* First bit of prescribe forcings GPUization. Only tested with arm so far.
* Cherry-pick merge
* Missed file
* GPUizing more cases
* Rest of cases GPUized
* Little more GPUization and fixes.
* Bug fix to minloc calculation affecting cases with l_modify_bc_for_cnvg_test=.true., unclear why this is a bug. Also adding ifdef around print in GPU code