* Initial GPUization, and testing multi_col output method as single precision.
* advance_clubb_core tweaks
* Adding radht acc update
* Updates
* Adding new flags to the mono_flux test to prevent CPU and GPU divergence which breaks it.
* Small changes and improvements.
* Making new monoflux test lines
* Reworking some SILHS GPUization to make it more similar to the GPU code in the rest of clubb. This adds some extra parts that run on GPUs, so it is BIT_CHANGING
* Cleanup and comment update
* Cleanup
* Removing DCUDA flag from compile config scripts to help GPU and CPU results match for silhs cases
* Removing accidentally added file.
* Updating script
* Updating script
* Updating script
* Updating tolerance in script to handle rico_silhs differences, and hopefully final GPU updates
* Small cleanup
* Adding option to multi_col diff check script to scale the differences by the field avg. This is only needed (so far) for thlm differences that are slightly too large