I added a stripped down version of the run_bindiff_w_flags.py script that simply reads in the JSON file and runs CLUBB with all the different flag setting groups listed. It stores the flag files and the model output in the working directory. It does not do anything fancy like checkout the git repository and compile or compare results.