Scalability of RDataFrames on 16+ cores

ingomueller.net · April 29, 2021, 11:16am

This is great feedback!

About the first point: As a non-physicist, I do not have a great understanding of how big “typical” data sets are. I image that there is quite a variety. Do you have any pointer to material that characterizes this variety? Alternatively, what fix do you suggest? Should be duplicate the current data set to cover more data set sizes (and which ones should we cover)?
About the second: What is the correct way to do this? This forum post suggests to use .L and from the context of the whole thread I understand that that does compile the macro with optimizations. Note that our script does use .L. Are the macros not compiled that way? Update: I just saw the + flag to .L. Is that all it takes to fix our script?