I am one of the project leads on the Binder project (we run mybinder.org). One of the frequent questions we receive is how to get data into a binder repo. In particular large data.
We have collected some example on how to do this in https://github.com/binder-examples/getting-data. I think the ROOT file format and being able to stream it via xrootd is very cool technology that not a lot of other fields of research have. They are mostly still copying the whole file first. Being an ex-CERNiac I think it would be cool to feature xrootd as an example.
Question: where can I find some root files that can be accessed remotely and contain data that is kinda interesting but not too difficult to understand for not-HEP people? The binder compatible HEP repositories on GitHub for teaching and outreach mostly have their data in CSV :-/ Any pointers would be welcome.
As you know Go-HEP has such an example
The issue to make it working with mybinder I encountered is that (it seemed to me) mybinder didn’t open the default port of xrootd.
Good to hear from you again!
We certainly have a good choice of public xrootd and http data. I’ll send some examples on Monday; if it’s more urgent and you want to browse yourself: we have added some as input data for RDataFrame tutorials recently.
Thanks for reaching out, more on Monday!
Here we go:
tutorials/dataframe/df103_NanoAODHiggsAnalysis.C have probably the nicest public data sets.
See https://root.cern.ch/doc/master/df102__NanoAODDimuonAnalysis_8C.html and https://root.cern.ch/doc/master/df103__NanoAODHiggsAnalysis_8C.html for the generated plot; see https://github.com/root-project/root/tree/master/tutorials/dataframe for the code.
Does this help?
Thanks a lot! Those look like good examples.
Now I just need to remember how to use ROOT again
We’re here to help should you have forgotten parts of it
This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.