I am one of the project leads on the Binder project (we run mybinder.org). One of the frequent questions we receive is how to get data into a binder repo. In particular large data.
We have collected some example on how to do this in https://github.com/binder-examples/getting-data. I think the ROOT file format and being able to stream it via xrootd is very cool technology that not a lot of other fields of research have. They are mostly still copying the whole file first. Being an ex-CERNiac I think it would be cool to feature xrootd as an example.
Question: where can I find some root files that can be accessed remotely and contain data that is kinda interesting but not too difficult to understand for not-HEP people? The binder compatible HEP repositories on GitHub for teaching and outreach mostly have their data in CSV :-/ Any pointers would be welcome.
As you know Go-HEP has such an example
The issue to make it working with mybinder I encountered is that (it seemed to me) mybinder didn’t open the default port of xrootd.
We certainly have a good choice of public xrootd and http data. I’ll send some examples on Monday; if it’s more urgent and you want to browse yourself: we have added some as input data for RDataFrame tutorials recently.