RDataFrame Display fast feature request?

RDataFrame::Display is return a RDisplay, which is very slow when deal with huge dataset.
even to display only headset 10 events

I’d love to add a feature to dump the first 10 or several events fast in RDataFrame;
or is there already have a function to do it?

Hi @cxwx,
RDisplay is meant to be that feature!

Could it be that in your computation graph you have a Display action together with some other action like Histo1D that requires processing the whole dataset (so Display stops processing after 10 events but you only see the printout when the full event loop is finished)?

Otherwise, could you share a reproducer or run perf record --call-graph dwarf on the reproducer to produce a flamegraph or similar, to figure out where time is being spent?


I’m sorry, it was a mistake from me.

I made a mistake that I define two branch with same branch name, which cause the problem.

Ah, interesting. We should have diagnostics for that. Feel free to report an issue on jira if you think that’s a bug in RDF.


This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.