Distributed RDataFrame - snapshot

Dear experts,

I am trying to use distributed RDataFrame on SWAN. I create a dataframe, fillter it and save the columns into a new root file. I understand while use snapshot in distributed case, the resultant snapshots would be equal to number of partitions.

Is there a way I can obtain one single snapshot (combining all partitions)?


Hello @wandering_particle ,

and welcome to the ROOT forum!

I think the only way is to post-process the outputs yourself, e.g. adding them together with the hadd command line tool that comes with ROOT. @vpadulan can correct me if Iā€™m wrong (likely when the working week restarts :smiley: ).


1 Like

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.