Hello,
Looks like I’m facing the same problem.
Proof-Lite workers randomly crash during TProofOutputFile merging.
~/.proof/newbean-bean-workdir/last-lite-session/worker-0.2.log:
22:16:53 28123 Wrk-0.2 | SvcMsg in <TProofPlayerSlave::CheckMemUsage>: Memory 448736 virtual 111476 resident event 1000
22:16:54 28123 Wrk-0.2 | SvcMsg in <TProofPlayerSlave::CheckMemUsage>: Memory 448736 virtual 111492 resident event 1000
22:16:54 28123 Wrk-0.2 | *** Break ***: segmentation violation
===========================================================
There was a crash (kSigSegmentationViolation).
This is the entire stack trace of all threads:
===========================================================
#0 0x00007f850eb6745e in __libc_waitpid (pid=<value optimized out>, stat_loc=0x7fff990e7c2c, options=<value optimized out>) at ../sysdeps/unix/sysv/linux/waitpid.c:32
#1 0x00007f850eafca99 in do_system (line=<value optimized out>) at ../sysdeps/posix/system.c:149
#2 0x00007f850fc6fbc6 in TUnixSystem::Exec (this=0x168ef60, shellcmd=0x34ec000 "/opt/root_trunk/etc/gdb-backtrace.sh 28123 1>&2") at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:2067
#3 0x00007f850fc704b6 in TUnixSystem::StackTrace (this=0x168ef60) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:2315
#4 0x00007f850fc6dde5 in TUnixSystem::DispatchSignals (this=0x168ef60, sig=kSigSegmentationViolation) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:1198
#5 0x00007f850fc6bb89 in SigHandler (sig=kSigSegmentationViolation) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:356
#6 0x00007f850fc73bac in sighandler (sig=11) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:3510
#7 <signal handler called>
#8 0x00007f850f2d9248 in __dynamic_cast () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#9 0x00007f850c36c79b in TFile::Close (this=0x1f71be0, option=0x7f8507ed74a0 "") at /opt/root_trunk/io/io/src/TFile.cxx:885
#10 0x00007f8507ebf148 in ReadDst::SlaveTerminate (this=0x1f86880) at /home/boger/newbean/bean/BeanCore/ReadDst.cxx:346
#11 0x00007f8507bfc0b9 in TProofPlayer::Process (this=0x1f6fac0, dset=0x1f1b260, selector_file=0x7fff990eaed9 "ReadDst", option=0x7fff990eaeb9 "", nentries=-1, first=-1) at /opt/root_trunk/proof/proofplayer/src/TProofPlayer.cxx:1381
#12 0x00007f850b0e0292 in TProofServ::HandleProcess (this=0x1bf6550, mess=0x1cf8580, slb=0x0) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:3974
#13 0x00007f850b0d2b1a in TProofServ::HandleSocketInput (this=0x1bf6550, mess=0x1cf8580, all=true) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:1629
#14 0x00007f850b0d11de in TProofServ::HandleSocketInput (this=0x1bf6550) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:1352
#15 0x00007f850b0f778b in TProofServLiteInputHandler::Notify (this=0x1bf9990) at /opt/root_trunk/proof/proof/src/TProofServLite.cxx:163
#16 0x00007f850b0fa7e5 in TProofServLiteInputHandler::ReadNotify (this=0x1bf9990) at /opt/root_trunk/proof/proof/src/TProofServLite.cxx:155
#17 0x00007f850fc6e160 in TUnixSystem::CheckDescriptors (this=0x168ef60) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:1293
#18 0x00007f850fc6d3d4 in TUnixSystem::DispatchOneEvent (this=0x168ef60, pendingOnly=false) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:1007
#19 0x00007f850fbc3af1 in TSystem::InnerLoop (this=0x168ef60) at /opt/root_trunk/core/base/src/TSystem.cxx:408
#20 0x00007f850fbc3872 in TSystem::Run (this=0x168ef60) at /opt/root_trunk/core/base/src/TSystem.cxx:358
#21 0x00007f850fb4969e in TApplication::Run (this=0x1bf6550, retrn=false) at /opt/root_trunk/core/base/src/TApplication.cxx:1044
#22 0x00007f850b0d724f in TProofServ::Run (this=0x1bf6550, retrn=false) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:2526
#23 0x00000000004027af in main (argc=6, argv=0x7fff990ec5b8) at /opt/root_trunk/main/src/pmain.cxx:325
===========================================================
The lines below might hint at the cause of the crash.
If they do not help you then please submit a bug report at
http://root.cern.ch/bugs. Please post the ENTIRE stack trace
from above as an attachment in addition to anything else
that might help us fixing this issue.
===========================================================
#8 0x00007f850f2d9248 in __dynamic_cast () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#9 0x00007f850c36c79b in TFile::Close (this=0x1f71be0, option=0x7f8507ed74a0 "") at /opt/root_trunk/io/io/src/TFile.cxx:885
#10 0x00007f8507ebf148 in ReadDst::SlaveTerminate (this=0x1f86880) at /home/boger/newbean/bean/BeanCore/ReadDst.cxx:346
#11 0x00007f8507bfc0b9 in TProofPlayer::Process (this=0x1f6fac0, dset=0x1f1b260, selector_file=0x7fff990eaed9 "ReadDst", option=0x7fff990eaeb9 "", nentries=-1, first=-1) at /opt/root_trunk/proof/proofplayer/src/TProofPlayer.cxx:1381
#12 0x00007f850b0e0292 in TProofServ::HandleProcess (this=0x1bf6550, mess=0x1cf8580, slb=0x0) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:3974
#13 0x00007f850b0d2b1a in TProofServ::HandleSocketInput (this=0x1bf6550, mess=0x1cf8580, all=true) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:1629
#14 0x00007f850b0d11de in TProofServ::HandleSocketInput (this=0x1bf6550) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:1352
#15 0x00007f850b0f778b in TProofServLiteInputHandler::Notify (this=0x1bf9990) at /opt/root_trunk/proof/proof/src/TProofServLite.cxx:163
#16 0x00007f850b0fa7e5 in TProofServLiteInputHandler::ReadNotify (this=0x1bf9990) at /opt/root_trunk/proof/proof/src/TProofServLite.cxx:155
#17 0x00007f850fc6e160 in TUnixSystem::CheckDescriptors (this=0x168ef60) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:1293
#18 0x00007f850fc6d3d4 in TUnixSystem::DispatchOneEvent (this=0x168ef60, pendingOnly=false) at /opt/root_trunk/core/unix/src/TUnixSystem.cxx:1007
#19 0x00007f850fbc3af1 in TSystem::InnerLoop (this=0x168ef60) at /opt/root_trunk/core/base/src/TSystem.cxx:408
#20 0x00007f850fbc3872 in TSystem::Run (this=0x168ef60) at /opt/root_trunk/core/base/src/TSystem.cxx:358
#21 0x00007f850fb4969e in TApplication::Run (this=0x1bf6550, retrn=false) at /opt/root_trunk/core/base/src/TApplication.cxx:1044
#22 0x00007f850b0d724f in TProofServ::Run (this=0x1bf6550, retrn=false) at /opt/root_trunk/proof/proof/src/TProofServ.cxx:2526
#23 0x00000000004027af in main (argc=6, argv=0x7fff990ec5b8) at /opt/root_trunk/main/src/pmain.cxx:325
===========================================================
22:16:55 28123 Wrk-0.2 | Error in <TProofServLite::HandleException>: caugth exception triggered by signal '1' while processing dset:'TDSet:Event', file:'/home/boger/data/mc/662/gen/alld_inc/alld_inc_9_10_4.dst' - check logs for possible stacktrace - last event: 999
The exactly same code works flawlessly with ROOT v532 but started to crash with both ROOT v534-3 and trunk (47040). If no TProofOutputFile is used, everything works fine.
The exact stack trace is somewhat random, sometimes the segfault happens in the analysis code, but sometimes also in TCollection::GarbageCollect, TFile::Close() or even in ~TTree().
Do you have any ideas?