Proof crashes with root version 5.34/03

Hello,
I am trying to run proof tutorials on our cluster as mentioned in following link

root.cern.ch/drupal/content/more … d-examples

The root version installed on master and slaves is “5.34/03”. When I tried to execute “runProof(“simple”)”, the program ends with segmentation violation. Please see below.

“”"
[jha2@@hep tutorials]$ root


  •                                     *
    
  •    W E L C O M E  to  R O O T       *
    
  •                                     *
    
  • Version 5.34/03 27 October 2012 *
  •                                     *
    
  • You are welcome to visit our Web site *
  •      [root.cern.ch](http://root.cern.ch)            *
    
  •                                     *
    

ROOT 5.34/03 (branches/v5-34-00-patches@46829, De 07 2012, 11:04:00 on linuxx8664gcc)

CINT/ROOT C/C++ Interpreter version 5.18.00, July 2, 2010
Type ? for help. Commands must be C++ statements.
Enclose multiple statements between { }.

Welcome to my rootlogon.C
root [0] .L proof/runProof.C+
root [1] runProof(“simple”)
tutorial dir: /tmp/jha2/.proof-tutorial
getProof: trying to open a session on the external cluster at 'proof://proof-00:80/'
Starting master: opening connection …
Starting master: OK
Opening connections to workers: OK (96 workers)
Setting up worker servers: OK (96 workers)
PROOF set to parallel mode (96 workers)
getProof: WARNING: started/attached a session on external cluster (proof://proof-00:80/): ‘nwrks=5’ ignored
getProof: WARNING: started/attached a session on external cluster (proof://proof-00:80/): ‘dir="/tmp/jha2/.proof-tutorial"’ ignored
PROOF set to parallel mode (5 workers)
runProof: simple: ACLiC mode: '+'
Collection name=‘TList’, class=‘TList’, size=2
Collection name=‘FeedbackList’, class=‘TList’, size=0
TParameter PROOF_UseParallelUnzip = 0

runProof: running “simple” with nhist= 100, nhist3=-1 and nevt= 100000

Info in TUnixSystem::ACLiC: creating shared library /scratch/scratch95/j/jha2/Work/PROOF/ProofTesting/tutorials/proof/ProofSimple_C.so
(class TCanvas*)0x2aaaac4f2b60
Mst-0: merging output objects … done
Mst-0: grand total: sent 13 objects, size: 174369 bytes
Warning in ProofSimple::Terminate: histograms not found

*** Break *** segmentation violation

===========================================================
There was a crash.
This is the entire stack trace of all threads:

Thread 3 (Thread 0x415c5940 (LWP 5800)):
#0 0x000000326889a541 in nanosleep () from /lib64/libc.so.6
#1 0x000000326889a364 in sleep () from /lib64/libc.so.6
#2 0x00002aff97ce4c44 in GarbageCollectorThread(void*, XrdClientThread*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdClient.so.1
#3 0x00002aff97c5967f in XrdSysThread_Xeq () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdUtils.so.1
#4 0x000000326940673d in start_thread () from /lib64/libpthread.so.0
#5 0x00000032688d44bd in clone () from /lib64/libc.so.6
#6 0x0000000000000000 in ?? ()

Thread 2 (Thread 0x40b42940 (LWP 5801)):
#0 0x00000032688cb696 in poll () from /lib64/libc.so.6
#1 0x00002aff97cc8c47 in XrdClientSock::RecvRaw(void*, int, int, int*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdClient.so.1
#2 0x00002aff97cf018c in XrdClientPhyConnection::ReadRaw(void*, int, int, int*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdClient.so.1
#3 0x00002aff97cf7cff in XrdClientMessage::ReadRaw(XrdClientPhyConnection*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdClient.so.1
#4 0x00002aff97cf48f5 in XrdClientPhyConnection::BuildMessage(bool, bool) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdClient.so.1
#5 0x00002aff97cf6082 in SocketReaderThread(void*, XrdClientThread*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdClient.so.1
#6 0x00002aff97c5967f in XrdSysThread_Xeq () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/cms/cmssw/CMSSW_6_1_2/external/slc5_amd64_gcc472/lib/libXrdUtils.so.1
#7 0x000000326940673d in start_thread () from /lib64/libpthread.so.0
#8 0x00000032688d44bd in clone () from /lib64/libc.so.6
#9 0x0000000000000000 in ?? ()

Thread 1 (Thread 0x2aff94b908d0 (LWP 5791)):
#0 0x000000326889a14f in waitpid () from /lib64/libc.so.6
#1 0x000000326883c481 in do_system () from /lib64/libc.so.6
#2 0x000000326883c7d7 in system () from /lib64/libc.so.6
#3 0x00002aff93a6423c in TUnixSystem::StackTrace() () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#4 0x00002aff93a66a13 in TUnixSystem::DispatchSignals(ESignals) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#5
#6 __cxxabiv1::__dynamic_cast (src_ptr=0x2aaaac64d5c0, src_type=0x2aff93f1e220 , dst_type=0x2aff969120c0 , src2dst=0) at …/…/…/…/libstdc+±v3/libsupc++/dyncast.cc:57
#7 0x00002aff967af64b in TProof::HandleOutputOptions(TString&, TString&, int) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libProof.so
#8 0x00002aff967b112b in TProof::Process(TDSet*, char const*, char const*, long long, long long) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libProof.so
#9 0x00002aff9678fd1b in TProof::Process(char const*, long long, char const*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libProof.so
#10 0x00002aff955ada59 in runProof(char const*, char const*, int) () from /usr/rmt_share/scratch95/j/jha2/Work/PROOF/ProofTesting/tutorials/./proof/runProof_C.so
#11 0x00002aff955b1070 in G__runProof_C_ACLiC_dict__0_2558(G__value*, char const*, G__param*, int) () from /usr/rmt_share/scratch95/j/jha2/Work/PROOF/ProofTesting/tutorials/./proof/runProof_C.so
#12 0x00002aff9400c6f7 in Cint::G__ExceptionWrapper(int ()(G__value, char const*, G__param*, int), G__value*, char*, G__param*, int) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#13 0x00002aff940b6811 in G__execute_call () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#14 0x00002aff940b6bde in G__call_cppfunc () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#15 0x00002aff9409ba43 in G__interpret_func () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#16 0x00002aff940853f9 in G__getfunction () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#17 0x00002aff94060672 in G__getitem () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#18 0x00002aff94066bea in G__getexpr () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#19 0x00002aff940e69ce in G__exec_statement () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#20 0x00002aff9404c71b in G__exec_tempfile_core () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#21 0x00002aff9404dc8e in G__exec_tempfile_fp () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#22 0x00002aff940f2d1d in G__process_cmd () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCint.so
#23 0x00002aff93a30011 in TCint::ProcessLine(char const*, TInterpreter::EErrorCode*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#24 0x00002aff93996432 in TApplication::ProcessLine(char const*, bool, int*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#25 0x00002aff949cf669 in TRint::HandleTermInput() () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libRint.so
#26 0x00002aff93a659ad in TUnixSystem::CheckDescriptors() () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#27 0x00002aff93a67148 in TUnixSystem::DispatchOneEvent(bool) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#28 0x00002aff939ec766 in TSystem::InnerLoop() () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#29 0x00002aff939ee344 in TSystem::Run() () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#30 0x00002aff9399478f in TApplication::Run(bool) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libCore.so
#31 0x00002aff949d0347 in TRint::Run(bool) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libRint.so
#32 0x00000000004011bc in main ()

The lines below might hint at the cause of the crash.
If they do not help you then please submit a bug report at
root.cern.ch/bugs. Please post the ENTIRE stack trace
from above as an attachment in addition to anything else
that might help us fixing this issue.

#6 __cxxabiv1::__dynamic_cast (src_ptr=0x2aaaac64d5c0, src_type=0x2aff93f1e220 , dst_type=0x2aff969120c0 , src2dst=0) at …/…/…/…/libstdc+±v3/libsupc++/dyncast.cc:57
#7 0x00002aff967af64b in TProof::HandleOutputOptions(TString&, TString&, int) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libProof.so
#8 0x00002aff967b112b in TProof::Process(TDSet*, char const*, char const*, long long, long long) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libProof.so
#9 0x00002aff9678fd1b in TProof::Process(char const*, long long, char const*) () from /cvmfs/cms.cern.ch/slc5_amd64_gcc472/lcg/root/5.34.03-cms4/lib/libProof.so
#10 0x00002aff955ada59 in runProof(char const*, char const*, int) () from /usr/rmt_share/scratch95/j/jha2/Work/PROOF/ProofTesting/tutorials/./proof/runProof_C.so

Root >

“”"
Let me know if you need more infromation from my side.
Thanks,
Manoj

Hi,

Are you in a position to try with a more recent 5-34 version, eg. 5-34-05 or 5-34-09 ?
There were fixes in this part, though is difficult to say if they may cause your issue.

Any how, are sure that you command is just

runProof("simple")

?
It seems that you are using an external cluster at proof://proof-00:80 and asking for 5 workers; by default runProof runs locally with a number of workers equal to the number of cores of the machine.

The problem is that for some reason the histograms are not created (of course it should not crash; but that comes after …). Can you check the logs on the workers? See root.cern.ch/drupal/content/gett … flogviewer . Just restart ROOT after the crash and run TProof::LogViewer(“proof://proof-00:80”): you should get the logs of the last session.

G. Ganis