Dear Expert:
Sometimes proof reports errors like the following:
+++ Starting max 5 workers following the setting of PROOF_NWORKERS
Looking up for exact location of files: OK (851 files)
Validating files: OK (851 files)
0.33: caught exception triggered by signal ‘1’ while processing dset:‘physics’, file:‘root://valtical07.cern.ch//localdisk/xrootd/users/qing/mc12_p1067/user.qing.mc12_8TeV.107660.AlpgenJimmy_AUET2CTEQ6L1_ZmumuNp0.merge.NTUP_SMWZ.e1218_s1469_s1470_r3542_r3549_p1067_2LepSkim_v2/user.qing.000791._00006.skimmed.root’, event:0 - check logs for possible stacktrace
Worker ‘valtical07.cern.ch-0.33’ has been removed from the active list
+++ Message from top master at valtical.cern.ch:1093 : marking valtical07.cern.ch:1093 (0.33) as bad
+++ Reason: received kPROOF_FATAL
+++ Most likely your code crashed on worker 0.33 at valtical07.cern.ch:1093.
+++ Please check the session logs for error messages either using
+++ the ‘Show logs’ button or executing
+++
+++ root [] TProof::Mgr(“valtical.cern.ch:1093”)->GetSessionLogs()->Display(“0.33”,0)
0.13: caught exception triggered by signal ‘1’ while processing dset:‘physics’, file:‘root://valtical05.cern.ch//localdisk/xrootd/users/qing/mc12_p1067/user.qing.mc12_8TeV.107660.AlpgenJimmy_AUET2CTEQ6L1_ZmumuNp0.merge.NTUP_SMWZ.e1218_s1469_s1470_r3542_r3549_p1067_2LepSkim_v2/user.qing.000791._00010.skimmed.root’, event:0 - check logs for possible stacktrace
Worker ‘valtical05.cern.ch-0.13’ has been removed from the active list
+++ Message from top master at valtical.cern.ch:1093 : marking valtical05.cern.ch:1093 (0.13) as bad
+++ Reason: received kPROOF_FATAL
+++ Most likely your code crashed on worker 0.13 at valtical05.cern.ch:1093.
+++ Please check the session logs for error messages either using
+++ the ‘Show logs’ button or executing
+++
+++ root [] TProof::Mgr(“valtical.cern.ch:1093”)->GetSessionLogs()->Display(“0.13”,0)
The 2 files are good files, seems to me the 2 proof WNs has problem accessing them and then they are marked as bad, my questions is :
Will the 2 files be re-processed when the the 2 WNs are marked as bad?
Cheers,Gang