Hi…
I am trying to use PoD in my analysis.
For starter I am just trying to make PoD cluster
out of 2 machines by using ssh.
I installed PoD on my machine. And then did
pod-server start, it gives
Starting PoD server...
updating xproofd configuration file...
starting xproofd...
starting PoD agent...
preparing PoD worker package...
selecting pre-compiled bins to be added to worker package...
PoD worker package: /home/chinmay/.PoD/wrk/PoDWorker.sh
------------------------
XPROOFD [23249] port: 21001
PoD agent [23281] port: 22001
PROOF connection string: chinmay@localhost.localdomain:21001
------------------------
Then I did
pod-ssh -c pod_ssh.cf --debug submit. pod_ssh.cf is attached here.
** [Thu, 14 Sep 2017 14:33:02 +0530] preparing PoD worker package...
** [Thu, 14 Sep 2017 14:33:02 +0530] selecting pre-compiled bins to be added to worker package...
** [Thu, 14 Sep 2017 14:33:02 +0530] PoD worker package: /home/chinmay/.PoD/wrk/PoDWorker.sh
** [Thu, 14 Sep 2017 14:33:02 +0530] pod-ssh config contains an inline shell script. It will be injected it into wrk. package
** [Thu, 14 Sep 2017 14:33:02 +0530] preparing PoD worker package...
** [Thu, 14 Sep 2017 14:33:02 +0530] inline shell script is found and will be added to the package...
** [Thu, 14 Sep 2017 14:33:02 +0530] selecting pre-compiled bins to be added to worker package...
** [Thu, 14 Sep 2017 14:33:02 +0530] PoD worker package: /home/chinmay/.PoD/wrk/PoDWorker.sh
** [Thu, 14 Sep 2017 14:33:02 +0530] There are 5 threads in the tread-pool.
** [Thu, 14 Sep 2017 14:33:02 +0530] Number of PoD workers: 1
** [Thu, 14 Sep 2017 14:33:02 +0530] Number of PROOF workers: 4
** [Thu, 14 Sep 2017 14:33:02 +0530] Workers list:
** [Thu, 14 Sep 2017 14:33:02 +0530] [nilay] with 4 workers at chinmay@10.159.63.110:/home/chinmay/tmp/nilay
nilay [Thu, 14 Sep 2017 14:33:02 +0530] pod-ssh-submit-worker is started for chinmay@10.159.63.110 (dir: /home/chinmay/tmp/nilay, nworkers: 4, sshopt: -X)
** [Thu, 14 Sep 2017 14:33:03 +0530]
*******************
Successfully processed tasks: 1
Failed tasks: 0
*******************
Now after this once I run root and try to open Proof session I get following error
[chinmay@localhost 3.16]$ root -l
root [0] TProof *proof = TProof::Open("chinmay@localhost.localdomain:21001")
Starting master: opening connection ...
Starting master: OK
no resource currently available for this session: please retry later
Error in <TProof::StartSlaves>: no resources available or problems setting up workers (check logs)
Error in <TProof::Open>: new session could not be created
(TProof *) nullptr
The pod.agent.client.log file shows the following error
2017-09-14 14:32:10.277 INF 0 [LOG singleton:thread-12171] LOG singleton has been initialized.
2017-09-14 14:32:10.277 INF 0 [PROOFAgent:thread-12171] pod-agent v.3.16
2017-09-14 14:32:10.277 INF 0 [CORE:thread-12171] Bringing >>> AgentClient <<< to life...
2017-09-14 14:32:10.277 INF 0 [AgentClient:thread-12171] Detected xpd [12119] on port 21001
2017-09-14 14:32:10.277 INF 0 [AgentClient:thread-12171] starting a monitor
2017-09-14 14:32:10.277 DBG 0 [AgentClient:thread-12171] Creating a PROOF configuration file...
2017-09-14 14:32:10.280 INF 0 [AgentClient:thread-12171] looking for PROOFAgent server to connect...
2017-09-14 14:32:10.280 ERR 1 [AgentClient:thread-12171] Can't connect to the server
Error on Socket<:55156>: Transport endpoint is not connected
2017-09-14 14:32:10.280 INF 0 [CORE:thread-12171] Shutting down >>> AgentClient <<<
2017-09-14 14:32:10.280 INF 0 [CORE:thread-12171] Shutting down >>> PROOFAgent <<<
Ca someone help ?