Tensorflow + LSF. Tensor flow distributed on an LSF cluster

advertisements

How to setup tensorflow to work with LSF job scheduler? I have almost no experience with LSF. tf.train.ClusterSpec needs ip addresses of workers and parameter servers. Is it possible to obtain them from the LSF environment? Are there any success stories of making them work together?

EDIT:

Found some explanations how to achieve similar goal on Slurm cluster Running TensorFlow on a Slurm Cluster?. Basically, i'm looking for something like this but for LSF job scheduler


There's a blog post and sample launch script for TensorFlow on LSF here.