Hi Kevin,

The command seems ok to me. But this log message screenshot doesn't consist
of many clues. Can you post the whole container logs?

BR,
Zhankun

On Thu, 13 Jun 2019 at 01:19, kevin su <[email protected]> wrote:

> Hi all,
>
> I use latest hadoop from github, compile by myself.
>
> try to run hadoop-submarine-all-0.3.0-SNAPSHOT-hadoop-3.1.jar to run
> tensorflow
> but i got Shell error output: Invalid arguments for cgroups resources:
> /opt/hadoop-3.3.0/logs/userlogs
>
> command = yarn jar hadoop-submarine-all-0.3.0-SNAPSHOT-hadoop-3.1.jar job
> run --name tf-job-006 --verbose --docker_image
> hadoopsubmarine/tf-1.8.0-gpu:0.0.1 --input_path
> hdfs://default/dataset/cifar-10-data --env
> DOCKER_JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64/jre/ --env
> DOCKER_HADOOP_HDFS_HOME=/hadoop-3.1.0 --num_workers 1 --worker_resources
> memory=2G,vcores=2 --worker_launch_cmd "cd
> /test/models/tutorials/image/cifar10_estimator && python cifar10_main.py
> --data-dir=%input_path% --job-dir=%checkpoint_path% --train-steps=10000
> --eval-batch-size=16 --train-batch-size=16  --sync" --tensorboard
> --tensorboard_docker_image wtan/tf-1.8.0-cpu:0.0.1
>
> my container-executor.cfg is in the link
>
> Thanks in advanced
>
> BR
> Kevin
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]

Reply via email to