-
Notifications
You must be signed in to change notification settings - Fork 162
Create parameter server in different thread #127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
test/unit/test_training.py
Outdated
@@ -83,73 +84,58 @@ def test_single_machine(run_module, single_machine_training_env): | |||
single_machine_training_env.to_env_vars()) | |||
|
|||
|
|||
@pytest.mark.skipif(sys.version_info.major != 3, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This tests are flaky on py2 because of the dict key ordering. I think we should add the skipif back.
env.to_cmd_args(), env_vars, wait=False) | ||
logger.info('Running distributed training job with parameter servers') | ||
|
||
tf_config = _build_tf_config(hosts=env.hosts, current_host=env.current_host) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should we just remove the ps_task parameter in _build_tf_config? It's not used anymore. Not critical. if you don't want to fiddle with the unit tests we can do it later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Let's wait for the tests to finish before merging.
Description of changes:
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.