Skip to content

Create parameter server in different thread #127

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 14 commits into from

Conversation

mvsusp
Copy link
Contributor

@mvsusp mvsusp commented Nov 26, 2018

Description of changes:

  • creating a parameter server in a separate Thread instead of running the customer script twice.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@@ -83,73 +84,58 @@ def test_single_machine(run_module, single_machine_training_env):
single_machine_training_env.to_env_vars())


@pytest.mark.skipif(sys.version_info.major != 3,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This tests are flaky on py2 because of the dict key ordering. I think we should add the skipif back.

env.to_cmd_args(), env_vars, wait=False)
logger.info('Running distributed training job with parameter servers')

tf_config = _build_tf_config(hosts=env.hosts, current_host=env.current_host)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we just remove the ps_task parameter in _build_tf_config? It's not used anymore. Not critical. if you don't want to fiddle with the unit tests we can do it later.

Copy link
Contributor

@icywang86rui icywang86rui left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Let's wait for the tests to finish before merging.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants