Role of ps and worker #2

Abdus-Samee · 2023-09-19T23:58:44Z

Hey there!
Just wanted to clarify something regarding the code for ps and worker. I've recently started working with such distributed training, so pardon my silly queries.

As much as I've come to know, ps serve parameters to the workers while the later one fetches them. Aside from the difference in the tf_config, I've noticed no code for fetching/serving of parameters particularly dedicated to only ps or only workers. Both share the same code.

I wanted to know how are they coordinating with one another?

Abdus-Samee · 2023-09-22T17:03:22Z

@18520339 I hope you look into the issue.

Thanks.

18520339 · 2023-09-24T11:52:18Z

Hi, as far as I know, the ParameterServerStrategy employs an underlying communication protocol (like gRPC) to coordinate the variable updates and synchronization. So, when using that strategy, the coordination between ps and workers is handled behind the scenes by TF's runtime, and you don't need to write explicit code to fetch or serve parameters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Role of ps and worker #2

Role of ps and worker #2

Abdus-Samee commented Sep 19, 2023

Abdus-Samee commented Sep 22, 2023

18520339 commented Sep 24, 2023 •

edited

Loading

Role of ps and worker #2

Role of ps and worker #2

Comments

Abdus-Samee commented Sep 19, 2023

Abdus-Samee commented Sep 22, 2023

18520339 commented Sep 24, 2023 • edited Loading

18520339 commented Sep 24, 2023 •

edited

Loading