[Core feature] Supporting customized k8s client in various k8s plugins #2883

honnix · 2022-09-15T09:17:24Z

Motivation: Why do you think this is important?

Organizations may have different Flyte setup, where one could set up centrally managed k8s clusters to run different types of jobs: batch, ML, etc.; while another setup could be different teams managing their own k8s clusters for different purposes.

It would be great to be able to configure the core k8s plugins to do inter-cluster communication. Plain pod is necessary.

Goal: What should the final outcome look like, ideally?

Users can configure core k8s plugin to do inter-cluster communication, essentially submitting custom resource to a specified k8s cluster instead of the one flytepropeller runs on.

Describe alternatives you've considered

Alternatively, a dedicated flytepropeller could be set up in the target cluster. However the team managing the external k8s cluster may not have sufficient knowledge to operate flytepropeller.

Another alternative is to make those core k8s plugins extensible so users can override the plugin registration process and provide their own k8s client via https://github.com/flyteorg/flyteplugins/blob/9e00636f9b2fcb84b5bc2c8dcece2b9b2bd0935e/go/tasks/pluginmachinery/k8s/plugin.go#L32. For example making https://github.com/flyteorg/flyteplugins/blob/master/go/tasks/plugins/k8s/ray/ray.go#L31-L40 public and https://github.com/flyteorg/flyteplugins/blob/master/go/tasks/plugins/k8s/ray/ray.go#L370 in a separated package, of course in a backward compatible way.

Propose: Link/Inline OR Additional context

No response

Are you sure this issue hasn't been raised already?

Yes

Have you read the Code of Conduct?

Yes

The text was updated successfully, but these errors were encountered:

hamersaw · 2022-09-26T14:39:41Z

Adding a bit of context here. The Flyte Flink plugin exposes configuration for the kubeclient. This is then passed to FlytePropeller instance during the registration process, where FlytePropeller registers the plugin to use the configured kubeclient.

In this paradigm the Flink plugin exposes specific kubeclient configuration; which, if exists overrides the default FlytePropeller configuration - allowing inter-cluster communication. This is advantageous in deployments where FlytePropeller is in one cluster, Ray may be executed in another cluster, Spark in yet another, etc.

The proposal is to add these configuration options to all k8s plugins. This involves adding kubeclient configuration to each individual plugin and including this during the FlytePropeller plugin registration process (similar to how this is currently performed in the Flink plugin). The result is configuration for each plugin that may look something like:

plugins:
  ray:
    # existing configuration omitted
    kube:
      endpoint: "localhost:30082"
  spark:
    # existing configuration omitted
    kube:
      endpoint: "localhost:30083"

hamersaw · 2022-09-26T14:47:34Z

This should be a relatively easy change. However, this deployment model is something that we don't suggest. This complicates things like secret management, inter-cluster management, exposing multiple failure domains, etc. For these reasons it's likely this will not get implemented unless it comes as a community contribution. We are very happy to help with anything, feel free to reach out.

honnix · 2022-10-13T08:28:17Z

Sorry for replying back.

I understand why this is not a recommended deployment model, but in practice, it is not always feasible to have a flytepropeller running in a k8s cluster. As examples, the cluster could be dedicated to run a specific type of load and due to security reason nothing else is allowed to run there; and the cluster could even be owned by a 3rd-party.

I think having this capability built into Flyte will help users integrate with their existing infrastructure more easily, and help build Flyte as a platform (things being integrated into Flyte, not the other way around). Please let me know what you think. Thank you for looking into this.

github-actions · 2024-06-19T00:07:05Z

Hello 👋, this issue has been inactive for over 9 months. To help maintain a clean and focused backlog, we'll be marking this issue as stale and will engage on it to decide if it is still applicable.
Thank you for your contribution and understanding! 🙏

honnix added enhancement New feature or request untriaged This issues has not yet been looked at by the Maintainers labels Sep 15, 2022

hamersaw removed the untriaged This issues has not yet been looked at by the Maintainers label Sep 26, 2022

hamersaw added this to the 1.3.0-candidate milestone Sep 26, 2022

hamersaw added hacktoberfest plugins Plugins related labels (backend or frontend) labels Sep 26, 2022

samhita-alla mentioned this issue Oct 3, 2022

Flyte Hacktoberfest 2022: Issues, Ideas and Guidelines #2917

Closed

50 tasks

cosmicBboy removed this from the 1.3.0-candidate milestone Jan 23, 2023

aybidi mentioned this issue Mar 16, 2023

feat: add support for inter-cluster communication for Ray plugin flyteorg/flyteplugins#321

Merged

8 tasks

samhita-alla removed the hacktoberfest label Sep 22, 2023

github-actions bot added the stale label Jun 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core feature] Supporting customized k8s client in various k8s plugins #2883

[Core feature] Supporting customized k8s client in various k8s plugins #2883

honnix commented Sep 15, 2022

hamersaw commented Sep 26, 2022

hamersaw commented Sep 26, 2022

honnix commented Oct 13, 2022

github-actions bot commented Jun 19, 2024

[Core feature] Supporting customized k8s client in various k8s plugins #2883

[Core feature] Supporting customized k8s client in various k8s plugins #2883

Comments

honnix commented Sep 15, 2022

Motivation: Why do you think this is important?

Goal: What should the final outcome look like, ideally?

Describe alternatives you've considered

Propose: Link/Inline OR Additional context

Are you sure this issue hasn't been raised already?

Have you read the Code of Conduct?

hamersaw commented Sep 26, 2022

hamersaw commented Sep 26, 2022

honnix commented Oct 13, 2022

github-actions bot commented Jun 19, 2024