Pytorch plugin usage example #11

igorvalko · 2020-06-12T09:27:28Z

No description provided.

kumare3

Couple small changes. Thank you

kumare3 · 2020-06-12T14:00:26Z

pytorch/README.rst

+
+Before running this make sure that
+    - pytorch plugin is enabled in flytepropeller's config
+    - `Kubeflow pytorch operator`_ is installed in your k8s cluster


Probably say that kustomize is configured to deploy pyrotechnics operator

kumare3 · 2020-06-12T14:04:04Z

pytorch/workflows/mnist.py

+    per_replica_gpu_limit="1",
+)
+def mnist_pytorch_job(workflow_params, no_cuda, batch_size, test_batch_size, epochs, learning_rate, sgd_momentum, seed, log_interval, save_model, dir, out):
+    backend_type = dist.Backend.GLOO


So eventually I want this boilerplate code to be in the pytorch wrapper, can you add a TODO: simplify by abstracting the boilerplate

Should we leave TODOs in example/boilerplate code? Maybe it's better to create issue to remember this idea?

issue works too, either way.

issue is great, and the TODO can point to the issue :)

kumare3 · 2020-06-12T14:05:06Z

pytorch/workflows/mnist.py

+        dir=dir
+    )
+
+    accuracies = Output(mnist_result.outputs.out, sdk_type=Types.String)


Shouldn’t this be list of floats

https://github.com/lyft/flytesnacks/pull/11/files#diff-daa70d7f0f13b99d1e904cc0cb7fd7a7R152 I'm stringifying it here to be sure that UI will just have to print string I've provided.

so we dont need to do that, since it is just an array of floats, UI can show it, if we mark it as an array of floats. Do you want to change that?

kumare3 · 2020-06-12T20:49:53Z

pytorch/Dockerfile

+
+RUN pip install awscli
+
+RUN pip install tensorboardX==2.0.0 flytekit[pytorch]==0.8.1


How does TensorBoard work in this case?

https://github.com/kubeflow/pytorch-operator/blob/master/examples/mnist/mnist.py#L6

hmm we will need to run tensorboard somewhere i guess right?

kumare3 · 2020-06-12T20:55:31Z

pytorch/workflows/mnist.py

+    log_interval=Types.Integer,
+    save_model=Types.Boolean,
+    dir=Types.String)
+@outputs(out=Types.String)


I think this needs some improvements like changing output to be the model. @wild-endeavor can you help?

Sorry, I don't understand. Currently it seems that the output out is a string, created from str(accuracies)

I think we should change the name from "out" to "accuracies", and I feel like we should make it the same type that's currently the output of []epoch_step(). I don't know what epoch_step produces - can you fill me in @igorvalko ?

This is a separate question though from saving the model though - do we want to save the model as a blob you mean? As a demo though, I don't know how that would be useful - one would have to download the model and use it in order to get numbers (like these accuracies) out of it right? We can save the model file sure, but I feel like the accuracies output makes more sense.

First of all MNIST in pytorch world is sort of 'hello world' (https://github.com/pytorch/examples/tree/master/mnist)

One can store model on their own by supplying save_model=true. Here we might only adjust the example to store on s3 (for now it is local FS), but I don't know internals of the model object: whether it will collect distributed model state to master or it's to be done manually.
Epoch - is model training iteration. Accuracy is acquired by evaluating test dataset on a trained model after each iteration.

I considered that accuracy is good enough to demonstrate, that job was done since

it implies both training and evaluation steps

succinct result to show

@igorvalko it is possible to output a model as a blob object in Flyte - Here is an example: The benefit is, you can then simply point your notebook to flyteadmin and load the model https://github.com/lyft/flytesnacks/blob/master/python/multi_step_linear/diabetes_xgboost.py#L104

and lets rename out to accuracy?

Suggested change

@outputs(out=Types.String)

@outputs(accuracy=[Types.Float], model=Types.Blob)

kumare3 · 2020-06-16T15:47:28Z

@igorvalko thank you for all the patience and great work. There is just a small change requested. The reason is, examples are very important to get close to right, as they get replicated quickly

wild-endeavor · 2020-06-16T17:39:37Z

@kumare3 i think this is fine right?

kumare3 · 2020-06-16T18:43:35Z

Looks great to me, lets merge!

igorvalko added 2 commits June 12, 2020 12:26

pytorch plugin usage example

5d9bba8

cleanup and revert to original async dataset loader

528ba12

igorvalko requested review from kumare3, matthewphsmith and wild-endeavor as code owners June 12, 2020 09:27

kumare3 requested changes Jun 12, 2020

View reviewed changes

polishing readme

c4e8666

wild-endeavor requested a review from kumare3 June 12, 2020 20:47

kumare3 requested changes Jun 12, 2020

View reviewed changes

igorvalko added 2 commits June 16, 2020 20:04

outputs changed

0adc929

formatting

dcdc15a

kumare3 self-requested a review June 16, 2020 18:43

kumare3 approved these changes Jun 16, 2020

View reviewed changes

wild-endeavor merged commit 761426a into flyteorg:master Jun 16, 2020

kumare3 mentioned this pull request Jun 16, 2020

[Backend][Plugin] Kubeflow operators - Pytorch flyteorg/flyte#338

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pytorch plugin usage example #11

Pytorch plugin usage example #11

igorvalko commented Jun 12, 2020

kumare3 left a comment

kumare3 Jun 12, 2020

kumare3 Jun 12, 2020

igorvalko Jun 12, 2020

wild-endeavor Jun 15, 2020

kumare3 Jun 16, 2020

kumare3 Jun 12, 2020

igorvalko Jun 12, 2020

kumare3 Jun 16, 2020

kumare3 Jun 12, 2020

igorvalko Jun 13, 2020

kumare3 Jun 16, 2020

kumare3 Jun 12, 2020

wild-endeavor Jun 15, 2020

igorvalko Jun 16, 2020 •

edited

Loading

kumare3 Jun 16, 2020

kumare3 Jun 16, 2020

kumare3 Jun 16, 2020 •

edited

Loading

igorvalko Jun 16, 2020

kumare3 commented Jun 16, 2020

wild-endeavor commented Jun 16, 2020

kumare3 commented Jun 16, 2020


		RUN pip install awscli

		RUN pip install tensorboardX==2.0.0 flytekit[pytorch]==0.8.1

	@outputs(out=Types.String)
	@outputs(accuracy=[Types.Float], model=Types.Blob)

Pytorch plugin usage example #11

Pytorch plugin usage example #11

Conversation

igorvalko commented Jun 12, 2020

kumare3 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

igorvalko Jun 16, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kumare3 Jun 16, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kumare3 commented Jun 16, 2020

wild-endeavor commented Jun 16, 2020

kumare3 commented Jun 16, 2020

igorvalko Jun 16, 2020 •

edited

Loading

kumare3 Jun 16, 2020 •

edited

Loading