-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RMP] Create a separate Merlin package for the dataloaders #394
Comments
I've created the repo under the name 'dataloader' primarily for seo and to make sure its got a narrow focus but we can rename later. |
@benfred @jperez999 , who is doing which part of this work ? Create a new repo from the Merlin repo template |
Pushing it to prioritization to check if we will continue with examples in 22.09 |
@benfred to create issues from the bullet point. |
Problem:
A number of customers only want to use our dataloaders. They're a thin wedge that we can use to get Merlin adoption amongst teams.
The PyTorch recommendations framework TorchRec would like to make the Merlin dataloader their default without depending on all of Merlin Models. They'd like to publish blog posts about the framework, which creates an opportunity to co-promote one part of the Merlin ecosystem.
The Spark team want to use our dataloaders to accelerate their workflows in TensorFlow and have coordinated with the horovod team to make it an optional dataloader that's natively included with horovod.
Goal:
Scope:
Publish Merlin dataloaders under a new 'dataloader' package
Constraints:
Blockers
Create a new repo from the Merlin repo templateNote: Julio is blocked on this. Ben has to create a new repo'dataloader' repo createdStarting Point:
v22.08
v22.09
The text was updated successfully, but these errors were encountered: