Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

performance: what to do if DVC is slow #4167

Open
jorgeorpinel opened this issue Dec 6, 2022 · 1 comment
Open

performance: what to do if DVC is slow #4167

jorgeorpinel opened this issue Dec 6, 2022 · 1 comment
Labels
A: docs Area: user documentation (gatsby-theme-iterative) C: guide Content of /doc/user-guide p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately.

Comments

@jorgeorpinel
Copy link
Contributor

jorgeorpinel commented Dec 6, 2022

Generalise performance issues, e.g. based on this support case:

I try to dvc pull from my AWS bucket to an ubuntu machine and seems really slow although the internet connection should be fast... On my own mac the download works well... Seems like ubuntu is downloading data sequentially while MAC is able to do it in parallel?

Troubleshooting (1/2) GENERAL

Try same DVC version in both OS
If you do aws s3 cp it copies things faster?
Could you also please run dvc version on that Ubuntu machine in the repo
+ run first cprofile to check the results - https://github.com/iterative/dvc/wiki/Debugging,-Profiling-and-Benchmarking-DVC#profiling-dvc


Yesterday I moved the directory of data to the ubuntu computer manually (very quick - downloading took 10 s) and run dvc add on that directory to test - it was also incredibly slow. So this is clearly not related to AWS

attached is the cprofile file for reference (looks like a thread locking issue?)

Troubleshooting (2/2) SPECIFIC

is your repo/workspace located on a network mounted drive? (Yes) - you may need to configure state.dir and index.dir to be located in a local directory


that worked for dvc add ✅ but dvc pull ❌ still has the same issue as before...
Actually even dvc pull now ✅ works as expected.

@jorgeorpinel jorgeorpinel added A: docs Area: user documentation (gatsby-theme-iterative) C: guide Content of /doc/user-guide labels Dec 6, 2022
@jorgeorpinel
Copy link
Contributor Author

@dberenbaum @efiop any other common troubleshooting steps to check for DVC speed? I imagine this is usually a thing for indexing or transferring files, any other operations that sometimes bug down? Thanks

@dberenbaum dberenbaum added the p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. label Oct 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A: docs Area: user documentation (gatsby-theme-iterative) C: guide Content of /doc/user-guide p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately.
Projects
None yet
Development

No branches or pull requests

2 participants