Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Core feature] Support cache overwrite flag at task level #6050

Open
2 tasks done
hebiao064 opened this issue Nov 26, 2024 · 3 comments
Open
2 tasks done

[Core feature] Support cache overwrite flag at task level #6050

hebiao064 opened this issue Nov 26, 2024 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@hebiao064
Copy link

Motivation: Why do you think this is important?

Currently, when tasks need to bypass cache and force re-execution, developers need to modify the cache_version parameter. However, this is an internal implementation detail that shouldn't be exposed to end users. Users should have a more intuitive way to control cache behavior at runtime.

Goal: What should the final outcome look like, ideally?

Proposal

Add support for a cache_overwrite or force_recompute flag at the task level that can be set during execution. This would allow:

  1. Task definition remains unchanged
  2. Users can control cache behavior at runtime
  3. Clear separation between developer concerns (cache_version) and user control

Describe alternatives you've considered

  1. Workflow-Level Cache Control

    • Add cache control parameters at workflow level only
    • Pros: Simpler API surface, clearer execution tracking
    • Cons: Too coarse-grained, can't control individual tasks
  2. Using Cache Version

    • Current approach: Modify cache_version
    • Pros: Already implemented
    • Cons: Mixes developer and user concerns, requires code changes

Propose: Link/Inline OR Additional context

Example API could look like:

@task(cache=True, overwrite_cache=False)  # Default behavior
def my_task():
    pass

# At Runtime
wf.my_task(overwrite_cache=True)  # Force recompute

Implementation (rough idea):

  • Add one more field called overwrite_cache for Flyte Task Decorator
  • Add helper func: IsTaskCacheOverwrite for cache.go
  • In executor.go, check if IsTaskCacheOverwrite, if yes, return CatalogCacheStatus_CACHE_SKIPPED

Are you sure this issue hasn't been raised already?

  • Yes

Have you read the Code of Conduct?

  • Yes
@hebiao064 hebiao064 added enhancement New feature or request untriaged This issues has not yet been looked at by the Maintainers labels Nov 26, 2024
@kumare3
Copy link
Contributor

kumare3 commented Dec 3, 2024

the UX is harder than it looks. As you have to find the right task and replace it. the same task maybe repeated multiple times. If you want to simply overwrite the value - change the cache_version and it will achieve the same effect. This can be overriden in the workflow during the registration time. Today we do not have an easy way to pass it in during execution. Suggest a nice UX for a complex workflow

@hebiao064
Copy link
Author

hebiao064 commented Dec 5, 2024

@kumare3 Thanks for reply.

Regarding the cache_version solution, it would work for adhoc/exprimental workflow code (tho it requires some level of understanding of how caching works in Flyte, I know it's easy but however in our scenario)

In our company, we built a set of reusable workflows and tasks (as ML Infra Team), our users (AI Verticals like Feed Ranking) will be use the wrapper like this

# Code ML Infra Team in `trainer.py` from repo_A
@task(cache=False, cache_version="v1")
def trainer_task()
    pass

Code from User Team in workflow.py from repo_B

from repo_A.trainer import trainer_task

@workflow
def my_wf:
    trainer_task().with_overrides(cache=True, cache_version="v2")

So workflow.py will be shared by a user team, and multiple developers will be working on the same workflow code.
Hence different developer may have trouble to coordinate on the cache_version.

If we provide the overwrite_cache for tasks, user can simply mark overwrite_cache=True to make sure this run will be writing to cache instead of coordinate the cache_version with his/her teammates.

So I would suggest support overwrite_cache in the task decorator and I would like to contribute if the idea is not bad to you.

@kumare3
Copy link
Contributor

kumare3 commented Dec 5, 2024

Interesting- got it.
Cc @eapolinario

@eapolinario eapolinario self-assigned this Jan 16, 2025
@eapolinario eapolinario removed the untriaged This issues has not yet been looked at by the Maintainers label Jan 16, 2025
@davidmirror-ops davidmirror-ops moved this from Backlog to Assigned in Flyte Issues/PRs maintenance Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Assigned
Development

No branches or pull requests

3 participants