Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Speculative Decoding] Medusa Implementation with Top-1 proposer #4978

Merged
merged 18 commits into from
Jul 10, 2024

Replacing Medusa models in test with smaller ones to prevent OOM

d9eb7ff
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

[Speculative Decoding] Medusa Implementation with Top-1 proposer #4978

Replacing Medusa models in test with smaller ones to prevent OOM
d9eb7ff
Select commit
Loading
Failed to load commit list.

Annotations

2 warnings

This job succeeded