huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 28k
Star 140k

Code
Issues 988
Pull requests 539
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: huggingface/transformers

Labels 131 Milestones 0

New pull request New

539 Open 18,544 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[bugfix] Update modeling_llama.py so it skips keys correctly

#36289 opened Feb 19, 2025 by HDCharles

Loading…

[modular] allow multiple modular files in the same model folder

#36287 opened Feb 19, 2025 by Cyrilvallez

Loading…

Remove remote code warning

#36285 opened Feb 19, 2025 by Rocketknight1

Loading…

[qwen2 audio] remove redundant code and update docs

#36282 opened Feb 19, 2025 by gante

Loading…

[modular] Do not track imports in functions

#36279 opened Feb 19, 2025 by Cyrilvallez

Loading…

Fix broken CI due to missing conversion files

#36275 opened Feb 19, 2025 by ydshieh

Loading…

Clean-up: get_patch_output_size can be removed

#36274 opened Feb 19, 2025 by zucchini-nlp

Loading…

[WIP] Set default processing device to best available device in fast image processors

#36268 opened Feb 18, 2025 by yonigozlan

Loading…

Remove hardcoded slow image processor class in processors supporting fast ones

#36266 opened Feb 18, 2025 by yonigozlan

Loading…

[docs] Update README

#36265 opened Feb 18, 2025 by stevhliu

Loading…

Upgrading torch version and cuda version in quantization docker

#36264 opened Feb 18, 2025 by MekkCyber

Loading…

Update composition flag usage

#36263 opened Feb 18, 2025 by zucchini-nlp

Loading…

Fix: dtype cannot be str

#36262 opened Feb 18, 2025 by zucchini-nlp

Loading…

Add D-FINE Model into Transformers

#36261 opened Feb 18, 2025 by VladOS95-cyber

Loading…

4 of 5 tasks

Ignore conversion files in test fetcher

#36251 opened Feb 18, 2025 by ydshieh

Loading…

VLMs: even more clean-up

#36249 opened Feb 18, 2025 by zucchini-nlp

Loading…

Add support for DeepseekAI's DeepseekVL

#36248 opened Feb 18, 2025 by geetu040 • Draft

3 of 15 tasks

Doc forward

#36243 opened Feb 17, 2025 by Cyrilvallez • Draft

[generate] remove legacy cache in t5 and whisper-based models (deprecated in v4.48)

#36238 opened Feb 17, 2025 by gante

Loading…

[tests] fix more device bugs

#36233 opened Feb 17, 2025 by faaany • Draft

Add evolla rebase main New model

#36232 opened Feb 17, 2025 by zhoubay

Loading…

5 tasks done

Prevent Reinitialization of Resized LM Head When tie_word_embeddings is False #35141

#36221 opened Feb 16, 2025 by sambhavnoobcoder

Loading…

fix: prevent second save in the end of training if last step was saved already

#36219 opened Feb 16, 2025 by NosimusAI

Loading…

2 of 5 tasks

Improvements in attention_forward functions

#36218 opened Feb 15, 2025 by mseeger

Loading…

3 of 5 tasks

[WIP] Add a dedicated tokenizer for byte level transformers

#36216 opened Feb 15, 2025 by apehex

Loading…

Previous 1 2 3 4 5 … 21 22 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly