-
Notifications
You must be signed in to change notification settings - Fork 30.2k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Making compute_loss_func always take priority in Trainer
#40626
opened Sep 2, 2025 by
Flakes342
Loading…
2 of 5 tasks
Use logical or for generate_masks_with_special_tokens_and_transfer_map in grounding dino models
#40625
opened Sep 2, 2025 by
lmarshall12
•
Draft
5 tasks
Fix: PIL image load in Processing utils apply_chat_template
#40622
opened Sep 2, 2025 by
abdokaseb
Loading…
1 of 5 tasks
[CP] Add attention_mask to the buffer when the mask is causal
#40619
opened Sep 2, 2025 by
kashif
Loading…
feat: err when unsupported attn impl is set w/
--continuous_batching
#40618
opened Sep 2, 2025 by
McPatate
Loading…
docs: update GPTBigCode model card with standardized usage section
#40615
opened Sep 2, 2025 by
FrankDannie
Loading…
5 tasks done
Fix issue of wrong number of tokens per GPUs affecting loss normalization in trainer.py
#40610
opened Sep 2, 2025 by
SamuelBarryCS
Loading…
Integrate colqwen2.5 using colqwen2 modelling code
#40600
opened Sep 1, 2025 by
sahil-kabir
•
Draft
2 of 5 tasks
feat(utils): add vision utils for embedding images and getting the hidden size
#40587
opened Sep 1, 2025 by
AmitMY
Loading…
4 of 5 tasks
Allow custom args in
custom_generate
Callables and unify generation args structure
#40586
opened Sep 1, 2025 by
manueldeprada
Loading…
feat(deepseek_v3): Add grouped GEMM kernel for faster MoE computation
#40583
opened Sep 1, 2025 by
bzantium
Loading…
5 tasks done
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.