-
-
Notifications
You must be signed in to change notification settings - Fork 2k
Issues: unslothai/unsloth
[FIXED]
attention_mask = attention_mask.to(torch.bool)
#1704
opened Feb 14, 2025 by
torahoang
Open
7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
AttributeError: 'PeftModelForCausalLM' object has no attribute '_unwrapped_old_generate'
#1749
opened Feb 18, 2025 by
stromyu520
Update preprocessor_config.json for Qwen2.5-VL models on huggingface
#1747
opened Feb 18, 2025 by
Tahirc1
How much impact does the system prompt have on the output results?
#1746
opened Feb 18, 2025 by
LaFeuilleMorte
I can't pip install "unsloth[colab-new]@git+https://github.com/unslothai/unsloth.git"
#1743
opened Feb 18, 2025 by
jiupinjiandingshi
Output deterioration with model.eval() (unsloth/Meta-Llama-3.1-8B)
#1742
opened Feb 18, 2025 by
yoakiyama
non-default argument follows default argument (UnslothGKDTrainer.py, line 613)
currently fixing
Am fixing now!
#1741
opened Feb 18, 2025 by
elvis324
Low-Quality and Repetitive Text Generation in FastLanguageModel instead of AutoModelForCausalLM
#1740
opened Feb 18, 2025 by
Hyfred
while training unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit why it is showning applying chat template .
#1728
opened Feb 17, 2025 by
SnehaKumari14
abnormal model output '!!!!!!!!!!!!' at new version
currently fixing
Am fixing now!
#1727
opened Feb 17, 2025 by
zuozhenLib
Getting error with loading Llama unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
#1726
opened Feb 16, 2025 by
SnehaKumari14
CalledProcessError: Command xxx returned non-zero exit status 2.
#1725
opened Feb 16, 2025 by
QiXingRan
AttributeError: _unwrapped_old_generate
currently fixing
Am fixing now!
#1723
opened Feb 16, 2025 by
ylwlf888
Feature Request: Finetune DeepSeek (and other MoEs) to use Pregate for predictive MoE offloading and fetching
feature request
Feature request pending on roadmap
#1719
opened Feb 15, 2025 by
Thomas-MMJ
RuntimeError: The size of tensor a (2048) must match the size of tensor b (4241) at non-singleton dimension 1
#1717
opened Feb 15, 2025 by
azzedineA
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-01-18.