Skip to content

Commit b141935

Browse files
update run script
1 parent be9be09 commit b141935

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

training/llama3.1/run-llama.sh

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,8 +18,10 @@ deepspeed --hostfile $HOSTFILE lora_script.py \
1818
--model_name_or_path meta-llama/Meta-Llama-3.1-405B \
1919
--tokenizer_name_or_path meta-llama/Meta-Llama-3.1-405B-Instruct \
2020
--save_steps 200 --max_steps 1000 --max_train_samples 8000 \
21+
--lora_r 64 --lora_alpha 64 \
2122
--quantize --bits 8 \
2223
--base_weight_sharding --offload --offload_ratio 0.75 \
2324
--gradient_checkpointing --activation_checkpointing \
2425
--learning_rate 2e-5 --lr_scheduler_type linear --warmup_ratio 0.02 \
2526
--auth-token $HF_AUTH_TOKEN
27+

0 commit comments

Comments
 (0)