Skip to content

Commit eadfba3

Browse files
authored
Merge pull request Snowflake-Labs#31 from Snowflake-Labs/jeffra-patch-1
use public deepspeed in ft tutorial
2 parents ff2c9f0 + 91f25cd commit eadfba3

File tree

2 files changed

+3
-4
lines changed

2 files changed

+3
-4
lines changed

training/llama3.1/README.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,11 +4,10 @@ If you haven't already, please first read an overview of all the optimizations t
44

55
## Requirements
66

7-
In order to get started you must use our fork [DeepSpeed](https://github.com/Snowflake-Labs/DeepSpeed/tree/ds-llama), this branch is currently [pending review](https://github.com/microsoft/DeepSpeed/pull/5791) from the microsoft/DeepSpeed team and we are actively working with them to merge this as soon as possible. In the mean time please use this DeepSpeed fork. The FP8 kernels provided in DeepSpeed require `triton==2.3.x`. This tutorial has been tested with `transformers==4.43.3`, but should work with any >= 4.43 version that supports Llama 3.1.
7+
This tutorial has been tested with the following package versions, it should work with newer versions as well but if it doesn't please file an issue with any relevant details. The FP8 kernels provided in DeepSpeed require `triton==2.3.x`. This tutorial has been tested with `transformers==4.43.3`, but should work with any >= 4.43 version that supports Llama 3.1. The features required for this tutorial are available in DeepSpeed 0.14.5 and later.
88

99
```bash
10-
pip install git+https://github.com/Snowflake-Labs/DeepSpeed.git@ds-llama
11-
pip install triton==2.3.0 transformers==4.43.3 huggingface_hub[hf_transfer]
10+
pip install deepspeed==0.14.5 triton==2.3.0 transformers==4.43.3 huggingface_hub[hf_transfer]
1211
```
1312

1413
## Getting Started

training/llama3.1/requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
git+https://github.com/Snowflake-Labs/DeepSpeed.git@ds-llama
1+
deepspeed==0.14.5
22
transformers==4.43.3
33
triton==2.3.0
44
huggingface_hub[hf_transfer]

0 commit comments

Comments
 (0)