Update README.md
Browse files
README.md
CHANGED
|
@@ -14,12 +14,12 @@ widget:
|
|
| 14 |
# Open-Assistant StableLM-7B SFT-7 Model
|
| 15 |
|
| 16 |
|
| 17 |
-
This is the
|
| 18 |
the [Open-Assistant](https://github.com/LAION-AI/Open-Assistant) project.
|
| 19 |
-
It is based on a
|
| 20 |
of assistant conversations collected through the
|
| 21 |
[https://open-assistant.io/](https://open-assistant.io/) human feedback web
|
| 22 |
-
app before
|
| 23 |
|
| 24 |
## Model Details
|
| 25 |
|
|
@@ -51,7 +51,7 @@ start generating the assistant reply.
|
|
| 51 |
- base model: [stabilityai/stablelm-base-alpha-7b](https://huggingface.co/stabilityai/stablelm-base-alpha-7b)
|
| 52 |
- checkpoint: 3 epochs (12000 steps)
|
| 53 |
|
| 54 |
-
command: `deepspeed trainer_sft.py --configs defaults
|
| 55 |
|
| 56 |
data:
|
| 57 |
```
|
|
|
|
| 14 |
# Open-Assistant StableLM-7B SFT-7 Model
|
| 15 |
|
| 16 |
|
| 17 |
+
This is the 7th iteration English supervised-fine-tuning (SFT) model of
|
| 18 |
the [Open-Assistant](https://github.com/LAION-AI/Open-Assistant) project.
|
| 19 |
+
It is based on a StableLM 7B that was fine-tuned on human demonstrations
|
| 20 |
of assistant conversations collected through the
|
| 21 |
[https://open-assistant.io/](https://open-assistant.io/) human feedback web
|
| 22 |
+
app before April 12, 2023.
|
| 23 |
|
| 24 |
## Model Details
|
| 25 |
|
|
|
|
| 51 |
- base model: [stabilityai/stablelm-base-alpha-7b](https://huggingface.co/stabilityai/stablelm-base-alpha-7b)
|
| 52 |
- checkpoint: 3 epochs (12000 steps)
|
| 53 |
|
| 54 |
+
command: `deepspeed trainer_sft.py --configs defaults stablelm-7b oasst-mix --cache_dir /home/ubuntu/data_cache --output_dir .saved/stable-lm-7b-1 --num_train_epochs 4 --deepspeed`
|
| 55 |
|
| 56 |
data:
|
| 57 |
```
|