Reproduce OLMO-3-Think-SFT

This issue thread is to report the reproducibility steps and results of OLMO-3-Think-SFT using OLMo-Core and open-instruct. We are trying to target the 7B parameter model. 
- Specifically we start from the [hugging face model](https://huggingface.co/allenai/Olmo-3-1025-7B), convert it into the OLMo-core format using this [script](https://github.com/allenai/OLMo-core/blob/main/src/examples/huggingface/convert_checkpoint_from_hf.py).   
- We also need to prepare data before we run the training script. Since OLMO uses packed training instances, we would need to convert the data from HF. 
- Then we can use the [script](https://github.com/allenai/OLMo-core/blob/main/src/scripts/train/sft/Olmo-3-7B-SFT.py) at OLMo-core repository to run the SFT.

One can already find the scripts for running it on SLURM over [here](https://github.com/allenai/open-instruct/tree/main/scripts/slurm/sft).  




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproduce OLMO-3-Think-SFT #168

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reproduce OLMO-3-Think-SFT #168

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions