Name: allenai/tulu-3-sft-olmo-2-mixture-0225
Creator: allenai
License: Unknown
Keywords: huggingface, size_categories:100K<n<1M, format:parquet, modality:text, library:datasets, library:dask, library:mlcroissant, library:polars, region:us

About allenai/tulu-3-sft-olmo-2-mixture-0225

Used to train OLMo 2 32B. From the blog post: Filtered out instructions from the SFT dataset and the chosen responses of the preference data that included mentions of a date cutoff from the synthetic data generation process. This resulted in a new version of the instruction dataset, Tulu 3 SFT Mixture 0225, and preference dataset, OLMo-2-32B-pref-mix-0325. We use majority voting to improve the quality of answers to our synthetic math questions. For our Persona MATH and Grade School Math datasets from Tülu 3, we only include prompts and completions where the model reaches a majority vote over 5 completions. New versions of the math and grade school math datasets are available.

allenai/tulu-3-sft-olmo-2-mixture-0225

Dataset details

About allenai/tulu-3-sft-olmo-2-mixture-0225