microsoft/phi-4
-------------------------------------------------------------------------------------------------------- Developers Microsoft Research Description phi-4 is a state-of-the-art open model built upon a blend of synthetic datasets, data from filtered public domain websites, and acquired academic books and Q&A datasets. Th
pip install mlforge-sdk && mlforge pull microsoft/phi-4
Model details
About microsoft/phi-4
-------------------------------------------------------------------------------------------------------- Developers Microsoft Research Description phi-4 is a state-of-the-art open model built upon a blend of synthetic datasets, data from filtered public domain websites, and acquired academic books and Q&A datasets. The goal of this approach was to ensure that small capable models were trained with data focused on high quality and advanced reasoning.phi-4 underwent a rigorous enhancement and alignment process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures Architecture 14B parameters, dense decoder-only Transformer model Inputs Text, best suited for prompts in the chat format Context length 16K tokens GPUs 1920 H100-80G Training time 21 days Training data 9.8T tokens Outputs Generated text in response to input Dates October 2024 – November 2024 Status Static model t