RedHatAI/Llama-3.2-1B-Instruct-FP8
Model Overview - Model Architecture: Llama-3 - Input: Text - Output: Text - Model Optimizations: - Activation quantization: FP8 - Weight quantization: FP8 - Intended Use Cases: Intended for commercial and research use multiple languages. Similarly to Llama-3.2-1B-Instruct, this models is intended for assistant-like chat. - Out-of-scope: Use in any manner that violates applicable laws or re
pip install mlforge-sdk && mlforge pull RedHatAI/Llama-3.2-1B-Instruct-FP8
Model details
About RedHatAI/Llama-3.2-1B-Instruct-FP8
Model Overview - Model Architecture: Llama-3 - Input: Text - Output: Text - Model Optimizations: - Activation quantization: FP8 - Weight quantization: FP8 - Intended Use Cases: Intended for commercial and research use multiple languages. Similarly to Llama-3.2-1B-Instruct, this models is intended for assistant-like chat. - Out-of-scope: Use in any manner that violates applicable laws or regulations (including trade compliance laws). - Release Date: 9/25/2024 - Version: 1.0 - License(s): Llama3.2 - Model Developers: Neural Magic