RedHatAI/Qwen2.5-1.5B-quantized.w8a8
Model Overview - Model Architecture: Qwen2 - Input: Text - Output: Text - Model Optimizations: - Activation quantization: INT8 - Weight quantization: INT8 - Intended Use Cases: Intended for commercial and research use multiple languages. Similarly to Qwen2.5-1.5B, this models is intended for assistant-like chat. - Out-of-scope: Use in any manner that violates applicable laws or regulations
pip install mlforge-sdk && mlforge pull RedHatAI/Qwen2.5-1.5B-quantized.w8a8
Model details
About RedHatAI/Qwen2.5-1.5B-quantized.w8a8
Model Overview - Model Architecture: Qwen2 - Input: Text - Output: Text - Model Optimizations: - Activation quantization: INT8 - Weight quantization: INT8 - Intended Use Cases: Intended for commercial and research use multiple languages. Similarly to Qwen2.5-1.5B, this models is intended for assistant-like chat. - Out-of-scope: Use in any manner that violates applicable laws or regulations (including trade compliance laws). - Release Date: 10/09/2024 - Version: 1.0 - License(s): apache-2.0 - Model Developers: Neural Magic