nm-testing/SmolLM-1.7B-Instruct-quantized.w4a16
Model Overview - Model Architecture: SmolLM-135M-Instruct - Input: Text - Output: Text - Model Optimizations: - Weight quantization: INT4 - Intended Use Cases: Intended for commercial and research use in English. Similarly to SmolLM-135M-Instruct, this models is intended for assistant-like chat. - Out-of-scope: Use in any manner that violates applicable laws or regulations (including trade c
pip install mlforge-sdk && mlforge pull nm-testing/SmolLM-1.7B-Instruct-quantized.w4a16
Model details
About nm-testing/SmolLM-1.7B-Instruct-quantized.w4a16
Model Overview - Model Architecture: SmolLM-135M-Instruct - Input: Text - Output: Text - Model Optimizations: - Weight quantization: INT4 - Intended Use Cases: Intended for commercial and research use in English. Similarly to SmolLM-135M-Instruct, this models is intended for assistant-like chat. - Out-of-scope: Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in languages other than English. - Release Date: 8/23/2024 - Version: 1.0 - License(s): Apache-2.0 - Model Developers: Neural Magic