NuExtract-1.5-smol

Property	Value
Parameter Count	1.71B
Model Type	Text Generation
License	MIT
Tensor Type	BF16

What is NuExtract-1.5-smol?

NuExtract-1.5-smol is a specialized language model fine-tuned from SmolLM2-1.7B, designed specifically for structured information extraction tasks. This compact model offers multilingual capabilities while maintaining high performance, despite being less than half the size of its larger counterpart NuExtract-1.5 (1.7B vs 3.8B parameters).

Implementation Details

The model is implemented using the transformers library and operates with bfloat16 precision for efficient inference. It requires a specific input format consisting of a JSON template and input text, making it highly structured and predictable in its extraction capabilities.

Built on SmolLM2-1.7B architecture
Optimized for zero-shot performance across multiple languages
Implements efficient inference with bfloat16 precision
Supports sliding window attention for handling long sequences

Core Capabilities

Structured information extraction from text
Multilingual support with strong zero-shot performance
Pure extraction focus with high accuracy
Handling of arbitrary sequence lengths
Template-based extraction with JSON output

Frequently Asked Questions

Q: What makes this model unique?

The model's ability to perform structured information extraction while maintaining a relatively small parameter count (1.71B) sets it apart. It's specifically designed to prioritize pure extraction, meaning the generated text is typically present verbatim in the source material.

Q: What are the recommended use cases?

The model excels at extracting structured information from text using JSON templates. It's particularly useful for automated data extraction, document parsing, and information retrieval tasks across multiple languages. For optimal performance, it's recommended to use the model with a temperature setting at or very close to 0.