Teuken-7B-instruct-research-v0.4

Maintained By
openGPT-X

Teuken-7B-instruct-research-v0.4

PropertyValue
Parameter Count7.45B
Model TypeTransformer-based decoder-only
Languages24 EU languages
LicenseOther (Custom)
PaperResearch Paper

What is Teuken-7B-instruct-research-v0.4?

Teuken-7B-instruct-research-v0.4 is an advanced multilingual language model specifically designed for European languages. Developed by a consortium including Fraunhofer, Forschungszentrum Jülich, TU Dresden, and DFKI, it represents a significant step forward in European language AI technology. The model was pre-trained on 4 trillion tokens and has been instruction-tuned to handle complex language tasks across 24 EU languages.

Implementation Details

The model utilizes a sophisticated architecture with 32 layers, 4096 hidden size, and 32 attention heads. It implements Group Query Attention with 2 query groups and uses the SwiGLU activation function. Training was conducted using bfloat16 precision on JUWELS Booster with NVIDIA A100 GPUs.

  • Sequence length: 4096 tokens
  • Rotary position embeddings
  • RMSNorm normalization
  • Zero dropout implementation

Core Capabilities

  • Multilingual understanding and generation across 24 EU languages
  • Strong performance on benchmarks like EU21-ARC, EU21-HeSw, and EU21-TQA
  • Specialized for European context and values
  • Instruction-following capabilities in multiple languages

Frequently Asked Questions

Q: What makes this model unique?

The model's primary distinction lies in its comprehensive coverage of all 24 EU languages and its specific optimization for European content. Unlike English-centric models, it provides more balanced performance across European languages and better reflects European cultural values.

Q: What are the recommended use cases?

The model is specifically designed for research applications involving multilingual tasks across EU languages. It excels in instruction-following scenarios but is not recommended for mathematical computations or coding tasks.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.