bert2bert_shared-turkish-summarization

bert2bert_shared-turkish-summarization

mrm8488

Turkish BERT2BERT model specialized in text summarization with 140M parameters, trained on MLSUM dataset. Achieves 29.48% Rouge2 F-measure on test set.

PropertyValue
Parameter Count140M
Model TypeEncoder-Decoder
Base ArchitectureBERT (dbmdz/bert-base-turkish-cased)
Training DatasetMLSUM Turkish
PerformanceRouge2 F-measure: 29.48%

What is bert2bert_shared-turkish-summarization?

This is a specialized Turkish language model designed for text summarization tasks, built upon the BERT architecture with shared encoder-decoder components. Developed by Manuel Romero, it leverages the dbmdz/bert-base-turkish-cased checkpoint and has been fine-tuned on the MLSUM Turkish dataset, which is part of a larger multilingual summarization corpus.

Implementation Details

The model utilizes a BERT2BERT architecture with 140M parameters, implementing both encoding and decoding capabilities for Turkish text summarization. It employs PyTorch framework and supports inference endpoints for production deployment.

  • Achieves 32.41% Rouge2 precision and 28.65% Rouge2 recall on test sets
  • Supports maximum input length of 512 tokens
  • Implements efficient text generation with attention masking
  • Uses safetensors for model weight storage

Core Capabilities

  • Automatic summarization of Turkish news articles
  • Handle long-form text input with truncation
  • Generate concise, coherent summaries
  • Support for batch processing and GPU acceleration

Frequently Asked Questions

Q: What makes this model unique?

This model is specifically optimized for Turkish language summarization, using a shared BERT architecture that reduces model size while maintaining performance. It's one of the few specialized Turkish summarization models available with documented performance metrics.

Q: What are the recommended use cases?

The model is particularly well-suited for summarizing Turkish news articles, research papers, and long-form content. It's ideal for applications requiring automated content summarization in Turkish media monitoring, content aggregation, and information extraction systems.

Related Models

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026