ko-gpt-trinity-1.2B-v0.5

ko-gpt-trinity-1.2B-v0.5

skt

A 1.2B parameter Korean language model by SK Telecom, trained on Ko-DAT dataset for text generation. Excels at Korean text tasks with strong reasoning capabilities.

PropertyValue
Parameter Count1.2 Billion
Model TypeLanguage Model
LicenseCC-BY-NC-SA-4.0
Release DateMay 2021
Training Tokens35 Billion

What is ko-gpt-trinity-1.2B-v0.5?

Ko-GPT-Trinity 1.2B is an advanced Korean language model developed by SK Telecom, implementing a GPT-3-style architecture with 1.2 billion parameters. Trained on the proprietary Ko-DAT dataset, it represents a significant advancement in Korean language processing capabilities.

Implementation Details

The model underwent extensive training over 72,000 steps using a masked autoregressive approach with cross-entropy loss. It was specifically designed to handle Korean language tasks and demonstrates superior performance compared to existing models like KoElectra and KoBERT.

  • Trained on 35 billion tokens from Ko-DAT dataset
  • Implements GPT-3 architecture principles
  • Optimized for Korean language understanding and generation
  • Achieves state-of-the-art performance on multiple benchmarks

Core Capabilities

  • Text Generation: Excels at producing coherent Korean text from prompts
  • Reasoning Tasks: Achieves 71.77% on BoolQ, 68.66% on CoPA, and 78.73% on WiC
  • Language Understanding: Strong performance in various Korean NLP tasks
  • Contextual Processing: Advanced handling of Korean language nuances

Frequently Asked Questions

Q: What makes this model unique?

The model stands out for its specialized focus on Korean language processing, with benchmark results surpassing previous models like KoElectra and KoBERT. Its large-scale training on Ko-DAT makes it particularly effective for Korean language tasks.

Q: What are the recommended use cases?

The model is best suited for Korean text generation, classification, searching, and summarization tasks. However, users should be aware of its limitations regarding non-Korean languages and potential biases in generated content.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026