glm-10b-chinese

glm-10b-chinese

THUDM

A 10B parameter Chinese language model trained on WuDaoCorpora, featuring 48 transformer layers and specialized in autoregressive blank filling tasks.

PropertyValue
Architecture48 Transformer Layers
Hidden Size4096
Attention Heads64 per layer
Research PaperView Paper
Training DatasetWuDaoCorpora

What is glm-10b-chinese?

GLM-10B-Chinese is a sophisticated General Language Model specifically designed for Chinese language processing. Developed by THUDM, it represents a significant advancement in autoregressive blank-filling language models, trained on the comprehensive WuDaoCorpora dataset. The model employs a unique approach to language understanding and generation through its innovative mask token system.

Implementation Details

The model architecture consists of 48 transformer layers, each featuring 4096 hidden dimensions and 64 attention heads. It implements three distinct mask tokens: [MASK] for short blank filling, [sMASK] for sentence filling, and [gMASK] for left-to-right generation. The model can be easily integrated using the Transformers library and supports both CPU and GPU execution, with recommended half-precision (FP16) for optimal performance.

  • Comprehensive transformer architecture with 48 layers
  • Multiple masking strategies for different tasks
  • PyTorch-based implementation
  • Supports both Chinese understanding and generation

Core Capabilities

  • Autoregressive blank filling
  • Natural language understanding
  • Sequence-to-sequence tasks
  • Language modeling
  • Multi-task text generation

Frequently Asked Questions

Q: What makes this model unique?

The model's distinctive feature is its autoregressive blank-filling objective, combined with its massive scale (10B parameters) and specialized training for Chinese language processing. The three-type mask token system allows for versatile text generation and understanding tasks.

Q: What are the recommended use cases?

The model is particularly well-suited for Chinese language processing tasks including text completion, sentence generation, and natural language understanding. It excels in scenarios requiring context-aware text generation and blank filling in Chinese text.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026