codegeex2-6b

Maintained By
THUDM

CodeGeeX2-6B

PropertyValue
AuthorsTHUDM
PaperCodeGeeX: A Pre-Trained Model for Code Generation
Training Data600B code tokens
LicenseApache-2.0 (code), Custom License (weights)

What is CodeGeeX2-6B?

CodeGeeX2-6B is a second-generation multilingual code generation model that builds upon the successful ChatGLM2 architecture. It represents a significant advancement in AI-powered programming assistance, offering impressive performance improvements over its predecessor while requiring fewer parameters. The model has been extensively trained on 600B tokens of code data, enabling it to understand and generate code across more than 100 programming languages.

Implementation Details

The model is built on the ChatGLM2-6B architecture and incorporates several technical improvements, including an 8192 sequence length capability and efficient INT4 quantization support. It can run on just 6GB of GPU memory when quantized, making it accessible for local deployment.

  • Supports both Chinese and English inputs
  • Implements context-aware code completion
  • Features cross-file completion capabilities
  • Provides integration with popular IDEs (VS Code and JetBrains)

Core Capabilities

  • Significantly improved performance across multiple languages (Python +57%, C++ +71%, Java +54%, JavaScript +83%, Go +56%, Rust +321%)
  • Achieves 35.9% Pass@1 rate on Python tasks, surpassing larger models like StarCoder-15B
  • Supports interactive AI programming assistance through Ask CodeGeeX
  • Handles code summarization, translation, debugging, and documentation generation

Frequently Asked Questions

Q: What makes this model unique?

CodeGeeX2-6B stands out for achieving superior performance with only 6B parameters, surpassing larger models while maintaining efficiency. Its bilingual capability and extensive language support make it particularly versatile for real-world development scenarios.

Q: What are the recommended use cases?

The model excels in code generation, completion, and translation across multiple programming languages. It's particularly useful for developers seeking AI assistance in IDEs, code documentation generation, and solving programming problems through interactive dialogue in either Chinese or English.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.