OlympicCoder-7B

Maintained By
open-r1

OlympicCoder-7B

PropertyValue
Parameter Count7 Billion
Base ModelQwen2.5-Coder-7B-Instruct
LicenseApache-2.0
Primary LanguageEnglish
GitHub Repositoryhttps://github.com/huggingface/open-r1

What is OlympicCoder-7B?

OlympicCoder-7B is a specialized code generation model designed specifically for competitive programming tasks. It's a 7B parameter model fine-tuned on a carefully decontaminated version of the Codeforces dataset, with particular emphasis on handling complex algorithmic challenges similar to those found in programming olympiads.

Implementation Details

The model builds upon the Qwen2.5-Coder-7B-Instruct architecture and was trained using DeepSpeed Zero-3 optimization across 8 devices. The training process utilized careful hyperparameter tuning, including a learning rate of 4.0e-5 and a cosine learning rate scheduler with warmup.

  • Batch size: 16 (2 per device with 8 gradient accumulation steps)
  • Training duration: 10 epochs
  • Optimization: Adam optimizer with betas=(0.9,0.999)
  • Specialized chat template with token implementation

Core Capabilities

  • Strong performance on IOI'2024 benchmark problems
  • Effective solution generation for LiveCodeBench challenges
  • Primary focus on C++ programming solutions
  • Integrated chain-of-thought reasoning through token mechanism
  • Support for both competitive programming and algorithmic problem-solving

Frequently Asked Questions

Q: What makes this model unique?

OlympicCoder-7B stands out for its specialized focus on competitive programming challenges and its performance on prestigious benchmarks like the International Olympiad in Informatics. The model's unique chat template implementation with the token enables consistent chain-of-thought reasoning.

Q: What are the recommended use cases?

The model is particularly well-suited for competitive programming tasks, algorithmic problem-solving, and generating solutions for complex coding challenges. While primarily optimized for C++ solutions, it can handle Python code generation as well, though this might be considered partially out-of-domain.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.