notbad_v1_0_mistral_24b

notbad_v1_0_mistral_24b

notbadai

A 24B parameter reasoning model built on Mistral, specialized in math and Python coding with enhanced performance through RL training and cleaner outputs

PropertyValue
Base ModelMistral-Small-24B-Instruct-2501
Parameters24 Billion
Model URLHuggingFace Repository
AuthorNotBadAI

What is notbad_v1_0_mistral_24b?

Notbad v1.0 Mistral 24B is an advanced language model specifically enhanced for mathematical reasoning and Python coding tasks. Built upon the Mistral-Small-24B-Instruct-2501 architecture, this model has undergone specialized reinforcement learning to improve its performance in technical domains while maintaining concise and clean outputs.

Implementation Details

The model represents a significant advancement in AI reasoning capabilities, achieved through self-improvement rather than knowledge distillation. The implementation leverages reinforcement learning techniques similar to Dr. GRPO, with training conducted on open datasets.

  • Built on Mistral-Small-24B-Instruct-2501 architecture
  • Employs advanced reinforcement learning techniques
  • Focuses on producing shorter, cleaner reasoning outputs
  • Trained with support from Lambda and Deep Infra computing resources

Core Capabilities

  • Strong performance in mathematical reasoning (0.752 on math benchmarks)
  • Exceptional Python coding abilities (0.869 on HumanEval)
  • Competitive MMLU professional scores (0.642)
  • Efficient general knowledge reasoning (0.447 on GPQA)

Frequently Asked Questions

Q: What makes this model unique?

The model's distinctive feature is its specialized training in mathematical reasoning and coding tasks, combined with its ability to produce concise and clean outputs. It achieves this through self-improvement via reinforcement learning rather than traditional knowledge distillation methods.

Q: What are the recommended use cases?

This model is particularly well-suited for mathematical problem-solving, Python programming tasks, and technical reasoning applications. It shows strong performance in professional knowledge domains while maintaining efficient and clean output generation.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026