Arithmo-Mistral-7B

Property	Value
Base Model	Mistral-7B-v0.1
License	Apache 2.0
Training Approach	QLoRA Fine-tuning
Primary Task	Mathematical Reasoning
Language	English

What is Arithmo-Mistral-7B?

Arithmo-Mistral-7B is a specialized mathematical reasoning model that builds upon the Mistral-7B foundation model. Developed by Ashvini Kumar Jindal and Ankur Parikh, it represents a significant advancement in mathematical problem-solving capabilities, outperforming many larger models including 13B parameter versions. The model excels in both Chain-of-Thought (CoT) reasoning and Program-of-Thought (PoT) approaches, achieving impressive scores of 74.7% on GSM8K and 25.3% on MATH benchmarks.

Implementation Details

The model was fine-tuned using QLoRA on a single RTX 4090 GPU, making it an efficient and accessible implementation. It supports two primary interaction modes: Zero-Shot Chain-of-Thought for generating reasoning steps and answers, and Zero-Shot Program-of-Thought for generating executable Python code that solves mathematical problems.

Trained using QLoRA fine-tuning technique
Implements both CoT and PoT reasoning approaches
Optimized for single GPU training and inference
Compatible with text-generation-inference systems

Core Capabilities

Solve complex mathematical word problems with detailed reasoning
Generate executable Python code for mathematical solutions
Process both direct questions and step-by-step reasoning tasks
Achieve state-of-the-art performance among 7B parameter models
Handle various mathematical reasoning tasks including arithmetic, algebra, and word problems

Frequently Asked Questions

Q: What makes this model unique?

The model stands out for its ability to match or exceed the performance of much larger models while maintaining efficiency with a 7B parameter architecture. It's uniquely capable of both reasoning through problems and generating executable code solutions.

Q: What are the recommended use cases?

The model is ideal for educational applications, automated math tutoring, problem-solving assistance, and mathematical reasoning tasks. It can be used both for explaining solutions step-by-step and for generating programmatic solutions.