base-7b-v0.2

Maintained By
internistai

Internist.ai base-7b-v0.2

PropertyValue
Parameter Count7.24B
Base ModelMistral-7B-v0.1
Context Length4096 tokens
LicenseApache 2.0
Training Data2.3B tokens (Medical + General Domain)
PaperLink to Research Paper

What is base-7b-v0.2?

Internist.ai 7b represents a breakthrough in medical AI, being the first 7B parameter model to achieve above 60% pass threshold on MedQA (USMLE). Developed by medical professionals at UCLouvain and Cliniques Universitaires Saint-Luc, it demonstrates the effectiveness of a physician-in-the-loop approach to medical language model development.

Implementation Details

The model was trained using Axolotl on 4 NVIDIA A100 80GB GPUs for 450 GPU hours. It incorporates advanced techniques including FlashAttention and NEFTune, with a training corpus of 2.3B tokens carefully curated from medical guidelines, textbooks, and general domain knowledge.

  • Trained using BF16 precision with cosine learning rate scheduling
  • Utilizes sample packing and 4096 token sequence length
  • Implements the Alpaca chat template format
  • Incorporates both medical and general domain capabilities

Core Capabilities

  • Achieved 60.5% accuracy on MedQA (USMLE)
  • Scores 79.4% on PubMedQA benchmarks
  • Demonstrates strong performance across MMLU medical categories
  • Maintains general domain competency while excelling in medical tasks

Frequently Asked Questions

Q: What makes this model unique?

This is the first 7B parameter model to achieve a passing score on the USMLE benchmark, demonstrating that carefully curated training data and physician involvement can create more effective medical AI models than larger, less focused approaches.

Q: What are the recommended use cases?

The model is designed for medical professionals as an assistant in clinical decision support and documentation. It requires additional task-specific training and safety evaluation before deployment in real-world settings, and is not recommended for use by non-medical professionals.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.