DISC-MedLLM

Maintained By
Flmc

DISC-MedLLM

PropertyValue
Base ModelBaichuan-13B-Base
LanguageChinese
LicenseApache 2.0
Research PaperarXiv:2308.14346

What is DISC-MedLLM?

DISC-MedLLM is a specialized medical domain large language model developed by Fudan-DISC lab, designed specifically for conversational healthcare scenarios. Built upon the Baichuan-13B architecture, it has been trained on a comprehensive dataset of over 470,000 medical examples to bridge the gap between general language models and real-world medical consultations.

Implementation Details

The model leverages a goal-oriented training strategy using the DISC-Med-SFT dataset, which combines real-world conversations, knowledge graph-derived QA pairs, and human preference-aligned data. The implementation utilizes PyTorch and Transformers frameworks, supporting both full-parameter and quantized inference options.

  • Training dataset includes 420k real-world conversations, 50k knowledge graph QA pairs, and 2k human preference examples
  • Implements a specialized dialogue format with specific user and assistant tokens (195 and 196)
  • Supports both full-parameter and quantized (int8, int4) deployment options

Core Capabilities

  • Knowledge-intensive medical consultations and treatment inquiries
  • Multi-turn medical dialogue support
  • Human preference alignment in responses
  • Integration with medical knowledge graphs
  • Reliable and context-aware medical information delivery

Frequently Asked Questions

Q: What makes this model unique?

DISC-MedLLM stands out through its specialized focus on Chinese medical consultations, comprehensive training data incorporating real doctor-patient dialogues, and integration with medical knowledge graphs. Its goal-oriented training strategy ensures high-quality health support services while maintaining alignment with human preferences.

Q: What are the recommended use cases?

The model is specifically designed for research and testing in medical consultation scenarios, including patient inquiries, treatment discussions, and health support services. However, it should not be used as a replacement for professional medical advice, and all outputs should be critically assessed.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.