dolphin-2_6-phi-2

Maintained By
cognitivecomputations

Dolphin-2.6-Phi-2

PropertyValue
Parameter Count2.78B
LicenseMIT
Tensor TypeFP16
Training Time2 days on 4x A100s
Training MethodqLoRA and Axolotl

What is dolphin-2_6-phi-2?

Dolphin-2.6-Phi-2 is an advanced language model based on Microsoft's Phi-2 architecture, specifically designed to be uncensored and highly compliant while maintaining strong performance across various tasks. This iteration includes significant improvements in training configuration and carefully curated datasets for enhanced quality and capabilities.

Implementation Details

The model utilizes the ChatML prompt format and was trained for 3 epochs using qLoRA and Axolotl framework. It incorporates seven diverse datasets including Dolphin, Airoboros, Magicoder, and Capybara, creating a well-rounded knowledge base for various applications.

  • Benchmark Performance: 61.7% average across major metrics
  • Enhanced training configuration for improved quality
  • Integrated empathy data from Samantha-based datasets
  • Replaced certain datasets with Capybara for better performance

Core Capabilities

  • Strong performance in reasoning tasks (ARC: 59.81%)
  • Excellent common sense understanding (HellaSwag: 74.65%)
  • Robust mathematical reasoning (GSM8K: 58.07%)
  • Enhanced empathy and conversational abilities
  • Compliance with complex instructions

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its uncensored nature combined with high compliance, making it suitable for research and controlled environments. It's specifically designed to be adaptable to custom alignment layers while maintaining strong performance across various benchmarks.

Q: What are the recommended use cases?

The model excels in conversational AI, code generation, and general instruction-following tasks. However, due to its uncensored nature, it requires implementation of appropriate alignment layers before deployment in production environments.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.