oasst-sft-7-llama-30b-xor

Maintained By
OpenAssistant

OpenAssistant LLaMA 30B SFT 7

PropertyValue
Base ModelLLaMA 30B
LicenseOther
Research PaperOASST Dataset Paper
Training ApproachSupervised Fine-Tuning

What is oasst-sft-7-llama-30b-xor?

The oasst-sft-7-llama-30b-xor is a sophisticated language model based on Meta's LLaMA 30B architecture, fine-tuned by OpenAssistant using supervised learning techniques. Due to LLaMA's licensing restrictions, the model is distributed using an innovative XOR weights approach, requiring users to combine it with the original LLaMA weights.

Implementation Details

The model implementation requires a specific process using XOR decoding, with strict version requirements for dependencies including Python 3.10, PyTorch 1.13.1, and specific versions of transformers and other libraries. The model was trained using a combination of datasets including OASST export, Vicuna, Dolly15k, and others, with specific configuration parameters such as FP16 dtype and flash attention enabled.

  • Utilizes gradient checkpointing and custom sampling
  • Implements flash attention for improved performance
  • Trained with a learning rate of 1e-5 and zero weight decay
  • Maximum sequence length of 2048 tokens

Core Capabilities

  • Multilingual support across 20 languages
  • Handles various task types including instruction following
  • Optimized for both general conversation and specialized tasks
  • Supports code generation and mathematical reasoning

Frequently Asked Questions

Q: What makes this model unique?

This model uniquely combines the powerful LLaMA 30B architecture with OpenAssistant's supervised fine-tuning approach, distributed through an innovative XOR weights system to comply with licensing requirements.

Q: What are the recommended use cases?

The model is suitable for multilingual applications, instruction following, code generation, and mathematical problem-solving. It's particularly effective for tasks requiring understanding across multiple languages and domains.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.