palmyra-base

Maintained By
Writer

Palmyra Base

PropertyValue
Parameter Count5 Billion
Model TypeCausal Language Model
ArchitectureTransformer Decoder
LicenseApache 2.0
LanguageEnglish

What is palmyra-base?

Palmyra Base is a powerful 5B parameter language model developed by Writer, designed specifically for English text generation. Built on the transformer decoder architecture, it follows similar principles to GPT-3 and excels in various natural language processing tasks. The model was trained on Writer's custom dataset using a causal language modeling objective, making it particularly effective for text generation tasks.

Implementation Details

The model can be implemented using the Hugging Face Transformers library with AutoModelForCausalLM. It requires PyTorch and operates optimally with float16 precision on CUDA-enabled devices. The model uses a custom tokenizer that should be initialized with use_fast=False for optimal performance.

  • Transformer decoder architecture optimized for English language
  • Pre-trained using causal language modeling objective
  • Supports both inference and fine-tuning capabilities
  • Integrates seamlessly with the Hugging Face ecosystem

Core Capabilities

  • Text generation and completion
  • Sentiment classification
  • Summarization tasks
  • Strong performance on SuperGLUE benchmark tasks
  • Efficient processing with float16 precision

Frequently Asked Questions

Q: What makes this model unique?

Palmyra Base stands out for its balance of power and speed, featuring 5B parameters optimized for English language tasks. It demonstrates particularly strong performance in sentiment classification and summarization, while maintaining efficient processing capabilities.

Q: What are the recommended use cases?

The model excels in text generation tasks, sentiment analysis, and summarization. However, it's important to note that human oversight is recommended for output curation, and the model should not be relied upon for factually critical applications.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.