palmyra-base

palmyra-base

Writer

Palmyra-base is a 5B parameter transformer-based language model optimized for English text generation, featuring high performance and versatility in tasks like sentiment analysis and summarization.

PropertyValue
Parameter Count5 Billion
Model TypeCausal Language Model
ArchitectureTransformer Decoder
LicenseApache 2.0
LanguageEnglish

What is palmyra-base?

Palmyra Base is a powerful 5B parameter language model developed by Writer, designed specifically for English text generation. Built on the transformer decoder architecture, it follows similar principles to GPT-3 and excels in various natural language processing tasks. The model was trained on Writer's custom dataset using a causal language modeling objective, making it particularly effective for text generation tasks.

Implementation Details

The model can be implemented using the Hugging Face Transformers library with AutoModelForCausalLM. It requires PyTorch and operates optimally with float16 precision on CUDA-enabled devices. The model uses a custom tokenizer that should be initialized with use_fast=False for optimal performance.

  • Transformer decoder architecture optimized for English language
  • Pre-trained using causal language modeling objective
  • Supports both inference and fine-tuning capabilities
  • Integrates seamlessly with the Hugging Face ecosystem

Core Capabilities

  • Text generation and completion
  • Sentiment classification
  • Summarization tasks
  • Strong performance on SuperGLUE benchmark tasks
  • Efficient processing with float16 precision

Frequently Asked Questions

Q: What makes this model unique?

Palmyra Base stands out for its balance of power and speed, featuring 5B parameters optimized for English language tasks. It demonstrates particularly strong performance in sentiment classification and summarization, while maintaining efficient processing capabilities.

Q: What are the recommended use cases?

The model excels in text generation tasks, sentiment analysis, and summarization. However, it's important to note that human oversight is recommended for output curation, and the model should not be relied upon for factually critical applications.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026