fairseq-dense-13B-Janeway

Maintained By
KoboldAI

fairseq-dense-13B-Janeway

PropertyValue
Parameter Count13 Billion
Model TypeDense Language Model
ArchitectureFairseq MoE Dense
AuthorKoboldAI
PaperArtetxe et al. (2021)
HuggingFaceLink

What is fairseq-dense-13B-Janeway?

fairseq-dense-13B-Janeway is a specialized language model built on Fairseq's Mixture of Experts (MoE) architecture, fine-tuned specifically for creative writing with a focus on science fiction and fantasy genres. The model derives its name from Star Trek's Captain Janeway and has been trained on an extensive collection of over 2,200 ebooks.

Implementation Details

The model is implemented using Fairseq's dense architecture and can be easily integrated using the Hugging Face Transformers library. It features genre-aware training through specialized tags and maintains the same training dataset as its smaller counterpart, GPT-Neo-2.7B-Janeway.

  • Built on Fairseq's MoE dense architecture
  • 13 billion parameters for enhanced performance
  • Genre-specific training with tagged data
  • Optimized for creative text generation

Core Capabilities

  • Creative fiction writing, especially in sci-fi and fantasy genres
  • Genre-aware text generation
  • Natural dialogue generation
  • Context-aware responses
  • Character-consistent narrative generation

Frequently Asked Questions

Q: What makes this model unique?

The model combines the power of a 13B parameter architecture with specialized training on science fiction and fantasy literature, making it particularly adept at generating creative content in these genres. Its genre-aware training approach using explicit tags allows for more controlled and contextually appropriate outputs.

Q: What are the recommended use cases?

The model is best suited for creative writing applications, particularly in science fiction and fantasy contexts. It excels at generating character dialogue, story continuations, and genre-specific content. The model can be particularly useful for writers, game developers, and creative content creators working in these genres.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.