bloomz

Maintained By
bigscience

BLOOMZ

PropertyValue
Parameter Count176B
Licensebigscience-bloom-rail-1.0
PaperCrosslingual Generalization through Multitask Finetuning
Supported Languages46+
Training PrecisionBF16

What is BLOOMZ?

BLOOMZ is a state-of-the-art multilingual language model that represents a significant advancement in cross-lingual AI capabilities. Built upon the BLOOM architecture, it has been specifically fine-tuned on the xP3 dataset to enable instruction following across dozens of languages. This 176B parameter model demonstrates remarkable zero-shot generalization abilities across both languages and tasks.

Implementation Details

The model was trained using a sophisticated setup involving 288 A100 80GB GPUs, utilizing a combination of pipeline, tensor, and data parallelism. The training process involved 498 fine-tuning steps processing 2.09 billion tokens, implemented through Megatron-DeepSpeed and PyTorch frameworks.

  • Architecture based on BLOOM with bfloat16 precision
  • Leverages advanced parallelization techniques (72x pipeline, 1x tensor, 4x data parallel)
  • Implements sophisticated NLP capabilities across 46+ languages

Core Capabilities

  • Multilingual instruction following and task completion
  • Zero-shot cross-lingual generalization
  • Natural language understanding and generation
  • Complex task handling including translation, sentiment analysis, and story generation

Frequently Asked Questions

Q: What makes this model unique?

BLOOMZ stands out for its ability to perform tasks in dozens of languages without specific training in each language, leveraging cross-lingual transfer learning through its fine-tuning on the xP3 dataset.

Q: What are the recommended use cases?

The model excels at tasks expressed in natural language across multiple languages, including translation, sentiment analysis, question answering, and creative writing. It's particularly effective when given clear, well-structured prompts with explicit task instructions.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.