Llama-Primus-Base

trendmicro-ailab

Cybersecurity-focused LLM based on Llama-3.1-8B-Instruct, achieving 15.88% improvement across security benchmarks through specialized training

Property	Value
Base Model	Llama-3.1-8B-Instruct
Developer	Trend Micro AI Lab
License	MIT + Llama 3.1 Community License Agreement
Training Data	Primus-Seed (0.2B) + Primus-FineWeb (2.57B)

What is Llama-Primus-Base?

Llama-Primus-Base is a specialized cybersecurity language model that builds upon Llama-3.1-8B-Instruct through continued pre-training on cybersecurity-specific datasets. It represents a significant advancement in domain-specific AI, achieving a 15.88% improvement in aggregated cybersecurity benchmark scores compared to its base model.

Implementation Details

The model leverages two primary training datasets: Primus-Seed, a manually curated high-quality cybersecurity text dataset, and Primus-FineWeb, which contains 2.57B tokens of filtered cybersecurity content from Common Crawl. This specialized training enables enhanced understanding and processing of cybersecurity-related tasks.

Comprehensive benchmark improvements across CISSP, CTI-Bench, CyberMetric, and SecEval
Specialized architecture optimized for cybersecurity applications
Built on the robust foundation of Llama-3.1-8B-Instruct

Core Capabilities

Enhanced CVSS scoring with reduced Mean Absolute Deviation
Improved accuracy in cyber threat intelligence tasks
Superior performance in security certification exam-style questions
Advanced entity extraction in security contexts

Frequently Asked Questions

Q: What makes this model unique?

The model's uniqueness lies in its specialized cybersecurity training data and significant performance improvements across multiple security benchmarks. It's part of Trend Micro's pioneering family of open cybersecurity language models, designed specifically for security applications.

Q: What are the recommended use cases?

The model is particularly suited for cybersecurity applications including threat intelligence analysis, vulnerability assessment, security certification training, and general security knowledge tasks. It shows notable improvements in CISSP exam-style questions and CTI-bench evaluations.