VulBERTa-MLP-Draper

VulBERTa-MLP-Draper

claudios

A specialized code vulnerability detection model based on RoBERTa, achieving 64.71% accuracy in identifying security flaws in C/C++ code with 125M parameters.

PropertyValue
Parameter Count125M
LicenseMIT
PaperarXiv:2205.12424
MetricsAccuracy: 64.71%, F1: 56.93%, ROC-AUC: 71.02%

What is VulBERTa-MLP-Draper?

VulBERTa-MLP-Draper is an advanced deep learning model designed specifically for detecting security vulnerabilities in source code. Built on RoBERTa architecture with a Multi-Layer Perceptron (MLP) classification head, this model represents a significant advancement in automated code security analysis. It's particularly notable for its ability to process and analyze C/C++ code while maintaining high accuracy in vulnerability detection.

Implementation Details

The model utilizes a custom tokenization pipeline that includes automated comment removal and specialized code processing. It employs a RoBERTa-based architecture enhanced with an MLP classification head, optimized for understanding code syntax and semantics.

  • Custom tokenization requiring libclang integration
  • Pre-trained on real-world code from open-source C/C++ projects
  • Simplified processing pipeline for improved efficiency
  • F32 tensor type for precise computations

Core Capabilities

  • Binary and multi-class vulnerability detection
  • Code syntax and semantics understanding
  • High precision (64.80%) and ROC-AUC score (71.02%)
  • Efficient processing of complex code structures

Frequently Asked Questions

Q: What makes this model unique?

VulBERTa-MLP-Draper stands out for its conceptual simplicity while achieving state-of-the-art performance in vulnerability detection. It requires minimal training data and parameters compared to similar models, yet maintains competitive accuracy levels.

Q: What are the recommended use cases?

The model is ideal for automated security auditing of C/C++ codebases, continuous integration security checks, and research applications in code vulnerability detection. It's particularly effective when integrated into development pipelines for early vulnerability detection.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026