QVQ-72B-Preview

QVQ-72B-Preview

Qwen

QVQ-72B-Preview is a cutting-edge visual reasoning AI model achieving 70.3% on MMMU benchmark, specializing in mathematical and visual understanding tasks

PropertyValue
DeveloperQwen Team
Model Size72B parameters
Model TypeVisual-Language Model
GitHubHuggingFace Repository

What is QVQ-72B-Preview?

QVQ-72B-Preview is an experimental research model developed by the Qwen team, specifically designed to enhance visual reasoning capabilities. The model has demonstrated exceptional performance across various benchmarks, particularly in mathematical and visual understanding tasks.

Implementation Details

The model is implemented using the Transformers library and requires specific utilities for handling visual inputs. It supports various input formats including base64, URLs, and interleaved images, though it's currently limited to single-round dialogues and doesn't support video inputs.

  • Supports flexible visual token processing with customizable pixel ranges
  • Implements advanced visual reasoning capabilities
  • Utilizes the qwen-vl-utils toolkit for convenient handling of visual inputs

Core Capabilities

  • Achieved 70.3% on MMMU benchmark
  • Scored 71.4% on MathVista(mini)
  • Demonstrated 35.9% performance on MathVision(full)
  • Showed 20.4% capability on OlympiadBench

Frequently Asked Questions

Q: What makes this model unique?

QVQ-72B-Preview stands out for its enhanced visual reasoning capabilities and superior performance on mathematical and visual understanding tasks, surpassing many contemporary models in specific benchmarks.

Q: What are the recommended use cases?

The model is best suited for tasks involving complex visual reasoning, mathematical problem-solving, and multidisciplinary understanding. However, users should note its limitations regarding language mixing, recursive reasoning loops, and single-round dialogue constraints.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026