GeometryCrafter

GeometryCrafter

TencentARC

GeometryCrafter is a state-of-the-art model for estimating consistent geometry from open-world videos, running at 1.27-2.49 FPS with advanced point map generation capabilities.

PropertyValue
DeveloperTencentARC
Model TypeGeometry Estimation
Hardware Requirements40GB GPU (full resolution) / 22GB GPU (low resolution)
Repositoryhttps://github.com/TencentARC/GeometryCrafter

What is GeometryCrafter?

GeometryCrafter is an innovative AI model developed by TencentARC that estimates temporally consistent, high-quality point maps from open-world videos. This groundbreaking technology enables advanced 3D/4D reconstruction and depth-based video editing capabilities, making it a valuable tool for computer vision applications.

Implementation Details

The model operates at various performance levels depending on configuration: 1.27FPS for full resolution (1024x576) processing and up to 2.49FPS for low-resolution (384x640) processing. It employs diffusion priors to ensure consistent geometry estimation across video frames.

  • Supports multiple resolution modes for different hardware capabilities
  • Includes both standard and deterministic variant implementations
  • Features comprehensive visualization tools via Viser
  • Provides extensive dataset evaluation capabilities

Core Capabilities

  • High-quality point map generation from videos
  • Temporal consistency maintenance across frames
  • Scale-invariant point map estimation
  • Affine-invariant depth estimation
  • Support for various video resolutions

Frequently Asked Questions

Q: What makes this model unique?

GeometryCrafter stands out for its ability to maintain temporal consistency while processing open-world videos, offering both standard and deterministic variants for different use cases. Its diffusion priors enable robust geometry estimation across diverse scenarios.

Q: What are the recommended use cases?

The model is ideal for 3D/4D reconstruction tasks, depth-based video editing, and geometry estimation in open-world scenarios. It's particularly useful for applications requiring consistent point map generation from video content.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026