LGM

ashawkey

LGM is a fast Text-to-3D and Image-to-3D model using Gaussian Splatting, generating high-resolution 3D content in 5 seconds with 415M parameters.

Property	Value
Parameter Count	415M
License	MIT
Paper	arXiv:2402.05054
Training Data	80K subset of Objaverse

What is LGM?

LGM (Large Multi-View Gaussian Model) is a groundbreaking AI model that revolutionizes 3D content creation by enabling rapid generation of high-resolution 3D objects from either text descriptions or images. Built on Gaussian Splatting technology, it can produce results in just 5 seconds, making it one of the fastest and most efficient 3D generation models available.

Implementation Details

The model leverages a sophisticated architecture with 415M parameters and operates using F32 tensor types. It was trained on a carefully curated subset of approximately 80,000 3D objects from the Objaverse dataset, ensuring high-quality and diverse output generation.

Fast generation time (5 seconds per object)
Support for both text-to-3D and image-to-3D conversion
High-resolution output capability
Built on efficient Gaussian Splatting technology

Core Capabilities

Rapid 3D object generation from text descriptions
High-fidelity 3D model creation from input images
Multi-view consistency in generated objects
Efficient processing with minimal computational overhead

Frequently Asked Questions

Q: What makes this model unique?

LGM stands out for its exceptional speed (5-second generation time) and ability to create high-resolution 3D content from both text and images using Gaussian Splatting, making it highly versatile for various 3D content creation needs.

Q: What are the recommended use cases?

The model is ideal for rapid prototyping, content creation for virtual environments, 3D asset generation for games and simulations, and any application requiring quick conversion from text or images to 3D models.

LGM