selecsls42b.in1k
Property | Value |
---|---|
Parameter Count | 32.5M |
GMACs | 3.0 |
Input Size | 224 x 224 |
License | CC-BY-4.0 |
Paper | XNect Paper |
What is selecsls42b.in1k?
selecsls42b.in1k is an efficient convolutional neural network designed for real-time computer vision tasks, particularly multi-person 3D motion capture. Developed as part of the XNect framework, this model balances computational efficiency with accuracy, featuring 32.5M parameters and operating on 224x224 pixel images.
Implementation Details
The model architecture is optimized for real-time performance while maintaining high accuracy. It utilizes selective sparse connections and achieves efficient feature extraction with just 3.0 GMACs (Giga Multiply-Accumulate Operations).
- Efficient backbone architecture with 4.6M activations
- Support for multiple output modes: classification, feature extraction, and embeddings
- Pretrained on ImageNet-1k dataset
- Implements both classification head and feature extraction capabilities
Core Capabilities
- Image Classification with 1000-class ImageNet support
- Feature map extraction with multiple resolution outputs
- Embedding generation for transfer learning tasks
- Real-time performance suitable for motion capture applications
Frequently Asked Questions
Q: What makes this model unique?
The model's architecture is specifically designed for real-time applications while maintaining good accuracy. Its selective sparse connections and efficient parameter usage make it particularly suitable for motion capture and real-time vision tasks.
Q: What are the recommended use cases?
This model is ideal for real-time applications including multi-person 3D motion capture, image classification tasks, and as a backbone for feature extraction in computer vision pipelines. It's particularly useful when deployment requires balancing performance with computational resources.