ese_vovnet19b_dw.ra_in1k

ese_vovnet19b_dw.ra_in1k

timm

Efficient VoVNet variant with 6.55M params, trained on ImageNet-1k using RandAugment. Optimized for energy and GPU computation with depthwise separable convolutions.

PropertyValue
Parameter Count6.55M
Model TypeImage Classification / Feature Backbone
LicenseApache-2.0
Training DataImageNet-1k
Image Size224x224 (train) / 288x288 (test)

What is ese_vovnet19b_dw.ra_in1k?

This is an energy-efficient implementation of the VoVNet architecture, specifically designed for optimal GPU computation. It's a variant that uses depthwise separable convolutions (dw) and was trained using the RandAugment (ra) recipe on ImageNet-1k. The model achieves a balance between computational efficiency and accuracy, requiring only 1.3 GMACs while maintaining strong performance.

Implementation Details

The model implements the VoVNet-v2 architecture with several optimizations. It uses energy-efficient spatial excitation (ESE) modules and features a lightweight design with 6.5M parameters. The architecture maintains 8.2M activations and employs different image sizes for training (224x224) and testing (288x288).

  • Optimized backbone network for real-time object detection
  • Implements depthwise separable convolutions for efficiency
  • Trained using RandAugment augmentation strategy
  • Supports both classification and feature extraction modes

Core Capabilities

  • Image classification with 1000 classes (ImageNet)
  • Feature map extraction at multiple scales
  • Image embedding generation
  • Real-time inference capability

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its energy-efficient design while maintaining competitive performance. The combination of ESE modules, depthwise separable convolutions, and RandAugment training makes it particularly suitable for resource-constrained environments.

Q: What are the recommended use cases?

The model is well-suited for real-time applications requiring efficient image classification or feature extraction. It's particularly appropriate for mobile and edge devices where computational resources are limited but real-time performance is necessary.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026