PickScore_v1

PickScore_v1

yuvalkirstain

PickScore_v1: A 986M parameter CLIP-based model for scoring text-to-image generation quality. Trained on Pick-a-Pic dataset for human preference prediction.

PropertyValue
Parameter Count986M
Model TypeZero-Shot Image Classification
FrameworkPyTorch
PaperPick-a-Pic: An Open Dataset of User Preferences

What is PickScore_v1?

PickScore_v1 is a sophisticated scoring model designed to evaluate the quality of AI-generated images based on their corresponding text prompts. Developed by yuvalkirstain, this model leverages CLIP architecture and has been fine-tuned on the Pick-a-Pic dataset to accurately predict human preferences in text-to-image generation tasks.

Implementation Details

The model is built upon the CLIP-H architecture and implements a dual-encoder approach that processes both image and text inputs. It utilizes safetensors for efficient parameter storage and provides normalized embedding scores through cosine similarity calculations.

  • Employs CLIP-ViT-H-14 as the base architecture
  • Processes images and text through separate encoders
  • Outputs probability scores for image-text alignment
  • Supports batch processing of multiple images

Core Capabilities

  • Human preference prediction for generated images
  • Image quality scoring relative to text prompts
  • Model evaluation and benchmarking
  • Ranking multiple images for a given prompt

Frequently Asked Questions

Q: What makes this model unique?

PickScore_v1 stands out for its specific optimization for human preference prediction in text-to-image generation, trained on a comprehensive dataset of user preferences rather than just generic image-text alignment.

Q: What are the recommended use cases?

The model is ideal for evaluating text-to-image generation models, ranking multiple generated images, automated quality assessment in image generation pipelines, and research applications requiring human preference simulation.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026