ChemGPT-1.2B

ChemGPT-1.2B

ncfrey

ChemGPT-1.2B: A GPT-Neo based molecular modeling transformer, trained on PubChem10M for generative chemistry tasks. Specializes in SMILES/SELFIES molecular representation.

PropertyValue
Model TypeGenerative Transformer (GPT-Neo based)
Training DataPubChem10M Dataset
Primary UseMolecular Modeling
PaperNeural Scaling of Deep Chemical Models

What is ChemGPT-1.2B?

ChemGPT-1.2B is a specialized transformer model designed for generative molecular modeling, built on the GPT-Neo architecture. Developed by researchers including Nathan Frey and colleagues, it represents a significant advancement in computational chemistry, leveraging deep learning for molecular structure generation and analysis.

Implementation Details

The model was trained on PubChem10M, a comprehensive dataset of molecular structures represented as SMILES strings. A key technical aspect is the preprocessing step that converts SMILES to SELFIES using version 1.0.4 of the SELFIES library, ensuring more robust molecular representation.

  • Based on GPT-Neo architecture
  • Trained on PubChem10M dataset
  • Implements SELFIES molecular representation
  • Available through 🤗 Transformers library

Core Capabilities

  • Generation of novel molecular structures
  • Investigation of pre-training effects on chemical modeling
  • Support for both SMILES and SELFIES representations
  • Integration with downstream chemical datasets

Frequently Asked Questions

Q: What makes this model unique?

ChemGPT-1.2B stands out for its specialized focus on molecular modeling and its foundation in the GPT-Neo architecture, making it particularly suited for chemical structure generation and analysis. The use of SELFIES representation adds an extra layer of robustness to molecular generation tasks.

Q: What are the recommended use cases?

The model is primarily intended for research purposes, particularly in investigating the effects of pre-training and fine-tuning on downstream chemical datasets. While it can generate molecules, its main strength lies in academic and research applications rather than production environments.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026