Noam Shazeer

Co-author of the Transformer paper, co-founder of Character.AI, and now co-lead of Gemini at Google DeepMind.

Who is Noam Shazeer?

Noam Shazeer is a computer scientist and entrepreneur best known as a co-author of the Transformer paper, a co-founder of Character.AI, and a co-lead of Google Gemini at Google DeepMind. He is widely associated with some of the most influential ideas in modern LLM architecture. (noamshazeer.com)

Background and career

Shazeer started at Google in 2000, where he worked on early search infrastructure and later contributed to major language modeling and systems research. His public bio also notes that he helped shape ideas such as sparse mixture-of-experts layers, TensorFlow-era tooling, and the Transformer itself. (noamshazeer.com)

He later co-founded Character.AI, then returned to Google and became a technical co-lead on Gemini, which sits inside Google DeepMind’s broader model development effort. That trajectory makes him a useful case study in how research, products, and model infrastructure can reinforce each other over time. (noamshazeer.com)

Key facts about Noam Shazeer include:

  1. Current role: Co-leads Google Gemini and is listed on his public site as VP Engineering at Google. (noamshazeer.com)
  2. Best-known paper: He is an author of “Attention Is All You Need,” the 2017 paper that introduced the Transformer architecture. (research.google)
  3. Startup background: He co-founded Character.AI, a consumer AI chatbot platform. (noamshazeer.com)
  4. Google roots: He joined Google in 2000 and worked on search and ad-related systems early in his career. (noamshazeer.com)
  5. Research impact: His work spans attention, sparsity, and large-scale model systems. (noamshazeer.com)

Notable contributions

  1. Transformer architecture: Co-authored the landmark paper that made attention-based sequence modeling the default for modern LLMs. (research.google)
  2. Multi-head attention and residual design: His public bio says he personally designed key parts of the Transformer implementation. (noamshazeer.com)
  3. Sparse mixture of experts: He helped develop sparsely-gated MoE ideas that later influenced efficient scaling work. (noamshazeer.com)
  4. Character.AI: Co-founded a high-profile consumer chatbot company focused on interactive characters. (noamshazeer.com)
  5. Gemini leadership: Returned to Google to help lead Gemini, one of the company’s flagship model families. (noamshazeer.com)

Why they matter in AI today

  1. He represents core LLM architecture: The Transformer remains the backbone of most frontier models, so his work is directly relevant to builders. (research.google)
  2. He bridges research and product: His career shows how foundational papers can evolve into consumer applications and enterprise model stacks. (noamshazeer.com)
  3. He highlights scaling tradeoffs: His MoE and systems work points to practical ways teams think about cost, speed, and capacity. (noamshazeer.com)
  4. He is relevant to agentic products: Character.AI and Gemini both live in the world of interactive, user-facing model behavior. (noamshazeer.com)
  5. He helps set the agenda for modern AI labs: Leadership roles at Google DeepMind shape what gets built, measured, and shipped next. (deepmind.google)

Where to follow their work

The most direct source is his personal site, which summarizes his role and selected projects. Google Research also hosts the Transformer publication page, and Google DeepMind’s public pages provide context for the lab that now develops Gemini. (noamshazeer.com)

For ongoing updates, his site links to public profiles such as X, LinkedIn, and Google Scholar. Those are the best channels to watch if you want to track new papers, talks, or product leadership updates. (noamshazeer.com)

How PromptLayer connects with Noam Shazeer's work

Shazeer’s career is a reminder that strong model performance depends on both breakthrough ideas and careful iteration. That is where PromptLayer fits in, giving teams a place to manage prompts, track changes, and evaluate model behavior as they build with the same kinds of LLM systems his work helped define.

Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.

Related Terms

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026