Product
Prompt Management
Evaluations
Observability
Dataset Management
Prompt Chaining
Docs
Blog
Case Studies
Careers
Pricing
Contact Us
Log In
Contact Us
Log In
Research Papers
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Published
Sep 29, 2024
Taming Overconfident AI: How Adaptive Temperature Scaling Calibrates LLMs
Johnathan Xie|Annie S. Chen|Yoonho Lee|Eric Mitchell|Chelsea Finn
Published
Sep 30, 2024
Can AI Forecast the Future? The ForecastBench Test
Ezra Karger|Houtan Bastani|Chen Yueh-Han|Zachary Jacobs|Danny Halawi|Fred Zhang|Philip E. Tetlock
Published
Sep 30, 2024
Unlocking Financial AI: Building Powerful LLMs Without Training Data
Masanori Hirano|Kentaro Imajo
Published
Sep 30, 2024
The Secret to Editing Multimodal AI: A Unified Approach
Kaihang Pan|Zhaoyu Fan|Juncheng Li|Qifan Yu|Hao Fei|Siliang Tang|Richang Hong|Hanwang Zhang|Qianru Sun
Published
Sep 30, 2024
Why AI Keeps Repeating Itself (And How to Fix It)
Huangyu Dai|Ben Chen|Kaidi Chen|Ying Han|Zihan Liang|Wen Jiang
Published
Sep 30, 2024
Unlocking Accent Adaptation: Supercharging AI Speech Recognition
Bingshen Mu|Kun Wei|Qijie Shao|Yong Xu|Lei Xie
Published
Sep 30, 2024
Unlocking AI’s Potential: How to Assemble LLMs for Superior Performance
Shuhao Chen|Weisen Jiang|Baijiong Lin|James T. Kwok|Yu Zhang
Published
Sep 30, 2024
TransAgent: AI Agents That Translate Code
Zhiqiang Yuan|Weitong Chen|Hanlin Wang|Kai Yu|Xin Peng|Yiling Lou
Published
Sep 30, 2024
How to Evaluate AI Summaries: A New Benchmark
Yuho Lee|Taewon Yun|Jason Cai|Hang Su|Hwanjun Song
1
...
The first platform built for
prompt engineering
Start for free