Product
Prompt Management
Evaluations
Observability
Dataset Management
Prompt Chaining
Docs
Blog
Case Studies
Careers
Pricing
Contact Us
Log In
Contact Us
Log In
Research Papers
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Published
Oct 2, 2024
Can We Trust Text Anymore? A New Watermark for AI
Dara Bahri|John Wieting|Dana Alon|Donald Metzler
Published
Oct 2, 2024
Racing Thoughts: Why AI Can't Keep Up With Context
Michael A. Lepori|Michael Mozer|Asma Ghandeharioun
Published
Oct 3, 2024
Can LLMs Teach Themselves to Reason?
Xiangyu Peng|Congying Xia|Xinyi Yang|Caiming Xiong|Chien-Sheng Wu|Chen Xing
Published
Oct 3, 2024
Mind-Controlled Robots? How Brainwaves Could Command Humanoids
Yiqun Duan|Qiang Zhang|Jinzhao Zhou|Jingkai Sun|Xiaowei Jiang|Jiahang Cao|Jiaxu Wang|Yiqian Yang|Wen Zhao|Gang Han|Yijie Guo|Chin-Teng Lin
Published
Oct 3, 2024
Coding With AI: How to Supercharge Research
Tonghe Zhuang|Zhicheng Lin
Published
Oct 3, 2024
Can LLMs Plan? Putting OpenAI's "Strawberry" to the Test
Karthik Valmeekam|Kaya Stechly|Atharva Gundawar|Subbarao Kambhampati
Published
Oct 3, 2024
How Hackers Could Poison Your AI Search Results
Collin Zhang|Tingwei Zhang|Vitaly Shmatikov
Published
Oct 3, 2024
How Human Uncertainty Impacts AI Evaluation
Aparna Elangovan|Jongwoo Ko|Lei Xu|Mahsa Elyasi|Ling Liu|Sravan Bodapati|Dan Roth
Published
Oct 3, 2024
Can AI Judge Code? Introducing CodeJudge
Weixi Tong|Tianyi Zhang
1
...
The first platform built for
prompt engineering
Start for free