Platform
Prompt Management
Evaluations
Observability
Dataset Management
Prompt Chaining
Docs
Blog
Case Studies
Careers
Contact Us
Log In
Research Papers
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Published
Jun 21, 2024
Can AI Decode Your Genes? A New Benchmark Puts LLMs to the Test
Haoyang Liu|Haohan Wang
Published
Jun 21, 2024
How AI and Student Feedback Create Super-Sticky Mnemonics
Nishant Balepur|Matthew Shu|Alexander Hoyle|Alison Robey|Shi Feng|Seraphina Goldfarb-Tarrant|Jordan Boyd-Graber
Published
Jun 21, 2024
Can AI Learn to Align Itself? SAILing Towards Self-Improving LLMs
Mucong Ding|Souradip Chakraborty|Vibhu Agrawal|Zora Che|Alec Koppel|Mengdi Wang|Amrit Bedi|Furong Huang
Published
Jun 21, 2024
Can AI Learn from Flawed Human Feedback?
Alexander Bukharin|Ilgee Hong|Haoming Jiang|Zichong Li|Qingru Zhang|Zixuan Zhang|Tuo Zhao
Published
Jun 21, 2024
Can We Spot AI-Written Text? Detecting the Digital Author
Kathleen C. Fraser|Hillary Dawkins|Svetlana Kiritchenko
Published
Jun 21, 2024
Breaking the Logic: How to Trick AI Into Ignoring the Rules
Anton Xue|Avishree Khare|Rajeev Alur|Surbhi Goel|Eric Wong
Published
Jun 21, 2024
Lost in Translation: Why AI Still Struggles with Low-Resource Languages
Sara Court|Micha Elsner
Published
Jun 21, 2024
Can LLMs Tell Truth from Fiction? Measuring AI Uncertainty
Roman Vashurin|Ekaterina Fadeeva|Artem Vazhentsev|Lyudmila Rvanova|Akim Tsvigun|Daniil Vasilev|Rui Xing|Abdelrahman Boda Sadallah|Kirill Grishchenkov|Sergey Petrakov|Alexander Panchenko|Timothy Baldwin|Preslav Nakov|Maxim Panov|Artem Shelmanov
Published
Jun 21, 2024
FIRST: Supercharging LLM Search with Single-Token Decoding
Revanth Gangi Reddy|JaeHyeok Doo|Yifei Xu|Md Arafat Sultan|Deevya Swain|Avirup Sil|Heng Ji
1
...
The first platform built for
prompt engineering
Start for free