Product
Prompt Management
Evaluations
Observability
Dataset Management
Prompt Chaining
Docs
Blog
Case Studies
Careers
Pricing
Contact Us
Log In
Contact Us
Log In
Research Papers
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Published
Aug 3, 2024
Can AI Understand Space? Testing LLMs' Spatial Reasoning
Alexey Tikhonov
Published
Aug 3, 2024
Unlocking Sub-1-Bit LLMs: How Structured Binarization Breaks Barriers
Peijie Dong|Lujun Li|Yuedong Zhong|Dayou Du|Ruibo Fan|Yuhan Chen|Zhenheng Tang|Qiang Wang|Wei Xue|Yike Guo|Xiaowen Chu
Published
Aug 3, 2024
Unlocking Opinions: How AI Decodes Stance in Text
Junxia Ma|Changjiang Wang|Hanwen Xing|Dongming Zhao|Yazhou Zhang
Published
Aug 3, 2024
Can Robots Trust Your Directions? TrustNavGPT and the Science of Doubt
Xingpeng Sun|Yiran Zhang|Xindi Tang|Amrit Singh Bedi|Aniket Bera
Published
Aug 3, 2024
Can AI Detect Hidden Drug Dangers? Meet MALADE
Jihye Choi|Nils Palumbo|Prasad Chalasani|Matthew M. Engelhard|Somesh Jha|Anivarya Kumar|David Page
Published
Aug 3, 2024
Unlocking AI’s Potential: Zero-Shot Tool Retrieval with Re-Invoke
Yanfei Chen|Jinsung Yoon|Devendra Singh Sachan|Qingze Wang|Vincent Cohen-Addad|Mohammadhossein Bateni|Chen-Yu Lee|Tomas Pfister
Published
Aug 4, 2024
Unlocking Automated End-to-End Testing: A Feature-Based Approach
Parsa Alian|Noor Nashid|Mobina Shahbandeh|Taha Shabani|Ali Mesbah
Published
Aug 4, 2024
Unlocking Scientific Secrets: How AI is Revolutionizing Knowledge Extraction
Balaji Muralidharan|Hayden Beadles|Reza Marzban|Kalyan Sashank Mupparaju
Published
Aug 4, 2024
Can AI Diagnose Illness Like a Doctor? A New Benchmark Reveals the Gap
Bowen Wang|Jiuyang Chang|Yiming Qian|Guoxin Chen|Junhao Chen|Zhouqiang Jiang|Jiahao Zhang|Yuta Nakashima|Hajime Nagahara
1
...
The first platform built for
prompt engineering
Start for free