Product
Prompt Management
Evaluations
Observability
Dataset Management
Prompt Chaining
Docs
Blog
Case Studies
Careers
Pricing
Contact Us
Log In
Contact Us
Log In
Research Papers
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Published
Jul 23, 2024
Can AI Catch Crypto Bugs? Exploring LLMs for Secure Code
Yifan Xia|Zichen Xie|Peiyu Liu|Kangjie Lu|Yan Liu|Wenhai Wang|Shouling Ji
Published
Jul 23, 2024
The Shared Hallucinations of Large Language Models
Yilun Zhou|Caiming Xiong|Silvio Savarese|Chien-Sheng Wu
Published
Jul 23, 2024
Unlocking Meaning: How AI Masters the Art of Semantic Change
Jader Martins Camboim de Sá|Marcos Da Silveira|Cédric Pruski
Published
Jul 23, 2024
Can AI Fix Itself? Exploring LLM Course Correction
Rongwu Xu|Yishuo Cai|Zhenhong Zhou|Renjie Gu|Haiqin Weng|Yan Liu|Tianwei Zhang|Wei Xu|Han Qiu
Published
Jul 23, 2024
Exposing AI’s Weaknesses: Red Teaming with RedAgent
Huiyu Xu|Wenhui Zhang|Zhibo Wang|Feng Xiao|Rui Zheng|Yunhe Feng|Zhongjie Ba|Kui Ren
Published
Jul 23, 2024
OpenHands: Unleashing AI Agents to Code, Command, and Conquer the Web
Xingyao Wang|Boxuan Li|Yufan Song|Frank F. Xu|Xiangru Tang|Mingchen Zhuge|Jiayi Pan|Yueqi Song|Bowen Li|Jaskirat Singh|Hoang H. Tran|Fuqiang Li|Ren Ma|Mingzhang Zheng|Bill Qian|Yanjun Shao|Niklas Muennighoff|Yizhe Zhang|Binyuan Hui|Junyang Lin|Robert Brennan|Hao Peng|Heng Ji|Graham Neubig
Published
Jul 23, 2024
GPT-4V Jailbroken: How AI Could Leak Your Identity
Yuanwei Wu|Yue Huang|Yixin Liu|Xiang Li|Pan Zhou|Lichao Sun
Published
Jul 23, 2024
Can AI Grade Your Code? TAMIGO and the Future of Teaching
Anishka IIITD|Diksha Sethi|Nipun Gupta|Shikhar Sharma|Srishti Jain|Ujjwal Singhal|Dhruv Kumar
Published
Jul 23, 2024
RAG vs. Long-Context LLMs: A Showdown for AI's Future
Zhuowan Li|Cheng Li|Mingyang Zhang|Qiaozhu Mei|Michael Bendersky
1
...
The first platform built for
prompt engineering
Start for free