Product
Prompt Management
Evaluations
Observability
Dataset Management
Prompt Chaining
Docs
Blog
Case Studies
Careers
Pricing
Contact Us
Log In
Contact Us
Log In
Research Papers
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Published
Jul 29, 2024
Beyond Benchmarks: Why LLM Evaluation is So Hard
Marco AF Pimentel|Clément Christophe|Tathagata Raha|Prateek Munjal|Praveen K Kanithi|Shadab Khan
Published
Jul 29, 2024
How AI is Revolutionizing Farming
Hongyan Zhu|Shuai Qin|Min Su|Chengzhi Lin|Anjie Li|Junfeng Gao
Published
Jul 29, 2024
Boosting Smaller AI Models for Medical Expertise
Jingwei Zhu|Minghuan Tan|Min Yang|Ruixue Li|Hamid Alinejad-Rokny
Published
Jul 29, 2024
Can AI Write Infinite Tests? This New Benchmark Says So
Marcel Zalmanovici|Orna Raz|Eitan Farchi|Iftach Freund
Published
Jul 29, 2024
Unlocking Scientific Posters: A New AI Dataset for Automated Layout
Shohei Tanaka|Hao Wang|Yoshitaka Ushiku
Published
Jul 29, 2024
Can LLMs Help AI Generalize to Unseen Domains?
Juhwan Choi|Junehyoung Kwon|JungMin Yun|Seunguk Yu|YoungBin Kim
Published
Jul 29, 2024
Merging AI Minds: How Cool-Fusion Combines LLMs Without Training
Cong Liu|Xiaojun Quan|Yan Pan|Liang Lin|Weigang Wu|Xu Chen
Published
Jul 29, 2024
Unlocking AI’s Potential: Self-Reasoning Retrieval for Smarter Language Models
Yuan Xia|Jingbo Zhou|Zhenhui Shi|Jun Chen|Haifeng Huang
Published
Jul 29, 2024
Unlocking Ancient Arabic Texts: The ATHAR Dataset
Mohammed Khalil|Mohammed Sabry
1
...
The first platform built for
prompt engineering
Start for free