Beval - very simple evals
Quick and dirty LLM-based evaluations for your AI product traces.
1
Add datasets
Upload CSV or JSON traces of user conversations with your AI product.
2
Create evals
Define what to evaluate — classify, score, or label each trace using a prompt.
3
Run
Hit run and get results in minutes, powered by LLMs.