Docs
Alpaca Evals

Alpaca Evals

In collaboration with the Alpaca team, we've loaded several submissions from the Alpaca leaderboard into Braintrust, where you can see not only the aggregated performance, but also dig into individual models and better understand their strengths and weaknesses.

Check out the Alpaca Evals project on Braintrust to dig in further—no login required.

Alpaca Example