alonso
About
Blog
Projects
Teaching
Talks
Consulting
Español
Blog
Mostly things I wrote about because I wanted to learn something new.
Categories
All
(1)
LLM
(1)
benchmark
(1)
evals
(1)
function-calling
(1)
structured-generation
(1)
Sign up using this form to receive an email whenever I post new content on my blog.
Structured Generation Benchmark
Testing LLM’s function-calling capabilities.
LLM
structured-generation
benchmark
function-calling
evals
This report originates from the Outlines community’s proposal to find a good dataset for evaluating structured generation. If you want to participate, join our Discord.
Apr 30, 2024
Alonso Astroza
No matching items