ScienceToStartup
Dashboard
Research
Trends
Topics
Saved
Articles
Changelog
Careers
About
Enterprise
Resources
Home
Resources
State Reports
Evaluation Frameworks
State of Evaluation Frameworks
3 papers · avg viability 5.7
Download CSV
View topic page
Top papers
GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?
(7.0)
Evaluating LLMs When They Do Not Know the Answer: Statistical Evaluation of Mathematical Reasoning via Comparative Signals
(5.0)