pinned
Running
MiniAppBench Leaderboard
🏅
Submit MiniAppBench results and view the leaderboard
None defined yet.
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants
Human-AI interaction is evolving from static text responses to dynamic, interactive applications.
MiniAppBench is the first comprehensive benchmark designed to evaluate principle-driven, interactive application generation. While traditional benchmarks focus on static layouts or algorithmic snippets, MiniAppBench shifts the paradigm toward MiniApps—HTML-based applications that require both visual rendering and complex interaction logic.