MiniAppBench

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ha251  updated a Space about 11 hours ago
MiniAppBench/README
ha251  published a Space about 11 hours ago
MiniAppBench/README
ha251  updated a dataset about 13 hours ago
MiniAppBench/Dataset
View all activity

Organization Card

MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

Human-AI interaction is evolving from static text responses to dynamic, interactive applications.

MiniAppBench is the first comprehensive benchmark designed to evaluate principle-driven, interactive application generation. While traditional benchmarks focus on static layouts or algorithmic snippets, MiniAppBench shifts the paradigm toward MiniApps—HTML-based applications that require both visual rendering and complex interaction logic.

models 0

None public yet