Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation Paper • 2602.16990 • Published 9 days ago • 10
Ebisu: Benchmarking Large Language Models in Japanese Finance Paper • 2602.01479 • Published 26 days ago • 17