k2-fsa/OmniVoice
Text-to-Speech • Updated • 234 • 63
Extract invoice details from images
Detect objects in images or videos
Extract text from images in multiple languages
Convert images to LaTeX code
OpenAI's Deep Research, but open
Segment document layouts into text, images, and tables
Analyze and visualize document layouts from images
Analyze scanned documents to detect and label content
Advanced AI-Powered ID Document Recognition Technology