Simple Sentence Transformed into Detailed Json Style Prompt
Extract pose stickman video and 3D JSON from a clip
Generate narrated audio from text or documents with custom voices
Generate detailed captions for any uploaded image
Florence-2-large / Florence-2-base