Multimodal (text + image + video + audio) embedding models aligned with jina-embeddings-v5-text-*. Two sizes, four task variants each.
-
jinaai/jina-embeddings-v5-omni-nano
Sentence Similarity • Updated • 14.4k -
jinaai/jina-embeddings-v5-omni-nano-retrieval
Sentence Similarity • 0.9B • Updated • 54.2k -
jinaai/jina-embeddings-v5-omni-nano-classification
Sentence Similarity • 0.9B • Updated • 18.6k • 1 -
jinaai/jina-embeddings-v5-omni-nano-clustering
Sentence Similarity • 0.9B • Updated • 11k
