Company

jina-embeddings-v2-base-en

Jina AI

jina-embeddings-v2-base-en is an open-source English text embedding model developed by Jina AI, a company seed-funded by Yunqi Capital . Released in 2023 as part of the jina-embeddings-v2 family, it was billed as the world's first open-source vector model supporting 8,000-token context length — matching a capability that at the time was only available from OpenAI's proprietary text-embedding-ada-002 . The base-en variant carries 137 million parameters with 768-dimensional output, pitched for high-precision large tasks, and was distributed via Hugging Face alongside a smaller 33-million-parameter sibling .

The model's architecture emphasized efficiency: its 768-dimension output (and 512 for the small variant) offered lower compute and storage costs than OpenAI's 1536-dimension alternative, while benchmark tests showed it outperforming text-embedding-ada-002 on text classification, retrieval, reranking, and summarization tasks . Jina AI positioned it particularly for retrieval-augmented generation (RAG) pipelines and long-document applications like legal analysis and medical literature review .

Jina AI was acquired by Elastic in October 2025, with founder and former CEO Xiao Han joining as Elastic's Vice President of AI; the Jina AI team and its models, including the embedding line, were integrated into Elastic's search AI infrastructure .

AI-generated — may contain errors, please verify.

jina-embeddings-v2-base-enCompany
Jina AI
渲染中…
Mentioned in 32 articles

Coverage