jina-embeddings-v2-base-en
Jina AI
jina-embeddings-v2-base-en is an open-source English text embedding model developed by Jina AI, a company seed-funded by Yunqi Capital . Released in 2023 as part of the jina-embeddings-v2 family, it was billed as the world's first open-source vector model supporting 8,000-token context length — matching a capability that at the time was only available from OpenAI's proprietary text-embedding-ada-002 . The base-en variant carries 137 million parameters with 768-dimensional output, pitched for high-precision large tasks, and was distributed via Hugging Face alongside a smaller 33-million-parameter sibling .
The model's architecture emphasized efficiency: its 768-dimension output (and 512 for the small variant) offered lower compute and storage costs than OpenAI's 1536-dimension alternative, while benchmark tests showed it outperforming text-embedding-ada-002 on text classification, retrieval, reranking, and summarization tasks . Jina AI positioned it particularly for retrieval-augmented generation (RAG) pipelines and long-document applications like legal analysis and medical literature review .
Jina AI was acquired by Elastic in October 2025, with founder and former CEO Xiao Han joining as Elastic's Vice President of AI; the Jina AI team and its models, including the embedding line, were integrated into Elastic's search AI infrastructure .
AI-generated — may contain errors, please verify.
Coverage
Yunqi Capital Quarterly | Upward, the Consistent Answer
New Growth, New Gains
云启资本·Elastic Acquires Yunqi Capital Seed-Backed Jina AI to Jointly Accelerate Search AI Infrastructure Evolution | Yunqi Capital
Driving Open-Source AI Infrastructure Toward a Global Ecosystem
云启资本·Manycore Tech's Huang Xiaohuang: The "Hammer" and the "Stars" of 14 Years Building a Company | Yunqi Capital Doers
From GPU Rendering to Spatial Intelligence
云启资本·Yunqi Capital Quarterly | Spring Winds Ask Nothing of the Traveler; We Chart Our Own Course
A Closer Look at Yunqi Capital's Recent Moves
云启资本·Deep Dive: The DeepSeek Boom — Who Hit the Jackpot? Who Took the Hit? | Yunqi Capital Attent!on Podcast
DeepSeek's open-source push keeps accelerating, and the chain reactions it set off continue to ripple through the industry. As a major focus of tech entrepreneurship and venture investment in recent years, how will AI's venture narrative shift? What changes are entrepreneurs and investors on the ground actually sensing?
云启资本·Vol.06 2025 AI VC Roundtable: DeepSeek Goes Viral — Who Hit the Jackpot? Who Took the Hit?
云启资本·Yunqi Capital | PingCAP Overtakes TOP Spot, Becomes the Fastest-Growing Vendor in Global Database Management Systems Market
The market is growing at a rate of nearly 100%.
云启资本·Yunqi Capital Spring-Summer Issue | In the Bright Season, Innovation Grows with Wanwu Capital
AGI Practice, Business Breakthroughs, and Reflections This Spring and Summer
云启资本·Yunqi Capital Year-End Roundup | Annual Deep-Dive Report: AI + Open Source + Commercialization = ?
Three Years Running: The New Trends in Open Source + AI
云启资本·Interview with a Pioneer of Open-Source Commercialization: How Programmers Question Sales, Understand Sales, and Become Sales
Now is the best time to pay attention to "open-source commercialization."
云启资本·Vol.02 Conversations with People Making Money from Open Source: AGI Is Coming — Is It Too Late to "Outward-Compete"?
云启资本·What Are the 5 Critical Questions for Making Open Source Commercialization Work? | Yunqi Capital × China Open Source Conference
AGI and Globalization Are Changing Open Source — Here's How
云启资本·Jina AI Launches World's First Open-Source 8K Vector Model | Cloud News · Open Source
Currently, only two AI technology companies — OpenAI and Jina AI — have released 8k embedding models.
云启资本·First "Global Open Source Contribution Ranking" Released, Three PingCAP Co-Founders Make the List | Cloud News · Open Source
Ranked solely by contribution.
云启资本·"Inside Google": During 10 Hours of Deep Conversation Across Silicon Valley and Shanghai, What New AGI Trends Did We Discuss? | FutureScope
Six Emerging Trends in AGI from a Global Perspective
云启资本·Exploring Forward in a World with Open-Source Communities: There Will Be Challenges, But It Will Be Worth It | Riding the AGI+ Wave
More, Bigger, Better, More Valuable Open-Source Communities
云启资本·Yunqi AGI × WAIC2023 | Three New Opportunities for Large Model Deployment ## 01 In 2023, large language models have become the hottest topic in tech and investment circles. At this year's World Artificial Intelligence Conference (WAIC), discussions about large models were everywhere. From B2B to B2C, from infrastructure to applications, from text to multimodal — the entire ecosystem is being reshaped. But beneath the excitement, a critical question looms: where exactly are the real opportunities for large model deployment? At the WAIC Yunqi Capital AGI Forum, we invited entrepreneurs and investors at the forefront of large model development to share their perspectives. Through in-depth conversations, we identified three emerging opportunities that deserve attention. ## 02 Opportunity One: Vertical Industry Models The consensus among panelists was clear: general-purpose large models are important, but vertical industry models represent the more immediate commercial opportunity. Why? Because general models, while capable of many tasks, often lack the depth required for professional scenarios. In fields like healthcare, finance, and legal services, domain expertise — specialized knowledge, compliance requirements, workflow integration — creates significant barriers to entry. As one entrepreneur noted: "A general model might score 60 points on a
And, Yunqi Capital's AGI+ portfolio has added these new products —
云启资本·AGI in Practice, Innovation Breakthroughs, and Frontier Insights This Spring | Yunqi Capital Quarterly · Spring Issue
New thinking and new practice in AI from Yunqi Capital and its portfolio companies.
云启资本·High Tech, High Growth! 16 Yunqi Capital Portfolio Companies Earn 21 Accolades | Yunqi Capital News
Meeting Challenges Together, Making the Future's Trajectory More Within Reach
云启资本·Yunqi's Chen Yu: ChatGPT's Qualitative Leap Is More Than a Technical Shift | The Paper X Yunqi Capital ChatGPT Special
When it comes to large language model R&D, startups have more flexibility than big companies.
云启资本·"Jina AI" Xiao Han: ChatGPT Is Disrupting Search, SEO Will Become Meaningless | Yunqi Capital ChatGPT Special
SEO Is Dead, Long Live LLMO
云启资本·"PingCAP" Huang Dongxu: A Deep Dive into New Database Development Trends in 10,000 Words | Yunqi Tech π
"Before ChatGPT came out, I always thought there was an element of hype around AI."
云启资本·Open Source Annual Report! Yunqi Capital Partners Again with Open Source Society to Release *2022 China Open Source Annual Report* | Yunqi News
Open Source Is Entering Its Next Phase of Development
云启资本·What Are We Actually Reviewing When We Talk About "Reviewing"? | Yunjian · Zhengyuan
After the "Cloud Sight" Series Concludes
云启资本·China's First: PingCAP's Distributed Database TiDB Passes CAICT HTAP Database Basic Capability Assessment | Yunqi Tech π
Leading the frontier of databases
云启资本·AIGC Pioneer "Jina AI": Decoding the Paradigm Shift in Multimodal AI | Yunqi Capital
The Frontier of a New AI Era
云启资本·"Zilliz": Eight Predictions for the Vector Database Industry in 2023 | Yunqi Capital
The Future Outlook for Vector Databases
云启资本·The Only Chinese Company Named a "Strong Performer" by Forrester, PingCAP Leads HTAP Frontier Trends | Yunqi Capital
Leading the "Invisible Technology" Era
云启资本·Go Discover, Go Challenge! PingCAP DevCon 2022 Goes Live Tomorrow | Yunqi Capital X PingCAP
From Cloud Native to Serverless: Reflections and Takeaways
云启资本·




























