Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

jinaai 's Collections
jina-reranker-m0
JinaVDR (Visual Document Retrieval) (BEIR)
Jina VDR VidoreOCR Tasks
JinaVDR (Visual Document Retrieval)
Retrieval Failures Re-Ranking Tasks
Jina Reader-LM
jina-embeddings-v3
Jina Reranker v2
jina-clip
late interaction retrievers
jina-embeddings-v2
jina-embeddings-v1
Jina Reranker v1
German Bert Training
Tokenizers

jina-embeddings-v2

updated 16 days ago

The V2 family of Jina Embeddings supports encoding large documents with 8k sequence length.

Upvote
16

  • Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

    Paper • 2310.19923 • Published Oct 30, 2023 • 14

  • Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings

    Paper • 2402.17016 • Published Feb 26, 2024 • 5

  • jinaai/jina-embeddings-v2-base-en

    Feature Extraction • 0.1B • Updated Jan 6 • 185k • 726

  • jinaai/jina-embeddings-v2-base-zh

    Feature Extraction • 0.2B • Updated Jan 6 • 21k • 243

  • jinaai/jina-embeddings-v2-small-en

    Feature Extraction • 0.0B • Updated Jan 6 • 688k • 137

  • jinaai/jina-embeddings-v2-base-de

    Feature Extraction • 0.2B • Updated Jan 6 • 24.5k • 80

  • jinaai/jina-embeddings-v2-base-es

    Feature Extraction • 0.2B • Updated Jan 6 • 16.7k • 34

  • jinaai/jina-embeddings-v2-base-code

    Feature Extraction • 0.2B • Updated Jan 6 • 88.3k • 115
Upvote
16
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs