A guide to choosing between vector and graph databases for your use case
embed-english-v3.0
, embed-english-light-v3.0
, and embed-multilingual-v3.0
- as well as BAAI/bge-small-en-v1.5
, and mixedbread-ai/mxbai-embed-large-v1
.
You may request another model to be used. Currently the following models are supported:
Model | Dimensions | Description | License | Size (GB) |
---|---|---|---|---|
cohere/embed-english-v3.0 | 1024 | A model that allows for text to be classified or turned into embeddings. English only. | Commercial | - |
cohere/embed-english-light-v3.0 | 384 | A smaller, faster version of embed-english-v3.0. Almost as capable, but a lot faster. English only. | Commercial | - |
cohere/embed-multilingual-v3.0 | 1024 | Provides multilingual classification and embedding support. See supported languages here. | Commercial | - |
cohere/embed-multilingual-light-v3.0 | 384 | A smaller, faster version of embed-multilingual-v3.0. Almost as capable, but a lot faster. Supports multiple languages. | Commercial | - |
BAAI/bge-small-en-v1.5 | 384 | Text embeddings, Unimodal (text), English, 512… | MIT | 0.067 |
BAAI/bge-small-zh-v1.5 | 512 | Text embeddings, Unimodal (text), Chinese, 512… | MIT | 0.090 |
snowflake/snowflake-arctic-embed-xs | 384 | Text embeddings, Unimodal (text), English, 512… | Apache-2.0 | 0.090 |
sentence-transformers/all-MiniLM-L6-v2 | 384 | Text embeddings, Unimodal (text), English, 256… | Apache-2.0 | 0.090 |
jinaai/jina-embeddings-v2-small-en | 512 | Text embeddings, Unimodal (text), English, 819… | Apache-2.0 | 0.120 |
BAAI/bge-small-en | 384 | Text embeddings, Unimodal (text), English, 512… | MIT | 0.130 |
snowflake/snowflake-arctic-embed-s | 384 | Text embeddings, Unimodal (text), English, 512… | Apache-2.0 | 0.130 |
nomic-ai/nomic-embed-text-v1.5-Q | 768 | Text embeddings, Multimodal (text, image), Eng… | Apache-2.0 | 0.130 |
BAAI/bge-base-en-v1.5 | 768 | Text embeddings, Unimodal (text), English, 512… | MIT | 0.210 |
sentence-transformers/paraphrase-multilingual-… | 384 | Text embeddings, Unimodal (text), Multilingual… | Apache-2.0 | 0.220 |
Qdrant/clip-ViT-B-32-text | 512 | Text embeddings, Multimodal (text&image), Engl… | MIT | 0.250 |
jinaai/jina-embeddings-v2-base-de | 768 | Text embeddings, Unimodal (text), Multilingual… | Apache-2.0 | 0.320 |
BAAI/bge-base-en | 768 | Text embeddings, Unimodal (text), English, 512… | MIT | 0.420 |
snowflake/snowflake-arctic-embed-m | 768 | Text embeddings, Unimodal (text), English, 512… | Apache-2.0 | 0.430 |
nomic-ai/nomic-embed-text-v1.5 | 768 | Text embeddings, Multimodal (text, image), Eng… | Apache-2.0 | 0.520 |
jinaai/jina-embeddings-v2-base-en | 768 | Text embeddings, Unimodal (text), English, 819… | Apache-2.0 | 0.520 |
nomic-ai/nomic-embed-text-v1 | 768 | Text embeddings, Multimodal (text, image), Eng… | Apache-2.0 | 0.520 |
snowflake/snowflake-arctic-embed-m-long | 768 | Text embeddings, Unimodal (text), English, 204… | Apache-2.0 | 0.540 |
mixedbread-ai/mxbai-embed-large-v1 | 1024 | Text embeddings, Unimodal (text), English, 512… | Apache-2.0 | 0.640 |
jinaai/jina-embeddings-v2-base-code | 768 | Text embeddings, Unimodal (text), Multilingual… | Apache-2.0 | 0.640 |
sentence-transformers/paraphrase-multilingual-… | 768 | Text embeddings, Unimodal (text), Multilingual… | Apache-2.0 | 1.000 |
snowflake/snowflake-arctic-embed-l | 1024 | Text embeddings, Unimodal (text), English, 512… | Apache-2.0 | 1.020 |
thenlper/gte-large | 1024 | Text embeddings, Unimodal (text), English, 512… | MIT | 1.200 |
BAAI/bge-large-en-v1.5 | 1024 | Text embeddings, Unimodal (text), English, 512… | MIT | 1.200 |
intfloat/multilingual-e5-large | 1024 | Text embeddings, Unimodal (text), Multilingual… | MIT | 2.240 |