embedding_provider
: The embedding provider to use while doing embedding. Available values include bedrock
, azure-openai
, huggingface
, mixedbread-ai
, octoai
, openai
, togetherai
, vertexai
, and voyageai
.
embedding_api_key
: The API key to use, if one is required to generate the embeddings through an API service, such as OpenAI.
embedding_aws_access_key_id
: The AWS access key ID to be used for AWS-based embedders, such as Amazon Bedrock.
embedding_aws_region
: The AWS Region ID to be used for AWS-based embedders, such as Amazon Bedrock.
embedding_aws_secret_access_key
: The AWS secret access key to be used for AWS-based embedders, such as Amazon Bedrock.
embedding_model_name
: The specific model to use for the embedding provider, if necessary.
embedding_model_name
values unless otherwise specified are:
bedrock
: None
azure-openai
: text-embedding-ada-002
, with 1536 dimensions
huggingface
: sentence-transformers/all-MiniLM-L6-v2
, with 384 dimensions
mixedbread-ai
: mixedbread-ai/mxbai-embed-large-v1
, with 1024 dimensions
octoai
: thenlper/gte-large
, with 1024 dimensions
openai
: text-embedding-ada-002
, with 1536 dimensions
togetherai
: togethercomputer/m2-bert-80M-8k-retrieval
, with 768 dimensions
vertexai
: textembedding-gecko@001
, with 768 dimensions
voyageai
: None