U<short-workflow-id>_<short-embedding-model-name>_<number-of-dimensions>
.
If Unstructured does not generate embeddings, the new collection’s name will be U<short-workflow-id
.Unstructuredautocreated
.<collection-name>
with
the name of the new collection, and replace <property-name>
with the name of each additional available property that
you want to return results for, such as text
, type
, element_id
, record_id
, and so on. The embeddings will be
returned in the vector
property.
record_id
property, as follows:
WEAVIATE_COLLECTION
- The name of the target collection in the instance, represented by --collection
(CLI) or collection
(Python).
WEAVIATE_HOST
- The connection URL to the instance, represented by --hostname
(CLI) or hostname
(Python).WEAVIATE_COLLECTION
- The name of the target collection in the instance, represented by --collection
(CLI) or collection
(Python).
If no value is provided, see the beginning of this article for the behavior at run time.WEAVIATE_CLUSTER_URL
- THE REST endpoint for the Weaviate database cluster, represented by --cluster-url
(CLI) or cluster_url
(Python).
WEAVIATE_API_KEY
- The API key for the database cluster, represented by --api-key
(CLI) or api_key
(Python).
--api-key
option here is part of the weaviate-cloud
command. For Python, the api_key
parameter here is part of the CloudWeaviateAccessConfig
object.WEAVIATE_COLLECTION
- The name of the target collection in the database, represented by --collection
(CLI) or collection
(Python).
If no value is provided, see the beginning of this article for the behavior at run time.
--partition-by-api
option (CLI) or partition_by_api
(Python) parameter to specify where files are processed:
--partition-by-api
(CLI) or partition_by_api
(Python), or explicitly specify partition_by_api=False
(Python).
Local file processing does not use an Unstructured API key or API URL, so you can also omit the following, if they appear:
--api-key $UNSTRUCTURED_API_KEY
(CLI) or api_key=os.getenv("UNSTRUCTURED_API_KEY")
(Python)--partition-endpoint $UNSTRUCTURED_API_URL
(CLI) or partition_endpoint=os.getenv("UNSTRUCTURED_API_URL")
(Python)UNSTRUCTURED_API_KEY
and UNSTRUCTURED_API_URL
--partition-by-api
(CLI) or partition_by_api=True
(Python).
Unstructured also requires an Unstructured API key and API URL, by adding the following:
--api-key $UNSTRUCTURED_API_KEY
(CLI) or api_key=os.getenv("UNSTRUCTURED_API_KEY")
(Python)--partition-endpoint $UNSTRUCTURED_API_URL
(CLI) or partition_endpoint=os.getenv("UNSTRUCTURED_API_URL")
(Python)UNSTRUCTURED_API_KEY
and UNSTRUCTURED_API_URL
, representing your API key and API URL, respectively.https://api.unstructuredapp.io/general/v0/general
, which is the API URL for the Unstructured Partition Endpoint.If you do not have an API key, get one now.If the Unstructured API is self-hosted, the process
for generating Unstructured API keys, and the Unstructured API URL that you use, are different.
For details, contact Unstructured Sales at
sales@unstructured.io.