I’m trying to create search index in azure ai search. I have json documents in blob storage.
But I’m facing some errors which I’m not able to resolve.
This is the error that I can see in the debugging session.
There’s a mismatch in vector dimensions. The vector field ‘vector’, with dimension of ‘1536’, expects a length of ‘1536’. However, the provided vector has a length of ‘0’. Please ensure that the vector length matches the expected length of the vector field. Read the following documentation for more details
Seems like some of the chunks from output of text splitter is empty, but I have validated that input to splitter is not empty.
I’m pasting the skills in skillset json file.
“skills”: [
{
"@odata.type": "#Microsoft.Skills.Text.SplitSkill",
"name": "SplitTextSkill",
"description": "Split combined text into chunks of 4096 characters",
"context": "/document",
"defaultLanguageCode": "en",
"textSplitMode": "pages",
"maximumPageLength": 4096,
"pageOverlapLength": 0,
"maximumPagesToTake": 0,
"inputs": [
{
"name": "text",
"source": "/document/combined_text"
}
],
"outputs": [
{
"name": "textItems",
"targetName": "chunks"
}
]
},
{
"@odata.type": "#Microsoft.Skills.Text.AzureOpenAIEmbeddingSkill",
"name": "Embeddings generation",
"description": "Azure OpenAI Embedding Skill",
"context": "/document/chunks/*",
"resourceUri": "endpointOfAzureOpenaiService",
"apiKey": "<redacted>",
"deploymentId": "akm-aml-embeddings",
"dimensions": 1536,
"modelName": "text-embedding-ada-002",
"inputs": [
{
"name": "text",
"source": "/document/chunks/*"
}
],
"outputs": [
{
"name": "embedding",
"targetName": "vector"
}
],
"authIdentity": null
}
]
user25708898 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.