I hope you’re having a good one.
I’m currently implementing RAG on Azure using OpenAI and Azure AI Search, formerly known as Cognitive Services. I have around 50-65 JSON files that I need to search on my enterprise data. It turns out that in the referencing of the chatbot, I’m only getting the text “citation” and I’m trying to retrieve the DOI, which is the URL to the document online, and the title of the scientific article. this files are saved as .txt
I have formatted my JSON file in this manner where the keys ‘content’ and ‘title’ are the only ones I want to perform a semantic search on and also make retrievable, while I just want the DOI (URL) to be retrievable.
{
"content": "The human eye is a complex organ responsible for vision, capturing light and converting it into neural signals for the brain to interpret. It consists of multiple parts, including the cornea, lens, and retina, each playing a vital role in the process of seeing.",
"date": "2023-07-15",
"Title": "The Magic of Vision",
"editorial_house": "MIT Research Meds and Public Health",
"doi": "https://doi.org/10.1234",
"author": "Dr. John Mayer"
}
Nontheless When im on the azure ai search page I never get my other fileds to be selected in metadata
As you can see, only ‘content’ appears and I still get this unappealing citation in the foot references of my searches. I would be so thankful if you could please show me the way to correctly make my data retrievable in the way I want.
As I’m not using code to do this, only the Azure Studio web, I’m not sure if the only way to do that is by using code. If so, I’ll be happy if someone can cite a repository as I have searched the documentation without any luck. I’m such an attentive and active user, I’ll be very much looking out for your help, guys.
My desired output is something like this
Is this possible? is it possible using the Azure studio or just doing code? Thanks a million!