I have a dataset with 3 features: Version, Model and Brand of cars. How do I apply fuzzy search (using Python) to see if what the user typed matches with the version, model, brand or some/all of these features.? I don’t know how to score this when it have more than one feature/column… Could you help me?
I want to extract what features the user typed (e.g: model, model and brand, model or brand and version). I’m using the fuzzywuzzy lib.
Follow dataset example bellow:
data = {
'Version': ['hb20 1.0 comfort', 'hb20 1.6 premium', 'hr-v 1.8 exl cvt', '1.0 Turbo LTZ (Aut)'],
'Model': ['HB20', 'HB20', 'HR-V', 'Onix'],
'Brand': ['Hyundai', 'Hyundai', 'Honda', 'Chevrolet']
}
Example of output:
user_input = "Hyundai HB20":
output=
{Brand match: 'Hyundai',
Model match: 'HB20'
}
Another example of output:
user_input = "Onix 1.0 turbo":
output=
{Model match: 'Onix',
Version match: '1.0 Turbo LTZ (Aut)'}