I’m trying to actually extract information from a resume in French to be able to filter a list of resume by something for example by skills or by date. the problem i have is the extraction of the first name and last name cause i want to actually show the results after filtering as cards that contains the name of the candidates and his skills, work, age…
i was able to extract skills and the year , mail , number, linkedin since it easy using only regex, the problem is the first name and the last name of the person. knowing that the transformation from pdf to text doesn’t keep the same order of things in resume so the name isn’t actually the first thing and this result is different from one resume to another. I’ve already tried to use spacy but still i can’t seem to get the right answer. Leading me to think that i should just force the user to name the pdf like firstname_lastname.pdf. any better way to approach it ? & Thanks for your answers in advance