I want to download a dictionary with definitions & CEFR levels for a hobby project I am working on. I want to be able to process PDF files down to a “sub-dictionary” with the words used in the book, and label them with their corresponding CEFR levels like in the Cambridge Dictionary (see the screenshot). This way I can filter words with C2 level for example, and only work on them.
I have searched this forum and GitHub, and found various resources for downloading dictionaries with definitions, pronunciations etc., but I couldn’t find a way to extract dictionaries with that information. Cambridge has an API, but I don’t think I can afford that. Is there a way to achieve this without paying for an API? I am also open to different kinds of filtering that might give the same result. I mainly work on Python.