I have a large dataset with 100,000 words with meaningful/unmeaningful words like dgksrt, flower, sdfgnv etc). I want to extract only meaningful words using python program.
one way is to iterate the dataset with dictionary words which is time consuming and not effective. is there a python library to achieve this?
I tried iterating with dictionary word list by having 2 dataframes – 1 for my dataset and another 1 for dictionary words which has more than 5,000,000 words. this is time consuming as every word in my dataset need to iterate through 5,000,000 words every time.