Everybody; I am relatively new to scraping and studied it for around 4 Hours.
I am trying to scrap data from a tour-selling website (Getyourguide.com) just to have something fun and challenging.
Based on what I know, some websites don’t share their information with bots, so we have to use other tools. In addition, some may use Captchas, and I am aware of some libraries that may be able to handle them.
Do you have any suggestions on how to handle such an issue?
I was thinking about recording the web page and performing OCR, but if there is an easier way that may become useful later on, I would rather go that way.
Also, if you have any idea for screen recording to perform OCR, it would be appreciated.
P.S.: I am not trying to use this data for any particular reason or even for marketing.
I am thinking about maybe later on I can make an RSS finder so I can use this experience to develop one.
Also I have to mention since I am familiar with Python I do prefer python but I can work with JS a little bit.