Last summer, I created code for scraping Apple podcast reviews based on the official documentation provided for app-store-scraper:
https://pypi.org/project/app-store-scraper/
Here is the relevant section from my longer Jupyter notebook:
# import packages
import pandas as pd
import numpy as np
import csv
from app_store_scraper import AppStore
# app_name = podcast name from URL
# app_id = podcast id from URL
podcast = AppStore(country='us', app_name='black-girl-gone-a-true-crime-podcast', app_id = '1556267741')
podcast.review(how_many=7000) # change number if necessary
reviews=podcast.reviews
print("This podcast has ", len(reviews), "reviews.")
df = pd.DataFrame()
# columns in each review = date, review, rating, isEdited, userName, title
podcastdf = pd.DataFrame(np.array(podcast.reviews),columns=['review'])
display(podcastdf)
Unfortunately, I now get a 401 error when running the code again and I wonder if there are any changes in the Apple API that prevent me from collecting the data.
Or maybe there’s some kind of dependency issue because packages have been updated.
3