I want to scrape from a Uniprot webpage like this http://www.uniprot.org/uniprot/Q4DQV8 the strings that starts with “Tc00” (in this case “Tc00.1047053511911.60”) using R. I’ve tried the following but the function read_html() doesn’t retrieve me any data I can like that.
library(tidyverse)
library(rvest)
page <- read_html(url)
page_text <- page %>% html_text()
extracted_string <- str_extract(page_text, "Tc00\S*")
print(extracted_string)
Can anyone help me, please?