By default, get_species_page_summary will fetch the species' ECOS page given the URL, but there may be times when the page has already been fetched. In those cases, the page can be specified to save time.

get_species_page_summary(url, species, pause = TRUE)

Arguments

url
The url of the species page
species
The scientific name of the species
pause
Pause for 0.5-3s during scraping [default = TRUE]

Value

A data_frame with four variables:

  • Species
  • Page
  • Scrape_Date
  • Page_Text_MD5

Note

Apparently, FWS does not serve the same version of a species' page up twice in a row. Instead, the same information will be presented in different orders. We have to use strsplit along with str_trim to get clean lines, then sort before doing the MD5 hash.