Recap: Web Scraping in R

Web Scraping in R

Timo Grossenbacher

Instructor

Concepts covered

  • Chapter 1: Introduction to HTML and Web Scraping
  • Chapter 2: Navigation and Selection with CSS
    • Fundamental web technologies and how to exploit them for scraping
    • The rvest package
  • Chapter 3: Advanced Selection with XPATH
    • XPATH functions like position() or text()
    • Node selection based on surrounding nodes (e.g. children)
  • Chapter 4: Scraping Best Practices
    • Behind everything: HTTP (and the httr package)
    • Best practices like throttling and identifying user agents
Web Scraping in R

What to do with the scraped data?

Web Scraping in R

Happy scraping!

Web Scraping in R

Preparing Video For Download...