Code as a Liberal Art, Spring 2022

Unit 2, Tutorial 4 lesson — Wednesday, March 30

More advanced webscraping with DOM trees

Today in class we will primarily focus on the material from Tutorial 3 that we have not yet had the chance to get to.

We will start by focusing on the following sections from the Tutorial 3 table of contents:

  1. Navigating a parse tree
  2. More advanced techniques with BeautifulSoup
  3. Searching the DOM
  4. An example, NYTimes.com