Day 3 – More on Python

Wednesday, June 16

9 am–1 pm Pacific Time
Connection Details: Lesson Room Link

There are no extra materials required for day 3, but some sessions will have you working in a virtual jupyter environment, we will provide you with login information at the beginning.

Room 1: Topic Modeling (9:00–11:00 am)
Room 2: Web Scraping (9:00–11:00 am)

Break (11:00–11:15 am)

Room 1: NLP (11:15 am–1:00 pm)
Room 2: API Querying (11:15 am–1:00 pm)

Lesson Plan


Lesson Notes, except NLP

Lesson Notes and details for NLP

Some special topics have materials, such as codebooks, but no specific notes

Topic Modeling (9:00–11:00 am)

  • What is Topic modeling?
  • Reading other peopl's code
  • Formatting data
  • Building a model
  • Coherence and choosing a model
  • Visualization

Web Scraping (9:00–11:00 am)

  • What is web scraping?
  • Introduction to Beautiful Soup
  • Parsing and saving data
  • Using Regular Expressions
  • To scrape or not to scrape?

Break (11:00–11:15 am)

Natural Language Processing (11:15 am–1:00 pm)

API Querying (11:15 am–1:00 pm)

  • What is an API?
  • The wide world of Web-API's
  • Getting Setup
  • Endpoints
  • Querying
  • Formatting and extracting data
  • Authentication