Code and data for experimenting with and analyzing tv guest appearances on late-night comedy shows.
- scrapy python package
- rdflib python package
- wikipedia python package
Here's an example of how to crawl wikipedia for episode data.
Possible spider names are: tds, colbert, fallon
cd scrapy_project
scrapy crawl name -o name_items.json -t json