Data Planes

Here you can find data and reproducible code for some of my projects. Feel free to use the code as it is or modify (and improve) it for your own purposes.

Please do not hesitate to get in touch if you encounter any errors.

Accident analyses

Aviation accidents (replication data; script for projecting, mapping, and analyzing crashes in Florida 2014 with point pattern analyses) [slightly outdated]

Road accidents (script for geocoding, mapping, and descriptively analyzing crashes in South Australia 2016)

Amazon data

Amazon scraper (scraping functions for 1. customer reviews from URLs and 2. product information of best sellers)

Customer reviews (script for automated scraping of all customer reviews for one or more Amazon products, with product ID/ASIN as input)

Crime analyses

Death penalties (data; script for scraping, wrangling, mapping, and visualizing historical data on executions in the United States, 1801-1900)

Serial killers (data; scripts for scraping, geocoding, visualizing, and mapping Wikipedia data on international and US American serial killers)

Network data

Renaissance Florence (relational and attribute data on Florentine families, 1426-34)

Spatial data

Airline flight routes (script for mapping airline routes using OpenFlights data in combination with NASA's night lights images)

Spatial gravity models (replication data; scripts for building dyadic data sets and conducting predictive spatial gravity analyses)

Text mining

Classic literature (script for processing and analyzing public domain works using the example of Bram Stoker's Dracula)

Movie scripts (script for processing and analyzing PDF files using the screenplay of The Room)

Music lyrics (script for processing, analyzing, and visualizing song lyrics using STARSET albums)

Reddit threads (script for scraping, processing, and mining user comments)

Twitter data

Mapping locations (script for scraping tweets with #MeToo, extracting and geocoding user locations using Google Maps API, and mapping locations)

Timeline analysis (script for scraping, cleaning, and analyzing tweets from Donald Trump's timeline, e.g., via sentiment analysis and publication statistics)

YouTube data

Viewer engagement (replication code for analyzing YouTube comments and video statistics on the Florida High School Shooting in February 2018)