NYC Taxi Complaint Data

NYC Taxi Complaint Data

Elliot Ramos Refusing to see the truth at refusals In October of 2013 and January of 2014, I obtained a series of files from TLC, 311 and DoITT. The agencies collaborated to provide an extensive set of data that included fields key fields not found on the open data portal for New York City. Specifically the “descriptor” fields, which includes TLC’s categorization for taxi complaints as well as the verbatim narrative field, which is filled out via 311 dispatcher or view form submission online by residents. The data set is extensive and required months of manual work at some points. For the purposes of the class project, I’m focusing on the analysis and slicing of the data using pandas. Here is the data provided by the city of New York, at first, they had provided two files split up into complaints with summons and complaints without summons. Specific locations were not provided, but Service Request numbers were. The data goes back to January 2010 by incident date, however a handful of earlier records were included in this set and were excluded from the overall analysis as noise. Excel files: Using excel, those files were stacked atop of each other, originally given flags to TRUE if the records resulted in a summons. Subsequent requests were made to provide additional data that had service requests numbers and the Open Data portal Unique IDs, this allowed for a merging of data using CSV kit and data with the open data site that included location data such as x, y coordinates. Open Data taxi complaint set: https://data.cityofnewyork.us/Social-Services/311-Taxi-Complaints/uppf-z66u Subsequent requests were made to fill...