13. Analyzing NYC Taxi Data#

13.1. Introduction#

13.2. Learning Objectives#

13.3. About the Dataset#

13.4. Installation#

13.5. Library Import#

13.6. Installing and Loading Extensions#

13.7. Loading Taxi Data#

13.7.1. Inspecting the Data#

13.8. Temporal Analysis#

13.8.1. Trips by Hour of Day#

13.8.2. As-Of Joins (Time-Aware Joins)#

13.8.3. Trips by Day of Week#

13.8.4. Peak vs Off-Peak Analysis#

13.9. Spatial Analysis#

13.9.1. Pickup Location Hotspots#

13.9.2. Trip Distance Distribution#

13.9.3. Geographic Distribution Using H3#

13.10. Trip Flow Analysis#

13.10.1. Top Origin-Destination Pairs#

13.10.2. Airport Trips Analysis#

13.11. Payment and Economic Analysis#

13.11.1. Payment Type Analysis#

13.11.2. Tipping Patterns#

13.11.3. Fare per Mile Analysis#

13.12. Passenger Behavior Analysis#

13.13. Multi-Month Analysis#

13.14. Visualization#

13.15. Performance Optimization Tips#

13.15.1. Filter Early#

13.15.2. Use Parquet Partitioning#

13.15.3. Sample for Exploration#

13.16. Key Takeaways#

13.17. Exercises#