Super Forecasters: Project Proposal Pitch
Super Forecasters: Project Proposal Pitch
Problem/Goal
Data Collection/Manipulation
Analysis/Forecasting/Visualization
Timeline
The Problem/The Goal
Crime rates:
heterogeneous
Goal: Find out the set of variables most closely correlated with crime,
and predict with a certain level of accuracy where and when
next crime will take place.
Data Collection
Sample data collection sources:
Predicted Predicting 1)Crime data: monthly/municipality level:
variables variables National System of Crime Data
Sample
Independent 2)Demographic information:
-Economic
Variables yearly/municipality level:
activity National Statistics Bureau (NSB)
-Cost of
living 3) Employment rates: yearly/municipality
-Population level: National Insurance Institute
-Education 4) Geo. Attributes: municipality: NSB
Crime rates rates 5) Economic activity/Cost of living:
-Employment
rate quarterly municipality:
-Geo. Economic Ministry
Attributes Data cleaning Procedure:
-Time trends 1) Scrapping data
2) Standardize data to same units
3) Design the database/tables (SQL)
4) Sort/merge/link (relational database)
Analysis
Covariate Analysis
Heat Map
Visualization/Dashboard
Gather data
4th – 6th Week Clean collected data
Standardize and merge data