For this week, I wanted to identity my focus for my analysis on the GTD dataset. This most likely will be changed in the future, but this will serve as a good start.
USA
- Determine if a relationship exists between population and number of attacks. Unsure how to statistically find this relationship yet (linear / logistic regression?).
- Gather population of city for year of attack through Population data first
- Map the cities and use marker(s) to denote population number and/or number of attacks
- Determine if a relationship exists between population and severity of attacks. Unsure how to statistically find this relationship yet (linear / logistic regression?).
- Gather population of city for year of attack through Population data first
- Map the cities and use marker(s) to denote population number and/or severity of attack
- Define what severity of attack is (deaths, injured, destruction, etc)
- Form clusters to see areas where attacks happen
- Through mapping the cities, it seems there are patches of the USA where attacks haven’t occurred. Possibly research background info into why.
Iraq
- Study the difference in number of attacks for the years 1970-2017
- Research background info on why number of attacks increased substantially in the 2000’s
- Study severity of attacks from 1900’s and 2000’s
- Is there a difference? If so, research possibilities on why
- Find where terrorists groups in the country live
- Correlation between terrorists location and number / severity of attacks is possible
- Form clusters to see where attacks happen
- Through mapping the attacks, there seems to be patches where attacks haven’t occurred. Possibly research background info into why.
- If find population info, then do same population analysis as for USA