Rare events

Summary

Rare or extreme events are events that occur with low frequency, and often refers to infrequent events that have a widespread effect and which might destabilize systems (for example, stock markets,[1] ocean wave intensity[2] or optical fibers[3] or society[4]). Rare events encompass natural phenomena (major earthquakes, tsunamis, hurricanes, floods, asteroid impacts, solar flares, etc.), anthropogenic hazards (warfare and related forms of violent conflict, acts of terrorism, industrial accidents, financial and commodity market crashes, etc.), as well as phenomena for which natural and anthropogenic factors interact in complex ways (epidemic disease spread, global warming-related changes in climate and weather, etc.).

Overview edit

Rare or extreme events are discrete occurrences of infrequently observed events. Despite being statistically improbable, such events are plausible insofar as historical instances of the event (or a similar event) have been documented.[5] Scholarly and popular analyses of rare events often focus on those events that could be reasonably expected to have a substantial negative effect on a society—either economically[6] or in terms of human casualties[7] (typically, both). Examples of such events might include an 8.0+ Richter magnitude earthquake, a nuclear incident that kills thousands of people, or a 10%+ single-day change in the value of a stock market index.[8][9][10]

Modeling and analysis edit

Rare event modeling (REM) refers to efforts to characterize the statistical distribution parameters, generative processes, or dynamics that govern the occurrence of statistically rare events, including but not limited to highly influential natural or human-made catastrophes. Such “modeling” may include a wide range of approaches, including, most notably, statistical models for analyzing historical event data[11][12] and computational software models that attempt to simulate rare event processes and dynamics.[13] REM also encompasses efforts to forecast the occurrence of similar events over some future time horizon, which may be of interest for both scholarly and applied purposes (e.g., risk mitigation and planning).[14] Novel data collection techniques can be used for learning about rare events data.[15]

Relevant data sets edit

In many cases, rare and catastrophic events can be regarded as extreme-magnitude instances of more mundane phenomena. For example, seismic activity, stock market fluctuations, and acts of organized violence all occur along a continuum of extremity, with more extreme-magnitude cases being statistically more infrequent.[16] Therefore, rather than viewing rare event data as its own class of information, data concerning "rare" events often exists as a subset of data within a broader parent event class (e.g., a seismic activity data set would include instances of extreme earthquakes, as well as data on much lower-intensity seismic events).

The following is a list of data sets focusing on domains that are of broad scholarly and policy interest, and where "rare" (extreme-magnitude) cases may be of particularly keen interest due to their potentially devastating consequences. Descriptions of the data sets are extracted from the source websites or providers.

Conflicts edit

  • Armed Conflict Database:[17] The Armed Conflict Database (ACD) monitors armed conflicts worldwide, focusing on political, military and humanitarian trends in current conflicts, whether they are local rebellions, long-term insurgencies, civil wars or inter-state conflicts. In addition to the comprehensive historical background for each conflict, the weekly timelines and the monthly updates, the statistics, data and reports in the ACD date back to 1997.
  • Armed Conflict Location & Event Data Project:[18] The Armed Conflict data set covers events occurring in Africa from 1997 to present. This data set includes the event date, longitude, latitude, and fatality magnitude scale.
  • Militarized Interstate Disputes[19] The Militarized Interstate Disputes (MID) data set "provides information about conflicts in which one or more states threaten, display, or use force against one or more other states between 1816 and 2010."
  • Political Instability Task Force (PITF) State Failure Problem Set, 1955–2013:[20] The Political Instability Task Force (PITF), State Failure Problem Set is part of a larger armed conflict database produced by the Center for Systemic Peace from open source data. Data in PITF are available on various subsets: ethnic war, revolutionary war, adverse regime change, and genocide or politicide.
  • Rand Database ofcon Worldwide Terrorism Incidents:[21] The Rand Database of Worldwide Terrorism Incidents data set covers terrorism incidents worldwide from 1968 through 2009 but is not currently active. The data set includes a date, location (city, country), perpetrator, detailed description, and number of injuries and fatalities.
  • Major Episodes of Political Violence:[22] The Major Episodes of Political Violence data set is part of a larger armed conflict database produced by the Center for Systemic Peace. Political Violence data include annual, cross-national, time-series data on interstate, societal, and communal warfare magnitude scores (independence, interstate, ethnic, and civil; violence and warfare) for all countries.

Natural disasters edit

  • Advanced National Seismic System (ANSS) Comprehensive Earthquake Catalog (ComCat):[23] The ANSS Comprehensive Catalog (ComCat) contains earthquake source parameters (e.g. hypocenters, magnitudes, phase picks and amplitudes) and other products (e.g. moment tensor solutions, macroseismic information, tectonic summaries, maps) produced by contributing seismic networks.
  • Dartmouth Flood Observatory:[24] Dartmouth Flood Observatory uses "Space-based Measurement and Modeling of Surface Water" to track floods and uses news reporting to validate the results. This data set includes the country, start date, end date, affected square km, and cause of the flood. Additionally, this data set includes many magnitude scales, such as: dead, displaced, severity, damage, and flood magnitude.
  • U.S. National Flood Insurance Program:[25] The U.S. National Flood Insurance Program data set contains a data table detailing flooding events with 1,500 or more paid losses from 1978 to the current month and year. The table includes the name and year of the event, the number of paid losses, the total amount paid and the average payment per loss.
  • FAOSTAT (Famine):[26] The FAOSTAT data set was developed by the Statistics Division of the Food and Agricultural Organization of the United Nations (FAO). It is an active, global data set that covers famine events from 1990–2013.
  • Global Volcanism Program:[27] "Volcanoes of the World is a database describing the physical characteristics of volcanoes and their eruptions." The data contain a start date, end date, volcano name (which can be used to look up the location) and VEI magnitude scale.
  • International Disaster Database:[28] EM-DAT contains essential core data on the occurrence and effects of over 18,000 mass disasters in the world from 1900 to present. The database is compiled from various sources, including UN agencies, non-governmental organizations, insurance companies, research institutes and press agencies.
  • NOAA Natural Hazards:[29] The Natural Hazards dataset is part of the National Geophysical Data Center run by the U.S. National Oceanic and Atmospheric Administration (NOAA). The National Geophysical Data Center archives and assimilates tsunami, earthquake and volcano data to support research, planning, response and mitigation. Long-term data, including photographs, can be used to establish the history of natural hazard occurrences and help mitigate against future events.

Diseases edit

  • FluView:[30] FluView is produced by the U.S. Centers for Disease Control (CDC) and provides weekly influenza surveillance information in the United States by census area and includes the number of people tested and number of positive cases.
  • Global Health Atlas:[31] The Global Health Atlas contains data on four communicable diseases: cholera, influenza, polio, and yellow fever. It is an active, global data set that covers number of cases and fatalities due to these infectious diseases.

Others edit

  • Aviation Safety Database:[32] The Aviation Safety Database covers aviation safety incidents around the world. Every incident reports the location of the incident, the departing and arriving airports, number of fatalities and type of Airplane involved in the incident.
  • Database of Radiological Incidents and Related Events:[33] The Database of Radiological Incidents and Related Events covers events that resulted in acute radiation exposures to humans sufficient enough to cause casualties. The database includes the date, location, number of deaths, number of injuries and highest radiation dose recorded.

See also edit

References edit

  1. ^ Sornette, Didier (2017). Why stock markets crash : critical events in complex financial systems. Princeton University Press. ISBN 9781400885091.
  2. ^ Dysthe, Kristian; Krogstad, Harald E.; Müller, Peter (January 2008). "Oceanic Rogue Waves". Annual Review of Fluid Mechanics. 40 (1): 287–310. Bibcode:2008AnRFM..40..287D. doi:10.1146/annurev.fluid.40.111406.102203.
  3. ^ Dudley, John M.; Dias, Frédéric; Erkintalo, Miro; Genty, Goëry (28 September 2014). "Instabilities, breathers and rogue waves in optics". Nature Photonics. 8 (10): 755–764. arXiv:1410.3071. Bibcode:2014NaPho...8..755D. doi:10.1038/nphoton.2014.220. S2CID 53349599.
  4. ^ King, Gary; Zeng, Langche (2001). "Logistic Regression in Rare Events Data". Political Analysis. 9 (2): 137–163. doi:10.1093/oxfordjournals.pan.a004868.
  5. ^ Morio, J., Balesdent, M. (2015). Estimation of Rare Event Probabilities in Complex Aerospace and Other Systems. Elsevier Science. http://store.elsevier.com/product.jsp?isbn=9780081000915&pagename=search
  6. ^ Sanders, D. (2002). The management of losses arising from extreme events. Paper presented at General Insurance Convention. http://www.actuaries.org.uk/research-and-resources/documents/management-losses-arising-extreme-events
  7. ^ Clauset, Aaron; Woodard, Ryan (2013). "Estimating the historical and future probabilities of large terrorist events". The Annals of Applied Statistics. 7 (4): 1838–1865. arXiv:1209.0089. doi:10.1214/12-AOAS614. S2CID 3088917.
  8. ^ Ghil, M.; Yiou, P.; Hallegatte, S.; Malamud, B. D.; Naveau, P.; Soloviev, A.; Friederichs, P.; Keilis-Borok, V.; Kondrashov, D.; Kossobokov, V.; Mestre, O.; Nicolis, C.; Rust, H. W.; Shebalin, P.; Vrac, M.; Witt, A.; Zaliapin, I. (2011). "Extreme events: Dynamics, statistics and prediction". Nonlinear Processes in Geophysics. 18 (3): 295–350. Bibcode:2011NPGeo..18..295G. doi:10.5194/npg-18-295-2011.
  9. ^ Sharma, A. S.; Bunde, A.; Dimri, V.P.; Baker, D.N. (6 May 2013). Extreme events and natural hazards: The complexity perspective. Wiley. ISBN 9781118672235.
  10. ^ Watkins, N. W. (2013). "Bunched black (And grouped grey) swans: Dissipative and non-dissipative models of correlated extreme fluctuations in complex geosystems" (PDF). Geophysical Research Letters. 40 (2): 402–410. Bibcode:2013GeoRL..40..402W. doi:10.1002/grl.50103.
  11. ^ King, Gary; Zeng, Langche (2001). "Logistic Regression in Rare Events Data". Political Analysis. 9 (2): 137–163. doi:10.1093/oxfordjournals.pan.a004868. ISSN 1047-1987. JSTOR 25791637.
  12. ^ King, Gary; Zeng, Langche (2001). "Explaining Rare Events in International Relations". International Organization. 55 (3): 693–715. doi:10.1162/00208180152507597. ISSN 0020-8183. JSTOR 3078661. S2CID 17865688.
  13. ^ Klüppelberg, Claudia (1997). Modelling Extremal Events. doi:10.1007/978-3-642-33483-2. ISBN 978-3-642-08242-9.
  14. ^ Goodwin, Paul; Wright, George (2010). "The limits of forecasting methods in anticipating rare events" (PDF). Technological Forecasting and Social Change. 77 (3): 355–368. doi:10.1016/j.techfore.2009.10.008.
  15. ^ King, Gary; Zeng, Langche (2002-05-30). "Estimating risk and rate levels, ratios and differences in case-control studies". Statistics in Medicine. 21 (10): 1409–1427. doi:10.1002/sim.1032. ISSN 0277-6715. PMID 12185893. S2CID 11387977.
  16. ^ Clauset, Aaron; Shalizi, Cosma Rohilla; Newman, M. E. J. (2009). "Power-Law Distributions in Empirical Data". SIAM Review. 51 (4): 661–703. arXiv:0706.1062. Bibcode:2009SIAMR..51..661C. doi:10.1137/070710111. S2CID 9155618.
  17. ^ https://acd.iiss.org/
  18. ^ http://www.acleddata.com/data/
  19. ^ https://web.archive.org/web/20141219135756/http://www.correlatesofwar.org/COW2%20Data/MIDs/MID40.html
  20. ^ http://www.systemicpeace.org/inscrdata.html
  21. ^ https://www.rand.org/nsrd/projects/terrorism-incidents.html
  22. ^ http://www.systemicpeace.org/inscrdata.html
  23. ^ https://earthquake.usgs.gov/earthquakes/search/
  24. ^ http://floodobservatory.colorado.edu/
  25. ^ http://www.fema.gov/policy-claim-statistics-flood-insurance/policy-claim-statistics-flood-insurance/policy-claim-13
  26. ^ http://faostat.fao.org/
  27. ^ http://www.volcano.si.edu/search_eruption.cfm
  28. ^ http://www.emdat.be/
  29. ^ http://www.ngdc.noaa.gov/hazard/
  30. ^ http://gis.cdc.gov/grasp/fluview/fluportaldashboard.html
  31. ^ [1]
  32. ^ http://aviation-safety.net/database/
  33. ^ http://www.johnstonsarchive.net/nuclear/radevents/