[Home ] [Archive]   [ فارسی ]  
:: Main :: About :: Current Issue :: Archive :: Search :: Submit :: Contact ::
:: Volume 12, Issue 1 (9-2022) ::
JGST 2022, 12(1): 63-79 Back to browse issues page
Analyzing Spatial-Temporal Distribution of Natural Hazard Events in Iran (1390-1400 SH) Automatically Extracted from News Stories
M. Shakhesi *, A. A. Alesheikh
Abstract:   (331 Views)
Analyzing natural hazards due to economic, environmental, and social effects is necessary for crisis management. However, good analysis comes from good data. In this case, natural hazard databases or inventories that contain all spatial, temporal, and other relevant information for each event are required. To realize that, news and social media, which provide detailed information about the hazards, are two precious resources. Nevertheless, most works manually extracted events from these text resources in the inventory development process. This paper presents a framework for extracting natural hazard events automatically from news stories by leveraging text mining techniques. By implementing the framework for the study region of Iran, we analyzed the spatial-temporal distribution of natural hazards.
Materials and Methods
According to spatial and temporal coverage of Mehr news agency, we selected this website as the main resource, and for training machine-learning-based models, we used ISNA's news articles. The process starts by mining web pages. All irrelevant records such as the news stories about maneuvers, conferences, and contradicts are removed automatically. Then, standardizing the texts of news articles should be accomplished. In the next stage, the text classification technique determines whether a news story is about a newly occurred event or a current event, in which context about the event the news story is published, and which natural hazards are pointed out in the text. For the first and the second text classification task, we used machine-learning-based models which achieved 0.875 and 0.716 for F-score, respectively. For the third task, we developed leveraged a rule-based model. The results of text classifications are used in the proceeding steps, including toponym recognition and resolution, information extraction, and topic detection and tracking.
Discussion and Results
The results show that although most natural hazards have a specific temporal distribution, the highest total frequency is for a Solar Hijri year's initial and final months. Spatially, the storm occurs more in eastern than western provinces. The diversity of other meteorological hazards, except dust, in northern provinces, is more than in southern ones. Regardless of the fatalities, the highest frequency of reported earthquake events is for southern provinces.
This paper presents a framework for automatically extracting natural hazard events from news stories. The framework leverages several text mining techniques such as text classification and information extraction to develop an inventory of the hazards. We implemented the framework to analyze the natural hazards of Iran from 1390 to 1400 Solar Hijri. Comparing the number of extracted events with the thematic maps published by the National Disaster Management Organization shows that the differences vary with the province. Based on that, the frequency of extracted events for some provinces equals the official statistics; hence the analysis for mined events is generalizable to real-world situations.
Article number: 5
Keywords: Natural Hazard, Spatial-Temporal Distribution, News Stories, Text Mining, Information Extraction
Full-Text [PDF 1955 kb]   (120 Downloads)    
Type of Study: Research | Subject: GIS
Send email to the article author

Add your comments about this article
Your username or Email:


XML   Persian Abstract   Print

Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Shakhesi M, Alesheikh A A. Analyzing Spatial-Temporal Distribution of Natural Hazard Events in Iran (1390-1400 SH) Automatically Extracted from News Stories. JGST 2022; 12 (1) :63-79
URL: http://jgst.issge.ir/article-1-1098-en.html

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Volume 12, Issue 1 (9-2022) Back to browse issues page
نشریه علمی علوم و فنون نقشه برداری Journal of Geomatics Science and Technology