A Review of Unstructured Data Analysis and Parsing Methods

Shubham Jain, Amy De Buitleir, Enda Fallon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Citations (Scopus)

Abstract

Computer applications generate an enormous amount of data every day through their logs, system-generated files or other reports. This generated data depicts the state of the running system and contains abundant information that can be used for system diagnostics and monitoring. Network monitoring systems produce a wide variety of unstructured information, so there is a need for an automated way to extract the relevant data, which currently requires multitude of custom parsers. Developing and testing custom parsers can be time-consuming. Instead, data can be automatically processed and parsed into a machine-readable format, building a generic model for standard or vendor-specific data, and generating insights for analytics, anomaly detection, intrusion detection, node failures and various other applications. This paper reviews some existing approaches for unstructured data mining and parsing and discusses the challenges in information extraction, creation of knowledge bases and presents a generic framework for automatic parsing.

Original languageEnglish
Title of host publication2020 International Conference on Emerging Smart Computing and Informatics, ESCI 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages164-169
Number of pages6
ISBN (Electronic)9781728152639
ISBN (Print)9781728152639
DOIs
Publication statusPublished - Mar 2020
Event2nd IEEE International Conference on Emerging Smart Computing and Informatics, ESCI 2020 - Pune, India
Duration: 12 Mar 202014 Mar 2020

Publication series

Name2020 International Conference on Emerging Smart Computing and Informatics, ESCI 2020

Conference

Conference2nd IEEE International Conference on Emerging Smart Computing and Informatics, ESCI 2020
Country/TerritoryIndia
CityPune
Period12/03/2014/03/20

Keywords

  • Data Mining
  • Information Extraction
  • Knowledge base
  • NLP
  • Similarity

Fingerprint

Dive into the research topics of 'A Review of Unstructured Data Analysis and Parsing Methods'. Together they form a unique fingerprint.

Cite this