A framework for distributed cleaning of data streams

Saul Gill, Brian Lee

Research output: Contribution to journalConference articlepeer-review

12 Citations (Scopus)

Abstract

Vast and ever increasing quantities of data are produced by sensors in the Internet of Things (IoT). The quality of this data can be very variable due to problems with sensors, incorrect calibration etc. Data quality can be greatly enhanced by cleaning the data before it reaches its end user. This paper reports on the construction of a distributed cleaning system (DCS) to clean data streams in real-time for an environmental case-study. A combination of declarative and statistical model based cleaning methods are applied and initial results are reported.

Original languageEnglish
Pages (from-to)1186-1191
Number of pages6
JournalProcedia Computer Science
Volume52
Issue number1
DOIs
Publication statusPublished - 2015
EventThe International Conference on Ambient Systems, Networks and Technologies, ANT-2015, the International Conference on Sustainable Energy Information Technology, SEIT-2015 - London, United Kingdom
Duration: 2 Jun 20155 Jun 2015

Keywords

  • Data stream cleaning
  • Declarative cleaning
  • Distributed cleaning
  • Internet of things
  • Model based cleaning
  • Regression
  • Sensor data modelling

Fingerprint

Dive into the research topics of 'A framework for distributed cleaning of data streams'. Together they form a unique fingerprint.

Cite this