Abstract
Vast and ever increasing quantities of data are produced by sensors in the Internet of Things (IoT). The quality of this data can be very variable due to problems with sensors, incorrect calibration etc. Data quality can be greatly enhanced by cleaning the data before it reaches its end user. This paper reports on the construction of a distributed cleaning system (DCS) to clean data streams in real-time for an environmental case-study. A combination of declarative and statistical model based cleaning methods are applied and initial results are reported.
Original language | English |
---|---|
Pages (from-to) | 1186-1191 |
Number of pages | 6 |
Journal | Procedia Computer Science |
Volume | 52 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2015 |
Event | The International Conference on Ambient Systems, Networks and Technologies, ANT-2015, the International Conference on Sustainable Energy Information Technology, SEIT-2015 - London, United Kingdom Duration: 2 Jun 2015 → 5 Jun 2015 |
Keywords
- Data stream cleaning
- Declarative cleaning
- Distributed cleaning
- Internet of things
- Model based cleaning
- Regression
- Sensor data modelling