An ensemble machine learning method for microplastics identification with FTIR spectrum

Research output: Contribution to journalArticlepeer-review

37 Citations (Scopus)

Abstract

Microplastics (MPs) (size < 5 mm) marine pollution have been investigated and monitored by many researchers and found in many coasts around the world. These toxic chemicals make their way into human diet through food chain when aquatic organisms ingest MPs. Attenuated Total Reflection Fourier transform infrared spectroscopy (ATR-FTIR) is a very effective method to detect MPs. To provide the automatic detecting method for MPs, Numerous studies have proposed Machine Learning (ML) based methods, such as Support Vector Machines, K-Nearest Neighbours, and Random Forests, for identification and classification of MPs through using the ATR-FTIR data. The evaluations of these ML based methods primarily focus on the average scores across all types of MPs. However, the existing FTIR datasets are normally imbalanced. Furthermore, some MPs contain the identical functional group, and some MPs may be fouled or contaminated, which will reduce the quality of FTIR data samples (e.g. lacking of peaks or creating noises). These factors will interfere the ML classification algorithms and cause the algorithms to perform differently while identifying different MPs. Hence, this work proposes an ensemble learning algorithm to exploit the advantage of different ML algorithms based on a systematic evaluation of the existing ML based MP identification approaches. A neural network is employed to fuse the outputs of chosen ML algorithms to improve the overall metrics. The evaluation results show that the proposed algorithm outperforms existing single ML based approaches.

Original languageEnglish
Article number108130
JournalJournal of Environmental Chemical Engineering
Volume10
Issue number4
DOIs
Publication statusPublished - Aug 2022

Keywords

  • ANN Artificial Neural Network
  • ATR Attenuated Total Reflection
  • Abbreviations MP Microplastic
  • EPR Ethylene propylene rubber
  • FTIR Fourier transform infrared spectroscopy
  • KNN K-Nearest Neighbours
  • LDA Linear Discriminant Analysis
  • ML Machine Learning
  • MLP Multilayer Perceptron
  • PCA Principal Component Analysis
  • PLSDA Partial Least Squares Discriminant Analysis
  • RF Random Forests
  • SIMCA Soft Independent Modelling of Class Analogies
  • SVM Support Vector Machines

Fingerprint

Dive into the research topics of 'An ensemble machine learning method for microplastics identification with FTIR spectrum'. Together they form a unique fingerprint.

Cite this