TY - GEN
T1 - Multi-Resolution Pre-Processing for Pattern Recognition in Images and Audio Signals
AU - Mansor, Noha
AU - Flynn, Ronan
AU - Daly, Mark
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/6/10
Y1 - 2021/6/10
N2 - With the rapid growth of technology and the proliferation of data in this digital age, current image and audio applications require greater resolution, higher data transmission rates and better data compression techniques to meet the ever increasing demands placed on them. The research presented here investigates the impact of data compression in the automatic recognition of handwritten digit images and spoken digit audio. A Haar wavelet transform (HWT) is used to compress the original image and audio data, which is input to an artificial neural network (ANN) where the automatic digit recognition is performed. The HWT generates a signature, or fingerprint, for the data by removing redundant data using a cut-off function, a number of which are investigated. This reduced data signature enables the ANN-based recogniser to be simplified and computationally more efficient. Experimental results show that for handwritten digit images, the recognition accuracy is 94.3% with compression ratios of 80%; for spoken audio digits, the recognition accuracy is 98.8% with compression ratios of 82%.
AB - With the rapid growth of technology and the proliferation of data in this digital age, current image and audio applications require greater resolution, higher data transmission rates and better data compression techniques to meet the ever increasing demands placed on them. The research presented here investigates the impact of data compression in the automatic recognition of handwritten digit images and spoken digit audio. A Haar wavelet transform (HWT) is used to compress the original image and audio data, which is input to an artificial neural network (ANN) where the automatic digit recognition is performed. The HWT generates a signature, or fingerprint, for the data by removing redundant data using a cut-off function, a number of which are investigated. This reduced data signature enables the ANN-based recogniser to be simplified and computationally more efficient. Experimental results show that for handwritten digit images, the recognition accuracy is 94.3% with compression ratios of 80%; for spoken audio digits, the recognition accuracy is 98.8% with compression ratios of 82%.
KW - Haar wavelet transform
KW - artificial neural network
KW - audio
KW - compression
KW - image
UR - http://www.scopus.com/inward/record.url?scp=85114442221&partnerID=8YFLogxK
U2 - 10.1109/ISSC52156.2021.9467861
DO - 10.1109/ISSC52156.2021.9467861
M3 - Conference contribution
AN - SCOPUS:85114442221
T3 - 2021 32nd Irish Signals and Systems Conference, ISSC 2021
BT - 2021 32nd Irish Signals and Systems Conference, ISSC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 32nd Irish Signals and Systems Conference, ISSC 2021
Y2 - 10 June 2021 through 11 June 2021
ER -