Reducing bandwidth for robust distributed speech recognition in conditions of packet loss

Ronan Flynn, Edward Jones

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

This paper proposes a method to reduce the bandwidth requirements for a distributed speech recognition (DSR) system, with minimal impact on recognition performance. Bandwidth reduction is achieved by applying a wavelet decomposition to feature vectors extracted from speech using an auditory-based front-end. The resulting vectors undergo vector quantisation and are then combined in pairs for transmission over a statistically modeled channel that is subject to packet burst loss. Recognition performance is evaluated in the presence of both background noise and packet loss. When there is no packet loss, results show that the proposed method can reduce the bandwidth required to 50% of the bandwidth required for the system in which the proposed method is not used, without compromising recognition performance. The bandwidth can be further reduced to 25% of the baseline for a slight decrease in recognition performance. Furthermore, in the presence of packet loss, the proposed method for bandwidth reduction, when combined with a suitable redundancy scheme, gives a 29% reduction in bandwidth, when compared to the recognition performance of an established packet loss mitigation technique.

Original languageEnglish
Pages (from-to)881-892
Number of pages12
JournalSpeech Communication
Volume54
Issue number7
DOIs
Publication statusPublished - Sep 2012

Keywords

  • Auditory front-end
  • Bandwidth reduction
  • Packet loss
  • Robust distributed speech recognition
  • Wavelet

Fingerprint

Dive into the research topics of 'Reducing bandwidth for robust distributed speech recognition in conditions of packet loss'. Together they form a unique fingerprint.

Cite this