Small and large vocabulary speech recognition of MP3 data under real-word conditions: Experimental study

Petr Pollak, Michal Borsky

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

This paper presents the study of speech recognition accuracy both for small and large vocabulary task with respect to different levels of MP3 compression of processed data. The motivation behind the work was to evaluate the usage of ASR system for off-line automatic transcription of recordings collected from standard present MP3 devices under different levels of background noise and channel distortion. Although MP3 may not be an optimal compression algorithm, the performed experiments have prooved that it does not distort speech signal significantly for higher compression rates. Realized experiments showed also that the accuracy of speech recognition (both small- and large-vocabulary) decreased very slowly for the bit-rate of 24 kbps and higher. However, slightly different setup of speech feature computation is necessary for MP3 speech data, mainly PLP features give significantly better results in comparison to MFCC.

Original languageEnglish
Title of host publicationE-Business and Telecommunications
Subtitle of host publicationInternational Joint Conference, ICETE 2011 Seville, Spain, July 18-21, 2011 Revised Selected Papers
EditorsJose L. Sevillano, Joaquim Filipe
Pages409-419
Number of pages11
DOIs
Publication statusPublished - 2012
Event8th International Joint Conference on e-Business and Telecommunications, ICETE 2011 - Seville, Spain
Duration: 18 Jul 201121 Jul 2011

Publication series

NameCommunications in Computer and Information Science
Volume314
ISSN (Print)1865-0929

Conference

Conference8th International Joint Conference on e-Business and Telecommunications, ICETE 2011
Country/TerritorySpain
CitySeville
Period18/07/1121/07/11

Other keywords

  • Large vocabulary
  • LVCSR
  • MP3
  • MPEG compression
  • Noise robustness
  • Small vocabulary
  • Speech recognition

Fingerprint

Dive into the research topics of 'Small and large vocabulary speech recognition of MP3 data under real-word conditions: Experimental study'. Together they form a unique fingerprint.

Cite this