Estimation of glottal closure instants in voiced speech using the DYPSA algorithm

Patrick A. Naylor, Anastasis Kounoudes, Jon Gudnason, Mike Brookes

Research output: Contribution to journalArticlepeer-review

260 Citations (Scopus)

Abstract

We present the Dynamic Programming Projected Phase-Slope Algorithm (DYPSA) for automatic estimation of glottal closure instants (GCIs) in voiced speech. Accurate estimation of GCIs is an important tool that can be applied to a wide range of speech processing tasks including speech analysis, synthesis and coding. DYPSA is automatic and operates using the speech signal alone without the need for an EGG signal. The algorithm employs the phase-slope function and a novel phase-slope projection technique for estimating GCI candidates from the speech signal. The most likely candidates are then selected using a dynamic programming technique to minimize a cost function that we define. We review and evaluate three existing methods of GCI estimation and compare the new DYPSA algorithm to them. Results are presented for the APLAWD and SAM databases for which 95.7% and 93.1% of GCIs are correctly identified.

Original languageEnglish
Article number4032783
Pages (from-to)34-43
Number of pages10
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume15
Issue number1
DOIs
Publication statusPublished - Jan 2007

Bibliographical note

Funding Information:
Manuscript received September 24, 2004; revised January 27, 2006. This work was supported by the Engineering and Physical Sciences Research Council, U.K., under Grant GR/N01569. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Rainer Martin..

Other keywords

  • Closed-phase
  • Glottal closure
  • Speech processing
  • Speech segmentation

Fingerprint

Dive into the research topics of 'Estimation of glottal closure instants in voiced speech using the DYPSA algorithm'. Together they form a unique fingerprint.

Cite this