Disfluency detection and reconstruction of spontaneous speech transcripts

HKUST Electronic Theses

Disfluency detection and reconstruction of spontaneous speech transcripts

by Linlin Wang

THESIS 2016

M.Phil. Electronic and Computer Engineering

xii, 63 pages : illustrations (some color) ; 30 cm

Abstract

Disfluencies in spoken language remain a challenge to both language processing applications and human perception. Disfluency identification and removal are therefore beneficial steps to improve performance of spoken language understanding tasks. In particular, it is important for close captioning video conferences.

We investigated different approaches for disfluency identification and removal, ranging from rule-based, translation models and supervised classification using either Conditional Random Fields (CRF) or Deep Neural Networks (DNN).

As supervised classifier in our task requires huge amount of human annotation and labeling, the rule-based approach and translation model allow us to use less human labeling than supervised classification.

In the rule-based approach, we used...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree M.Phil. Department Electronic and Computer Engineering Authors Wang, Linlin Subjects Automatic speech recognition Natural language processing (Computer science) Additional titles Title on signature page: Disfluencies detection and reconstruction of spontaneous speech transcripts Language English Call number Thesis ECED 2016 WangL DOI 10.14711/thesis-b1626114

Full record

Disfluency detection and reconstruction of spontaneous speech transcripts

by Linlin Wang

Post a Comment Cancel reply