Speech and prosodic processing for assistive technology

Lalita Narupiyakul; Vlado Keselj; Nick Cercone; Booncharoen Sirinaovakul

Publication:
Speech and prosodic processing for assistive technology

dc.contributor.author	Lalita Narupiyakul	en_US
dc.contributor.author	Vlado Keselj	en_US
dc.contributor.author	Nick Cercone	en_US
dc.contributor.author	Booncharoen Sirinaovakul	en_US
dc.contributor.other	Mahidol University	en_US
dc.contributor.other	Dalhousie University	en_US
dc.contributor.other	York University	en_US
dc.contributor.other	King Mongkuts University of Technology Thonburi	en_US
dc.date.accessioned	2018-10-19T04:50:48Z
dc.date.available	2018-10-19T04:50:48Z
dc.date.issued	2013-12-01	en_US
dc.description.abstract	A speaker's utterance may convey different meanings to a hearer than what the speaker intended. Such ambiguities can be resolved by emphasizing accents at different positions. In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance. In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention. The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks. We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence. We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components. An implementation of the system is described and the evaluation results on the CMU Communicator (CMU-COM) dataset are presented. © 2013 The authors and IOS Press. All rights reserved.	en_US
dc.identifier.citation	Frontiers in Artificial Intelligence and Applications. Vol.253, (2013), 36-48	en_US
dc.identifier.doi	10.3233/978-1-61499-258-5-36	en_US
dc.identifier.issn	09226389	en_US
dc.identifier.other	2-s2.0-84894597328	en_US
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/31604
dc.rights	Mahidol University	en_US
dc.rights.holder	SCOPUS	en_US
dc.source.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84894597328&origin=inward	en_US
dc.subject	Computer Science	en_US
dc.title	Speech and prosodic processing for assistive technology	en_US
dc.type	Article	en_US
dspace.entity.type	Publication
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84894597328&origin=inward	en_US

Collections

Scopus 2011-2015

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Publication: Speech and prosodic processing for assistive technology

Files

Collections

Publication:
Speech and prosodic processing for assistive technology