Publication:
Speech and prosodic processing for assistive technology

dc.contributor.authorLalita Narupiyakulen_US
dc.contributor.authorVlado Keseljen_US
dc.contributor.authorNick Cerconeen_US
dc.contributor.authorBooncharoen Sirinaovakulen_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherDalhousie Universityen_US
dc.contributor.otherYork Universityen_US
dc.contributor.otherKing Mongkuts University of Technology Thonburien_US
dc.date.accessioned2018-10-19T04:50:48Z
dc.date.available2018-10-19T04:50:48Z
dc.date.issued2013-12-01en_US
dc.description.abstractA speaker's utterance may convey different meanings to a hearer than what the speaker intended. Such ambiguities can be resolved by emphasizing accents at different positions. In human communication, the utterances are emphasized at a focus part to distinguish the important content and reduce ambiguity in the utterance. In our Focus-to-Emphasize Tone (FET) system, we determine how the speaker's utterances are influenced by focus and speaker's intention. The relationships of focus information, speaker's intention and prosodic phenomena are investigated to recognize the intonation patterns and annotate the sentence with prosodic marks. We propose using the Focus to Emphasize Tone (FET) analysis, which includes: (i) generating the constraints for foci, speaker's intention and prosodic features, (ii) defining the intonation patterns, and (iii) labelling a set of prosodic marks for a sentence. We also design the FET structure to support our analysis and to contain focus, speaker's intention and prosodic components. An implementation of the system is described and the evaluation results on the CMU Communicator (CMU-COM) dataset are presented. © 2013 The authors and IOS Press. All rights reserved.en_US
dc.identifier.citationFrontiers in Artificial Intelligence and Applications. Vol.253, (2013), 36-48en_US
dc.identifier.doi10.3233/978-1-61499-258-5-36en_US
dc.identifier.issn09226389en_US
dc.identifier.other2-s2.0-84894597328en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/31604
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84894597328&origin=inwarden_US
dc.subjectComputer Scienceen_US
dc.titleSpeech and prosodic processing for assistive technologyen_US
dc.typeArticleen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84894597328&origin=inwarden_US

Files

Collections