Show simple item record

dc.contributor.authorSchaeffler, Felixen
dc.contributor.authorEichner, Matthiasen
dc.contributor.authorBeck, Janet M.en
dc.date.accessioned2019-12-05T11:34:01Z
dc.date.available2019-12-05T11:34:01Z
dc.date.issued2019-03-09
dc.identifier.citationSchaeffler, F., Eichner, M. & Beck, J. (2019) Towards ordinal classification of voice quality features with acoustic parameters. In: Proceedings of The Conference on Electronic Speech Signal Processing, TU Dresden, 6-8 March 2019. ESSV, pp. 288-295.en
dc.identifier.urihttps://eresearch.qmu.ac.uk/handle/20.500.12289/10224
dc.identifier.urihttp://www.essv.de/pdf/2019_288_295.pdf
dc.descriptionFelix Schaeffler - ORCID 0000-0002-2764-7635 https://orcid.org/0000-0002-2764-7635en
dc.descriptionItem not available in this repository.
dc.description.abstractThe human voice is capable of fine-grained variation that results in listener attributions of various psychological, social and biological factors. The complexity of this process is reflected in the number and richness of terms that are used to describe human voices. In this paper we argue that any application that attempts a mapping of the acoustic voice signal onto voice descriptor labels would benefit from an intermediate auditory-phonetic level. As a point of departure we explore the relationships between acoustic parameters and some specific perceptual features derived from Vocal Profile Analysis (VPA), a phonetically motivated voice quality analysis scheme. Perceptual analysis of voice samples from 133 speakers was carried out using VPA for three key phonation features (creakiness, whisperiness, harshness). We extracted eleven acoustic parameters from the samples and used stepwise linear regression to identify acoustic parameters with predictive value. Samples from female speakers were used to derive regression equations which were then used to predict VPA ratings of male voices. Results show significant predictors for all three phonation features and indicate that predictions for the three phonation types rely mainly on different parameters. If a tolerance of ± 1 scalar degree for the perceptual analysis is accepted, then classification accuracy lies at or above 90% for all three phonation features.en
dc.description.urihttp://www.essv.de/pdf/2019_288_295.pdfen
dc.description.urihttp://www.essv.de/archive.php?year=2019en
dc.description.urihttp://www.essv.de/essv2019/en
dc.format.extent288-295en
dc.language.isoenen
dc.publisherESSVen
dc.relation.ispartofProceedings of The Conference on Electronic Speech Signal Processing, TU Dresden, 6-8 March 2019en
dc.titleTowards ordinal classification of voice quality features with acoustic parametersen
dc.typeArticleen
dcterms.accessRightsnone
dc.description.ispublishedpub
rioxxterms.typeConference Paper/Proceeding/Abstracten
rioxxterms.publicationdate2019-03-09
refterms.dateFCD2019-12-05
refterms.depositExceptionNAen
refterms.accessExceptionNAen
refterms.technicalExceptionNAen
refterms.panelUnspecifieden
qmu.authorSchaeffler, Felixen
qmu.authorBeck, Janet M.en
qmu.centreCASLen
dc.description.statuspub
refterms.versionNAen
refterms.dateDeposit2019-12-05


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record