sh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Classification of affect in speech using normalized time-frequency cepstra
Stockholms universitet, Psykologiska institutionen.
2010 (English)In: Speech Prosody 2010 Conference Proceedings, 2010, 100071-1-4 p.Conference paper, (Refereed)
Abstract [en]

Subtle temporal and spectral differences between categorical realizations of para-linguistic phenomena (e.g., affective vocal expressions) are hard to capture and describe. In this paper we present a signal representation based on Time Varying Constant-Q Cepstral Coeffcients (TVCQCC) derived for this purpose. A method which utilizes the special properties of the constant Q-transform for mean F0 estimation and normalization is described. The coeffcients are invariant to segment length, and as a special case, a representation for prosody is considered. Speaker independent classifcation results using v-SVM with the Berlin EMO-DB and two closed sets of basic (anger, disgust, fear, happiness, sadness, neutral) and social/interpersonal (affection, pride, shame) emotions recorded by forty professional actors from two English dialect areas are reported. The accuracy for the Berlin EMO-DB is 71.2 %, and the accuracies for the first set including basic emotions was 44.6% and for the second set including basic and social emotions the accuracy was 31.7% . It was found that F0 normalization boosts the performance and a combined feature set shows the best performance.

Place, publisher, year, edition, pages
2010. 100071-1-4 p.
National Category
Psychology
Identifiers
URN: urn:nbn:se:sh:diva-26870ISBN: 978-0-557-51931-6 (print)OAI: oai:DiVA.org:sh-26870DiVA: diva2:802556
Conference
Speech Prosody, 5th International Conference, Chicago Illinois, May 11-14, 2010.
Note

This work was partly funded by the Swedish Research Council under contract 2006-1360.

Available from: 2010-12-01 Created: 2015-04-10 Last updated: 2015-04-13Bibliographically approved

Open Access in DiVA

No full text

Other links

Fulltext [PDF]

Search in DiVA

By author/editor
Laukka, Petri
Psychology

Search outside of DiVA

GoogleGoogle Scholar

Total: 31 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf