Optimization of an image-based talking head system

Show simple item record

dc.identifier.uri http://dx.doi.org/10.15488/1743
dc.identifier.uri http://www.repo.uni-hannover.de/handle/123456789/1768
dc.contributor.author Liu, Kang
dc.contributor.author Ostermann, Jörn
dc.date.accessioned 2017-07-19T07:41:50Z
dc.date.available 2017-07-19T07:41:50Z
dc.date.issued 2009
dc.identifier.citation Liu, K.; Ostermann, J.: Optimization of an image-based talking head system. In: Eurasip Journal on Audio, Speech, and Music Processing 2009 (2009), 174192. DOI: https://doi.org/10.1155/2009/174192
dc.description.abstract This paper presents an image-based talking head system, which includes two parts: analysis and synthesis. The audiovisual analysis part creates a face model of a recorded human subject, which is composed of a personalized 3D mask as well as a large database of mouth images and their related information. The synthesis part generates natural looking facial animations from phonetic transcripts of text. A critical issue of the synthesis is the unit selection which selects and concatenates these appropriate mouth images from the database such that they match the spoken words of the talking head. Selection is based on lip synchronization and the similarity of consecutive images. The unit selection is refined in this paper, and Pareto optimization is used to train the unit selection. Experimental results of subjective tests show that most people cannot distinguish our facial animations from real videos. Copyright © 2009 K. Liu and J. Ostermann. eng
dc.description.sponsorship EC/FP6/511568
dc.language.iso eng
dc.publisher Heidelberg : Springer Verlag
dc.relation.ispartofseries Eurasip Journal on Audio, Speech, and Music Processing 2009 (2009)
dc.rights CC BY 4.0
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.subject Image-Based Talking Head eng
dc.subject Pareto Optimization eng
dc.subject.ddc 621,3 | Elektrotechnik, Elektronik ger
dc.title Optimization of an image-based talking head system
dc.type article
dc.type Text
dc.relation.issn 1687-4714
dc.relation.doi https://doi.org/10.1155/2009/174192
dc.bibliographicCitation.volume 2009
dc.bibliographicCitation.firstPage 174192
dc.description.version publishedVersion
tib.accessRights frei zug�nglich


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s):

Show simple item record

 

Search the repository


Browse

My Account

Usage Statistics