Conversion of Speaker's Face Image Using PCA and Animation Unit for Video Chatting

Yuki Saito, Takashi Nose, Takahiro Shinozaki, Akinori Ito

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Video chat is a good way of personal communication, however, there is a privacy issue in the video chat because we need to disclose one's identity such as face or voice when chatting. In this paper, we propose two methods by which face image of a speaker is converted into that of different person to conceal the speaker's identity. In the first method, we first prepare the speech and video data of the original and target speakers for training the conversion model. The face image features are calculated using the PCA to the whole pixels of the image. In the second method, the animation units extracted by Kinect are used as an intermediate feature, and we train a model that converts the animation unit to the target speaker's face image. In both methods, we used a neural network as the conversion model. We conducted experiments, and the first method could convert the whole shape of the speakers, while small movements such as mouth movement cannot be converted. The second method could convert both the whole shape of the face and mouth movement, however, the quality of face image was deteriorated.

Original languageEnglish
Title of host publicationProceedings - 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015
EditorsJeng-Shyang Pan, Ching-Yu Yang, Hsiang-Cheh Huang, Ivan Lee
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages433-436
Number of pages4
ISBN (Electronic)9781509001880
DOIs
Publication statusPublished - 2016 Feb 19
Event11th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015 - Adelaide, Australia
Duration: 2015 Sept 232015 Sept 25

Publication series

NameProceedings - 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015

Conference

Conference11th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2015
Country/TerritoryAustralia
CityAdelaide
Period15/9/2315/9/25

Keywords

  • Face conversion
  • Kinect v2
  • Neural network
  • Principal component analysis
  • Speaker conversion

Fingerprint

Dive into the research topics of 'Conversion of Speaker's Face Image Using PCA and Animation Unit for Video Chatting'. Together they form a unique fingerprint.

Cite this