Kshirsagar et al., 2001 - Google Patents

Personalized face and speech communication over the internet

Kshirsagar et al., 2001

Document ID: 11952866989911425199
Author: Kshirsagar S; Joslin C; Lee W; Magnenat-Thalmann N
Publication year: 2001
Publication venue: Proceedings IEEE Virtual Reality 2001

External Links

Cited by

Snippet

We present our system for personalized face and speech communication over the Internet. The overall system consists of three parts: the cloning of real human faces to use as the representative avatars; the Networked Virtual Environment System performing the basic task …

Continue reading at www.academia.edu (PDF) (other versions)

238000004891 communication 0 title abstract description 25

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/205—3D [Three Dimensional] animation driven by audio data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/50—Lighting effects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating 3D models or images for computer graphics
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality

Similar Documents

Publication	Publication Date	Title
CN115004236B (en)	2025-05-13	Photorealistic talking faces from audio
Hong et al.	2002	Real-time speech-driven face animation with expressions using neural networks
Doenges et al.	1997	MPEG-4: Audio/video and synthetic graphics/audio for mixed media
Lee et al.	1999	MPEG-4 compatible faces from orthogonal photos
WO2021248473A1 (en)	2021-12-16	Personalized speech-to-video with three-dimensional (3d) skeleton regularization and expressive body poses
Brand	1999	Voice puppetry
US7027054B1 (en)	2006-04-11	Do-it-yourself photo realistic talking head creation system and method
US6919892B1 (en)	2005-07-19	Photo realistic talking head creation system and method
Gutierrez-Osuna et al.	2005	Speech-driven facial animation with realistic dynamics
Thalmann et al.	2002	Face to virtual face
Cosatto et al.	2003	Lifelike talking faces for interactive services
Escher et al.	1997	Automatic 3D cloning and real-time animation of a human face
King et al.	2005	Creating speech-synchronized animation
Ostermann et al.	2004	Talking faces-technologies and applications
CN119343703A (en)	2025-01-21	Create images, meshes, and speech animations from mouth shape data
Gachery et al.	2001	Designing MPEG-4 facial animation tables for web applications
King	2001	A facial model and animation techniques for animated speech
Kshirsagar et al.	2001	Personalized face and speech communication over the internet
CN119729145A (en)	2025-03-28	Digital human video generation method and device, electronic equipment and storage medium
Tang et al.	2008	Real-time conversion from a single 2D face image to a 3D text-driven emotive audio-visual avatar
Ekmen et al.	2019	From 2D to 3D real-time expression transfer for facial animation
Perng et al.	1998	Image talk: a real time synthetic talking head using one single image with chinese text-to-speech capability
Schreer et al.	2008	Real-time vision and speech driven avatars for multimedia applications
Du et al.	2002	Realistic mouth synthesis based on shape appearance dependence mapping
Thalmann	2000	The virtual human as a multimodal interface