Hao Tang

Daniel Tretter

Qian Lin

Proceedings of the Imaging and Printing in a Web 2.0 World IV, 2013

2012

Partially Supervised Speaker Clustering.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2012

TouchPaper: making print interactive.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Ontological Inference Framework with Joint Ontology Construction and Learning for Image Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

AH-SIFT: Augmented Histogram based SIFT descriptor.

[BibT_eX]

[DOI]

Feng Tang

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Object-aware saliency detection for consumer images.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

2011

Expression recognition from 3D dynamic faces using robust spatio-temporal shape features.

[BibT_eX]

[DOI]

Vuong Le

Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

2010

Human-Centered Face Computing in Multimedia Interaction and Communication.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Multimedia Communication: Techniques and Applications, 2010

One-vector representations of stochastic signals for pattern recognition

[BibT_eX]

[DOI]

PhD thesis, 2010

A Novel Vector Representation of Stochastic Signals Based on Adapted Ergodic HMMs.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

Novel Gaussianized vector representation for improved natural scene categorization.

[BibT_eX]

[DOI]

Xi Zhou

Xiaodan Zhuang

Pattern Recognit. Lett., 2010

Non-frontal view facial expression recognition based on ergodic hidden Markov model supervectors.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Robust license plate detection using image saliency.

[BibT_eX]

[DOI]

Kai-Hsiang Lin

Proceedings of the International Conference on Image Processing, 2010

Accurate and efficient reconstruction of 3D faces from stereo images.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Emotion Recognition from Arbitrary View Facial Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2010, 2010

2009

Sensitive Talking Heads [Applications Corner].

[BibT_eX]

[DOI]

Mark A. Hasegawa-Johnson

Zhihong Zeng

IEEE Signal Process. Mag., 2009

Spherical Discriminant Analysis in Semi-supervised Speaker Clustering.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Emotion recognition from speech VIA boosted Gaussian mixture models.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Locality preserving speaker clustering.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A novel approach to expression recognition from non-frontal face images.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Generative model-based speaker clustering via mixture of von Mises-Fisher distributions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Fishervoice and semi-supervised speaker clustering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Humanoid Audio-Visual Avatar With Emotive Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Yun Fu

Jilin Tu

IEEE Trans. Multim., 2008

EAVA: A 3D Emotive Audio-Visual Avatar.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE Workshop on Applications of Computer Vision (WACV 2008), 2008

Two-stage prosody prediction for emotional text-to-speech synthesis.

[BibT_eX]

[DOI]

Xi Zhou

Matthias Odisio

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A novel Gaussianized vector representation for natural scene categorization.

[BibT_eX]

[DOI]

Xi Zhou

Xiaodan Zhuang

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Boosting Gaussian mixture models via discriminant analysis.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Comparison of Algorithms for Speaker Identification under Adverse Far-Field Recording Conditions with Extremely Short Utterances.

[BibT_eX]

[DOI]

Zhixiong Chen

Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2008

Real-time conversion from a single 2D face image to a 3D text-driven emotive audio-visual avatar.

[BibT_eX]

[DOI]

Yuxiao Hu

Yun Fu

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Face hallucination VIA sparse coding.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

MPEG4 performance-driven avatar via robust facial motion tracking.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

Camera and microphone array for 3D audiovisual face data collection.

[BibT_eX]

[DOI]

Yuxiao Hu

Proceedings of the IEEE International Conference on Acoustics, 2008

3D facial expression recognition based on properties of line segments connecting facial feature points.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

3D facial expression recognition based on automatically selected features.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

2007

HMM-Based Acoustic Event Detection with AdaBoost Feature Selection.

[BibT_eX]

[DOI]

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006

A spectral clustering approach to speaker diarization.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Improved Graphical Model for Audiovisual Object Tracking.

[BibT_eX]

[DOI]