Hao Tang

Affiliations:
  • Hewlett-Packard Labs., Multimedia Interaction & Understanding Laboratory, Palo Alto, CA, USA
  • University of Illinois at Urbana-Champaign, Urbana, IL, USA (PhD 2010)
  • Rutgers University, USA (former)
  • University of Science and Technology of China, China (former)


According to our database1, Hao Tang authored at least 39 papers between 2006 and 2013.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2013
Content-aware automatic cropping for consumer photos.
Proceedings of the Imaging and Printing in a Web 2.0 World IV, 2013

2012
Partially Supervised Speaker Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

TouchPaper: making print interactive.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Ontological Inference Framework with Joint Ontology Construction and Learning for Image Understanding.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

AH-SIFT: Augmented Histogram based SIFT descriptor.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Object-aware saliency detection for consumer images.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

2011
Expression recognition from 3D dynamic faces using robust spatio-temporal shape features.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

2010
Human-Centered Face Computing in Multimedia Interaction and Communication.
Proceedings of the Intelligent Multimedia Communication: Techniques and Applications, 2010

One-vector representations of stochastic signals for pattern recognition
PhD thesis, 2010

A Novel Vector Representation of Stochastic Signals Based on Adapted Ergodic HMMs.
IEEE Signal Process. Lett., 2010

Novel Gaussianized vector representation for improved natural scene categorization.
Pattern Recognit. Lett., 2010

Non-frontal view facial expression recognition based on ergodic hidden Markov model supervectors.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Robust license plate detection using image saliency.
Proceedings of the International Conference on Image Processing, 2010

Accurate and efficient reconstruction of 3D faces from stereo images.
Proceedings of the International Conference on Image Processing, 2010

Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 2010

Emotion Recognition from Arbitrary View Facial Images.
Proceedings of the Computer Vision - ECCV 2010, 2010

2009
Sensitive Talking Heads [Applications Corner].
IEEE Signal Process. Mag., 2009

Spherical Discriminant Analysis in Semi-supervised Speaker Clustering.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Emotion recognition from speech VIA boosted Gaussian mixture models.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Locality preserving speaker clustering.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A novel approach to expression recognition from non-frontal face images.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Generative model-based speaker clustering via mixture of von Mises-Fisher distributions.
Proceedings of the IEEE International Conference on Acoustics, 2009

Fishervoice and semi-supervised speaker clustering.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Humanoid Audio-Visual Avatar With Emotive Text-to-Speech Synthesis.
IEEE Trans. Multim., 2008

EAVA: A 3D Emotive Audio-Visual Avatar.
Proceedings of the 9th IEEE Workshop on Applications of Computer Vision (WACV 2008), 2008

Two-stage prosody prediction for emotional text-to-speech synthesis.
Proceedings of the INTERSPEECH 2008, 2008

A novel Gaussianized vector representation for natural scene categorization.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Boosting Gaussian mixture models via discriminant analysis.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Comparison of Algorithms for Speaker Identification under Adverse Far-Field Recording Conditions with Extremely Short Utterances.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2008

Real-time conversion from a single 2D face image to a 3D text-driven emotive audio-visual avatar.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Face hallucination VIA sparse coding.
Proceedings of the International Conference on Image Processing, 2008

MPEG4 performance-driven avatar via robust facial motion tracking.
Proceedings of the International Conference on Image Processing, 2008

Camera and microphone array for 3D audiovisual face data collection.
Proceedings of the IEEE International Conference on Acoustics, 2008

3D facial expression recognition based on properties of line segments connecting facial feature points.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

3D facial expression recognition based on automatically selected features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

2007
HMM-Based Acoustic Event Detection with AdaBoost Feature Selection.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
A spectral clustering approach to speaker diarization.
Proceedings of the INTERSPEECH 2006, 2006

Improved Graphical Model for Audiovisual Object Tracking.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Person Identification Based on Multichannel and Multimodality Fusion.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006


  Loading...