Cha Zhang

Diana Marculescu

CoRR, 2020

Multimodal Active Speaker Detection and Virtual Cinematography for Video Conferencing.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards Efficient Model Compression via Learned Global Ranking.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

LeGR: Filter Pruning via Learned Global Ranking.

[BibT_eX]

[DOI]

CoRR, 2019

RePr: Improved Training of Convolutional Filters.

[BibT_eX]

[DOI]

Aaditya Prakash

James A. Storer

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks.

[BibT_eX]

[DOI]

Ting-Wu Chin

Diana Marculescu

CoRR, 2018

2017

Orthogonal and Idempotent Transformations for Learning Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Deep Learning for Intelligent Video Analysis.

[BibT_eX]

[DOI]

Tao Mei

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Automatic speech emotion recognition using recurrent neural networks with local attention.

[BibT_eX]

[DOI]

Seyedmahdad Mirsamadi

Emad Barsoum

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Addressing bias in machine learning algorithms: A pilot study on emotion recognition for intelligent systems.

[BibT_eX]

[DOI]

Ayanna M. Howard

Eric Horvitz

Proceedings of the 2017 IEEE Workshop on Advanced Robotics and its Social Impacts, 2017

2016

Image Bit-Depth Enhancement via Maximum A Posteriori Estimation of AC Signal.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Training deep networks for facial expression recognition with crowd-sourced label distribution.

[BibT_eX]

[DOI]

Emad Barsoum

Cristian Canton-Ferrer

Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Emotion recognition in the wild from videos using images.

[BibT_eX]

[DOI]

Sarah Adel Bargal

Emad Barsoum

Cristian Canton-Ferrer

Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

2015

Precision Enhancement of 3-D Surfaces from Compressed Multiview Depth Maps.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

A survey on face detection in the wild: Past, present and future.

[BibT_eX]

[DOI]

Stefanos Zafeiriou

Comput. Vis. Image Underst., 2015

Image based Static Facial Expression Recognition with Multiple Deep Network Learning.

[BibT_eX]

[DOI]

Zhiding Yu

Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

2014

Rate-Constrained 3D Surface Estimation From Noise-Corrupted Multiview Depth Videos.

[BibT_eX]

[DOI]

Wenxiu Sun

IEEE Trans. Image Process., 2014

A robust optical/inertial data fusion system for motion tracking of the robot manipulator.

[BibT_eX]

[DOI]

J. Zhejiang Univ. Sci. C, 2014

Iterative transductive learning for automatic image segmentation and matting with RGB-D data.

[BibT_eX]

[DOI]

Bei He

Guijin Wang

J. Vis. Commun. Image Represent., 2014

Precision Enhancement of 3D Surfaces from Multiple Compressed Depth Maps.

[BibT_eX]

[DOI]

CoRR, 2014

Improving multiview face detection with multi-task deep convolutional neural networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Immersive 3D Communication.

[BibT_eX]

[DOI]

Wanmin Wu

Bernardino Romera-Paredes

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Video face beautification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Facial expression tracking from head-mounted, partially observing cameras.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Point cloud attribute compression with graph transform.

[BibT_eX]

[DOI]

Charles T. Loop

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Image bit-depth enhancement via maximum-a-posteriori estimation of graph AC component.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013

Analyzing the Optimality of Predictive Transform Coding Using Graph-Based Models.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2013

Viewport: A Distributed, Immersive Teleconferencing System with Infrared Dot Pattern.

[BibT_eX]

[DOI]

Ricardo Martin-Brualla

IEEE Multim., 2013

3D Imaging Techniques and Multimedia Applications [Guest editor's introduction].

[BibT_eX]

[DOI]

IEEE Multim., 2013

Precision enhancement of 3D surfaces from multiple quantized depth maps.

[BibT_eX]

[DOI]

Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013

Rate-distortion optimized 3D reconstruction from noise-corrupted multiview depth videos.

[BibT_eX]

[DOI]

Wenxiu Sun

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Robust part-based face matching with multiple templates.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Real-Time High-Resolution Sparse Voxelization with Application to Image-Based Modeling.

[BibT_eX]

[DOI]

Charles T. Loop

Proceedings of the High-Performance Graphics 2013, 2013

Video Enhancement of People Wearing Polarized Glasses: Darkening Reversal and Reflection Reduction.

[BibT_eX]

[DOI]

Mao Ye

Ruigang Yang

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Wide-Baseline Hair Capture Using Strand-Based Refinement.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

Geometrically Constrained Room Modeling With Compact Microphone Arrays.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Automatic Real-Time Video Matting Using Time-of-Flight Camera and Multichannel Poisson Equations.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2012

Virtual View Reconstruction Using Temporal Information.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

See-through Image Enhancement through Sensor Fusion.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

3D scene reconstruction by multiple structured-light based commodity depth cameras.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

An Interactive 3-D Audio System With Loudspeakers.

[BibT_eX]

[DOI]

Myung-Suk Song

Hong-Goo Kang

IEEE Trans. Multim., 2011

Improving Immersive Experiences in Telecommunication with Motion Parallax [Applications Corner].

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2011

Low-complexity, near-lossless coding of depth maps from kinect-like depth cameras.

[BibT_eX]

[DOI]

Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Calibration between depth and color sensors for commodity depth cameras.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

A novel see-through screen based on weave fabrics.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

CROWDMOS: An approach for crowdsourcing mean opinion score studies.

[BibT_eX]

[DOI]

Michael L. Seltzer

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Boosting-Based Face Detection and Adaptation

[BibT_eX]

[DOI]

Synthesis Lectures on Computer Vision, Morgan & Claypool Publishers, ISBN: 978-3-031-01809-1, 2010

Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Joint tracking and multiview video compression.

[BibT_eX]

[DOI]

Dinei Florêncio

Proceedings of the Visual Communications and Image Processing 2010, 2010

Enhancing loudspeaker-based 3D audio with room modeling.

[BibT_eX]

[DOI]

Myung-Suk Song

Hong-Goo Kang

Proceedings of the 2010 IEEE International Workshop on Multimedia Signal Processing, 2010

Personal 3D audio system with loudspeakers.

[BibT_eX]

[DOI]

Myung-Suk Song

Hong-Goo Kang

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Turning enemies into friends: Using reflections to improve sound source localization.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

L1 regularized room modeling with compact microphone arrays.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

3D Deformable Face Tracking with a Commodity Depth Camera.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2010

2009

Improving depth perception with motion parallax and its application in teleconferencing.

[BibT_eX]

[DOI]

Zhaozheng Yin

Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009

ACM 2009 workshop on ambient media computing (AMC'09) overview.

[BibT_eX]

[DOI]

Abdulmotaleb El Saddik

K. Selçuk Candan

Irene Cheng

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Multiview video compression and streaming based on predicted viewer position.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Boosted multi-task learning for face verification with applications to web image and video search.

[BibT_eX]

[DOI]

Xiaogang Wang

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Efficient Scale-Space Spatiotemporal Saliency Tracking for Distortion-Free Video Retargeting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2009

2008

An automated end-to-end lecture capture and broadcasting system.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2008

Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2008

Maximum Likelihood Sound Source Localization and Beamforming for Directional Microphone Arrays in Distributed Meetings.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2008

Active Multicamera Networks: From Rendering to Surveillance.

[BibT_eX]

[DOI]

Brian A. Stancil

IEEE J. Sel. Top. Signal Process., 2008

Multimedia Immersive Technologies and Networking.

[BibT_eX]

[DOI]

Athanasios V. Vasilakos

Adv. Multim., 2008

Semantic saliency driven camera control for personal remote collaboration.

[BibT_eX]

[DOI]

Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Requirements and recommendations for an enhanced meeting viewing experience.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Why does PHAT work well in lownoise, reverberative environments?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Taylor expansion based classifier adaptation: Application to person detection.

[BibT_eX]

[DOI]

Raffay Hamid

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007

Active Rearranged Capturing of Image-Based Rendering Scenes-Theory and Practice.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2007

Multiview Imaging and 3DTV.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2007

Multiple-Instance Pruning For Learning Efficient Cascade Detectors.

[BibT_eX]

[DOI]

Paul A. Viola

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Learning-Based Perceptual Image Quality Improvement for Video Conferencing.

[BibT_eX]

[DOI]

Zicheng Liu

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Enhanced MVDR Beamforming for Arrays of Directional Microphones.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Maximum Likelihood Sound Source Localization for Multiple Directional Microphones.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Light Field Sampling

[BibT_eX]

[DOI]

Synthesis Lectures on Image, Video, and Multimedia Processing, Morgan & Claypool Publishers, ISBN: 978-3-031-02241-8, 2006

Boosting-Based Multimodal Speaker Detection for Distributed Meetings.

[BibT_eX]

[DOI]

Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Robust Visual Tracking via Pixel Classification and Integration.

[BibT_eX]

[DOI]

Yong Rui

Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Three-Layer Virtual Director Model for Supporting Automated Multi-Site Distributed Education.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Light Weight Background Blurring for Video Conferencing Applications.

[BibT_eX]

[DOI]

Yong Rui

Li-wei He

Proceedings of the International Conference on Image Processing, 2006

2005

On the compression and streaming of concentric mosaic data for free wandering in a realistic environment over the Internet.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2005

Multiple Instance Boosting for Object Detection.

[BibT_eX]

[DOI]

Paul A. Viola

John C. Platt

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

An automated end-to-end lecture capturing and broadcasting system.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM International Conference on Multimedia, 2005

Hybrid speaker tracking in an automated lecture room.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Light field capturing with lensless cameras.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Conference on Image Processing, 2005

2004

A survey on image-based rendering - representation, sampling and compression.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2004

A self-reconfigurable camera array.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2004

Non-Uniform Sampling for Image-Based Rendering: Convergence of Image, Vision, and Graphic.

[BibT_eX]

[DOI]

Proceedings of the 10th International Multimedia Modeling Conference (MMM 2004), 2004

Distributed hosting of Web content with erasure coding and unequal weight assignment.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Semantic propagation from relevance feedbacks.

[BibT_eX]

[DOI]

Hoon Yul Bang

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Security analysis for key generation systems using face images.

[BibT_eX]

[DOI]

Wende Zhang

Proceedings of the 2004 International Conference on Image Processing, 2004

View-dependent non-uniform sampling for image-based rendering.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Conference on Image Processing, 2004

2003

Spectral analysis for sampling image-based rendering data.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2003

Color image sharpening based on collective time-evolution of simultaneous nonlinear reaction-diffusion.

[BibT_eX]

Proceedings of the Visual Communications and Image Processing 2003, 2003

Nonuniform sampling of image-based rendering data with the position-interval-error (PIE) function.

[BibT_eX]

Proceedings of the Visual Communications and Image Processing 2003, 2003

A system for active image-based rendering.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Annotating retrieval database with active learning.

[BibT_eX]

[DOI]

Proceedings of the 2003 International Conference on Image Processing, 2003

Surface plenoptic function: a tool for the sampling analysis of image-based rendering.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

On generalized sampling for image-based rendering data.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

An active learning framework for content-based information retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2002

Smart rebinning for the compression of concentric mosaic.

[BibT_eX]

[DOI]

Yunnan Wu

IEEE Trans. Multim., 2002

Towards optimal least square filters using the eigenfilter approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Interactive browsing of 3D environment over the Internet.

[BibT_eX]

Proceedings of the Visual Communications and Image Processing 2001, 2001

Indexing and retrieval of 3D models aided by active learning.

[BibT_eX]

[DOI]

Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Efficient feature extraction for 2D/3D objects in mesh representation.

[BibT_eX]

[DOI]

Proceedings of the 2001 International Conference on Image Processing, 2001

2000

Compression and rendering of concentric mosaics with reference block codec (RBC).

[BibT_eX]

Proceedings of the Visual Communications and Image Processing 2000, 2000

Smart rebinning for compression of concentric mosaics.

[BibT_eX]

[DOI]

Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Compression of Lumigraph with Multiple Reference Frame (MRF) Prediction and Just-in-Time Rendering.

[BibT_eX]

[DOI]