Yong Rui

Affiliations:
  • Lenovo Group
  • Microsoft Research (former)


According to our database1, Yong Rui authored at least 244 papers between 1996 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2017, "For contributions to image, video and multimedia analysis, understanding and retrieval".

IEEE Fellow

IEEE Fellow 2010, "For contributions to image and video analysis, indexing and retrieval".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Toward Egocentric Compositional Action Anticipation with Adaptive Semantic Debiasing.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024

2023
Graph Attention Transformer Network for Multi-label Image Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Balanced masking strategy for multi-label image classification.
Neurocomputing, 2023

A Survey on Video Moment Localization.
ACM Comput. Surv., 2023

Hybrid Representation Learning via Epistemic Graph.
CoRR, 2023

Learning From Biased Soft Labels.
CoRR, 2023

Learning From Biased Soft Labels.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
msr-vtt.
Dataset, December, 2022

Hierarchical User Intent Graph Network for Multimedia Recommendation.
IEEE Trans. Multim., 2022

Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Lenovo Schedules Laptop Manufacturing Using Deep Reinforcement Learning.
INFORMS J. Appl. Anal., 2022

Knowledge Mining: A Cross-disciplinary Survey.
Int. J. Autom. Comput., 2022

Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Delving Globally into Texture and Structure for Image Inpainting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Noise-robust Locality Transformer for Fine-grained Food Image Retrieval.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Semi-Supervised 3D Medical Image Segmentation Via Boundary-Aware Consistent Hidden Representation Learning.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
Hierarchical User Intent Graph Network forMultimedia Recommendation.
CoRR, 2021

HoloBoard: a Large-format Immersive Teaching Board based on pseudo HoloGraphics.
Proceedings of the UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology, 2021

MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

What If We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
Self-Supervised Agent Learning for Unsupervised Cross-Domain Person Re-Identification.
IEEE Trans. Image Process., 2020

CDbin: Compact Discriminative Binary Descriptor Learned With Efficient Neural Network.
IEEE Trans. Circuits Syst. Video Technol., 2020

An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Selecting Useful Knowledge from Previous Tasks for Future Learning in a Single Network.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Label Distribution Learning on Auxiliary Label Space Graphs for Facial Expression Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning Click-Based Deep Structure-Preserving Embeddings with Visual Attention.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Unified Spatio-Temporal Attention Networks for Action Recognition in Videos.
IEEE Trans. Multim., 2019

Image Recognition by Predicted User Click Feature With Multidomain Multitask Transfer Deep Network.
IEEE Trans. Image Process., 2019

Toward efficient indexing structure for scalable content-based music retrieval.
Multim. Syst., 2019

AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond.
IEEE Multim., 2019

A Survey on Food Computing.
ACM Comput. Surv., 2019

A Distributed Approach towards Discriminative Distance Metric Learning.
CoRR, 2019

2018
Image Similarity.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

User-Click-Data-Based Fine-Grained Image Recognition via Weakly Supervised Metric Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2018

You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis.
IEEE Trans. Multim., 2018

Scalable Content-Aware Collaborative Filtering for Location Recommendation.
IEEE Trans. Knowl. Data Eng., 2018

Multitask Autoencoder Model for Recovering Human Poses.
IEEE Trans. Ind. Electron., 2018

Multimodal Deep Embedding via Hierarchical Grounded Compositional Semantics.
IEEE Trans. Circuits Syst. Video Technol., 2018

Automatic Generation of Social Event Storyboard From Image Click-Through Data.
IEEE Trans. Circuits Syst. Video Technol., 2018

Hierarchical semantic image matching using CNN feature pyramid.
Comput. Vis. Image Underst., 2018

Sequence-to-Sequence Learning via Shared Latent Representation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Saliency Detection on Light Field: A Multi-Cue Approach.
ACM Trans. Multim. Comput. Commun. Appl., 2017

Enhancing Person Re-identification in a Self-Trained Subspace.
ACM Trans. Multim. Comput. Commun. Appl., 2017

Search by Screenshots for Universal Article Clipping in Mobile Apps.
ACM Trans. Inf. Syst., 2017

Robust Spammer Detection in Microblogs: Leveraging User Carefulness.
ACM Trans. Intell. Syst. Technol., 2017

LEGO-MM: LEarning Structured Model by Probabilistic loGic Ontology Tree for MultiMedia.
IEEE Trans. Image Process., 2017

Learning hierarchical video representation for action recognition.
Int. J. Multim. Inf. Retr., 2017

Changes on the Horizon for the Multimedia Community.
IEEE Multim., 2017

Best Paper and Best Department Article Unveiled.
IEEE Multim., 2017

From Artificial Intelligence to Augmented Intelligence.
IEEE Multim., 2017

Beyond the Words: Predicting User Personality from Heterogeneous Information.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Multi-level Attention Networks for Visual Question Answering.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Automatic Generation of Visual-Textual Presentation Layout.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Monet: A System for Reliving Your Memories by Theme-Based Photo Storytelling.
IEEE Trans. Multim., 2016

Learning of Multimodal Representations With Random Walks on the Click Graph.
IEEE Trans. Image Process., 2016

Building Hierarchical Representations for Oracle Character and Sketch Recognition.
IEEE Trans. Image Process., 2016

Recognizing Exceptional Contributions.
IEEE Multim., 2016

Understanding Multimedia.
IEEE Multim., 2016

Working with the Domain Experts.
IEEE Multim., 2016

UniClip: Leveraging Web Search for Universal Clipping of Articles on Mobile.
Data Sci. Eng., 2016

Predicting Social Status via Social Networks: A Case Study on University, Occupation, and Region.
CoRR, 2016

Exploiting Dining Preference for Restaurant Recommendation.
Proceedings of the 25th International Conference on World Wide Web, 2016

Who Will Reply to/Retweet This Tweet?: The Dynamics of Intimacy from Online Social Interactions.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Image2Text: A Multimodal Image Captioner.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Deep Semantic-Preserving and Ranking-Based Hashing for Image Retrieval.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Semi-Supervised Multimodal Deep Learning for RGB-D Object Recognition.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Network Morphism.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Improve dog recognition by mining more information from both click-through logs and pre-trained models.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Joint Multiview Segmentation and Localization of RGB-D Images Using Depth-Induced Silhouette Consistency.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

MSR-VTT: A Large Video Description Dataset for Bridging Video and Language.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Jointly Modeling Embedding and Translation to Bridge Video and Language.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
A Distributed Approach Toward Discriminative Distance Metric Learning.
IEEE Trans. Neural Networks Learn. Syst., 2015

Retargeting Semantically-Rich Photos.
IEEE Trans. Multim., 2015

Learning Cross Space Mapping via DNN Using Large Scale Click-Through Logs.
IEEE Trans. Multim., 2015

Mining Latent Attributes From Click-Through Logs for Image Recognition.
IEEE Trans. Multim., 2015

Partial-Duplicate Clustering and Visual Pattern Discovery on Web Scale Image Database.
IEEE Trans. Multim., 2015

Super Fast Event Recognition in Internet Videos.
IEEE Trans. Multim., 2015

Where2Stand: A Human Position Recommendation System for Souvenir Photography.
ACM Trans. Intell. Syst. Technol., 2015

Learning to Rank Using User Clicks and Visual Features for Image Retrieval.
IEEE Trans. Cybern., 2015

Image Tag Refinement With View-Dependent Concept Representations.
IEEE Trans. Circuits Syst. Video Technol., 2015

Multi-order visual phrase for scalable partial-duplicate visual search.
Multim. Syst., 2015

Establishing Best Papers for IEEE MultiMedia.
IEEE Multim., 2015

Multimedia Goes Beyond Content.
IEEE Multim., 2015

Mining Location-based Social Networks: A Predictive Perspective.
IEEE Data Eng. Bull., 2015

Leveraging Careful Microblog Users for Spammer Detection.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Tagging Personal Photos with Transfer Deep Learning.
Proceedings of the 24th International Conference on World Wide Web, 2015

Resorting Relevance Evidences to Cumulative Citation Recommendation for Knowledge Base Acceleration.
Proceedings of the Web-Age Information Management - 16th International Conference, 2015

Predicting Smartphone Adoption in Social Networks.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Offline Sketch Parsing via Shapeness Estimation.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

On the selection of trending image from the web.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Content-Aware Collaborative Filtering for Location Recommendation Based on Human Mobility Data.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

MeshStereo: A Global Stereo Model with Mesh Alignment Regularization for View Interpolation.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Relaxing from Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Query Adaptive Similarity Measure for RGB-D Object Recognition.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Mining consumer impulsivity from offline and online behavior.
Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2015

Automatically Solving Number Word Problems by Semantic Parsing and Reasoning.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Scalable Visual Instance Mining with Instance Graph.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Up-Fusion: An Evolving Multimedia Fusion Method.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Bilateral Correspondence Model for Words-and-Pictures Association in Multimedia-Rich Microblogs.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Exploiting Click Constraints and Multi-view Features for Image Re-ranking.
IEEE Trans. Multim., 2014

Topic-Sensitive Influencer Mining in Interest-Based Social Media Networks via Hypergraph Learning.
IEEE Trans. Multim., 2014

USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval.
IEEE Trans. Image Process., 2014

Cascade Category-Aware Visual Search.
IEEE Trans. Image Process., 2014

Click Prediction for Web Image Reranking Using Multimodal Sparse Coding.
IEEE Trans. Image Process., 2014

High-Order Distance-Based Multiview Stochastic Learning in Image Classification.
IEEE Trans. Cybern., 2014

Preface: Internet multimedia computing and service.
Multim. Tools Appl., 2014

Deep Neural Networks: Another Tool for Multimedia Computing.
IEEE Multim., 2014

Big Data and Image Search.
IEEE Multim., 2014

IEEE MultiMedia Forges Ahead.
IEEE Multim., 2014

Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2014

Multimedia search reranking: A literature survey.
ACM Comput. Surv., 2014

Visualizing and Comparing Convolutional Neural Networks.
CoRR, 2014

Indigenization of Urban Mobility.
CoRR, 2014

Learning to personalize trending image search suggestion.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Click-through-based cross-view learning for image search.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

SmartVisio: Interactive Sketch Recognition with Natural Correction and Editing.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

CeleBrowser: An example of browsing big data on small device.
Proceedings of the International Conference on Multimedia Retrieval, 2014

GeoMF: joint geographical modeling and matrix factorization for point-of-interest recommendation.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Large-margin Weakly Supervised Dimensionality Reduction.
Proceedings of the 31th International Conference on Machine Learning, 2014

Unsupervised Template Mining for Semantic Category Understanding.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

As-Rigid-As-Possible Stereo under Second Order Smoothness Priors.
Proceedings of the Computer Vision - ECCV 2014, 2014

DNN Flow: DNN Feature Pyramid based Image Matching.
Proceedings of the British Machine Vision Conference, 2014

What Visual Attributes Characterize an Object Class?
Proceedings of the Computer Vision - ACCV 2014, 2014

Sketch Recognition with Natural Correction and Editing.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Learning Word Representation Considering Proximity and Ambiguity.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Towards decrypting attractiveness via multi-modality cues.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Large-scale multilabel propagation based on efficient sparse graph construction.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Image search - from thousands to billions in 20 years.
ACM Trans. Multim. Comput. Commun. Appl., 2013

View-Based Discriminative Probabilistic Modeling for 3D Object Retrieval and Recognition.
IEEE Trans. Image Process., 2013

Hierarchical affective content analysis in arousal and valence dimensions.
Signal Process., 2013

Pairwise constraints based multiview features fusion for scene classification.
Pattern Recognit., 2013

Cross-media semantic representation via bi-directional learning to rank.
Proceedings of the ACM Multimedia Conference, 2013

Clickage: towards bridging semantic and intent gaps via mining click logs of search engines.
Proceedings of the ACM Multimedia Conference, 2013

Multi-order visual phrase for scalable image search.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Multimedia LEGO: Learning Structured Model by Probabilistic Logic Ontology Tree.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Efficient 2D-to-3D Correspondence Filtering for Scalable 3D Object Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Cross-Domain Human Action Recognition.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Sparse transfer learning for interactive video search reranking.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Location Discriminative Vocabulary Coding for Mobile Landmark Search.
Int. J. Comput. Vis., 2012

Annotating web images using NOVA: NOn-conVex group spArsity.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Towards indexing representative images on the web.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Sense beauty via face, dressing, and/or voice.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

PartBook for image parsing.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011
Up-fusion: an evolving multimedia decision fusion method.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards low bit rate mobile visual search with multiple-channel coding.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards multi-semantic image annotation with graph regularized exclusive group lasso.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards cross-category knowledge propagation for learning visual concepts.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Image Classification With Kernelized Spatial-Context.
IEEE Trans. Multim., 2010

Unified tag analysis with multi-edge graph.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Video based 3D reconstruction using spatio-temporal attention analysis.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
Image Similarity.
Proceedings of the Encyclopedia of Database Systems, 2009

Event Tactic Analysis Based on Broadcast Sports Video.
IEEE Trans. Multim., 2009

Two-Dimensional Multilabel Active Learning with an Efficient Online Adaptation Model for Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Learning concepts by modeling relationships.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

2008
Content-Based Multimedia Retrieval.
Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

An automated end-to-end lecture capture and broadcasting system.
ACM Trans. Multim. Comput. Commun. Appl., 2008

Correlative multilabel video annotation with temporal kernels.
ACM Trans. Multim. Comput. Commun. Appl., 2008

Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos.
IEEE Trans. Multim., 2008

Using Webcast Text for Semantic Event Detection in Broadcast Sports Video.
IEEE Trans. Multim., 2008

Application Potential of Multimedia Information Retrieval.
Proc. IEEE, 2008

Web video topic discovery and tracking via bipartite graph reinforcement model.
Proceedings of the 17th International Conference on World Wide Web, 2008

Topic mining on web-shared videos.
Proceedings of the IEEE International Conference on Acoustics, 2008

A joint appearance-spatial distance for kernel-based image categorization.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Two-Dimensional Active Learning for image classification.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Guest Editors' Introduction: Advances in Multimedia Computing.
IEEE Multim., 2007

Trajectory based event tactics analysis in broadcast sports video.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Correlative multi-label video annotation.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Learning Concepts by Modeling Relationships.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Semantic Event Extraction from Basketball Games using Multi-Modal Analysis.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Concurrent Multiple Instance Learning for Image Categorization.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Read, write, and navigation awareness in realistic multi-view collaborations.
Proceedings of the 3rd International Conference on Collaborative Computing: Networking, 2007

2006
Direct Kernel Biased Discriminant Analysis: A New Content-Based Image Retrieval Relevance Feedback Algorithm.
IEEE Trans. Multim., 2006

Semantic retrieval of video - review of research on video retrieval in meetings, movies and broadcast news, and sports.
IEEE Signal Process. Mag., 2006

Multicue HMM-UKF for Real-Time Contour Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Boosting-Based Multimodal Speaker Detection for Distributed Meetings.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Robust Visual Tracking via Pixel Classification and Integration.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Three-Layer Virtual Director Model for Supporting Automated Multi-Site Distributed Education.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Recognizing Faces in Recorded Meetings via MRC-Boosting.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

PASS: Peer-Aware Silence Suppression for Internet Voice Conferences.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

PING: a Group-to-Individual Distributed Meeting System.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Light Weight Background Blurring for Video Conferencing Applications.
Proceedings of the International Conference on Image Processing, 2006

2005
An automated end-to-end lecture capturing and broadcasting system.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

What is the state of our community?
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Hybrid speaker tracking in an automated lecture room.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Sound source localization for circular arrays of directional microphones.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Characters or Faces: A User Study on Ease of Use for HIPs.
Proceedings of the Human Interactive Proofs, Second International Workshop, 2005

2004
Real-time speaker tracking using particle filter sensor fusion.
Proc. IEEE, 2004

ARTiFACIAL: Automated Reverse Turing test using FACIAL features.
Multim. Syst., 2004

Automating lecture capture and broadcast: technology and videography.
Multim. Syst., 2004

Improving Retrieval Performance by Region Constraints and Relevance Feedback.
J. Comput. Sci. Technol., 2004

Constraint Based Region Matching for Image Retrieval.
Int. J. Comput. Vis., 2004

Breaking the clock face HIP.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

A portable solution for automatic lecture room camera management.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Time delay estimation in the presence of correlated noise and reverberation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Adaptive tree similarity learning for image retrieval.
Multim. Syst., 2003

Excuse me, but are you human?
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

New direct approaches to robust sound source localization.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Videography for telepresentations.
Proceedings of the 2003 Conference on Human Factors in Computing Systems, 2003

Exploration of Visual Data.
The International Series in Video Computing 7, Springer, ISBN: 978-1-4615-0497-9, 2003

2002
Distributed meetings: a meeting capture and broadcasting system.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Parametric contour tracking using unscented Kalman filter.
Proceedings of the 2002 International Conference on Image Processing, 2002

Mode-based Multi-Hypothesis Head Tracking Using Parametric Contours.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

2001
Relevance Feedback Techniques in Image Retrieval.
Proceedings of the Principles of Visual Information Retrieval, 2001

Building an intelligent camera management system.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Optimal radial contour tracking by dynamic programming.
Proceedings of the 2001 International Conference on Image Processing, 2001

Optimal Adaptive Learning for Image Retrieval.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Better Proposal Distributions: Object Tracking Using Unscented Particle Filter.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

JPDAF Based HMM or Real-Time Contour Tracking.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Viewing meeting captured by an omni-directional camera.
Proceedings of the CHI 2001 Conference on Human Factors in Computing Systems, Seattle, WA, USA, March 31, 2001

Automating camera management for lecture room environments.
Proceedings of the CHI 2001 Conference on Human Factors in Computing Systems, Seattle, WA, USA, March 31, 2001

2000
Automatically extracting highlights for TV Baseball programs.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Optimizing Learning in Image Retrieval.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Segmenting Visual Actions Based on Spatio-Temporal Motion Patterns.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Browsing digital video.
Proceedings of the CHI 2000 Conference on Human factors in computing systems, 2000

A Framework for Garment Shopping over the Internet.
Proceedings of the Handbook on Electronic Commerce, 2000

1999
Efficient Indexing, Browsing and Retrieval of Image/Video Content
PhD thesis, 1999

Constructing Table-of-Content for Videos.
Multim. Syst., 1999

Information Retrieval Beyond the Text Document.
Libr. Trends, 1999

Image Retrieval: Current Techniques, Promising Directions, and Open Issues.
J. Vis. Commun. Image Represent., 1999

Video key frame extraction by unsupervised clustering and feedback adjustment.
J. Comput. Sci. Technol., 1999

A novel relevance feedback technique in image retrieval.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Efficient Access to Video Content in a Unified Framework.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Water-Filling: A Novel Way for Image Structural Feature Extraction.
Proceedings of the 1999 International Conference on Image Processing, 1999

Video Sequence Learning and Recognition Via Dynamic Som.
Proceedings of the 1999 International Conference on Image Processing, 1999

1998
A Modified Fourier Descriptor for Shape Matching in MARS.
Proceedings of the Image Databases and Multi-Media Search, 1998

A Region-Based Representation of Images in MARS.
J. VLSI Signal Process., 1998

Supporting Ranked Boolean Similarity Queries in MARS.
IEEE Trans. Knowl. Data Eng., 1998

Relevance feedback: a power tool for interactive content-based image retrieval.
IEEE Trans. Circuits Syst. Video Technol., 1998

Relevance Feedback Techniques in Interactive Content-Based Image Retrieval.
Proceedings of the Storage and Retrieval for Image and Video Databases VI, 1998

Browsing and retrieving video content in a unified framework.
Proceedings of the Second IEEE Workshop on Multimedia Signal Processing, 1998

Exploring Video Structure Beyond the Shots.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1998

Adaptive Key Frame Extraction using Unsupervised Clustering.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Digital image/video library and MPEG-7: standardization and research issues.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Supporting Similarity Queries in MARS.
Proceedings of the Fifth ACM International Conference on Multimedia '97, 1997

Supporting Content-based Queries over Images in MARS.
Proceedings of the International Conference on Multimedia Computing and Systems, 1997

Content-Based Image Retrieval with Relevance Feedback in MARS.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

1996
Automated region segmentation using attraction-based grouping in spatial-color-texture space.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996


  Loading...