Yong Rui

Min-Ling Zhang

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Graph Attention Transformer Network for Multi-label Image Classification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Balanced masking strategy for multi-label image classification.

[BibT_eX]

[DOI]

Neurocomputing, 2023

A Survey on Video Moment Localization.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2023

Hybrid Representation Learning via Epistemic Graph.

[BibT_eX]

[DOI]

CoRR, 2023

Learning From Biased Soft Labels.

[BibT_eX]

[DOI]

CoRR, 2023

Learning From Biased Soft Labels.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

msr-vtt.

[BibT_eX]

[DOI]

Dataset, December, 2022

Hierarchical User Intent Graph Network for Multimedia Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Lenovo Schedules Laptop Manufacturing Using Deep Reinforcement Learning.

[BibT_eX]

[DOI]

INFORMS J. Appl. Anal., 2022

Knowledge Mining: A Cross-disciplinary Survey.

[BibT_eX]

[DOI]

Vicente Iván Sánchez Carmona

Int. J. Autom. Comput., 2022

Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Delving Globally into Texture and Structure for Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Noise-robust Locality Transformer for Fine-grained Food Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Semi-Supervised 3D Medical Image Segmentation Via Boundary-Aware Consistent Hidden Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021

Hierarchical User Intent Graph Network forMultimedia Recommendation.

[BibT_eX]

[DOI]

CoRR, 2021

HoloBoard: a Large-format Immersive Teaching Board based on pseudo HoloGraphics.

[BibT_eX]

[DOI]

Proceedings of the UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology, 2021

MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding.

[BibT_eX]

[DOI]

Alexander G. Hauptmann

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

What If We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020

Self-Supervised Agent Learning for Unsupervised Cross-Domain Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

CDbin: Compact Discriminative Binary Descriptor Learned With Efficient Neural Network.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Selecting Useful Knowledge from Previous Tasks for Future Learning in a Single Network.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Label Distribution Learning on Auxiliary Label Space Graphs for Facial Expression Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Learning Click-Based Deep Structure-Preserving Embeddings with Visual Attention.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2019

Unified Spatio-Temporal Attention Networks for Action Recognition in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Image Recognition by Predicted User Click Feature With Multidomain Multitask Transfer Deep Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Toward efficient indexing structure for scalable content-based music retrieval.

[BibT_eX]

[DOI]

Multim. Syst., 2019

AI-Oriented Large-Scale Video Management for Smart City: Technologies, Standards, and Beyond.

[BibT_eX]

[DOI]

IEEE Multim., 2019

A Survey on Food Computing.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2019

A Distributed Approach towards Discriminative Distance Metric Learning.

[BibT_eX]

[DOI]

CoRR, 2019

2018

Image Similarity.

[BibT_eX]

[DOI]

Tao Mei

Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

User-Click-Data-Based Fine-Grained Image Recognition via Weakly Supervised Metric Learning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2018

You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Scalable Content-Aware Collaborative Filtering for Location Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2018

Multitask Autoencoder Model for Recovering Human Poses.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2018

Multimodal Deep Embedding via Hierarchical Grounded Compositional Semantics.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Automatic Generation of Social Event Storyboard From Image Click-Through Data.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Hierarchical semantic image matching using CNN feature pyramid.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2018

Sequence-to-Sequence Learning via Shared Latent Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Saliency Detection on Light Field: A Multi-Cue Approach.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2017

Enhancing Person Re-identification in a Self-Trained Subspace.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2017

Search by Screenshots for Universal Article Clipping in Mobile Apps.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., 2017

Robust Spammer Detection in Microblogs: Leveraging User Carefulness.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2017

LEGO-MM: LEarning Structured Model by Probabilistic loGic Ontology Tree for MultiMedia.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Learning hierarchical video representation for action recognition.

[BibT_eX]

[DOI]

Int. J. Multim. Inf. Retr., 2017

Changes on the Horizon for the Multimedia Community.

[BibT_eX]

[DOI]

IEEE Multim., 2017

Best Paper and Best Department Article Unveiled.

[BibT_eX]

[DOI]

IEEE Multim., 2017

From Artificial Intelligence to Augmented Intelligence.

[BibT_eX]

[DOI]

IEEE Multim., 2017

Beyond the Words: Predicting User Personality from Heterogeneous Information.

[BibT_eX]

[DOI]

Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Multi-level Attention Networks for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Automatic Generation of Visual-Textual Presentation Layout.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2016

Monet: A System for Reliving Your Memories by Theme-Based Photo Storytelling.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Learning of Multimodal Representations With Random Walks on the Click Graph.

[BibT_eX]

[DOI]

Zhongfei (Mark) Zhang

Yueting Zhuang

IEEE Trans. Image Process., 2016

Building Hierarchical Representations for Oracle Character and Sketch Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Recognizing Exceptional Contributions.

[BibT_eX]

[DOI]

IEEE Multim., 2016

Understanding Multimedia.

[BibT_eX]

[DOI]

IEEE Multim., 2016

Working with the Domain Experts.

[BibT_eX]

[DOI]

IEEE Multim., 2016

UniClip: Leveraging Web Search for Universal Clipping of Articles on Mobile.

[BibT_eX]

[DOI]

Data Sci. Eng., 2016

Predicting Social Status via Social Networks: A Case Study on University, Occupation, and Region.

[BibT_eX]

[DOI]

CoRR, 2016

Exploiting Dining Preference for Restaurant Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on World Wide Web, 2016

Who Will Reply to/Retweet This Tweet?: The Dynamics of Intimacy from Online Social Interactions.

[BibT_eX]

[DOI]

Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Image2Text: A Multimodal Image Captioner.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Deep Semantic-Preserving and Ranking-Based Hashing for Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Semi-Supervised Multimodal Deep Learning for RGB-D Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Network Morphism.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Improve dog recognition by mining more information from both click-through logs and pre-trained models.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Joint Multiview Segmentation and Localization of RGB-D Images Using Depth-Induced Silhouette Consistency.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization.

[BibT_eX]

[DOI]

Ting Yao

Tao Mei

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

MSR-VTT: A Large Video Description Dataset for Bridging Video and Language.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Jointly Modeling Embedding and Translation to Bridge Video and Language.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

A Distributed Approach Toward Discriminative Distance Metric Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2015

Retargeting Semantically-Rich Photos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Learning Cross Space Mapping via DNN Using Large Scale Click-Through Logs.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Mining Latent Attributes From Click-Through Logs for Image Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Partial-Duplicate Clustering and Visual Pattern Discovery on Web Scale Image Database.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Super Fast Event Recognition in Internet Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Where2Stand: A Human Position Recommendation System for Souvenir Photography.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2015

Learning to Rank Using User Clicks and Visual Features for Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2015

Image Tag Refinement With View-Dependent Concept Representations.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

Multi-order visual phrase for scalable partial-duplicate visual search.

[BibT_eX]

[DOI]

Multim. Syst., 2015

Establishing Best Papers for IEEE MultiMedia.

[BibT_eX]

[DOI]

IEEE Multim., 2015

Multimedia Goes Beyond Content.

[BibT_eX]

[DOI]

IEEE Multim., 2015

Mining Location-based Social Networks: A Predictive Perspective.

[BibT_eX]

[DOI]

IEEE Data Eng. Bull., 2015

Leveraging Careful Microblog Users for Spammer Detection.

[BibT_eX]

[DOI]

Hao Fu

Xing Xie

Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Tagging Personal Photos with Transfer Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on World Wide Web, 2015

Resorting Relevance Evidences to Cumulative Citation Recommendation for Knowledge Base Acceleration.

[BibT_eX]

[DOI]

Proceedings of the Web-Age Information Management - 16th International Conference, 2015

Predicting Smartphone Adoption in Social Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data.

[BibT_eX]

[DOI]

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Offline Sketch Parsing via Shapeness Estimation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

On the selection of trending image from the web.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Content-Aware Collaborative Filtering for Location Recommendation Based on Human Mobility Data.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

MeshStereo: A Global Stereo Model with Mesh Alignment Regularization for View Interpolation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Relaxing from Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Query Adaptive Similarity Measure for RGB-D Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Mining consumer impulsivity from offline and online behavior.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2015

Automatically Solving Number Word Problems by Semantic Parsing and Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Scalable Visual Instance Mining with Instance Graph.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2015, 2015

2014

Up-Fusion: An Evolving Multimedia Fusion Method.

[BibT_eX]

[DOI]

Xiangyu Wang

Mohan S. Kankanhalli

ACM Trans. Multim. Comput. Commun. Appl., 2014

Bilateral Correspondence Model for Words-and-Pictures Association in Multimedia-Rich Microblogs.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2014

Exploiting Click Constraints and Multi-view Features for Image Re-ranking.

[BibT_eX]

[DOI]

Jun Yu

Bo Chen

IEEE Trans. Multim., 2014

Topic-Sensitive Influencer Mining in Interest-Based Social Media Networks via Hypergraph Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Cascade Category-Aware Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Click Prediction for Web Image Reranking Using Multimodal Sparse Coding.

[BibT_eX]

[DOI]

Jun Yu

Dacheng Tao

IEEE Trans. Image Process., 2014

High-Order Distance-Based Multiview Stochastic Learning in Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2014

Preface: Internet multimedia computing and service.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2014

Deep Neural Networks: Another Tool for Multimedia Computing.

[BibT_eX]

[DOI]

IEEE Multim., 2014

Big Data and Image Search.

[BibT_eX]

[DOI]

IEEE Multim., 2014

IEEE MultiMedia Forges Ahead.

[BibT_eX]

[DOI]

IEEE Multim., 2014

Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval.

[BibT_eX]

[DOI]

IEEE J. Emerg. Sel. Topics Circuits Syst., 2014

Multimedia search reranking: A literature survey.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2014

Visualizing and Comparing Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2014

Indigenization of Urban Mobility.

[BibT_eX]

[DOI]

CoRR, 2014

Learning to personalize trending image search suggestion.

[BibT_eX]

[DOI]

Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Click-through-based cross-view learning for image search.

[BibT_eX]

[DOI]

Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

SmartVisio: Interactive Sketch Recognition with Natural Correction and Editing.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

CeleBrowser: An example of browsing big data on small device.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimedia Retrieval, 2014

GeoMF: joint geographical modeling and matrix factorization for point-of-interest recommendation.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Large-margin Weakly Supervised Dimensionality Reduction.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Unsupervised Template Mining for Semantic Category Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

As-Rigid-As-Possible Stereo under Second Order Smoothness Priors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

DNN Flow: DNN Feature Pyramid based Image Matching.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2014

What Visual Attributes Characterize an Object Class?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

Sketch Recognition with Natural Correction and Editing.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Learning Word Representation Considering Proximity and Ambiguity.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Towards decrypting attractiveness via multi-modality cues.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2013

Large-scale multilabel propagation based on efficient sparse graph construction.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2013

Image search - from thousands to billions in 20 years.

[BibT_eX]

[DOI]

Lei Zhang

ACM Trans. Multim. Comput. Commun. Appl., 2013

View-Based Discriminative Probabilistic Modeling for 3D Object Retrieval and Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2013

Hierarchical affective content analysis in arousal and valence dimensions.

[BibT_eX]

[DOI]

Signal Process., 2013

Pairwise constraints based multiview features fusion for scene classification.

[BibT_eX]

[DOI]

Pattern Recognit., 2013

Cross-media semantic representation via bi-directional learning to rank.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Clickage: towards bridging semantic and intent gaps via mining click logs of search engines.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Multi-order visual phrase for scalable image search.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Multimedia LEGO: Learning Structured Model by Probabilistic Logic Ontology Tree.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Efficient 2D-to-3D Correspondence Filtering for Scalable 3D Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

Cross-Domain Human Action Recognition.

[BibT_eX]

[DOI]

Wei Bian

Dacheng Tao

IEEE Trans. Syst. Man Cybern. Part B, 2012

Sparse transfer learning for interactive video search reranking.

[BibT_eX]

[DOI]

Xinmei Tian

Dacheng Tao

ACM Trans. Multim. Comput. Commun. Appl., 2012

Location Discriminative Vocabulary Coding for Mobile Landmark Search.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2012

Annotating web images using NOVA: NOn-conVex group spArsity.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Towards indexing representative images on the web.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Sense beauty via face, dressing, and/or voice.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

PartBook for image parsing.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011

Up-fusion: an evolving multimedia decision fusion method.

[BibT_eX]

[DOI]

Xiangyu Wang

Mohan S. Kankanhalli

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards low bit rate mobile visual search with multiple-channel coding.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards multi-semantic image annotation with graph regularized exclusive group lasso.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards cross-category knowledge propagation for learning visual concepts.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

Image Classification With Kernelized Spatial-Context.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2010

Unified tag analysis with multi-edge graph.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Video based 3D reconstruction using spatio-temporal attention analysis.

[BibT_eX]

[DOI]

Xian Xiao

Changsheng Xu

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009

Image Similarity.

[BibT_eX]

[DOI]

Tao Mei

Proceedings of the Encyclopedia of Database Systems, 2009

Event Tactic Analysis Based on Broadcast Sports Video.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2009

Two-Dimensional Multilabel Active Learning with an Efficient Online Adaptation Model for Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2009

Learning concepts by modeling relationships.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

2008

Content-Based Multimedia Retrieval.

[BibT_eX]

[DOI]

Xian-Sheng Hua

Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

An automated end-to-end lecture capture and broadcasting system.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2008

Correlative multilabel video annotation with temporal kernels.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2008

Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2008

Using Webcast Text for Semantic Event Detection in Broadcast Sports Video.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2008

Application Potential of Multimedia Information Retrieval.

[BibT_eX]

[DOI]

Mohan S. Kankanhalli

Proc. IEEE, 2008

Web video topic discovery and tracking via bipartite graph reinforcement model.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on World Wide Web, 2008

Topic mining on web-shared videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

A joint appearance-spatial distance for kernel-based image categorization.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Two-Dimensional Active Learning for image classification.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007

Guest Editors' Introduction: Advances in Multimedia Computing.

[BibT_eX]

[DOI]

Ketan Mayer-Patel

Wolfgang Klas

IEEE Multim., 2007

Trajectory based event tactics analysis in broadcast sports video.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Multimedia 2007, 2007

Correlative multi-label video annotation.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Multimedia 2007, 2007

Learning Concepts by Modeling Relationships.

[BibT_eX]

[DOI]

Guo-Jun Qi

Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Semantic Event Extraction from Basketball Games using Multi-Modal Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Concurrent Multiple Instance Learning for Image Categorization.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Read, write, and navigation awareness in realistic multi-view collaborations.

[BibT_eX]

[DOI]

Sasa Junuzovic

Prasun Dewan

Proceedings of the 3rd International Conference on Collaborative Computing: Networking, 2007

2006

Direct Kernel Biased Discriminant Analysis: A New Content-Based Image Retrieval Relevance Feedback Algorithm.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2006

Semantic retrieval of video - review of research on video retrieval in meetings, movies and broadcast news, and sports.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2006

Multicue HMM-UKF for Real-Time Contour Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2006

Boosting-Based Multimodal Speaker Detection for Distributed Meetings.

[BibT_eX]

[DOI]

Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Robust Visual Tracking via Pixel Classification and Integration.

[BibT_eX]

[DOI]

Cha Zhang

Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Three-Layer Virtual Director Model for Supporting Automated Multi-Site Distributed Education.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Recognizing Faces in Recorded Meetings via MRC-Boosting.

[BibT_eX]

[DOI]

Xun Xu

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

PASS: Peer-Aware Silence Suppression for Internet Voice Conferences.

[BibT_eX]

[DOI]

Xun Xu

Li-wei He

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

PING: a Group-to-Individual Distributed Meeting System.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Light Weight Background Blurring for Video Conferencing Applications.

[BibT_eX]

[DOI]

Cha Zhang

Li-wei He

Proceedings of the International Conference on Image Processing, 2006

2005

An automated end-to-end lecture capturing and broadcasting system.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM International Conference on Multimedia, 2005

What is the state of our community?

[BibT_eX]

[DOI]

Proceedings of the 13th ACM International Conference on Multimedia, 2005

Hybrid speaker tracking in an automated lecture room.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Sound source localization for circular arrays of directional microphones.

[BibT_eX]

[DOI]

Warren Lam

Jinyan Su

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Characters or Faces: A User Study on Ease of Use for HIPs.

[BibT_eX]

[DOI]

Proceedings of the Human Interactive Proofs, Second International Workshop, 2005

2004

Real-time speaker tracking using particle filter sensor fusion.

[BibT_eX]

[DOI]

Proc. IEEE, 2004

ARTiFACIAL: Automated Reverse Turing test using FACIAL features.

[BibT_eX]

[DOI]

Zicheng Liu

Multim. Syst., 2004

Automating lecture capture and broadcast: technology and videography.

[BibT_eX]

[DOI]

Multim. Syst., 2004

Improving Retrieval Performance by Region Constraints and Relevance Feedback.

[BibT_eX]

[DOI]

Tao Wang

Jia-Guang Sun

J. Comput. Sci. Technol., 2004

Constraint Based Region Matching for Image Retrieval.

[BibT_eX]

[DOI]

Tao Wang

Jia-Guang Sun

Int. J. Comput. Vis., 2004

Breaking the clock face HIP.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

A portable solution for automatic lecture room camera management.

[BibT_eX]

[DOI]

Michael N. Wallick

Li-wei He

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Time delay estimation in the presence of correlated noise and reverberation.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Adaptive tree similarity learning for image retrieval.

[BibT_eX]

[DOI]

Multim. Syst., 2003

Excuse me, but are you human?

[BibT_eX]

[DOI]

Zicheng Liu

Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

New direct approaches to robust sound source localization.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Videography for telepresentations.

[BibT_eX]

[DOI]

Anoop Gupta

Jonathan Grudin

Proceedings of the 2003 Conference on Human Factors in Computing Systems, 2003

Exploration of Visual Data.

[BibT_eX]

[DOI]

Xiang Sean Zhou

The International Series in Video Computing 7, Springer, ISBN: 978-1-4615-0497-9, 2003

2002

Distributed meetings: a meeting capture and broadcasting system.

[BibT_eX]

[DOI]

Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Parametric contour tracking using unscented Kalman filter.

[BibT_eX]

[DOI]

Proceedings of the 2002 International Conference on Image Processing, 2002

Mode-based Multi-Hypothesis Head Tracking Using Parametric Contours.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

2001

Relevance Feedback Techniques in Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Principles of Visual Information Retrieval, 2001

Building an intelligent camera management system.

[BibT_eX]

[DOI]

Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Optimal radial contour tracking by dynamic programming.

[BibT_eX]

[DOI]

Proceedings of the 2001 International Conference on Image Processing, 2001

Optimal Adaptive Learning for Image Retrieval.

[BibT_eX]

[DOI]

Tao Wang

Shi-Min Hu

Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Better Proposal Distributions: Object Tracking Using Unscented Particle Filter.

[BibT_eX]

[DOI]

Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

JPDAF Based HMM or Real-Time Contour Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Viewing meeting captured by an omni-directional camera.

[BibT_eX]

[DOI]

Anoop Gupta

Jonathan J. Cadiz

Proceedings of the CHI 2001 Conference on Human Factors in Computing Systems, Seattle, WA, USA, March 31, 2001

Automating camera management for lecture room environments.

[BibT_eX]

[DOI]

Proceedings of the CHI 2001 Conference on Human Factors in Computing Systems, Seattle, WA, USA, March 31, 2001

2000

Automatically extracting highlights for TV Baseball programs.

[BibT_eX]

[DOI]

Anoop Gupta

Alex Acero

Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Optimizing Learning in Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Segmenting Visual Actions Based on Spatio-Temporal Motion Patterns.

[BibT_eX]

[DOI]

P. Anandan

Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Browsing digital video.

[BibT_eX]

[DOI]

Proceedings of the CHI 2000 Conference on Human factors in computing systems, 2000

A Framework for Garment Shopping over the Internet.

[BibT_eX]

[DOI]

Proceedings of the Handbook on Electronic Commerce, 2000

1999

Efficient Indexing, Browsing and Retrieval of Image/Video Content

[BibT_eX]

[DOI]

PhD thesis, 1999

Constructing Table-of-Content for Videos.

[BibT_eX]

[DOI]

Multim. Syst., 1999

Information Retrieval Beyond the Text Document.

[BibT_eX]

[DOI]

Libr. Trends, 1999

Image Retrieval: Current Techniques, Promising Directions, and Open Issues.

[BibT_eX]

[DOI]

Shih-Fu Chang

J. Vis. Commun. Image Represent., 1999

Video key frame extraction by unsupervised clustering and feedback adjustment.

[BibT_eX]

[DOI]

Yueting Zhuang

J. Comput. Sci. Technol., 1999

A novel relevance feedback technique in image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Efficient Access to Video Content in a Unified Framework.

[BibT_eX]

[DOI]

Xiang Sean Zhou

Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Water-Filling: A Novel Way for Image Structural Feature Extraction.

[BibT_eX]

[DOI]

Xiang Sean Zhou

Proceedings of the 1999 International Conference on Image Processing, 1999

Video Sequence Learning and Recognition Via Dynamic Som.

[BibT_eX]

[DOI]

Proceedings of the 1999 International Conference on Image Processing, 1999

1998

A Modified Fourier Descriptor for Shape Matching in MARS.

[BibT_eX]

[DOI]

Alfred C. She

Proceedings of the Image Databases and Multi-Media Search, 1998

A Region-Based Representation of Images in MARS.

[BibT_eX]

[DOI]

J. VLSI Signal Process., 1998

Supporting Ranked Boolean Similarity Queries in MARS.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 1998

Relevance feedback: a power tool for interactive content-based image retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 1998

Relevance Feedback Techniques in Interactive Content-Based Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Storage and Retrieval for Image and Video Databases VI, 1998

Browsing and retrieving video content in a unified framework.

[BibT_eX]

[DOI]

Proceedings of the Second IEEE Workshop on Multimedia Signal Processing, 1998

Exploring Video Structure Beyond the Shots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1998

Adaptive Key Frame Extraction using Unsupervised Clustering.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Digital image/video library and MPEG-7: standardization and research issues.

[BibT_eX]

[DOI]

Shih-Fu Chang

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

Supporting Similarity Queries in MARS.

[BibT_eX]

[DOI]

Proceedings of the Fifth ACM International Conference on Multimedia '97, 1997

Supporting Content-based Queries over Images in MARS.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimedia Computing and Systems, 1997

Content-Based Image Retrieval with Relevance Feedback in MARS.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

1996

Automated region segmentation using attraction-based grouping in spatial-color-texture space.

[BibT_eX]

[DOI]

Alfred C. She