Ting Yao

Abdulmotaleb El-Saddik

ACM Trans. Multim. Comput. Commun. Appl., 2019

Editorial to Special Issue on Deep Learning for Intelligent Multimedia Analytics.

[BibT_eX]

[DOI]

Wei Zhang

Abdulmotaleb El-Saddik

ACM Trans. Multim. Comput. Commun. Appl., 2019

Learning Click-Based Deep Structure-Preserving Embeddings with Visual Attention.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2019

Unified Spatio-Temporal Attention Networks for Action Recognition in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

See and chat: automatically generating viewer-level comments on images.

[BibT_eX]

[DOI]

Jingwen Chen

Hongyang Chao

Multim. Tools Appl., 2019

Vision and Language: from Visual Perception to Content Creation.

[BibT_eX]

[DOI]

Wei Zhang

CoRR, 2019

Multi-Source Domain Adaptation and Semi-Supervised Domain Adaptation with Focus on Visual Domain Adaptation Challenge 2019.

[BibT_eX]

[DOI]

CoRR, 2019

Scheduled Differentiable Architecture Search for Visual Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

vireoJD-MM at Activity Detection in Extended Videos.

[BibT_eX]

[DOI]

CoRR, 2019

Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019.

[BibT_eX]

[DOI]

CoRR, 2019

VireoJD-MM @ TRECVid 2019: Activities in Extended Video (ActEV).

[BibT_eX]

[DOI]

Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

daBNN: A Super Fast Inference Framework for Binary Neural Networks on ARM devices.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Long Short-Term Relation Networks for Video Action Detection.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Animating Your Life: Real-Time Video-to-Animation Translation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Mocycle-GAN: Unpaired Video-to-Video Translation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Deep Learning for Video Captioning: A Review.

[BibT_eX]

[DOI]

Shaoxiang Chen

Yu-Gang Jiang

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Hierarchy Parsing for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Relation Distillation Networks for Video Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Customizable Architecture Search for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Spatio-Temporal Representation With Local and Global Diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Transferrable Prototypical Networks for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Gaussian Temporal Awareness Networks for Action Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Pointing Novel Objects in Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploring Object Relation in Mean Teacher for Cross-Domain Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Learning Deep Spatio-Temporal Dependence for Semantic Video Segmentation.

[BibT_eX]

[DOI]

Zhaofan Qiu

IEEE Trans. Multim., 2018

Exploiting Web Images for Video Highlight Detection With Triplet Deep Ranking.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Boosting image sentiment analysis with visual attention.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Deep Domain Adaptation Hashing with Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Greedy Layer-Wise Training of Long Short Term Memory Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

Exploring Visual Relationship for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Recurrent Tubelet Proposal and Recognition Networks for Action Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Fully Convolutional Adaptation Networks for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Jointly Localizing and Describing Events for Dense Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Memory Matching Networks for One-Shot Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep learning for video classification and captioning.

[BibT_eX]

[DOI]

Proceedings of the Frontiers of Multimedia Research, 2018

2017

Detecting shot boundary with sparse coding for video summarization.

[BibT_eX]

[DOI]

Neurocomputing, 2017

Learning hierarchical video representation for action recognition.

[BibT_eX]

[DOI]

Int. J. Multim. Inf. Retr., 2017

Deep Semantic Hashing with Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Seeing Bot.

[BibT_eX]

[DOI]

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Learning Multimodal Attention LSTM Networks for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

To Create What You Tell: Generating Videos from Captions.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Boosting Image Captioning with Attributes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks.

[BibT_eX]

[DOI]

Zhaofan Qiu

Proceedings of the IEEE International Conference on Computer Vision, 2017

Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Quantization: Encoding Convolutional Activations with Deep Generative Model.

[BibT_eX]

[DOI]

Zhaofan Qiu

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Video Captioning with Transferred Semantic Attributes.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Deep Learning for Video Classification and Captioning.

[BibT_eX]

[DOI]

CoRR, 2016

Multi-Scale Triplet CNN for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Deep Semantic-Preserving and Ranking-Based Hashing for Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization.

[BibT_eX]

[DOI]

Yong Rui

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

MSR-VTT: A Large Video Description Dataset for Bridging Video and Language.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Jointly Modeling Embedding and Translation to Bridge Video and Language.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Click-boosting multi-modality graph-based reranking for image search.

[BibT_eX]

[DOI]

Multim. Syst., 2015

Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search.

[BibT_eX]

[DOI]

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Learning Query and Image Similarities with Ranking Canonical Correlation Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Semi-supervised Domain Adaptation with Subspace Learning for visual recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

VIREO-TNO @ TRECVID 2014: Multimedia Event Detection and Recounting (MED and MER).

[BibT_eX]

[DOI]

John G. M. Schavemaker

Klamer Schutte

Wessel Kraaij

Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

VIREO @ TRECVID 2014: Instance Search and Semantic Indexing.

[BibT_eX]

[DOI]

Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Click-through-based cross-view learning for image search.

[BibT_eX]

[DOI]

Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Click-through-based Subspace Learning for Image Search.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

2013

Circular Reranking for Visual Search.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2013

Unified entity search in social media community.

[BibT_eX]

[DOI]

Proceedings of the 22nd International World Wide Web Conference, 2013

VIREO/ECNU @ TRECVID 2013: A Video Dance of Detection, Recounting and Search with Motion Relativity and Concept Learning from Wild.

[BibT_eX]

[DOI]

Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Annotation for free: video tagging by mining user search behavior.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Image search by graph-based label propagation with image representation from DNN.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Video concept detection by learning from web images: A case study on cross domain learning.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Click-boosting random walk for image search reranking.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

2012

VIREO @ TRECVID 2012: Searching with Topology, Recounting will Small Concepts, Learning with Free Examples.

[BibT_eX]

[DOI]

Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Predicting domain adaptivity: redo or recycle?

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011

VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search.

[BibT_eX]

[DOI]

Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Context-based friend suggestion in online photo-sharing community.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

2010

Co-reranking by mutual reinforcement for image search.

[BibT_eX]

[DOI]