Yu-Gang Jiang

IEEE Trans. Multim., 2021

Co-Attention Memory Network for Multimodal Microblog's Hashtag Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2021

A Study of Multi-Task and Region-Wise Deep Learning for Food Ingredient Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Predicting Content Similarity via Multimodal Modeling for Video-In-Video Advertising.

[BibT_eX]

[DOI]

Xue Song

IEEE Trans. Circuits Syst. Video Technol., 2021

Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

DB-LSTM: Densely-connected Bi-directional LSTM for human action recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2021

A Coarse-to-Fine Framework for Resource Efficient Video Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, 2021

Efficient Video Transformers with Spatial-Temporal Token Selection.

[BibT_eX]

[DOI]

CoRR, 2021

M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection.

[BibT_eX]

[DOI]

CoRR, 2021

HMS: Hierarchical Modality Selection for Efficient Video Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

What Do Deep Nets Learn? Class-wise Patterns Revealed in the Input Space.

[BibT_eX]

[DOI]

CoRR, 2021

A Multimodal Framework for Video Ads Understanding.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval.

[BibT_eX]

[DOI]

Zheng Wang

Jingjing Chen

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Two-stage Visual Cues Enhancement Network for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.

[BibT_eX]

[DOI]

Yuqian Fu

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos.

[BibT_eX]

[DOI]

Yuqian Fu

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications.

[BibT_eX]

[DOI]

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

VideoLT: Large-scale Long-tailed Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Motion Guided Region Message Passing for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

Re-Caption: Saliency-Enhanced Image Captioning Through Two-Phase Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Pose-Guided Person Image Synthesis in the Non-Iconic Views.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Learning Layer-Skippable Inference Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Deep Ranking for Image Zero-Shot Multi-Label Classification.

[BibT_eX]

[DOI]

Timothy M. Hospedales

IEEE Trans. Image Process., 2020

Learning to Score Figure Skating Sport Videos.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Matching Image and Sentence With Multi-Faceted Representations.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Object Detection from Scratch with Deep Supervision.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Vocabulary-Informed Zero-Shot and Open-Set Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Extreme vocabulary learning.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2020

Colonoscopy Polyp Detection: Domain Adaptation From Medical Report Images to Real-time Videos.

[BibT_eX]

[DOI]

CoRR, 2020

Imbalanced Gradients: A New Cause of Overestimated Adversarial Robustness.

[BibT_eX]

[DOI]

CoRR, 2020

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Recurrent Memory Reasoning Network for Expert Finding in Community Question Answering.

[BibT_eX]

[DOI]

Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Instance Image Retrieval with Generative Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Video Relation Detection via Multiple Hypothesis Association.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-modal Cooking Workflow Construction for Food Recipes.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Person-level Action Recognition in Complex Events via TSD-TSM Networks.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Visual Relations Augmented Cross-modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Clean-Label Backdoor Attacks on Video Recognition Models.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Hyperbolic Visual Embedding Learning for Zero-Shot Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Long-Term Cloth-Changing Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Heuristic Black-Box Adversarial Attacks on Video Recognition Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Feature Deformation Meta-Networks in Image Captioning of Novel Objects.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2019

Dense Dilated Network for Video Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Multi-Level Semantic Feature Augmentation for One-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Reformulating natural language queries using sequence-to-sequence models.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2019

FDU Participation in TRECVID 2019 VTT Task.

[BibT_eX]

[DOI]

Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

Hot Topic-Aware Retweet Prediction with Masked Self-attentive Model.

[BibT_eX]

[DOI]

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Black-box Adversarial Attacks on Video Recognition Models.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Towards Optimal CNN Descriptors for Large-Scale Image Retrieval.

[BibT_eX]

[DOI]

Yinzheng Gu

Chuanpeng Li

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Sparse Temporal Causal Convolution for Efficient Action Modeling.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Take Goods from Shelves: A Dataset for Class-Incremental Object Detection.

[BibT_eX]

[DOI]

Yu Hao

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

CNN-Based Chinese NER with Lexicon Rethinking.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Deep Learning for Video Captioning: A Review.

[BibT_eX]

[DOI]

Ting Yao

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Smart Advertising in Videos Based on Comprehensive Content Analytics.

[BibT_eX]

[DOI]

Yi Zhang

Fan Luan

Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Composite Binary Decomposition Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Trainable Undersampling for Class-Imbalance Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Semantic Proposal for Activity Localization in Videos via Sentence Query.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Motion Guided Spatial Attention for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Image Block Augmentation for One-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

DeepProduct: Mobile Product Search With Portable Deep Features.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2018

Editorial IEEE Transactions on Multimedia Special Section on Video Analytics: Challenges, Algorithms, and Applications.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

NAIS: Neural Attentive Item Similarity Model for Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2018

Hookworm Detection in Wireless Capsule Endoscopy Images With Deep Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Image Classification With Tailored Fine-Grained Dictionaries.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Affect. Comput., 2018

Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2018

Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Stacked multichannel autoencoder - an efficient way of learning from synthetic data.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2018

Learning part-based mid-level representation for visual recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective.

[BibT_eX]

[DOI]

CoRR, 2018

Semantic Feature Augmentation in Few-shot Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Learning to score and summarize figure skating sport videos.

[BibT_eX]

[DOI]

CoRR, 2018

Dense Dilated Network for Few Shot Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Non-local NetVLAD Encoding for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Pose-Normalized Image Generation for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Recurrent Fusion Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Dual Skipping Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Generating Keyword Queries for Natural Language Queries to Alleviate Lexical Chasm Problem.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Cross-Domain Sentiment Classification with Target Domain Specific Information.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Deep learning for video classification and captioning.

[BibT_eX]

[DOI]

Proceedings of the Frontiers of Multimedia Research, 2018

2017

The THUMOS challenge on action recognition for videos "in the wild".

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization.

[BibT_eX]

[DOI]

CoRR, 2017

Recent Advances in Zero-shot Recognition.

[BibT_eX]

[DOI]

CoRR, 2017

Aggregating Frame-level Features for Large-Scale Video Classification.

[BibT_eX]

[DOI]

CoRR, 2017

Learning Semantic Feature Map for Visual Content Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning to Generate and Edit Hairstyles.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

LSVC2017: Large-Scale Video Classification Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

VSCC'2017: Visual Analysis for Smart and Connected Communities.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Sketch Recognition with Deep Visual-Sequential Fusion Model.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adaptively Weighted Multi-task Deep Network for Person Attribute Classification.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning Fashion Compatibility with Bidirectional LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Frame-Transformer Emotion Classification Network.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Iterative object and part transfer for fine-grained recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

DSOD: Learning Deeply Supervised Object Detectors from Scratch.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Multi-scale Deep Learning Architectures for Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Weakly Supervised Dense Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Adaptive Proximal Average Approximation for Composite Convex Minimization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Hierarchical Visualization of Video Search Results for Topic-Based Browsing.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Partial Copy Detection in Videos: A Benchmark and an Evaluation of Popular Methods.

[BibT_eX]

[DOI]

Jiajun Wang

IEEE Trans. Big Data, 2016

Flexible multi-task learning with latent task grouping.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Multiple task learning with flexible structure regularization.

[BibT_eX]

[DOI]

Neurocomputing, 2016

A Bayesian Hashing approach and its application to face recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Web video categorization using category-predictive classifiers and category-specific concept classifiers.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues.

[BibT_eX]

[DOI]

Xi Wang

IEEE Multim., 2016

Deep Learning for Video Classification and Captioning.

[BibT_eX]

[DOI]

CoRR, 2016

NTTFudan Team @ TRECVID 2016: Multimedia Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Exploiting Objects with LSTMs for Video Categorization.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Binary Optimized Hashing.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Emotion in Context: Deep Semantic Feature Fusion for Video Emotion Recognition.

[BibT_eX]

[DOI]

Chen Chen

Zuxuan Wu

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video Emotion Recognition with Transferred Deep Feature Encodings.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Matching User Photos to Online Products with Robust Deep Features.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

BigVid at MediaEval 2016: Predicting Interestingness in Images and Videos.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

On Stochastic Primal-Dual Hybrid Gradient Approach for Compositely Regularized Minimization.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Harnessing Object and Scene Semantics for Large-Scale Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Regional Gating Neural Networks for Multi-label Image Classification.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2016, 2016

2015

Super Fast Event Recognition in Internet Videos.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

CHCF: A Cloud-Based Heterogeneous Computing Framework for Large-Scale Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2015

GPU-based MapReduce for large-scale near-duplicate video retrieval.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2015

A relative similarity based method for interactive patient risk prediction.

[BibT_eX]

[DOI]

Data Min. Knowl. Discov., 2015

Fusing Multi-Stream Deep Networks for Video Classification.

[BibT_eX]

[DOI]

CoRR, 2015

Fudan at TRECVID 2015: Adaptive Feature Fusion for Multimedia Event Detection in Videos.

[BibT_eX]

[DOI]

Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

NTT-Fudan Team @ TRECVID 2015: Multimedia Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

ASM'15: The 1st International Workshop on Affect and Sentiment in Multimedia.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Evaluating Two-Stream CNN for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Portfolio Choices with Orthogonal Bandit Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Optimal Bayesian Hashing for Efficient Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

VSD2014: A dataset for violent scenes detection in hollywood movies and web videos.

[BibT_eX]

[DOI]

Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

Categorizing Big Video Data on the Web: Challenges and Opportunities.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

2014

Placing Videos on a Semantic Hierarchy for Search Result Navigation.

[BibT_eX]

[DOI]

Song Tan

ACM Trans. Multim. Comput. Commun. Appl., 2014

Video Event Detection Using Motion Relativity and Feature Selection.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Guest Editorial Special Section on Socio-Mobile Media Analysis and Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Learning Multiple Relative Attributes With Humans in the Loop.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Special issue on Multimedia Event Detection.

[BibT_eX]

[DOI]

Mach. Vis. Appl., 2014

Discovering joint audio-visual codewords for video event detection.

[BibT_eX]

[DOI]

Mach. Vis. Appl., 2014

Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2014

A Framework of Video Coding for Compressing Near-Duplicate Videos.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Real-time summarization of user-generated videos based on semantic recognition.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The MediaEval 2014 Affect Task: Violent Scenes Detection.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

News Credibility Evaluation on Microblog with a Hierarchical Propagation Model.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Which Looks Like Which: Exploring Inter-class Relationships in Fine-Grained Visual Categorization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

VCDB: A Large-Scale Database for Partial Copy Detection in Videos.

[BibT_eX]

[DOI]

Yudong Jiang

Jiajun Wang

Proceedings of the Computer Vision - ECCV 2014, 2014

Benchmarking Violent Scenes Detection in movies.

[BibT_eX]

[DOI]

Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014

Predicting Emotions in User-Generated Videos.

[BibT_eX]

[DOI]

Xiangyang Xue

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Query-Adaptive Image Search With Hash Codes.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

High-level event recognition in unconstrained videos.

[BibT_eX]

[DOI]

Subhabrata Bhattacharya

Mubarak Shah

Int. J. Multim. Inf. Retr., 2013

Strong geometrical consistency in large scale partial-duplicate image search.

[BibT_eX]

[DOI]

Junqiang Wang

Jinhui Tang

Proceedings of the ACM Multimedia Conference, 2013

Beauty is here: evaluating aesthetics in videos using multimodal features and free training data.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

The MediaEval 2013 Affect Task: Violent Scenes Detection.

[BibT_eX]

[DOI]

Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes.

[BibT_eX]

[DOI]

Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Multiple Task Learning Using Iteratively Reweighted Least Square.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

Learning Hash Codes with Listwise Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Understanding and Predicting Interestingness of Videos.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012

Sampling and Ontologically Pooling Web Images for Visual Concept Learning.

[BibT_eX]

[DOI]

Shiai Zhu

IEEE Trans. Multim., 2012

Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

A fast video event recognition system and its application to video search.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Joint audio-visual bi-modal codewords for video event detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimedia Retrieval, 2012

SUPER: towards real-time event recognition in internet videos.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimedia Retrieval, 2012

The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Learning Hybrid Part Filters for Scene Recognition.

[BibT_eX]

[DOI]

Yingbin Zheng

Xiangyang Xue

Proceedings of the Computer Vision - ECCV 2012, 2012

Trajectory-Based Modeling of Human Actions with Motion Reference Points.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Supervised hashing with kernels.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Concept-Driven Multi-Modality Fusion for Video Search.

[BibT_eX]

[DOI]

Xiao-Yong Wei

IEEE Trans. Circuits Syst. Video Technol., 2011

Modeling Scene and Object Contexts for Human Action Retrieval With Few Examples.

[BibT_eX]

[DOI]

Zhenguo Li

IEEE Trans. Circuits Syst. Video Technol., 2011

The MediaMill TRECVID 2011 Semantic Video Search Engine.

[BibT_eX]

[DOI]

Cees G. M. Snoek

Koen E. A. van de Sande

Arnold W. M. Smeulders

Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

On the pooling of positive examples with ontology for visual concept learning.

[BibT_eX]

[DOI]

Shiai Zhu

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards textually describing complex video contents with audio-visual concept classifiers.

[BibT_eX]

[DOI]

Chun Chet Tan

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Consumer video understanding: a benchmark database and an evaluation of human and machine performance.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Lost in binarization: query-adaptive ranking for similar image search with compact codes.

[BibT_eX]

[DOI]

Jun Wang

Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Noise resistant graph ranking for improved web image search.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study.

[BibT_eX]

[DOI]

Jun Yang

Alexander G. Hauptmann

IEEE Trans. Multim., 2010

Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching.

[BibT_eX]

[DOI]

Subhabrata Bhattacharya

Mubarak Shah

Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

On the sampling of web images for learning visual concept classifiers.

[BibT_eX]

[DOI]

Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2009

Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2009

VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Brain state decoding for rapid image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Semantic context transfer across heterogeneous sources for domain adaptive video search.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Domain adaptive semantic diffusion for large scale context-based video annotation.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Label diagnosis through self tuning forweb image search.

[BibT_eX]

[DOI]

Jun Wang

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Exploring inter-concept relationship with context space for semantic video indexing.

[BibT_eX]

[DOI]

Xiao-Yong Wei

Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

2008

Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces.

[BibT_eX]

[DOI]

Xiao-Yong Wei

IEEE Trans. Multim., 2008

Beyond Semantic Search: What You Observe May Not Be What You Think.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Bag-of-visual-words expansion using visual relatedness for video indexing.

[BibT_eX]

[DOI]

Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Video event detection using motion relativity and visual relatedness.

[BibT_eX]

[DOI]

Feng Wang

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Ontology-based visual word matching for near-duplicate retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007

Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and search.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Evaluating bag-of-visual-words representations in scene classification.

[BibT_eX]

[DOI]

Jun Yang

Alexander G. Hauptmann

Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Towards optimal bag-of-features for object categorization and semantic video retrieval.

[BibT_eX]

[DOI]

Jun Yang

Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006

Modeling Local Interest Points for Semantic Detection and Video Search at TRECVID 2006.

[BibT_eX]

[DOI]

Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation.

[BibT_eX]

[DOI]

Wanlei Zhao

Proceedings of the 14th ACM International Conference on Multimedia, 2006

Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help?.

[BibT_eX]

[DOI]

Wanlei Zhao