Xiaoshuai Sun

According to our database1, Xiaoshuai Sun authored at least 130 papers between 2008 and 2020.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2020
Similarity-Preserving Linkage Hashing for Online Image Retrieval.
IEEE Trans. Image Process., 2020

Deep Saliency Hashing for Fine-Grained Retrieval.
IEEE Trans. Image Process., 2020

TVENet: Temporal variance embedding network for fine-grained action representation.
Pattern Recognit., 2020

What is damaged: a benchmark dataset for abnormal traffic object classification.
Multim. Tools Appl., 2020

Actionness-pooled Deep-convolutional Descriptor for fine-grained action recognition.
Neurocomputing, 2020

Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Discovering Latent Discriminative Patterns for Multi-Mode Event Representation.
IEEE Trans. Multimedia, 2019

Correntropy-Induced Robust Low-Rank Hypergraph.
IEEE Trans. Image Process., 2019

Gradual recovery based occluded digit images recognition.
Multim. Tools Appl., 2019

Action recognition with multi-scale trajectory-pooled 3D convolutional descriptors.
Multim. Tools Appl., 2019

Robust ℓ2-Hypergraph and its applications.
Inf. Sci., 2019

Unsupervised semantic deep hashing.
Neurocomputing, 2019

A Real-time Global Inference Network for One-stage Referring Expression Comprehension.
CoRR, 2019

SSAH: Semi-supervised Adversarial Deep Hashing with Self-paced Hard Sample Generation.
CoRR, 2019

Hadamard Codebook Based Deep Hashing.
CoRR, 2019

Toward 3D Object Reconstruction from Stereo Images.
CoRR, 2019

Sketch-Specific Data Augmentation for Freehand Sketch Recognition.
CoRR, 2019

Deep Semantic Parsing of Freehand Sketches with Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning.
CoRR, 2019

Semantic-aware Image Deblurring.
CoRR, 2019

Scene-based Factored Attention for Image Captioning.
CoRR, 2019

Semi-Supervised Adversarial Monocular Depth Estimation.
CoRR, 2019

Supervised Online Hashing via Similarity Distribution Learning.
CoRR, 2019

Hadamard Matrix Guided Online Hashing.
CoRR, 2019

Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images.
CoRR, 2019

Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method.
IEEE Access, 2019

Information Competing Process for Learning Diversified Representations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Variational Structured Semantic Inference for Diverse Image Captioning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Multi-modal Multi-layer Fusion Network with Average Binary Center Loss for Face Anti-spoofing.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Hypergraph Induced Convolutional Manifold Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Video Post-Filter Deblocking Method Based on Temporal Boosting Residual Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Towards Cross-modality Topic Modelling via Deep Topical Correlation Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2019

Dynamic Capsule Attention for Visual Question Answering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Free VQA Models from Knowledge Inertia by Pairwise Inconformity Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Towards Optimal Fine Grained Retrieval via Decorrelated Centralized Loss with Normalize-Scale Layer.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Towards Optimal Discrete Online Hashing with Balanced Similarity.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length.
IEEE Trans. Multimedia, 2018

Distinctive action sketch for human action recognition.
Signal Process., 2018

Event patches: Mining effective parts for event detection and understanding.
Signal Process., 2018

Exploring part-aware segmentation for fine-grained visual categorization.
Multim. Tools Appl., 2018

Rediscover flowers structurally.
Multim. Tools Appl., 2018

Hierarchical semantic image matching using CNN feature pyramid.
Comput. Vis. Image Underst., 2018

Semantic and Contrast-Aware Saliency.
CoRR, 2018

The Effectiveness of Instance Normalization: a Strong Baseline for Single Image Dehazing.
CoRR, 2018

Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Add: Actionness-Pooled Deep-Convolutional Descriptor.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Cycle-Consistency Based Hierarchical Dense Semantic Correspondence.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Illustrate your travel notes: web-based story visualization.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Weighted voxel: a novel voxel representation for 3D reconstruction.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Restricted Boltzmann Machine Based Active Learning for Sparse Recommendation.
Proceedings of the Database Systems for Advanced Applications, 2018

GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Strong Baseline for Single Image Dehazing with Deep Features and Instance Normalization.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Dancelets Mining for Video Recommendation Based on Dance Styles.
IEEE Trans. Multimedia, 2017

Hierarchical Latent Concept Discovery for Video Event Detection.
IEEE Trans. Image Process., 2017

Breaking video into pieces for action recognition.
Multim. Tools Appl., 2017

Anomaly detection based on spatio-temporal sparse representation and visual attention analysis.
Multim. Tools Appl., 2017

Exploiting the complementary strengths of multi-layer CNN features for image retrieval.
Neurocomputing, 2017

Actor identification via mining representative actions.
Neurocomputing, 2017

Shallow and Deep Model Investigation for Distinguishing Corn and Weeds.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Object Discovery and Cosegmentation Based on Dense Correspondences.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Multi-scale Discriminative Patches for Fined-Grained Visual Categorization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Trajectory-Pooled 3D Convolutional Descriptors for Action Recognition.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Gated additive skip context connection for object detection.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Dancing like a superstar: Action guidance based on pose estimation and conditional pose alignment.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

SPTF: A Scalable Probabilistic Tensor Factorization Model for Semantic-Aware Behavior Prediction.
Proceedings of the 2017 IEEE International Conference on Data Mining, 2017

An Integrated Model for Effective Saliency Prediction.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Web-Based Semantic Fragment Discovery for On-Line Lingual-Visual Similarity.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Robust spatial-temporal deep model for multimedia event detection.
Neurocomputing, 2016

Unsupervised discovery of crowd activities by saliency-based clustering.
Neurocomputing, 2016

Quartet-net Learning for Visual Instance Retrieval.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Mining representative actions for actor identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
深度学习中的自编码器的表达能力研究 (Representation Ability Research of Auto-encoders in Deep Learning).
计算机科学, 2015

Strategy for dynamic 3D depth data matching towards robust action retrieval.
Neurocomputing, 2015

Strategy for aesthetic photography recommendation via collaborative composition model.
IET Comput. Vis., 2015

Part-Aware Segmentation for Fine-Grained Categorization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

"Clustering of Dancelets": Towards Video Recommendation Based on Dance Styles.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Distinctive action sketch.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Predicting discrete probability distribution of image emotions.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Dual-mode video stabilization based on adaptive motion clustering.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Boost sparse coding based abnormal event detection via explicitly applying temporal continuity constraint.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

2014
Toward Statistical Modeling of Saccadic Eye-Movement and Visual Saliency.
IEEE Trans. Image Process., 2014

Where should I stand? Learning based human position recommendation for mobile photographing.
Multim. Tools Appl., 2014

Using Label Propagation to Get Confidence Map for Segmentation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Exploring Principles-of-Art Features For Image Emotion Recognition.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Exploring covert attention for generic boosting of saliency models.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Structure-aware multi-object discovery for weakly supervised tracking.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

"Clustering by saliency" - Unsupervised discovery of crowd activities.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Discriminative Features for Bird Species Classification.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

2013
Bidirectional-isomorphic manifold learning at image semantic understanding & representation.
Multim. Tools Appl., 2013

Visual attention modeling based on short-term environmental adaption.
J. Vis. Commun. Image Represent., 2013

Video classification and recommendation based on affective analysis of viewers.
Neurocomputing, 2013

Flexible Presentation of Videos Based on Affective Content Analysis.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

On dense sampling size.
Proceedings of the IEEE International Conference on Image Processing, 2013

Exploring Implicit Image Statistics for Visual Representativeness Modeling.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Context-Aware Semi-Local Feature Detector.
ACM Trans. Intell. Syst. Technol., 2012

Task-Dependent Visual-Codebook Compression.
IEEE Trans. Image Process., 2012

Action retrieval based on generalized dynamic depth data matching.
Proceedings of the 2012 Visual Communications and Image Processing, 2012

Action Segmentation in Dance Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Real-Time Viewfinder Composition Assessment and Recommendation to Mobile Photographing.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Memorable basis: towards human-centralized sparse representation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Aesthetic composition represetation for portrait photographing recommendation.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

What are we looking for: Towards statistical modeling of saccadic eye movements and visual saliency.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Actor-independent action search using spatiotemporal vocabulary with appearance hashing.
Pattern Recognit., 2011

Video indexing and recommendation based on affective analysis of viewers.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Unsupervised fast anomaly detection in crowds.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning heterogeneous data for hierarchical web video classification.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Sparse representation based visual element analysis.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Video stabilization based on saliency driven SIFT matching and discriminative RANSAC.
Proceedings of the ICIMCS 2011, 2011

Contextual dictionaries for image super resolution.
Proceedings of the ICIMCS 2011, 2011

A spatiotemporal context phrase description for general dynamic texture.
Proceedings of the ICIMCS 2011, 2011

Affective Video Classification Based on Spatio-temporal Feature Fusion.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Saliency Detection: A Self-Adaption Sparse Representation Approach.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

2010
A rotation and scale invariant texture description approach.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Saliency detection based on short-term sparse representation.
Proceedings of the International Conference on Image Processing, 2010

Visual saliency as sequential eye fixation probability.
Proceedings of the International Conference on Image Processing, 2010

A robust texture descriptor using multifractal analysis with Gabor filter.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Visual topic model for web image annotation.
Proceedings of the Second International Conference on Internet Multimedia Computing and Service, 2010

Mining actor correlations with hierarchical concurrence parsing.
Proceedings of the IEEE International Conference on Acoustics, 2010

Towards semantic embedding in visual vocabulary.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Visual and textual fusion for semantically supervised region-based retrieval.
Multimedia Syst., 2009

Photo assessment based on computational visual attention model.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

What is a complete set of keywords for image description & annotation on the web.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

VisualCor system: search actor correlations in TV series.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

2008
Vision-Based Semi-supervised Homecare with Spatial Constraint.
Proceedings of the Advances in Multimedia Information Processing, 2008

Attention-driven action retrieval with DTW-based 3d descriptor matching.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Place retrieval with graph-based place-view model.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Cross-media manifold learning for image retrieval & annotation.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Directional correlation analysis of local Haar binary pattern for text detection.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Text Particles Multi-band Fusion for Robust Text Detection.
Proceedings of the Image Analysis and Recognition, 5th International Conference, 2008


  Loading...