Wu Liu

Orcid: 0000-0003-1633-7575

Affiliations:
  • University of Science and Technology of China, Heifei, Anhui, China


According to our database1, Wu Liu authored at least 178 papers between 2012 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict.
CoRR, May, 2026

Rotation-Invariant Spherical Watermarking via Third-Order SO(3) Representation Coupling.
CoRR, May, 2026

GASim: A Graph-Accelerated Hybrid Framework for Social Simulation.
CoRR, May, 2026

Heterogeneous Consensus-Progressive Reasoning for Efficient Multi-Agent Debate.
CoRR, April, 2026

EMS: Multi-Agent Voting via Efficient Majority-then-Stopping.
CoRR, April, 2026

A Paradigm Shift: Fully End-to-End Training for Temporal Sentence Grounding in Videos.
CoRR, April, 2026

Bitscaling: Streamlining neural network compression via predictive multi-scale growth of mixed-precision networks.
Neural Networks, 2026

AdaptiveMamba: Comprehensive visual representation learning with adaptive semantic perception.
Knowl. Based Syst., 2026

FHNet: Frequency guided hierarchical optimization network for few-shot fine-grained classification of spatially dispersed and aggregated data.
Inf. Fusion, 2026

Multi-Granularity Multi-Modal Knowledge Graph Representation Learning via Subgraph-Aware Adaptive Fusion and Hierarchical Relation Modeling.
Proceedings of the ACM Web Conference 2026, 2026

Enhancing Federated Class-Incremental Learning via Spatial-Temporal Statistics Aggregation.
Proceedings of the ACM Web Conference 2026, 2026

GUI-Eyes: Tool-Augmented Perception for Visual Grounding in GUI Agents.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Region-Constraint In-Context Generation for Instructional Video Editing.
CoRR, December, 2025

PopSim: Social Network Simulation for Social Media Popularity Prediction.
CoRR, December, 2025

ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion.
CoRR, October, 2025

RumorSphere: A Framework for Million-scale Agent-based Dynamic Simulation of Rumor Propagation.
CoRR, September, 2025

SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation.
CoRR, August, 2025

Single-Frame Supervision for Spatio-Temporal Video Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

DynamiX: Large-Scale Dynamic Social Network Simulator.
CoRR, July, 2025

From Global to Hybrid: A Review of Supervised Deep Learning for 2-D Image Feature Representation.
IEEE Trans. Artif. Intell., June, 2025

STSA: Federated Class-Incremental Learning via Spatial-Temporal Statistics Aggregation.
CoRR, June, 2025

HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation.
CoRR, March, 2025

DRC: Discrete Representation Classifier With Salient Features via Fixed-Prototype.
IEEE Trans. Circuits Syst. Video Technol., January, 2025

TalkingAvatar: Learning 3D talking human avatar via NeRF.
Neurocomputing, 2025

LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representation.
Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

Asymmetric Pre-aligned Anchor Contrastive Enhanced Diffusion Hashing Model for Incomplete Multimodal Retrieval.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Retrieval.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Edit-by-Example: Adaptive Exemplar-Based Image Editing.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

High-Fidelity Object Removal through Boosting Diffusion Processes.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

PlugMark: A Plug-In Zero-Watermarking Framework for Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MotionPro: A Precise Motion Controller for Image-to-Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
SigFormer: Sparse Signal-guided Transformer for Multi-modal Action Segmentation.
ACM Trans. Multim. Comput. Commun. Appl., August, 2024

Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

A Generative-Based Image Fusion Strategy for Visible-Infrared Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

Learning Monocular Regression of 3D People in Crowds via Scene-Aware Blending and De-Occlusion.
IEEE Trans. Multim., 2024

CDKM: Common and Distinct Knowledge Mining Network With Content Interaction for Dense Captioning.
IEEE Trans. Multim., 2024

OmniPrism: Learning Disentangled Visual Concept for Image Generation.
CoRR, 2024

T-SVG: Text-Driven Stereoscopic Video Generation.
CoRR, 2024

LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation.
CoRR, 2024

It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment.
CoRR, 2024

Motion Capture from Inertial and Vision Sensors.
CoRR, 2024

Prompting Industrial Anomaly Segment with Large Vision-Language Models.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

Watermarking Vision-Language Models.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ADDG: An Adaptive Domain Generalization Framework for Cross-Plane MRI Segmentation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

CLaM: An Open-Source Library for Performance Evaluation of Text-driven Human Motion Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HCMA'24: The 5th International Workshop on Human-centric Multimedia Analysis Summary.
Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

AnimateAnywhere: Context-Controllable Human Video Generation with ID-Consistent One-shot Learning.
Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification.
Proceedings of the Computer Vision - ECCV 2024, 2024

ChatVTG: Video Temporal Grounding via Chat with Video Dialogue Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HumanNeRF-SE: A Simple yet Effective Approach to Animate HumanNeRF with Diverse Poses.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning a Dynamic Neural Human via Poses Guided Dual Spaces Feature.
Proceedings of the IEEE International Conference on Advanced Video and Signal Based Surveillance, 2024

2023
Bird-Count: a multi-modality benchmark and system for bird population counting in the wild.
Multim. Tools Appl., December, 2023

Introduction to the Special Issue on Trustworthy Multimedia Computing and Applications in Urban Scenes.
ACM Trans. Multim. Comput. Commun. Appl., November, 2023

Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification.
IEEE Trans. Multim., 2023

Recent Advances of Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective.
ACM Comput. Surv., 2023

SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation.
CoRR, 2023

RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Parsing is All You Need for Accurate Gait Recognition in the Wild.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Factorized Omnidirectional Representation based Vision GNN for Anisotropic 3D Multimodal MR Image Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

HCMA '23: 4th International Workshop on Human-Centric Multimedia Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

FastReID: A Pytorch Toolbox for General Instance Re-identification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CoCa: A Connectivity-Aware Cascade Framework for Histology Gland Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multimedia Cognition and Evaluation in Open Environments.
Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2023

TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Segment Every Referring Object Point by Point.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
FasterPose: A Faster Simple Baseline for Human Pose Estimation.
ACM Trans. Multim. Comput. Commun. Appl., 2022

TICNet: A Target-Insight Correlation Network for Object Tracking.
IEEE Trans. Cybern., 2022

Gait Recognition in the Wild with Multi-hop Temporal Switch.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

HCMA'22: 3rd International Workshop on Human-Centric Multimedia Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Delving into the Frequency: Temporally Consistent Human Motion Transfer in the Fourier Space.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

WOC: A Handy Webcam-based 3D Online Chatroom.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Causality Inference for Very Important Person Localization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Keypoint-Guided Modality-Invariant Discriminative Learning for Visible-Infrared Person Re-identification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Learning Monocular Mesh Recovery of Multiple Body Parts Via Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

Genre-Conditioned Long-Term 3D Dance Generation Driven by Music.
Proceedings of the IEEE International Conference on Acoustics, 2022

CAViT: Contextual Alignment Vision Transformer for Video Object Re-identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding.
Proceedings of the Computer Vision - ECCV 2022, 2022

Gait Recognition in the Wild with Dense 3D Representations and A Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Putting People in their Place: Monocular Regression of 3D People in Depth.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DPIT: Dual-Pipeline Integrated Transformer for Human Pose Estimation.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

2021
Correlation Discrepancy Insight Network for Video Re-identification.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition.
IEEE Trans. Multim., 2021

Universal Cross-Domain 3D Model Retrieval.
IEEE Trans. Multim., 2021

Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking.
IEEE Trans. Multim., 2021

A Real-Time Action Representation With Temporal Encoding and Deep Compression.
IEEE Trans. Circuits Syst. Video Technol., 2021

Coupled-dynamic learning for vision and language: Exploring Interaction between different tasks.
Pattern Recognit., 2021

Hierarchical multi-view context modelling for 3D object classification and retrieval.
Inf. Sci., 2021

CMTR: Cross-modality Transformer for Visible-infrared Person Re-identification.
CoRR, 2021

A Baseline Framework for Part-level Action Parsing and Action Recognition.
CoRR, 2021

Semi-Supervised Domain Generalizable Person Re-Identification.
CoRR, 2021

AttriMeter: An Attribute-guided Metric Interpreter for Person Re-Identification.
CoRR, 2021

Boosting End-to-end Multi-Object Tracking and Person Search via Knowledge Distillation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Consistency-Constancy Bi-Knowledge Learning for Pedestrian Detection in Night Surveillance.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

HUMA'21: 2nd International Workshop on Human-centric Multimedia Analysis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MSO: Multi-Feature Space Joint Optimization Network for RGB-Infrared Person Re-Identification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

TraND: Transferable Neighborhood Discovery for Unsupervised Cross-Domain Gait Recognition.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Neural Architecture Search for Joint Human Parsing and Pose Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Monocular, One-stage, Regression of Multiple 3D People.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Explainable Person Re-Identification with Attribute-guided Metric Distillation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Group-aware Label Transfer for Domain Adaptive Person Re-identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Listen, Look, and Find the One: Robust Person Search with Multimodality Index.
ACM Trans. Multim. Comput. Commun. Appl., 2020

MetaSearch: Incremental Product Search via Deep Meta-Learning.
IEEE Trans. Image Process., 2020

Synthetic Training for Monocular Human Mesh Recovery.
CoRR, 2020

CenterHMR: a Bottom-up Single-shot Method for Multi-person 3D Mesh Recovery from a Single Image.
CoRR, 2020

Hierarchical Gumbel Attention Network for Text-based Person Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-Features Fusion and Decomposition for Age-Invariant Face Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

HUMA'20: 1st International Workshop on Human-Centric Multimedia Analysis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

PyAnomaly: A Pytorch-based Toolkit for Video Anomaly Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Cross-modality and Progressive Person Search System.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Pose-native Network Architecture Search for Multi-person Human Pose Estimation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Effective and Efficient: Toward Open-world Instance Re-identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

KTAN: Knowledge Transfer Adversarial Network.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

When Pedestrian Detection Meets Nighttime Surveillance: A New Benchmark.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019
Generalized zero-shot learning for action recognition with web-scale video data.
World Wide Web, 2019

Attentive Spatial-Temporal Summary Networks for Feature Learning in Irregular Gait Recognition.
IEEE Trans. Multim., 2019

DELTA: A deep dual-stream network for multi-label image classification.
Pattern Recognit., 2019

A common subgraph correspondence mining framework for map search services.
Multim. Tools Appl., 2019

Foreground-aware Pyramid Reconstruction for Alignment-free Occluded Person Re-identification.
CoRR, 2019

WIDER Face and Pedestrian Challenge 2018: Methods and Results.
CoRR, 2019

PVSS: A Progressive Vehicle Search System for Video Surveillance Networks.
CoRR, 2019

POINet: Pose-Guided Ovonic Insight Network for Multi-Person Pose Tracking.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

BraidNet: Braiding Semantics and Details for Accurate Human Parsing.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learnable Aggregating Net with Diversity Learning for Video Question Answering.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Recurrent Quantization for Generating Sequential Binary Codes.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Multi-Granularity Reasoning for Social Relation Recognition From Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Social Relation Recognition From Videos via Multi-Scale Spatial-Temporal Reasoning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Structured Two-Stream Attention Network for Video Question Answering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance.
IEEE Trans. Multim., 2018

Learning Efficient Spatial-Temporal Gait Features with Deep Learning for Human Identification.
Neuroinformatics, 2018

Third-Eye: A Mobilephone-Enabled Crowdsensing System for Air Quality Monitoring.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2018

A Progressive Search Paradigm for the Internet of Things.
IEEE Multim., 2018

KTAN: Knowledge Transfer Adversarial Network.
CoRR, 2018

Multi-stream Fusion Model for Social Relation Recognition from Videos.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

LOCO: Local Context Based Faster R-CNN for Small Traffic Sign Detection.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Beyond View Transformation: Cycle-Consistent Global and Partial Perception Gan for View-Invariant Gait Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Joint License Plate Super-Resolution and Recognition in One Multi-Task Gan Framework.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Progressive Vehicle Search System for Video Surveillance Networks.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Automatic Generation of Textual Advertisement for Video Advertising.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Appearance and Gait-Based Progressive Person Re-Identification for Surveillance Systems.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Deep learning based basketball video analysis for intelligent arena application.
Multim. Tools Appl., 2017

Multi-modal tag localization for mobile video search.
Multim. Syst., 2017

Online multi-objective optimization for live video forwarding across video data centers.
J. Vis. Commun. Image Represent., 2017

Minimizing Resource Cost for Camera Stream Scheduling in Video Data Center.
J. Comput. Sci. Technol., 2017

Deep learning hashing for mobile visual search.
EURASIP J. Image Video Process., 2017

Deep Learning Based Intelligent Basketball Arena with Energy Image.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Multi-attribute Based Fire Detection in Diverse Surveillance Videos.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Multi-feature Fusion for Predicting Social Media Popularity.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Beyond Human-level License Plate Super-resolution with Progressive Vehicle Search and Domain Priori GAN.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Weighted sequence loss based spatial-temporal deep learning framework for human body orientation estimation.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

An efficient deep learning hashing neural network for mobile visual search.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Query from Sketch: A Common Subgraph Correspondence Mining Framework.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

2016
Large-scale vehicle re-identification in urban surveillance videos.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Siamese neural network based gait recognition for human identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A Deep Learning-Based Approach to Progressive Vehicle Re-identification for Urban Surveillance.
Proceedings of the Computer Vision - ECCV 2016, 2016

A discriminative null space based deep learning approach for person re-identification.
Proceedings of the 4th International Conference on Cloud Computing and Intelligence Systems, 2016

Cost Optimal Resource Provisioning for Live Video Forwarding Across Video Data Centers.
Proceedings of the Big Data Computing and Communications - Second International Conference, 2016

2015
Multimodal tag localization based on deep learning.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Multi-task deep visual-semantic embedding for video thumbnail selection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Instant Mobile Video Search With Layered Audio-Video Indexing and Progressive Transmission.
IEEE Trans. Multim., 2014

2013
Accurate Estimation of Human Body Orientation From RGB-D Sensors.
IEEE Trans. Cybern., 2013

LAVES: an instant mobile video search system based on layered audio-video indexing.
Proceedings of the ACM Multimedia Conference, 2013

Listen, look, and gotcha: instant video search with mobile phones by layered audio-video indexing.
Proceedings of the ACM Multimedia Conference, 2013

2012
Web Video Geolocation by Geotagged Social Resources.
IEEE Trans. Multim., 2012

RGB-D Based Multi-attribute People Search in Intelligent Visual Surveillance.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012


  Loading...