Wu Liu

Orcid: 0000-0003-1633-7575

Affiliations:

University of Science and Technology of China, Heifei, Anhui, China

According to our database¹, Wu Liu authored at least 178 papers between 2012 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict.

[BibT_eX]

[DOI]

CoRR, May, 2026

Rotation-Invariant Spherical Watermarking via Third-Order SO(3) Representation Coupling.

[BibT_eX]

[DOI]

CoRR, May, 2026

GASim: A Graph-Accelerated Hybrid Framework for Social Simulation.

[BibT_eX]

[DOI]

CoRR, May, 2026

Heterogeneous Consensus-Progressive Reasoning for Efficient Multi-Agent Debate.

[BibT_eX]

[DOI]

CoRR, April, 2026

EMS: Multi-Agent Voting via Efficient Majority-then-Stopping.

[BibT_eX]

[DOI]

CoRR, April, 2026

A Paradigm Shift: Fully End-to-End Training for Temporal Sentence Grounding in Videos.

[BibT_eX]

[DOI]

CoRR, April, 2026

Bitscaling: Streamlining neural network compression via predictive multi-scale growth of mixed-precision networks.

[BibT_eX]

[DOI]

Neural Networks, 2026

AdaptiveMamba: Comprehensive visual representation learning with adaptive semantic perception.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2026

FHNet: Frequency guided hierarchical optimization network for few-shot fine-grained classification of spatially dispersed and aggregated data.

[BibT_eX]

[DOI]

Inf. Fusion, 2026

Multi-Granularity Multi-Modal Knowledge Graph Representation Learning via Subgraph-Aware Adaptive Fusion and Hierarchical Relation Modeling.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

Enhancing Federated Class-Incremental Learning via Spatial-Temporal Statistics Aggregation.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2026, 2026

GUI-Eyes: Tool-Augmented Perception for Visual Grounding in GUI Agents.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Region-Constraint In-Context Generation for Instructional Video Editing.

[BibT_eX]

[DOI]

CoRR, December, 2025

PopSim: Social Network Simulation for Social Media Popularity Prediction.

[BibT_eX]

[DOI]

CoRR, December, 2025

ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion.

[BibT_eX]

[DOI]

CoRR, October, 2025

RumorSphere: A Framework for Million-scale Agent-based Dynamic Simulation of Rumor Propagation.

[BibT_eX]

[DOI]

CoRR, September, 2025

SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation.

[BibT_eX]

[DOI]

CoRR, August, 2025

Single-Frame Supervision for Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

DynamiX: Large-Scale Dynamic Social Network Simulator.

[BibT_eX]

[DOI]

CoRR, July, 2025

From Global to Hybrid: A Review of Supervised Deep Learning for 2-D Image Feature Representation.

[BibT_eX]

[DOI]

IEEE Trans. Artif. Intell., June, 2025

STSA: Federated Class-Incremental Learning via Spatial-Temporal Statistics Aggregation.

[BibT_eX]

[DOI]

CoRR, June, 2025

HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

DRC: Discrete Representation Classifier With Salient Features via Fixed-Prototype.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., January, 2025

TalkingAvatar: Learning 3D talking human avatar via NeRF.

[BibT_eX]

[DOI]

Neurocomputing, 2025

LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representation.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

Asymmetric Pre-aligned Anchor Contrastive Enhanced Diffusion Hashing Model for Incomplete Multimodal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Edit-by-Example: Adaptive Exemplar-Based Image Editing.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

High-Fidelity Object Removal through Boosting Diffusion Processes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

PlugMark: A Plug-In Zero-Watermarking Framework for Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MotionPro: A Precise Motion Controller for Image-to-Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

SigFormer: Sparse Signal-guided Transformer for Multi-modal Action Segmentation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., August, 2024

Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., August, 2024

A Generative-Based Image Fusion Strategy for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., January, 2024

Learning Monocular Regression of 3D People in Crowds via Scene-Aware Blending and De-Occlusion.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

CDKM: Common and Distinct Knowledge Mining Network With Content Interaction for Dense Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

OmniPrism: Learning Disentangled Visual Concept for Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

T-SVG: Text-Driven Stereoscopic Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation.

[BibT_eX]

[DOI]

CoRR, 2024

It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Motion Capture from Inertial and Vision Sensors.

[BibT_eX]

[DOI]

CoRR, 2024

Prompting Industrial Anomaly Segment with Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

Watermarking Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ADDG: An Adaptive Domain Generalization Framework for Cross-Plane MRI Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

CLaM: An Open-Source Library for Performance Evaluation of Text-driven Human Motion Generation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HCMA'24: The 5th International Workshop on Human-centric Multimedia Analysis Summary.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

AnimateAnywhere: Context-Controllable Human Video Generation with ID-Consistent One-shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ChatVTG: Video Temporal Grounding via Chat with Video Dialogue Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HumanNeRF-SE: A Simple yet Effective Approach to Animate HumanNeRF with Diverse Poses.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning a Dynamic Neural Human via Poses Guided Dual Spaces Feature.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Advanced Video and Signal Based Surveillance, 2024

2023

Bird-Count: a multi-modality benchmark and system for bird population counting in the wild.

[BibT_eX]

[DOI]

Multim. Tools Appl., December, 2023

Introduction to the Special Issue on Trustworthy Multimedia Computing and Applications in Urban Scenes.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., November, 2023

Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Recent Advances of Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2023

SigFormer: Sparse Signal-Guided Transformer for Multi-Modal Human Action Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Parsing is All You Need for Accurate Gait Recognition in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Factorized Omnidirectional Representation based Vision GNN for Anisotropic 3D Multimodal MR Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

HCMA '23: 4th International Workshop on Human-Centric Multimedia Analysis.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

FastReID: A Pytorch Toolbox for General Instance Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

CoCa: A Connectivity-Aware Cascade Framework for Histology Gland Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multimedia Cognition and Evaluation in Open Environments.

[BibT_eX]

[DOI]

Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2023

TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Segment Every Referring Object Point by Point.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

FasterPose: A Faster Simple Baseline for Human Pose Estimation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2022

TICNet: A Target-Insight Correlation Network for Object Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2022

Gait Recognition in the Wild with Multi-hop Temporal Switch.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

HCMA'22: 3rd International Workshop on Human-Centric Multimedia Analysis.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Delving into the Frequency: Temporally Consistent Human Motion Transfer in the Fourier Space.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

WOC: A Handy Webcam-based 3D Online Chatroom.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Causality Inference for Very Important Person Localization.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Keypoint-Guided Modality-Invariant Discriminative Learning for Visible-Infrared Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Learning Monocular Mesh Recovery of Multiple Body Parts Via Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Genre-Conditioned Long-Term 3D Dance Generation Driven by Music.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

CAViT: Contextual Alignment Vision Transformer for Video Object Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Gait Recognition in the Wild with Dense 3D Representations and A Benchmark.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Putting People in their Place: Monocular Regression of 3D People in Depth.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DPIT: Dual-Pipeline Integrated Transformer for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

2021

Correlation Discrepancy Insight Network for Video Re-identification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2021

Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Universal Cross-Domain 3D Model Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

A Real-Time Action Representation With Temporal Encoding and Deep Compression.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

Coupled-dynamic learning for vision and language: Exploring Interaction between different tasks.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Hierarchical multi-view context modelling for 3D object classification and retrieval.

[BibT_eX]

[DOI]

Inf. Sci., 2021

CMTR: Cross-modality Transformer for Visible-infrared Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2021

A Baseline Framework for Part-level Action Parsing and Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Semi-Supervised Domain Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

AttriMeter: An Attribute-guided Metric Interpreter for Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

Boosting End-to-end Multi-Object Tracking and Person Search via Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Consistency-Constancy Bi-Knowledge Learning for Pedestrian Detection in Night Surveillance.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

HUMA'21: 2nd International Workshop on Human-centric Multimedia Analysis.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MSO: Multi-Feature Space Joint Optimization Network for RGB-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

TraND: Transferable Neighborhood Discovery for Unsupervised Cross-Domain Gait Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Neural Architecture Search for Joint Human Parsing and Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Monocular, One-stage, Regression of Multiple 3D People.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Explainable Person Re-Identification with Attribute-guided Metric Distillation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Group-aware Label Transfer for Domain Adaptive Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Listen, Look, and Find the One: Robust Person Search with Multimodality Index.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2020

MetaSearch: Incremental Product Search via Deep Meta-Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Synthetic Training for Monocular Human Mesh Recovery.

[BibT_eX]

[DOI]

CoRR, 2020

CenterHMR: a Bottom-up Single-shot Method for Multi-person 3D Mesh Recovery from a Single Image.

[BibT_eX]

[DOI]

CoRR, 2020

Hierarchical Gumbel Attention Network for Text-based Person Search.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-Features Fusion and Decomposition for Age-Invariant Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

HUMA'20: 1st International Workshop on Human-Centric Multimedia Analysis.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

PyAnomaly: A Pytorch-based Toolkit for Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Cross-modality and Progressive Person Search System.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Pose-native Network Architecture Search for Multi-person Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Effective and Efficient: Toward Open-world Instance Re-identification.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

KTAN: Knowledge Transfer Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

When Pedestrian Detection Meets Nighttime Surveillance: A New Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019

Generalized zero-shot learning for action recognition with web-scale video data.

[BibT_eX]

[DOI]

World Wide Web, 2019

Attentive Spatial-Temporal Summary Networks for Feature Learning in Irregular Gait Recognition.

[BibT_eX]

[DOI]

Shuangqun Li

Wu Liu

Huadong Ma

IEEE Trans. Multim., 2019

DELTA: A deep dual-stream network for multi-label image classification.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

A common subgraph correspondence mining framework for map search services.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2019

Foreground-aware Pyramid Reconstruction for Alignment-free Occluded Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2019

WIDER Face and Pedestrian Challenge 2018: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2019

PVSS: A Progressive Vehicle Search System for Video Surveillance Networks.

[BibT_eX]

[DOI]

CoRR, 2019

POINet: Pose-Guided Ovonic Insight Network for Multi-Person Pose Tracking.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

BraidNet: Braiding Semantics and Details for Accurate Human Parsing.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learnable Aggregating Net with Diversity Learning for Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Recurrent Quantization for Generating Sequential Binary Codes.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Multi-Granularity Reasoning for Social Relation Recognition From Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Social Relation Recognition From Videos via Multi-Scale Spatial-Temporal Reasoning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Structured Two-Stream Attention Network for Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Learning Efficient Spatial-Temporal Gait Features with Deep Learning for Human Identification.

[BibT_eX]

[DOI]

Neuroinformatics, 2018

Third-Eye: A Mobilephone-Enabled Crowdsensing System for Air Quality Monitoring.

[BibT_eX]

[DOI]

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2018

A Progressive Search Paradigm for the Internet of Things.

[BibT_eX]

[DOI]

Huadong Ma

Wu Liu

IEEE Multim., 2018

KTAN: Knowledge Transfer Adversarial Network.

[BibT_eX]

[DOI]

CoRR, 2018

Multi-stream Fusion Model for Social Relation Recognition from Videos.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

LOCO: Local Context Based Faster R-CNN for Small Traffic Sign Detection.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Beyond View Transformation: Cycle-Consistent Global and Partial Perception Gan for View-Invariant Gait Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Joint License Plate Super-Resolution and Recognition in One Multi-Task Gan Framework.

[BibT_eX]

[DOI]

Minghui Zhang

Wu Liu

Huadong Ma

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Progressive Vehicle Search System for Video Surveillance Networks.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Automatic Generation of Textual Advertisement for Video Advertising.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Appearance and Gait-Based Progressive Person Re-Identification for Surveillance Systems.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Deep learning based basketball video analysis for intelligent arena application.

[BibT_eX]

[DOI]

Wu Liu

Chenggang Clarence Yan

Jiangyu Liu

Huadong Ma

Multim. Tools Appl., 2017

Multi-modal tag localization for mobile video search.

[BibT_eX]

[DOI]

Multim. Syst., 2017

Online multi-objective optimization for live video forwarding across video data centers.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2017

Minimizing Resource Cost for Camera Stream Scheduling in Video Data Center.

[BibT_eX]

[DOI]

Yihong Gao

Huadong Ma

Wu Liu

J. Comput. Sci. Technol., 2017

Deep learning hashing for mobile visual search.

[BibT_eX]

[DOI]

EURASIP J. Image Video Process., 2017

Deep Learning Based Intelligent Basketball Arena with Energy Image.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Multi-attribute Based Fire Detection in Diverse Surveillance Videos.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Multi-feature Fusion for Predicting Social Media Popularity.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Beyond Human-level License Plate Super-resolution with Progressive Vehicle Search and Domain Priori GAN.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Weighted sequence loss based spatial-temporal deep learning framework for human body orientation estimation.

[BibT_eX]

[DOI]

Peiye Liu

Wu Liu

Huadong Ma

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

An efficient deep learning hashing neural network for mobile visual search.

[BibT_eX]

[DOI]

Heng Qi

Wu Liu

Liang Liu

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Query from Sketch: A Common Subgraph Correspondence Mining Framework.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

2016

Large-scale vehicle re-identification in urban surveillance videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Siamese neural network based gait recognition for human identification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A Deep Learning-Based Approach to Progressive Vehicle Re-identification for Urban Surveillance.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

A discriminative null space based deep learning approach for person re-identification.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Cloud Computing and Intelligence Systems, 2016

Cost Optimal Resource Provisioning for Live Video Forwarding Across Video Data Centers.

[BibT_eX]

[DOI]

Proceedings of the Big Data Computing and Communications - Second International Conference, 2016

2015

Multimodal tag localization based on deep learning.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Multi-task deep visual-semantic embedding for video thumbnail selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Instant Mobile Video Search With Layered Audio-Video Indexing and Progressive Transmission.

[BibT_eX]

[DOI]

Wu Liu

Tao Mei

Yongdong Zhang

IEEE Trans. Multim., 2014

2013

Accurate Estimation of Human Body Orientation From RGB-D Sensors.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2013

LAVES: an instant mobile video search system based on layered audio-video indexing.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Listen, look, and gotcha: instant video search with mobile phones by layered audio-video indexing.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

2012

Web Video Geolocation by Geotagged Social Resources.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

RGB-D Based Multi-attribute People Search in Intelligent Visual Surveillance.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Wu Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...