Hanli Wang

Orcid: 0000-0002-9999-4871

According to our database1, Hanli Wang authored at least 172 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Transductive Learning With Prior Knowledge for Generalized Zero-Shot Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

Multi-Modal Structure-Embedding Graph Transformer for Visual Commonsense Reasoning.
IEEE Trans. Multim., 2024

Glow in the Dark: Low-Light Image Enhancement With External Memory.
IEEE Trans. Multim., 2024

Self-Supervised Video Representation Learning by Serial Restoration With Elastic Complexity.
IEEE Trans. Multim., 2024

Misalignment-Robust Frequency Distribution Loss for Image Transformation.
CoRR, 2024

ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
High Dynamic Range Image Quality Assessment Based on Frequency Disparity.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Knowledge-Enriched Attention Network With Group-Wise Semantic for Visual Storytelling.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Task-Driven Video Compression for Humans and Machines: Framework Design and Optimization.
IEEE Trans. Multim., 2023

Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression.
IEEE Trans. Image Process., 2023

CSformer: Bridging Convolution and Transformer for Compressive Sensing.
IEEE Trans. Image Process., 2023

Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation.
IEEE Trans. Image Process., 2023

Joint Graph Attention and Asymmetric Convolutional Neural Network for Deep Image Compression.
IEEE Trans. Circuits Syst. Video Technol., 2023

Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention.
IEEE Trans. Circuits Syst. Video Technol., 2023

Pseudo 3D Perception Transformer with Multi-level Confidence Optimization for Visual Commonsense Reasoning.
CoRR, 2023

Structure-Aware Generative Adversarial Network for Text-to-Image Generation.
Proceedings of the IEEE International Conference on Image Processing, 2023

Adaptive Token Excitation with Negative Selection for Video-Text Retrieval.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

2022
Emotion Expression With Fact Transfer for Video Description.
IEEE Trans. Multim., 2022

Meta-Learning-Based Incremental Few-Shot Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Multiscale Conditional Relationship Graph Network for Referring Relationships in Images.
IEEE Trans. Cogn. Dev. Syst., 2022

Contrastive semantic similarity learning for image captioning evaluation.
Inf. Sci., 2022

Learning temporal information and object relation for zero-shot action recognition.
Displays, 2022

Taking an Emotional Look at Video Paragraph Captioning.
CoRR, 2022

Two-stream Hierarchical Similarity Reasoning for Image-text Matching.
CoRR, 2022

Generalized Visual Quality Assessment of GAN-Generated Face Images.
CoRR, 2022

Cycle-Interactive Generative Adversarial Network for Robust Unsupervised Low-Light Enhancement.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Multi-concept Mining for Video Captioning Based on Multiple Tasks.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Spatio-temporal Super-resolution Network: Enhance Visual Representations for Video Captioning.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Discretized Gaussian Mixture Hyperprior for Learned Image Compression with Mask Module.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Learned Image Compression with Multi-Scale Spatial and Contextual Information Fusion.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
Perceptual Image Compression with Block-Level Just Noticeable Difference Prediction.
ACM Trans. Multim. Comput. Commun. Appl., 2021

CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions.
IEEE Trans. Multim., 2021

MABAN: Multi-Agent Boundary-Aware Network for Natural Language Moment Retrieval.
IEEE Trans. Image Process., 2021

Person Reidentification by Multiscale Feature Representation Learning With Random Batch Feature Mask.
IEEE Trans. Cogn. Dev. Syst., 2021

Visual Storytelling with Hierarchical BERT Semantic Guidance.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

2020
Affective Video Content Analysis With Adaptive Fusion Recurrent Network.
IEEE Trans. Multim., 2020

Perceptually Weighted Mean Squared Error Based Rate-Distortion Optimization for HEVC.
IEEE Trans. Broadcast., 2020

Just Noticeable Difference Level Prediction for Perceptual Image Compression.
IEEE Trans. Broadcast., 2020

RepeatPadding: Balancing words and sentence length for language comprehension in visual question answering.
Inf. Sci., 2020

Evolutionary recurrent neural network for image captioning.
Neurocomputing, 2020

Structural Pyramid Network for Cascaded Optical Flow Estimation.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Wonderful Clips of Playing Basketball: A Database for Localizing Wonderful Actions.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Patch assembly for real-time instance segmentation.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Random Occlusion Recovery with Noise Channel for Person Re-identification.
Proceedings of the Intelligent Computing Theories and Application, 2020

Joint Transmit Power and Trajectory Optimization for Two-Way Multi-Hop UAV Relaying Networks.
Proceedings of the 2020 IEEE International Conference on Communications Workshops, 2020

Enhanced Action Tubelet Detector for Spatio-Temporal Video Action Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Rich Visual and Language Representation with Complementary Semantics for Video Captioning.
ACM Trans. Multim. Comput. Commun. Appl., 2019

A Multi-Grained Parallel Solution for HEVC Encoding on Heterogeneous Platforms.
IEEE Trans. Multim., 2019

WLDISR: Weighted Local Sparse Representation-Based Depth Image Super-Resolution for 3D Video System.
IEEE Trans. Image Process., 2019

Multi-modal learning for affective content analysis in movies.
Multim. Tools Appl., 2019

Multi-Dilation Network for Crowd Counting.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Data Driven Regularization for Convolutional Neural Networks on Image Classification.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Swell-and-Shrink: Decomposing Image Captioning by Transformation and Summarization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Perceptual Video Coding with Block-Level Staircase Just Noticeable Distortion.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

P3D-CTN: Pseudo-3D Convolutional Tube Network for Spatio-Temporal Action Detection in Videos.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

2018
Motion keypoint trajectory and covariance descriptor for human action recognition.
Vis. Comput., 2018

A Collaborative Scheduling-Based Parallel Solution for HEVC Encoding on Multicore Platforms.
IEEE Trans. Multim., 2018

Real-Time Action Recognition With Deeply Transferred Motion Vector CNNs.
IEEE Trans. Image Process., 2018

Maximal granularity structure and generalized multi-view discriminant analysis for person re-identification.
Pattern Recognit., 2018

Looking deeper and transferring attention for image captioning.
Multim. Tools Appl., 2018

Deep sequential fusion LSTM network for image description.
Neurocomputing, 2018

Large-scale video compression: recent advances and challenges.
Frontiers Comput. Sci., 2018

Three-Stream Action Tubelet Detector for Spatiotemporal Action Detection in Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Deeper Spatial Pyramid Network with Refined Up-Sampling for Optical Flow Estimation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

CNN Features for Emotional Impact of Movies Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018

Refining Attention: A Sequential Attention Model for Image Captioning.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Image Captioning with Word Level Attention.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Object-oriented video prediction with pixel-level attention.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Categorizing Concepts With Basic Level for Vision-to-Language.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Highly Parallel Acceleration of HEVC Encoding on ARM Platform.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Light Field Image Coding with Disparity Correlation Based Prediction.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Static Correlative Filter Based Convolutional Neural Network for Visual Question Answering.
Proceedings of the 2018 IEEE International Conference on Big Data and Smart Computing, 2018

2017
Joint Compression of Near-Duplicate Videos.
IEEE Trans. Multim., 2017

Image Quality Assessment Based on Local Linear Information and Distortion-Specific Compensation.
IEEE Trans. Image Process., 2017

Building Correlations Between Filters in Convolutional Neural Networks.
IEEE Trans. Cybern., 2017

Objective Video Quality Assessment Based on Perceptually Weighted Mean Squared Error.
IEEE Trans. Circuits Syst. Video Technol., 2017

Improving feature matching strategies for efficient image retrieval.
Signal Process. Image Commun., 2017

Learning correlations for human action recognition in videos.
Multim. Tools Appl., 2017

G-MS2F: GoogLeNet based multi-stage feature fusion of deep CNN for scene recognition.
Neurocomputing, 2017

Richer feature for image classification with super and sub kernels based on deep convolutional neural network.
Comput. Electr. Eng., 2017

Richer Semantic Visual and Language Representation for Video Captioning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

MIC-TJU in MediaEval 2017 Emotional Impact of Movies Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

Image captioning with deep LSTM based on sequential residual.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Regularization of convolutional neural networks using ShuffleNode.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Predicting Interestingness of Visual Content.
Proceedings of the Visual Content Indexing and Retrieval with Psycho-Visual Models, 2017

2016
Bayesian rule based fast TU depth decision algorithm for high efficiency video coding.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Optimal stopping theory based fast coding tree unit decision for high efficiency video coding.
Proceedings of the 2016 Visual Communications and Image Processing, 2016

Accelerating Large-Scale Human Action Recognition with GPU-Based Spark.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Improving Image Retrieval by Local Feature Reselection with Query Expansion.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

MediaEval 2016 Predicting Media Interestingness Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Distortion recognition for image quality assessment with convolutional neural network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Screen content image quality assessment via convolutional neural network.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Blind image quality assessment for multiply distorted images via convolutional neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Real-Time Action Recognition with Enhanced Motion Vector CNNs.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Efficient coding modulation and seamless rate adaptation for visible light communications.
IEEE Wirel. Commun., 2015

Compressed Image Quality Metric Based on Perceptually Weighted Distortion.
IEEE Trans. Image Process., 2015

CHCF: A Cloud-Based Heterogeneous Computing Framework for Large-Scale Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2015

GPU-based MapReduce for large-scale near-duplicate video retrieval.
Multim. Tools Appl., 2015

A new network-based algorithm for human activity recognition in video.
CoRR, 2015

Encoding scale into fisher vector for human action recognition.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Tracking Salient Keypoints for Human Action Recognition.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Accelerating Support Vector Machine Learning with GPU-Based MapReduce.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Correlative Filters for Convolutional Neural Networks.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Large-scale human action recognition with spark.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Human Action Recognition With Trajectory Based Covariance Descriptor In Unconstrained Videos.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Accelerating Large-scale Image Retrieval on Heterogeneous Architectures with Spark.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Twin Feature and Similarity Maximal Matching for Image Retrieval.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

MIC-TJU in MediaEval 2015 Affective Impact of Movies Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

The MediaEval 2015 Affective Impact of Movies Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Effectively compressing Near-Duplicate Videos in a joint way.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Improving bag-of-words representation with efficient twin feature integration.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

A GPU-based MapReduce framework for MSR-Bing Image Retrieval Challenge.
Proceedings of the 10th International Conference on Communications and Networking in China, 2015

2014
A New Network-Based Algorithm for Human Activity Recognition in Videos.
IEEE Trans. Circuits Syst. Video Technol., 2014

Early detection of all-zero 4×4 blocks in High Efficiency Video Coding.
J. Vis. Commun. Image Represent., 2014

MIC_TJ at TRECVID 2014.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Mixed Pooling for Convolutional Neural Networks.
Proceedings of the Rough Sets and Knowledge Technology - 9th International Conference, 2014

A Framework of Video Coding for Compressing Near-Duplicate Videos.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

MIC-TJU at MediaEval Violent Scenes Detection (VSD) 2014.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Predicting zero coefficients for High Efficiency Video Coding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Optimal stopping theory based algorithm for coding unit size decision in HEVC.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Multiview Coding Mode Decision With Hybrid Optimal Stopping Model.
IEEE Trans. Image Process., 2013

Rate control for consistent visual quality of H.264/AVC encoding.
Signal Process. Image Commun., 2013

Semi-supervised multi-label image classification based on nearest neighbor editing.
Neurocomputing, 2013

MIC_TJ at TRECVID 2013: Instance Search Task.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Parallel time-space processing model based fast <i>N</i>-body simulation on GPUs.
Proceedings of the 2013 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2013

Fast Mode Decision Based on Optimal Stopping Theory for Multiview Video Coding.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

2012
Joint Demosaicing and Subpixel-Based Down-Sampling for Bayer Images: A Fast Frequency-Domain Analysis Approach.
IEEE Trans. Multim., 2012

H.264/SVC Mode Decision Based on Optimal Stopping Theory.
IEEE Trans. Image Process., 2012

Adaptive Quantization-Parameter Clip Scheme for Smooth Quality in H.264/AVC.
IEEE Trans. Image Process., 2012

Novel Rate-Quantization Model-Based Rate Control With Adaptive Initialization for Spatial Scalable Video Coding.
IEEE Trans. Ind. Electron., 2012

A Universal Rate Control Scheme for Video Transcoding.
IEEE Trans. Circuits Syst. Video Technol., 2012

Novel 2-D MMSE Subpixel-Based Image Down-Sampling.
IEEE Trans. Circuits Syst. Video Technol., 2012

Towards reliable self-clustering Mobile Ad Hoc Networks.
Comput. Electr. Eng., 2012

Adaptive Rate-Distortion Prediction for Multiple Reference Selection and Inter-mode Decision.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Analytical study of RGB vertical stripe and RGBX square-shaped subpixel arrangements.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Quality assessment for color images with tucker decomposition.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Large-scale multimedia data mining using MapReduce framework.
Proceedings of the 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings, 2012

2011
Rate Control Optimization for Temporal-Layer Scalable Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2011

Efficient Multi-Reference Frame Selection Algorithm for Hierarchical B Pictures in Multiview Video Coding.
IEEE Trans. Broadcast., 2011

Hierarchical B-picture mode decision in H.264/SVC.
J. Vis. Commun. Image Represent., 2011

Combining interpretable fuzzy rule-based classifiers via multi-objective hierarchical evolutionary algorithm.
Proceedings of the IEEE International Conference on Systems, 2011

2010
Fast Mode Decision Based on Mode Adaptation.
IEEE Trans. Circuits Syst. Video Technol., 2010

Fast Inter-Mode Decision Based on Rate-Distortion Cost Characteristics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Image thresholding based on Random spatial sampling and Majority voting.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Frame level rate control for H.264/AVC with novel Rate-Quantization model.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Probability-based coding mode prediction for H.264/AVC.
Proceedings of the International Conference on Image Processing, 2010

Fast inter-layer mode decision in Scalable Video Coding.
Proceedings of the International Conference on Image Processing, 2010

2009
Early Determination of Zero-Quantized 8 , <sup>×</sup>, 8 DCT Coefficients.
IEEE Trans. Circuits Syst. Video Technol., 2009

Resampling-based selective clustering ensembles.
Pattern Recognit. Lett., 2009

Decision-based median filter using k-nearest noise-free pixels.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Prediction of Zero Quantized DCT Coefficients in H.264/AVC Using Hadamard Transformed Information.
IEEE Trans. Circuits Syst. Video Technol., 2008

Rate-Distortion Optimization of Rate Control for H.264 With Adaptive Initial Quantization Parameter Determination.
IEEE Trans. Circuits Syst. Video Technol., 2008

Two-Dimensional Map Based Fast Mode Decision for H.264/AVC.
Proceedings of the Advances in Multimedia Information Processing, 2008

A Novel Approach for Service Capabilities Representation Based on Statistical Study on WSDL.
Proceedings of the Third International Conference on Internet and Web Applications and Services, 2008

SVPCGA: Selection on virtual population based compact genetic algorithm.
Proceedings of the IEEE Congress on Evolutionary Computation, 2008

Probabilistic and Graphical Model based Genetic Algorithm Driven Clustering with Instance-level Constraints.
Proceedings of the IEEE Congress on Evolutionary Computation, 2008

2007
An Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization.
IEEE Trans. Multim., 2007

Hybrid Model to Detect Zero Quantized DCT Coefficients in H.264.
IEEE Trans. Multim., 2007

Genetic-fuzzy rule mining approach and evaluation of feature selection techniques for anomaly intrusion detection.
Pattern Recognit., 2007

Efficient predictive model of zero quantized DCT coefficients for fast video encoding.
Image Vis. Comput., 2007

A Rate-Distortion Optimization Algorithm for Rate Control in H.264.
Proceedings of the IEEE International Conference on Acoustics, 2007

Gaussian Model Based Approach to Reducing DCT Computations in H.264.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Agent Based Multi-Objective Approach to Generating Interpretable Fuzzy Systems.
Proceedings of the Multi-Objective Machine Learning, 2006

Efficient prediction algorithm of integer DCT coefficients for H.264/AVC optimization.
IEEE Trans. Circuits Syst. Video Technol., 2006

Novel quantized DCT for video encoder optimization.
IEEE Signal Process. Lett., 2006

Fast video coding based on Gaussian model of DCT coefficients.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Analytical Model of Zero Quantized DCT Coefficients for Video Encoder Optimization.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Effectively Detecting All-Zero DCT Blocks for H.264 Optimization.
Proceedings of the International Conference on Image Processing, 2006

2005
Agent-based evolutionary approach for interpretable rule-based knowledge extraction.
IEEE Trans. Syst. Man Cybern. Part C, 2005

Multi-objective hierarchical genetic algorithm for interpretable fuzzy rule-based knowledge extraction.
Fuzzy Sets Syst., 2005

Anomaly Intrusion Detection Using Multi-Objective Genetic Fuzzy System and Agent-Based Evolutionary Computation Framework.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

2004
Optimization of Gaussian Mixture Model Parameters for Speaker Identification.
Proceedings of the Genetic and Evolutionary Computation, 2004


  Loading...