Haowei Liu

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Text2Scenes: Language-Guided Synthesis of Complex Indoor Scenes.
Int. J. Comput. Vis., June, 2026

Exploring speech clues for Chinese aspect-based sentiment analysis.
Frontiers Comput. Sci., April, 2026

InnoAds-Composer: Efficient Condition Composition for E-Commerce Poster Generation.
CoRR, March, 2026

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents.
CoRR, February, 2026

MSFI: Multi-timescale spatio-temporal features integration in spiking neural networks.
Neural Networks, 2026

Adaptive hypergraph contrastive learning with structure-conditioned diffusion for robust fault diagnosis.
Expert Syst. Appl., 2026

GCRA-FWVAE: Anomaly detection for IIoT univariate time series using time-frequency domain analysis.
Digit. Commun. Networks, 2026

RAGAR: Retrieval Augmented Personalized Image Generation Guided by Recommendation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

OTI: A Model-free and Visually Interpretable Measure of Image Attackability.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Hierarchical radial basis functions method for large eddy simulation.
Eng. Comput., December, 2025

A comparative study of several classes of meshfree methods for solving the Helmholtz equation.
Eng. Comput., December, 2025

Hierarchical radial basis functions method for the coupling of Stokes and Darcy flow.
Eng. Comput., October, 2025

Mobile-Agent-v3: Fundamental Agents for GUI Automation.
CoRR, August, 2025

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation.
CoRR, June, 2025

A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics.
CoRR, February, 2025

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC.
CoRR, February, 2025

Let Images Speak More: An Efficient Method for Detecting Image Manipulation History.
IEEE Trans. Circuits Syst. Video Technol., 2025

Novel fusion architecture of multi-location blood flow sounds for arteriovenous fistula stenosis diagnosis.
Comput. Methods Programs Biomed., 2025

A Sub-domain Index System for Network Security Situation Assessment.
Proceedings of the Wireless Artificial Intelligent Computing Systems and Applications, 2025

RaCT: Ranking-aware Chain-of-Thought Optimization for LLMs.
Proceedings of the 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2025

A Unified Reinforcement Learning Framework for Comprehensive UAV Railway Inspection.
Proceedings of the Machine Learning and Artificial Intelligence, 2025

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Rethinking the Effect of LoRA in Foundation Models for Long-Tailed Recognition.
Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

Smart Contract Reentrancy Vulnerability Localization Using Explainable Graph Neural Networks.
Proceedings of the 49th IEEE Annual Computers, Software, and Applications Conference, 2025

2024
ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers.
CoRR, 2024

CLERF: Contrastive LEaRning for Full Range Head Pose Estimation.
CoRR, 2024

Full-range Head Pose Geometric Data Augmentations.
CoRR, 2024

NFT1000: A Visual Text Dataset For Non-Fungible Token Retrieval.
CoRR, 2024

NFT1000: A Cross-Modal Dataset For Non-Fungible Token Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MIBench: Evaluating Multimodal Large Language Models over Multiple Images.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Using My Artistic Style? You Must Obtain My Authorization.
Proceedings of the Computer Vision - ECCV 2024, 2024

mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Meshfree methods for nonlinear equilibrium radiation diffusion equation with jump coefficient.
Comput. Math. Appl., October, 2023

Body Weight Estimation for Pigs Based on 3D Hybrid Filter and Convolutional Neural Network.
Sensors, September, 2023

TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
CoRR, 2023

Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Research on behavior recognition algorithms in classroom scenarios.
Proceedings of the 2023 4th International Conference on Computing, 2023

2022
Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Performance Comparison of Seven Pretrained Models on a text classification task.
Proceedings of the 2022 5th International Conference on Signal Processing and Machine Learning, 2022

Tweet Sentiment Extraction Using Byte Level Pretrained Language Model∗.
Proceedings of the ICMLC 2022: 14th International Conference on Machine Learning and Computing, Guangzhou, China, February 18, 2022

Exploring Motion Information for Distractor Suppression in Visual Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Temperature-Insensitive Label-Free Sensors for Human IgG Based on S-Tapered Optical Fiber Sensors.
IEEE Access, 2021

2019
Intelligent monitoring of indoor surveillance video based on deep learning.
Proceedings of the 21st International Conference on Advanced Communication Technology, 2019

TED: A Tetrominoes based edge descriptor.
Proceedings of the 21st International Conference on Advanced Communication Technology, 2019

2014
Automatic objects segmentation with RGB-D cameras.
J. Vis. Commun. Image Represent., 2014

Recognizing object manipulation activities using depth and visual cues.
J. Vis. Commun. Image Represent., 2014

2012
Automatic object segmentation with 3-D cameras.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Recognizing object manipulation activities using depth and visual cues.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Automatic Video Activity Recognition.
Proceedings of the Multimedia Analysis, Processing and Communications, 2011

Robust Detection of Abandoned and Removed Objects in Complex Surveillance Videos.
IEEE Trans. Syst. Man Cybern. Part C, 2011

Automatic video activity detection using compressed domain motion trajectories for H.264 videos.
J. Vis. Commun. Image Represent., 2011

Benchmarking Datasets for Human Activity Recognition.
Proceedings of the Visual Analysis of Humans - Looking at People., 2011

2010
Video activity detection using compressed domain motion trajectories for H.264 videos.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Unsupervised action classification using space-time link analysis.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010


  Loading...