Xiawu Zheng

Orcid: 0000-0002-6855-5403

According to our database1, Xiawu Zheng authored at least 112 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
An Information Theory-Inspired Strategy for Automated Network Pruning.
Int. J. Comput. Vis., August, 2025

Training-Free Multimodal Large Language Model Orchestration.
CoRR, August, 2025

UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding.
CoRR, July, 2025

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents.
CoRR, June, 2025

Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective.
CoRR, May, 2025

Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs.
CoRR, May, 2025

Adaptive Fuzzy Positive Learning for Annotation-Scarce Semantic Segmentation.
Int. J. Comput. Vis., March, 2025

QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension.
CoRR, March, 2025

Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective.
CoRR, February, 2025

Towards Efficient Automatic Self-Pruning of Large Language Models.
CoRR, February, 2025

Training-free Anomaly Event Detection via LLM-guided Symbolic Pattern Discovery.
CoRR, February, 2025

Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy.
CoRR, February, 2025

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction.
CoRR, January, 2025

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Multimodal Quantitative Language for Generative Recommendation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dynamic Low-Rank Sparse Adaptation for Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Learning Transition Patterns by Large Language Models for Sequential Recommendation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Automated Fine-Grained Mixture-of-Experts Quantization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025


Feature Denoising Diffusion Model for Blind Image Quality Assessment.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Dynamic Clustering Convolutional Neural Network.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Training-Free Transformer Architecture Search With Zero-Cost Proxy Guided Evolution.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Uncovering the Over-Smoothing Challenge in Image Super-Resolution: Entropy-Based Quantification and Contrastive Optimization.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

ARLP: Automatic multi-agent transformer reinforcement learning pruner for one-shot neural network pruning.
Knowl. Based Syst., 2024

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension.
CoRR, 2024

VITA: Towards Open-Source Interactive Omni Multimodal LLM.
CoRR, 2024

Local Manifold Learning for No-Reference Image Quality Assessment.
CoRR, 2024

Depth-Guided Semi-Supervised Instance Segmentation.
CoRR, 2024

VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models.
CoRR, 2024

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.
CoRR, 2024

Multi-Modal Prompt Learning on Blind Image Quality Assessment.
CoRR, 2024

Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization.
CoRR, 2024

Data Interpreter: An LLM Agent For Data Science.
CoRR, 2024

EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMs.
CoRR, 2024

Feature Denoising Diffusion Model for Blind Image Quality Assessment.
CoRR, 2024

Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation.
CoRR, 2024

A Hybrid Performance Estimation Strategy for Optimizing Neural Architecture Search.
Proceedings of the Advances in Computational Intelligence Systems, 2024

Multimodal Inplace Prompt Tuning for Open-set Object Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Motion-aware Latent Diffusion Models for Video Frame Interpolation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Outlier-aware Slicing for Post-Training Quantization in Vision Transformer.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

AffineQuant: Affine Transformation Quantization for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Protein-Ligand Interaction Prior for Binding-aware 3D Molecule Diffusion Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Functionally Similar Multi-Label Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-branch Collaborative Learning Network for 3D Visual Grounding.
Proceedings of the Computer Vision - ECCV 2024, 2024

Textual Grounding for Open-Vocabulary Visual Information Extraction in Layout-Diversified Documents.
Proceedings of the Computer Vision - ECCV 2024, 2024

GraCo: Granularity-Controllable Interactive Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Bilateral Event Mining and Complementary for Event Stream Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RepAn: Enhanced Annealing through Re-parameterization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Solving the Catastrophic Forgetting Problem in Generalized Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Semi-Supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Binding-Adaptive Diffusion Models for Structure-Based Drug Design.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
DDPNAS: Efficient Neural Architecture Search via Dynamic Distribution Pruning.
Int. J. Comput. Vis., May, 2023

Adaptive Feature Selection for No-Reference Image Quality Assessment using Contrastive Mitigating Semantic Noise Sensitivity.
CoRR, 2023

Less is More: Learning Reference Knowledge Using No-Reference Image Quality Assessment.
CoRR, 2023

DLIP: Distilling Language-Image Pre-training.
CoRR, 2023

A Unified Framework for 3D Point Cloud Visual Grounding.
CoRR, 2023

MetaGPT: Meta Programming for Multi-Agent Collaborative Framework.
CoRR, 2023

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models.
CoRR, 2023

Probabilistic Modeling: Proving the Lottery Ticket Hypothesis in Spiking Neural Network.
CoRR, 2023

Two-Stage Deep Learning Segmentation for Tiny Brain Regions.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

GLViG: Global and Local Vision GNN May Be What You Need for Vision.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Knowledge Prompt-tuning for Sequential Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Event-Diffusion: Event-Based Image Reconstruction and Restoration with Diffusion Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LocLoc: Low-level Cues and Local-area Guides for Weakly Supervised Object Localization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Adaptive PromptNet for Auxiliary Glioma Diagnosis Without Contrast-Enhanced MRI.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Iterative Data Refinement for Self-Supervised Learning MR Image Reconstruction.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Digest: Deeply Supervised Knowledge Transfer Network Learning for Brain Tumor Segmentation with Incomplete Multi-Modal MRI Scans.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

A Unified Framework for Soft Threshold Pruning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Sparse Neural Networks with Identity Layers.
Proceedings of the Image and Graphics - 12th International Conference, 2023

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Meta Architecture for Point Cloud Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Novel Neighbor Aggregation Function for Medical Point Cloud Analysis.
Proceedings of the Advances in Computer Graphics, 2023

Data-Efficient Image Quality Assessment with Attention-Panel Decoder.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

OMPQ: Orthogonal Mixed Precision Quantization.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Meta Architecure for Point Cloud Analysis.
CoRR, 2022

Iterative Data Refinement for Self-Supervised MR Image Reconstruction.
CoRR, 2022

What Hinders Perceptual Quality of PSNR-oriented Methods?
CoRR, 2022

Searching Lightweight Neural Network for Image Signal Processing.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Training-free Transformer Architecture Search.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Architecture Search with Representation Mutual Information.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Aggregating Global and Local Visual Representation for Vehicle Re-IDentification.
IEEE Trans. Multim., 2021

Evolving Fully Automated Machine Learning via Life-Long Knowledge Anchors.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

MIGO-NAS: Towards Fast and Generalizable Neural Architecture Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Binarized Neural Architecture Search for Efficient Object Recognition.
Int. J. Comput. Vis., 2021

OMPQ: Orthogonal Mixed Precision Quantization.
CoRR, 2021

An Information Theory-inspired Strategy for Automatic Network Pruning.
CoRR, 2021

You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient.
CoRR, 2021

On Evolving Attention Towards Domain Adaptation.
CoRR, 2021

CDP: Towards Optimal Filter Pruning via Class-wise Discriminative Power.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

EC-DARTS: Inducing Equalized and Consistent Optimization into DARTS.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
PAMS: Quantized Super-Resolution via Parameterized Max Scale.
CoRR, 2020

PAMS: Quantized Super-Resolution via Parameterized Max Scale.
Proceedings of the Computer Vision - ECCV 2020, 2020

Rethinking Performance Estimation in Neural Architecture Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Binarized Neural Architecture Search.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Dynamic Distribution Pruning for Efficient Network Architecture Search.
CoRR, 2019

Multinomial Distribution Learning for Effective Neural Architecture Search.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Towards Optimal Fine Grained Retrieval via Decorrelated Centralized Loss with Normalize-Scale Layer.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018


  Loading...