Keze Wang

Orcid: 0000-0002-7817-8306

According to our database¹, Keze Wang authored at least 131 papers between 2013 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

How Do Document Parsers Break? Auditing Structural Vulnerability in Document Intelligence.

[BibT_eX]

[DOI]

CoRR, May, 2026

LASAR: Towards Spatio-temporal Reasoning with Latent Cognitive Map.

[BibT_eX]

[DOI]

CoRR, May, 2026

Exploring Talking Head Models with Adjacent Frame Prior for Speech-Preserving Facial Expression Manipulation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2026

The Fourth Challenge on Image Super-Resolution (⨉4) at NTIRE 2026: Benchmark Results and Method Overview.

[BibT_eX]

[DOI]

et al.

CoRR, April, 2026

ChunQiuTR: Time-Keyed Temporal Retrieval in Classical Chinese Annals.

[BibT_eX]

[DOI]

CoRR, April, 2026

TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration.

[BibT_eX]

[DOI]

CoRR, March, 2026

A Scalable Curiosity-Driven Game-Theoretic Framework for Long-Tail Multi-Label Learning in Data Mining.

[BibT_eX]

[DOI]

Jing Yang

Keze Wang

CoRR, February, 2026

AgriWorld:A World Tools Protocol Framework for Verifiable Agricultural Reasoning with Code-Executing LLM Agents.

[BibT_eX]

[DOI]

CoRR, February, 2026

RADAR: Benchmarking Vision-Language-Action Generalization via Real-World Dynamics, Spatial-Physical Intelligence, and Autonomous Evaluation.

[BibT_eX]

[DOI]

CoRR, February, 2026

Process-of-Thought Reasoning for Videos.

[BibT_eX]

[DOI]

CoRR, February, 2026

Spectral Gating Networks.

[BibT_eX]

[DOI]

CoRR, February, 2026

Rational ANOVA Networks.

[BibT_eX]

[DOI]

CoRR, February, 2026

Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems.

[BibT_eX]

[DOI]

CoRR, January, 2026

ResAgent: Entropy-based Prior Point Discovery and Visual Reasoning for Referring Expression Segmentation.

[BibT_eX]

[DOI]

CoRR, January, 2026

Weather-R1: Logically Consistent Reinforcement Fine-Tuning for Multimodal Reasoning in Meteorology.

[BibT_eX]

[DOI]

CoRR, January, 2026

3D-Agent:Tri-Modal Multi-Agent Collaboration for Scalable 3D Object Annotation.

[BibT_eX]

[DOI]

CoRR, January, 2026

Stable Language Guidance for Vision-Language-Action Models.

[BibT_eX]

[DOI]

CoRR, January, 2026

Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

Toward Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

Stable Language Guidance for Vision-Language-Action Models.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Reinforcement Learning for Diffusion LLMs via Energy-Based Gibbs Alignment.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Provably Safe Offline-to-Online RL: Decoupling Learning from Data-Driven Safety Enforcement.

[BibT_eX]

[DOI]

Kaitong Cai

Jusheng Zhang

Keze Wang

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Top-Down Semantic Refinement for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation.

[BibT_eX]

[DOI]

Zhuojie Yang

Wentao Wan

Keze Wang

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

HiVA: Self-organized Hierarchical Variable Agent via Goal-driven Semantic-Topological Evolution.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Cost-Effective Communication: An Auction-based Method for Language Agent Interaction.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Self-Rewarded Multimodal Coherent Reasoning Across Diverse Visual Domains.

[BibT_eX]

[DOI]

CoRR, December, 2025

CoAgent: Collaborative Planning and Consistency Agent for Coherent Video Generation.

[BibT_eX]

[DOI]

CoRR, December, 2025

RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks.

[BibT_eX]

[DOI]

CoRR, December, 2025

FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

SirenPose: Dynamic Scene Reconstruction via Geometric Supervision.

[BibT_eX]

[DOI]

CoRR, December, 2025

LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction.

[BibT_eX]

[DOI]

CoRR, December, 2025

PTTA: A Pure Text-to-Animation Framework for High-Quality Creation.

[BibT_eX]

[DOI]

CoRR, December, 2025

Reflective Confidence: Correcting Reasoning Flaws via Online Self-Correction.

[BibT_eX]

[DOI]

Qinglin Zeng

Jing Yang

Keze Wang

CoRR, December, 2025

GTMA: Dynamic Representation Optimization for OOD Vision-Language Models.

[BibT_eX]

[DOI]

Jensen Zhang

Ningyuan Liu

Keze Wang

CoRR, December, 2025

Adaptive-VoCo: Complexity-Aware Visual Token Compression for Vision-Language Models.

[BibT_eX]

[DOI]

Xiaoyang Guo

Keze Wang

CoRR, December, 2025

Large Language Models as Discounted Bayesian Filters.

[BibT_eX]

[DOI]

Jensen Zhang

Jing Yang

Keze Wang

CoRR, December, 2025

STORM: Search-Guided Generative World Models for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, December, 2025

Massive Editing for Large Language Models Based on Dynamic Weight Generation.

[BibT_eX]

[DOI]

CoRR, December, 2025

Enhancing Visual Programming for Visual Reasoning via Probabilistic Graphs.

[BibT_eX]

[DOI]

CoRR, December, 2025

HybridToken-VLM: Hybrid Token Compression for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models.

[BibT_eX]

[DOI]

CoRR, December, 2025

PhyDetEx: Detecting and Explaining the Physical Plausibility of T2V Models.

[BibT_eX]

[DOI]

Zeqing Wang

Keze Wang

Lei Zhang

CoRR, December, 2025

Causal Invariance and Counterfactual Learning Driven Cooperative Game for Multi-Label Classification.

[BibT_eX]

[DOI]

CoRR, December, 2025

ℰ<sub>0</sub>: Enhancing Generalization and Fine-Grained Control in VLA Models via Continuized Discrete Diffusion.

[BibT_eX]

[DOI]

CoRR, November, 2025

MM-OPERA: Benchmarking Open-ended Association Reasoning for Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Guardian: Decoupling Exploration from Safety in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization.

[BibT_eX]

[DOI]

CoRR, October, 2025

Backward-Friendly Optimization: Training Large Language Models with Approximate Gradients under Memory Constraints.

[BibT_eX]

[DOI]

CoRR, October, 2025

Learning Dynamics of VLM Finetuning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Failure-Driven Workflow Refinement.

[BibT_eX]

[DOI]

CoRR, October, 2025

VideoVerse: How Far is Your T2V Generator from a World Model?

[BibT_eX]

[DOI]

CoRR, October, 2025

CF-VLM:CounterFactual Vision-Language Fine-tuning.

[BibT_eX]

[DOI]

CoRR, June, 2025

From Motion to Behavior: Hierarchical Modeling of Humanoid Generative Behavior Control.

[BibT_eX]

[DOI]

CoRR, June, 2025

Continuous Value Assignment: A Doubly Robust Data Augmentation for Off-Policy Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., May, 2025

TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation.

[BibT_eX]

[DOI]

CoRR, April, 2025

Kolmogorov-Arnold Fourier Networks.

[BibT_eX]

[DOI]

CoRR, February, 2025

SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

MAT-Agent: Adaptive Multi-Agent Training Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

AlphaAgent: LLM-Driven Alpha Mining with Regularized Exploration to Counteract Alpha Decay.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

High-Fidelity Face Swapping via Fine-grained Attribute Control with Diffusion Models.

[BibT_eX]

[DOI]

Zikang Zhou

Keze Wang

Proceedings of the International Joint Conference on Neural Networks, 2025

KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Towards More Efficient Post-training via Fourier Domain Adapter Framework.

[BibT_eX]

[DOI]

Yijia Fan

Jusheng Zhang

Keze Wang

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

CCG: Rare-Label Prediction via Neural SEM-Driven Causal Game.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Reproducible Vision-Language Models Meet Concepts Out of Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Improving Network Interpretability via Explanation Consistency Evaluation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Multi-Person 3D Pose Estimation With Occlusion Reasoning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

Category-Adaptive Cross-Modal Semantic Refinement and Transfer for Open-Vocabulary Multi-Label Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

On Training Data Influence of GPT Models.

[BibT_eX]

[DOI]

CoRR, 2024

Gesture Generation Via Diffusion Model with Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adaptive Prompt Routing for Arbitrary Text Style Transfer with Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Towards Causality-Aware Inferring: A Sequential Discriminative Approach for Medical Diagnosis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2023

VisualProg Distiller: Learning to Fine-tune Non-differentiable Visual Programming Frameworks.

[BibT_eX]

[DOI]

CoRR, 2023

Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs.

[BibT_eX]

[DOI]

CoRR, 2023

FIRE: Fine Implicit Reconstruction Enhancement with Detailed Body Part Labels and Geometric Features.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Interactive Learning for Interpretable Visual Recognition via Semantic-Aware Self-Teaching Framework.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

2022

Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2022

TCGL: Temporal Contrastive Graph for Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Enhancing Prototypical Few-Shot Learning By Leveraging The Local-Level Strategy.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Semantics-Aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models.

[BibT_eX]

[DOI]

CoRR, 2021

Temporal Contrastive Graph for Self-supervised Video Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Solving Inefficiency of Self-supervised Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Linguistically Routing Capsule Network for Out-of-distribution Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Mind the Context: The Impact of Contextualization in Neural Module Networks for Grounding Visual Referring Expressions.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

3D Human Pose Machines with Self-Supervised Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Face Hallucination by Attentive Sequence Optimization with Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Linguistically Driven Graph Capsule Network for Visual Question Reasoning.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Reinforced Agents with Counterfactual Simulation for Medical Automatic Diagnosis.

[BibT_eX]

[DOI]

CoRR, 2020

Grammatically Recognizing Images with Tree Convolution.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2019

Cost-Effective Object Detection: Active Sample Mining With Switchable Selection Criteria.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2019

Instance-aware representation learning and association for online multi-person tracking.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Adaptively Connected Neural Networks.

[BibT_eX]

[DOI]

Guangrun Wang

Keze Wang

Liang Lin

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Deep Co-Space: Sample Mining Across Feature Transformation for Semi-Supervised Learning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Active Self-Paced Learning for Cost-Effective and Progressive Face Identification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Embedding Temporally Consistent Depth Recovery for Real-time Dense Mapping in Visual-inertial Odometry.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Convolutional Memory Blocks for Depth Data Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Flow Guided Recurrent Neural Encoder for Video Salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Structure-Preserving Image Super-Resolution via Contextualized Multitask Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Cost-Effective Active Learning for Deep Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2017

Structure-Preserving Image Super-resolution via Contextualized Multi-task Learning.

[BibT_eX]

[DOI]

CoRR, 2017

Image Retrieval with Attribute-Associated Auxiliary References.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

Fine-Grained Butterfly Recognition with Deep Residual Networks: A New Baseline and Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

Recurrent 3D Pose Sequence Machines.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Face Recognition via Heuristic Deep Active Learning.

[BibT_eX]

[DOI]

Proceedings of the Biometric Recognition - 12th Chinese Conference, 2017

2016

A Deep Structured Model with Radius-Margin Bound for 3D Human Activity Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Human Pose Estimation from Depth Images via Inference Embedded Multi-task Learning.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Learning a lightweight deep convolutional network for joint age and gender recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Local- and holistic-structure preserving image super resolution via deep joint component learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures With Edge-Preserving Coherence.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

2014

3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

2013

PISA: Pixelwise Image Saliency by Aggregating Complementary Appearance Contrast Measures with Spatial Priors.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Keze Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...