Yaxiong Wang

Orcid: 0000-0001-6596-8117

According to our database¹, Yaxiong Wang authored at least 81 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

REVEAL: Reference-Grounded Reasoning for Multimodal Manipulation Detection.

[BibT_eX]

[DOI]

CoRR, May, 2026

OmniVL-Guard Pro: A Tool-Augmented Agent for Omnibus Vision-Language Forensics.

[BibT_eX]

[DOI]

CoRR, May, 2026

CanonSLR: Canonical-View Guided Multi-View Continuous Sign Language Recognition.

[BibT_eX]

[DOI]

CoRR, April, 2026

Pretrain-then-Adapt: Uncertainty-Aware Test-Time Adaptation for Text-based Person Search.

[BibT_eX]

[DOI]

CoRR, April, 2026

Can Video Diffusion Models Predict Past Frames? Bidirectional Cycle Consistency for Reversible Interpolation.

[BibT_eX]

[DOI]

CoRR, April, 2026

Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting.

[BibT_eX]

[DOI]

CoRR, March, 2026

Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection.

[BibT_eX]

[DOI]

CoRR, March, 2026

OmniVL-Guard: Towards Unified Vision-Language Forgery Detection and Grounding via Balanced RL.

[BibT_eX]

[DOI]

CoRR, February, 2026

Tears or Cheers? Benchmarking LLMs via Culturally Elicited Distinct Affective Responses.

[BibT_eX]

[DOI]

CoRR, January, 2026

LEViT: Locally Enhanced Vision Transformer for Efficient Object Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

Dynamic Correlation-Guided Disentanglement and Contrastive Learning for RGB-D Cross-Modal Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2026

Minimizing the pretraining gap: Domain-aligned text-based person retrieval.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

Multi-level Style Preference Optimization: An Adaptive Detection Framework for Human-Machine Hybrid Text.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Towards Unified Multimodal Misinformation Detection in Social Media: A Benchmark Dataset and Baseline.

[BibT_eX]

[DOI]

CoRR, September, 2025

TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation.

[BibT_eX]

[DOI]

CoRR, September, 2025

FakeSV-VLM: Taming VLM for Detecting Fake Short-Video News via Progressive Mixture-Of-Experts Adapter.

[BibT_eX]

[DOI]

CoRR, August, 2025

Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation.

[BibT_eX]

[DOI]

CoRR, August, 2025

Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark.

[BibT_eX]

[DOI]

CoRR, June, 2025

TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction.

[BibT_eX]

[DOI]

CoRR, June, 2025

SSAM: Self-Supervised Association Modeling for Test-Time Adaption.

[BibT_eX]

[DOI]

CoRR, June, 2025

The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts.

[BibT_eX]

[DOI]

CoRR, May, 2025

Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Text-Driven Diffusion Model for Sign Language Production.

[BibT_eX]

[DOI]

CoRR, March, 2025

Scale Up Composed Image Retrieval Learning via Modification Text Generation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

Towards Geometric-Photometric Joint Alignment for facial mesh registration.

[BibT_eX]

[DOI]

Xizhi Wang

Yaxiong Wang

Mengjian Li

Comput. Graph., 2025

Multimodal learning for non-small cell lung cancer prognosis.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2025

MORE'25 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

Beyond General Alignment: Fine-Grained Entity-Centric Image-Text Matching with Multimodal Attentive Experts.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

A Survey for Point Prompt of Segment Anything Model.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '25 Workshops: Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Domain-Agnostic Neural Oil Painting via Normalization Affine Test-Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Unsupervised Cross-Modal Person Search via Progressive Diverse Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Towards Fine-Grained Emotion Understanding via Skeleton-Based Micro-Gesture Recognition.

[BibT_eX]

[DOI]

Proceedings of IJCAI-2025 Workshop & Challenge on Human Behavior Analysis for Emotion Understanding (MiGA 2025), 2025

Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Knowledge Swapping via Learning and Unlearning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Consistency-Aware Fake Videos Detection on Short Video Platforms.

[BibT_eX]

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Beyond Walking: A Large-Scale Image-Text Benchmark for Text-Based Person Anomaly Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

FakeSV-VLM: Taming VLM for Detecting Fake Short-Video News via Progressive Mixture-Of-Experts Adapter.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SLRTP2025 Sign Language Production Challenge: Methodology, Results and Future Work.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Distilling Structured Rationale from Large Language Models to Small Language Models for Abstractive Summarization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Diversity-Learning Block: Conquer Feature Homogenization of Multibranch.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., June, 2024

ReGO: Reference-Guided Outpainting for Scenery Image.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization.

[BibT_eX]

[DOI]

CoRR, 2024

Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery.

[BibT_eX]

[DOI]

CoRR, 2024

EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Knowledge Adaptation Network for Few-Shot Class-Incremental Learning.

[BibT_eX]

[DOI]

CoRR, 2024

CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery.

[BibT_eX]

[DOI]

CoRR, 2024

CaLa: Complementary Association Learning for Augmenting Comoposed Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities.

[BibT_eX]

[DOI]

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

2023

PicassoNet: Searching Adaptive Architecture for Efficient Facial Landmark Localization.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2023

Learning to complement: Relation complementation network for few-shot class-incremental learning.

[BibT_eX]

[DOI]

Knowl. Based Syst., December, 2023

SCGNet: Shifting and Cascaded Group Network.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Generative label fused network for image-text matching.

[BibT_eX]

[DOI]

Knowl. Based Syst., March, 2023

Diff attention: A novel attention scheme for person re-identification.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., February, 2023

Optimal Noise pursuit for Augmenting Text-to-Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion.

[BibT_eX]

[DOI]

CoRR, 2023

Knowledge Transfer-Driven Few-Shot Class-Incremental Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Tea Buds Detection in Complex Background Based on Improved YOLOv7.

[BibT_eX]

[DOI]

IEEE Access, 2023

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022

An Improved Apple Object Detection Method Based on Lightweight YOLOv4 in Complex Backgrounds.

[BibT_eX]

[DOI]

Chenxi Zhang

Feng Kang

Yaxiong Wang

Remote. Sens., 2022

Branch Identification and Junction Points Location for Apple Trees Based on Deep Learning.

[BibT_eX]

[DOI]

Remote. Sens., 2022

Scale adaption-guided human face detection.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2022

Multimodal Learning for Non-small Cell Lung Cancer Prognosis.

[BibT_eX]

[DOI]

CoRR, 2022

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

PFAN++: Bi-Directional Image-Text Retrieval With Position Focused Attention Network.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Sketch-Guided Scenery Image Outpainting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Canopy Parameter Estimation of Citrus grandis var. Longanyou Based on LiDAR 3D Point Clouds.

[BibT_eX]

[DOI]

Remote. Sens., 2021

Social image retrieval based on topic diversity.

[BibT_eX]

[DOI]

Yaxiong Wang

Li Zhu

Xueming Qian

Multim. Tools Appl., 2021

Generating Superpixels for High-resolution Images with Decoupled Patch Calibration.

[BibT_eX]

[DOI]

CoRR, 2021

AINet: Association Implantation for Superpixel Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Fuzzy Neural Network for Image Clustering.

[BibT_eX]

[DOI]

Proceedings of the 30th IEEE International Conference on Fuzzy Systems, 2021

2020

Attentive Stacked Denoising Autoencoder With Bi-LSTM for Personalized Context-Aware Citation Recommendation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

DONet: Dual Objective Networks for Skin Lesion Segmentation.

[BibT_eX]

[DOI]

CoRR, 2020

Semantic Gated Network for Efficient News Representation.

[BibT_eX]

[DOI]

Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

2019

Adaptive Robust Control of Multi-Axle Vehicle Electro-Hydraulic Power Steering System With Uncertain Tire Steering Resistance Moment.

[BibT_eX]

[DOI]

IEEE Access, 2019

Position Focused Attention Network for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018

Joint Hypergraph Learning for Tag-Based Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

2017

Image Re-Ranking Based on Topic Diversity.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Yaxiong Wang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...