Yaxiong Wang

Orcid: 0000-0001-6596-8117

According to our database1, Yaxiong Wang authored at least 81 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
REVEAL: Reference-Grounded Reasoning for Multimodal Manipulation Detection.
CoRR, May, 2026

OmniVL-Guard Pro: A Tool-Augmented Agent for Omnibus Vision-Language Forensics.
CoRR, May, 2026

CanonSLR: Canonical-View Guided Multi-View Continuous Sign Language Recognition.
CoRR, April, 2026

Pretrain-then-Adapt: Uncertainty-Aware Test-Time Adaptation for Text-based Person Search.
CoRR, April, 2026

Can Video Diffusion Models Predict Past Frames? Bidirectional Cycle Consistency for Reversible Interpolation.
CoRR, April, 2026

Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting.
CoRR, March, 2026

Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection.
CoRR, March, 2026

OmniVL-Guard: Towards Unified Vision-Language Forgery Detection and Grounding via Balanced RL.
CoRR, February, 2026

Tears or Cheers? Benchmarking LLMs via Culturally Elicited Distinct Affective Responses.
CoRR, January, 2026

LEViT: Locally Enhanced Vision Transformer for Efficient Object Re-Identification.
IEEE Trans. Multim., 2026

Dynamic Correlation-Guided Disentanglement and Contrastive Learning for RGB-D Cross-Modal Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2026

Minimizing the pretraining gap: Domain-aligned text-based person retrieval.
Pattern Recognit., 2026

Multi-level Style Preference Optimization: An Adaptive Detection Framework for Human-Machine Hybrid Text.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Towards Unified Multimodal Misinformation Detection in Social Media: A Benchmark Dataset and Baseline.
CoRR, September, 2025

TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation.
CoRR, September, 2025

FakeSV-VLM: Taming VLM for Detecting Fake Short-Video News via Progressive Mixture-Of-Experts Adapter.
CoRR, August, 2025

Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation.
CoRR, August, 2025

Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark.
CoRR, June, 2025

TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction.
CoRR, June, 2025

SSAM: Self-Supervised Association Modeling for Test-Time Adaption.
CoRR, June, 2025

The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts.
CoRR, May, 2025

Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation.
CoRR, March, 2025

Text-Driven Diffusion Model for Sign Language Production.
CoRR, March, 2025

Scale Up Composed Image Retrieval Learning via Modification Text Generation.
IEEE Trans. Multim., 2025

Towards Geometric-Photometric Joint Alignment for facial mesh registration.
Comput. Graph., 2025

Multimodal learning for non-small cell lung cancer prognosis.
Biomed. Signal Process. Control., 2025

MORE'25 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

Beyond General Alignment: Fine-Grained Entity-Centric Image-Text Matching with Multimodal Attentive Experts.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

A Survey for Point Prompt of Segment Anything Model.
Proceedings of the MMAsia '25 Workshops: Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Domain-Agnostic Neural Oil Painting via Normalization Affine Test-Time Adaptation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Unsupervised Cross-Modal Person Search via Progressive Diverse Text Generation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Towards Fine-Grained Emotion Understanding via Skeleton-Based Micro-Gesture Recognition.
Proceedings of IJCAI-2025 Workshop & Challenge on Human Behavior Analysis for Emotion Understanding (MiGA 2025), 2025

Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Knowledge Swapping via Learning and Unlearning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Consistency-Aware Fake Videos Detection on Short Video Platforms.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Beyond Walking: A Large-Scale Image-Text Benchmark for Text-Based Person Anomaly Search.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

FakeSV-VLM: Taming VLM for Detecting Fake Short-Video News via Progressive Mixture-Of-Experts Adapter.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SLRTP2025 Sign Language Production Challenge: Methodology, Results and Future Work.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Distilling Structured Rationale from Large Language Models to Small Language Models for Abstractive Summarization.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Diversity-Learning Block: Conquer Feature Homogenization of Multibranch.
IEEE Trans. Neural Networks Learn. Syst., June, 2024

ReGO: Reference-Guided Outpainting for Scenery Image.
IEEE Trans. Image Process., 2024

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization.
CoRR, 2024

Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery.
CoRR, 2024

EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning.
CoRR, 2024

Knowledge Adaptation Network for Few-Shot Class-Incremental Learning.
CoRR, 2024

CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval.
CoRR, 2024

FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation.
CoRR, 2024

Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery.
CoRR, 2024

CaLa: Complementary Association Learning for Augmenting Comoposed Image Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

2023
PicassoNet: Searching Adaptive Architecture for Efficient Facial Landmark Localization.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Learning to complement: Relation complementation network for few-shot class-incremental learning.
Knowl. Based Syst., December, 2023

SCGNet: Shifting and Cascaded Group Network.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Generative label fused network for image-text matching.
Knowl. Based Syst., March, 2023

Diff attention: A novel attention scheme for person re-identification.
Comput. Vis. Image Underst., February, 2023

Optimal Noise pursuit for Augmenting Text-to-Video Generation.
CoRR, 2023

DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion.
CoRR, 2023

Knowledge Transfer-Driven Few-Shot Class-Incremental Learning.
CoRR, 2023

Tea Buds Detection in Complex Background Based on Improved YOLOv7.
IEEE Access, 2023

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
An Improved Apple Object Detection Method Based on Lightweight YOLOv4 in Complex Backgrounds.
Remote. Sens., 2022

Branch Identification and Junction Points Location for Apple Trees Based on Deep Learning.
Remote. Sens., 2022

Scale adaption-guided human face detection.
Knowl. Based Syst., 2022

Multimodal Learning for Non-small Cell Lung Cancer Prognosis.
CoRR, 2022

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
PFAN++: Bi-Directional Image-Text Retrieval With Position Focused Attention Network.
IEEE Trans. Multim., 2021

Sketch-Guided Scenery Image Outpainting.
IEEE Trans. Image Process., 2021

Canopy Parameter Estimation of Citrus grandis var. Longanyou Based on LiDAR 3D Point Clouds.
Remote. Sens., 2021

Social image retrieval based on topic diversity.
Multim. Tools Appl., 2021

Generating Superpixels for High-resolution Images with Decoupled Patch Calibration.
CoRR, 2021

AINet: Association Implantation for Superpixel Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Fuzzy Neural Network for Image Clustering.
Proceedings of the 30th IEEE International Conference on Fuzzy Systems, 2021

2020
Attentive Stacked Denoising Autoencoder With Bi-LSTM for Personalized Context-Aware Citation Recommendation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

DONet: Dual Objective Networks for Skin Lesion Segmentation.
CoRR, 2020

Semantic Gated Network for Efficient News Representation.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

2019
Adaptive Robust Control of Multi-Axle Vehicle Electro-Hydraulic Power Steering System With Uncertain Tire Steering Resistance Moment.
IEEE Access, 2019

Position Focused Attention Network for Image-Text Matching.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
Joint Hypergraph Learning for Tag-Based Image Retrieval.
IEEE Trans. Image Process., 2018

2017
Image Re-Ranking Based on Topic Diversity.
IEEE Trans. Image Process., 2017


  Loading...