Yaxiong Wang

Orcid: 0000-0001-6596-8117

According to our database1, Yaxiong Wang authored at least 61 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation.
CoRR, August, 2025

Minimizing the Pretraining Gap: Domain-aligned Text-Based Person Retrieval.
CoRR, July, 2025

Towards Fine-Grained Emotion Understanding via Skeleton-Based Micro-Gesture Recognition.
CoRR, June, 2025

Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark.
CoRR, June, 2025

TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction.
CoRR, June, 2025

SSAM: Self-Supervised Association Modeling for Test-Time Adaption.
CoRR, June, 2025

The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts.
CoRR, May, 2025

Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach.
CoRR, April, 2025

Scale Up Composed Image Retrieval Learning via Modification Text Generation.
CoRR, April, 2025

Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation.
CoRR, March, 2025

Text-Driven Diffusion Model for Sign Language Production.
CoRR, March, 2025

Knowledge Swapping via Learning and Unlearning.
CoRR, February, 2025

Towards Geometric-Photometric Joint Alignment for facial mesh registration.
Comput. Graph., 2025

Multimodal learning for non-small cell lung cancer prognosis.
Biomed. Signal Process. Control., 2025

MORE'25 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

Beyond General Alignment: Fine-Grained Entity-Centric Image-Text Matching with Multimodal Attentive Experts.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Consistency-Aware Fake Videos Detection on Short Video Platforms.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SLRTP2025 Sign Language Production Challenge: Methodology, Results and Future Work.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Distilling Structured Rationale from Large Language Models to Small Language Models for Abstractive Summarization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Diversity-Learning Block: Conquer Feature Homogenization of Multibranch.
IEEE Trans. Neural Networks Learn. Syst., June, 2024

ReGO: Reference-Guided Outpainting for Scenery Image.
IEEE Trans. Image Process., 2024

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization.
CoRR, 2024

Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery.
CoRR, 2024

Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search.
CoRR, 2024

EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning.
CoRR, 2024

Knowledge Adaptation Network for Few-Shot Class-Incremental Learning.
CoRR, 2024

CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval.
CoRR, 2024

FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation.
CoRR, 2024

Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery.
CoRR, 2024

CaLa: Complementary Association Learning for Augmenting Comoposed Image Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

2023
PicassoNet: Searching Adaptive Architecture for Efficient Facial Landmark Localization.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Learning to complement: Relation complementation network for few-shot class-incremental learning.
Knowl. Based Syst., December, 2023

SCGNet: Shifting and Cascaded Group Network.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Generative label fused network for image-text matching.
Knowl. Based Syst., March, 2023

Diff attention: A novel attention scheme for person re-identification.
Comput. Vis. Image Underst., February, 2023

Optimal Noise pursuit for Augmenting Text-to-Video Generation.
CoRR, 2023

DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion.
CoRR, 2023

Knowledge Transfer-Driven Few-Shot Class-Incremental Learning.
CoRR, 2023

Tea Buds Detection in Complex Background Based on Improved YOLOv7.
IEEE Access, 2023

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
An Improved Apple Object Detection Method Based on Lightweight YOLOv4 in Complex Backgrounds.
Remote. Sens., 2022

Branch Identification and Junction Points Location for Apple Trees Based on Deep Learning.
Remote. Sens., 2022

Scale adaption-guided human face detection.
Knowl. Based Syst., 2022

Multimodal Learning for Non-small Cell Lung Cancer Prognosis.
CoRR, 2022

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
PFAN++: Bi-Directional Image-Text Retrieval With Position Focused Attention Network.
IEEE Trans. Multim., 2021

Sketch-Guided Scenery Image Outpainting.
IEEE Trans. Image Process., 2021

Canopy Parameter Estimation of Citrus grandis var. Longanyou Based on LiDAR 3D Point Clouds.
Remote. Sens., 2021

Social image retrieval based on topic diversity.
Multim. Tools Appl., 2021

Generating Superpixels for High-resolution Images with Decoupled Patch Calibration.
CoRR, 2021

AINet: Association Implantation for Superpixel Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Fuzzy Neural Network for Image Clustering.
Proceedings of the 30th IEEE International Conference on Fuzzy Systems, 2021

2020
Attentive Stacked Denoising Autoencoder With Bi-LSTM for Personalized Context-Aware Citation Recommendation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

DONet: Dual Objective Networks for Skin Lesion Segmentation.
CoRR, 2020

Semantic Gated Network for Efficient News Representation.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

2019
Adaptive Robust Control of Multi-Axle Vehicle Electro-Hydraulic Power Steering System With Uncertain Tire Steering Resistance Moment.
IEEE Access, 2019

Position Focused Attention Network for Image-Text Matching.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
Joint Hypergraph Learning for Tag-Based Image Retrieval.
IEEE Trans. Image Process., 2018

2017
Image Re-Ranking Based on Topic Diversity.
IEEE Trans. Image Process., 2017


  Loading...