Xinyu Xiao

Orcid: 0000-0003-4895-632X

According to our database1, Xinyu Xiao authored at least 51 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting.
CoRR, March, 2026

MeTok: An Efficient Meteorological Tokenization with Hyper-Aligned Group Learning for Precipitation Nowcasting.
CoRR, March, 2026

UniVid: Pyramid Diffusion Model for High Quality Video Generation.
CoRR, March, 2026

Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning.
CoRR, February, 2026

PruneRAG: Confidence-Guided Query Decomposition Trees for Efficient Retrieval-Augmented Generation.
CoRR, January, 2026

Hummingbird: SLO-Oriented GPU Preemption at Microsecond-scale.
CoRR, January, 2026

Dual Feature Fusion for Incomplete Multi-View Multi-Label Learning.
IEEE Trans. Multim., 2026

Uncertainty-aware mixture of experts for robust multimodal sentiment analysis.
Pattern Recognit., 2026

PruneRAG: Confidence-Guided Query Decomposition Trees for Efficient Retrieval-Augmented Generation.
Proceedings of the ACM Web Conference 2026, 2026

AWMA-MoE: Attention-Guided Watermark Adapter with MoE for Latent Diffusion Models.
Proceedings of the ACM Web Conference 2026, 2026

2025
Deep learning models for asphalt material behavior analysis under smart city IoT infrastructures.
Discov. Internet Things, December, 2025

RayFusion: Ray Fusion Enhanced Collaborative Visual Perception.
CoRR, October, 2025

Ming-Omni: A Unified Multimodal Model for Perception and Generation.
CoRR, June, 2025

Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction.
CoRR, May, 2025

Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2025

Multi-faceted Complementary Learning for Incomplete Multi-view Multi-label Classification.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Unified Visual Generation via Next-Set Prediction in Continuous Domain.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Merge then Realign: Simple and Effective Modality-Incremental Continual Learning for Multimodal LLMs.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Towards Building Human-like Smart Agents in Modern 3D Video Games (Student Abstract).
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning.
IEEE Trans. Image Process., 2024

SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

Preformer: Simple and Efficient Design for Precipitation Nowcasting With Transformers.
IEEE Geosci. Remote. Sens. Lett., 2024

GTPAN: Global Target Preference Attention Network for session-based recommendation.
Expert Syst. Appl., 2024

Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation.
CoRR, 2024

Local-to-Global Self-Consistency Learning for Temporal Action Localization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Improved Loss Minimization Control Based on Time-Harmonic Equivalent Circuit for Linear Induction Motors Adopted to Linear Metro.
IEEE Trans. Veh. Technol., July, 2023

WiDFF-ID: Device-Free Fast Person Identification Using Commodity WiFi.
IEEE Trans. Cogn. Commun. Netw., February, 2023

LSIAN: Exploiting interval interests for session-based recommendation via sparse attention network.
Inf. Sci., 2023

CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis.
CoRR, 2023

Adaptive Base-class Suppression and Prior Guidance Network for One-Shot Object Detection.
CoRR, 2023

Programmable Pressure Pneumatic System for Soft Robots.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2023

2022
Multi-interaction fusion collaborative filtering for social recommendation.
Expert Syst. Appl., 2022

CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Graph Neural Network For Session-based Recommendation System Via Time Sessions.
Proceedings of the International Joint Conference on Neural Networks, 2022

Relational Graph Reasoning Transformer for Image Captioning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Spatiotemporal Contextual Consistency Network for Precipitation Nowcasting.
Proceedings of the IEEE International Conference on Data Mining, 2022

2021
Extracting Effective Image Attributes with Refined Universal Detection.
Sensors, 2021

Relational Attention with Textual Enhanced Transformer for Image Captioning.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Reinforcement Stacked Learning with Semantic-Associated Attention for Visual Question Answering.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Comprehensive Efficiency Optimization of Linear Induction Motors for Urban Transit.
IEEE Trans. Veh. Technol., 2020

2019
Deep Hierarchical Encoder-Decoder Network for Image Captioning.
IEEE Trans. Multim., 2019

Dense semantic embedding network for image captioning.
Pattern Recognit., 2019

Precipitation Forecasting via Multi-Scale Deconstructed ConvLSTM.
CoRR, 2019

DetNAS: Backbone Search for Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

What and Where the Themes Dominate in Image.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Accelerating the Optimal Shape Design of Linear Machines by Transient Simulation Using Mesh Deformation and Mesh Connection Techniques.
IEEE Trans. Ind. Electron., 2018

2017
Important User Group Based Web Service Recommendation.
Proceedings of the 6th IIAI International Congress on Advanced Applied Informatics, 2017

PUED: A Social Spammer Detection Method Based on PU Learning and Ensemble Learning.
Proceedings of the Collaborative Computing: Networking, Applications and Worksharing, 2017


  Loading...