Xiyao Wang
This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.
Bibliography
2025
CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning.
CoRR, July, 2025
CoRR, June, 2025
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs.
CoRR, June, 2025
What makes Reasoning Models Different? Follow the Reasoning Leader for Efficient Decoding.
CoRR, June, 2025
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning.
CoRR, June, 2025
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data.
CoRR, May, 2025
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement.
CoRR, April, 2025
Quantifying Emotional Responses to Immutable Data Characteristics and Designer Choices in Data Visualizations.
IEEE Trans. Vis. Comput. Graph., January, 2025
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications.
Proceedings of the Nineteenth International AAAI Conference on Web and Social Media, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension.
CoRR, 2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning.
CoRR, 2024
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement.
CoRR, 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
CoRR, 2024
Lane-Changing Tracking Control of Automated Vehicle Platoon Based on MA-DDPG and Adaptive MPC.
IEEE Access, 2024
Proceedings of the IEEE Visualization and Visual Analytics, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Int. J. Circuit Theory Appl., September, 2023
Wideband Millimeter-Wave SPST Switch in 100-nm GaN-on-Si Using Strong Mutual Coupling.
IEEE Trans. Circuits Syst. II Express Briefs, June, 2023
Int. J. Hum. Comput. Stud., April, 2023
Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making.
CoRR, 2023
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
IEEE Trans. Vis. Comput. Graph., 2022
Understanding differences between combinations of 2D and 3D input and output devices for 3D data visualization.
Int. J. Hum. Comput. Stud., 2022
SimCLR-Unet: An ECG Feature wave segmentation algorithm based on a self-supervised learning strategy.
Proceedings of the 4th International Conference on Robotics, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering, 2022
2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
2020
Augmented reality environments for the interactive exploration of 3D data. (Environnements de réalité augmentée pour l'exploration interactive de données 3D).
PhD thesis, 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning.
CoRR, 2020
Towards an Understanding of Augmented Reality Extensions for Existing 3D Data Analysis Tools.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020
2019
Comput. Graph. Forum, 2019
Proceedings of the HCI International 2019 - Late Breaking Posters, 2019