Kaiyan Zhao

Orcid: 0009-0004-7112-4163

According to our database1, Kaiyan Zhao authored at least 32 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ANO: A Principled Approach to Robust Policy Optimization.
CoRR, May, 2026

Anon: Extrapolating Adaptivity Beyond SGD and Adam.
CoRR, May, 2026

E<sup>2</sup>DT: Efficient and Effective Decision Transformer with Experience-Aware Sampling for Robotic Manipulation.
CoRR, May, 2026

C<sup>2</sup>T: Captioning-Structure and LLM-Aligned Common-Sense Reward Learning for Traffic-Vehicle Coordination.
CoRR, April, 2026

Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions.
CoRR, March, 2026

When Attention Betrays: Erasing Backdoor Attacks in Robotic Policies by Reconstructing Visual Tokens.
CoRR, February, 2026

Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training.
CoRR, February, 2026

Benchmarking Machine Translation on Chinese Social Media Texts.
CoRR, January, 2026

NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning.
CoRR, January, 2026

EComStage: Stage-wise and Orientation-specific Benchmarking for Large Language Models in E-commerce.
CoRR, January, 2026

FedPAD: Aggregation-free federated learning with prototype-based adaptive distillation.
Knowl. Based Syst., 2026

Latent State-Predictive Exploration for Deep Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

DSAP: Enhancing Generalization in Goal-Conditioned Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Explore to Learn: Latent Exploration Through Disentangled Synergy Patterns for Reinforcement Learning in Overactuated Control.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RGMP: Recurrent Geometric-prior Multimodal Policy for Generalizable Humanoid Robot Manipulation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Improving Multimodal Contrastive Learning of Sentence Embeddings with Object-Phrase Alignment.
CoRR, August, 2025

BiCAM: A Bidirectional Contextualized Attentive Model for Analyzing the Correlation of Heterogeneous Security Events.
IEEE Trans. Reliab., June, 2025

Efficient Diversity-based Experience Replay for Deep Reinforcement Learning.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

BILE: An Effective Behavior-based Latent Exploration Scheme for Deep Reinforcement Learning.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

HeMoRa: Unsupervised Heuristic Consensus Sampling for Robust Point Cloud Registration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Team-wise effective communication in multi-agent reinforcement learning.
Auton. Agents Multi Agent Syst., December, 2024

CAG-Malconv: A Byte-Level Malware Detection Method With CBAM and Attention-GRU.
IEEE Trans. Netw. Serv. Manag., October, 2024

Direct Quantized Training of Language Models with Stochastic Rounding.
CoRR, 2024

Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay.
CoRR, 2024

Improving Arithmetic Reasoning Ability of Large Language Models through Relation Tuples, Verification and Dynamic Feedback.
CoRR, 2024

Rethinking Exploration in Reinforcement Learning with Effective Metric-Based Exploration Bonus.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Enhancing Cross-lingual Sentence Embedding for Low-resource Languages with Word Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

AARR-Net: An Attention Assistance Feature Fusion and Model Recursive Recovery Network for Category-Level 6D Object Pose Estimation.
Proceedings of the Neural Information Processing - 31st International Conference, 2024

Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023
Barycentric interpolation collocation algorithm to solve fractional differential equations.
Math. Comput. Simul., 2023

Regulating confidence by corner discrepancy and center score in corner-based object detection methods.
J. Intell. Fuzzy Syst., 2023

2022
LIBKDV: A Versatile Kernel Density Visualization Library for Geospatial Analytics.
Proc. VLDB Endow., 2022


  Loading...