Landslide Hazard Prediction Based on Small Baseline Subset-Interferometric Synthetic-Aperture Radar Technology Combined with Land-Use Dynamic Change and Hydrological Conditions (Sichuan, China).

[BibT_eX]

[DOI]

Hongyi Guo

Antonio Miguel Martínez-Graña

Remote. Sens., August, 2024

Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning.

[BibT_eX]

[DOI]

Inf. Sci., 2024

Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards.

[BibT_eX]

[DOI]

CoRR, 2024

Can Large Language Models Play Games? A Case Study of A Self-Play Approach.

[BibT_eX]

[DOI]

CoRR, 2024

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting.

[BibT_eX]

[DOI]

CoRR, 2024

Human-Instruction-Free LLM Self-Alignment with Limited Samples.

[BibT_eX]

[DOI]

CoRR, 2024

Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency.

[BibT_eX]

[DOI]

CoRR, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

Policy Learning Using Weak Supervision.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

2020

An improved localization method in cyber-social environments with obstacles.

[BibT_eX]

[DOI]

Comput. Electr. Eng., 2020

Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates.

[BibT_eX]

[DOI]

Yang Liu

Hongyi Guo

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Signal Instructed Coordination in Team Competition.

[BibT_eX]

[DOI]

CoRR, 2019

Life Assistants for the Elderly Based on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Intl Conf on Dependable, 2019

2017

ChildGuard: A Child-Safety Monitoring System.

[BibT_eX]

[DOI]

IEEE Multim., 2017

2016

Automatic Threshold Calculation Based Label Propagation Algorithm for Overlapping Community.

[BibT_eX]

[DOI]

Proceedings of the IEEE First International Conference on Data Science in Cyberspace, 2016

Hongyi Guo

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...