Yihao Feng

According to our database1, Yihao Feng authored at least 33 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability.
CoRR, 2024

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning.
CoRR, 2024

Text2Data: Low-Resource Data Generation with Textual Control.
CoRR, 2024

Demand Prediction by Incorporating Internet-of-Things Data: A Case of Automobile Repair and Maintenance Service.
Proceedings of the 57th Hawaii International Conference on System Sciences, 2024

2023
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents.
CoRR, 2023

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.
CoRR, 2023

REX: Rapid Exploration and eXploitation for AI Agents.
CoRR, 2023

FAMO: Fast Adaptive Multitask Optimization.
CoRR, 2023

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning.
CoRR, 2023

HIVE: Harnessing Human Feedback for Instructional Visual Editing.
CoRR, 2023

Preference-grounded Token-level Guidance for Language Model Fine-tuning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FAMO: Fast Adaptive Multitask Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning.
CoRR, 2022

A Regularized Implicit Policy for Offline Reinforcement Learning.
CoRR, 2022

Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning.
CoRR, 2022

A Unified Framework for Alternating Offline Model Training and Policy Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds.
Proceedings of the 9th International Conference on Learning Representations, 2021

Unsupervised Out-of-Domain Detection via Pre-trained Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Off-Policy Interval Estimation with Lipschitz Value Iteration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Accountable Off-Policy Evaluation With Kernel Bellman Statistics.
Proceedings of the 37th International Conference on Machine Learning, 2020

Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
A Kernel Loss for Solving the Bellman Equation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Knowledge-guided Semantic Computing Network.
CoRR, 2018

Action-dependent Control Variates for Policy Optimization via Stein Identity.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
ARShop: A Cloud-based Augmented Reality System for Shopping.
Proc. VLDB Endow., 2017

Sample-efficient Policy Optimization with Stein Control Variate.
CoRR, 2017

Learning to Draw Samples with Amortized Stein Variational Gradient Descent.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017


  Loading...