Wei Shen

Affiliations:
  • ByteDance Inc., China
  • Baidu, China (2023 - 2024)
  • Nanjing University, China (former)


According to our database1, Wei Shen authored at least 20 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction.
CoRR, May, 2025

LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs.
CoRR, April, 2025

What Do Latent Action Models Actually Learn?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

HPSERec: A Hierarchical Partitioning and Stepwise Enhancement Framework for Long-tailed Sequential Recommendation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Policy Filtration for RLHF to Mitigate Noise in Reward Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation.
CoRR, 2024

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models.
CoRR, 2024

Leveraging Web-Crawled Data for High-Quality Fine-Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.
Proceedings of the IEEE International Conference on Data Mining, 2022

A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Fair and size-scalable participant selection framework for large-scale mobile crowdsensing.
J. Syst. Archit., 2021

The Medical Segmentation Decathlon.
CoRR, 2021

Inductive Matrix Completion Using Graph Autoencoder.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2016
Combined Cloud: A Mixture of Voluntary Cloud and Reserved Instance Marketplace.
J. Comput. Sci. Technol., 2016

A Participant Selection Method for Crowdsensing Under an Incentive Mechanism.
Proceedings of the Collaborate Computing: Networking, Applications and Worksharing, 2016


  Loading...