June Yang

Orcid: 0009-0008-3059-7027

According to our database1, June Yang authored at least 10 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Guess-Verify-Refine: Data-Aware Top-K for Sparse-Attention Decoding on Blackwell via Temporal Correlation.
CoRR, April, 2026

DWDP: Distributed Weight Data Parallelism for High-Performance LLM Inference on NVL72.
CoRR, April, 2026

Scalable Training of Mixture-of-Experts Models with Megatron Core.
CoRR, March, 2026

2025
BroRL: Scaling Reinforcement Learning via Broadened Exploration.
CoRR, October, 2025

Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training.
CoRR, July, 2025

MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core.
CoRR, April, 2025

2024
Llama 3 Meets MoE: Efficient Upcycling.
CoRR, 2024

2023
Aligning Language Models with Offline Reinforcement Learning from Human Feedback.
CoRR, 2023

HyperGef: A Framework Enabling Efficient Fusion for Hypergraph Neural Network on GPUs.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

FastDimeNet++: Training DimeNet++ in 22 minutes.
Proceedings of the 52nd International Conference on Parallel Processing, 2023


  Loading...