Hanyu Zhao

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
Scaling Towards the Information Boundary of Instruction Set: InfinityInstruct-Subject Technical Report.
CoRR, July, 2025

CI-VID: A Coherent Interleaved Text-Video Dataset.
CoRR, July, 2025

Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models.
CoRR, June, 2025

PerfTracker: Online Performance Troubleshooting for Large-scale Model Training in Production.
CoRR, June, 2025

Beyond the Lens: Quantifying the Impact of Scientific Documentaries through Amazon Reviews.
CoRR, February, 2025

Memory Offloading for Large Language Model Inference with Latency SLO Guarantees.
CoRR, February, 2025

Conformal Uncertainty Indicator for Continual Test-Time Adaptation.
CoRR, February, 2025

Beyond the Lens: Quantifying the Impact of Scientific Documentaries through Amazon Reviews.
Proceedings of the 17th ACM Web Science Conference 2025, 2025

Evolution of Aegis: Fault Diagnosis for AI Model Training Service in Production.
Proceedings of the 22nd USENIX Symposium on Networked Systems Design and Implementation, 2025

CaliEX: A Disk-Based Large-Scale GNN Training System with Joint Design of Caching and Execution.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

Personalized Label Inference Attack in Federated Transfer Learning via Contrastive Meta Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Beyond IID: Optimizing Instruction Finetuning from the Perspective of Instruction Interaction and Dependency.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
FlexNF: Flexible Network Function Orchestration for Scalable On-Path Service Chain Serving.
IEEE/ACM Trans. Netw., June, 2024

Class Hierarchy-Guided Generalized Few-Shot Ship Detection in Remote Sensing Images.
IEEE Geosci. Remote. Sens. Lett., 2024

Modifying the one-hot encoding technique can enhance the adversarial robustness of the visual model for symbol recognition.
Expert Syst. Appl., 2024

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models.
CoRR, 2024

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging.
CoRR, 2024

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency.
CoRR, 2024

Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling.
CoRR, 2024

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies.
CoRR, 2024

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach.
CoRR, 2024

Variational Continual Test-Time Adaptation.
CoRR, 2024

Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache.
CoRR, 2024

GeoExpert: Geometry Problem Solving Based on RoBERTa and Graph Attention Networks.
Proceedings of the 36th International Conference on Software Engineering and Knowledge Engineering, 2024

Llumnix: Dynamic Scheduling for Large Language Model Serving.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Computing services oriented evolutionary architecture and key technologies of electric power communication network.
Proceedings of the 24th IEEE International Conference on Communication Technology, 2024

UniSpLLM: An Integrated Approach for Enhancing Reasoning and Education with Large Language Models.
Proceedings of the Proceeding of the 32nd International Conference on Computers in Education, 2024

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
A Class of Optimal Control Problems of Forward-Backward Systems with Input Constraint.
J. Optim. Theory Appl., December, 2023

Deep-Reinforcement-Learning-Based NOMA-Aided Slotted ALOHA for LEO Satellite IoT Networks.
IEEE Internet Things J., October, 2023

GoldMiner: Elastic Scaling of Training Data Pre-Processing Pipelines for Deep Learning.
Proc. ACM Manag. Data, 2023

ROAM: memory-efficient large DNN training via optimized operator ordering and memory layout.
CoRR, 2023

EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs.
Proceedings of the International Conference for High Performance Computing, 2023

SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Artificial Intelligence Security Competition (AISC).
CoRR, 2022

EasyScale: Accuracy-consistent Elastic Training for Deep Learning.
CoRR, 2022

Instance-wise Prompt Tuning for Pretrained Language Models.
CoRR, 2022

Deep Reinforcement Learning for the Joint AoI and Throughput Optimization of the Random Access System.
Proceedings of the 14th International Conference on Wireless Communications and Signal Processing, 2022

Zoomer: Boosting Retrieval on Web-scale Graphs by Regions of Interest.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Deep Learning-Based Joint Modulation and Coding Scheme Recognition for 5G New Radio Protocols.
Proceedings of the 22nd IEEE International Conference on Communication Technology, 2022

2021
WuDaoCorpora: A super large-scale Chinese corpora for pre-training language models.
AI Open, 2021

FlexNF: Flexible Network Function Orchestration on the Programmable Data Plane.
Proceedings of the 29th IEEE/ACM International Symposium on Quality of Service, 2021

A Chinese Machine Reading Comprehension Dataset Automatic Generated Based on Knowledge Graph.
Proceedings of the Chinese Computational Linguistics - 20th China National Conference, 2021

A Federated Adversarial Learning Method for Biomedical Named Entity Recognition.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
PASCAL: a pseudo cascade learning framework for breast cancer treatment entity normalization in Chinese clinical text.
BMC Medical Informatics Decis. Mak., 2020

HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

A Topology-Aware Performance Prediction Model for Distributed Deep Learning on GPU Clusters.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019
MSD: Multi-Self-Distillation Learning via Multi-classifiers within Deep Neural Networks.
CoRR, 2019

A Word Segmentation Method of Ancient Chinese Based on Word Alignment.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

SCHED²: Scheduling Deep Learning Training via Deep Reinforcement Learning.
Proceedings of the 2019 IEEE Global Communications Conference, 2019

Chinese Historical Term Translation Pairs Extraction Using Modern Chinese as a Pivot Language.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

2018
Gandiva: Introspective Cluster Scheduling for Deep Learning.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Building efficient and available distributed transaction with Paxos-based coding consensus.
Proceedings of the IEEE INFOCOM 2018, 2018

Term Translation Extraction from Historical Classics Using Modern Chinese Explanation.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines.
Proceedings of the ACM Symposium on Cloud Computing, 2018

Scheduling CPU for GPU-based Deep Learning Jobs.
Proceedings of the ACM Symposium on Cloud Computing, 2018


  Loading...