Mao Yang

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2024
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens.
CoRR, 2024

ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

2023
Boosting LLM Reasoning: Push the Limits of Few-shot Learning with Reinforced In-Context Pruning.
CoRR, 2023

Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models.
CoRR, 2023

Model-enhanced Vector Index.
CoRR, 2023

Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations.
CoRR, 2023

Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference.
CoRR, 2023

An Adaptive Hybrid Channel Reservation Medium Access Control Protocol for Differentiated QoS.
CoRR, 2023

Accurate and Structured Pruning for Efficient Automatic Speech Recognition.
CoRR, 2023

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models.
CoRR, 2023

An Adaptive Channel Reservation MAC Protocol Based on Forwarding Traffic of Key Nodes.
CoRR, 2023

IRGen: Generative Modeling for Image Retrieval.
CoRR, 2023

Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing.
CoRR, 2023

LUT-NN: Towards Unified Neural Network Inference by Table Lookup.
CoRR, 2023

SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.
CoRR, 2023

SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction.
CoRR, 2023

PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

SPFresh: Incremental In-Place Update for Billion-Scale Vector Search.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

On Modular Learning of Distributed Systems for Predicting End-to-End Latency.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

Model-enhanced Vector Index.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

A System Verilog Based Networked Verification and Testing Method for Wireless Network Protocols on Chip.
Proceedings of the IEEE International Conference on Signal Processing, 2023

An Empirical Study on Quality Issues of Deep Learning Platform.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Runtime Performance Prediction for Deep Learning Models with Graph Neural Network.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FrozenHot Cache: Rethinking Cache Management for Modern Hardware.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022
Estimation of Above-Ground Biomass of Winter Wheat Based on Consumer-Grade Multi-Spectral UAV.
Remote. Sens., 2022

LordNet: Learning to Solve Parametric Partial Differential Equations without Simulated Data.
CoRR, 2022

Tutel: Adaptive Mixture-of-Experts at Scale.
CoRR, 2022

A Neural Corpus Indexer for Document Retrieval.
CoRR, 2022

Towards efficient vision transformer inference: a first study of transformers on mobile devices.
Proceedings of the HotMobile '22: The 23rd International Workshop on Mobile Computing Systems and Applications, Tempe, Arizona, USA, March 9, 2022

A Two-Level Adaptive Resource Allocation Algorithm for Quality of Service Guarantee in Home WiFi Networks.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

Research on Backbone Routing Protocol of Ad Hoc Network Based on SDN.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

Non-uniform Time Slice Parallel Simulation Method Based on Offline Learning for IEEE 802.11ax.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

A Coexistence Method of Short-Range Heterogeneous Network Based on Cell Cooperation.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

A Joint Optimization Method for Scheduling and Random Access Based on the Idea of Particle-Based Access in IEEE 802.11ax.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

Bidirectional Scanning Based Medium Access Control Algorithm in Directional Aviation Relay Network with Multiple Air Nodes.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

A Channel Reservation Mechanism in IEEE 802.11be for Multi-cell Scenarios.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

An Adaptive Beamtracking Method for the Next Generation mmWave WLAN.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

Edge Station Throughput Enhancement Method Based on Energy Detection Threshold and Transmission Power Joint Dynamic Adjustment.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

An Uplink OFDMA Access Method for Low Latency in Next-Generation WLANs.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

A Collision Aware Multi-link Operation for Next Generation WLAN.
Proceedings of the Smart Grid and Internet of Things - 6th EAI International Conference, 2022

ROLLER: Fast and Efficient Tensor Compilation for Deep Learning.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

A Neural Corpus Indexer for Document Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

REFTY: Refinement Types for Valid Deep Learning Models.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search.
CoRR, 2021

Argus: A Fully Transparent Incentive System for Anti-Piracy Campaigns (Extended Version).
CoRR, 2021

Match Plan Generation in Web Search with Parameterized Action Reinforcement Learning.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Argus: A Fully Transparent Incentive System for Anti-Piracy Campaigns.
Proceedings of the 40th International Symposium on Reliable Distributed Systems, 2021

WRENCH: A Comprehensive Benchmark for Weak Supervision.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Resource-Guided Configuration Space Reduction for Deep Learning Models.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

OpEvo: An Evolutionary Method for Tensor Operator Optimization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
OpEvo: An Evolutionary Method for Tensor Operator Optimization.
CoRR, 2020

Research on Ultra-Short-Term Wind Power Prediction Considering Source Relevance.
IEEE Access, 2020

AutoSys: The Design and Operation of Learning-Augmented Systems.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

Enhancing the interoperability between deep learning frameworks by model conversion.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Estimating GPU memory consumption of deep learning models.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Retiarii: A Deep Learning Exploratory-Training Framework.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

An empirical study on program failures of deep learning jobs.
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June, 2020

TextNAS: A Neural Architecture Search Space Tailored for Text Representation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
The Case for Learning-and-System Co-design.
ACM SIGOPS Oper. Syst. Rev., 2019

Time-Series Anomaly Detection Service at Microsoft.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

2018
Ultra-Short-Term Prediction of Photovoltaic Power Based on Periodic Extraction of PV Energy and LSH Algorithm.
IEEE Access, 2018

Ultra-Short-Term Multistep Wind Power Prediction Based on Improved EMD and Reconstruction Method Using Run-Length Analysis.
IEEE Access, 2018

2017
Topology-Transparent Scheduling in Mobile Ad Hoc Networks with Unidirectional Links.
Proceedings of the Communications and Networking, 2017

2016
Research of Recognition System of Web Intrusion Detection Based on Storm.
Proceedings of the Fifth International Conference on Network, Communication and Computing, 2016

2015
Software-Defined and Virtualized Future Mobile and Wireless Networks: A Survey.
Mob. Networks Appl., 2015

2014
Opportunistic sharing scheme for spectrum allocation in wireless virtualization.
Soft Comput., 2014

Opportunistic spectrum sharing for wireless virtualization.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2014

Rex: replication at the speed of multi-core.
Proceedings of the Ninth Eurosys Conference 2014, 2014

Network Performance Aware Graph Partitioning for Large Graph Processing Systems in the Cloud.
Proceedings of the Large Scale and Big Data - Processing and Management., 2014

2013
OpenRAN: a software-defined ran architecture via virtualization.
Proceedings of the ACM SIGCOMM 2013 Conference, 2013

Opportunistic Spectrum Sharing Based Resource Allocation for Wireless Virtualization.
Proceedings of the Seventh International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing, 2013

KuaFu: Closing the parallelism gap in database replication.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Failure Recovery: When the Cure Is Worse Than the Disease.
Proceedings of the 14th Workshop on Hot Topics in Operating Systems, 2013

2012
Karnaugh-map like online embedding algorithm of wireless virtualization.
Proceedings of the 15th International Symposium on Wireless Personal Multimedia Communications, 2012

Improving large graph processing on partitioned graphs in the cloud.
Proceedings of the ACM Symposium on Cloud Computing, SOCC '12, 2012

2011
G2: A Graph Processing System for Diagnosing Distributed Systems.
Proceedings of the 2011 USENIX Annual Technical Conference, 2011

2010
Large graph processing in the cloud.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Comet: batched stream processing for data intensive distributed computing.
Proceedings of the 1st ACM Symposium on Cloud Computing, 2010

2009
SAD effect on the return volatility and dynamic relevance analysis among the unexpected returns in China's market indices.
Int. J. Netw. Virtual Organisations, 2009

FiLM: A Runtime Monitoring Tool for Distributed Systems.
Proceedings of the Third IEEE International Conference on Secure Software Integration and Reliability Improvement, 2009

MODIST: Transparent Model Checking of Unmodified Distributed Systems.
Proceedings of the 6th USENIX Symposium on Networked Systems Design and Implementation, 2009

Distributed optimal control based on internal average kinetic energy for multi-robot system.
Proceedings of the 4th International Conference on Autonomous Robots and Agents, 2009

Wave Computing in the Cloud.
Proceedings of HotOS'09: 12th Workshop on Hot Topics in Operating Systems, 2009

The Control Based on Internal Average Kinetic Energy in Complex Environment for Multi-robot System.
Proceedings of the Complex Sciences, 2009

2008
Robust incentives via multi-level Tit-for-Tat.
Concurr. Comput. Pract. Exp., 2008

Behavior Analysis of Swarm Robot Systems Based on Vicsek Model.
Proceedings of the Fourth International Conference on Natural Computation, 2008

2007
A Multi-dimensional Reputation System Combined with Trust and Incentive Mechanisms in P2P File Sharing Systems.
Proceedings of the 27th International Conference on Distributed Computing Systems Workshops (ICDCS 2007 Workshops), 2007

An Empirical Study of Collusion Behavior in the Maze P2P File-Sharing System.
Proceedings of the 27th IEEE International Conference on Distributed Computing Systems (ICDCS 2007), 2007

2006
Measurement study and application of social network in the Maze P2P file-sharing system.
Proceedings of the 1st International Conference on Scalable Information Systems, 2006

Analyzing Peer-to-Peer Traffic's Impact on Large Scale Networks.
Proceedings of the Computational Science, 2006

Understanding the Session Durability in Peer-to-Peer Storage System.
Proceedings of the Computational Science, 2006

Analyze the Impact of User Search Behavior on DHT-based P2P File Sharing System.
Proceedings of the Grid and Cooperative Computing Workshops, 2006

Bring Reputation System to Social Network in the Maze P2P File-Sharing System.
Proceedings of the 2006 International Symposium on Collaborative Technologies and Systems, 2006

2005
Characterization of P2P File-Sharing System.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

An Empirical Study of Free-Riding Behavior in the Maze P2P File-Sharing System.
Proceedings of the Peer-to-Peer Systems IV, 4th International Workshop, 2005

2004
Deployment of a Large-scale Peer-to-Peer Social Network.
Proceedings of the First USENIX Workshop on Real, Large Distributed Systems, 2004


  Loading...