Yang Wang

Orcid: 0000-0001-7322-4062

Affiliations:
  • University of Electronic Science and Technology of China, MOE Key Laboratory of Optical Fiber Sensing and Communications, Chengdu, China


According to our database1, Yang Wang authored at least 28 papers between 2008 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs.
CoRR, June, 2025

DSTC: Dual-Side Sparse Tensor Core for DNNs Acceleration on Modern GPU Architectures.
IEEE Trans. Computers, February, 2025

LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator.
CoRR, January, 2025

Reduction Fusion for Optimized Distributed Data-Parallel Computations via Inverse Recomputation.
Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering, 2025

PipeThreader: Software-Defined Pipelining for Efficient DNN Execution.
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025

LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

2024
Toward CXL-Native Memory Tiering via Device-Side Profiling.
CoRR, 2024

Anubis: Towards Reliable Cloud AI Infrastructure via Proactive Validation.
CoRR, 2024

SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
LUT-NN: Towards Unified Neural Network Inference by Table Lookup.
CoRR, 2023

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

2022
FlexMon: A flexible and fine-grained traffic monitor for programmable networks.
J. Netw. Comput. Appl., 2022

Joint optimization of dynamic resource allocation and packet scheduling for virtual switches in cognitive internet of vehicles.
EURASIP J. Adv. Signal Process., 2022

Towards efficient vision transformer inference: a first study of transformers on mobile devices.
Proceedings of the HotMobile '22: The 23rd International Workshop on Mobile Computing Systems and Applications, Tempe, Arizona, USA, March 9, 2022

SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

Romou: rapidly generate high-performance tensor kernels for mobile GPUs.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

2021
Dual-side Sparse Tensor Core.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

NeuralMon: Graph Neural Network for Flow Measurement Allocation.
Proceedings of the IEEE Global Communications Conference, 2021

2020
LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Low Complexity Hierarchical Scheduling for Diverse Datacenter Jobs.
IEEE Commun. Lett., 2019

2018
MOSC: a method to assign the outsourcing of service function chain across multiple clouds.
Comput. Networks, 2018

2016
Towards optimal outsourcing of service function chain across multiple clouds.
Proceedings of the 2016 IEEE International Conference on Communications, 2016

2009
Products of Mealy-type fuzzy finite state machines.
Fuzzy Sets Syst., 2009

2008
Lattice Minimal Automata and Lattice Reduced Automata.
Proceedings of the Fuzzy Information and Engineering, 2008

Equivalence between Mizumoto Lattice Finite Automata.
Proceedings of the Fuzzy Information and Engineering, 2008


  Loading...