Shuo Yang

Orcid: 0000-0003-1638-9623

Affiliations:
  • Tsinghua University, Tsinghua Shenzhen International Graduate School, China


According to our database1, Shuo Yang authored at least 78 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation.
CoRR, May, 2026

Beyond the Target: From Imitation to Collaboration in Speculative Decoding.
CoRR, May, 2026

Belief-Guided Inference Control for Large Language Model Services via Verifiable Observations.
CoRR, April, 2026

OCR-Memory: Optical Context Retrieval for Long-Horizon Agent Memory.
CoRR, April, 2026

Pythia: Toward Predictability-Driven Agent-Native LLM Serving.
CoRR, April, 2026

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data.
CoRR, April, 2026

Well Begun is Half Done: Training-Free and Model-Agnostic Semantically Guaranteed User Representation Initialization for Multimodal Recommendation.
CoRR, April, 2026

Flash-KMeans: Fast and Memory-Efficient Exact K-Means.
CoRR, March, 2026

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing.
CoRR, March, 2026

CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation.
CoRR, March, 2026

DGGVAE: Dual-Granularity Graph Variational Auto-Encoder for Group Recommendation.
ACM Trans. Inf. Syst., February, 2026

Reinforced Curriculum Pre-Alignment for Domain-Adaptive VLMs.
CoRR, February, 2026

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization.
CoRR, February, 2026

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow.
CoRR, January, 2026

Blind Radio Map Construction via Topology-Guided Manifold Learning.
IEEE Internet Things J., 2026

RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking.
Proceedings of the Companion Proceedings of the ACM Web Conference 2026, 2026

CoLD: Collaborative Label Denoising Framework for Network Intrusion Detection.
Proceedings of the 33rd Annual Network and Distributed System Security Symposium, 2026

VI-MMRec: Similarity-Aware Training Cost-free Virtual User-Item Interactions for Multimodal Recommendation.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

Learning and Editing Universal Graph Prompt Tuning via Reinforcement Learning.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

BlendServe: Optimizing Offline Inference with Resource-Aware Batching.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

Multi-modal Dynamic Proxy Learning for Personalized Multiple Clustering.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Careful Queries, Credible Results: Teaching RAG Models Advanced Web Search Tools with Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Let the Barbarians In: How AI Can Accelerate Systems Performance Research.
CoRR, December, 2025

Enhancing Robustness and Generalization Capability for Multimodal Recommender Systems via Sharpness-Aware Minimization.
IEEE Trans. Knowl. Data Eng., November, 2025

Training-Free Loosely Speculative Decoding: Accepting Semantically Correct Drafts Beyond Exact Match.
CoRR, November, 2025

StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation.
CoRR, November, 2025

Barbarians at the Gate: How AI is Upending Systems Research.
CoRR, October, 2025

vAttention: Verified Sparse Attention.
CoRR, October, 2025

Toward Edge General Intelligence via Large Language Models: Opportunities and Challenges.
IEEE Netw., September, 2025

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention.
CoRR, September, 2025

Beyond Classification: Evaluating LLMs for Fine-Grained Automatic Malware Behavior Auditing.
CoRR, September, 2025

Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward.
CoRR, August, 2025

Large Language Models for Network Intrusion Detection Systems: Foundations, Implementations, and Future Directions.
CoRR, July, 2025

Generative AI for Vulnerability Detection in 6G Wireless Networks: Advances, Case Study, and Future Directions.
CoRR, June, 2025

Radial Attention: O(n log n) Sparse Attention with Energy Decay for Long Video Generation.
CoRR, June, 2025

Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving.
CoRR, May, 2025

An Extensible Software Transport Layer for GPU Networking.
CoRR, April, 2025

WorldModelBench: Judging Video Generation Models As World Models.
CoRR, February, 2025

A Survey on Multimodal Recommender Systems: Recent Advances and Future Directions.
CoRR, February, 2025

Learning Temporal Invariance in Android Malware Detectors.
CoRR, February, 2025

Twilight: Adaptive Attention Sparsity with Hierarchical Top-<i>p</i> Pruning.
CoRR, February, 2025

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity.
CoRR, February, 2025

Self-Supervised Adaptation Method to Concept Drift for Network Intrusion Detection.
IEEE Trans. Dependable Secur. Comput., 2025

LAMD: Context-Driven Android Malware Detection and Classification with LLMs.
Proceedings of the 2025 IEEE Security and Privacy, 2025

NLGCL: Naturally Existing Neighbor Layers Graph Contrastive Learning for Recommendation.
Proceedings of the Nineteenth ACM Conference on Recommender Systems, 2025

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

The Best is Yet to Come: Graph Convolution in the Testing Phase for Multimodal Recommendation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MDVT: Enhancing Multimodal Recommendation with Model-Agnostic Multimodal-Driven Virtual Triplets.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

HashAttention: Semantic Sparsity for Faster Inference.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Mitigating Statistical Heterogeneity in Intrusion Detection Systems within Federated Learning.
Proceedings of the 2025 IEEE Global Communications Conference, 2025

Hypercomplex Prompt-aware Multimodal Recommendation.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Enhancing Graph Collaborative Filtering with FourierKAN Feature Transformation.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
An Efficient and Adaptive Content Delivery System Based on Hybrid Network.
ACM Trans. Knowl. Discov. Data, February, 2024

HashAttention: Semantic Sparsity for Faster Inference.
CoRR, 2024

BlendServe: Optimizing Offline Inference for Auto-regressive Large Models with Resource-aware Batching.
CoRR, 2024

Towards Edge General Intelligence via Large Language Models: Opportunities and Challenges.
CoRR, 2024

Post-Training Sparse Attention with Double Sparsity.
CoRR, 2024

FourierKAN-GCF: Fourier Kolmogorov-Arnold Network - An Effective and Efficient Feature Transformation for Graph Collaborative Filtering.
CoRR, 2024

STCA: Stacked Token-based Continuous Authentication Protocol for Zero Trust IoT.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2024

An Efficient (t, n) Threshold Authentication Scheme for Vehicular Ad Hoc Networks.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2024

SLoRA: Scalable Serving of Thousands of LoRA Adapters.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

ReCDA: Concept Drift Adaptation with Representation Enhancement for Network Intrusion Detection.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Multi-Scale Contrastive Attention Representation Learning for Encrypted Traffic Classification.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
IBA: A secure and efficient device-to-device interaction-based authentication scheme for Internet of Things.
Comput. Commun., February, 2023

Rethinking Benchmark and Contamination for Language Models with Rephrased Samples.
CoRR, 2023

S-LoRA: Serving Thousands of Concurrent LoRA Adapters.
CoRR, 2023

A Reliable and Decentralized Trust Management Model for Fog Computing in Industrial IoT.
Proceedings of the NOMS 2023, 2023

SF-IDS: An Imbalanced Semi-Supervised Learning Framework for Fine-Grained Intrusion Detection.
Proceedings of the IEEE International Conference on Communications, 2023

A Lightweight Approach for Network Intrusion Detection Based on Self-Knowledge Distillation.
Proceedings of the IEEE International Conference on Communications, 2023

Rethink Long-Tailed Recognition with Vision Transforms.
Proceedings of the IEEE International Conference on Acoustics, 2023

Towards Efficient Blockchain-Based Cross-Domain Trust Reputation Management for LS-HetNet.
Proceedings of the IEEE Global Communications Conference, 2023

Learning Imbalanced Data with Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2017
Algorithm-Directed Crash Consistence in Non-volatile Memory for HPC.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017


  Loading...