Yihua Cheng

Orcid: 0000-0003-1353-9817

According to our database1, Yihua Cheng authored at least 56 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Enhancing Gaze Reasoning in Vision Foundation Models for Gaze Following.
CoRR, May, 2026

FoSS: Modeling Long Range Dependencies and Multimodal Uncertainty in Trajectory Prediction via Fourier State Space Integration.
CoRR, March, 2026

Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models.
IEEE Trans. Veh. Technol., January, 2026

DroidSpeak: KV Cache Sharing Across Fine-tuned Model Variants.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single Image.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Force-Aware 3D Contact Modeling for Stable Grasp Generation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
VL4Gaze: Unleashing Vision-Language Models for Gaze Following.
CoRR, December, 2025

EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving.
CoRR, December, 2025

Behavior-Aware Knowledge-Embedded Model for Driver Attention Prediction.
IEEE Trans. Circuits Syst. Video Technol., October, 2025

LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference.
CoRR, October, 2025

AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving.
CoRR, September, 2025

Excavate the potential of Single-Scale Features: A Decomposition Network for Water-Related Optical Image Enhancement.
CoRR, August, 2025

HyperRAG: Enhancing Quality-Efficiency Tradeoffs in Retrieval-Augmented Generation with Reranker KV-Cache Reuse.
CoRR, April, 2025

Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction.
CoRR, March, 2025

Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache.
CoRR, March, 2025

Excavate the Potential of Single-Scale Features: A Decomposition Network for Water-Related Optical Image Enhancement.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025

Meta-Learning Enables Complex Cluster-Specific Few-Shot Binding Affinity Prediction for Protein-Protein Interactions.
J. Chem. Inf. Model., 2025

PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications.
Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, 2025

Multi-Hypothesis 3D Hand Mesh Recovering from a Single Blurry Image.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion.
Proceedings of the Twentieth European Conference on Computer Systems, 2025

PersonaBooth: Personalized Text-to-Motion Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

3D Prior Is All You Need: Cross-Task Few-shot 2D Gaze Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Earth+: On-Board Satellite Imagery Compression Leveraging Historical Earth Observations.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

Single-view Image to Novel-view Generation for Hand-Object Interactions.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Appearance-Based Gaze Estimation With Deep Learning: A Review and Benchmark.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Do Large Language Models Need a Content Delivery Network?
CoRR, 2024

Earth+: on-board satellite imagery compression leveraging historical earth observations.
CoRR, 2024

Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network.
CoRR, 2024

Multi-Modal Gaze Following in Conversational Scenarios.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving.
Proceedings of the ACM SIGCOMM 2024 Conference, 2024

GRACE: Loss-Resilient Real-Time Video through Neural Codecs.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Eloquent: A More Robust Transmission Scheme for LLM Token Streaming.
Proceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing, 2024

TextGaze: Gaze-Controllable Face Generation with Natural Language.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

What Do You See in Vehicle? Comprehensive Vision Solution for In-Vehicle Gaze Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Enabling Perception-Driven Optimization in Networking.
SIGMETRICS Perform. Evaluation Rev., September, 2023

CacheGen: Fast Context Loading for Language Model Applications.
CoRR, 2023

Grace++: Loss-Resilient Real-Time Video Communication under High Network Latency.
CoRR, 2023

Optimizing Real-Time Video Experience with Data Scalable Codec.
Proceedings of the 2023 Workshop on Emerging Multimedia Systems, 2023

POLYCORN: Data-driven Cross-layer Multipath Networking for High-speed Railway through Composable Schedulerlets.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

DVGaze: Dual-View Gaze Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Online Profiling and Adaptation of Quality Sensitivity for Internet Video.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

Raising the Level of Abstraction for Time-State Analytics With the Timeline Framework.
Proceedings of the 13th Conference on Innovative Data Systems Research, 2023

High-Fidelity Eye Animatable Neural Radiance Fields for Human Face.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
GRACE: Loss-Resilient Real-Time Video Communication Using Data-Scalable Autoencoder.
CoRR, 2022

Gaze Estimation using Transformer.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

PureGaze: Purifying Gaze Feature for Generalizable Gaze Estimation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From Peking University.
IEEE Trans. Parallel Distributed Syst., 2021

2020
Gaze Estimation by Exploring Two-Eye Asymmetry.
IEEE Trans. Image Process., 2020

A First Look at Disconnection-Centric TCP Performance on High-Speed Railways.
IEEE J. Sel. Areas Commun., 2020

Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

A Coarse-to-Fine Adaptive Network for Appearance-Based Gaze Estimation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
An Active-Passive Measurement Study of TCP Performance over LTE on High-speed Rails.
Proceedings of the 25th Annual International Conference on Mobile Computing and Networking, 2019

2018
Student Cluster Competition 2017, Team Peking University: Reproducing vectorization of the Tersoff multi-body potential on the Intel Broadwell architecture.
Parallel Comput., 2018

Appearance-Based Gaze Estimation via Evaluation-Guided Asymmetric Regression.
Proceedings of the Computer Vision - ECCV 2018, 2018


  Loading...