Shiyao Wang

Orcid: 0000-0001-5291-4945

Affiliations:
  • KuaiShou Inc., Beijing, China
  • Tsinghua University, Department of Computer Science, Tsinghua National Laboratory for Information Science and Technology, State Key Laboratory of Intelligent Technology and Systems, Beijing, China


According to our database1, Shiyao Wang authored at least 26 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MISS: Multi-Modal Tree Indexing and Searching with Lifelong Sequential Behavior for Retrieval Recommendation.
CoRR, August, 2025

Kwai Keye-VL Technical Report.
CoRR, July, 2025

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation.
CoRR, June, 2025

OneRec Technical Report.
CoRR, June, 2025

OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment.
CoRR, February, 2025

2024
QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou.
CoRR, 2024

Moment&Cross: Next-Generation Real-Time Cross-Domain CTR Prediction for Live-Streaming Recommendation at Kuaishou.
CoRR, 2024

MMBee: Live Streaming Gift-Sending Recommendations via Multi-Modal Fusion and Behaviour Expansion.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Multimodal Transformer for Live Streaming Highlight Prediction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

2023
ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer.
CoRR, 2023

2022
A High-resolution Image-based Virtual Try-on System in Taobao E-commerce Scenario.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CreaGAN: An Automatic Creative Generation Framework for Display Advertising.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Self-Supervised Text Erasing with Controllable Image Synthesis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
A Hybrid Bandit Model with Visual Priors for Creative Ranking in Display Advertising.
Proceedings of the WWW '21: The Web Conference 2021, 2021

2019
Fast Object Detection in Compressed Video.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Fast Object Detection in Compressed Video.
CoRR, 2018

Densely Connected CNN with Multi-scale Feature Attention for Text Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Masked Label Learning for Optical Flow Regression.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Fully Motion-Aware Network for Video Object Detection.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Augmented Reality as a Telemedicine Platform for Remote Procedural Training.
Sensors, 2017

Tightly-coupled convolutional neural network with spatial-temporal memory for text classification.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

2016
An Accurate GPS-IMU/DR Data Fusion Method for Driverless Car Based on a Set of Predictive Models and Grid Constraints.
Sensors, 2016

SAM: A rethinking of prominent convolutional neural network architectures for visual object recognition.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Accelerating Convolutional Neural Networks with Dominant Convolutional Kernel and Knowledge Pre-regression.
Proceedings of the Computer Vision - ECCV 2016, 2016

Stochastic Area Pooling for Generic Convolutional Neural Network.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Collaborative Learning Network for Face Attribute Prediction.
Proceedings of the Computer Vision - ACCV 2016, 2016


  Loading...