Shuai Yang

Orcid: 0000-0002-5576-8629

Affiliations:
  • Nanyang Technological University, Singapore
  • Peking University, Institute of Computer Science and Technology, Beijing, China (former)


According to our database1, Shuai Yang authored at least 90 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling.
CoRR, October, 2025

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy.
CoRR, October, 2025

E<sup>3</sup>DGE: Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion.
Int. J. Comput. Vis., September, 2025

LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors.
CoRR, September, 2025

ANYPORTAL: Zero-Shot Consistent Video Background Replacement.
CoRR, September, 2025

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer.
CoRR, August, 2025

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation.
CoRR, July, 2025

TokensGen: Harnessing Condensed Tokens for Long Video Generation.
CoRR, July, 2025

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation.
CoRR, June, 2025

Video World Models with Long-term Spatial Memory.
CoRR, June, 2025

WORLDMEM: Long-term Consistent World Simulation with Memory.
CoRR, April, 2025

Language-based Image Colorization: A Benchmark and Beyond.
CoRR, March, 2025

Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space.
CoRR, March, 2025

Balanced Image Stylization with Style Matching Score.
CoRR, March, 2025

Splice, Focus and Relife: High-Resolution Periodic Pattern Generation.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

Trajectory attention for fine-grained video motion control.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Imagine360: Immersive 360 Video Generation from Perspective Anchor.
CoRR, 2024

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation.
CoRR, 2024

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation.
CoRR, 2024

Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation.
CoRR, 2024

Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond.
CoRR, 2024

Grounded 3D-LLM with Referent Tokens.
CoRR, 2024

PRIME: Protect Your Videos From Malicious Editing.
CoRR, 2024

Video Diffusion Models are Training-free Motion Interpreter and Controller.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CoolColor: Text-guided COherent OLd film COLORization.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

COCO-LC: Colorfulness Controllable Language-based Colorization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

GroupDiff: Diffusion-Based Group Portrait Editing.
Proceedings of the Computer Vision - ECCV 2024, 2024

Fresco: Spatial-Temporal Correspondence for Zero-Shot Video Translation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoBooth: Diffusion-based Video Generation with Image Prompts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure.
IEEE Trans. Multim., 2023

DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields.
CoRR, 2023

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

DeformToon3d: Deformable Neural Radiance Fields for 3D Toonification.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text2Performer: Text-Driven Human Video Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
VToonify: Controllable High-Resolution Portrait Video Style Transfer.
ACM Trans. Graph., 2022

Text2Human: text-driven controllable human image generation.
ACM Trans. Graph., 2022

CLAST: Contrastive Learning for Arbitrary Style Transfer.
IEEE Trans. Image Process., 2022

Artistic Text Style Transfer: An overview of state-of-the-art methods and datasets [SP Forum].
IEEE Signal Process. Mag., 2022

Shape-Matching GAN++: Scale Controllable Dynamic Artistic Text Style Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Unsupervised Image-to-Image Translation with Generative Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Towards Coding for Human and Machine Vision: Scalable Face Image Coding.
IEEE Trans. Multim., 2021

Controllable Sketch-to-Image Translation for Robust Face Synthesis.
IEEE Trans. Image Process., 2021

TE141K: Artistic Text Benchmark for Text Effect Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Mask-guided GAN for robust text editing in the scene.
Neurocomputing, 2021

Edit Like A Designer: Modeling Design Workflows for Unaligned Fashion Editing.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Instance-Aware Coherent Video Style Transfer for Chinese Ink Wash Painting.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
Consistent Video Style Transfer via Relaxation and Regularization.
IEEE Trans. Image Process., 2020

From Design Draft to Real Attire: Unaligned Fashion Image Translation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multitask Attentive Network For Text Effects Quality Assessment.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Scale-Free Single Image Deraining Via Visibility-Enhanced Recurrent Wavelet Learning.
IEEE Trans. Image Process., 2019

Context-Aware Text-Based Binary Image Stylization and Synthesis.
IEEE Trans. Image Process., 2019

D3R-Net: Dynamic Routing Residue Recurrent Network for Video Rain Removal.
IEEE Trans. Image Process., 2019

Selfie retoucher: subject-oriented self-portrait enhancement.
Multim. Tools Appl., 2019

TE141K: Artistic Text Benchmark for Text Effects Transfer.
CoRR, 2019

Artistic Text Stylization for Visual-Textual Presentation Synthesis.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Controllable Artistic Text Style Transfer via Shape-Matching GAN.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Typography With Decor: Intelligent Text Style Transfer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

TET-GAN: Text Effects Transfer via Stylization and Destylization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Structure-Guided Image Inpainting Using Homography Transformation.
IEEE Trans. Multim., 2018

Joint-Feature Guided Depth Map Super-Resolution With Face Priors.
IEEE Trans. Cybern., 2018

Automatic portrait oil painter: joint domain stylization for portrait images.
Multim. Tools Appl., 2018

Text effects transfer via distribution-aware texture synthesis.
Comput. Vis. Image Underst., 2018

Context-Aware Unsupervised Text Stylization.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Soft Decoding of Light Field Images Using Pocs and Fast Graph Spectrayl Filters.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Erase or Fill? Deep Joint Recurrent Rain Removal and Reconstruction in Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Joint-domain unsupervised stylization for portraits.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Soft segmentation-guided bipartite graph image stylization.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

1+N fusion: Cascaded self-portrait enhancement.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Awesome Typography: Statistics-Based Text Effects Transfer.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Facial depth map enhancement via neighbor embedding.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Structure-guided image completion via regularity statistics.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Novel self-portrait enhancement via multi-photo fusing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

No-reference quality assessment for image sharpness and noise.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Hierarchical oil painting stylization with limited reference via sparse representation.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Novel autoregressive model based on adaptive window-extension and patch-geodesic distance for image interpolation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Perspective distorted video restoration and stabilization for mobile devices.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015


  Loading...