Shuai Yang

Orcid: 0000-0002-5576-8629

Affiliations:

Nanyang Technological University, Singapore
Peking University, Institute of Computer Science and Technology, Beijing, China (former)

According to our database¹, Shuai Yang authored at least 90 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling.

[BibT_eX]

[DOI]

CoRR, October, 2025

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy.

[BibT_eX]

[DOI]

CoRR, October, 2025

E<sup>3</sup>DGE: Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., September, 2025

LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors.

[BibT_eX]

[DOI]

CoRR, September, 2025

ANYPORTAL: Zero-Shot Consistent Video Background Replacement.

[BibT_eX]

[DOI]

Wenshuo Gao

Xicheng Lan

Shuai Yang

CoRR, September, 2025

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer.

[BibT_eX]

[DOI]

CoRR, August, 2025

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation.

[BibT_eX]

[DOI]

CoRR, July, 2025

TokensGen: Harnessing Condensed Tokens for Long Video Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation.

[BibT_eX]

[DOI]

CoRR, June, 2025

Video World Models with Long-term Spatial Memory.

[BibT_eX]

[DOI]

CoRR, June, 2025

WORLDMEM: Long-term Consistent World Simulation with Memory.

[BibT_eX]

[DOI]

CoRR, April, 2025

Language-based Image Colorization: A Benchmark and Beyond.

[BibT_eX]

[DOI]

Yifan Li

Shuai Yang

Jiaying Liu

CoRR, March, 2025

Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space.

[BibT_eX]

[DOI]

CoRR, March, 2025

Balanced Image Stylization with Style Matching Score.

[BibT_eX]

[DOI]

CoRR, March, 2025

Splice, Focus and Relife: High-Resolution Periodic Pattern Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

Trajectory attention for fine-grained video motion control.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model.

[BibT_eX]

[DOI]

Xiang Gao

Shuai Yang

Jiaying Liu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Imagine360: Immersive 360 Video Generation from Perspective Anchor.

[BibT_eX]

[DOI]

CoRR, 2024

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation.

[BibT_eX]

[DOI]

CoRR, 2024

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond.

[BibT_eX]

[DOI]

CoRR, 2024

Grounded 3D-LLM with Referent Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

PRIME: Protect Your Videos From Malicious Editing.

[BibT_eX]

[DOI]

CoRR, 2024

Video Diffusion Models are Training-free Motion Interpreter and Controller.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CoolColor: Text-guided COherent OLd film COLORization.

[BibT_eX]

[DOI]

Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

COCO-LC: Colorfulness Controllable Language-based Colorization.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GroupDiff: Diffusion-Based Group Portrait Editing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Fresco: Spatial-Temporal Correspondence for Zero-Shot Video Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoBooth: Diffusion-based Video Generation with Image Prompts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields.

[BibT_eX]

[DOI]

CoRR, 2023

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

DeformToon3d: Deformable Neural Radiance Fields for 3D Toonification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text2Performer: Text-Driven Human Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

VToonify: Controllable High-Resolution Portrait Video Style Transfer.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2022

Text2Human: text-driven controllable human image generation.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2022

CLAST: Contrastive Learning for Arbitrary Style Transfer.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Artistic Text Style Transfer: An overview of state-of-the-art methods and datasets [SP Forum].

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2022

Shape-Matching GAN++: Scale Controllable Dynamic Artistic Text Style Transfer.

[BibT_eX]

[DOI]

Shuai Yang

Zhangyang Wang

Jiaying Liu

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Unsupervised Image-to-Image Translation with Generative Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Towards Coding for Human and Machine Vision: Scalable Face Image Coding.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Controllable Sketch-to-Image Translation for Robust Face Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

TE141K: Artistic Text Benchmark for Text Effect Transfer.

[BibT_eX]

[DOI]

Shuai Yang

Wenjing Wang

Jiaying Liu

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Mask-guided GAN for robust text editing in the scene.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Edit Like A Designer: Modeling Design Workflows for Unaligned Fashion Editing.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Instance-Aware Coherent Video Style Transfer for Chinese Ink Wash Painting.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020

Consistent Video Style Transfer via Relaxation and Regularization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

From Design Draft to Real Attire: Unaligned Fashion Image Translation.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multitask Attentive Network For Text Effects Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Scale-Free Single Image Deraining Via Visibility-Enhanced Recurrent Wavelet Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Context-Aware Text-Based Binary Image Stylization and Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

D3R-Net: Dynamic Routing Residue Recurrent Network for Video Rain Removal.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Selfie retoucher: subject-oriented self-portrait enhancement.

[BibT_eX]

[DOI]

Sifeng Xia

Shuai Yang

Jiaying Liu

Multim. Tools Appl., 2019

TE141K: Artistic Text Benchmark for Text Effects Transfer.

[BibT_eX]

[DOI]

Shuai Yang

Wenjing Wang

Jiaying Liu

CoRR, 2019

Artistic Text Stylization for Visual-Textual Presentation Synthesis.

[BibT_eX]

[DOI]

Shuai Yang

Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Controllable Artistic Text Style Transfer via Shape-Matching GAN.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Typography With Decor: Intelligent Text Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

TET-GAN: Text Effects Transfer via Stylization and Destylization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Structure-Guided Image Inpainting Using Homography Transformation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Joint-Feature Guided Depth Map Super-Resolution With Face Priors.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2018

Automatic portrait oil painter: joint domain stylization for portrait images.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2018

Text effects transfer via distribution-aware texture synthesis.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2018

Context-Aware Unsupervised Text Stylization.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Soft Decoding of Light Field Images Using Pocs and Fast Graph Spectrayl Filters.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Erase or Fill? Deep Joint Recurrent Rain Removal and Reconstruction in Videos.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Joint-domain unsupervised stylization for portraits.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Soft segmentation-guided bipartite graph image stylization.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

1+N fusion: Cascaded self-portrait enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Awesome Typography: Statistics-Based Text Effects Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Facial depth map enhancement via neighbor embedding.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Structure-guided image completion via regularity statistics.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Novel self-portrait enhancement via multi-photo fusing.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

No-reference quality assessment for image sharpness and noise.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

Hierarchical oil painting stylization with limited reference via sparse representation.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Novel autoregressive model based on adaptive window-extension and patch-geodesic distance for image interpolation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Perspective distorted video restoration and stabilization for mobile devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Shuai Yang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...