Yiren Song

Orcid: 0000-0002-7028-3347

According to our database1, Yiren Song authored at least 38 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
WordCon: Word-level Typography Control in Scene Text Rendering.
CoRR, June, 2025

MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks.
CoRR, June, 2025

RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers.
CoRR, June, 2025

Autoregressive Images Watermarking through Lexical Biasing: An Approach Resistant to Regeneration Attack.
CoRR, June, 2025

EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering.
CoRR, May, 2025

DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers.
CoRR, May, 2025

GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains.
CoRR, May, 2025

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data.
CoRR, May, 2025

FocusedAD: Character-centric Movie Audio Description.
CoRR, April, 2025

TransAnimate: Taming Layer Diffusion to Generate RGBA Video.
CoRR, March, 2025

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer.
CoRR, March, 2025

PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data.
CoRR, February, 2025

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation.
CoRR, February, 2025

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer.
CoRR, February, 2025

Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks.
CoRR, January, 2025

Two-dimensional normalized knowledge distillation leveraging class relations.
J. Vis. Commun. Image Represent., 2025

Image Watermarks are Removable using Controllable Regeneration from Clean Noise.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Stable-Hair: Real-World Hair Transfer via Diffusion Model.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
DiffSim: Taming Diffusion Models for Evaluating Visual Similarity.
CoRR, 2024

GridShow: Omni Visual Generation.
CoRR, 2024

Anti-Reference: Universal and Immediate Defense Against Reference-Based Generation.
CoRR, 2024

FonTS: Text Rendering with Typography and Style Controls.
CoRR, 2024

Stable-Hair: Real-World Hair Transfer via Diffusion Model.
CoRR, 2024

Steganalysis on Digital Watermarking: Is Your Defense Truly Impervious?
CoRR, 2024

WMAdapter: Adding WaterMark Control to Latent Diffusion Models.
CoRR, 2024

ProcessPainter: Learn Painting Process from Sequence Data.
CoRR, 2024

Fast Personalized Text-to-Image Syntheses With Attention Injection.
CoRR, 2024

Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model.
CoRR, 2024

ProcessPainter: Learning to draw from sequence data.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

Can Simple Averaging Defeat Modern Watermarks?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Fast Personalized Text to Image Synthesis with Attention Injection.
Proceedings of the IEEE International Conference on Acoustics, 2024

RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-key Identification.
Proceedings of the Computer Vision - ECCV 2024, 2024

SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
CLIPVG: Text-Guided Image Manipulation Using Differentiable Vector Graphics.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multi-sensor measurement fusion based on minimum mixture error entropy with non-Gaussian measurement noise.
Digit. Signal Process., 2022

CLIPTexture: Text-Driven Texture Synthesis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CLIPFont: Text Guided Vector WordArt Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022


  Loading...