Sucheng Ren

Orcid: 0000-0003-4730-8435

According to our database¹, Sucheng Ren authored at least 46 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Autoregressive Video Generation beyond Next Frames Prediction.

[BibT_eX]

[DOI]

CoRR, September, 2025

HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation.

[BibT_eX]

[DOI]

Sucheng Ren

Xiaomeng Li

IEEE Trans. Neural Networks Learn. Syst., June, 2025

Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, May, 2025

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation.

[BibT_eX]

[DOI]

CoRR, February, 2025

ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

DeepMIM: Deep Supervision for Masked Image Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Autoregressive Pretraining with Mamba in Vision.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adventurer: Optimizing Vision Mamba Architecture Designs for Efficiency.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Mamba-Reg: Vision Mamba Also Needs Registers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Unifying Global-Local Representations in Salient Object Detection With Transformers.

[BibT_eX]

[DOI]

IEEE Trans. Emerg. Top. Comput. Intell., August, 2024

FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching.

[BibT_eX]

[DOI]

CoRR, 2024

HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation.

[BibT_eX]

[DOI]

Sucheng Ren

Xiaomeng Li

CoRR, 2024

M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Causal Image Modeling for Efficient Visual Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

What If We Recaption Billions of Web Images with LLaMA-3?

[BibT_eX]

[DOI]

CoRR, 2024

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context.

[BibT_eX]

[DOI]

CoRR, 2024

Mamba-R: Vision Mamba ALSO Needs Registers.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapolation.

[BibT_eX]

[DOI]

CoRR, 2024

Glance to Count: Learning to Rank with Anchors for Weakly-supervised Crowd Counting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Rejuvenating image-GPT as Strong Visual Representation Learners.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Reducing Spatial Labeling Redundancy for Active Semi-Supervised Crowd Counting.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Edge Distraction-aware Salient Object Detection.

[BibT_eX]

[DOI]

IEEE Multim., 2023

Compress & Align: Curating Image-Text Data with Human Knowledge.

[BibT_eX]

[DOI]

CoRR, 2023

DeepMIM: Deep Supervision for Masked Image Modeling.

[BibT_eX]

[DOI]

CoRR, 2023

NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Fine-grained Domain Adaptive Crowd Counting via Point-derived Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Self-supervision through Random Segments with Autoregressive Coding (RandSAC).

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

SG-Former: Self-guided Transformer with Evolving Token Reallocation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

The Modality Focusing Hypothesis: On the Blink of Multimodal Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2022

Training-Free Robust Multimodal Learning via Sample-Wise Jacobian Regularization.

[BibT_eX]

[DOI]

CoRR, 2022

Self-supervision through Random Segments with Autoregressive Coding (RandSAC).

[BibT_eX]

[DOI]

CoRR, 2022

DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Learning from Multiple Annotator Noisy Labels via Sample-Wise Label Fusion.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2022

Shunted Self-Attention via Multi-Scale Token Aggregation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A Simple Data Mixing Prior for Improving Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Co-advise: Cross Inductive Bias Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Reducing Spatial Labeling Redundancy for Semi-supervised Crowd Counting.

[BibT_eX]

[DOI]

CoRR, 2021

Unifying Global-Local Representations in Salient Object Detection with Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

Multimodal Knowledge Expansion.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Feature Decorrelation in Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Reciprocal Transformations for Unsupervised Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning From the Master: Distilling Cross-Modal Advanced Knowledge for Lip Reading.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Delving Deep Into Many-to-Many Attention for Few-Shot Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

TENet: Triple Excitation Network for Video Salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Sucheng Ren

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...