Peng-Tao Jiang

Orcid: 0000-0002-1786-4943

According to our database¹, Peng-Tao Jiang authored at least 57 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Realism Control One-step Diffusion for Real-World Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, September, 2025

RED: Robust Event-Guided Motion Deblurring with Modality-Specific Disentangled Representation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, August, 2025

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

SDMatte: Grafting Diffusion Models for Interactive Matting.

[BibT_eX]

[DOI]

CoRR, August, 2025

Q-Ponder: A Unified Training Pipeline for Reasoning-based Visual Quality Assessment.

[BibT_eX]

[DOI]

CoRR, June, 2025

HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions.

[BibT_eX]

[DOI]

CoRR, May, 2025

Any-to-Bokeh: One-Step Video Bokeh via Multi-Plane Image Guided Diffusion.

[BibT_eX]

[DOI]

CoRR, May, 2025

MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on.

[BibT_eX]

[DOI]

CoRR, May, 2025

Photography Perspective Composition: Towards Aesthetic Perspective Recommendation.

[BibT_eX]

[DOI]

CoRR, May, 2025

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

DSDNet: Raw Domain Demoiréing via Dual Color-Space Synergy.

[BibT_eX]

[DOI]

CoRR, April, 2025

M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Towards Training-Free Open-World Segmentation via Image Prompt Foundation Models.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., January, 2025

DepthMaster: Taming Diffusion Models for Monocular Depth Estimation.

[BibT_eX]

[DOI]

CoRR, January, 2025

Learning Adaptive Lighting via Channel-Aware Guidance.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

A Temporal Modeling Framework for Video Pre-Training on Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Multi-Task Dense Predictions via Unleashing the Power of Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Boosting Vision State Space Model with Fractal Scanning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

RDNeRF: relative depth guided NeRF for dense free view synthesis.

[BibT_eX]

[DOI]

Vis. Comput., March, 2024

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Adaptive Lighting via Channel-Aware Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Differential Pyramid Representation for Tone Mapping.

[BibT_eX]

[DOI]

CoRR, 2024

CPA: Camera-pose-awareness Diffusion Transformer for Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer.

[BibT_eX]

[DOI]

CoRR, 2024

ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Natural Image Matting in the Wild via Real-Scenario Prior.

[BibT_eX]

[DOI]

CoRR, 2024

Scalable Visual State Space Model with Fractal Scanning.

[BibT_eX]

[DOI]

CoRR, 2024

Empowering Segmentation Ability to Multi-modal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Improving Adversarial Energy-Based Model via Diffusion Process.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Beta-Tuned Timestep Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Revisiting Single Image Reflection Removal in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Task Dense Prediction via Mixture of Low-Rank Experts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Traffic Scene Parsing Through the TSP6K Dataset.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Deeply Explain CNN Via Hierarchical Decomposition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., May, 2023

Decoupling Degradation and Content Processing for Adverse Weather Image Restoration.

[BibT_eX]

[DOI]

CoRR, 2023

Generalization and Hallucination of Large Vision-Language Models through a Camouflaged Lens.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Training-free Open-world Segmentation via Image Prompting Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

PGformer: Proxy-Bridged Game Transformer for Multi-Person Extremely Interactive Motion Prediction.

[BibT_eX]

[DOI]

CoRR, 2023

Segment Anything is A Good Pseudo-label Generator for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Peng-Tao Jiang

Yuqi Yang

CoRR, 2023

Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Online Attention Accumulation for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Attention mechanisms in computer vision: A survey.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2022

L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Delving Deep Into Label Smoothing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

LayerCAM: Exploring Hierarchical Class Activation Maps for Localization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Personalized Image Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019

Integral Object Mining via Online Attention Accumulation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

Semantic Edge Detection with Diverse Deep Supervision.

[BibT_eX]

[DOI]

CoRR, 2018

Self-Erasing Network for Integral Object Attention.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

DEL: Deep Embedding Learning for Efficient Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Peng-Tao Jiang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...