Peng-Tao Jiang

Orcid: 0000-0002-1786-4943

According to our database¹, Peng-Tao Jiang authored at least 77 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Towards Photorealistic and Efficient Bokeh Rendering via Diffusion Framework.

[BibT_eX]

[DOI]

CoRR, May, 2026

SEMat: Semantic Enhanced Natural Image Interactive Matting.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2026

SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing.

[BibT_eX]

[DOI]

CoRR, April, 2026

TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation.

[BibT_eX]

[DOI]

CoRR, April, 2026

Anchor Forcing: Anchor Memory and Tri-Region RoPE for Interactive Streaming Video Diffusion.

[BibT_eX]

[DOI]

CoRR, March, 2026

MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems.

[BibT_eX]

[DOI]

CoRR, March, 2026

C<sup>2</sup>FG: Control Classifier-Free Guidance via Score Discrepancy Analysis.

[BibT_eX]

[DOI]

CoRR, March, 2026

FlowConsist: Make Your Flow Consistent with Real Trajectory.

[BibT_eX]

[DOI]

CoRR, February, 2026

Trust but Verify: Adaptive Conditioning for Reference-Based Diffusion Super-Resolution via Implicit Reference Correlation Modeling.

[BibT_eX]

[DOI]

CoRR, February, 2026

Bidirectional Beta-Tuned Diffusion Model.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., January, 2026

Bidirectional Noise Injection: Enhancing Diffusion Models via Coordinated Input-Output Perturbation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Realism Control One-step Diffusion for Real-world Image Super Resolution.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

CameraMaster: Unified Camera Semantic-Parameter Control for Photography Retouching.

[BibT_eX]

[DOI]

CoRR, November, 2025

MagicWorld: Interactive Geometry-driven Video World Exploration.

[BibT_eX]

[DOI]

CoRR, November, 2025

FeRA: Frequency-Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, November, 2025

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, November, 2025

AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

RED: Robust Event-Guided Motion Deblurring with Modality-Specific Disentangled Representation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, August, 2025

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

Q-Ponder: A Unified Training Pipeline for Reasoning-based Visual Quality Assessment.

[BibT_eX]

[DOI]

CoRR, June, 2025

HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions.

[BibT_eX]

[DOI]

CoRR, May, 2025

Any-to-Bokeh: One-Step Video Bokeh via Multi-Plane Image Guided Diffusion.

[BibT_eX]

[DOI]

CoRR, May, 2025

MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on.

[BibT_eX]

[DOI]

CoRR, May, 2025

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Towards Training-Free Open-World Segmentation via Image Prompt Foundation Models.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., January, 2025

DepthMaster: Taming Diffusion Models for Monocular Depth Estimation.

[BibT_eX]

[DOI]

CoRR, January, 2025

SE-GUI: Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Photography Perspective Composition: Towards Aesthetic Perspective Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Learning Differential Pyramid Representation for Tone Mapping.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

DSDNet: Raw Domain Demoiréing via Dual Color-Space Synergy.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Learning Adaptive Lighting via Channel-Aware Guidance.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

A Temporal Modeling Framework for Video Pre-Training on Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Multi-Task Dense Predictions via Unleashing the Power of Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MIPI 2025 Challenge on Deblurring for Hybrid EVS Camera: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

MOERL: When Mixture-Of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SDMATTE: Grafting Diffusion Models for Interactive Matting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Proxy-Bridged Game Transformer for Interactive Extreme Motion Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Boosting Vision State Space Model with Fractal Scanning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

RDNeRF: relative depth guided NeRF for dense free view synthesis.

[BibT_eX]

[DOI]

Vis. Comput., March, 2024

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Adaptive Lighting via Channel-Aware Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Differential Pyramid Representation for Tone Mapping.

[BibT_eX]

[DOI]

CoRR, 2024

CPA: Camera-pose-awareness Diffusion Transformer for Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer.

[BibT_eX]

[DOI]

CoRR, 2024

ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Natural Image Matting in the Wild via Real-Scenario Prior.

[BibT_eX]

[DOI]

CoRR, 2024

Scalable Visual State Space Model with Fractal Scanning.

[BibT_eX]

[DOI]

CoRR, 2024

Empowering Segmentation Ability to Multi-modal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Improving Adversarial Energy-Based Model via Diffusion Process.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Beta-Tuned Timestep Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Revisiting Single Image Reflection Removal in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Task Dense Prediction via Mixture of Low-Rank Experts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Traffic Scene Parsing Through the TSP6K Dataset.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Deeply Explain CNN Via Hierarchical Decomposition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., May, 2023

Decoupling Degradation and Content Processing for Adverse Weather Image Restoration.

[BibT_eX]

[DOI]

CoRR, 2023

Generalization and Hallucination of Large Vision-Language Models through a Camouflaged Lens.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Training-free Open-world Segmentation via Image Prompting Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

PGformer: Proxy-Bridged Game Transformer for Multi-Person Extremely Interactive Motion Prediction.

[BibT_eX]

[DOI]

CoRR, 2023

Segment Anything is A Good Pseudo-label Generator for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Peng-Tao Jiang

Yuqi Yang

CoRR, 2023

Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Online Attention Accumulation for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Attention mechanisms in computer vision: A survey.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2022

L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Delving Deep Into Label Smoothing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

LayerCAM: Exploring Hierarchical Class Activation Maps for Localization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Personalized Image Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019

Integral Object Mining via Online Attention Accumulation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

Semantic Edge Detection with Diverse Deep Supervision.

[BibT_eX]

[DOI]

CoRR, 2018

Self-Erasing Network for Integral Object Attention.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

DEL: Deep Embedding Learning for Efficient Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Peng-Tao Jiang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...