Peng-Tao Jiang

Orcid: 0000-0002-1786-4943

According to our database1, Peng-Tao Jiang authored at least 52 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models.
CoRR, August, 2025

SDMatte: Grafting Diffusion Models for Interactive Matting.
CoRR, August, 2025

Q-Ponder: A Unified Training Pipeline for Reasoning-based Visual Quality Assessment.
CoRR, June, 2025

HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions.
CoRR, May, 2025

Any-to-Bokeh: One-Step Video Bokeh via Multi-Plane Image Guided Diffusion.
CoRR, May, 2025

MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on.
CoRR, May, 2025

Photography Perspective Composition: Towards Aesthetic Perspective Recommendation.
CoRR, May, 2025

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning.
CoRR, May, 2025

DSDNet: Raw Domain Demoiréing via Dual Color-Space Synergy.
CoRR, April, 2025

A Temporal Modeling Framework for Video Pre-Training on Video Instance Segmentation.
CoRR, March, 2025

M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation.
CoRR, March, 2025

Towards Training-Free Open-World Segmentation via Image Prompt Foundation Models.
Int. J. Comput. Vis., January, 2025

DepthMaster: Taming Diffusion Models for Monocular Depth Estimation.
CoRR, January, 2025

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Multi-Task Dense Predictions via Unleashing the Power of Diffusion.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Boosting Vision State Space Model with Fractal Scanning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
RDNeRF: relative depth guided NeRF for dense free view synthesis.
Vis. Comput., March, 2024

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning.
CoRR, 2024

Learning Adaptive Lighting via Channel-Aware Guidance.
CoRR, 2024

Learning Differential Pyramid Representation for Tone Mapping.
CoRR, 2024

CPA: Camera-pose-awareness Diffusion Transformer for Video Generation.
CoRR, 2024

ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer.
CoRR, 2024

ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution.
CoRR, 2024

Towards Natural Image Matting in the Wild via Real-Scenario Prior.
CoRR, 2024

Scalable Visual State Space Model with Fractal Scanning.
CoRR, 2024

Empowering Segmentation Ability to Multi-modal Large Language Models.
CoRR, 2024

Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Improving Adversarial Energy-Based Model via Diffusion Process.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Beta-Tuned Timestep Diffusion Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

Revisiting Single Image Reflection Removal in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Task Dense Prediction via Mixture of Low-Rank Experts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Traffic Scene Parsing Through the TSP6K Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Deeply Explain CNN Via Hierarchical Decomposition.
Int. J. Comput. Vis., May, 2023

Decoupling Degradation and Content Processing for Adverse Weather Image Restoration.
CoRR, 2023

Generalization and Hallucination of Large Vision-Language Models through a Camouflaged Lens.
CoRR, 2023

Towards Training-free Open-world Segmentation via Image Prompting Foundation Models.
CoRR, 2023

PGformer: Proxy-Bridged Game Transformer for Multi-Person Extremely Interactive Motion Prediction.
CoRR, 2023

Segment Anything is A Good Pseudo-label Generator for Weakly Supervised Semantic Segmentation.
CoRR, 2023

Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Online Attention Accumulation for Weakly Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Attention mechanisms in computer vision: A survey.
Comput. Vis. Media, 2022

L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Delving Deep Into Label Smoothing.
IEEE Trans. Image Process., 2021

LayerCAM: Exploring Hierarchical Class Activation Maps for Localization.
IEEE Trans. Image Process., 2021

Personalized Image Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019
Integral Object Mining via Online Attention Accumulation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Semantic Edge Detection with Diverse Deep Supervision.
CoRR, 2018

Self-Erasing Network for Integral Object Attention.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

DEL: Deep Embedding Learning for Efficient Image Segmentation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018


  Loading...