Jinheng Xie

Orcid: 0000-0001-5678-4500

According to our database1, Jinheng Xie authored at least 41 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
Open-world Weakly-Supervised Object Localization.
Pattern Recognit., 2026

2025
CLIMS++: Cross Language Image Matching with Automatic Context Discovery for Weakly Supervised Semantic Segmentation.
Int. J. Comput. Vis., August, 2025

DisFaceRep: Representation Disentanglement for Co-occurring Facial Components in Weakly Supervised Face Parsing.
CoRR, August, 2025

FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing.
CoRR, July, 2025

Show-o2: Improved Native Unified Multimodal Models.
CoRR, June, 2025

OpenFACADES: An Open Framework for Architectural Caption and Attribute Data Enrichment via Street View Imagery.
CoRR, April, 2025

Progressive Pseudo Labeling for Multi-Dataset Detection Over Unified Label Space.
IEEE Trans. Multim., 2025

Faster Diffusion Through Temporal Attention Decomposition.
Trans. Mach. Learn. Res., 2025

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

A Simple Data Augmentation for Feature Distribution Skewed Federated Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Anomaly detection via gating highway connection for retinal fundus images.
Pattern Recognit., April, 2024

WMAdapter: Adding WaterMark Control to Latent Diffusion Models.
CoRR, 2024

Learning Long-form Video Prior via Generative Pre-Training.
CoRR, 2024

Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models.
CoRR, 2024

Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation.
CoRR, 2024

Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Learning Video Context as Interleaved Multimodal Sequences.
Proceedings of the Computer Vision - ECCV 2024, 2024

Tune-an-Ellipse: CLIP Has Potential to Find what you Want.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Weakly Supervised Pedestrian Segmentation for Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., March, 2023

Dynamically Masked Discriminator for Generative Adversarial Networks.
CoRR, 2023

VisorGPT: Learning Visual Prior via Generative Pre-Training.
CoRR, 2023

Open-World Weakly-Supervised Object Localization.
CoRR, 2023

Dynamically Masked Discriminator for GANs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Visual Prior via Generative Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

TCEIP: Text Condition Embedded Regression Network for Dental Implant Position Prediction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TCSloT: Text Guided 3D Context and Slope Aware Triple Network for Dental Implant Position Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022
Decoupled Mixup for Generalized Visual Recognition.
CoRR, 2022

A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-Rays.
CoRR, 2022

Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation.
CoRR, 2022

Cross Language Image Matching for Weakly Supervised Semantic Segmentation.
CoRR, 2022

Point Beyond Class: A Benchmark for Weakly Semi-supervised Abnormality Localization in Chest X-Rays.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Decoupled Mixup for Out-of-Distribution Visual Recognition.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

C<sup>2</sup> AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Think About Boundary: Fusing Multi-level Boundary Information for Landmark Heatmap Regression.
Proceedings of the International Joint Conference on Neural Networks, 2021

Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021


  Loading...