Jiedong Zhuang

Orcid: 0000-0003-0551-5911

According to our database1, Jiedong Zhuang authored at least 18 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
PHiD: Preserving human identity in pose-guided character animation.
Neurocomputing, 2025

ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments.
IEEE Trans. Image Process., 2024

ST<sup>3</sup>: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming.
CoRR, 2024

Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation.
CoRR, 2024

Data-Free Quantization of Vision Transformers Through Perturbation-Aware Image Synthesis.
Proceedings of the PRICAI 2024: Trends in Artificial Intelligence, 2024

Zero-Shot Referring Image Segmentation with Hierarchical Prompts and Frequency Domain Fusion.
Proceedings of the PRICAI 2024: Trends in Artificial Intelligence, 2024

Trans-DONeRF for Transparent Object Rendering with Mixed Depth Prior.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Mitigating Hallucination in Visual-Language Models via Re-balancing Contrastive Decoding.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Perceptual Image Compression with Text-Guided Multi-level Fusion.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
TeG-DG: Textually Guided Domain Generalization for Face Anti-Spoofing.
CoRR, 2023

2022
A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization.
IEEE Trans. Circuits Syst. Video Technol., 2022

Vision-Based UAV Localization System in Denial Environments.
CoRR, 2022

A Semantic Guidance and Transformer-Based Matching Method for UAVs and Satellite Images for UAV Geo-Localization.
IEEE Access, 2022

2021
A Faster and More Effective Cross-View Matching Method of UAV and Satellite Images for UAV Geolocalization.
Remote. Sens., 2021


  Loading...