Zhendong Mao
Orcid: 0000-0001-5739-8126Affiliations:
- University of Science and Technology of China, School of Cyberspace Science and Technology, Hefei, China
- Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China (PhD 2014)
According to our database1,
Zhendong Mao
authored at least 152 papers
between 2009 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CoRR, July, 2025
CoRR, June, 2025
Pro3D-Editor : A Progressive-Views Perspective for Consistent and Precise 3D Editing.
CoRR, June, 2025
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning.
CoRR, May, 2025
Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking.
CoRR, May, 2025
Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models.
CoRR, May, 2025
CoRR, May, 2025
HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models.
CoRR, May, 2025
DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization.
CoRR, May, 2025
Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach.
CoRR, April, 2025
CoRR, April, 2025
CoRR, April, 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models.
CoRR, March, 2025
Exploiting Pre-Trained Language Models for Black-Box Attack against Knowledge Graph Embeddings.
ACM Trans. Knowl. Discov. Data, January, 2025
Improving Video Summarization by Exploring the Coherence Between Corresponding Captions.
IEEE Trans. Image Process., 2025
Rethinking Pseudo Word Learning in Zero-Shot Composed Image Retrieval: From an Object-Aware Perspective.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025
Proceedings of the MultiMedia Modeling, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
A4A: Adapter for Adapter Transfer via All-for-All Mapping for Cross-Architecture Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
CMI-AIGCX at GenAI Detection Task 2: Leveraging Multilingual Proxy LLMs for Machine-Generated Text Detection in Academic Essays.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Alleviating Hallucinations in Large Language Models via Truthfulness-driven Rank-adaptive LoRA.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
M-RangeDetector: Enhancing Generalization in Machine-Generated Text Detection through Multi-Range Attention Masks.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Improve Safety Training of Large Language Models with Safety-Critical Singular Vectors Localization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
ACM Trans. Inf. Syst., November, 2024
Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024
Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024
IEEE Trans. Circuits Syst. Video Technol., April, 2024
Improving Image-Text Matching With Bidirectional Consistency of Cross-Modal Alignment.
IEEE Trans. Circuits Syst. Video Technol., 2024
IEEE Trans. Circuits Syst. Video Technol., 2024
Fast, Accurate, and Lightweight Memory-Enhanced Embedding Learning Framework for Image-Text Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2024
Curriculum Learning Driven Domain Adaptation for Low-Resource Machine Reading Comprehension.
IEEE Signal Process. Lett., 2024
CoRR, 2024
USTC-BUPT at SemEval-2024 Task 8: Enhancing Machine-Generated Text Detection via Domain Adversarial Neural Networks and LLM Embeddings.
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Improving Radiology Report Generation with D<sup>2</sup>-Net: When Diffusion Meets Discriminator.
Proceedings of the IEEE International Conference on Acoustics, 2024
FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Chain-of-Question: A Progressive Question Decomposition Approach for Complex Knowledge Base Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Benchmarking Large Language Models on Controllable Generation under Diversified Instructions.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
GH-DDM: the generalized hybrid denoising diffusion model for medical image generation.
Multim. Syst., June, 2023
Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip.
World Wide Web (WWW), March, 2023
Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching.
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation.
CoRR, 2023
CoRR, 2023
kNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.
CoRR, 2023
Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Inductive Relation Prediction from Relational Paths and Context with Hierarchical Transformers.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Artificial Neural Networks and Machine Learning, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
IEEE Trans. Multim., 2022
IEEE Trans. Circuits Syst. Video Technol., 2022
IEEE Trans. Circuits Syst. Video Technol., 2022
Joint Local Correlation and Global Contextual Information for Unsupervised 3D Model Retrieval and Classification.
IEEE Trans. Circuits Syst. Video Technol., 2022
IEEE Trans. Circuits Syst. Video Technol., 2022
EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granularity.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
GroupDiff: Exploring A Unified Graph Structure and High-order Interactions for Group Recommendation.
Proceedings of the 8th International Conference on Big Data Computing and Communications, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval.
IEEE Trans. Multim., 2021
IEEE Trans. Image Process., 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Evolution of ICTs-empowered-identification: A general re-ranking method for person re-identification.
Pattern Recognit. Lett., 2021
Multim. Tools Appl., 2021
Hierarchical multi-view context modelling for 3D object classification and retrieval.
Inf. Sci., 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Misshapen Pelvis Landmark Detection With Local-Global Feature Learning for Diagnosing Developmental Dysplasia of the Hip.
IEEE Trans. Medical Imaging, 2020
Multim. Tools Appl., 2020
Multim. Tools Appl., 2020
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020
Overcoming Language Priors with Self-supervised Learning for Visual Question Answering.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
IEEE Trans. Multim., 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Pulmonary Vessel Segmentation via Stage-Wise Convolutional Networks With Orientation-Based Region Growing Optimization.
IEEE Access, 2018
Proceedings of the IEEE Visual Communications and Image Processing, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
2017
IEEE Trans. Knowl. Data Eng., 2017
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2015
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015
2014
Inf. Sci., 2014
2013
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013
Proceedings of the ACM Multimedia Conference, 2013
2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
2009
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009
Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining, 2009