Zhendong Mao

Orcid: 0000-0001-5739-8126

Affiliations:

University of Science and Technology of China, School of Cyberspace Science and Technology, Hefei, China
Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China (PhD 2014)

According to our database¹, Zhendong Mao authored at least 159 papers between 2009 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Matryoshka Learning With Metric Transfer for Image-Text Matching.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2025

Fully Semantic Gap Recovery for End-to-End Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2025

Mitigating Biases in Language Models via Bias Unlearning.

[BibT_eX]

[DOI]

CoRR, September, 2025

MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools.

[BibT_eX]

[DOI]

CoRR, September, 2025

Video-LevelGauge: Investigating Contextual Positional Bias in Large Video Language Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory.

[BibT_eX]

[DOI]

CoRR, July, 2025

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents.

[BibT_eX]

[DOI]

CoRR, June, 2025

Pro3D-Editor : A Progressive-Views Perspective for Consistent and Precise 3D Editing.

[BibT_eX]

[DOI]

CoRR, June, 2025

Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking.

[BibT_eX]

[DOI]

CoRR, May, 2025

Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization.

[BibT_eX]

[DOI]

CoRR, May, 2025

Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach.

[BibT_eX]

[DOI]

CoRR, April, 2025

D<sup>2</sup>iT: Dynamic Diffusion Transformer for Accurate Image Generation.

[BibT_eX]

[DOI]

CoRR, April, 2025

Leveraging Robust Optimization for LLM Alignment under Distribution Shifts.

[BibT_eX]

[DOI]

CoRR, April, 2025

RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

Exploiting Pre-Trained Language Models for Black-Box Attack against Knowledge Graph Embeddings.

[BibT_eX]

[DOI]

ACM Trans. Knowl. Discov. Data, January, 2025

Improving Video Summarization by Exploring the Coherence Between Corresponding Captions.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

Rethinking Pseudo Word Learning in Zero-Shot Composed Image Retrieval: From an Object-Aware Perspective.

[BibT_eX]

[DOI]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Skin-Adapter: Fine-Grained Skin-Color Preservation for Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling, 2025

PromptMetric: Prompt Recipe as an Automatic Metric for Evaluating Open-domain Question Answering Systems.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

On-the-fly Preference Alignment via Principle-Guided Decoding.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Multi-Prototype Grouping for Continual Learning in Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

DETCP: Self-Detoxifying Language Models With Contrastive Pairs.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

A4A: Adapter for Adapter Transfer via All-for-All Mapping for Cross-Architecture Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Dragin3D: Image Editing by Dragging in 3D Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

CMI-AIGCX at GenAI Detection Task 2: Leveraging Multilingual Proxy LLMs for Machine-Generated Text Detection in Academic Essays.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Alleviating Hallucinations in Large Language Models via Truthfulness-driven Rank-adaptive LoRA.

[BibT_eX]

[DOI]

Jiahao Li

Zhendong Mao

Quan Wang

Proceedings of the Findings of the Association for Computational Linguistics, 2025

M-RangeDetector: Enhancing Generalization in Machine-Generated Text Detection through Multi-Range Attention Masks.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Fine-grained Knowledge Enhancement for Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Improve Safety Training of Large Language Models with Safety-Critical Singular Vectors Localization.

[BibT_eX]

[DOI]

Peijian Gu

Quan Wang

Zhendong Mao

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

ELDER: Enhancing Lifelong Model Editing with Mixture-of-LoRA.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Document-level Relation Extraction with Progressive Self-distillation.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., November, 2024

Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., May, 2024

Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Enhanced Semantic Similarity Learning Framework for Image-Text Matching.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2024

Improving Image-Text Matching With Bidirectional Consistency of Cross-Modal Alignment.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2024

Cascade Semantic Prompt Alignment Network for Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2024

Fast, Accurate, and Lightweight Memory-Enhanced Embedding Learning Framework for Image-Text Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2024

Curriculum Learning Driven Domain Adaptation for Low-Resource Machine Reading Comprehension.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

Enhance Lifelong Model Editing with Continuous Data-Adapter Association.

[BibT_eX]

[DOI]

CoRR, 2024

RealCustom++: Representing Images as Real-Word for Real-Time Customization.

[BibT_eX]

[DOI]

CoRR, 2024

USTC-BUPT at SemEval-2024 Task 8: Enhancing Machine-Generated Text Detection via Domain Adversarial Neural Networks and LLM Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

Homology Consistency Constrained Efficient Tuning for Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Dual-path Collaborative Generation Network for Emotional Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Neighborhood-Adaptive Context Enhancement Learning For Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Improving Radiology Report Generation with D<sup>2</sup>-Net: When Diffusion Meets Discriminator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Linguistic-Aware Patch Slimming Framework for Fine-Grained Cross-Modal Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

IDEATE: Detecting AI-Generated Text Using Internal and External Factual Structures.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Visual-Linguistic Dependency Encoding for Image-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

LIRE: listwise reward enhancement for preference alignment.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Chain-of-Question: A Progressive Question Decomposition Approach for Complex Knowledge Base Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Feature-Adaptive and Data-Scalable In-Context Learning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Disentangled Learning with Synthetic Parallel Data for Text Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

GH-DDM: the generalized hybrid denoising diffusion model for medical image generation.

[BibT_eX]

[DOI]

Multim. Syst., June, 2023

Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip.

[BibT_eX]

[DOI]

World Wide Web (WWW), March, 2023

Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Intra-Class Adaptive Augmentation With Neighbor Correction for Deep Metric Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation.

[BibT_eX]

[DOI]

CoRR, 2023

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts.

[BibT_eX]

[DOI]

CoRR, 2023

kNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.

[BibT_eX]

[DOI]

CoRR, 2023

Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Difference-Aware Iterative Reasoning Network for Key Relation Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

SADE: A Self-Adaptive Expert for Multi-Dataset Question Answering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Inductive Relation Prediction from Relational Paths and Context with Hierarchical Transformers.

[BibT_eX]

[DOI]

Jiaang Li

Quan Wang

Zhendong Mao

Proceedings of the IEEE International Conference on Acoustics, 2023

Contour-Augmented Concept Prediction Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning, 2023

On the Calibration of Large Language Models and Alignment.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Image Captioning via Predicting Structured Concepts.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Grammatical Error Correction via Mixed-Grained Weighted Training.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Crossing the Gap: Domain Generalization for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Semantic Relationship among Instances for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Text Style Transfer with Contrastive Transfer Pattern Mining.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Focus Your Attention: A Focal Attention for Multimodal Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Semantically Similarity-Wise Dual-Branch Network for Scene Graph Generation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Task-Adaptive Attention for Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Local Correlation and Global Contextual Information for Unsupervised 3D Model Retrieval and Classification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Self-Supervised Synthesis Ranking for Deep Metric Learning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Fine-tuning with Multi-modal Entity Prompts for News Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection.

[BibT_eX]

[DOI]

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granularity.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Negative-Aware Attention Framework for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GroupDiff: Exploring A Unified Graph Structure and High-order Interactions for Group Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Big Data Computing and Communications, 2022

Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Review and Arrange: Curriculum Learning for Natural Language Understanding.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Evolution of ICTs-empowered-identification: A general re-ranking method for person re-identification.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2021

Object-difference drived graph convolutional networks for visual question answering.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2021

Hierarchical multi-view context modelling for 3D object classification and retrieval.

[BibT_eX]

[DOI]

Inf. Sci., 2021

Mask and Predict: Multi-step Reasoning for Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Lesion-Aware Transformers for Diabetic Retinopathy Grading.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Image Captioning with Context-Aware Auxiliary Guidance.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Deep Metric Learning with Self-Supervised Ranking.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Misshapen Pelvis Landmark Detection With Local-Global Feature Learning for Diagnosing Developmental Dysplasia of the Hip.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 2020

Context propagation embedding network for weakly supervised semantic segmentation.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2020

SP-VITON: shape-preserving image-based virtual try-on network.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2020

Compact Position-Aware Attention Network for Image Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

A Feature Generalization Framework for Social Media Popularity Prediction.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Domain-Specific Alignment Network for Multi-Domain Image-Based 3D Object Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learning Rich Attention for Pediatric Bone Age Assessment.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Overcoming Language Priors with Self-supervised Learning for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Graph Structured Network for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Curriculum Learning for Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Double-Bit Quantization and Index Hashing for Nearest Neighbor Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

MMJN: Multi-Modal Joint Networks for 3D Shape Recognition.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Neighbor-aware Approach for Image-text Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Pulmonary Vessel Segmentation via Stage-Wise Convolutional Networks With Orientation-Based Region Growing Optimization.

[BibT_eX]

[DOI]

IEEE Access, 2018

Stacked Fully Convolutional Networks for Pulmonary Vessel Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Visual Communications and Image Processing, 2018

Post Tuned Hashing: A New Approach to Indexing High-dimensional Data.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017

Knowledge Graph Embedding: A Survey of Approaches and Applications.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2017

Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Double-bit quantization and weighting for nearest neighbor search.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2015

Hierarchical Encoding of Binary Descriptors for Image Matching.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

What is the next step of binary features?

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

2014

Salient region detection for complex background images using integrated features.

[BibT_eX]

[DOI]

Inf. Sci., 2014

2013

COGE: A Novel Binary Feature Descriptor Exploring Anisotropy and Non-uniformity.

[BibT_eX]

[DOI]

Zhendong Mao

Yongdong Zhang

Qi Tian

Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

What are the distance metrics for local features?

[BibT_eX]

[DOI]

Zhendong Mao

Yongdong Zhang

Qi Tian

Proceedings of the ACM Multimedia Conference, 2013

2012

Geometric context-preserving progressive transmission in mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

A method for detecting salient regions using integrated features.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2009

TRECVID 2009 of MCG-ICT-CAS.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

C3M: A Classification Model for Multivariate Motion Time Series.

[BibT_eX]

[DOI]

Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

An adaptive ensemble classifier for concept drifting stream.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining, 2009

Zhendong Mao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...