Zhendong Mao

Orcid: 0000-0001-5739-8126

According to our database1, Zhendong Mao authored at least 108 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024

Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Enhanced Semantic Similarity Learning Framework for Image-Text Matching.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

LIRE: listwise reward enhancement for preference alignment.
CoRR, 2024

Feature-Adaptive and Data-Scalable In-Context Learning.
CoRR, 2024

Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation.
CoRR, 2024

RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization.
CoRR, 2024

IDEATE: Detecting AI-Generated Text Using Internal and External Factual Structures.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Visual-Linguistic Dependency Encoding for Image-Text Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

GH-DDM: the generalized hybrid denoising diffusion model for medical image generation.
Multim. Syst., June, 2023

Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip.
World Wide Web (WWW), March, 2023

Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching.
IEEE Trans. Multim., 2023

Intra-Class Adaptive Augmentation With Neighbor Correction for Deep Metric Learning.
IEEE Trans. Multim., 2023

On the Calibration of Large Language Models and Alignment.
CoRR, 2023

Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation.
CoRR, 2023

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation.
CoRR, 2023

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts.
CoRR, 2023

kNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.
CoRR, 2023

Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Difference-Aware Iterative Reasoning Network for Key Relation Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SADE: A Self-Adaptive Expert for Multi-Dataset Question Answering.
Proceedings of the IEEE International Conference on Acoustics, 2023

Inductive Relation Prediction from Relational Paths and Context with Hierarchical Transformers.
Proceedings of the IEEE International Conference on Acoustics, 2023

On the Calibration of Large Language Models and Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Image Captioning via Predicting Structured Concepts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Grammatical Error Correction via Mixed-Grained Weighted Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Crossing the Gap: Domain Generalization for Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Semantic Relationship among Instances for Image-Text Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Text Style Transfer with Contrastive Transfer Pattern Mining.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Focus Your Attention: A Focal Attention for Multimodal Learning.
IEEE Trans. Multim., 2022

Semantically Similarity-Wise Dual-Branch Network for Scene Graph Generation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Task-Adaptive Attention for Image Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Local Correlation and Global Contextual Information for Unsupervised 3D Model Retrieval and Classification.
IEEE Trans. Circuits Syst. Video Technol., 2022

Self-Supervised Synthesis Ranking for Deep Metric Learning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Channel Estimation and Active-User Detection for Massive Access in Internet of Things - A Deep Learning Approach.
IEEE Internet Things J., 2022

Channel Estimation for Intelligent Reflecting Surface Assisted Massive MIMO Systems - A Deep Learning Approach.
IEEE Commun. Lett., 2022

EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Fine-tuning with Multi-modal Entity Prompts for News Image Captioning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granularity.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Negative-Aware Attention Framework for Image-Text Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GroupDiff: Exploring A Unified Graph Structure and High-order Interactions for Group Recommendation.
Proceedings of the 8th International Conference on Big Data Computing and Communications, 2022

Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval.
IEEE Trans. Multim., 2021

Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection.
IEEE Trans. Image Process., 2021

Review and Arrange: Curriculum Learning for Natural Language Understanding.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Evolution of ICTs-empowered-identification: A general re-ranking method for person re-identification.
Pattern Recognit. Lett., 2021

Object-difference drived graph convolutional networks for visual question answering.
Multim. Tools Appl., 2021

Hierarchical multi-view context modelling for 3D object classification and retrieval.
Inf. Sci., 2021

Mask and Predict: Multi-step Reasoning for Scene Graph Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Lesion-Aware Transformers for Diabetic Retinopathy Grading.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Image Captioning with Context-Aware Auxiliary Guidance.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Deep Metric Learning with Self-Supervised Ranking.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Misshapen Pelvis Landmark Detection With Local-Global Feature Learning for Diagnosing Developmental Dysplasia of the Hip.
IEEE Trans. Medical Imaging, 2020

Context propagation embedding network for weakly supervised semantic segmentation.
Multim. Tools Appl., 2020

SP-VITON: shape-preserving image-based virtual try-on network.
Multim. Tools Appl., 2020

Compact Position-Aware Attention Network for Image Semantic Segmentation.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

A Feature Generalization Framework for Social Media Popularity Prediction.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Domain-Specific Alignment Network for Multi-Domain Image-Based 3D Object Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learning Rich Attention for Pediatric Bone Age Assessment.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Overcoming Language Priors with Self-supervised Learning for Visual Question Answering.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Graph Structured Network for Image-Text Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Curriculum Learning for Natural Language Understanding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Double-Bit Quantization and Index Hashing for Nearest Neighbor Search.
IEEE Trans. Multim., 2019

MMJN: Multi-Modal Joint Networks for 3D Shape Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Neighbor-aware Approach for Image-text Matching.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Channel Matrix Sparsity With Imperfect Channel State Information in Cloud Radio Access Networks.
IEEE Trans. Veh. Technol., 2018

Pulmonary Vessel Segmentation via Stage-Wise Convolutional Networks With Orientation-Based Region Growing Optimization.
IEEE Access, 2018

Stacked Fully Convolutional Networks for Pulmonary Vessel Segmentation.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Post Tuned Hashing: A New Approach to Indexing High-dimensional Data.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

On the Design of OFDM-Based Simultaneous Wireless Information and Power Transfer in Fog-Radio Access Networks.
Proceedings of the IEEE/CIC International Conference on Communications in China, 2018

Performance Analysis of Outage and Average Sum Rate of Sparse Code Division Multiple Access in Fog Radio Access Networks.
Proceedings of the IEEE/CIC International Conference on Communications in China, 2018

2017
Knowledge Graph Embedding: A Survey of Approaches and Applications.
IEEE Trans. Knowl. Data Eng., 2017

Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Double-bit quantization and weighting for nearest neighbor search.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Training Design for Channel Estimation in Uplink Cloud Radio Access Networks.
IEEE Trans. Signal Process., 2016

Recent Advances in Cloud Radio Access Networks: System Architectures, Key Techniques, and Open Issues.
IEEE Commun. Surv. Tutorials, 2016

Joint Design of Iterative Training-Based Channel Estimation and Cluster Formation in Cloud-Radio Access Networks.
IEEE Access, 2016

2015
Low-Complexity Segment Training Channel Estimation in Cloud Radio Access Networks.
Proceedings of the IEEE 82nd Vehicular Technology Conference, 2015

Hierarchical Encoding of Binary Descriptors for Image Matching.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

What is the next step of binary features?
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

2014
Salient region detection for complex background images using integrated features.
Inf. Sci., 2014

2013
COGE: A Novel Binary Feature Descriptor Exploring Anisotropy and Non-uniformity.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

What are the distance metrics for local features?
Proceedings of the ACM Multimedia Conference, 2013

2012
Geometric context-preserving progressive transmission in mobile visual search.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

A method for detecting salient regions using integrated features.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2009
TRECVID 2009 of MCG-ICT-CAS.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

C3M: A Classification Model for Multivariate Motion Time Series.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

An adaptive ensemble classifier for concept drifting stream.
Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining, 2009


  Loading...