Zhongang Qi

Orcid: 0000-0001-8298-4063

According to our database1, Zhongang Qi authored at least 52 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

DropConn: Dropout Connection Based Random GNNs for Molecular Property Prediction.
IEEE Trans. Knowl. Data Eng., February, 2024

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model.
CoRR, 2024

RecDCL: Dual Contrastive Learning for Recommendation.
CoRR, 2024

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Task-Aware Dual-Representation Network for Few-Shot Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding.
CoRR, 2023

CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models.
CoRR, 2023

StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation.
CoRR, 2023

Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation.
CoRR, 2023

Sticker820K: Empowering Interactive Retrieval with Stickers.
CoRR, 2023

T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models.
CoRR, 2023

Exploiting Contextual Objects and Relations for 3D Visual Grounding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VTLayout: A Multi-Modal Approach for Video Text Layout.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Toward Human Perception-Centric Video Thumbnail Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Do We Really Need Temporal Convolutions in Action Segmentation?
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Order-Prompted Tag Sequence Generation for Video Tagging.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ERBNet: An Effective Representation Based Network for Unbiased Scene Graph Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ViLEM: Visual-Language Error Modeling for Image-Text Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Accelerating the Training of Video Super-resolution Models.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Weakly-supervised Action Localization via Hierarchical Mining.
CoRR, 2022

Efficient U-Transformer with Boundary-Aware Loss for Action Segmentation.
CoRR, 2022

CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation.
CoRR, 2022

Convolutional Transformer with Similarity-based Boundary Prediction for Action Segmentation.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
From Heatmaps to Structural Explanations of Image Classifiers.
CoRR, 2021

Stochastic Block-ADMM for Training Deep Networks.
CoRR, 2021

A Generic Object Re-identification System for Short Videos.
CoRR, 2021

Embedding deep networks into visual explanations.
Artif. Intell., 2021

Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Semantic-Guided Relation Propagation Network for Few-shot Action Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

TransFusion: Multi-Modal Fusion for Video Tag Inference via Translation-based Knowledge Embedding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Open-Book Video Captioning With Retrieve-Copy-Generate Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Visualizing point cloud classifiers by curvature smoothing.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Visualizing Deep Networks by Optimizing with Integrated Gradients.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

ScaleNet - Improve CNNs through Recursively Rescaling Objects.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Interactive Naming for Explaining Deep Neural Networks: A Formative Study.
Proceedings of the Joint Proceedings of the ACM IUI 2019 Workshops co-located with the 24th ACM Conference on Intelligent User Interfaces (ACM IUI 2019), 2019

PointConv: Deep Convolutional Networks on 3D Point Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Deep Air Learning: Interpolation, Prediction, and Feature Analysis of Fine-Grained Air Quality.
IEEE Trans. Knowl. Data Eng., 2018

Multi-Task Medical Concept Normalization Using Multi-View Convolutional Neural Network.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Embedding Deep Networks into Visual Explanations.
CoRR, 2017

2016
Bayesian Multi-Task Relationship Learning with Link Structure.
IEEE Trans. Knowl. Data Eng., 2016

2013
Learning with limited and noisy tagging.
Proceedings of the ACM Multimedia Conference, 2013

Characterizing and Comparing User Location Preference in an Urban Mobile Network.
Proceedings of the Trustworthy Computing and Services, 2013

2012
Multi-view learning from imperfect tagging.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Mining noisy tagging from multi-label space.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Mining partially annotated images.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011


  Loading...