Donghuo Zeng

Orcid: 0000-0002-6425-6270

According to our database1, Donghuo Zeng authored at least 40 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Hierarchical Semantic Correlation-Aware Masked Autoencoder for Unsupervised Audio-Visual Representation Learning.
CoRR, April, 2026

PLATO-JDS: Enhancing Japanese Dialogue Systems Through Topic-Switch Adaptation.
New Gener. Comput., February, 2026

Variance & Greediness: A comparative study of metric-learning losses.
CoRR, January, 2026

Personality-Aware Reinforcement Learning for Persuasive Dialogue with LLM-Driven Simulation.
Proceedings of the Persuasive Technology - 21st International Conference, 2026

Learning Audio-Visual Embeddings with Inferred Latent Interaction Graphs.
Proceedings of the Advances in Information Retrieval, 2026

Dialogue Control and Its Consequences: Grounding, Policy, and User Perception in Persuasive Chatbots.
Proceedings of the Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems, 2026

2025
Comparing Contrastive and Triplet Loss: Variance Analysis and Optimization Behavior.
CoRR, October, 2025

Causal Discovery and Counterfactual Reasoning to Optimize Persuasive Dialogue Policies.
CoRR, March, 2025

Generative Framework for Personalized Persuasion: Inferring Causal, Counterfactual, and Latent Knowledge.
Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization, 2025

Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding Learning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Learning Hidden Causal Factors from Psychometrics Data Using Distributional Information.
Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025

2024
Top-down Activity Representation Learning for Video Question Answering.
CoRR, 2024

Multi-object event graph representation learning for Video Question Answering.
CoRR, 2024

Counterfactual Reasoning Using Predicted Latent Personality Dimensions for Optimizing Persuasion Outcome.
Proceedings of the Persuasive Technology - 19th International Conference, 2024

Identifying Latent State-Transition Processes for Individualized Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Anchor-aware Deep Metric Learning for Audio-visual Retrieval.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

2023
Learning Explicit and Implicit Dual Common Subspaces for Audio-visual Cross-modal Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval.
CoRR, 2023

TV-watching partner robot: Analysis of User's Experience.
CoRR, 2023

Topic-switch adapted Japanese Dialogue System based on PLATO-2.
CoRR, 2023

VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning.
IEEE Access, 2023

Triplet Loss with Curriculum Learning for Audio-Visual Retrieval.
Proceedings of the IEEE International Symposium on Multimedia, 2023

Do I Have Your Attention: A Large Scale Engagement Prediction Dataset and Baselines.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

EmotiW 2023: Emotion Recognition in the Wild Challenge.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

2022
Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval.
Proceedings of the IEEE International Symposium on Multimedia, 2022

2021
Learning Explicit and Implicit Latent Common Spaces for Audio-Visual Cross-Modal Retrieval.
CoRR, 2021

TV-watching Companion Robot Supported by Open-domain Chatbot "KACTUS".
Proceedings of the MUM 2021: 20th International Conference on Mobile and Ubiquitous Multimedia, Leuven, Belgium, December 5, 2021

SHECS: A Local Smart Hands-free Elderly Care Support System on Smart AR Glasses with AI Technology.
Proceedings of the IEEE International Symposium on Multimedia, 2021

2020
Deep Alignment Representation Learning for Multimodal Information Retrieval.
PhD thesis, 2020

Deep Triplet Neural Networks with Cluster-CCA for Audio-Visual Cross-Modal Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2020

MTM Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio?
CoRR, 2020

Unsupervised Generative Adversarial Alignment Representation for Sheet music, Audio and Lyrics.
Proceedings of the 6th IEEE International Conference on Multimedia Big Data, 2020

2019
Learning Joint Embedding for Cross-Modal Retrieval.
CoRR, 2019

Audio-Visual Embedding for Cross-Modal MusicVideo Retrieval through Supervised Deep CCA.
CoRR, 2019

Personalized Music Recommendation with Triplet Network.
CoRR, 2019

Learning Joint Embedding for Cross-Modal Retrieval.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

2018
Audio-Visual Embedding for Cross-Modal Music Video Retrieval through Supervised Deep CCA.
Proceedings of the 2018 IEEE International Symposium on Multimedia, 2018

Deep Learning of Human Perception in Audio Event Classification.
Proceedings of the 2018 IEEE International Symposium on Multimedia, 2018

2017
LSTM-CRF for Drug-Named Entity Recognition.
Entropy, 2017

2016
Enlarging drug dictionary with semi-supervised learning for Drug Entity Recognition.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016


  Loading...