Haonan Cheng
Orcid: 0000-0003-3407-4318
According to our database1,
Haonan Cheng authored at least 53 papers
between 2017 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
EnvTriCascade: An Environment-Aware Tri-Stage Cascaded Framework for ESDD2 2026 Challenge.
CoRR, May, 2026
CoRR, April, 2026
Implement Referring Expression Comprehension by Extending Auto-focus Lens to Locked Vision Model.
ACM Trans. Multim. Comput. Commun. Appl., February, 2026
Towards Explicit Acoustic Evidence Perception in Audio LLMs for Speech Deepfake Detection.
CoRR, January, 2026
Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning.
CoRR, January, 2026
Anchor-Based Multimodal Verification: A Dynamic Query Framework for Fake News Forensics in Short Videos.
IEEE Trans. Inf. Forensics Secur., 2026
Inf. Fusion, 2026
Expert Syst. Appl., 2026
Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, December, 2025
Signal Image Video Process., November, 2025
CoRR, January, 2025
Vis. Intell., 2025
IEEE Trans. Inf. Forensics Secur., 2025
Pattern Recognit., 2025
Visual primitives as words: Alignment and interaction for compositional zero-shot learning.
Pattern Recognit., 2025
Generalization enhancement strategy based on ensemble learning for open domain image manipulation detection.
J. Vis. Commun. Image Represent., 2025
Inf. Process. Manag., 2025
FG-Midiformer: A Symbolic Music Understanding Model towards Fine-Grained Learning of Multi-Attributes.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Pop-Diffuseq: Controllable Symbolic Music Multi-Instrument Infilling and Accompaniment Generation with Long-Axis Attention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025
Proceedings of the IEEE International Conference on Multimedia and Expo, ICME 2025 - Workshops, Nantes, France, June 30, 2025
Look Around Before Locating: Considering Content and Structure Information for Visual Grounding.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
Multim. Syst., February, 2024
IEEE Trans. Inf. Forensics Secur., 2024
MusicECAN: An Automatic Denoising Network for Music Recordings With Efficient Channel Attention.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Visual-guided scene-aware audio generation method based on hierarchical feature codec and rendering decision.
Displays, 2024
Comput. Vis. Image Underst., 2024
Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge.
CoRR, 2024
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio.
CoRR, 2024
EnvFake: An Initial Environmental-Fake Audio Dataset for Scene-Consistency Detection.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion Strategy.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially Spoofed Audio Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Binauralmusic: A Diverse Dataset for Improving Cross-Modal Binaural Audio Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024
DNIT: Enhancing Day-Night Image-to-Image Translation through Fine-Grained Feature Handling (Student Abstract).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Displays, September, 2023
PQG-A2SA: Performance Quantification Guided Audio-to-Score Alignment for Orchestral Music.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023
Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2023
RD-FGFS: A Rule-Data Hybrid Framework for Fine-Grained Footstep Sound Synthesis from Visual Guidance.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Learning A Self-Supervised Domain-Invariant Feature Representation for Generalized Audio Deepfake Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
MABC-Net: Multimodal Mixed Attentional Network with Balanced Class for Temporal Forgery Localization.
Proceedings of the Digital Multimedia Communications, 2023
CACEE: Computational Aesthetic Classification of Expressive Effects Based on Emotional Consistency.
Proceedings of the 4th International Workshop on Human-centric Multimedia Analysis, 2023
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023
2022
IEEE Trans. Circuits Syst. Video Technol., 2022
Emotional Acceptance Measure (EAM): An Objective Evaluation Method Towards Information Communication Effect.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2022
2019
Haptic Force Guided Sound Synthesis in Multisensory Virtual Reality (VR) Simulation for Rigid-Fluid Interaction.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2019
2017
Proceedings of the 2017 IEEE Virtual Reality, 2017