Zhongjiang He
Orcid: 0009-0000-1835-9271
According to our database1,
Zhongjiang He
authored at least 47 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Animal-CLIP: A Dual-Prompt Enhanced Vision-Language Model for Animal Action Recognition.
Int. J. Comput. Vis., August, 2025
CoRR, August, 2025
GOAT-SLM: A Spoken Language Model with Paralinguistic and Speaker Characteristic Awareness.
CoRR, July, 2025
TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios.
CoRR, July, 2025
CoRR, July, 2025
FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models.
CoRR, July, 2025
IEEE Trans. Image Process., 2025
Enhancing math reasoning ability of large language models via computation logic graphs.
Knowl. Based Syst., 2025
Knowl. Based Syst., 2025
ViCo: A Multitask Video-enhanced and Cognition-preserving Modality Alignment Training Framework.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
UCS-SQL: Uniting Content and Structure for Enhanced Semantic Bridging In Text-to-SQL.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
INT: Establishing Information Transfer for Multilingual Intent Detection and Slot Filling.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
BoViLA: Bootstrapping Video-Language Alignment via LLM-Based Self-Questioning and Answering.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the ISWC 2024 Posters, 2024
Mixture-of-Hand-Experts: Repainting the Deformed Hand Images Generated by Diffusion Models.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
Proceedings of the Natural Language Processing and Chinese Computing, 2024
Animal-Bench: Benchmarking Multimodal Video Models for Animal-centric Video Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
AutoGraph: Enabling Visual Context via Graph Alignment in Open Domain Multi-Modal Dialogue Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization.
Proceedings of the International Joint Conference on Neural Networks, 2024
Proceedings of the International Joint Conference on Neural Networks, 2024
Improving Pointer Network based Dialogue State Tracking via Dual Hierarchical Selective Augmentation.
Proceedings of the International Joint Conference on Neural Networks, 2024
Towards Generalization beyond Pointwise Learning: A Unified Information-theoretic Perspective.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Augmented Self-Mask Attention Transformer for Naturalistic Driving Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Computer Vision - ACCV 2024, 2024
Proceedings of the Computer Vision - ACCV 2024, 2024
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
CoRR, 2023
A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
An Effective Motorcycle Helmet Object Detection Framework for Intelligent Traffic Safety.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023