Zhongjiang He

Orcid: 0009-0000-1835-9271

According to our database1, Zhongjiang He authored at least 47 papers between 2023 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Animal-CLIP: A Dual-Prompt Enhanced Vision-Language Model for Animal Action Recognition.
Int. J. Comput. Vis., August, 2025

Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance.
CoRR, August, 2025

GOAT-SLM: A Spoken Language Model with Paralinguistic and Speaker Characteristic Awareness.
CoRR, July, 2025

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios.
CoRR, July, 2025

Technical Report of TeleChat2, TeleChat2.5 and T1.
CoRR, July, 2025

BoSS: Beyond-Semantic Speech.
CoRR, July, 2025

Emotional Support with LLM-based Empathetic Dialogue Generation.
CoRR, July, 2025

TableReasoner: Advancing Table Reasoning Framework with Large Language Models.
CoRR, July, 2025

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models.
CoRR, July, 2025

Reserve to Adapt: Mining Inter-Class Relations for Open-Set Domain Adaptation.
IEEE Trans. Image Process., 2025

Enhancing math reasoning ability of large language models via computation logic graphs.
Knowl. Based Syst., 2025

RAICL-DSC: Retrieval-Augmented In-Context Learning for Dialogue State Correction.
Knowl. Based Syst., 2025

ViCo: A Multitask Video-enhanced and Cognition-preserving Modality Alignment Training Framework.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

UCS-SQL: Uniting Content and Structure for Enhanced Semantic Bridging In Text-to-SQL.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

INT: Establishing Information Transfer for Multilingual Intent Detection and Slot Filling.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Detailed Object Description with Controllable Dimensions.
CoRR, 2024

BoViLA: Bootstrapping Video-Language Alignment via LLM-Based Self-Questioning and Answering.
CoRR, 2024

Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification.
CoRR, 2024

Disentangle and denoise: Tackling context misalignment for video moment retrieval.
CoRR, 2024

52B to 1T: Lessons Learned via Tele-FLM Series.
CoRR, 2024

Tele-FLM Technical Report.
CoRR, 2024

TeleChat Technical Report.
CoRR, 2024

Scalable Database-Driven KGs can help Text-to-SQL.
Proceedings of the ISWC 2024 Posters, 2024

Mixture-of-Hand-Experts: Repainting the Deformed Hand Images Generated by Diffusion Models.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Enhancing Chinese Argument Mining with Large Language Model.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Animal-Bench: Benchmarking Multimodal Video Models for Animal-centric Video Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AutoGraph: Enabling Visual Context via Graph Alignment in Open Domain Multi-Modal Dialogue Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

GOAL: Grounded text-to-image Synthesis with Joint Layout Alignment Tuning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization.
Proceedings of the International Joint Conference on Neural Networks, 2024

Graph-based Dynamic Domain Selection for Dialogue State Tracking.
Proceedings of the International Joint Conference on Neural Networks, 2024

Improving Pointer Network based Dialogue State Tracking via Dual Hierarchical Selective Augmentation.
Proceedings of the International Joint Conference on Neural Networks, 2024

Towards Generalization beyond Pointwise Learning: A Unified Information-theoretic Perspective.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ProTA: Probabilistic Token Aggregation for Text-Video Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Domain-Slot Aware Contrastive Learning for Improved Dialogue State Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2024

Augmented Self-Mask Attention Transformer for Naturalistic Driving Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dual Prompt Tuning based Contrastive Learning for Hierarchical Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Hierarchical Prompting for Diffusion Classifiers.
Proceedings of the Computer Vision - ACCV 2024, 2024

Class-Aware Contrastive Learning for Fine-Grained Skeleton-Based Action Recognition.
Proceedings of the Computer Vision - ACCV 2024, 2024

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.
CoRR, 2023

A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Alignment and Generation Adapter for Efficient Video-text Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-Attention Transformer for Naturalistic Driving Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

An Effective Motorcycle Helmet Object Detection Framework for Intelligent Traffic Safety.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


  Loading...