Zhongjiang He

Orcid: 0009-0005-5929-2567

According to our database¹, Zhongjiang He authored at least 50 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

MR-UIE: Multi-Perspective Reasoning with Reinforcement Learning for Universal Information Extraction.

[BibT_eX]

[DOI]

CoRR, September, 2025

TableZoomer: A Collaborative Agent Framework for Large-scale Table Question Answering.

[BibT_eX]

[DOI]

CoRR, September, 2025

Animal-CLIP: A Dual-Prompt Enhanced Vision-Language Model for Animal Action Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., August, 2025

Never Compromise to Vulnerabilities: A Comprehensive Survey on AI Governance.

[BibT_eX]

[DOI]

CoRR, August, 2025

GOAT-SLM: A Spoken Language Model with Paralinguistic and Speaker Characteristic Awareness.

[BibT_eX]

[DOI]

CoRR, July, 2025

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios.

[BibT_eX]

[DOI]

CoRR, July, 2025

Technical Report of TeleChat2, TeleChat2.5 and T1.

[BibT_eX]

[DOI]

CoRR, July, 2025

BoSS: Beyond-Semantic Speech.

[BibT_eX]

[DOI]

CoRR, July, 2025

Emotional Support with LLM-based Empathetic Dialogue Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

TableReasoner: Advancing Table Reasoning Framework with Large Language Models.

[BibT_eX]

[DOI]

CoRR, July, 2025

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models.

[BibT_eX]

[DOI]

CoRR, July, 2025

Detailed Object Description With Controllable Dimensions.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

Reserve to Adapt: Mining Inter-Class Relations for Open-Set Domain Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

Enhancing math reasoning ability of large language models via computation logic graphs.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2025

RAICL-DSC: Retrieval-Augmented In-Context Learning for Dialogue State Correction.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2025

SelectVision: Adaptive Vision Resolution Selection for Visual Document Understanding.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2025, 2025

ViCo: A Multitask Video-enhanced and Cognition-preserving Modality Alignment Training Framework.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

UCS-SQL: Uniting Content and Structure for Enhanced Semantic Bridging In Text-to-SQL.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

INT: Establishing Information Transfer for Multilingual Intent Detection and Slot Filling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

BoViLA: Bootstrapping Video-Language Alignment via LLM-Based Self-Questioning and Answering.

[BibT_eX]

[DOI]

CoRR, 2024

Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification.

[BibT_eX]

[DOI]

CoRR, 2024

Disentangle and denoise: Tackling context misalignment for video moment retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

52B to 1T: Lessons Learned via Tele-FLM Series.

[BibT_eX]

[DOI]

CoRR, 2024

Tele-FLM Technical Report.

[BibT_eX]

[DOI]

CoRR, 2024

TeleChat Technical Report.

[BibT_eX]

[DOI]

CoRR, 2024

Scalable Database-Driven KGs can help Text-to-SQL.

[BibT_eX]

[DOI]

Proceedings of the ISWC 2024 Posters, 2024

Mixture-of-Hand-Experts: Repainting the Deformed Hand Images Generated by Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Enhancing Chinese Argument Mining with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2024

Animal-Bench: Benchmarking Multimodal Video Models for Animal-centric Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AutoGraph: Enabling Visual Context via Graph Alignment in Open Domain Multi-Modal Dialogue Generation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

GOAL: Grounded text-to-image Synthesis with Joint Layout Alignment Tuning.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Graph-based Dynamic Domain Selection for Dialogue State Tracking.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Improving Pointer Network based Dialogue State Tracking via Dual Hierarchical Selective Augmentation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Towards Generalization beyond Pointwise Learning: A Unified Information-theoretic Perspective.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ProTA: Probabilistic Token Aggregation for Text-Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Domain-Slot Aware Contrastive Learning for Improved Dialogue State Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Augmented Self-Mask Attention Transformer for Naturalistic Driving Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dual Prompt Tuning based Contrastive Learning for Hierarchical Text Classification.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Hierarchical Prompting for Diffusion Classifiers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

Class-Aware Contrastive Learning for Fine-Grained Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Alignment and Generation Adapter for Efficient Video-text Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-Attention Transformer for Naturalistic Driving Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

An Effective Motorcycle Helmet Object Detection Framework for Intelligent Traffic Safety.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Zhongjiang He

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...