Ji Qi

Orcid: 0009-0002-8829-309X

Affiliations:
  • Tsinghua University, Beijing, China


According to our database1, Ji Qi authored at least 28 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
CoRR, July, 2025

TextVidBench: A Benchmark for Long Video Scene Text Understanding.
CoRR, June, 2025

Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models.
CoRR, May, 2025

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes.
CoRR, April, 2025

ExpLLM: Towards Chain of Thought for Facial Expression Recognition.
IEEE Trans. Multim., 2025

CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
CogVLM2: Visual Language Models for Image and Video Understanding.
CoRR, 2024

LVBench: An Extreme Long Video Understanding Benchmark.
CoRR, 2024

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations.
CoRR, 2024

CogVLM: Visual Expert for Pretrained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LongAlign: A Recipe for Long Context Alignment of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
CogVLM: Visual Expert for Pretrained Language Models.
CoRR, 2023

Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment.
CoRR, 2023

BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation.
CoRR, 2023

Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction.
CoRR, 2023

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation.
CoRR, 2023

Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
ConstGCN: Constrained Transmission-based Graph Convolutional Networks for Document-level Relation Extraction.
CoRR, 2022

Syntactically Robust Training on Partially-Observed Data for Open Information Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

ParaMac: A General Unsupervised Paraphrase Generation Framework Leveraging Semantic Constraints and Diversifying Mechanisms.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Deep Gaussian Mixture Model on Multiple Interpretable Features of Fetal Heart Rate for Pregnancy Wellness.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

DiaKG: An Annotated Diabetes Dataset for Medical Knowledge Graph Construction.
Proceedings of the Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction, 2021

2019
Using GAN to Generate Sport News from Live Game Stats.
Proceedings of the Cognitive Computing - ICCC 2019, 2019

2018
Building Corpus with Emoticons for Sentiment Analysis.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

A Self-Attentive Model with Gate Mechanism for Spoken Language Understanding.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018


  Loading...