Linjie Li
Orcid: 0000-0003-0867-8863
According to our database1,
Linjie Li
authored at least 121 papers
between 2016 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models.
CoRR, July, 2025
GLIMPSE: Do Large Vision-Language Models Truly Think With Videos or Just Glimpse at Them?
CoRR, July, 2025
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers.
CoRR, June, 2025
CoRR, June, 2025
MoTE: Mixture of Task-specific Experts for Pre-Trained ModelBased Class-incremental Learning.
CoRR, June, 2025
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs.
CoRR, June, 2025
CoRR, June, 2025
Seeing is Not Reasoning: MVPBench for Graph-based Evaluation of Multi-path Visual Physical CoT.
CoRR, May, 2025
Are Unified Vision-Language Models Necessary: Generalization Across Understanding and Generation.
CoRR, May, 2025
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning.
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning.
CoRR, April, 2025
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement.
CoRR, April, 2025
V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models.
CoRR, April, 2025
Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models.
CoRR, March, 2025
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning.
CoRR, March, 2025
CoRR, February, 2025
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback.
CoRR, January, 2025
CoRR, January, 2025
MoTE: Mixture of task-specific experts for pre-trained model-based Class-incremental learning.
Knowl. Based Syst., 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
CertainlyUncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities.
Dataset, December, 2024
Found. Trends Comput. Graph. Vis., 2024
An Iterative Resampling Deep Decoupling Domain Adaptation method for class-imbalance bearing fault diagnosis under variant working conditions.
Expert Syst. Appl., 2024
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension.
CoRR, 2024
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities.
CoRR, 2024
Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Enhancing Human-to-Robot Skill Transfer: A Framework Integrating Movement and Variable Impedance Based on EMG.
Proceedings of the IEEE International Conference on Industrial Technology, 2024
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Idea2Img: Iterative Self-refinement with GPT-4V for Automatic Image Design and Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation.
CoRR, 2023
CoRR, 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Genom. Proteom. Bioinform., August, 2022
Trans. Mach. Learn. Res., 2022
Found. Trends Comput. Graph. Vis., 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 10th International Workshop on Signal Design and Its Applications in Communications, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
PREVAIL: Pre-trained Variational Adversarial Active Learning for Molecular Property Prediction.
Proceedings of the 8th IEEE International Conference on Cloud Computing and Intelligent Systems, 2022
Proceedings of the Computer Vision - ACCV 2024, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
CoRR, 2021
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
A Fault Diagnostic Scheme Based on Capsule Network for Rolling Bearing under Different Rotational Speeds.
Sensors, 2020
CoRR, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the AIAM2020: 2nd International Conference on Artificial Intelligence and Advanced Manufacture, 2020
2019
Configuration Design and Simulation of Novel Petal Tooth Nutation Joint Drive for Robot.
Proceedings of the Intelligent Robotics and Applications - 12th International Conference, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2017
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017
2016
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016