Jian Li

Orcid: 0000-0002-0242-6481

Affiliations:
  • YouTu Lab, Tencent, Shanghai, China
  • Nanjing University of Science and Technology, Nanjing, China


According to our database1, Jian Li authored at least 57 papers between 2006 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
GUI Agents with Reinforcement Learning: Toward Digital Inhabitants.
CoRR, April, 2026

Beyond Frame-Wise Tracking: A Trajectory-Based Paradigm for Efficient Point Cloud Tracking.
IEEE Robotics Autom. Lett., March, 2026

AD-Copilot: A Vision-Language Assistant for Industrial Anomaly Detection via Visual In-context Comparison.
CoRR, March, 2026

Improving Search Agent with One Line of Code.
CoRR, March, 2026

SE-Search: Self-Evolving Search Agent via Memory and Dense Reward.
CoRR, March, 2026

AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing.
CoRR, January, 2026

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation.
CoRR, January, 2026

Resource Allocation Based on Topological Rotation Symmetry and Subnet Division for Entanglement Distribution Networks.
IEEE Trans. Netw. Sci. Eng., 2026

Active Dataset Distillation via Dual-Space Informative Matching.
IEEE Trans. Image Process., 2026

Towards fine-grained vision-language alignment for few-shot anomaly detection.
Pattern Recognit., 2026

LLM-Oriented Token-Adaptive Knowledge Distillation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
RoleRMBench & RoleRM: Towards Reward Modeling for Profile-Based Role Play in Dialogue Systems.
CoRR, December, 2025

SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM.
CoRR, November, 2025

LLM-Oriented Token-Adaptive Knowledge Distillation.
CoRR, October, 2025

Optimized Qubit-Based Synchronization in Quantum Key Distribution.
IEEE Commun. Lett., May, 2025

Swin DiT: Diffusion Transformer using Pseudo Shifted Windows.
CoRR, May, 2025

VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model.
CoRR, May, 2025

Efficient multimodal large language models: a survey.
Vis. Intell., 2025

TAPCNet: Tactile-Assisted Point Cloud Completion Network via Iterative Fusion Strategy.
IET Comput. Vis., 2025

MAP: Parameter-Efficient Tuning for Referring Expression Comprehension via Multi-Modal Adaptive Positional Encoding.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

BoxSeg: Quality-Aware and Peer-Assisted Learning for Box-supervised Instance Segmentation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards Universal Dataset Distillation via Task-Driven Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Enhancing the performance of point cloud completion assisted with tactile information.
EURASIP J. Image Video Process., December, 2024

Multi-protocol updating for seamless key negotiation in quantum metropolitan networks.
J. Opt. Commun. Netw., 2024

LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval.
CoRR, 2024

Spider: Any-to-Many Multimodal LLM.
CoRR, 2024

MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection.
CoRR, 2024

VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis.
CoRR, 2024

Efficient Multimodal Large Language Models: A Survey.
CoRR, 2024

Topological Rotation Symmetry-Based Wavelength Allocation for Entanglement Distribution Networks.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2024

Fetch and Forge: Efficient Dataset Condensation for Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
Highly efficient twin-field quantum key distribution with neural networks.
Sci. China Inf. Sci., August, 2023

Rethinking Mobile Block for Efficient Neural Models.
CoRR, 2023

PVG: Progressive Vision Graph for Vision Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Rethinking Mobile Block for Efficient Attention-based Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning with Noisy labels via Self-supervised Adversarial Noisy Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning from Noisy Labels with Decoupled Meta Label Purifier.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Passive Light Source Monitoring for Sending or Not Sending Twin-Field Quantum Key Distribution.
Entropy, 2022

SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-resolution.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Experimental verification of NP-complete problems via linear optics.
Proceedings of the 13th International Conference on Wireless Communications and Signal Processing, 2021

Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ASFD: Automatic and Scalable Face Detector.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
ACFD: Asymmetric Cartoon Face Detector.
CoRR, 2020

ASFD: Automatic and Scalable Face Detector.
CoRR, 2020

Learning Hierarchical Graph for Occluded Pedestrian Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Fast Learning of Temporal Action Proposal via Dense Boundary Generator.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Anti-Confusing: Region-Aware Network for Human Pose Estimation.
CoRR, 2019

DSFD: Dual Shot Face Detector.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017
Parameter optimization in biased decoy-state quantum key distribution with both source errors and statistical fluctuations.
Quantum Inf. Process., 2017

An improved proposal on the practical quantum key distribution with biased basis.
Quantum Inf. Process., 2017

Object detection via feature fusion based single network.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

2008
Optical Communications Evaluation of GMPLS controlled Multi-granularity Automatically Switched Optical Network.
Eur. Trans. Telecommun., 2008

2006
A Simple and Fast Wavelength Reservation Protocol for Dynamic Traffic in Large Scale Wavelength-routed Networks.
Proceedings of the Fifth International Conference on Networking and the International Conference on Systems (ICN / ICONS / MCL 2006), 2006


  Loading...