Li Yuan
Orcid: 0000-0002-2120-5588Affiliations:
- Peking University, School of Electronic and Computer Engineering, Beijing, China
- National University of Singapore, Singapore
According to our database1,
Li Yuan
authored at least 155 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025
CoRR, August, 2025
IEEE Trans. Circuits Syst. Video Technol., July, 2025
IEEE Trans. Cogn. Dev. Syst., June, 2025
CoRR, June, 2025
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation.
CoRR, June, 2025
Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations.
CoRR, May, 2025
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation.
CoRR, May, 2025
CoRR, May, 2025
BioProBench: Comprehensive Dataset and Benchmark in Biological Protocol Understanding and Reasoning.
CoRR, May, 2025
CoRR, April, 2025
How to Detect and Defeat Molecular Mirage: A Metric-Driven Benchmark for Hallucination in LLM-based Molecular Comprehension.
CoRR, April, 2025
CoRR, April, 2025
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025
Decoupled peak property learning for efficient and interpretable electronic circular dichroism spectrum prediction.
Nat. Comput. Sci., March, 2025
NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations.
CoRR, March, 2025
SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video.
CoRR, March, 2025
CoRR, March, 2025
CoRR, March, 2025
Learning Box Regression and Mask Segmentation Under Long-Tailed Distribution with Gradient Transfusing.
Int. J. Comput. Vis., February, 2025
AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scene.
CoRR, January, 2025
Sci. China Inf. Sci., 2025
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
IEEE Trans. Neural Networks Learn. Syst., December, 2024
IEEE Trans. Medical Imaging, December, 2024
Full Transformer Framework for Robust Point Cloud Registration With Deep Information Interaction.
IEEE Trans. Neural Networks Learn. Syst., October, 2024
IEEE Trans. Circuits Syst. Video Technol., June, 2024
Efficient Long-Short Temporal Attention network for unsupervised Video Object Segmentation.
Pattern Recognit., February, 2024
Fully Transformer-Equipped Architecture for end-to-end Referring Video Object Segmentation.
Inf. Process. Manag., January, 2024
World Sci. Annu. Rev. Artif. Intell., 2024
Neural Networks, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Multi-granularity Score-based Generative Framework Enables Efficient Inverse Design of Complex Organics.
CoRR, 2024
CoRR, 2024
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Repaint123: Fast and High-Quality One Image to 3D Generation with Progressive Controllable Repainting.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Towards Better Seach Query Classification with Distribution-Diverse Multi-Expert Knowledge Distillation in JD Ads Search.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023
Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection.
Pattern Recognit., October, 2023
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
IEEE Trans. Pattern Anal. Mach. Intell., 2023
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting.
CoRR, 2023
Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models.
CoRR, 2023
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding.
CoRR, 2023
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs.
CoRR, 2023
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
CoRR, 2023
CoRR, 2023
ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Image and Graphics - 12th International Conference, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision, 2022
Proceedings of the Computer Vision, 2022
2021
Exploring global diverse attention via pairwise temporal relation for video summarization.
Pattern Recognit., 2021
Token Labeling: Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet.
CoRR, 2021
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
IEEE Trans. Multim., 2020
CoRR, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018