Yanwei Li

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond.
CoRR, April, 2026

Mini-Gemini: Mining the Potential of Multi-Modality Vision Language Models.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

Block-Wise Random Diffusion Encryption Using 4-D Dynamic Feedback Memristive Chaotic System.
IEEE Trans. Consumer Electron., February, 2026

One Loss to Rule Them All: Marked Time-to-Event for Structured EHR Foundation Models.
CoRR, February, 2026

2025
Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark.
CoRR, December, 2025

Visual Spatial Tuning.
CoRR, November, 2025

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs.
CoRR, October, 2025

A Novel Secure Key Stream Generator Based on Chaotic Multistate Cellular Automata.
IEEE Internet Things J., September, 2025

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective.
CoRR, September, 2025

Aligning Effective Tokens with Video Anomaly in Large Language Models.
CoRR, August, 2025

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World.
CoRR, June, 2025

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models.
CoRR, May, 2025

FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records.
CoRR, May, 2025

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding.
CoRR, April, 2025

LLaVA-OneVision: Easy Visual Task Transfer.
Trans. Mach. Learn. Res., 2025

Optimization and Validation of Wafer Surface Defect Detection Algorithm Based on RT-DETR.
IEEE Access, 2025

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

LYRA: An Efficient and Speech-Centric Framework for Omni-Cognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Aligning Effective Tokens with Video Anomaly in Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Dynamic catastrophe analysis of deepwater mooring platform/riser/wellhead coupled system under ISW.
Reliab. Eng. Syst. Saf., 2024

Virtual element discretization method to optimal control problem governed by Stokes equations with pointwise control constraint on arbitrary polygonal meshes.
J. Comput. Appl. Math., 2024

Beyond Pixels: Text Enhances Generalization in Real-World Image Restoration.
CoRR, 2024

LLaVA-OneVision: Easy Visual Task Transfer.
CoRR, 2024

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.
CoRR, 2024

RL-GPT: Integrating Reinforcement Learning and Code-as-policy.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Research Trends of Human Machine Interaction Studies in Mechanical Equipment Design: A Bibliometric Review.
Proceedings of the HCI International 2024 Posters, 2024

Internal Versus External Forces: Which Dominates in Driving the Use of Open Government Data.
Proceedings of the Electronic Government - 23rd IFIP WG 8.5 International Conference, 2024

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Unveiling the Mechanisms Behind Open Government Data Use: The Interplay of Internal Resources and Institutional Pressures.
Proceedings of the 25th Annual International Conference on Digital Government Research, 2024

LISA: Reasoning Segmentation via Large Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Adaptive Virtual Element Method for Optimal Control Problem Governed by Stokes Equations.
J. Sci. Comput., December, 2023

Fully Convolutional Networks for Panoptic Segmentation With Point-Based Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Investigating the Validity and Reliability of a Comprehensive Essay Evaluation Model of Integrating Manual Feedback and Intelligent Assistance.
Int. J. Emerg. Technol. Learn., February, 2023

Scale-Aware Automatic Augmentations for Object Detection With Dynamic Training.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning.
CoRR, 2023

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-Empowered Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Analysis of the Seismic Damage Evolution Process in Surrounding Rock of Underground Caverns at a Hydropower Station in Tibet.
Proceedings of the International Conference on Mathematics and Machine Learning, 2023

End-to-end 3D Tracking with Decoupled Queries.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
An Evaluation Model of Online Autonomous English Learning Efficiency Using an Artificial Neural Network.
Int. J. Emerg. Technol. Learn., April, 2022

A Self-Driven Microfluidic Chip for Ricin and Abrin Detection.
Sensors, 2022

Unifying Voxel-based Representation with Transformer for 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Multiple Effects of Digital Archaeology in the Future Space.
Proceedings of the Cross-Cultural Design. Applications in Learning, Arts, Cultural Heritage, Creative Industries, and Virtual Reality, 2022

Achieving a Blockchain-based Privacy-preserving Quality-aware Knowledge Marketplace in Crowdsensing.
Proceedings of the 20th IEEE International Conference on Embedded and Ubiquitous Computing, 2022

Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines.
Proceedings of the Computer Vision - ECCV 2022, 2022

Diversified Dynamic Routing for Vision Tasks.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Voxel Field Fusion for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Focal Sparse Convolutional Networks for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Evaluating Phenotype Classification Using Synthesized Online Content.
Proceedings of the AMIA 2022, 2022

Improving Biomedical Informatics Graduate Student Recruitment through a Structured Undergraduate Summer Internship Team-Based Experience.
Proceedings of the AMIA 2022, 2022

2021
Design of LoRa/NB-IoT Gateway for Intelligent Infusion System Based on Binary Exponential Backoff Algorithm.
Proceedings of the 2021 International Conference on Control, 2021

Human Kinematics Modeling and Simulation Based on OpenSim.
Proceedings of the 2021 International Conference on Control, 2021

Multi-Scale Aligned Distillation for Low-Resolution Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Fully Convolutional Networks for Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Scale-Aware Automatic Augmentation for Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Stitcher: Feedback-driven Data Provider for Object Detection.
CoRR, 2020

An efficient and privacy-preserving truth discovery scheme in crowdsensing applications.
Comput. Secur., 2020

Fine-Grained Dynamic Head for Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Rethinking Learnable Tree Filter for Generic Feature Transform.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Dynamic Routing for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Reducing the site survey using fingerprint refinement for cost-efficient indoor location.
Wirel. Networks, 2019

FastPose: Towards Real-time Pose Estimation and Tracking via Scale-normalized Multi-task Networks.
CoRR, 2019

Process Extraction from Texts via Multi-Task Architecture.
CoRR, 2019

Design of Industrial Field Intelligent Temperature Acquisition System Based on Timestamped Anti-Interference Algorithm.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Design of Intelligent Unmanned Vehicle Handling Simulation System.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Design of Wearable Human Monitoring System Based on Internet of Things.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Emergency Communication System Based on Tethered Unmanned Aerial Vehicle.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Learnable Tree Filter for Structure-preserving Feature Transform.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

State-Aware Re-Identification Feature for Multi-Target Multi-Camera Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Attention-Guided Unified Network for Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
A New Approach to Security Analysis of Smart Home Authentication Systems.
Fundam. Informaticae, 2018

The Governance of Risks in Ridesharing: A Revelatory Case from Singapore.
CoRR, 2018

Identity-Enhanced Network for Facial Expression Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Design of low power 4×40 Gb/s laser diode driver for parallel optical transmission systems.
Sci. China Inf. Sci., 2017

2016
Stratify Mobile App Reviews: E-LDA Model Based on Hot "Entity" Discovery.
Proceedings of the 12th International Conference on Signal-Image Technology & Internet-Based Systems, 2016

2015
The multi-attribute group decision-making method based on the interval grey uncertain linguistic generalized hybrid averaging operator.
Neural Comput. Appl., 2015

2013
Fair and Efficient Spectrum Splitting for Unlicensed Secondary Users in Cooperative Cognitive Radio Networks.
Wirel. Pers. Commun., 2013

A Cluster-Based Consensus Algorithm in a Wireless Sensor Network.
Int. J. Distributed Sens. Networks, 2013

Game Theory Based Hybrid Access for Macrocell-Edge Users in a Macro-Femto Network.
Proceedings of the 77th IEEE Vehicular Technology Conference, 2013

Availability analytical model for permanent dedicated path protection in service differentiated WDM networks.
Proceedings of the 2013 Optical Fiber Communication Conference and Exposition and the National Fiber Optic Engineers Conference (OFC/NFOEC), 2013

2012
Dynamic Traffic Grooming in Flexible Multi-Layer IP/Optical Networks.
IEEE Commun. Lett., 2012

Availability Analytical Model for Permanent Dedicated Path Protection in WDM Networks.
IEEE Commun. Lett., 2012

An Adaptive Blind Single Antenna Interference Cancellation Algorithm for 4G LTE Systems.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012


  Loading...