Shaobo Wang

Orcid: 0000-0001-8156-7081

Affiliations:
  • Shanghai Jiao Tong University, Shanghai, China


According to our database1, Shaobo Wang authored at least 40 papers between 2021 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
The Missing Piece in Pre-trained Model Evaluation: Reward-Guided Decoding Unlocks Task-Oriented Behavior Without Parameter Updates.
CoRR, May, 2026

DISA: Offline Importance Sampling for Distribution-Matching LLM-RL.
CoRR, May, 2026

Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners.
CoRR, May, 2026

Bridging Visual Representation and Reinforcement Learning from Verifiable Rewards in Large Vision-Language Models.
CoRR, March, 2026

Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified Models.
CoRR, March, 2026

Towards Principled Dataset Distillation: A Spectral Distribution Perspective.
CoRR, March, 2026

Credit Where It is Due: Cross-Modality Connectivity Drives Precise Reinforcement Learning for MLLM Reasoning.
CoRR, February, 2026

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration.
CoRR, February, 2026

Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction.
CoRR, February, 2026

Agentic Proposing: Enhancing Large Language Model Reasoning via Compositional Skill Synthesis.
CoRR, February, 2026

Grounding and Enhancing Informativeness and Utility in Dataset Distillation.
CoRR, January, 2026

Bridging Visual Dynamics and Narrative Reasoning: Multimodal Large Language Models for Short Drama Quality Assessment.
Proceedings of the ACM Web Conference 2026, 2026

UNSEEN: Enhancing Dataset Pruning from a Generalization Perspective.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

ImageBindDC: Compressing Multi-modal Data with ImageBind-based Condensation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction.
CoRR, November, 2025

Diffusion LLM with Native Variable Generation Lengths: Let [EOS] Lead the Way.
CoRR, October, 2025

CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs.
CoRR, October, 2025

Rethinking LLM Evaluation: Can We Evaluate LLMs with 200x Less Data?
CoRR, October, 2025

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation.
CoRR, October, 2025

Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution.
CoRR, September, 2025

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning.
CoRR, September, 2025

dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching.
CoRR, June, 2025

Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs.
CoRR, June, 2025

Shifting AI Efficiency From Model-Centric to Data-Centric Compression.
CoRR, May, 2025

KO: Kinetics-inspired Neural Optimizer with PDE Simulation Approaches.
CoRR, May, 2025

DD-Ranking: Rethinking the Evaluation of Dataset Distillation.
CoRR, May, 2025

Compute Only 16 Tokens in One Timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Stop Looking for "Important Tokens" in Multimodal Language Models: Duplication Matters More.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Dataset Distillation with Neural Characteristic Function: A Minmax Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models.
CoRR, 2024

DRUPI: Dataset Reduction Using Privileged Information.
CoRR, 2024

Not All Samples Should Be Utilized Equally: Towards Understanding and Improving Dataset Distillation.
CoRR, 2024

Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2).
CoRR, 2024

Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-V2).
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Unified Batch Normalization: Identifying and Alleviating the Feature Condensation in Batch Normalization and a Unified Framework.
CoRR, 2023

2021
Trap of Feature Diversity in the Learning of MLPs.
CoRR, 2021

Visualizing the Emergence of Intermediate Visual Patterns in DNNs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...