Shaohui Lin

Orcid: 0009-0004-9264-9241

According to our database1, Shaohui Lin authored at least 88 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection.
CoRR, July, 2025

RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought.
CoRR, June, 2025

CompBench: Benchmarking Complex Instruction-guided Image Editing.
CoRR, May, 2025

TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation.
CoRR, April, 2025

IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval.
CoRR, April, 2025

LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?
CoRR, March, 2025

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models.
CoRR, March, 2025

Autoregressive Image Generation Guided by Chains of Thought.
CoRR, February, 2025

LIPT: Latency-Aware Image Processing Transformer.
IEEE Trans. Image Process., 2025

DCS-RISR: Dynamic channel splitting for efficient real-world image super-resolution.
Neural Networks, 2025

Complete Chess Games Enable LLM Become A Chess Master.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

AugKD: Ingenious Augmentations Empower Knowledge Distillation for Image Super-Resolution.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Knowledge Transfer Across Modalities for Weakly Supervised Point Cloud Semantic Segmentation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SET: Spectral Enhancement for Tiny Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Probability-Density-aware Semi-supervised Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Online Management for Edge-Cloud Collaborative Continuous Learning: A Two-Timescale Approach.
IEEE Trans. Mob. Comput., December, 2024

Improving rare relation inferring for scene graph generation using bipartite graph network.
Comput. Vis. Image Underst., February, 2024

A closer look at branch classifiers of multi-exit architectures.
Comput. Vis. Image Underst., February, 2024

Dynamic image super-resolution via progressive contrastive self-distillation.
Pattern Recognit., 2024

Class-imbalanced semi-supervised learning for large-scale point cloud semantic segmentation via decoupling optimization.
Pattern Recognit., 2024

SegCFT: Context-aware Fourier Transform for efficient semantic segmentation.
Neurocomputing, 2024

Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution.
CoRR, 2024

HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection.
CoRR, 2024

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.
CoRR, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
CoRR, 2024

Fusion-Mamba for Cross-modality Object Detection.
CoRR, 2024

LIPT: Latency-aware Image Processing Transformer.
CoRR, 2024

Rethinking Centered Kernel Alignment in Knowledge Distillation.
CoRR, 2024

CLIP in Mirror: Disentangling text from visual images through reflection.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Rethinking Centered Kernel Alignment in Knowledge Distillation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Aligning and Prompting Everything All at Once for Universal Visual Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A General and Efficient Training for Transformer via Token Expansion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CLIP-Driven Open-Vocabulary 3D Scene Graph Generation via Cross-Modality Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AQ-DETR: Low-Bit Quantized Detection Transformer with Auxiliary Queries.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Weakly Supervised Open-Vocabulary Object Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Kumaraswamy Wavelet for Heterophilic Scene Graph Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Hybrid knowledge distillation from intermediate layers for efficient Single Image Super-Resolution.
Neurocomputing, October, 2023

MISSU: 3D Medical Image Segmentation via Self-Distilling TransUNet.
IEEE Trans. Medical Imaging, September, 2023

Compressing convolutional neural networks with cheap convolutions and online distillation.
Displays, July, 2023

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise.
CoRR, 2023

Filter Pruning for Efficient CNNs via Knowledge-driven Differential Filter Sampler.
CoRR, 2023

EdgeC3: Online Management for Edge-Cloud Collaborative Continuous Learning.
Proceedings of the 20th Annual IEEE International Conference on Sensing, 2023

Data-Free Low-Bit Quantization via Dynamic Multi-teacher Knowledge Distillation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

MVP-SEG: Multi-view Prompt Learning for Open-Vocabulary Semantic Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Classifier Decoupled Training for Black-Box Unsupervised Domain Adaptation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Prior Knowledge-driven Dynamic Scene Graph Generation with Causal Inference.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Latent Feature Regularization based Adversarial Network for Brain Tumor Anomaly Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RAGT: Learning Robust Features for Occluded Human Pose and Shape Estimation with Attention-Guided Transformer.
Proceedings of the Computer-Aided Design and Computer Graphics, 2023

An Online Control Approach of Collaborative Federated Learning with Constrained Resources.
Proceedings of the 7th Asia-Pacific Workshop on Networking, 2023

Adaptive Hierarchy-Branch Fusion for Online Knowledge Distillation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Explicit Invariant Feature Induced Cross-Domain Crowd Counting.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
DCS-RISR: Dynamic Channel Splitting for Efficient Real-world Image Super-Resolution.
CoRR, 2022

Self-supervised Models are Good Teaching Assistants for Vision Transformers.
Proceedings of the International Conference on Machine Learning, 2022

DisCo: Remedying Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Comprehensive Regularization in a Bi-directional Predictive Network for Video Anomaly Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Towards Compact Single Image Super-Resolution via Contrastive Self-distillation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Learn from Concepts: Towards the Purified Memory for Few-shot Learning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Novelty Detection via Contrastive Learning with Negative Data Augmentation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Self-supervised Compressed Video Action Recognition via Temporal-Consistent Sampling.
Proceedings of the Neural Information Processing - 28th International Conference, 2021

Contrastive Learning for Compact Single Image Dehazing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Compact CNNs via Collaborative Compression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Toward Compact ConvNets via Structure-Sparsity Regularized Filter Pruning.
IEEE Trans. Neural Networks Learn. Syst., 2020

PAMS: Quantized Super-Resolution via Parameterized Max Scale.
CoRR, 2020

Towards deep neural network compression via learnable wavelet transforms.
CoRR, 2020

Neural Network Compression via Learnable Wavelet Transforms.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2020, 2020

PAMS: Quantized Super-Resolution via Parameterized Max Scale.
Proceedings of the Computer Vision - ECCV 2020, 2020

Interpretable Neural Network Decoupling.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Holistic CNN Compression via Low-Rank Decomposition with Knowledge Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Training convolutional neural networks with cheap convolutions and online distillation.
CoRR, 2019

Dynamic Neural Network Decoupling.
CoRR, 2019

Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning.
CoRR, 2019

Pruning Blocks for CNN Compression and Acceleration via Online Ensemble Distillation.
IEEE Access, 2019

Towards Optimal Structured CNN Pruning via Generative Adversarial Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Accelerating Convolutional Networks via Global & Dynamic Filter Pruning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
ESPACE: Accelerating Convolutional Neural Networks via Eliminating Spatial and Channel Redundancy.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Masked face detection via a modified LeNet.
Neurocomputing, 2016

Towards Convolutional Neural Networks Compression via Global Error Reconstruction.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016


  Loading...