Dachuan Shi

Orcid: 0000-0002-9296-7213

According to our database1, Dachuan Shi authored at least 23 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning.
CoRR, May, 2026

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding.
CoRR, March, 2026

MetaState: Persistent Working Memory Enhances Reasoning in Discrete Diffusion Language Models.
CoRR, March, 2026

Behavior Knowledge Merge in Reinforced Agentic Models.
CoRR, January, 2026

Cognitive digital twins for capability matching toward reconfigurable manufacturing: Leveraging asset administration shells and large language models.
Robotics Comput. Integr. Manuf., 2026

2025
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs.
CoRR, October, 2025

Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners.
CoRR, October, 2025

Superficial Self-Improved Reasoners Benefit from Model Merging.
CoRR, March, 2025

Dual data mapping with fine-tuned large language models and asset administration shells toward interoperable knowledge representation.
Robotics Comput. Integr. Manuf., 2025

Fine-tuning large language models with contrastive margin ranking loss for selective entity matching in product data integration.
Adv. Eng. Informatics, 2025

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Superficial Self-Improved Reasoners Benefit from Model Merging.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Asset Adminstration Shell-based Flexible Manufacturing System.
Proceedings of the 6th IEEE International Conference on Industrial Cyber-Physical Systems, 2023

UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Proceedings of the International Conference on Machine Learning, 2023

2022
Heuristic Dropout: An Efficient Regularization Method for Medical Image Segmentation Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Masked Generative Distillation.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Visual Measurement System for Wheel-Rail Lateral Position Evaluation.
Sensors, 2021

Deep Learning based Virtual Point Tracking for Real-Time Target-less Dynamic Displacement Measurement in Railway Applications.
CoRR, 2021

Multi-Encoder Parse-Decoder Network for Sequential Medical Image Segmentation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

2020
Empirical Study on Robustness of Machine Learning Approaches for Fault Diagnosis under Railway Operational Conditions.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020


  Loading...