Shuzheng Si

According to our database1, Shuzheng Si authored at least 24 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
MENTOR: Efficient Multimodal-Conditioned Tuning for Autoregressive Vision Generation Models.
CoRR, July, 2025

SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications.
CoRR, June, 2025

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning.
CoRR, May, 2025

UltraIF: Advancing Instruction Following from the Wild.
CoRR, February, 2025

Document Segmentation Matters for Retrieval-Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

GLTW: Joint Improved Graph Transformer and LLM via Three-Word Language for Knowledge Graph Completion.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance.
CoRR, 2024

Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement.
CoRR, 2024

Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints.
CoRR, 2024

UltraEdit: Instruction-based Fine-Grained Image Editing at Scale.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

One-Shot Learning as Instruction Data Prospector for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
One Shot Learning as Instruction Data Prospector for Large Language Models.
CoRR, 2023

ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks.
CoRR, 2023

Distantly-Supervised Named Entity Recognition with Uncertainty-aware Teacher Learning and Student-student Collaborative Learning.
CoRR, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains.
CoRR, 2023

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER.
Proceedings of the 29th International Conference on Computational Linguistics, 2022


  Loading...