Ziyin Zhang

Orcid: 0009-0001-5137-8797

According to our database1, Ziyin Zhang authored at least 38 papers between 2009 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Beyond Retrieval: A Multitask Benchmark and Model for Code Search.
CoRR, May, 2026

TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale.
CoRR, April, 2026

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World.
CoRR, March, 2026


2025
C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling.
CoRR, December, 2025

SHRP: Specialized Head Routing and Pruning for Efficient Encoder Compression.
CoRR, December, 2025

F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data.
CoRR, October, 2025

CodeFuse-CR-Bench: A Comprehensiveness-aware Benchmark for End-to-End Code Review Evaluation in Python Projects.
CoRR, September, 2025

CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China.
CoRR, September, 2025

From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms.
CoRR, August, 2025

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning.
CoRR, May, 2025

Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks.
CoRR, May, 2025

T-CPAD: A Transformer-Based Approach for Crowd Flow Prediction and Anomaly Detection.
IEEE Access, 2025

Draft Model Knows When to Stop: Self-Verification Speculative Decoding for Long-Form Generation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Multilingual Encoder Knows more than You Realize: Shared Weights Pretraining for Extremely Low-Resource Languages.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code.
Trans. Mach. Learn. Res., 2024

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding.
CoRR, 2024

GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding.
CoRR, 2024

Multiple-Choice Questions are Efficient and Robust LLM Evaluators.
CoRR, 2024

Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality.
CoRR, 2024

Can ChatGPT Rival Neural Machine Translation? A Comparative Study.
CoRR, 2024

MELA: Multilingual Evaluation of Linguistic Acceptability.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Self-Distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Distinguishing Translations by Human, NMT, and ChatGPT: A Linguistic and Statistical Approach.
CoRR, 2023

A Survey on Language Models for Code.
CoRR, 2023

Revisiting Acceptability Judgements.
CoRR, 2023

Hedges in Bidirectional Translations of Publicity-Oriented Documents.
CoRR, 2023

ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models.
CoRR, 2023

2022
MUST Augment: Efficient Augmentation with Multi-stage Stochastic Strategy.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

2021
Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer.
CoRR, 2021

Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

CATNet: Scene Text Recognition Guided by Concatenating Augmented Text Features.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

2020
A New Caching Algorithm for Boosting Edge Computing Performance.
Proceedings of the 11th IEEE Annual Ubiquitous Computing, 2020

2018
用户非对称信任关系的推荐算法 (Recommendation Algorithm Combining User's Asymmetric Trust Relationships).
计算机科学, 2018

Development of a new cloudlet content caching algorithm based on web mining.
Proceedings of the IEEE 8th Annual Computing and Communication Workshop and Conference, 2018

2009
Local Planning of AUV Based on Fuzzy-Q Learning in Strong Sea Flow Field.
Proceedings of the Second International Joint Conference on Computational Sciences and Optimization, 2009


  Loading...