Shimin Tao

Orcid: 0000-0002-2795-6921

According to our database1, Shimin Tao authored at least 78 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
From Handcrafted Features to LLMs: A Brief Survey for Machine Translation Quality Estimation.
CoRR, 2024

Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation.
CoRR, 2024

DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators.
CoRR, 2024

Knowledge-Prompted Estimator: A Novel Approach to Explainable Machine Translation Assessment.
Proceedings of the 26th International Conference on Advanced Communications Technology, 2024

Translate Meanings, Not Just Words: IdiomKB's Role in Optimizing Idiomatic Translation with Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
LogSummary: Unstructured Log Summarization for Software Systems.
IEEE Trans. Netw. Serv. Manag., September, 2023

Exploiting Spatial-Temporal Behavior Patterns for Fraud Detection in Telecom Networks.
IEEE Trans. Dependable Secur. Comput., 2023

P-Transformer: Towards Better Document-to-Document Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Automatic Instruction Optimization for Open-source LLM Instruction Tuning.
CoRR, 2023

NJUNLP's Participation for the WMT2023 Quality Estimation Shared Task.
CoRR, 2023

Translate Meanings, Not Just Words: IdiomKB's Role in Optimizing Idiomatic Translation with Language Models.
CoRR, 2023

LogPrompt: Prompt Engineering Towards Zero-Shot and Interpretable Log Analysis.
CoRR, 2023

Collective Human Opinions in Semantic Textual Similarity.
CoRR, 2023

Knowledge-Prompted Estimator: A Novel Approach to Explainable Machine Translation Assessment.
CoRR, 2023

Implicit Cross-Lingual Word Embedding Alignment for Reference-Free Machine Translation Evaluation.
IEEE Access, 2023

Weakly Supervised Entity Alignment with Positional Inspiration.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

HW-TSC's Participation in the WMT 2023 Automatic Post Editing Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

Empowering a Metric with LLM-assisted Named Entity Annotation: HW-TSC's Submission to the WMT23 Metrics Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

Unify Word-level and Span-level Tasks: NJUNLP's Participation for the WMT2023 Quality Estimation Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

Multi-order Matched Neighborhood Consistent Graph Alignment in a Union Vector Space.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

LogDAPT: Log Data Anomaly Detection with Domain-Adaptive Pretraining (industry track).
Proceedings of the 24th International Middleware Conference Industrial Track, 2023

The HW-TSC's Speech-to-Speech Translation System for IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Biglog: Unsupervised Large-scale Pre-training for a Unified Log Representation.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

CONFPILOT: A Pilot for Faster Configuration by Learning from Device Manuals.
Proceedings of the 43rd IEEE International Conference on Distributed Computing Systems, 2023

Zephyr: Zero-Shot Punctuation Restoration.
Proceedings of the IEEE International Conference on Acoustics, 2023

UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction.
Proceedings of the IEEE International Conference on Acoustics, 2023

TeacherSim: Cross-lingual Machine Translation Evaluation with Monolingual Embedding as Teacher.
Proceedings of the 25th International Conference on Advanced Communication Technology, 2023

Chinese ASR and NER Improvement Based on Whisper Fine-Tuning.
Proceedings of the 25th International Conference on Advanced Communication Technology, 2023

SmartSpanNER: Making SpanNER Robust in Low Resource Scenarios.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DA-Parser: A Pre-trained Domain-aware Parsing Framework for Heterogeneous Log Analysis.
Proceedings of the 47th IEEE Annual Computers, Software, and Applications Conference, 2023

Knowledge Prompt for Whisper: An ASR Entity Correction Approach with Knowledge Base.
Proceedings of the IEEE International Conference on Big Data, 2023

Incorporating Pinyin into Pipeline Named Entity Recognition from Chinese Speech.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Lexical Translation Inconsistency-Aware Document-Level Translation Repair.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Denoising Pre-training for Machine Translation Quality Estimation with Curriculum Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
LogStamp: Automatic Online Log Parsing Based on Sequence Labelling.
SIGMETRICS Perform. Evaluation Rev., 2022

CrossQE: HW-TSC 2022 Submission for the Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

HW-TSC's Submission for the WMT22 Efficiency Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Partial Could Be Better than Whole. HW-TSC 2022 Submission for the Metrics Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

NJUNLP's Participation for the WMT2022 Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Exploring Robustness of Machine Translation Metrics: A Study of Twenty-Two Automatic Metrics in the WMT22 Metric Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

HW-TSC at SemEval-2022 Task 7: Ensemble Model Based on Pretrained Models for Identifying Plausible Clarifications.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

WRS: Workflow Retrieval System for Cloud Automatic Remediation.
Proceedings of the 2022 IEEE/IFIP Network Operations and Management Symposium, 2022

Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CCDC: A Chinese-Centric Cross Domain Contrastive Learning Framework.
Proceedings of the Knowledge Science, Engineering and Management, 2022

The HW-TSC's Offline Speech Translation System for IWSLT 2022 Evaluation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

The HW-TSC's Simultaneous Speech Translation System for IWSLT 2022 Evaluation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

The HW-TSC's Speech to Speech Translation System for IWSLT 2022 Evaluation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Part Represents Whole: Improving the Evaluation of Machine Translation System Using Entropy Enhanced Metrics.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

Modeling Consistency Preference via Lexical Chains for Document-level Neural Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Diformer: Directional Transformer for Neural Machine Translation.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Target-Side Language Model for Reference-Free Machine Translation Evaluation.
Proceedings of the Machine Translation - 18th China Conference, 2022

PEACook: Post-editing Advancement Cookbook.
Proceedings of the Machine Translation - 18th China Conference, 2022

CCMT 2022 Translation Quality Estimation Task.
Proceedings of the Machine Translation - 18th China Conference, 2022

Incorporating Multilingual Knowledge Distillation into Machine Translation Evaluation.
Proceedings of the Knowledge Graph and Semantic Computing: Knowledge Graph Empowers the Digital Economy, 2022

EntityRank: Unsupervised Mining of Bilingual Named Entity Pairs from Parallel Corpora for Neural Machine Translation.
Proceedings of the IEEE International Conference on Big Data, 2022

HwTscSU's Submissions on WAT 2022 Shared Task.
Proceedings of the 9th Workshop on Asian Translation, 2022

Capture Human Disagreement Distributions by Calibrated Networks for Natural Language Inference.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Deep graph alignment network.
Neurocomputing, 2021

Joint-training on Symbiosis Networks for Deep Nueral Machine Translation models.
CoRR, 2021

Self-Distillation Mixup Training for Non-autoregressive Neural Machine Translation.
CoRR, 2021

UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks.
CoRR, 2021

The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation.
CoRR, 2021

HW-TSC's Participation in the WMT 2021 Efficiency Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

HW-TSC's Participation at WMT 2021 Quality Estimation Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Make the Blind Translator See The World: A Novel Transfer Learning Solution for Multimodal Machine Translation.
Proceedings of the 18th Biennial Machine Translation Summit - Volume 1: Research Track, 2021

HI-CMLM: Improve CMLM with Hybrid Decoder Input.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

Prefix-Graph: A Versatile Log Parsing Approach Merging Prefix Tree with Probabilistic Graph.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Incorporating Complete Syntactical Knowledge for Spoken Language Understanding.
Proceedings of the Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction, 2021

How Length Prediction Influence the Performance of Non-Autoregressive Translation?
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

2020
Summarizing Unstructured Logs in Online Services.
CoRR, 2020

HW-TSC's Participation at WMT 2020 Automatic Post Editing Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

HW-TSC's Participation at WMT 2020 Quality Estimation Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

LogParse: Making Log Parsing Adaptive through Word Classification.
Proceedings of the 29th International Conference on Computer Communications and Networks, 2020

2019
LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
FUNNEL: Assessing Software Changes in Web-Based Services.
IEEE Trans. Serv. Comput., 2018

2017
Segmentation of Time Series Based on Kinetic Characteristics for Storage Consumption Prediction.
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017

2015
Rapid and robust impact assessment of software changes in large internet-based services.
Proceedings of the 11th ACM Conference on Emerging Networking Experiments and Technologies, 2015


  Loading...