Shujian Huang

Orcid: 0000-0003-4869-0832

According to our database1, Shujian Huang authored at least 160 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EDT: Improving Large Language Models' Generation by Entropy-based Dynamic Temperature Sampling.
CoRR, 2024

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation.
CoRR, 2024

Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models.
CoRR, 2024

Diffusion Language Models Are Versatile Protein Learners.
CoRR, 2024

Cobra Effect in Reference-Free Image Captioning Metrics.
CoRR, 2024

Question Translation Training for Better Multilingual Reasoning.
CoRR, 2024

MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization.
CoRR, 2024

Multi-Candidate Speculative Decoding.
CoRR, 2024

Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation.
CoRR, 2024

kNN-BOX: A Unified Framework for Nearest Neighbor Generation.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023
A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily.
CoRR, 2023

Exploring the Dialogue Comprehension Ability of Large Language Models.
CoRR, 2023

Dynamic Demonstrations Controller for In-Context Learning.
CoRR, 2023

NJUNLP's Participation for the WMT2023 Quality Estimation Shared Task.
CoRR, 2023

Extrapolating Large Language Models to Non-English by Aligning Languages.
CoRR, 2023

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions.
CoRR, 2023

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis.
CoRR, 2023

Unify Word-level and Span-level Tasks: NJUNLP's Participation for the WMT2023 Quality Estimation Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Addressing Linguistic Bias through a Contrastive Analysis of Academic Writing in the NLP Domain.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Roles of Scaling and Instruction Tuning in Language Perception: Model vs. Human Attention.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Local Interpretation of Transformer Based on Linear Decomposition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CoP: Factual Inconsistency Detection by Controlling the Preference.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Denoising Pre-training for Machine Translation Quality Estimation with Curriculum Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation.
CoRR, 2022

DeepE: a deep neural network for knowledge graph embedding.
CoRR, 2022

Structure-Unified M-Tree Coding Solver for MathWord Problem.
CoRR, 2022

Zero-shot Domain Adaptation for Neural Machine Translation with Retrieved Phrase-level Prompts.
CoRR, 2022

A Numerical Reasoning Question Answering System with Fine-grained Retriever and the Ensemble of Multiple Generators for FinQA.
CoRR, 2022

CrossQE: HW-TSC 2022 Submission for the Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

NJUNLP's Participation for the WMT2022 Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Analyzing the Intensity of Complaints on Social Media.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Helping the Weak Makes You Strong: Simple Multi-Task Learning Improves Non-Autoregressive Translators.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Structure-Unified M-Tree Coding Solver for Math Word Problem.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Probing Cross-modal Semantics Alignment Capability from the Textual Perspective.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Learning from Adjective-Noun Pairs: A Knowledge-enhanced Framework for Target-Oriented Multimodal Sentiment Classification.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Alleviating the Inequality of Attention Heads for Neural Machine Translation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Towards Multi-label Unknown Intent Detection.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

NJUNLP's Submission for CCMT 2022 Quality Estimation Task.
Proceedings of the Machine Translation - 18th China Conference, 2022

BiTIIMT: A Bilingual Text-infilling Method for Interactive Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Rethinking Document-level Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

latent-GLAT: Glancing at Latent Variables for Parallel Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Non-parametric Online Learning from Human Feedback for Neural Machine Translation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Integrating heterogeneous thesauruses for Chinese synonyms.
Frontiers Comput. Sci., 2021

Non-Parametric Online Learning from Human Feedback for Neural Machine Translation.
CoRR, 2021

Dual Side Deep Context-aware Modulation for Social Recommendation.
CoRR, 2021

Dual Side Deep Context-aware Modulation for Social Recommendation.
Proceedings of the WWW '21: The Web Conference 2021, 2021

HW-TSC's Participation at WMT 2021 Quality Estimation Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Duplex Sequence-to-Sequence Learning for Reversible Machine Translation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Non-Autoregressive Translation by Learning Target Categorical Codes.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Meta-LMTC: Meta-Learning for Large-Scale Multi-Label Text Classification.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learning Kernel-Smoothed Machine Translation with Retrieved Examples.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Adaptive Nearest Neighbor Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Energy-based Unknown Intent Detection with Data Manipulation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

When is Char Better Than Subword: A Systematic Study of Segmentation Algorithms for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Automated Cross-prompt Scoring of Essay Traits.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

DirectQE: Direct Pretraining for Machine Translation Quality Estimation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Improving Self-Attention Networks With Sequential Relations.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

FDMT: A Benchmark Dataset for Fine-grained Domain Adaptation in Machine Translation.
CoRR, 2020

Transformer-based Multi-Aspect Modeling for Multi-Aspect Multi-Sentiment Analysis.
CoRR, 2020

Opinion Transmission Network for Jointly Improving Aspect-oriented Opinion Words Extraction and Sentiment Classification.
CoRR, 2020

Capturing Longer Context for Document-level Neural Machine Translation: A Multi-resolutional Approach.
CoRR, 2020

Prompt Agnostic Essay Scorer: A Domain Generalization Approach to Cross-prompt Automated Essay Scoring.
CoRR, 2020

Toward Making the Most of Context in Neural Machine Translation.
CoRR, 2020

An Improved Label Propagation Algorithm-Based Method to Develop Sectionalizing Strategies for Parallel Power System Restoration.
IEEE Access, 2020

NJU's submission to the WMT20 QE Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

MSGE: A Multi-step Gated Model for Knowledge Graph Completion.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2020

Opinion Transmission Network for Jointly Improving Aspect-Oriented Opinion Words Extraction and Sentiment Classification.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

Transformer-Based Multi-aspect Modeling for Multi-aspect Multi-sentiment Analysis.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

Learning to Generate Personalized Query Auto-Completions via a Multi-View Multi-Task Attentive Approach.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Towards Making the Most of Context in Neural Machine Translation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Mirror-Generative Neural Machine Translation.
Proceedings of the 8th International Conference on Learning Representations, 2020

A Simple and Effective Approach to Robust Unsupervised Bilingual Dictionary Induction.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Enhance Prototypical Network with Text Descriptions for Few-shot Relation Classification.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

NJUNLP's Machine Translation System for CCMT-2020 Uighur $\rightarrow $ Chinese Translation Task.
Proceedings of the Machine Translation - 16th China Conference, 2020

NJUNLP's Submission for CCMT20 Quality Estimation Task.
Proceedings of the Machine Translation - 16th China Conference, 2020

A Reinforced Generation of Adversarial Examples for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

RPD: A Distance Function Between Word Embeddings.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

Dialogue State Tracking with Explicit Slot Connection Modeling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Explicit Semantic Decomposition for Definition Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Latent Opinions Transfer Network for Target-Oriented Opinion Words Extraction.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Acquiring Knowledge from Pre-Trained Model to Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

GRET: Global Representation Enhanced Transformer.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Generating Diverse Translation by Manipulating Multi-Head Attention.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Non-autoregressive Transformer by Position Learning.
CoRR, 2019

A Reinforced Generation of Adversarial Samples for Neural Machine Translation.
CoRR, 2019

Multi-Perspective Inferrer: Reasoning Sentences Relationship from Holistic Perspective.
CoRR, 2019

Improving Neural Machine Translation with Pre-trained Representation.
CoRR, 2019

Correct-and-Memorize: Learning to Translate from Interactive Revisions.
CoRR, 2019

Exploiting Noisy Data in Distant Supervision Relation Classification.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Online Distilling from Checkpoints for Neural Machine Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Target-oriented Opinion Words Extraction with Target-fused Neural Sequence Labeling.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Utilizing Non-Parallel Text for Style Transfer by Making Partial Comparisons.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Correct-and-Memorize: Learning to Translate from Interactive Revisions.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dynamic Past and Future for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Improving Bilingual Lexicon Induction on Distant Language Pairs.
Proceedings of the Machine Translation - 15th China Conference, 2019

CCMT 2019 Machine Translation Evaluation Report.
Proceedings of the Machine Translation - 15th China Conference, 2019

Learning Representation Mapping for Relation Detection in Knowledge Base Question Answering.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Generating Sentences from Disentangled Syntactic and Semantic Spaces.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Collaborative Filtering with Topic and Social Latent Factors Incorporating Implicit Feedback.
ACM Trans. Knowl. Discov. Data, 2018

Modeling Past and Future for Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2018

Learning to Discriminate Noises for Incorporating External Information in Neural Machine Translation.
CoRR, 2018

Collaborative Filtering with Topic and Social Latent Factors Incorporating Implicit Feedback.
CoRR, 2018

Improving Aspect Identification with Reviews Segmentation.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Combining Character and Word Information in Neural Machine Translation Using a Multi-Level Attention.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Dynamic Oracle for Neural Machine Translation in Decoding Phase.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Unsupervised Bilingual Lexicon Induction via Latent Variable Models.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Improving Review Representations With User Attention and Product Attention for Sentiment Classification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
PQAC-WN: constructing a wordnet for Pre-Qin ancient Chinese.
Lang. Resour. Evaluation, 2017

A Neural Probabilistic Structured-Prediction Method for Transition-Based Natural Language Processing.
J. Artif. Intell. Res., 2017

Modeling Past and Future for Neural Machine Translation.
CoRR, 2017

Findings of the 2017 Conference on Machine Translation (WMT17).
Proceedings of the Second Conference on Machine Translation, 2017

AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Deep Matrix Factorization Models for Recommender Systems.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Compressing Neural Networks by Applying Frequent Item-Set Mining.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2017, 2017

Word-Context Character Embeddings for Chinese Word Segmentation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Neural Machine Translation with Word Predictions.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

RGraph: Generating Reference Graphs for Better Machine Translation Evaluation.
Proceedings of the Machine Translation - 13th China Workshop, 2017

Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

A Multi-view Clustering Model for Event Detection in Twitter.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017

Chunk-Based Bi-Scale Decoder for Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Enhancing Shift-Reduce Constituent Parsing with Action N-Gram Model.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016

Adaptation of Language Models for SMT Using Neural Networks with Topic Information.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016

PRIMT: A Pick-Revise Framework for Interactive Machine Translation.
Proceedings of the NAACL HLT 2016, 2016

Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Tagging Chinese microblogger via sparse feature selection.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Tree-State Based Rule Selection Models for Hierarchical Phrase-Based Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A Search-Based Dynamic Reranking Model for Dependency Parsing.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Non-linear Learning for Statistical Machine Translation.
CoRR, 2015

Resolving Coordinate Structures for Chinese Constituent Parsing.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

Word Segmentation of Micro Blogs with Bagging.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

A Synthetic Approach for Recommendation: Combining Ratings, Social Relations, and Reviews.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Graph-Based Collective Lexical Selection for Statistical Machine Translation.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Academic Paper Recommendation Based on Heterogeneous Graph.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

Sentiment Classification with Graph Sparsity Regularization.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2015

A Unified Framework for Jointly Learning Distributed Representations of Word and Attributes.
Proceedings of The 7th Asian Conference on Machine Learning, 2015

A Neural Probabilistic Structured-Prediction Model for Transition-Based Dependency Parsing.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Non-linear Learning for Statistical Machine Translation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Structured Sparsity with Group-Graph Regularization.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Learning word embeddings from dependency relations.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

An Investigation on Statistical Machine Translation with Neural Language Models.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2014

2013
Forgetting Word Segmentation in Chinese Text Classification with <i>L</i>1-Regularized Logistic Regression.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2013

2012
Enhancing Statistical Machine Translation with Character Alignment.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Adapting Conventional Chinese Word Segmenter for Segmenting Micro-blog Text: Combining Rule-based and Statistic-based Approaches.
Proceedings of the Second CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2012

2011
Language Model Weight Adaptation Based on Cross-entropy for Statistical Machine Translation.
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation, 2011

Dealing with Spurious Ambiguity in Learning ITG-based Word Alignment.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Improving Word Alignment by Semi-Supervised Ensemble.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010

2009
Combining ILP and MLN for Coreference Resolution.
Proceedings of the 2009 International Conference on Asian Language Processing, 2009

Segmenting Long Sentence Pairs for Statistical Machine Translation.
Proceedings of the 2009 International Conference on Asian Language Processing, 2009


  Loading...