Linjun Shou

Orcid: 0000-0002-1050-7708

According to our database1, Linjun Shou authored at least 66 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Hypertext Entity Extraction in Webpage.
CoRR, 2024

2023
Improving Readability for Automatic Speech Recognition Transcription.
ACM Trans. Asian Low Resour. Lang. Inf. Process., May, 2023

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers.
CoRR, 2023

Coherent Entity Disambiguation via Modeling Topic and Categorical Dependency.
CoRR, 2023

Instructed Language Models with Retrievers Are Powerful Entity Linkers.
CoRR, 2023

Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations.
CoRR, 2023

TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs.
CoRR, 2023

Bridge the Gap between Language models and Tabular Understanding.
CoRR, 2023

Augmenting Passage Representations with Query Generation for Enhanced Cross-Lingual Dense Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Typos-aware Bottlenecked Pre-Training for Robust Dense Retrieval.
Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023

Large Language Models are Diverse Role-Players for Summarization Evaluation.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

Coherent Entity Disambiguation via Modeling Topic and Categorical Dependency.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Instructed Language Models with Retrievers Are Powerful Entity Linkers.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

RUEL: Retrieval-Augmented User Representation with Edge Browser Logs for Sequential Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Structural Contrastive Pretraining for Cross-Lingual Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Alleviating Over-smoothing for Unsupervised Sentence Representation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Graph Fusion Approach for Cross-Lingual Machine Reading Comprehension.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

WIERT: Web Information Extraction via Render Tree.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Graph Fusion Network for Text Classification.
Knowl. Based Syst., 2022

Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation.
CoRR, 2022

Negative Sampling for Contrastive Representation Learning: A Review.
CoRR, 2022

Multi-level Contrastive Learning for Cross-lingual Spoken Language Understanding.
CoRR, 2022

Transformer-Empowered Content-Aware Collaborative Filtering.
Proceedings of the Fourth Knowledge-aware and Conversational Recommender Systems Workshop co-located with 16th ACM Conference on Recommender Systems (RecSys 2022), 2022

Combining Unstructured Content and Knowledge Graphs into Recommendation Datasets.
Proceedings of the Fourth Knowledge-aware and Conversational Recommender Systems Workshop co-located with 16th ACM Conference on Recommender Systems (RecSys 2022), 2022

Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Unsupervised Context Aware Sentence Representation Pretraining for Multi-lingual Dense Retrieval.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Empowering Dual-Encoder with Query Generator for Cross-Lingual Dense Retrieval.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Lexicon-Enhanced Self-Supervised Training for Multilingual Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Label-aware Multi-level Contrastive Learning for Cross-lingual Spoken Language Understanding.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Tiger: Transferable Interest Graph Embedding for Domain-Level Zero-Shot Recommendation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

From Good to Best: Two-Stage Training for Cross-Lingual Machine Reading Comprehension.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
From Good to Best: Two-Stage Training for Cross-lingual Machine Reading Comprehension.
CoRR, 2021

A Joint and Domain-Adaptive Approach to Spoken Language Understanding.
CoRR, 2021

CalibreNet: Calibration Networks for Multilingual Sequence Labeling.
Proceedings of the WSDM '21, 2021

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Language Scaling: Applications, Challenges and Approaches.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Improving Zero-shot Neural Machine Translation on Language-specific Encoders- Decoders.
Proceedings of the International Joint Conference on Neural Networks, 2021

Generating Human Readable Transcript for Automatic Speech Recognition with Pre-Trained Language Model.
Proceedings of the IEEE International Conference on Acoustics, 2021

WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Syntax-Enhanced Pre-trained Model.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Retrieval Enhanced Model for Commonsense Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

GLGE: A New General Language Generation Evaluation Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

CoSQA: 20, 000+ Web Queries for Code Search and Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Reinforced Multi-Teacher Selection for Knowledge Distillation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Tag and Correct: Question aware Open Information Extraction with Two-stage Decoding.
CoRR, 2020

Pre-training Text Representations as Meta Learning.
CoRR, 2020

Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning.
CoRR, 2020

XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation.
CoRR, 2020

Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Mining Implicit Relevance Feedback from User Behavior for Web Question Answering.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

No Answer is Better Than Wrong Answer: A Reflection Model for Document Level Machine Reading Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

CodeBERT: A Pre-Trained Model for Programming and Natural Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

A Graph Representation of Semi-structured Data for Web Question Answering.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Cross-lingual Machine Reading Comprehension with Language Branch Knowledge Distillation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Model Compression with Multi-Task Knowledge Distillation for Web-scale Question Answering System.
CoRR, 2019

NeuronBlocks - Building Your NLP DNN Models Like Playing Lego.
CoRR, 2019

A Recurrent Attention Network for Judgment Prediction.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Text and Time Series, 2019

Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

NeuronBlocks: Building Your NLP DNN Models Like Playing Lego.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019


  Loading...