Tianxiang Sun

Orcid: 0000-0001-8291-820X

According to our database1, Tianxiang Sun authored at least 36 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance.
CoRR, 2024

In-Memory Learning: A Declarative Learning Framework for Large Language Models.
CoRR, 2024

Turn Waste into Worth: Rectifying Top-k Router of MoE.
CoRR, 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling.
CoRR, 2024

Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT.
CoRR, 2024

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation.
CoRR, 2024

Can AI Assistants Know What They Don't Know?
CoRR, 2024

Agent Alignment in Evolving Social Norms.
CoRR, 2024

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
LLatrieval: LLM-Verified Retrieval for Verifiable Generation.
CoRR, 2023

Flames: Benchmarking Value Alignment of Chinese Large Language Models.
CoRR, 2023

Evaluating Hallucinations in Chinese Large Language Models.
CoRR, 2023

Secrets of RLHF in Large Language Models Part I: PPO.
CoRR, 2023

Origin Tracing and Detecting of LLMs.
CoRR, 2023

Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Improving Contrastive Learning of Sentence Embeddings from AI Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Paradigm Shift in Natural Language Processing.
Int. J. Autom. Comput., 2022

DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models.
CoRR, 2022

Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning.
CoRR, 2022

BBTv2: Pure Black-Box Optimization Can Be Comparable to Gradient Descent for Few-Shot Learning.
CoRR, 2022

Towards Efficient NLP: A Standard Evaluation and A Strong Baseline.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Black-Box Tuning for Language-Model-as-a-Service.
Proceedings of the International Conference on Machine Learning, 2022

BBTv2: Towards a Gradient-Free Future with Large Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline.
CoRR, 2021

Learning to Teach with Student Feedback.
CoRR, 2021

Early Exiting with Ensemble Internal Classifiers.
CoRR, 2021

Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Accelerating BERT Inference for Sequence Labeling via Early-Exit.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Pre-trained Models for Natural Language Processing: A Survey.
CoRR, 2020

CoLAKE: Contextualized Language and Knowledge Embedding.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Learning Sparse Sharing Architectures for Multiple Tasks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020


  Loading...