Pengcheng He

Orcid: 0000-0003-3305-952X

According to our database1, Pengcheng He authored at least 80 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Personalized Abstractive Summarization by Tri-agent Generation Pipeline.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
A novel competitive constrained dual-archive dual-stage evolutionary algorithm for constrained multiobjective optimization.
Swarm Evol. Comput., December, 2023

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective.
CoRR, 2023

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models.
CoRR, 2023

Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling.
CoRR, 2023

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models.
CoRR, 2023

Deep Reinforcement Learning from Hierarchical Weak Preference Feedback.
CoRR, 2023

Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models.
CoRR, 2023

Summaries, Highlights, and Action items: Design, implementation and evaluation of an LLM-powered meeting recap system.
CoRR, 2023

LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation.
CoRR, 2023

Interactive Editing for Text Summarization.
CoRR, 2023

PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization.
CoRR, 2023

Summarization with Precise Length Control.
CoRR, 2023

ChatGPT-steered Editing Instructor for Customization of Abstractive Summarization.
CoRR, 2023

Instruction Tuning with GPT-4.
CoRR, 2023

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback.
CoRR, 2023

Guiding Large Language Models via Directional Stimulus Prompting.
CoRR, 2023

A Prototype-Oriented Clustering for Domain Shift with Source Privacy.
CoRR, 2023

Cellular Network Optimization Using Unfolding-Based Graph Neural Networks.
Proceedings of the 24th IEEE International Workshop on Signal Processing Advances in Wireless Communications, 2023

Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

In-Context Learning Unlocked for Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Guiding Large Language Models via Directional Stimulus Prompting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models.
Proceedings of the International Conference on Machine Learning, 2023

HyperTuning: Toward Adapting Large Language Models without Back-propagation.
Proceedings of the International Conference on Machine Learning, 2023

Less is More: Task-aware Layer-wise Distillation for Language Model Compression.
Proceedings of the International Conference on Machine Learning, 2023

LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation.
Proceedings of the International Conference on Machine Learning, 2023

Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Diffusion-GAN: Training GANs with Diffusion.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

LMGQS: A Large-scale Dataset for Query-focused Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Query Rewriting in Retrieval-Augmented Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Attend to the Right Context: A Plug-and-Play Module for Content-Controllable Summarization.
CoRR, 2022

Momentum Calibration for Text Generation.
CoRR, 2022

Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization.
CoRR, 2022

GODEL: Large-Scale Pre-Training for Goal-Directed Dialog.
CoRR, 2022

Truncated Diffusion Probabilistic Models.
CoRR, 2022

Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs.
CoRR, 2022

Experiences and Lessons Learned From DR Resources Participating in the US and UK Capacity Markets: Mechanisms, Status, Dilemmas and Recommendations.
IEEE Access, 2022

Reinforced Event-Driven Evolutionary Algorithm Based on Double Deep Q-network.
Proceedings of the Advances in Swarm Intelligence - 13th International Conference, 2022

An Improved Particle Swarm Optimization Algorithm for Irregular Flight Recovery Problem.
Proceedings of the Advances in Swarm Intelligence - 13th International Conference, 2022

MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

ALLSH: Active Learning Guided by Local Sensitivity and Hardness.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Learned Multi-objective Bacterial Foraging Optimization Algorithm with Continuous Deep Q-Learning.
Proceedings of the Machine Learning for Cyber Security - 4th International Conference, 2022

A Zeroth-Order Block Coordinate Gradient Descent Method For Cellular Network Optimization.
Proceedings of the 18th International Symposium on Wireless Communication Systems, 2022

Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance.
Proceedings of the International Conference on Machine Learning, 2022

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
A Novel Image Encryption Algorithm Based on the Delayed Maps and Permutation-Confusion-Diffusion Architecture.
Secur. Commun. Networks, 2021

Adversarial Training as Stackelberg Game: An Unrolled Optimization Approach.
CoRR, 2021

Greedy Multi-step Off-Policy Reinforcement Learning.
CoRR, 2021

Deberta: decoding-Enhanced Bert with Disentangled Attention.
Proceedings of the 9th International Conference on Learning Representations, 2021

Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ARCH: Efficient Adversarial Regularized Training with Caching.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Token-wise Curriculum Learning for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Irregular Flight Timetable Recovery Under COVID-19: An Approach Based on Genetic Algorithm.
Proceedings of the Data Mining and Big Data - 6th International Conference, 2021

Reader-Guided Passage Reranking for Open-Domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Generation-Augmented Retrieval for Open-Domain Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

UnitedQA: A Hybrid Approach for Open Domain Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Adversarial Training for Large Neural Language Models.
CoRR, 2020

Robust TOA-Based Source Self-Positioning With Clock Imperfection.
Proceedings of the 2020 IEEE Wireless Communications and Networking Conference, 2020


On the Variance of the Adaptive Learning Rate and Beyond.
Proceedings of the 8th International Conference on Learning Representations, 2020

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
X-SQL: reinforce schema representation with context.
CoRR, 2019

A Hybrid Neural Network Model for Commonsense Reasoning.
CoRR, 2019

Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding.
CoRR, 2019

A Bi-Attention Adversarial Network for Prostate Cancer Segmentation.
IEEE Access, 2019

Multi-Task Deep Neural Networks for Natural Language Understanding.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
3D Spatial Pyramid Dilated Network for Pulmonary Nodule Classification.
Symmetry, 2018

2017
Scalable Deep Document / Sequence Reasoning with Cognitive Toolkit.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

2014
Low illumination image Retinex enhancement algorithm based on guided filtering.
Proceedings of the IEEE 3rd International Conference on Cloud Computing and Intelligence Systems, 2014

2012
The design and implementation of the 1911 revolution ontology search engine.
Proceedings of the 9th International Conference on Fuzzy Systems and Knowledge Discovery, 2012


  Loading...