Hui Huang

Orcid: 0000-0002-3544-9719

Affiliations:
  • Harbin Institute of Technology, Faculty of Computing, China


According to our database1, Hui Huang authored at least 24 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory.
CoRR, May, 2025

Think-J: Learning to Think for Generative LLM-as-a-Judge.
CoRR, May, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
CoRR, April, 2025

AIR: Complex Instruction Generation via Automatic Iterative Refinement.
CoRR, February, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Model is not a General Substitute for GPT-4.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Multi-view fusion for universal translation quality estimation.
Inf. Fusion, February, 2024

Multi-view fusion for instruction mining of large language model.
Inf. Fusion, 2024

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models.
CoRR, 2024

Legal Fact Prediction: Task Definition and Dataset Construction.
CoRR, 2024

Self-Evaluation of Large Language Model based on Glass-box Features.
CoRR, 2024

An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Models are Task-specific Classifiers.
CoRR, 2024

Self-Evaluation of Large Language Model based on Glass-box Features.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Mitigating the Bias of Large Language Model Evaluation.
Proceedings of the Chinese Computational Linguistics - 23rd China National Conference, 2024

2023
Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System.
CoRR, 2023

Retrieval-Augmented Classification with Decoupled Representation.
CoRR, 2023

Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models.
CoRR, 2023

Towards Making the Most of LLM for Translation Quality Estimation.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

HIT-MI&T Lab's Submission to Eval4NLP 2023 Shared Task.
Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems, 2023

Iterative Nearest Neighbour Machine Translation for Unsupervised Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Improving Translation Quality Estimation with Bias Mitigation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023


  Loading...