Doug Kang

According to our database1, Doug Kang authored at least 6 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining.
CoRR, February, 2026

Accelerating Personalization Signal Learning via Synthetic Data.
Proceedings of the Advances in Information Retrieval, 2026

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

2025
LikeBench: Evaluating Subjective Likability in LLMs for Personalization.
CoRR, December, 2025

2024
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training.
CoRR, 2024



  Loading...