Ian En-Hsu Yen

Affiliations:

University of Texas at Austin, Department of Computer Science

According to our database¹, Ian En-Hsu Yen authored at least 42 papers between 2013 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2025

SQ-format: A Unified Sparse-Quantized Hardware-friendly Data Format for LLMs.

[BibT_eX]

[DOI]

CoRR, December, 2025

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding.

[BibT_eX]

[DOI]

CoRR, 2024

2023

FP8-BERT: Post-Training Quantization for Transformer.

[BibT_eX]

[DOI]

CoRR, 2023

2022

S4: a High-sparsity, High-performance AI Accelerator.

[BibT_eX]

[DOI]

Ian En-Hsu Yen

Zhibin Xiao

Dongkuan Xu

CoRR, 2022

Towards ℓ1 Regularization for Deep Neural Networks: Model Sparsity Versus Task Difficulty.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE International Conference on Data Science and Advanced Analytics, 2022

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm.

[BibT_eX]

[DOI]

Sanguthevar Rajasekaran

Hang Liu

Caiwen Ding

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking Network Pruning - under the Pre-train and Fine-tune Paradigm.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

2020

Minimizing FLOPs to Learn Efficient Sparse Representations.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Temporal popularity prediction of locations for geographical placement of retail stores.

[BibT_eX]

[DOI]

Knowl. Inf. Syst., 2019

Scalable Global Alignment Graph Kernel Using Random Features: From Node Embedding to Graph Embedding.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Efficient Global String Kernel with Random Features: Beyond Counting Substructures.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Temporal Structure Mining for Weakly Supervised Action Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

Sublinear-Time Learning and Inference for High-Dimensional Models.

[BibT_eX]

[DOI]

Ian En-Hsu Yen

PhD thesis, 2018

Learning Tensor Latent Features.

[BibT_eX]

[DOI]

CoRR, 2018

D2KE: From Distance to Kernel and Embedding.

[BibT_eX]

[DOI]

CoRR, 2018

MixLasso: Generalized Mixed Regression via Convex Atomic-Norm Regularization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Representer Point Selection for Explaining Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scalable Spectral Clustering Using Random Binning Features.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Loss Decomposition for Fast Learning in Large Output Spaces.

[BibT_eX]

[DOI]

Ian En-Hsu Yen

Satyen Kale

Felix X. Yu

Daniel Niels Holtmann-Rice

Sanjiv Kumar

Pradeep Ravikumar

Proceedings of the 35th International Conference on Machine Learning, 2018

Word Mover's Embedding: From Word2Vec to Document Embedding.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Random Warping Series: A Random Features Method for Time-Series Embedding.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Latent Feature Lasso.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Doubly Greedy Primal-Dual Coordinate Descent for Sparse Empirical Risk Minimization.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Scalable Convex Multiple Sequence Alignment via Entropy-Regularized Dual Decomposition.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Greedy Direction Method of Multiplier for MAP Inference of Large Output Domain.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016

Large-scale Submodular Greedy Exemplar Selection with Structured Similarity Matrices.

[BibT_eX]

[DOI]

Dmitry Malioutov

Abhishek Kumar

Ian En-Hsu Yen

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Dual Decomposed Learning with Factorwise Oracle for Structural SVM of Large Output Domain.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Revisiting Random Binning Features: Fast Convergence and Strong Parallelizability.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

PD-Sparse : A Primal and Dual Sparse Approach to Extreme Multiclass and Multilabel Classification.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Scalable Exemplar Clustering and Facility Location via Augmented Block Coordinate Descent with Column Generation.

[BibT_eX]

[DOI]

Ian En-Hsu Yen

Dmitry Malioutov

Abhishek Kumar

Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015

Tackling the Achilles Heel of Social Networks: Influence Propagation based Language Model Smoothing.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on World Wide Web, 2015

Sparse Linear Programming via Primal and Dual Augmented Coordinate Descent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

A Dual Augmented Block Minimization Framework for Learning with Limited Memory.

[BibT_eX]

[DOI]

Ian En-Hsu Yen

Shan-Wei Lin

Shou-De Lin

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

A Convex Exemplar-based Approach to MAD-Bayes Dirichlet Process Mixture Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

2014

Proximal Quasi-Newton for Computationally Intensive L1-regularized M-estimators.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Sparse Random Feature Algorithm as Coordinate Descent in Hilbert Space.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Constant Nullspace Strong Convexity and Fast Convergence of Proximal Methods under High-Dimensional Settings.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013

Indexed block coordinate descent for large-scale linear classification with limited memory.

[BibT_eX]

[DOI]

Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Ian En-Hsu Yen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...