Tong Zhang

CoRR, 2020

Multi-modal AsynDGAN: Learn From Distributed Medical Image Data without Sharing Private Information.

[BibT_eX]

[DOI]

CoRR, 2020

VEGA: Towards an End-to-End Configurable AutoML Pipeline.

[BibT_eX]

[DOI]

CoRR, 2020

Propagation Model Search for Graph Neural Networks.

[BibT_eX]

[DOI]

Yuhui Ding

Quanming Yao

CoRR, 2020

Disentangled Generative Causal Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2020

CorrAttack: Black-box Adversarial Attack with Structured Search.

[BibT_eX]

[DOI]

Zhichao Huang

Yaowei Huang

CoRR, 2020

Multi-consensus Decentralized Accelerated Gradient Descent.

[BibT_eX]

[DOI]

CoRR, 2020

Bidirectional Generative Modeling Using Adversarial Gradient Estimation.

[BibT_eX]

[DOI]

Xinwei Shen

Kani Chen

CoRR, 2020

Mean-Field Analysis of Two-Layer Neural Networks: Non-Asymptotic Rates and Generalization Bounds.

[BibT_eX]

[DOI]

CoRR, 2020

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems.

[BibT_eX]

[DOI]

Luo Luo

Haishan Ye

CoRR, 2020

Decentralized Accelerated Proximal Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

How to Characterize The Landscape of Overparameterized Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Generalized Neural Tangent Kernel Analysis for Two-layer Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stable Learning via Differentiated Variable Decorrelation.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Guided Learning of Nonconvex Models through Successive Functional Gradient Optimization.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Black-Box Adversarial Attack with Transferable Model-based Embedding.

[BibT_eX]

[DOI]

Zhichao Huang

Proceedings of the 8th International Conference on Learning Representations, 2020

Improving Constituency Parsing with Span Attention.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

CATCH: Context-Based Meta Reinforcement Learning for Transferrable Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Improving Chinese Word Segmentation with Wordhood Memory Networks.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed Knowledge.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Stable Learning via Sample Reweighting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Utilizing Second Order Information in Minibatch Stochastic Variance Reduced Proximal Iterations.

[BibT_eX]

[DOI]

Jialei Wang

J. Mach. Learn. Res., 2019

Layer-Wise Learning Strategy for Nonparametric Tensor Product Smoothing Spline Regression and Graphical Models.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Robust Frequent Directions with Application in Online Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Picasso: A Sparse Learning Library for High Dimensional Data Analysis in R and Python.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Fast Generalized Matrix Regression with Applications in Machine Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Multi-objective Neural Architecture Search via Predictive Network Performance Optimization.

[BibT_eX]

[DOI]

CoRR, 2019

Over Parameterized Two-level Neural Networks Can Learn Near Optimal Feature Representations.

[BibT_eX]

[DOI]

Cong Fang

Hanze Dong

CoRR, 2019

Mirror Natural Evolution Strategies.

[BibT_eX]

[DOI]

Haishan Ye

CoRR, 2019

DeepSqueeze: Parallel Stochastic Gradient Descent with Double-Pass Error-Compensated Compression.

[BibT_eX]

[DOI]

CoRR, 2019

DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-Pass Error-Compensated Compression.

[BibT_eX]

[DOI]

CoRR, 2019

MAP Inference via L2-Sphere Linear Program Reformulation.

[BibT_eX]

[DOI]

CoRR, 2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Graph-guided multi-task sparse learning model: a method for identifying antigenic variants of influenza A(H3N2) virus.

[BibT_eX]

[DOI]

Bioinform., 2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.

[BibT_eX]

[DOI]

IEEE Access, 2019

Divergence-Augmented Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-pass Error-Compensated Compression.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

DHER: Hindsight Experience Replay for Dynamic Goals.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Efficient Decision-Based Black-Box Adversarial Attacks on Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Sharp Analysis for Nonconvex SGD Escaping from Saddle Points.

[BibT_eX]

[DOI]

Cong Fang

Zhouchen Lin

Proceedings of the Conference on Learning Theory, 2019

Sentiment Analysis Using Autoregressive Language Modeling and Broad Learning System.

[BibT_eX]

[DOI]

Xin-Rong Gong

Jian-Xiu Jin

Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

Reinforced Training Data Selection for Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Neural Machine Translation with Adequacy-Oriented Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Bayesian Model Averaging With Exponentiated Least Squares Loss.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2018

Learning to Remember Translation History with a Continuous Cache.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2018

Near-optimal stochastic approximation for online principal component estimation.

[BibT_eX]

[DOI]

Math. Program., 2018

Hessian-Aware Zeroth-Order Optimization for Black-Box Adversarial Attack.

[BibT_eX]

[DOI]

CoRR, 2018

Finite-Sample Analyses for Fully Decentralized Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space.

[BibT_eX]

[DOI]

CoRR, 2018

Fully Implicit Online Learning.

[BibT_eX]

[DOI]

CoRR, 2018

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game.

[BibT_eX]

[DOI]

CoRR, 2018

A convex formulation for high-dimensional sparse sliced inverse regression.

[BibT_eX]

[DOI]

CoRR, 2018

Diffusion Approximations for Online Principal Component Estimation and Global Convergence.

[BibT_eX]

[DOI]

CoRR, 2018

Incorporating Pseudo-Parallel Data for Quantifiable Sequence Editing.

[BibT_eX]

[DOI]

CoRR, 2018

Decentralization Meets Quantization.

[BibT_eX]

[DOI]

CoRR, 2018

Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset.

[BibT_eX]

[DOI]

Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Gradient Sparsification for Communication-Efficient Distributed Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Exponentially Weighted Imitation Learning for Batched Historical Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Communication Compression for Decentralized Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Stochastic Primal-Dual Method for Empirical Risk Minimization with O(1) Per-Iteration Complexity.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Stochastic Expectation Maximization with Variance Reduction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Sketched Follow-The-Regularized-Leader for Online Factorization Machine.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Safe Element Screening for Submodular Function Minimization.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Error Compensated Quantized SGD and its Applications to Large-scale Distributed Optimization.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Graphical Nonconvex Optimization via an Adaptive Convex Relaxation.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

An Algorithmic Framework of Variable Metric Over-Relaxed Hybrid Proximal Extra-Gradient Method.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

End-to-end Active Object Tracking via Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Composite Functional Gradient Learning of Generative Adversarial Models.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Candidates vs. Noises Estimation for Large Multi-Class Classification Problem.

[BibT_eX]

[DOI]

Yiheng Huang

Proceedings of the 35th International Conference on Machine Learning, 2018

Modeling Localness for Self-Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

QuaSE: Sequence Editing under Quantifiable Guidance.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Multi-Head Attention with Disagreement Regularization.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Exploiting Deep Representations for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Super-Identity Convolutional Neural Network for Face Hallucination.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Recurrent Fusion Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Stereoscopic Image Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Video Re-localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Translating Pro-Drop Languages With Reconstruction Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Sparseness Analysis in the Pretraining of Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2017

Hierarchical Contextual Attention Recurrent Neural Network for Map Query Suggestion.

[BibT_eX]

[DOI]

Zhongfei (Mark) Zhang

Wenwu Zhu

IEEE Trans. Knowl. Data Eng., 2017

A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2017

Gradient Hard Thresholding Pursuit.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2017

Candidates v.s. Noises Estimation for Large Multi-Class Classification Problem.

[BibT_eX]

[DOI]

CoRR, 2017

Improved Optimization of Finite Sums with Minibatch Stochastic Variance Reduced Proximal Iterations.

[BibT_eX]

[DOI]

Jialei Wang

CoRR, 2017

On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions.

[BibT_eX]

[DOI]

CoRR, 2017

On Quadratic Convergence of DC Proximal Newton Algorithm in Nonconvex Sparse Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Diffusion Approximations for Online Principal Component Estimation and Global Convergence.

[BibT_eX]

[DOI]

Chris Junchi Li

Mengdi Wang

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Projection-free Distributed Online Learning in Networks.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Efficient Distributed Learning with Sparsity.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Deep Pyramid Convolutional Neural Networks for Text Categorization.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

Accelerated proximal stochastic dual coordinate ascent for regularized loss minimization.

[BibT_eX]

[DOI]

Math. Program., 2016

Towards More Efficient SPSD Matrix Approximation and CUR Matrix Decomposition.

[BibT_eX]

[DOI]

Shusen Wang

Zhihua Zhang

J. Mach. Learn. Res., 2016

A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization.

[BibT_eX]

[DOI]

CoRR, 2016

Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level.

[BibT_eX]

[DOI]

CoRR, 2016

Supervised and Semi-Supervised Text Categorization using One-Hot LSTM for Region Embeddings.

[BibT_eX]

[DOI]

CoRR, 2016

Local Uncertainty Sampling for Large-Scale Multi-Class Logistic Regression.

[BibT_eX]

[DOI]

Ting Yang

CoRR, 2016

Learning Additive Exponential Family Graphical Models via \ell_{2, 1}-norm Regularized M-Estimation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Exact Recovery of Hard Thresholding Pursuit.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Fast Component Pursuit for Large-Scale Inverse Covariance Estimation.

[BibT_eX]

[DOI]

Yu Zhang

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Fundamentals of Predictive Text Mining, Second Edition

[BibT_eX]

[DOI]

Sholom M. Weiss

Nitin Indurkhya

Texts in Computer Science, Springer, ISBN: 978-1-4471-6750-1, 2015

Learning sparse low-threshold linear classifiers.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2015

Sparse Nonlinear Regression: Parameter Estimation and Asymptotic Inference.

[BibT_eX]

[DOI]

CoRR, 2015

Improved Analyses of the Randomized Power Method and Block Lanczos Method.

[BibT_eX]

[DOI]

Shusen Wang

Zhihua Zhang

CoRR, 2015

Towards More Efficient Nystrom Approximation and CUR Matrix Decomposition.

[BibT_eX]

[DOI]

Shusen Wang

Zhihua Zhang

CoRR, 2015

Semi-Supervised Learning with Multi-View Embedding: Theory and Application with Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2015

Local Smoothness in Variance Reduced Optimization.

[BibT_eX]

[DOI]

Daniel Vainsencher

Han Liu

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Quartz: Randomized Dual Coordinate Ascent with Arbitrary Sampling.

[BibT_eX]

[DOI]

Zheng Qu

Peter Richtárik

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Effective Use of Word Order for Text Categorization with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Stochastic Optimization with Importance Sampling for Regularized Loss Minimization.

[BibT_eX]

[DOI]

Peilin Zhao

Proceedings of the 32nd International Conference on Machine Learning, 2015

Adaptive Stochastic Alternating Direction Method of Multipliers.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

2014

Partial Gaussian Graphical Model Estimation.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2014

A Proximal Stochastic Gradient Method with Progressive Variance Reduction.

[BibT_eX]

[DOI]

Lin Xiao

SIAM J. Optim., 2014

Learning Nonlinear Functions Using Regularized Greedy Forest.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2014

Pathwise Coordinate Optimization for Sparse Learning: Algorithm and Theory.

[BibT_eX]

[DOI]

Tuo Zhao

Han Liu

CoRR, 2014

Randomized Dual Coordinate Ascent with Arbitrary Sampling.

[BibT_eX]

[DOI]

Zheng Qu

Peter Richtárik

CoRR, 2014

Sparse Recovery with Very Sparse Compressed Counting.

[BibT_eX]

[DOI]

Cun-Hui Zhang

CoRR, 2014

Batch-Mode Active Learning via Error Bound Minimization.

[BibT_eX]

[DOI]

Quanquan Gu

Jiawei Han

Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Efficient mini-batch training for stochastic optimization.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization.

[BibT_eX]

[DOI]

Xiaotong Yuan

Proceedings of the 31th International Conference on Machine Learning, 2014

A Convergence Rate Analysis for LogitBoost, MART and Their Variant.

[BibT_eX]

[DOI]

Peng Sun

Jie Zhou

Proceedings of the 31th International Conference on Machine Learning, 2014

Communication-Efficient Distributed Optimization using an Approximate Newton-type Method.

[BibT_eX]

[DOI]

Ohad Shamir

Nathan Srebro

Proceedings of the 31th International Conference on Machine Learning, 2014

Compressed Counting Meets Compressed Sensing.

[BibT_eX]

[DOI]

Cun-Hui Zhang

Proceedings of The 27th Conference on Learning Theory, 2014

2013

A Proximal-Gradient Homotopy Method for the Sparse Least-Squares Problem.

[BibT_eX]

[DOI]

Lin Xiao

SIAM J. Optim., 2013

Truncated power method for sparse eigenvalue problems.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2013

Stochastic dual coordinate ascent methods for regularized loss.

[BibT_eX]

[DOI]

Krishnakumar Balasubramanian

J. Mach. Learn. Res., 2013

Aggregation of Affine Estimators.

[BibT_eX]

[DOI]

CoRR, 2013

High-dimensional Joint Sparsity Random Effects Model for Multi-task Learning.

[BibT_eX]

[DOI]

Kai Yu

Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Accelerating Stochastic Gradient Descent using Predictive Variance Reduction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes.

[BibT_eX]

[DOI]

Ohad Shamir

Proceedings of the 30th International Conference on Machine Learning, 2013

2012

Random Design Analysis of Ridge Regression.

[BibT_eX]

[DOI]

Proceedings of the COLT 2012, 2012

A spectral algorithm for learning Hidden Markov Models.

[BibT_eX]

[DOI]

J. Comput. Syst. Sci., 2012

Analysis of a randomized approximation scheme for matrix multiplication

[BibT_eX]

[DOI]

CoRR, 2012

Proximal Stochastic Dual Coordinate Ascent

[BibT_eX]

[DOI]

CoRR, 2012

Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization

[BibT_eX]

[DOI]

CoRR, 2012

Deviation Optimal Learning using Greedy Q-aggregation

[BibT_eX]

[DOI]

Dong Dai

Philippe Rigollet

CoRR, 2012

AntigenMap 3D: an online antigenic cartography resource.

[BibT_eX]

[DOI]

Bioinform., 2012

Selective Labeling via Error Bound Minimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A Proximal-Gradient Homotopy Method for the L1-Regularized Least-Squares Problem.

[BibT_eX]

[DOI]

Lin Xiao

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Sparse Recovery With Orthogonal Matching Pursuit Under RIP.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2011

Adaptive Forward-Backward Greedy Algorithm for Learning Sparse Representations.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2011

Robust Matrix Decomposition With Sparse Corruptions.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2011

Integrative Analysis of Many Weighted Co-Expression Networks Using Tensor Computation.

[BibT_eX]

[DOI]

Xianghong Jasmine Zhou

PLoS Comput. Biol., 2011

Learning with Structured Sparsity.

[BibT_eX]

[DOI]

Junzhou Huang

Dimitris N. Metaxas

J. Mach. Learn. Res., 2011

A tail inequality for quadratic forms of subgaussian random vectors

[BibT_eX]

[DOI]

CoRR, 2011

An Analysis of Random Design Linear Regression

[BibT_eX]

[DOI]

CoRR, 2011

Dimension-free tail inequalities for sums of random matrices.

[BibT_eX]

[DOI]

CoRR, 2011

Efficient Optimal Learning for Contextual Bandits.

[BibT_eX]

[DOI]

Proceedings of the UAI 2011, 2011

Learning to Search Efficiently in High Dimensions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Greedy Model Averaging.

[BibT_eX]

[DOI]

Dong Dai

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Spectral Methods for Learning Multivariate Latent Tree Structure.

[BibT_eX]

[DOI]

Animashree Anandkumar

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

2010

Fundamentals of Predictive Text Mining.

[BibT_eX]

[DOI]

Sholom M. Weiss

Nitin Indurkhya

Texts in Computer Science 41, Springer, ISBN: 978-1-84996-226-1, 2010

Trading Accuracy for Sparsity in Optimization Problems with Sparsity Constraints.

[BibT_eX]

[DOI]

Nathan Srebro

SIAM J. Optim., 2010

A Computational Framework for Influenza Antigenic Cartography.

[BibT_eX]

[DOI]

Zhipeng Cai

Xiu-Feng Wan

PLoS Comput. Biol., 2010

Analysis of Multi-stage Convex Relaxation for Sparse Regularization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2010

Robust Matrix Decomposition with Outliers

[BibT_eX]

[DOI]

CoRR, 2010

Deep Coding Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Agnostic Active Learning Without Constraints.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Improved Local Coordinate Coding using Local Tangents.

[BibT_eX]

[DOI]

Kai Yu

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Image Classification Using Super-Vector Coding of Local Image Descriptors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2010, 2010

2009

Classifying search queries using the Web as a source of knowledge.

[BibT_eX]

[DOI]

ACM Trans. Web, 2009

On the Consistency of Feature Selection using Greedy Least Squares Regression.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2009

Sparse Online Learning via Truncated Gradient.

[BibT_eX]

[DOI]

John Langford

Lihong Li

J. Mach. Learn. Res., 2009

Nonlinear Learning using Local Coordinate Coding.

[BibT_eX]

[DOI]

Kai Yu

Yihong Gong

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Multi-Label Prediction via Compressed Sensing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Learning nonlinear dynamic models.

[BibT_eX]

[DOI]

John Langford

Ruslan Salakhutdinov

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008

Graph-Based Semi-Supervised Learning and Spectral Kernel Design.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2008

Statistical Analysis of Bayes Optimal Subset Ranking.

[BibT_eX]

[DOI]

David Cossock

IEEE Trans. Inf. Theory, 2008

An Online Relevant Set Algorithm for Statistical Machine Translation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Multi-stage Convex Relaxation for Learning with Sparse Regularization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Adaptive Forward-Backward Greedy Algorithm for Sparse Learning with Linear Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

2007

A block bigram prediction model for statistical machine translation.

[BibT_eX]

[DOI]

ACM Trans. Speech Lang. Process., 2007

On the Effectiveness of Laplacian Normalization for Graph Semi-supervised Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2007

Robust classification of rare queries using web knowledge.

[BibT_eX]

[DOI]

Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

A General Boosting Method and its Application to Learning Ranking Functions for Web Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information.

[BibT_eX]

[DOI]

John Langford

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Two-view feature generation model for semi-supervised learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2007

Margin Based Active Learning.

[BibT_eX]

[DOI]

Maria-Florina Balcan

Andrei Z. Broder

Proceedings of the Learning Theory, 20th Annual Conference on Learning Theory, 2007

2006

Information-theoretic upper and lower bounds for statistical estimation.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2006

Learning on Graph with Laplacian Regularization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Linear prediction models with graph regularization for web-page categorization.

[BibT_eX]

[DOI]

Alexandrin Popescul

Byron Dom

Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Subset Ranking Using Regression.

[BibT_eX]

[DOI]

David Cossock

Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006

Effectiveness of Meeting Outcomes in Virtual vs. Face-to-Face Teams: A Comparison Study in China.

[BibT_eX]

[DOI]

Proceedings of the Connecting the Americas. 12th Americas Conference on Information Systems, 2006

A Discriminative Global Training Algorithm for Statistical MT.

[BibT_eX]

[DOI]

Proceedings of the ACL 2006, 2006

2005

Learning Bounds for Kernel Regression Using Effective Data Dimensionality.

[BibT_eX]

[DOI]

Neural Comput., 2005

A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2005

TREC 2005 Genomics Track Experiments at IBM Watson.

[BibT_eX]

[DOI]

Mark Dredze

Proceedings of the Fourteenth Text REtrieval Conference, 2005

Analysis of Spectral Kernel Design based Semi-supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Localized Upper and Lower Bounds for Some Estimation Problems.

[BibT_eX]

[DOI]

Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

Data Dependent Concentration Bounds for Sequential Prediction Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005

A Localized Prediction Model for Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the ACL 2005, 2005

A High-Performance Semi-Supervised Learning Method for Text Chunking.

[BibT_eX]

[DOI]

Proceedings of the ACL 2005, 2005

2004

Statistical Analysis of Some Multi-Category Large Margin Classification Methods.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2004

Text categorization for a comprehensive time-dependent benchmark.

[BibT_eX]

[DOI]

Inf. Process. Manag., 2004

Focused named entity recognition using machine learning.

[BibT_eX]

[DOI]

Li Zhang

Yue Pan

Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

Class-size Independent Generalization Analsysis of Some Discriminative Multi-Category Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Support Vector Classification with Input Data Uncertainty.

[BibT_eX]

[DOI]

Jinbo Bi

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Column-generation boosting methods for mixture of kernels.

[BibT_eX]

[DOI]

Jinbo Bi

Kristin P. Bennett

Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Chinese Named Entity Recognition Based on Multilevel Linguistic Features.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing, 2004

Solving large scale linear prediction problems using stochastic gradient descent algorithms.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2004

On the Convergence of MDL Density Estimation.

[BibT_eX]

[DOI]

Proceedings of the Learning Theory, 17th Annual Conference on Learning Theory, 2004

2003

Sequential greedy approximation for certain convex optimization problems.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2003

Leave-One-Out Bounds for Kernel Methods.

[BibT_eX]

[DOI]

Neural Comput., 2003

Generalization Error Bounds for Bayesian Mixture Algorithms.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2003

Greedy Algorithms for Classification -- Consistency, Convergence Rates, and Adaptivity.

[BibT_eX]

[DOI]

Shie Mannor

J. Mach. Learn. Res., 2003

Learning Bounds for a Generalized Family of Bayesian Posterior Distributions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

An Infinity-sample Theory for Multi-category Large Margin Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

On the Convergence of Boosting Procedures.

[BibT_eX]

[DOI]

Bin Yu

Proceedings of the Machine Learning, 2003

HowtogetaChineseName(Entity): Segmentation and Combination Issues.

[BibT_eX]

[DOI]

Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2003

Named Entity Recognition through Classifier Combination.

[BibT_eX]

[DOI]

Proceedings of the Seventh Conference on Natural Language Learning, 2003

A Robust Risk Minimization based Named Entity Recognition System.

[BibT_eX]

[DOI]

Proceedings of the Seventh Conference on Natural Language Learning, 2003

Updating an NLP system to fit new domains: an empirical study on the sentence segmentation problem.

[BibT_eX]

[DOI]

Fred Damerau

Proceedings of the Seventh Conference on Natural Language Learning, 2003

2002

Two-Sided Arnoldi and Nonsymmetric Lanczos Algorithms.

[BibT_eX]

[DOI]

Jane Cullum

SIAM J. Matrix Anal. Appl., 2002

Approximation Bounds for Some Sparse Kernel Regression Algorithms.

[BibT_eX]

[DOI]

Neural Comput., 2002

On the Dual Formulation of Regularized Linear Systems with Convex Risks.

[BibT_eX]

[DOI]

Mach. Learn., 2002

Recommender Systems Using Linear Classifier.

[BibT_eX]

[DOI]

Vijay S. Iyengar

J. Mach. Learn. Res., 2002

Text Chunking based on a Generalization of Winnow.

[BibT_eX]

[DOI]

Fred Damerau

J. Mach. Learn. Res., 2002

Covering Number Bounds of Certain Regularized Linear Function Classes.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2002

On the Consistency of Instantaneous Rigid Motion Estimation.

[BibT_eX]

[DOI]

Carlo Tomasi

Int. J. Comput. Vis., 2002

A decision-tree-based symbolic rule induction system for text categorization.

[BibT_eX]

[DOI]

IBM Syst. J., 2002

Experiments in high-dimensional text categorization.

[BibT_eX]

[DOI]

Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Effective Dimension and Generalization of Kernel Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Data-Dependent Bounds for Bayesian Mixture Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Statistical Behavior and Consistency of Support Vector Machines, Boosting, and Beyond.

[BibT_eX]

Proceedings of the Machine Learning, 2002

The Consistency of Greedy Algorithms for Classification.

[BibT_eX]

[DOI]

Shie Mannor

Proceedings of the Computational Learning Theory, 2002

2001

Rank-One Approximation to High Order Tensors.

[BibT_eX]

[DOI]

Gene H. Golub

SIAM J. Matrix Anal. Appl., 2001

Text Categorization Based on Regularized Linear Classification Methods.

[BibT_eX]

[DOI]

Frank J. Oles

Inf. Retr., 2001

An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods.

[BibT_eX]

[DOI]

AI Mag., 2001

Empirical Study of Recommender Systems Using Linear Classifiers.

[BibT_eX]

[DOI]

Vijay S. Iyengar

Proceedings of the Knowledge Discovery and Data Mining, 2001

A General Greedy Approximation Algorithm with Applications.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Generalization Performance of Some Learning Problems in Hilbert Functional Spaces.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

Some Sparse Approximation Bounds for Regression Problems.

[BibT_eX]

Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

A Leave-One-out Cross Validation Bound for Kernel Methods with Applications in Learning.

[BibT_eX]

[DOI]

Proceedings of the Computational Learning Theory, 2001

A Sequential Approximation Bound for Some Sample-Dependent Convex Optimization Problems with Applications in Learning.

[BibT_eX]

[DOI]

Proceedings of the Computational Learning Theory, 2001

Text Chunking using Regularized Winnow.

[BibT_eX]

[DOI]

Fred Damerau

Proceedings of the Association for Computational Linguistic, 2001

2000

Regularized Winnow Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Convergence of Large Margin Separable Linear Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Active learning using adaptive resampling.

[BibT_eX]

[DOI]

Vijay S. Iyengar

Chidanand Apté

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

1999

Some Theoretical Results Concerning the Convergence of Compositions of Regularized Linear Functions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Fast, Robust, and Consistent Camera Motion Estimation.

[BibT_eX]

[DOI]

Carlo Tomasi

Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

Theoretical Analysis of a Class of Randomized Regularization Methods.

[BibT_eX]

[DOI]

Proceedings of the Twelfth Annual Conference on Computational Learning Theory, 1999

1998

Methods for computational and statistical estimation with applications.

[BibT_eX]

[DOI]

PhD thesis, 1998

On the Homotopy Method for Perturbed Symmetric Generalized Eigenvalue Problems.

[BibT_eX]

[DOI]

Kincho H. Law

Gene H. Golub

SIAM J. Sci. Comput., 1998

A Linear Algorithm for Optimal Context Clustering with Application to Bi-level Image Coding.

[BibT_eX]

[DOI]

Daniel H. Greene

F. Frances Yao

Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Compression by Model Combination.

[BibT_eX]

[DOI]

Proceedings of the Data Compression Conference, 1998

1997

A progressive Ziv-Lempel algorithm for image compression.

[BibT_eX]

[DOI]

Proceedings of the Compression and Complexity of SEQUENCES 1997, 1997

1996

Optimal Surface Smoothing as Filter Design.

[BibT_eX]

[DOI]

Gabriel Taubin

Gene H. Golub

Proceedings of the Computer Vision, 1996

1995

Densities of Self-Similar Measures on the Line.

[BibT_eX]

[DOI]

Robert S. Strichartz

Arthur Taylor