Taiji Suzuki

CoRR, 2019

Approximation and non-parametric estimation of ResNet-type convolutional neural networks.

[BibT_eX]

[DOI]

Kenta Oono

Proceedings of the 36th International Conference on Machine Learning, 2019

Adaptivity of deep ReLU network for learning in Besov and mixed smooth Besov spaces: optimal rate and curse of dimensionality.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Sharp Characterization of Optimal Minibatch Size for Stochastic Finite Sum Convex Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Understanding the Effects of Pre-Training for Object Detectors via Eigenspectrum.

[BibT_eX]

[DOI]

Yosuke Shinya

Edgar Simo-Serra

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Cross-Domain Recommendation via Deep Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2019

Stochastic Gradient Descent with Exponential Convergence Rates of Expected Classification Errors.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

Generalized ridge estimator and model selection criteria in multivariate linear regression.

[BibT_eX]

[DOI]

Yuichi Mori

J. Multivar. Anal., 2018

Spectral-Pruning: Compressing deep neural network via spectral analysis.

[BibT_eX]

[DOI]

CoRR, 2018

Sample Efficient Stochastic Gradient Iterative Hard Thresholding Method for Stochastic Sparse Linear Regression with Limited Attribute Observation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Adam Induces Implicit Weight Sparsity in Rectifier Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

Functional Gradient Boosting based on Residual Network Perception.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Short-term local weather forecast using dense weather station by deep neural network.

[BibT_eX]

[DOI]

Kazuo Yonekura

Hitoshi Hattori

Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Independently Interpretable Lasso: A New Regularizer for Sparse Regression with Uncorrelated Variables.

[BibT_eX]

[DOI]

Masaaki Takada

Hironori Fujisawa

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Fast generalization error bound of deep learning from a kernel perspective.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Gradient Layer: Enhancing the Convergence of Adversarial Training for Generative Models.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

Stochastic Particle Gradient Descent for Infinite Ensembles.

[BibT_eX]

[DOI]

CoRR, 2017

Fast learning rate of deep learning via a kernel perspective.

[BibT_eX]

[DOI]

CoRR, 2017

Doubly Accelerated Stochastic Variance Reduced Dual Averaging Method for Regularized Empirical Risk Minimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Trimmed Density Ratio Estimation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Stochastic Difference of Convex Algorithm and its Application to Training Deep Boltzmann Machines.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016

System identification and parameter estimation in mathematical medicine: examples demonstrated for prostate cancer.

[BibT_eX]

[DOI]

Quant. Biol., 2016

Stochastic dual averaging methods using variance reduction techniques for regularized empirical risk minimization problems.

[BibT_eX]

[DOI]

CoRR, 2016

Minimax Optimal Alternating Minimization for Kernel Nonparametric Tensor Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Structure Learning of Partitioned Markov Networks.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Gaussian process nonparametric tensor estimator and its minimax optimality.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Convergence rate of Bayesian tensor estimator and its minimax optimality.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

A Consistent Method for Graph Based Anomaly Localization.

[BibT_eX]

[DOI]

Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Support Consistency of Direct Sparse-Change Learning in Markov Networks.

[BibT_eX]

[DOI]

Song Liu

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Direct Learning of Sparse Changes in Markov Networks by Density Ratio Estimation.

[BibT_eX]

[DOI]

Neural Comput., 2014

Convergence rate of Bayesian tensor estimator: Optimal rate without restricted strong convexity.

[BibT_eX]

[DOI]

CoRR, 2014

Stochastic Dual Coordinate Ascent with Alternating Direction Method of Multipliers.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

2013

Sufficient Dimension Reduction via Squared-Loss Mutual Information Estimation.

[BibT_eX]

[DOI]

Neural Comput., 2013

Computational complexity of kernel-based density-ratio estimation: a condition number analysis.

[BibT_eX]

[DOI]

Mach. Learn., 2013

Improvement of multiple kernel learning using adaptively weighted regularization.

[BibT_eX]

[DOI]

JSIAM Lett., 2013

Conjugate relation between loss functions and uncertainty sets in classification problems.

[BibT_eX]

[DOI]

Akiko Takeda

J. Mach. Learn. Res., 2013

Direct Divergence Approximation between Probability Distributions and Its Applications in Machine Learning.

[BibT_eX]

[DOI]

Marthinus Christoffel du Plessis

Song Liu

J. Comput. Sci. Eng., 2013

Convex Tensor Decomposition via Structured Schatten Norm Regularization.

[BibT_eX]

[DOI]

Ryota Tomioka

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Dual Averaging and Proximal Gradient Descent for Online Alternating Direction Multiplier Method.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

2012

f-Divergence Estimation and Two-Sample Homogeneity Test Under Semiparametric Density-Ratio Models.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2012

Statistical analysis of kernel-based least-squares density-ratio estimation.

[BibT_eX]

[DOI]

Mach. Learn., 2012

Fast Learning Rate of Multiple Kernel Learning: Trade-Off between Sparsity and Smoothness.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

A Conjugate Property between Loss Functions and Uncertainty Sets in Classification Problems.

[BibT_eX]

[DOI]

Akiko Takeda

Proceedings of the COLT 2012, 2012

Density-Difference Estimation.

[BibT_eX]

[DOI]

Marthinus Christoffel du Plessis

Song Liu

Ichiro Takeuchi

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

PAC-Bayesian Bound for Gaussian Process Regression and Multiple Kernel Additive Model.

[BibT_eX]

[DOI]

Proceedings of the COLT 2012, 2012

Density Ratio Estimation in Machine Learning.

[BibT_eX]

[DOI]

Cambridge University Press, ISBN: 978-0-521-19017-6, 2012

2011

Direct density-ratio estimation with dimensionality reduction via least-squares hetero-distributional subspace search.

[BibT_eX]

[DOI]

Neural Networks, 2011

Least-squares two-sample test.

[BibT_eX]

[DOI]

Neural Networks, 2011

Least-Squares Independent Component Analysis.

[BibT_eX]

[DOI]

Neural Comput., 2011

SpicyMKL: a fast algorithm for Multiple Kernel Learning with thousands of kernels.

[BibT_eX]

[DOI]

Ryota Tomioka

Mach. Learn., 2011

Super-Linear Convergence of Dual Augmented Lagrangian Algorithm for Sparsity Regularized Estimation.

[BibT_eX]

[DOI]

Ryota Tomioka

J. Mach. Learn. Res., 2011

Least-Squares Independence Test.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

Relative Density-Ratio Estimation for Robust Distribution Comparison.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Statistical Performance of Convex Tensor Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Unifying Framework for Fast Learning Rate of Non-Sparse Multiple Kernel Learning.

[BibT_eX]

[DOI]