Masashi Sugiyama

Orcid: 0000-0001-6658-6743

According to our database1, Masashi Sugiyama authored at least 552 papers between 1999 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
On the Robustness of Average Losses for Partial-Label Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Reinforcement Learning with Options and State Representation.
CoRR, 2024

Learning with Noisy Foundation Models.
CoRR, 2024

VEC-SBM: Optimal Community Detection with Vectorial Edges Covariates.
CoRR, 2024

Generating Chain-of-Thoughts with a Direct Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought.
CoRR, 2024

Reinforcement Learning from Bagged Reward: A Transformer-based Approach for Instance-Level Reward Redistribution.
CoRR, 2024

A General Framework for Learning from Weak Supervision.
CoRR, 2024

Direct Distillation between Different Domains.
CoRR, 2024

Appearance-Based Curriculum for Semi-Supervised Learning with Multi-Angle Unlabeled Data.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Overcoming Continuous Distribution Shifts: Challenges in Online Machine Learning.
Proceedings of the 12th International Winter Conference on Brain-Computer Interface, 2024

Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Boundary-restricted metric learning.
Mach. Learn., December, 2023

Learning Intention-Aware Policies in Deep Reinforcement Learning.
Neural Comput., October, 2023

Class-Wise Denoising for Robust Learning Under Label Noise.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Positive-unlabeled classification under class-prior shift: a prior-invariant approach based on density ratio estimation.
Mach. Learn., March, 2023

Root cause estimation of faults in production processes: a novel approach inspired by approximate Bayesian computation.
Int. J. Prod. Res., March, 2023

Representation learning for continuous action spaces is beneficial for efficient policy learning.
Neural Networks, February, 2023

Learning With Proper Partial Labels.
Neural Comput., January, 2023

Learning with Complementary Labels Revisited: A Consistent Approach via Negative-Unlabeled Learning.
CoRR, 2023

Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit.
CoRR, 2023

Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation.
CoRR, 2023

Atom-Motif Contrastive Transformer for Molecular Property Prediction.
CoRR, 2023

Thompson Exploration with Best Challenger Rule in Best Arm Identification.
CoRR, 2023

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks.
CoRR, 2023

Unified Risk Analysis for Weakly Supervised Learning.
CoRR, 2023

Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation.
CoRR, 2023

Combinatorial Pure Exploration of Multi-Armed Bandit with a Real Number Action Class.
CoRR, 2023

Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision.
CoRR, 2023

BadLabel: A Robust Perspective on Evaluating and Enhancing Label-noise Learning.
CoRR, 2023

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations.
CoRR, 2023

Enriching Disentanglement: Definitions to Metrics.
CoRR, 2023

Analysis of Pleasantness Evoked by Various Airborne Ultrasound Tactile Stimuli Using Pairwise Comparisons and the Bradley-Terry Model.
CoRR, 2023

Enhancing Label Sharing Efficiency in Complementary-Label Learning with Label Augmentation.
CoRR, 2023

Assessing Vulnerabilities of Adversarial Learning Algorithm through Poisoning Attacks.
CoRR, 2023

Fairness Improves Learning from Noisily Labeled Long-Tailed Data.
CoRR, 2023

Asymptotically Optimal Thompson Sampling Based Policy for the Uniform Bandits and the Gaussian Bandits.
CoRR, 2023

Adapting to Continuous Covariate Shift via Online Density Ratio Estimation.
CoRR, 2023

Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Adapting to Continuous Covariate Shift via Online Density Ratio Estimation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online (Multinomial) Logistic Bandit: Improved Regret and Constant Computation Cost.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Class-Distribution-Aware Pseudo-Labeling for Semi-Supervised Multi-Label Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Binary Classification with Confidence Difference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Generalizing Importance Weighting to A Universal Solver for Distribution Shift Problems.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Imitation Learning from Vague Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Distributional Pareto-Optimal Multi-Objective Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Category-theoretical Meta-analysis of Definitions of Disentanglement.
Proceedings of the International Conference on Machine Learning, 2023

Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits.
Proceedings of the International Conference on Machine Learning, 2023

GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks.
Proceedings of the International Conference on Machine Learning, 2023

Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation.
Proceedings of the International Conference on Machine Learning, 2023

Seeing Differently, Acting Similarly: Heterogeneously Observable Imitation Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Multi-Label Knowledge Distillation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Distribution Shift Matters for Knowledge Distillation with Webly Collected Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Audio Signal Enhancement with Learning from Positive and Unlabeled Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
NoiLin: Improving adversarial training and correcting stereotype of noisy labels.
Trans. Mach. Learn. Res., 2022

LocalDrop: A Hybrid Regularization for Deep Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Centroid Estimation With Guaranteed Efficiency: A General Framework for Weakly Supervised Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Discovering diverse solutions in deep reinforcement learning by maximizing state-action-based mutual information.
Neural Networks, 2022

Deep learning, reinforcement learning, and world models.
Neural Networks, 2022

Neural Networks special issue on Artificial Intelligence and Brain Science.
Neural Networks, 2022

Learning from Noisy Pairwise Similarity and Unlabeled Data.
J. Mach. Learn. Res., 2022

Fast and Robust Rank Aggregation against Model Misspecification.
J. Mach. Learn. Res., 2022

Learning from Noisy Complementary Labels with Robust Loss Functions.
IEICE Trans. Inf. Syst., 2022

Improving imbalanced classification using near-miss instances.
Expert Syst. Appl., 2022

Audio Signal Enhancement with Learning from Positive and Unlabelled Data.
CoRR, 2022

Equivariant Disentangled Transformation for Domain Generalization under Combination Shift.
CoRR, 2022

Learning from Multiple Unlabeled Datasets with Partial Risk Regularization.
CoRR, 2022

The Survival Bandit Problem.
CoRR, 2022

Excess risk analysis for epistemic uncertainty with application to variational inference.
CoRR, 2022

Universal approximation property of invertible neural networks.
CoRR, 2022

On the Effectiveness of Adversarial Training against Backdoor Attacks.
CoRR, 2022

Adversarial Attacks and Defense for Non-Parametric Two-Sample Tests.
CoRR, 2022

Synergy-of-Experts: Collaborate to Improve Adversarial Robustness.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adapting to Online Label Shift with Provable Guarantees.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Contrastive Embedding in Low-Dimensional Space.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Adversarially Robust Deep Image Denoising.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Adversarial Attack and Defense for Non-Parametric Two-Sample Tests.
Proceedings of the International Conference on Machine Learning, 2022

Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum.
Proceedings of the International Conference on Machine Learning, 2022

To Smooth or Not? When Label Smoothing Meets Noisy Labels.
Proceedings of the International Conference on Machine Learning, 2022

Exploiting Class Activation Value for Partial-Label Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Rethinking Class-Prior Estimation for Positive-Unlabeled Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Sample Selection with Uncertainty of Losses for Learning with Noisy Labels.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Federated Learning from Only Unlabeled Data with Class-conditional-sharing Clients.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Meta Discovery: Learning to Discover Novel Classes given Very Limited Data.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning and Mining with Noisy Labels.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Predictive variational Bayesian inference as risk-seeking optimization.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Pairwise Supervision Can Provably Elicit a Decision Boundary.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Multi-class Classification from Multiple Unlabeled Datasets with Partial Risk Regularization.
Proceedings of the Asian Conference on Machine Learning, 2022

Robust computation of optimal transport by β-potential regularization.
Proceedings of the Asian Conference on Machine Learning, 2022

2021
A One-Step Approach to Covariate Shift Adaptation.
SN Comput. Sci., 2021

Direction Matters: On Influence-Preserving Graph Summarization and Max-Cut Principle for Directed Graphs.
Neural Comput., 2021

Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting.
Neural Comput., 2021

Semisupervised Ordinal Regression Based on Empirical Risk Minimization.
Neural Comput., 2021

Classification From Pairwise Similarities/Dissimilarities and Unlabeled Data via Empirical Risk Minimization.
Neural Comput., 2021

Information-Theoretic Representation Learning for Positive-Unlabeled Classification.
Neural Comput., 2021

Learning with Proper Partial Labels.
CoRR, 2021

Rethinking Importance Weighting for Transfer Learning.
CoRR, 2021

Active Refinement for Multi-Label Learning: A Pseudo-Label Approach.
CoRR, 2021

Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations.
CoRR, 2021

Multi-Class Classification from Single-Class Data with Confidences.
CoRR, 2021

On the Robustness of Average Losses for Partial-Label Learning.
CoRR, 2021

Instance Correction for Learning with Open-set Noisy Labels.
CoRR, 2021

NoiLIn: Do Noisy Labels Always Hurt Adversarial Training?
CoRR, 2021

Approximating Instance-Dependent Noise via Instance-Confidence Embedding.
CoRR, 2021

Discovering Diverse Solutions in Deep Reinforcement Learning.
CoRR, 2021

Guided Interpolation for Adversarial Training.
CoRR, 2021

Understanding the Interaction of Adversarial Training with Noisy Labels.
CoRR, 2021

Source-free Domain Adaptation via Distributional Alignment by Matching Batch Normalization Statistics.
CoRR, 2021

A Symmetric Loss Perspective of Reliable Machine Learning.
CoRR, 2021

Constraint learning for control tasks with limited duration barrier functions.
Autom., 2021

Incorporating causal graphical prior knowledge into predictive modeling via simple data augmentation.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Learning from Noisy Similar and Dissimilar Data.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Probabilistic Margins for Instance Reweighting in Adversarial Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Loss function based second-order Jensen inequality and its application to particle variational inference.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Lower-Bounded Proper Losses for Weakly Supervised Classification.
Proceedings of the 38th International Conference on Machine Learning, 2021

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection.
Proceedings of the 38th International Conference on Machine Learning, 2021

Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences.
Proceedings of the 38th International Conference on Machine Learning, 2021

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Binary Classification from Multiple Unlabeled Datasets via Surrogate Set Classification.
Proceedings of the 38th International Conference on Machine Learning, 2021

Provably End-to-end Label-noise Learning without Anchor Points.
Proceedings of the 38th International Conference on Machine Learning, 2021

Maximum Mean Discrepancy Test is Aware of Adversarial Attacks.
Proceedings of the 38th International Conference on Machine Learning, 2021

Pointwise Binary Classification with Pairwise Confidence Comparisons.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Diverse-Structured Networks for Adversarial Robustness.
Proceedings of the 38th International Conference on Machine Learning, 2021

Classification with Rejection Based on Cost-sensitive Classification.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning from Similarity-Confidence Data.
Proceedings of the 38th International Conference on Machine Learning, 2021

Confidence Scores Make Instance-dependent Label-noise Learning Possible.
Proceedings of the 38th International Conference on Machine Learning, 2021

Large-Margin Contrastive Learning with Distance Polarization Regularizer.
Proceedings of the 38th International Conference on Machine Learning, 2021

Geometry-aware Instance-reweighted Adversarial Training.
Proceedings of the 9th International Conference on Learning Representations, 2021

A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima.
Proceedings of the 9th International Conference on Learning Representations, 2021

Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Mixture Proportion Estimation in Weakly Supervised Learning.
Proceedings of the CIKM 2021 Workshops co-located with 30th ACM International Conference on Information and Knowledge Management (CIKM 2021), 2021

Robust Imitation Learning from Noisy Demonstrations.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

A unified view of likelihood ratio and reparameterization gradients.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

γ-ABC: Outlier-Robust Approximate Bayesian Computation Based on a Robust Divergence Estimator.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Fenchel-Young Losses with Skewed Entropies for Class-posterior Probability Estimation.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Stochastic Multichannel Ranking with Brain Dynamics Preferences.
Neural Comput., 2020

Polynomial-Time Algorithms for Multiple-Arm Identification with Full-Bandit Feedback.
Neural Comput., 2020

Classification from Triplet Comparison Data.
Neural Comput., 2020

Binary classification with ambiguous training data.
Mach. Learn., 2020

Principled analytic classifier for positive-unlabeled learning via weighted integral probability metric.
Mach. Learn., 2020

Active deep Q-learning with demonstration.
Mach. Learn., 2020

Unsupervised key frame selection using information theory and colour histogram difference.
Int. J. Bus. Intell. Data Min., 2020

Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation.
CoRR, 2020

Stable Weight Decay Regularization.
CoRR, 2020

A Survey of Label-noise Representation Learning: Past, Present and Future.
CoRR, 2020

Maximum Mean Discrepancy is Aware of Adversarial Attacks.
CoRR, 2020

Adai: Separating the Effects of Adaptive Learning Rate and Momentum Inertia.
CoRR, 2020

Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent.
CoRR, 2020

LFD-ProtoNet: Prototypical Network Based on Local Fisher Discriminant Analysis for Few-shot Learning.
CoRR, 2020

Parts-dependent Label Noise: Towards Instance-dependent Label Noise.
CoRR, 2020

Similarity-based Classification: Connecting Similarity Learning to Binary Classification.
CoRR, 2020

Do Public Datasets Assure Unbiased Comparisons for Registration Evaluation?
CoRR, 2020

Time-varying Gaussian Process Bandit Optimization with Non-constant Evaluation Time.
CoRR, 2020

Towards Mixture Proportion Estimation without Irreducibility.
CoRR, 2020

A Diffusion Theory for Deep Learning Dynamics: Stochastic Gradient Descent Escapes From Sharp Minima Exponentially Fast.
CoRR, 2020

Partially Zero-shot Domain Adaptation from Incomplete Target Data with Missing Classes.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Learning from Aggregate Observations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dual T: Reducing Estimation Error for Transition Matrix in Label-noise Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Part-dependent Label Noise: Towards Instance-dependent Label Noise.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Coupling-based Invertible Neural Networks Are Universal Diffeomorphism Approximators.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Provably Consistent Partial-Label Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Rethinking Importance Weighting for Deep Learning under Distribution Shift.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Calibrated Surrogate Maximization of Dice.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Are Registration Uncertainty and Error Monotonically Associated?
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Simultaneous Planning for Item Picking and Placing by Deep Reinforcement Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Binary Classification from Positive Data with Skewed Confidence.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Attacks Which Do Not Kill Training Make Adversarial Learning Stronger.
Proceedings of the 37th International Conference on Machine Learning, 2020

Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks Using PAC-Bayesian Analysis.
Proceedings of the 37th International Conference on Machine Learning, 2020

Few-shot Domain Adaptation by Causal Mechanism Transfer.
Proceedings of the 37th International Conference on Machine Learning, 2020

Variational Imitation Learning with Diverse-quality Demonstrations.
Proceedings of the 37th International Conference on Machine Learning, 2020

Progressive Identification of True Labels for Partial-Label Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Online Dense Subgraph Discovery via Blurred-Graph Feedback.
Proceedings of the 37th International Conference on Machine Learning, 2020

Do We Need Zero Training Loss After Achieving Zero Training Error?
Proceedings of the 37th International Conference on Machine Learning, 2020

Accelerating the diffusion-based ensemble sampling by non-reversible dynamics.
Proceedings of the 37th International Conference on Machine Learning, 2020

Learning with Multiple Complementary Labels.
Proceedings of the 37th International Conference on Machine Learning, 2020

Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels.
Proceedings of the 37th International Conference on Machine Learning, 2020

SIGUA: Forgetting May Make Learning with Noisy Labels More Robust.
Proceedings of the 37th International Conference on Machine Learning, 2020

Calibrated Surrogate Losses for Adversarially Robust Classification.
Proceedings of the Conference on Learning Theory, 2020

Mitigating Overfitting in Supervised Classification from Two Unlabeled Datasets: A Consistent Risk Correction Approach.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Unsupervised Discrete Representation Learning.
Proceedings of the Explainable AI: Interpreting, 2019

ECG-Based Concentration Recognition With Multi-Task Regression.
IEEE Trans. Biomed. Eng., 2019

Foreword: special issue for the journal track of the 10th Asian Conference on Machine Learning (ACML 2018).
Mach. Learn., 2019

Good arm identification via bandit feedback.
Mach. Learn., 2019

Millionaire: a hint-guided approach for crowdsourcing.
Mach. Learn., 2019

Where is the Bottleneck of Adversarial Learning with Unlabeled Data?
CoRR, 2019

A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme.
CoRR, 2019

Learning from Indirect Observations.
CoRR, 2019

Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics.
CoRR, 2019

VILD: Variational Imitation Learning with Diverse-quality Demonstrations.
CoRR, 2019

Pilot Study on Verifying the Monotonic Relationship between Error and Uncertainty in Deformable Registration for Neurosurgery.
CoRR, 2019

Solving NP-Hard Problems on Graphs by Reinforcement Learning without Domain Knowledge.
CoRR, 2019

Butterfly: A Panacea for All Difficulties in Wildly Unsupervised Domain Adaptation.
CoRR, 2019

Polynomial-time Algorithms for Combinatorial Pure Exploration with Full-bandit Feedback.
CoRR, 2019

Online Multiclass Classification Based on Prediction Margin for Partial Feedback.
CoRR, 2019

Semi-Supervised Ordinal Regression Based on Empirical Risk Minimization.
CoRR, 2019

New Tricks for Estimating Gradients of Expectations.
CoRR, 2019

On Possibility and Impossibility of Multiclass Classification with Rejection.
CoRR, 2019

Domain Discrepancy Measure Using Complex Models in Unsupervised Domain Adaptation.
CoRR, 2019

Revisiting Sample Selection Approach to Positive-Unlabeled Learning: Turning Unlabeled Data into Positive rather than Negative.
CoRR, 2019

An analytic formulation for positive-unlabeled learning via weighted integral probability metric.
CoRR, 2019

Positive-Unlabeled Classification under Class Prior Shift and Asymmetric Error.
Proceedings of the 2019 SIAM International Conference on Data Mining, 2019

Uncoupled Regression from Pairwise Comparison Data.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Are Anchor Points Really Indispensable in Label-Noise Learning?
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On the Calibration of Multiclass Classification with Rejection.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On the Applicability of Registration Uncertainty.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

How does Disagreement Help Generalization against Label Corruption?
Proceedings of the 36th International Conference on Machine Learning, 2019

Imitation Learning from Imperfect Demonstration.
Proceedings of the 36th International Conference on Machine Learning, 2019

Complementary-Label Learning for Arbitrary Losses and Models.
Proceedings of the 36th International Conference on Machine Learning, 2019

Classification from Positive, Unlabeled and Biased Negative Data.
Proceedings of the 36th International Conference on Machine Learning, 2019

On Symmetric Losses for Learning from Corrupted Labels.
Proceedings of the 36th International Conference on Machine Learning, 2019

Hierarchical Reinforcement Learning via Advantage-Weighted Information Maximization.
Proceedings of the 7th International Conference on Learning Representations, 2019

On the Minimal Supervision for Training Any Binary Classifier from Only Unlabeled Data.
Proceedings of the 7th International Conference on Learning Representations, 2019

Learning Efficient Tensor Representations with Ring-structured Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Binary Classification Only from Unlabeled Data by Iterative Unlabeled-unlabeled Classification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning Only from Relevant Keywords and Unlabeled Documents.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Zero-shot Domain Adaptation Based on Attribute Information.
Proceedings of The 11th Asian Conference on Machine Learning, 2019

Dueling Bandits with Qualitative Feedback.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Clipped Matrix Completion: A Remedy for Ceiling Effects.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Unsupervised Domain Adaptation Based on Source-Guided Discrepancy.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Bézier Simplex Fitting: Describing Pareto Fronts of Simplicial Problems with Small Samples in Multi-Objective Optimization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Bayesian Posterior Approximation via Greedy Particle Optimization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Convex formulation of multiple instance learning from positive and unlabeled bags.
Neural Networks, 2018

Sufficient Dimension Reduction via Direct Estimation of the Gradients of Logarithmic Conditional Densities.
Neural Comput., 2018

Bias Reduction and Metric Learning for Nearest-Neighbor Estimation of Kullback-Leibler Divergence.
Neural Comput., 2018

Correction to: Semi-supervised AUC optimization based on positive-unlabeled learning.
Mach. Learn., 2018

Semi-supervised AUC optimization based on positive-unlabeled learning.
Mach. Learn., 2018

Stochastic Divergence Minimization for Biterm Topic Models.
IEICE Trans. Inf. Syst., 2018

Pumpout: A Meta Approach for Robustly Training Deep Neural Networks with Noisy Labels.
CoRR, 2018

Alternate Estimation of a Classifier and the Class-Prior from Positive and Unlabeled Data.
CoRR, 2018

Matrix Co-completion for Multi-label Classification with Missing Features and Labels.
CoRR, 2018

Frank-Wolfe Stein Sampling.
CoRR, 2018

Co-sampling: Training Robust Networks for Extremely Noisy Supervision.
CoRR, 2018

On the Ambiguity of Registration Uncertainty.
CoRR, 2018

Binary Matrix Completion Using Unobserved Entries.
CoRR, 2018

Using the variogram for vector outlier screening: application to feature-based image registration.
Int. J. Comput. Assist. Radiol. Surg., 2018

Variational Inference for Gaussian Processes with Panel Count Data.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Uplift Modeling from Separate Labels.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Binary Classification from Positive-Confidence Data.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Co-teaching: Robust training of deep neural networks with extremely noisy labels.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Masking: A New Perspective of Noisy Supervision.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Feature-Driven Active Framework for Ultrasound-Based Brain Shift Compensation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Active Feature Acquisition with Supervised Matrix Completion.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model.
Proceedings of the 35th International Conference on Machine Learning, 2018

Does Distributionally Robust Supervised Learning Give Robust Classifiers?
Proceedings of the 35th International Conference on Machine Learning, 2018

Classification from Pairwise Similarity and Unlabeled Data.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Efficient Tensor Representations with Ring Structure Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Guide Actor-Critic for Continuous Control.
Proceedings of the 6th International Conference on Learning Representations, 2018

Multi Task Learning with Positive and Unlabeled Data and its Application to Mental State Prediction.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A fully adaptive algorithm for pure exploration in linear bandits.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Variational Inference based on Robust Divergences.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Bayesian Nonparametric Poisson-Process Allocation for Time-Sequence Modeling.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Hierarchical Policy Search via Return-Weighted Density Estimation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Direct Estimation of the Derivative of Quadratic Mutual Information with Application in Supervised Dimension Reduction.
Neural Comput., 2017

Homotopy continuation approaches for robust SV classification and regression.
Mach. Learn., 2017

Class-prior estimation for learning from positive and unlabeled data.
Mach. Learn., 2017

Geometry-aware principal component analysis for symmetric positive definite matrices.
Mach. Learn., 2017

Introduction: special issue of selected papers from ACML 2015.
Mach. Learn., 2017

Foreword: special issue for the journal track of the 8th Asian conference on machine learning (ACML 2016).
Mach. Learn., 2017

Mode-Seeking Clustering and Density Ridge Estimation via Direct Estimation of Density-Derivative-Ratios.
J. Mach. Learn. Res., 2017

Tensor Networks for Dimensionality Reduction and Large-scale Optimization: Part 2 Applications and Future Perspectives.
Found. Trends Mach. Learn., 2017

Estimation of Squared-Loss Mutual Information from Positive and Unlabeled Data.
CoRR, 2017

Tensor Networks for Dimensionality Reduction and Large-Scale Optimizations. Part 2 Applications and Future Perspectives.
CoRR, 2017

Learning Efficient Tensor Representations with Ring Structure Networks.
CoRR, 2017

Misdirected Registration Uncertainty.
CoRR, 2017

Learning from Complementary Labels.
CoRR, 2017

Risk Minimization Framework for Multiple Instance Learning from Positive and Unlabeled Bags.
CoRR, 2017

Generative Local Metric Learning for Kernel Regression.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Positive-Unlabeled Learning with Non-Negative Risk Estimator.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning from Complementary Labels.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Expectation Propagation for t-Exponential Family Using q-Algebra.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Lung lesion detection in FDG-PET/CT with Gaussian process regression.
Proceedings of the Medical Imaging 2017: Computer-Aided Diagnosis, 2017

Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data.
Proceedings of the 34th International Conference on Machine Learning, 2017

Learning Discrete Representations via Information Maximizing Self-Augmented Training.
Proceedings of the 34th International Conference on Machine Learning, 2017

Estimating Density Ridges by Direct Estimation of Density-Derivative-Ratios.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Least-Squares Log-Density Gradient Clustering for Riemannian Manifolds.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Whitening-Free Least-Squares Non-Gaussian Component Analysis.
Proceedings of The 9th Asian Conference on Machine Learning, 2017

Policy Search with High-Dimensional Context Variables.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Trial and Error: Using Previous Experiences as Simulation Models in Humanoid Motor Learning.
IEEE Robotics Autom. Mag., 2016

Model-based reinforcement learning with dimension reduction.
Neural Networks, 2016

Regularized Multitask Learning for Multidimensional Log-Density Gradient Estimation.
Neural Comput., 2016

Theoretical and Experimental Analyses of Tensor-Based Regression and Classification.
Neural Comput., 2016

Direct Density Derivative Estimation.
Neural Comput., 2016

An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions.
Neural Comput., 2016

Computationally Efficient Class-Prior Estimation under Class Balance Change Using Energy Distance.
IEICE Trans. Inf. Syst., 2016

Beyond the Low-density Separation Principle: A Novel Approach to Semi-supervised Learning.
CoRR, 2016

Theoretical Comparisons of Learning from Positive-Negative, Positive-Unlabeled, and Negative-Unlabeled Data.
CoRR, 2016

Reinterpreting the Transformation Posterior in Probabilistic Image Registration.
CoRR, 2016

Robust supervised learning under uncertainty in dataset shift.
CoRR, 2016

Dependence maximization localization: a novel approach to 2D street-map-based robot localization.
Adv. Robotics, 2016

Faster Stochastic Variational Inference using Proximal-Gradient Methods with General Divergence Functions.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

Semi-supervised sufficient dimension reduction under class-prior change.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2016

Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Modal Regression via Direct Log-Density Derivative Estimation.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Structure Learning of Partitioned Markov Networks.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Target-less camera-LiDAR extrinsic calibration using a bagged dependence estimator.
Proceedings of the IEEE International Conference on Automation Science and Engineering, 2016

Non-Gaussian Component Analysis with Log-Density Gradient Estimation.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Multitask Principal Component Analysis.
Proceedings of The 8th Asian Conference on Machine Learning, 2016

Geometry-aware stationary subspace analysis.
Proceedings of The 8th Asian Conference on Machine Learning, 2016

2015
Active Learning in Recommender Systems.
Proceedings of the Recommender Systems Handbook, 2015

Importance-weighted covariance estimation for robust common spatial pattern.
Pattern Recognit. Lett., 2015

Cross-Domain Matching with Squared-Loss Mutual Information.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Bandit-Based Task Assignment for Heterogeneous Crowdsourcing.
Neural Comput., 2015

Conditional Density Estimation with Dimensionality Reduction via Squared-Loss Conditional Entropy Minimization.
Neural Comput., 2015

Online Direct Density-Ratio Estimation Applied to Inlier-Based Outlier Detection.
Neural Comput., 2015

Direct conditional probability density estimation with sparse feature selection.
Mach. Learn., 2015

Introduction: special issue of selected papers of ACML 2013.
Mach. Learn., 2015

Condition for perfect dimensionality recovery by variational Bayesian PCA.
J. Mach. Learn. Res., 2015

Direct Density Ratio Estimation with Convolutional Neural Networks with Application in Outlier Detection.
IEICE Trans. Inf. Syst., 2015

Supervised LogEuclidean Metric Learning for Symmetric Positive Definite Matrices.
CoRR, 2015

Online Markov decision processes with policy iteration.
CoRR, 2015

Convergence of Proximal-Gradient Stochastic Variational Inference under Non-Decreasing Step-Size Sequence.
CoRR, 2015

Task selection for bandit-based task assignment in heterogeneous crowdsourcing.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2015

Predictive Approaches for Low-Cost Preventive Medicine Program in Developing Countries.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

A dependence maximization approach towards street map-based localization.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Stroke-Based Stylization Learning and Rendering with Inverse Reinforcement Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Convex Formulation for Learning from Positive and Unlabeled Data.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Minimum dependency key frames selection via quadratic mutual information.
Proceedings of the Tenth International Conference on Digital Information Management, 2015

Averaging covariance matrices for EEG signal classification based on the CSP: An empirical study.
Proceedings of the 23rd European Signal Processing Conference, 2015

Direct Density-Derivative Estimation and Its Application in KL-Divergence Approximation.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation.
Proceedings of The 7th Asian Conference on Machine Learning, 2015

Sufficient Dimension Reduction via Direct Estimation of the Gradients of Logarithmic Conditional Densities.
Proceedings of The 7th Asian Conference on Machine Learning, 2015

Continuous Target Shift Adaptation in Supervised Learning.
Proceedings of The 7th Asian Conference on Machine Learning, 2015

Support Consistency of Direct Sparse-Change Learning in Markov Networks.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Statistical Reinforcement Learning - Modern Machine Learning Approaches.
Chapman and Hall / CRC machine learning and pattern recognition series, CRC Press, ISBN: 978-1-439-85689-5, 2015

2014
A least-squares approach to anomaly detection in static and sequential data.
Pattern Recognit. Lett., 2014

Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation.
Neural Networks, 2014

Semi-supervised learning of class balance under class-prior change by distribution matching.
Neural Networks, 2014

Semi-supervised information-maximization clustering.
Neural Networks, 2014

High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso.
Neural Comput., 2014

Information-Maximization Clustering Based on Squared-Loss Mutual Information.
Neural Comput., 2014

Information-Theoretic Semi-Supervised Metric Learning via Entropy Regularization.
Neural Comput., 2014

Direct Learning of Sparse Changes in Markov Networks by Density Ratio Estimation.
Neural Comput., 2014

Least-squares independence regression for non-linear causal inference under non-Gaussian noise.
Mach. Learn., 2014

Tree-Based Ensemble Multi-Task Learning Method for Classification and Regression.
IEICE Trans. Inf. Syst., 2014

Computationally Efficient Estimation of Squared-Loss Mutual Information with Multiplicative Kernel Models.
IEICE Trans. Inf. Syst., 2014

Unsupervised Dimension Reduction via Least-Squares Quadratic Mutual Information.
IEICE Trans. Inf. Syst., 2014

Class Prior Estimation from Positive and Unlabeled Data.
IEICE Trans. Inf. Syst., 2014

Constrained Least-Squares Density-Difference Estimation.
IEICE Trans. Inf. Syst., 2014

Statistical Analysis of Distance Estimators with Density Differences and Density Ratios.
Entropy, 2014

Efficient Reuse of Previous Experiences to Improve Policies in Real Environment.
CoRR, 2014

Support vector comparison machines.
CoRR, 2014

Clustering via Mode Seeking by Direct Estimation of the Gradient of a Log-Density.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Multitask learning meets tensor factorization: task imputation via convex optimization.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Analysis of Learning from Positive and Unlabeled Data.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Analysis of Variational Bayesian Latent Dirichlet Allocation: Weaker Sparsity Than MAP.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Outlier Path: A Homotopy Algorithm for Robust SVM.
Proceedings of the 31th International Conference on Machine Learning, 2014

Transductive Learning with Multi-class Volume Approximation.
Proceedings of the 31th International Conference on Machine Learning, 2014

Efficient reuse of previous experiences in humanoid motor learning.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

Analysis of Empirical MAP and Empirical Partially Bayes: Can They be Alternatives to Variational Bayes?
Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014

2013
Change-point detection in time-series data by relative density-ratio estimation.
Neural Networks, 2013

Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration.
Neural Comput., 2013

Relative Density-Ratio Estimation for Robust Distribution Comparison.
Neural Comput., 2013

Density-Difference Estimation.
Neural Comput., 2013

Variational Bayesian sparse additive matrix factorization.
Mach. Learn., 2013

Computational complexity of kernel-based density-ratio estimation: a condition number analysis.
Mach. Learn., 2013

Maximum volume clustering: a new discriminative clustering approach.
J. Mach. Learn. Res., 2013

Global analytic solution of fully-observed variational Bayesian matrix factorization.
J. Mach. Learn. Res., 2013

Direct Divergence Approximation between Probability Distributions and Its Applications in Machine Learning.
J. Comput. Sci. Eng., 2013

Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation in Oriental Ink Painting.
IEICE Trans. Inf. Syst., 2013

Direct Approximation of Quadratic Mutual Information and Its Application to Dependence-Maximization Clustering.
IEICE Trans. Inf. Syst., 2013

Computationally Efficient Multi-Label Classification by Least-Squares Probabilistic Classifiers.
IEICE Trans. Inf. Syst., 2013

Feature Selection via l<sub>1</sub>-Penalized Squared-Loss Mutual Information.
IEICE Trans. Inf. Syst., 2013

Winning the Kaggle Algorithmic Trading Challenge with the Composition of Many Models and Feature Engineering.
IEICE Trans. Inf. Syst., 2013

Machine Learning with Squared-Loss Mutual Information.
Entropy, 2013

Noise adaptive optimization of matrix initialization for frequency-domain independent component analysis.
Digit. Signal Process., 2013

Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances
CoRR, 2013

Density Ratio Hidden Markov Models
CoRR, 2013

Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2013

Direct Learning of Sparse Changes in Markov Networks by Density Ratio Estimation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Parametric Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Global Solver and Its Efficient Approximation for Variational Bayesian Low-rank Subspace Clustering.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Infinitesimal Annealing for Training Semi-Supervised Support Vector Machines.
Proceedings of the 30th International Conference on Machine Learning, 2013

Squared-loss Mutual Information Regularization: A Novel Information-theoretic Approach to Semi-supervised Learning.
Proceedings of the 30th International Conference on Machine Learning, 2013

Direct Approximation of Divergences Between Probability Distributions.
Proceedings of the Empirical Inference - Festschrift in Honor of Vladimir N. Vapnik, 2013

Divergence estimation for machine learning and signal processing.
Proceedings of the International Winter Workshop on Brain-Computer Interface, 2013

2012
f-Divergence Estimation and Two-Sample Homogeneity Test Under Semiparametric Density-Ratio Models.
IEEE Trans. Inf. Theory, 2012

Introduction to the Special Section on the 2nd Asia Conference on Machine Learning (ACML 2010).
ACM Trans. Intell. Syst. Technol., 2012

Sequential change-point detection based on direct density-ratio estimation.
Stat. Anal. Data Min., 2012

Analysis and improvement of policy gradient estimation.
Neural Networks, 2012

Improving importance estimation in pool-based batch active learning for approximate linear regression.
Neural Networks, 2012

Canonical dependency analysis based on squared-loss mutual information.
Neural Networks, 2012

Multi-parametric solution-path algorithm for instance-weighted support vector machines.
Mach. Learn., 2012

Statistical analysis of kernel-based least-squares density-ratio estimation.
Mach. Learn., 2012

Fast Learning Rate of Multiple Kernel Learning: Trade-Off between Sparsity and Smoothness.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Sparse Additive Matrix Factorization for Robust PCA and Its Generalization.
Proceedings of the 4th Asian Conference on Machine Learning, 2012

Importance-weighted least-squares probabilistic classifier for covariate shift adaptation with application to human activity recognition.
Neurocomputing, 2012

On Kernel Parameter Selection in Hilbert-Schmidt Independence Criterion.
IEICE Trans. Inf. Syst., 2012

Multi-Task Approach to Reinforcement Learning for Factored-State Markov Decision Problems.
IEICE Trans. Inf. Syst., 2012

Early Stopping Heuristics in Pool-Based Incremental Active Learning for Least-Squares Probabilistic Classifier.
IEICE Trans. Inf. Syst., 2012

Feature Selection via L1-Penalized Squared-Loss Mutual Information
CoRR, 2012

Dependence Maximizing Temporal Alignment via Squared-Loss Mutual Information
CoRR, 2012

High-Dimensional Feature Selection by Feature-Wise Non-Linear Lasso
CoRR, 2012

Perfect Dimensionality Recovery by Variational Bayesian PCA.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Designing various component analysis at will.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Computationally efficient multi-label classification by least-squares probabilistic classifier.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Machine Learning in Non-Stationary Environments - Introduction to Covariate Shift Adaptation.
Adaptive computation and machine learning, MIT Press, ISBN: 978-0-262-01709-1, 2012

Density Ratio Estimation in Machine Learning.
Cambridge University Press, ISBN: 978-0-521-19017-6, 2012

2011
Active Learning in Recommender Systems.
Proceedings of the Recommender Systems Handbook, 2011

Direct density-ratio estimation with dimensionality reduction via least-squares hetero-distributional subspace search.
Neural Networks, 2011

Least-squares two-sample test.
Neural Networks, 2011

Least-Squares Independent Component Analysis.
Neural Comput., 2011

Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning.
Neural Comput., 2011

Statistical outlier detection using direct density ratio estimation.
Knowl. Inf. Syst., 2011

Cross-Domain Object Matching with Model Selection.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Suffcient Component Analysis.
Proceedings of the 3rd Asian Conference on Machine Learning, 2011

A Refined Margin Analysis for Boosting Algorithms via Equilibrium Margin.
J. Mach. Learn. Res., 2011

Super-Linear Convergence of Dual Augmented Lagrangian Algorithm for Sparsity Regularized Estimation.
J. Mach. Learn. Res., 2011

Maximum Volume Clustering.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Theoretical Analysis of Bayesian Matrix Factorization.
J. Mach. Learn. Res., 2011

Dependence-Maximization Clustering with Least-Squares Mutual Information.
J. Adv. Comput. Intell. Intell. Informatics, 2011

Computationally Efficient Multi-task Learning with Least-squares Probabilistic Classifiers.
IPSJ Trans. Comput. Vis. Appl., 2011

Improving the Accuracy of Least-Squares Probabilistic Classifiers.
IEICE Trans. Inf. Syst., 2011

Lighting Condition Adaptation for Perceived Age Estimation.
IEICE Trans. Inf. Syst., 2011

Least-Squares Independence Test.
IEICE Trans. Inf. Syst., 2011

Target Neighbor Consistent Feature Weighting for Nearest Neighbor Classification.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Global Solution of Fully-Observed Variational Bayesian Matrix Factorization is Column-Wise Independent.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

On Information-Maximization Clustering: Tuning Parameter Selection and Analytic Solution.
Proceedings of the 28th International Conference on Machine Learning, 2011

On Bayesian PCA: Automatic Dimensionality Selection and Analytic Solution.
Proceedings of the 28th International Conference on Machine Learning, 2011

Detection of activities and events without explicit categorization.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Automatic audio tag classification via semi-supervised canonical density estimation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Multi-race age estimation based on the combination of multiple classifiers.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

Direct Density-Ratio Estimation with Dimensionality Reduction via Hetero-Distributional Subspace Analysis.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Trajectory Regression on Road Networks.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Conic Programming for Multitask Learning.
IEEE Trans. Knowl. Data Eng., 2010

Application of Covariate Shift Adaptation Techniques in Brain-Computer Interfaces.
IEEE Trans. Biomed. Eng., 2010

Semi-supervised speaker identification under covariate shift.
Signal Process., 2010

Dimensionality reduction for density ratio estimation in high-dimensional spaces.
Neural Networks, 2010

Efficient exploration through active learning for value function approximation in reinforcement learning.
Neural Networks, 2010

Semi-supervised local Fisher discriminant analysis for dimensionality reduction.
Mach. Learn., 2010

Single versus Multiple Sorting in All Pairs Similarity Search.
Proceedings of the 2nd Asian Conference on Machine Learning, 2010

Sufficient Dimension Reduction via Squared-loss Mutual Information Estimation.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Preface.
Proceedings of the 2nd Asian Conference on Machine Learning, 2010

Conditional Density Estimation via Least-Squares Density Ratio Estimation.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

A Transfer Learning Approach and Selective Integration of Multiple Types of Assays for Biological Network Inference.
Int. J. Knowl. Discov. Bioinform., 2010

Direct Importance Estimation with a Mixture of Probabilistic Principal Component Analyzers.
IEICE Trans. Inf. Syst., 2010

A Semi-Supervised Approach to Perceived Age Prediction from Face Images.
IEICE Trans. Inf. Syst., 2010

Least-Squares Conditional Density Estimation.
IEICE Trans. Inf. Syst., 2010

Least Absolute Policy Iteration-A Robust Approach to Value Function Approximation.
IEICE Trans. Inf. Syst., 2010

Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting.
IEICE Trans. Inf. Syst., 2010

Foreword.
IEICE Trans. Inf. Syst., 2010

Spectral Methods for Thesaurus Construction.
IEICE Trans. Inf. Syst., 2010

Theoretical Analysis of Density Ratio Estimation.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2010

Semi-supervised Estimation of Perceived Age from Face Images.
Proceedings of the VISAPP 2010 - Proceedings of the Fifth International Conference on Computer Vision Theory and Applications, Angers, France, May 17-21, 2010, 2010

Parametric Return Density Estimation for Reinforcement Learning.
Proceedings of the UAI 2010, 2010

Direct Density Ratio Estimation with Dimensionality Reduction.
Proceedings of the SIAM International Conference on Data Mining, 2010

Feature Selection for Reinforcement Learning: Evaluating Implicit State-Reward Dependency via Conditional Mutual Information.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Global Analytic Solution for Variational Bayesian Matrix Factorization.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Perceived Age Estimation under Lighting Condition Change by Covariate Shift Adaptation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

SemiCCA: Efficient Semi-supervised Learning of Canonical Correlations.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

A Fast Augmented Lagrangian Algorithm for Learning Low-Rank Matrices.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Implicit Regularization in Variational Bayesian Matrix Factorization.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Nonparametric Return Distribution Approximation for Reinforcement Learning.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Acceleration of sequence kernel computation for real-time speaker identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Direct importance estimation with probabilistic principal component analyzers.
Proceedings of the IEEE International Conference on Acoustics, 2010

Automatic audio tagging using covariate shift adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Dependence Minimizing Regression with Model Selection for Non-Linear Causal Inference under Non-Gaussian Noise.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Robust Label Propagation on Multiple Networks.
IEEE Trans. Neural Networks, 2009

Dual-Augmented Lagrangian Method for Efficient Sparse Reconstruction.
IEEE Signal Process. Lett., 2009

Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Neural Networks, 2009

On Generalization Performance and Non-Convex Optimization of Extended <i>nu</i>-Support Vector Machine.
New Gener. Comput., 2009

Theory and Algorithm for Learning with Dissimilarity Functions.
Neural Comput., 2009

Pool-based active learning in approximate linear regression.
Mach. Learn., 2009

Lanczos Approximations for the Speedup of Kernel Partial Least Squares Regression.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

A Least-squares Approach to Direct Importance Estimation.
J. Mach. Learn. Res., 2009

Direct Density Ratio Estimation for Large-scale Covariate Shift Adaptation.
J. Inf. Process., 2009

A Density-ratio Framework for Statistical Data Processing.
IPSJ Trans. Comput. Vis. Appl., 2009

Direct Importance Estimation with Gaussian Mixture Models.
IEICE Trans. Inf. Syst., 2009

On Computational Issues of Semi-Supervised Local Fisher Discriminant Analysis.
IEICE Trans. Inf. Syst., 2009

Recent Advances and Trends in Large-Scale Kernel Methods.
IEICE Trans. Inf. Syst., 2009

Efficient Construction of Neighborhood Graphs by the Multiple Sorting Method
CoRR, 2009

Mutual information estimation reveals global associations between stimuli and biological processes.
BMC Bioinform., 2009

Simultaneous inference of biological networks of multiple species from genome-wide data and evolutionary information: a semi-supervised approach.
Bioinform., 2009

Change-Point Detection in Time-Series Data by Direct Density-Ratio Estimation.
Proceedings of the SIAM International Conference on Data Mining, 2009

Link Propagation: A Fast Semi-supervised Learning Algorithm for Link Prediction.
Proceedings of the SIAM International Conference on Data Mining, 2009

Efficient Sample Reuse in EM-Based Policy Search.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Analysis of Variational Bayesian Matrix Factorization.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

Mutual information approximation via maximum likelihood estimation of density ratio.
Proceedings of the IEEE International Symposium on Information Theory, 2009

Probabilistic principal component analysis based on JoyStick Probability Selector.
Proceedings of the International Joint Conference on Neural Networks, 2009

Active Policy Iteration: Efficient Exploration through Active Learning for Value Function Approximation in Reinforcement Learning.
Proceedings of the IJCAI 2009, 2009

Estimating Squared-Loss Mutual Information for Independent Component Analysis.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Least absolute policy iteration for robust value function approximation.
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

Covariate shift adaptation for semi-supervised speaker identification.
Proceedings of the IEEE International Conference on Acoustics, 2009

A Framework of Adaptive Brain Computer Interfaces.
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009

Efficient data reuse in value function approximation.
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

Density Ratio Estimation: A New Versatile Tool for Machine Learning.
Proceedings of the Advances in Machine Learning, 2009

2008
A Multipurpose Linear Component Analysis Method Based on Modulated Hebb-Oja Learning Rule.
IEEE Signal Process. Lett., 2008

A batch ensemble approach to active learning with model selection.
Neural Networks, 2008

Approximating Mutual Information by Maximum Likelihood Density Ratio Estimation.
Proceedings of the Third Workshop on New Challenges for Feature Selection in Data Mining and Knowledge Discovery, 2008

Approximating the Best Linear Unbiased Estimator of Non-Gaussian Signals with Gaussian Noise.
IEICE Trans. Inf. Syst., 2008

Geodesic Gaussian kernels for value function approximation.
Auton. Robots, 2008

Active Learning with Model Selection in Linear Regression.
Proceedings of the SIAM International Conference on Data Mining, 2008

Integration of Multiple Networks for Robust Label Propagation.
Proceedings of the SIAM International Conference on Data Mining, 2008

Pool-Based Agnostic Experiment Design in Linear Regression.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

Efficient Direct Density Ratio Estimation for Non-stationarity Adaptation and Outlier Detection.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Order Retrieval.
Proceedings of the Large-Scale Knowledge Resources. Construction and Application, 2008

<i>nu</i>-support vector machine as conditional value-at-risk minimization.
Proceedings of the Machine Learning, 2008

Inlier-Based Outlier Detection via Direct Density Ratio Estimation.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

On the Margin Explanation of Boosting Algorithms.
Proceedings of the 21st Annual Conference on Learning Theory, 2008

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Covariate Shift Adaptation by Importance Weighted Cross Validation.
J. Mach. Learn. Res., 2007

Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis.
J. Mach. Learn. Res., 2007

Generalization Error Estimation for Non-linear Learning Methods.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2007

A New Meta-Criterion for Regularized Subspace Information Criterion.
IEICE Trans. Inf. Syst., 2007

Analytic Optimization of Adaptive Ridge Parameters Based on Regularized Subspace Information Criterion.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2007

Influence-based collaborative active learning.
Proceedings of the 2007 ACM Conference on Recommender Systems, 2007

Direct Importance Estimation with Model Selection and Its Application to Covariate Shift Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Multi-Task Learning via Conic Programming.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Asymptotic Bayesian generalization error when training and test distributions are different.
Proceedings of the Machine Learning, 2007

2006
Active Learning in Approximately Linear Regression Based on Conditional Expectation of Generalization Error.
J. Mach. Learn. Res., 2006

In Search of Non-Gaussian Components of a High-Dimensional Distribution.
J. Mach. Learn. Res., 2006

Analytic Optimization of Shrinkage Parameters Based on Regularized Subspace Information Criterion.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2006

Constructing Kernel Functions for Binary Regression.
IEICE Trans. Inf. Syst., 2006

Model Selection Using a Class of Kernels with an Invariant Metric.
Proceedings of the Structural, 2006

Mixture Regression for Covariate Shift.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Local Fisher discriminant analysis for supervised dimensionality reduction.
Proceedings of the Machine Learning, 2006

Obtaining the Best Linear Unbiased Estimator of Noisy Signals by Non-Gaussian Component Analysis.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Novel Dimension Reduction Procedure for Searching Non-Gaussian Subspaces.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Importance-Weighted Cross-Validation for Covariate Shift.
Proceedings of the Pattern Recognition, 2006

2005
Active Learning for Misspecified Models.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Non-Gaussian Component Analysis: a Semi-parametric Framework for Linear Dimension Reduction.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Model Selection Under Covariate Shift.
Proceedings of the Artificial Neural Networks: Formal Models and Their Applications, 2005

2004
Trading Variance Reduction with Unbiasedness: The Regularized Subspace Information Criterion for Robust Model Selection in Kernel Regression.
Neural Comput., 2004

Estimating the error at given test input points for linear regression.
Proceedings of the IASTED International Conference on Neural Networks and Computational Intelligence, 2004

Regularizing generalization error estimators: a novel approach to robust model selection.
Proceedings of the 12th European Symposium on Artificial Neural Networks, 2004

2003
Improving Precision of the Subspace Information Criterion.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2003

2002
Subspace information criterion for nonquadratic regularizers-Model selection for sparse regressors.
IEEE Trans. Neural Networks, 2002

A unified method for optimizing linear image restoration filters.
Signal Process., 2002

Optimal design of regularization term and regularization parameter by subspace information criterion.
Neural Networks, 2002

Theoretical and Experimental Evaluation of the Subspace Information Criterion.
Mach. Learn., 2002

The Subspace Information Criterion for Infinite Dimensional Hypothesis Spaces.
J. Mach. Learn. Res., 2002

Selecting Ridge Parameters in Infinite Dimensional Hypothesis Spaces.
Proceedings of the Artificial Neural Networks, 2002

2001
Properties of incremental projection learning.
Neural Networks, 2001

Incremental projection learning for optimal generalization.
Neural Networks, 2001

Incremental Active Learning for Optimal Generalization.
Neural Comput., 2001

Subspace Information Criterion for Model Selection.
Neural Comput., 2001

2000
Incremental Active Learning with Bias Reduction.
Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks, 2000

A new information criterion for the selection of subspace models.
Proceedings of the 8th European Symposium on Artificial Neural Networks, 2000

1999
Training Data Selection for Optimal Generalization in Trigonometric Polynomial Networks.
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999


  Loading...