Sepp Hochreiter

Orcid: 0000-0001-7449-2528

Affiliations:
  • Johannes Kepler University of Linz, Austria


According to our database1, Sepp Hochreiter authored at least 124 papers between 1994 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Geometry-Informed Neural Networks.
CoRR, 2024

Overcoming Saturation in Density Ratio Estimation by Iterated Regularization.
CoRR, 2024

MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations.
CoRR, 2024

SymbolicAI: A framework for logic-based approaches combining generative models and solvers.
CoRR, 2024

Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Deep Reinforcement Learning for Optimization at Early Design Stages.
IEEE Des. Test, February, 2023

Txt2Img-MHN: Remote Sensing Image Generation From Text Using Modern Hopfield Networks.
IEEE Trans. Image Process., 2023

Introducing an Improved Information-Theoretic Measure of Predictive Uncertainty.
CoRR, 2023

Functional trustworthiness of AI systems by statistically valid testing.
CoRR, 2023

SITTA: A Semantic Image-Text Alignment for Image Captioning.
CoRR, 2023

Semantic HELM: An Interpretable Memory for Reinforcement Learning.
CoRR, 2023

G-Signatures: Global Graph Propagation With Randomized Signatures.
CoRR, 2023

Quantification of Uncertainty with Adversarial Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning to Modulate pre-trained Models in RL.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Variational Annealing on Graphs for Combinatorial Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Semantic HELM: A Human-Readable Memory for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Conformal Prediction for Time Series with Modern Hopfield Networks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language.
Proceedings of the International Conference on Machine Learning, 2023

Context-enriched molecule representations improve few-shot drug discovery.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Boundary Graph Neural Networks for 3D Simulations.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Unconstrained generation of synthetic antibody-antigen structures to guide machine learning methodology for antibody specificity prediction.
Nat. Comput. Sci., 2022

Domain Shifts in Machine Learning Based Covid-19 Diagnosis From Blood Tests.
J. Medical Syst., 2022

Improving Few- and Zero-Shot Reaction Template Prediction Using Modern Hopfield Networks.
J. Chem. Inf. Model., 2022

Entangled Residual Mappings.
CoRR, 2022

Hopular: Modern Hopfield Networks for Tabular Data.
CoRR, 2022

Toward a broad AI.
Commun. ACM, 2022

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.
Proceedings of the International Conference on Machine Learning, 2022

History Compression via Language Models in Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Reactive Exploration to Cope With Non-Stationarity in Lifelong Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2022

A Dataset Perspective on Offline Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2022

Few-Shot Learning by Dimensionality Reduction in Gradient Space.
Proceedings of the Conference on Lifelong Learning Agents, 2022

The Landslide4Sense Competition 2022.
Proceedings of the Second Workshop on Complex Data Challenges in Earth Observation (CDCEO 2022) co-located with 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence (IJCAI-ECAI 2022), 2022

2021
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER.
Trans. Large Scale Data Knowl. Centered Syst., 2021

The Promise of AI for DILI Prediction.
Frontiers Artif. Intell., 2021

Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning.
CoRR, 2021

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP.
CoRR, 2021

Learning 3D Granular Flow Simulations.
CoRR, 2021

Modern Hopfield Networks for Few- and Zero-Shot Reaction Prediction.
CoRR, 2021

Trusted Artificial Intelligence: Towards Certification of Machine Learning Applications.
CoRR, 2021

Traffic4cast at NeurIPS 2022 – Predict Dynamics along Graph Edges from Sparse Node Data: Whole City Traffic and ETA from Stationary Vehicle Detectors.
Proceedings of the NeurIPS 2022 Competition Track, 2021




MC-LSTM: Mass-Conserving LSTM.
Proceedings of the 38th International Conference on Machine Learning, 2021

Hopfield Networks is All You Need.
Proceedings of the 9th International Conference on Learning Representations, 2021

CDCEO'21 - First Workshop on Complex Data Challenges in Earth Observation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

High-resolution multi-channel weather forecasting - First insights on transfer learning from the Weather4cast Competitions 2021.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
Industry-scale application and evaluation of deep learning for drug target prediction.
J. Cheminformatics, 2020

Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling.
CoRR, 2020

Rainfall-Runoff Prediction at Multiple Timescales with a Single Long Short-Term Memory Network.
CoRR, 2020

Cross-Domain Few-Shot Learning by Representation Fusion.
CoRR, 2020

Hopfield Networks is All You Need.
CoRR, 2020

Large-scale ligand-based virtual screening for SARS-CoV-2 inhibitors using deep neural networks.
CoRR, 2020

Modern Hopfield Networks and Attention for Immune Repertoire Classification.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Traffic4cast at NeurIPS 2020 ? yet more on theunreasonable effectiveness of gridded geo-spatial processes.
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Cost Optimization at Early Stages of Design Using Deep Reinforcement Learning.
Proceedings of the MLCAD '20: 2020 ACM/IEEE Workshop on Machine Learning for CAD, 2020

XAI and Strategy Extraction via Reward Redistribution.
Proceedings of the xxAI - Beyond Explainable AI, 2020

2019
Interpretable Deep Learning in Drug Discovery.
Proceedings of the Explainable AI: Interpreting, 2019

NeuralHydrology - Interpreting LSTMs in Hydrology.
Proceedings of the Explainable AI: Interpreting, 2019

Visual Scene Understanding for Autonomous Driving Using Semantic Segmentation.
Proceedings of the Explainable AI: Interpreting, 2019

Explaining and Interpreting LSTMs.
Proceedings of the Explainable AI: Interpreting, 2019

Machine Learning in Drug Discovery.
J. Chem. Inf. Model., 2019

Accurate Prediction of Biological Assays with High-Throughput Microscopy Images and Convolutional Networks.
J. Chem. Inf. Model., 2019

Detecting cutaneous basal cell carcinomas in ultra-high resolution and weakly labelled histopathological images.
CoRR, 2019

Using LSTMs for climate change assessment studies on droughts and floods.
CoRR, 2019

Quantum Optical Experiments Modeled by Long Short-Term Memory.
CoRR, 2019

Patch Refinement - Localized 3D Object Detection.
CoRR, 2019

Explaining and Interpreting LSTMs.
CoRR, 2019

Benchmarking a Catchment-Aware Long Short-Term Memory Network (LSTM) for Large-Scale Hydrological Modeling.
CoRR, 2019

NeuralHydrology - Interpreting LSTMs in Hydrology.
CoRR, 2019

Interpretable Deep Learning in Drug Discovery.
CoRR, 2019

The surprising efficiency of framing geo-spatial time series forecasting as a video prediction task - Insights from the IARAI Traffic4cast Competition at NeurIPS 2019.
Proceedings of the NeurIPS 2019 Competition and Demonstration Track, 2019

RUDDER: Return Decomposition for Delayed Rewards.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Human-level Protein Localization with Convolutional Neural Networks.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Fréchet ChemNet Distance: A Metric for Generative Models for Molecules in Drug Discovery.
J. Chem. Inf. Model., 2018

RUDDER: Return Decomposition for Delayed Rewards.
CoRR, 2018

Fréchet ChemblNet Distance: A metric for generative models for molecules.
CoRR, 2018

DeepSynergy: predicting anti-cancer drug synergy with Deep Learning.
Bioinform., 2018

First Order Generative Adversarial Networks.
Proceedings of the 35th International Conference on Machine Learning, 2018

Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields.
CoRR, 2017

GANs Trained by a Two Time-Scale Update Rule Converge to a Nash Equilibrium.
CoRR, 2017

Rectified factor networks for biclustering of omics data.
Bioinform., 2017

Self-Normalizing Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs).
Proceedings of the 4th International Conference on Learning Representations, 2016

2015
Toxicity Prediction using Deep Learning.
CoRR, 2015

Rectified Factor Networks.
CoRR, 2015

KeBABS: an R package for kernel-based analysis of biological sequences.
Bioinform., 2015

Rchemcpp: a web service for structural analoging in ChEMBL, Drugbank and the Connectivity Map.
Bioinform., 2015

msa: an R package for multiple sequence alignment.
Bioinform., 2015

Rectified Factor Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014
Furby: fuzzy force-directed bicluster visualization.
BMC Bioinform., 2014

2013
Computational Methods Aiding Early-Stage Drug Design (Dagstuhl Seminar 13212).
Dagstuhl Reports, 2013

2011
APCluster: an R package for affinity propagation clustering.
Bioinform., 2011

2010
FABIA: factor analysis for bicluster acquisition.
Bioinform., 2010

2009
Modeling Position Specificity in Sequence Kernels by Fuzzy Equivalence Relations.
Proceedings of the Joint 2009 International Fuzzy Systems Association World Congress and 2009 European Society of Fuzzy Logic and Technology Conference, 2009

09081 Summary - Similarity-based learning on structures.
Proceedings of the Similarity-based learning on structures, 15.02. - 20.02.2009, 2009

09081 Abstracts Collection - Similarity-based learning on structures.
Proceedings of the Similarity-based learning on structures, 15.02. - 20.02.2009, 2009

2008
An SMO Algorithm for the Potential Support Vector Machine.
Neural Comput., 2008

2007
I/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data.
Bioinform., 2007

Fast model-based protein homology detection without alignment.
Bioinform., 2007

Optimality of LSTD and its Relation to MC.
Proceedings of the International Joint Conference on Neural Networks, 2007

Monaural Speech Separation by Support Vector Machines: Bridging the Divide Between Supervised and Unsupervised Learning Methods.
Proceedings of the Blind Speech Separation, 2007

2006
Support Vector Machines for Dyadic Data.
Neural Comput., 2006

A new summarization method for affymetrix probe level data.
Bioinform., 2006

P-SVM Variable Selection for Discovering Dependencies Between Genetic and Brain Imaging Data.
Proceedings of the International Joint Conference on Neural Networks, 2006

Nonlinear Feature Selection with the Potential Support Vector Machine.
Proceedings of the Feature Extraction - Foundations and Applications, 2006

2002
Feature Selection and Classification on Matrix Data: From Large Margins to Small Covering Numbers.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Coulomb Classifiers: Generalizing Support Vector Machines via an Analogy to Electrostatic Systems.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

2001
Learning to Learn Using Gradient Descent.
Proceedings of the Artificial Neural Networks, 2001

A Discrete Probabilistic Memory Model for Discovering Dependencies in Time.
Proceedings of the Artificial Neural Networks, 2001

2000
Beyond Maximum Likelihood and Density Estimation: A Sample-Based Criterion for Unsupervised Learning of Complex Models.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

1999
Feature Extraction Through LOCOCODE.
Neural Comput., 1999

Nonlinear ICA through low-complexity autoencoders.
Proceedings of the 1999 International Symposium on Circuits and Systems, ISCAS 1999, Orlando, Florida, USA, May 30, 1999

1998
The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions.
Int. J. Uncertain. Fuzziness Knowl. Based Syst., 1998

Source Separation as a By-Product of Regularization.
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998

1997
Flat Minima
Neural Comput., 1997

Long Short-Term Memory.
Neural Comput., 1997

Unsupervised Coding with LOCOCODE.
Proceedings of the Artificial Neural Networks, 1997

1996
LSTM can Solve Hard Long Time Lag Problems.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

1994
Simplifying Neural Nets by Discovering Flat Minima.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994


  Loading...