José Hernández-Orallo

Orcid: 0000-0001-9746-7632

Affiliations:
  • Polytechnic University of Valencia, Spain


According to our database1, José Hernández-Orallo authored at least 192 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
When Redundancy Matters: Machine Teaching of Representations.
CoRR, 2024

Your Prompt Is My Command: On Assessing the Human-Centred Generality of Multimodal Models (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Heuristic search of optimal machine teaching curricula.
Mach. Learn., October, 2023

Training Data Scientists Through Project-Based Learning.
Rev. Iberoam. de Tecnol. del Aprendiz., August, 2023

Can language models automate data wrangling?
Mach. Learn., June, 2023

Trends in AI inference energy consumption: Beyond the performance-vs-parameter laws of deep learning.
Sustain. Comput. Informatics Syst., April, 2023

Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models.
J. Artif. Intell. Res., 2023

Animal-AI 3: What's New & Why You Should Care.
CoRR, 2023

Evaluating General-Purpose AI with Psychometrics.
CoRR, 2023

An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI.
CoRR, 2023

Predictable Artificial Intelligence.
CoRR, 2023

Inferring Capabilities from Task Performance with Bayesian Triangulation.
CoRR, 2023

Revealing the structure of language model capabilities.
CoRR, 2023

Dialectical language model evaluation: An initial appraisal of the commonsense spatial reasoning abilities of LLMs.
CoRR, 2023

XAI with Machine Teaching When Humans Are (Not) Informed About the Irrelevant Features.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

Adversarial Benchmark Evaluation Rectified by Controlling for Difficulty.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

2022
Automating data science.
Commun. ACM, 2022

Heterogeneity Breaks the Game: Evaluating Cooperation-Competition with Multisets of Agents.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Reject Before You Run: Small Assessors Anticipate Big Language Models.
Proceedings of the Workshop on AI Evaluation Beyond Metrics co-located with the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), 2022

Evaluating Object Permanence in Embodied Agents using the Animal-AI Environment.
Proceedings of the Workshop on AI Evaluation Beyond Metrics co-located with the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), 2022

Measuring the Occupational Impact of AI: Tasks, Cognitive Abilities and AI Benchmarks (Extended Abstract).
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Non-Cheating Teaching Revisited: A New Probabilistic Machine Teaching Model.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Framework for Categorising AI Evaluation Instruments.
Proceedings of the Workshop on AI Evaluation Beyond Metrics co-located with the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), 2022

Not a Number: Identifying Instance Features for Capability-Oriented Evaluation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

When AI Difficulty Is Easy: The Explanatory Power of Predicting IRT Difficulty.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Training on the Test Set: Mapping the System-Problem Space in AI.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

How General-Purpose Is a Language Model? Usefulness and Safety with Human Prompters in the Wild.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Teaching and Explanation: Aligning Priors between Machines and Humans.
Proceedings of the Human-Like Machine Intelligence., 2022

2021
CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories.
IEEE Trans. Knowl. Data Eng., 2021

Futures of artificial intelligence through technology readiness levels.
Telematics Informatics, 2021

AUTOMAT[R]IX: learning simple matrix pipelines.
Mach. Learn., 2021

Measuring the Occupational Impact of AI: Tasks, Cognitive Abilities and AI Benchmarks.
J. Artif. Intell. Res., 2021

Missing the missing values: The ugly duckling of fairness in machine learning.
Int. J. Intell. Syst., 2021

Editor's Note.
Int. J. Interact. Multim. Artif. Intell., 2021

Compute and Energy Consumption Trends in Deep Learning Inference.
CoRR, 2021

Conditional Teaching Size.
CoRR, 2021

Automating Data Science: Prospects and Challenges.
CoRR, 2021

Making sense of sensory input.
Artif. Intell., 2021

Optimal Teaching Curricula with Compositional Simplicity Priors.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Think Big, Teach Small: Do Language Models Distil Occam's Razor?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Muppets: Multipurpose Table Segmentation.
Proceedings of the Advances in Intelligent Data Analysis XIX, 2021

Project-Based Learning for Scaffolding Data Scientists' Skills.
Proceedings of the 16th International Conference on Computer Science & Education, 2021

Negative Side Effects and AI Agent Indicators: Experiments in SafeLife.
Proceedings of the Workshop on Artificial Intelligence Safety 2021 (SafeAI 2021) co-located with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021

2020
Dual Indicators to Analyze AI Benchmarks: Difficulty, Discrimination, Ability, and Generality.
IEEE Trans. Games, 2020

Twenty Years Beyond the Turing Test: Moving Beyond the Human Judges Too.
Minds Mach., 2020

Learning alternative ways of performing a task.
Expert Syst. Appl., 2020

Evaluating the Apperception Engine.
CoRR, 2020

The Association for the Advancement of Artificial Intelligence 2020 Workshop Program.
AI Mag., 2020

Tracking AI: The Capability Is (Not) Near.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

AI Paradigms and AI Safety: Mapping Artefacts and Techniques to Safety Issues.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Finite and Confident Teaching in Expectation: Sampling from Infinite Concept Classes.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Family and Prejudice: A Behavioural Taxonomy of Machine Learning Techniques.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

AI Safety Landscape From short-term specific system engineering to long-term artificial general intelligence.
Proceedings of the 50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2020

Does AI Qualify for the Job?: A Bidirectional Model Mapping Labour and AI Intensities.
Proceedings of the AIES '20: AAAI/ACM Conference on AI, 2020

Exploring AI Safety in Degrees: Generality, Capability and Control.
Proceedings of the Workshop on Artificial Intelligence Safety, 2020

2019
Gazing into Clever Hans machines.
Nat. Mach. Intell., 2019

The teaching size: computable teachers and learners for universal languages.
Mach. Learn., 2019

AI Generality and Spearman's Law of Diminishing Returns.
J. Artif. Intell. Res., 2019

Setting decision thresholds when operating conditions are uncertain.
Data Min. Knowl. Discov., 2019

The Animal-AI Environment: Training and Testing Animal-Like Artificial Cognition.
CoRR, 2019

Fairness and Missing Values.
CoRR, 2019

Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence.
AI Mag., 2019

Item response theory in AI: Analysing machine learning classifiers at the instance level.
Artif. Intell., 2019

BK-ADAPT: Dynamic Background Knowledge for Automating Data Transformation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Automated Data Transformation with Inductive Programming and Dynamic Background Knowledge.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Automating Common Data Science Matrix Transformations.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

The Animal-AI Testbed and Competition.
Proceedings of the NeurIPS 2019 Competition and Demonstration Track, 2019

AI Extenders: The Ethical and Societal Implications of Humans Cognitively Extended by AI.
Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

Surveying Safety-relevant AI Characteristics.
Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the Thirty-Third AAAI Conference on Artificial Intelligence 2019 (AAAI-19), 2019

2018
Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them.
CoRR, 2018

General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation.
CoRR, 2018

A multidisciplinary task-based perspective for evaluating the impact of AI autonomy and generality on the future of work.
CoRR, 2018

Assessing the impact of machine intelligence on human behaviour: an interdisciplinary endeavour.
CoRR, 2018

Accounting for the Neglected Dimensions of AI Progress.
CoRR, 2018

Finite Biased Teaching with Infinite Concept Classes.
CoRR, 2018

The Facets of Artificial Intelligence: A Framework to Track the Evolution of AI.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Identifying the Machine Learning Family from Black-Box Models.
Proceedings of the Advances in Artificial Intelligence, 2018

2017
CASP-DM: Context Aware Standard Process for Data Mining.
CoRR, 2017

A computational analysis of general intelligence tests for evaluating cognitive development.
Cogn. Syst. Res., 2017

Evaluation in artificial intelligence: from task-oriented to ability-oriented measurement.
Artif. Intell. Rev., 2017

A New AI Evaluation Cosmos: Ready to Play the Game?
AI Mag., 2017

Modelling Machine Learning Models.
Proceedings of the Philosophy and Theory of Artificial Intelligence 2017, 2017


Knowledge Extraction from Task Narratives.
Proceedings of the 4th international Workshop on Sensor-based Activity Recognition and Interaction, 2017

Low-level Event Detection System for Minimally-Invasive Surgery Training.
Proceedings of the 4th international Workshop on Sensor-based Activity Recognition and Interaction, 2017

Computer Models Solving Intelligence Test Problems: Progress and Implications (Extended Abstract).
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

The Measure of All Minds: Evaluating Natural and Artificial Intelligence
Cambridge University Press, ISBN: 9781316594179, 2017

2016
Binarised regression tasks: methods and evaluation metrics.
Data Min. Knowl. Discov., 2016

Reframing in context: A systematic approach for model reuse in machine learning.
AI Commun., 2016

Computer models solving intelligence test problems: Progress and implications.
Artif. Intell., 2016

Making Sense of Item Response Theory in Machine Learning.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Is Spearman's Law of Diminishing Returns (SLODR) Meaningful for Artificial Agents?
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

2015
Derek Partridge: What Makes You Clever: The Puzzle of Intelligence - World Scientific, 2013, xvi+447, $25.00, ISBN: 978-981-4513.
Minds Mach., 2015

Can Machine Intelligence be Measured in the Same Way as Human intelligence?
Künstliche Intell., 2015

Approaches and Applications of Inductive Programming (Dagstuhl Seminar 15442).
Dagstuhl Reports, 2015

Forgetting and consolidation for incremental and cumulative knowledge acquisition systems.
CoRR, 2015

Universal Psychometrics Tasks: difficulty, composition and decomposition.
CoRR, 2015

Inductive programming meets the real world.
Commun. ACM, 2015

Knowledge acquisition with forgetting: an incremental and developmental setting.
Adapt. Behav., 2015

On environment difficulty and discriminating power.
Auton. Agents Multi Agent Syst., 2015

Multidimensional Prediction Models When the Resolution Context Changes.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Pentaho + R: An Integral View for Multidimensional Prediction Models.
Proceedings of the Advances in Artificial Intelligence, 2015

Instrumental Properties of Social Testbeds.
Proceedings of the Artificial General Intelligence, 2015

C-Tests Revisited: Back and Forth with Complexity.
Proceedings of the Artificial General Intelligence, 2015

Stochastic Tasks: Difficulty and Levin Search.
Proceedings of the Artificial General Intelligence, 2015

2014
Probabilistic Reframing for Cost-Sensitive Regression.
ACM Trans. Knowl. Discov. Data, 2014

Aggregative quantification for regression.
Data Min. Knowl. Discov., 2014

Definition and properties to assess multi-agent environments as social intelligence tests.
CoRR, 2014

A note about the generalisation of the $C$-tests.
CoRR, 2014

AI Evaluation: past, present and future.
CoRR, 2014

Universal psychometrics: Measuring cognitive abilities in the machine kingdom.
Cogn. Syst. Res., 2014

Bridging the Gap between Distance and Generalization.
Comput. Intell., 2014

How universal can an intelligence test be?
Adapt. Behav., 2014

A Knowledge Growth and Consolidation Framework for Lifelong Machine Learning Systems.
Proceedings of the 13th International Conference on Machine Learning and Applications, 2014

2013
ROC curves for regression.
Pattern Recognit., 2013

ROC curves in cost space.
Mach. Learn., 2013

On Potential Cognitive Abilities in the Machine Kingdom.
Minds Mach., 2013

Test cost and misclassification cost trade-off using reframing
CoRR, 2013

On the universality of cognitive tests
CoRR, 2013

A short note on estimating intelligence from user profiles in the context of universal psychometrics: prospects and caveats
CoRR, 2013

Complexity distribution of agent policies
CoRR, 2013

On the definition of a general learning system with user-defined operators.
CoRR, 2013

On the effect of calibration in classifier combination.
Appl. Intell., 2013

2012
A unified view of performance metrics: translating threshold choice into expected classification loss.
J. Mach. Learn. Res., 2012

Soft (Gaussian CDE) regression models and loss functions
CoRR, 2012

On the influence of intelligence in (social) intelligence testing environments
CoRR, 2012

Learning with Configurable Operators and RL-Based Heuristics.
Proceedings of the New Frontiers in Mining Complex Patterns - First International Workshop, 2012

Turing Tests with Turing Machines.
Proceedings of the Turing-100, 2012

On Measuring Social Intelligence: Experiments on Competition and Cooperation.
Proceedings of the Artificial General Intelligence - 5th International Conference, 2012

2011
Threshold Choice Methods: the Missing Link
CoRR, 2011

Application of distances between terms for flat and hierarchical data
CoRR, 2011

Analysis of first prototype universal intelligence tests: evaluating and comparing AI algorithms and humans
CoRR, 2011

Technical Note: Towards ROC Curves in Cost Space
CoRR, 2011

An architecture for the evaluation of intelligent systems
CoRR, 2011

Using negotiable features for prescription problems.
Computing, 2011

Brier Curves: a New Cost-Based Visualisation of Classifier Performance.
Proceedings of the 28th International Conference on Machine Learning, 2011

A Coherent Interpretation of AUC as a Measure of Aggregated Classification Performance.
Proceedings of the 28th International Conference on Machine Learning, 2011

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test.
Proceedings of the Advances in Artificial Intelligence, 2011

Comparing Humans and AI Agents.
Proceedings of the Artificial General Intelligence - 4th International Conference, 2011

On More Realistic Environment Distributions for Defining, Evaluating and Developing Intelligence.
Proceedings of the Artificial General Intelligence - 4th International Conference, 2011

Compression and Intelligence: Social Environments and Communication.
Proceedings of the Artificial General Intelligence - 4th International Conference, 2011

Applying distances between terms to both flat and hierarchical data.
Proceedings of AAIP 2011, 2011

2010
Annotated English
CoRR, 2010

Measuring universal intelligence: Towards an anytime intelligence test.
Artif. Intell., 2010

Data Mining Strategies for CRM Negotiation Prescription Problems.
Proceedings of the Trends in Applied Intelligent Systems, 2010

Quantification via Probability Estimators.
Proceedings of the ICDM 2010, 2010

An Integrated Distance for Atoms.
Proceedings of the Functional and Logic Programming, 10th International Symposium, 2010

Newton Trees.
Proceedings of the AI 2010: Advances in Artificial Intelligence, 2010

2009
An experimental comparison of performance measures for classification.
Pattern Recognit. Lett., 2009

An Instantiation of Hierarchical Distance-Based Conceptual Clustering for Propositional Learning.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

Similarity-Binning Averaging: A Generalisation of Binning Calibration.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2009

Negotiation with Price-dependent Probability Models.
Proceedings of the Second Workshop on Agreement Technologies, 2009

Generalisation Operators for Lists Embedded in a Metric Space.
Proceedings of the Approaches and Applications of Inductive Programming, 2009

2008
Hierarchical Distance-Based Conceptual Clustering.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

2007
Joint Cutoff Probabilistic Estimation Using Simulation: A Mailing Campaign Application.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2007

2006
Similarity Functions for Structured Data. An Application to Decision Trees.
Inteligencia Artif., 2006

The Mimetic Technique when the Original Data is Not Available: Model Learning and Revision.
Inteligencia Artif., 2006

Minimal Distance-Based Generalisation Operators for First-Order Objects.
Proceedings of the Inductive Logic Programming, 16th International Conference, 2006

2005
Web Categorisation Using Distance-Based Decision Trees.
Proceedings of the International Workshop on Automated Specification and Verification of Web Sites, 2005

Distance Based Generalisation.
Proceedings of the Inductive Logic Programming, 15th International Conference, 2005

Knowledge acquisition through machine learning: minimising expert's effort.
Proceedings of the Fourth International Conference on Machine Learning and Applications, 2005

Knowledge Discovery from Databases.
Proceedings of the Encyclopedia of Database Technologies and Applications, 2005

Data Warehousing and OLAP.
Proceedings of the Encyclopedia of Database Technologies and Applications, 2005

2004
The 1st workshop on ROC analysis in artificial intelligence (ROCAI-2004).
SIGKDD Explor., 2004

Cautious Classifiers.
Proceedings of the ROC Analysis in Artificial Intelligence, 1st International Workshop, 2004

Bagging Decision Multi-trees.
Proceedings of the Multiple Classifier Systems, 5th International Workshop, 2004

Delegating classifiers.
Proceedings of the Machine Learning, 2004

Analysing the Trade-Off Between Comprehensibility and Accuracy in Mimetic Models.
Proceedings of the Discovery Science, 7th International Conference, 2004

2003
Cost-sensitive diagnosis of declarative programs.
Proceedings of the 12th International Workshop on Functional and Constraint Logic Programming, 2003

Simple Mimetic Classifiers.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2003

Beam Search Extraction and Forgetting Strategies on Shared Ensembles.
Proceedings of the Multiple Classifier Systems, 4th International Workshop, 2003

Volume under the ROC Surface for Multi-class Problems.
Proceedings of the Machine Learning: ECML 2003, 2003

Improving the AUC of Probabilistic Estimation Trees.
Proceedings of the Machine Learning: ECML 2003, 2003

2002
SMILES: A Multi-purpose Learning System.
Proceedings of the Logics in Artificial Intelligence, European Conference, 2002

Learning Decision Trees Using the Area Under the ROC Curve.
Proceedings of the Machine Learning, 2002

Induction of Decision Multi-trees Using Levin Search.
Proceedings of the Computational Science - ICCS 2002, 2002

Shared Ensemble Learning Using Multi-trees.
Proceedings of the Advances in Artificial Intelligence, 2002

From Ensemble Methods to Comprehensible Models.
Proceedings of the Discovery Science, 5th International Conference, 2002

2001
Predictive Software.
Autom. Softw. Eng., 2001

Incremental Learning of Functional Logic Programs.
Proceedings of the Functional and Logic Programming, 5th International Symposium, 2001

AND/OR trees for the learning of functional logic programs.
Proceedings of the APPIA-GULP-PRODE 2001: Joint Conference on Declarative Programming, 2001

2000
Beyond the Turing Test.
J. Log. Lang. Inf., 2000

Constructive reinforcement learning.
Int. J. Intell. Syst., 2000

Thesis: Computational measures of information gain and reinforcement in inference processes.
AI Commun., 2000

Truth from Trash. How Learning Makes Sense by Chris Thornton.
Artif. Intell., 2000

Aprendizaje Automatico de Programas Logico-Funcionales.
Inteligencia Artif., 2000

The role of induction in (semi-)automated software life-cycles.
Proceedings of the 9th International Workshop on Functional and Logic Programming, 2000

Learning functional logic classification concepts from databases.
Proceedings of the 9th International Workshop on Functional and Logic Programming, 2000

Universal Learning of Classes from Sparse and Non-uniform Evidence.
Proceedings of the Inductive Logic Programming, 10th International Conference, 2000

Software as Learning: Quality Factors and Life-Cycle Revised.
Proceedings of the Fundamental Approaches to Software Engineering, 2000

1999
A Strong Complete Schmema for Inductive Functional Logic Programming.
Proceedings of the Inductive Logic Programming, 9th International Workshop, 1999

1998
Inverse Narrowing for the Induction of Functional Logic Programs.
Proceedings of the 1998 Joint Conference on Declarative Programming, 1998


  Loading...