The final publication is available at Springer via http://dx.doi.org/ 10.1007/s10462-016-9505-7.The evaluation of artificial intelligence systems and components is crucial for the progress of the discipline. In this paper we describe and critically assess the different ways AI systems are evaluated, and the role of components and techniques in these systems. We first focus on the traditional task-oriented evaluation approach. We identify three kinds of evaluation: human discrimination, problem benchmarks and peer confrontation. We describe some of the limitations of the many evaluation schemes and competitions in these three categories, and follow the progression of some of these tests. We then focus on a less customary (and challen...
We report on a series of new platforms and events dealing with AI evaluation that may change the way...
We report on a series of new platforms and events dealing with AI evaluation that may change the way...
The final publication is available at Springer via http://dx.doi.org/10.1007/s13218-015-0361-4In re...
The final publication is available at Springer via http://dx.doi.org/ 10.1007/s10462-016-9505-7.The...
Artificial intelligence develops techniques and systems whose performance must be evaluated on a reg...
Artificial intelligence develops techniques and systems whose performance must be evaluated on a reg...
Today, available methods that assess AI systems are focused on using empirical techniques to measure...
Artificial General Intelligence seeks to create an artificial system capable of solving many differe...
In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation o...
Thesis: S.M. in Engineering and Management, Massachusetts Institute of Technology, School of Enginee...
Abstract—Artificial intelligence (AI) is having a deep impact on the way humans work, communicate an...
This is the author’s version of a work that was accepted for publication in Artificial Intelligence....
Comparing humans and machines is one important source of information about both machine and human s...
The hereby article is to present the notions of two concepts: human and artificial intelligence. The...
We present and develop the notion of ‘universal psychometrics’ as a subject of study, and eventuall...
We report on a series of new platforms and events dealing with AI evaluation that may change the way...
We report on a series of new platforms and events dealing with AI evaluation that may change the way...
The final publication is available at Springer via http://dx.doi.org/10.1007/s13218-015-0361-4In re...
The final publication is available at Springer via http://dx.doi.org/ 10.1007/s10462-016-9505-7.The...
Artificial intelligence develops techniques and systems whose performance must be evaluated on a reg...
Artificial intelligence develops techniques and systems whose performance must be evaluated on a reg...
Today, available methods that assess AI systems are focused on using empirical techniques to measure...
Artificial General Intelligence seeks to create an artificial system capable of solving many differe...
In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation o...
Thesis: S.M. in Engineering and Management, Massachusetts Institute of Technology, School of Enginee...
Abstract—Artificial intelligence (AI) is having a deep impact on the way humans work, communicate an...
This is the author’s version of a work that was accepted for publication in Artificial Intelligence....
Comparing humans and machines is one important source of information about both machine and human s...
The hereby article is to present the notions of two concepts: human and artificial intelligence. The...
We present and develop the notion of ‘universal psychometrics’ as a subject of study, and eventuall...
We report on a series of new platforms and events dealing with AI evaluation that may change the way...
We report on a series of new platforms and events dealing with AI evaluation that may change the way...
The final publication is available at Springer via http://dx.doi.org/10.1007/s13218-015-0361-4In re...