This paper describes and tests a method for carrying out quantified reproducibility assessment (QRA) that is based on concepts and definitions from metrology. QRA produces a single score estimating the degree of reproducibility of a given system and evaluation measure, on the basis of the scores from, and differences between, different reproductions. We test QRA on 18 different system and evaluation measure combinations (involving diverse NLP tasks and types of evaluation), for each of which we have the original results and one to seven reproduction results. The proposed QRA method produces degree-of-reproducibility scores that are comparable across multiple reproductions not only of the same, but also of different, original studies. We fin...
International audienceOne of the challenges in machine learning research is to ensure that presented...
Why are some research studies easy to reproduce while others are difficult? Casting doubt on the acc...
In this paper we report our reproduction study of the Croatian part of an annotation-based human eva...
This paper describes and tests a method for carrying out quantified reproducibility assessment (QRA)...
This paper reports results from a reproduction study in which we repeated the human evaluation of th...
Reproducibility has become an increasingly debated topic in NLP and ML over recent years, but so far...
Against a background of growing interest in reproducibility in NLP and ML, and as part of an ongoing...
Against the background of what has been termed a reproducibility crisis in science, the NLP field is...
International audienceAgainst the background of what has beentermed a reproducibility crisis in scie...
We report our efforts in identifying a set of previous human evaluations in NLP that would be suitab...
We report our efforts in identifying a set of previous humane valuations in NLP that would be suitab...
Background: While the term reproducibility crisis mainly reflects reproducibility of experiments bet...
Reproducibility is of utmost concern in machine learning and natural language processing (NLP). In t...
One of the challenges in machine learning research is to ensure that presented and published result...
The reproducibility of scientific discoveries is a hallmark of scientific research. Although its cen...
International audienceOne of the challenges in machine learning research is to ensure that presented...
Why are some research studies easy to reproduce while others are difficult? Casting doubt on the acc...
In this paper we report our reproduction study of the Croatian part of an annotation-based human eva...
This paper describes and tests a method for carrying out quantified reproducibility assessment (QRA)...
This paper reports results from a reproduction study in which we repeated the human evaluation of th...
Reproducibility has become an increasingly debated topic in NLP and ML over recent years, but so far...
Against a background of growing interest in reproducibility in NLP and ML, and as part of an ongoing...
Against the background of what has been termed a reproducibility crisis in science, the NLP field is...
International audienceAgainst the background of what has beentermed a reproducibility crisis in scie...
We report our efforts in identifying a set of previous human evaluations in NLP that would be suitab...
We report our efforts in identifying a set of previous humane valuations in NLP that would be suitab...
Background: While the term reproducibility crisis mainly reflects reproducibility of experiments bet...
Reproducibility is of utmost concern in machine learning and natural language processing (NLP). In t...
One of the challenges in machine learning research is to ensure that presented and published result...
The reproducibility of scientific discoveries is a hallmark of scientific research. Although its cen...
International audienceOne of the challenges in machine learning research is to ensure that presented...
Why are some research studies easy to reproduce while others are difficult? Casting doubt on the acc...
In this paper we report our reproduction study of the Croatian part of an annotation-based human eva...