Automatic data mining is not an easy task and its success in the biodiversity world is deeply tied to the standardization and consistency of scientific journals' layout structure. The various formatting styles found in the over 500 million pages of published biodiversity information (Kalfatovich 2010), pose a remarkable challenge towards the goal of automating the liberation of data currently trapped on the printed page. Regular expressions and other pattern-recognition strategies invariably fail to cope with this diverse landscape of academic publishing. Challenges such as incomplete data and taxonomic uncertainty add several additional layers of complexity.However, in the era of big data, the liberation of all the different facts containe...
The increasing availability of digitized biodiversity data worldwide, provided by an increasing numb...
Plazi is a Swiss non-governmental organization dedicated to the liberation of data imprisoned in fla...
The Swiss NGO Plazi (http://plazi.org) has developed an automated workflow for liberating data, incl...
One of the main challenges in biodiversity data reusability is finding ways to transform what is pro...
The growing corpus of hundreds of millions of pages of taxonomic literature reporting research resul...
Biodiversity data plays a pivotal role in understanding and conserving our natural world. As the lar...
The growing corpus of hundreds of millions of pages of taxonomic literature reporting research resul...
The quality of biodiversity data publicly accessible via aggregators such as GBIF (Global Biodiversi...
The process of choosing data for a project and then determining what subset of records are suitable ...
As part of the CETAF COVID19 task force, Plazi liberated taxonomic treatments, figures, observation ...
As part of the CETAF COVID19 task force, Plazi liberated taxonomic treatments, figures, observation ...
As part of the CETAF COVID19 task force, Plazi liberated taxonomic treatments, figures, observation ...
Scholarly knowledge about biodiversity is published in a rapidly increasing corpus of scientific pub...
Plazi's TreatmentBank is a research infrastructure and partner of the recent European Union-funded B...
To understand the loss of species, a benchmark is needed, e.g. the status of biodiversity in 1992 wh...
The increasing availability of digitized biodiversity data worldwide, provided by an increasing numb...
Plazi is a Swiss non-governmental organization dedicated to the liberation of data imprisoned in fla...
The Swiss NGO Plazi (http://plazi.org) has developed an automated workflow for liberating data, incl...
One of the main challenges in biodiversity data reusability is finding ways to transform what is pro...
The growing corpus of hundreds of millions of pages of taxonomic literature reporting research resul...
Biodiversity data plays a pivotal role in understanding and conserving our natural world. As the lar...
The growing corpus of hundreds of millions of pages of taxonomic literature reporting research resul...
The quality of biodiversity data publicly accessible via aggregators such as GBIF (Global Biodiversi...
The process of choosing data for a project and then determining what subset of records are suitable ...
As part of the CETAF COVID19 task force, Plazi liberated taxonomic treatments, figures, observation ...
As part of the CETAF COVID19 task force, Plazi liberated taxonomic treatments, figures, observation ...
As part of the CETAF COVID19 task force, Plazi liberated taxonomic treatments, figures, observation ...
Scholarly knowledge about biodiversity is published in a rapidly increasing corpus of scientific pub...
Plazi's TreatmentBank is a research infrastructure and partner of the recent European Union-funded B...
To understand the loss of species, a benchmark is needed, e.g. the status of biodiversity in 1992 wh...
The increasing availability of digitized biodiversity data worldwide, provided by an increasing numb...
Plazi is a Swiss non-governmental organization dedicated to the liberation of data imprisoned in fla...
The Swiss NGO Plazi (http://plazi.org) has developed an automated workflow for liberating data, incl...