AbstractScientific community across many disciplines is exploring new ways to extract knowledge from all available sources. Historically, written manuscripts have been the media of choice for recording experimental findings. Many disciplines such as social science, medical science are exploring ways to automate knowledge discovery from a vast repository of published scientific work. This work attempts to accelerate the process of information extraction by extending Kepler, a graphical workflow management tool. Kepler provides a simple way of designing and executing complex workflows in the form of directed graphs. This work presents a scalable approach to convert published research as PDF documents into indexable XML documents using Kepler....
International audienceDuring the last decade, the availability of scientific papers in full text and...
International audienceThe Open Access movement in scientific publishing and search engines like Goog...
Purpose: To demonstrate how the information extracted from scientific text can be directly used in s...
AbstractScientific community across many disciplines is exploring new ways to extract knowledge from...
AbstractScientific workflow systems are designed to compose and execute either a series of computati...
AbstractWe report on progress of employing the Kepler workflow engine to prototype “end-to-end” appl...
In order to perform complex scientific data analysis, multiple software and skillsets are generally ...
Over the past five years, our activities have both established Kepler as a viable scientific workflo...
The continuous growth of scientific literature brings innovations and, at the same time, raises new ...
Next-generation DNA sequencing machines are generating a very large amount of sequence data with app...
AbstractData curation is critical for scientific data digitization, sharing, integration, and use. T...
Scientific research products are the result of long-term collaborations between teams. Scientific wo...
AbstractThe growing scale and complexity of cataloguing and analyzing of astronomical data forces sc...
© The Author(s), 2012. This article is distributed under the terms of the Creative Commons Attribut...
Motivation: The availability of improved natural language processing (NLP) algorithms and models ena...
International audienceDuring the last decade, the availability of scientific papers in full text and...
International audienceThe Open Access movement in scientific publishing and search engines like Goog...
Purpose: To demonstrate how the information extracted from scientific text can be directly used in s...
AbstractScientific community across many disciplines is exploring new ways to extract knowledge from...
AbstractScientific workflow systems are designed to compose and execute either a series of computati...
AbstractWe report on progress of employing the Kepler workflow engine to prototype “end-to-end” appl...
In order to perform complex scientific data analysis, multiple software and skillsets are generally ...
Over the past five years, our activities have both established Kepler as a viable scientific workflo...
The continuous growth of scientific literature brings innovations and, at the same time, raises new ...
Next-generation DNA sequencing machines are generating a very large amount of sequence data with app...
AbstractData curation is critical for scientific data digitization, sharing, integration, and use. T...
Scientific research products are the result of long-term collaborations between teams. Scientific wo...
AbstractThe growing scale and complexity of cataloguing and analyzing of astronomical data forces sc...
© The Author(s), 2012. This article is distributed under the terms of the Creative Commons Attribut...
Motivation: The availability of improved natural language processing (NLP) algorithms and models ena...
International audienceDuring the last decade, the availability of scientific papers in full text and...
International audienceThe Open Access movement in scientific publishing and search engines like Goog...
Purpose: To demonstrate how the information extracted from scientific text can be directly used in s...