Motivation: Document similarity metrics such as PubMed’s “Find related articles ” feature, which have been primarily used to identify studies with similar topics, can now also be used to detect duplicated or potentially plagiarized papers within literature reference databases. However, the CPU-intensive nature of document comparison has limited MEDLINE text similarity studies to the comparison of abstracts, which constitute only a small fraction of a publication’s total text. Extending searches to include text archived by online search engines would drastically increase comparison ability. For large-scale studies, submitting short phrases encased in direct quotes to search engines for exact matches would be optimal for both individual queri...
Part 1: ConferenceInternational audienceNear duplicate documents and their detection are studied to ...
Our study identifies sentences in Wikipedia articles that are either identical or highly similar by ...
Document recommendation systems for locating relevant literature have mostly relied on methods devel...
Motivation: Duplicate publication impacts the quality of the scientific corpus, has been difficult t...
Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text...
Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text...
Objective We aim to identify duplicate pairs of Medline citations, particularly when the documents a...
Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text...
Background: Finding duplicates is an important phase of systematic review. However, no consensus reg...
Motivation: Duplicate publication impacts the quality of the scien-tific corpus, has been difficult ...
Finding duplicates is an important phase of systematic review. However, no consensus regarding the m...
The ever-growing amounts of textual information coming from different sources have fostered the deve...
The ever-growing amounts of textual information coming from different sources have fostered the deve...
<div><p>Background</p><p>Finding duplicates is an important phase of systematic review. However, no ...
Objective: To automatically detect duplicate citations in a bibliographical database. Background: Ci...
Part 1: ConferenceInternational audienceNear duplicate documents and their detection are studied to ...
Our study identifies sentences in Wikipedia articles that are either identical or highly similar by ...
Document recommendation systems for locating relevant literature have mostly relied on methods devel...
Motivation: Duplicate publication impacts the quality of the scientific corpus, has been difficult t...
Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text...
Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text...
Objective We aim to identify duplicate pairs of Medline citations, particularly when the documents a...
Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text...
Background: Finding duplicates is an important phase of systematic review. However, no consensus reg...
Motivation: Duplicate publication impacts the quality of the scien-tific corpus, has been difficult ...
Finding duplicates is an important phase of systematic review. However, no consensus regarding the m...
The ever-growing amounts of textual information coming from different sources have fostered the deve...
The ever-growing amounts of textual information coming from different sources have fostered the deve...
<div><p>Background</p><p>Finding duplicates is an important phase of systematic review. However, no ...
Objective: To automatically detect duplicate citations in a bibliographical database. Background: Ci...
Part 1: ConferenceInternational audienceNear duplicate documents and their detection are studied to ...
Our study identifies sentences in Wikipedia articles that are either identical or highly similar by ...
Document recommendation systems for locating relevant literature have mostly relied on methods devel...