We present a system which applies text mining using computational linguistic techniques to automatically extract, categorize, disambiguate and filter metadata for image access. Candidate subject terms are identified through standard approaches; novel semantic categorization using machine learning and disambiguation using both WordNet and a domain specific thesaurus are applied. The resulting metadata can be manually edited by image catalogers or filtered by semi-automatic rules. We describe the implementation of this workbench created for, and evaluated by, image catalogers. We discuss the system\u27s current functionality, developed under the Computational Linguistics for Metadata Building (CLiMB) research project. The CLiMB Toolkit has be...
The growing predominance of social semantics in the form of tagging presents the metadata community ...
International audienceThis work combines semantic reasoning and machine learning to create tools tha...
INTRODUCTION: With the increase in the availability of digital text collections for humanities rese...
We present a system which applies text mining using computational linguistic techniques to automatic...
Digital image collections in libraries and other curatorial institutions grow too rapidly to create ...
We describe a series of studies aimed at identifying specifications for a text extraction module of ...
This paper reports on the linguistic analysis of a tag set of nearly 50,000 tags collected as part ...
Article focusing on metadata creation for photographs in language archives. It was presented at the ...
In this paper, we describe an interactive system, built within the context of CLiMB project, which p...
Taking a more quantitative approach in linguistic landscape research, we explore recent techniques o...
Marriott Library received funding to explore the feasibility of using image analysis to generate des...
Image catalogs containing several million reproductions of artworks still pose a costly or computati...
We describe a series of studies aimed at identifying specifications for a text extraction module of ...
© 2016 IEEE. The goal of this work is to automatically collect a large number of highly relevant ima...
A major drawback in today's information landscape is the distribution of resources across numerous r...
The growing predominance of social semantics in the form of tagging presents the metadata community ...
International audienceThis work combines semantic reasoning and machine learning to create tools tha...
INTRODUCTION: With the increase in the availability of digital text collections for humanities rese...
We present a system which applies text mining using computational linguistic techniques to automatic...
Digital image collections in libraries and other curatorial institutions grow too rapidly to create ...
We describe a series of studies aimed at identifying specifications for a text extraction module of ...
This paper reports on the linguistic analysis of a tag set of nearly 50,000 tags collected as part ...
Article focusing on metadata creation for photographs in language archives. It was presented at the ...
In this paper, we describe an interactive system, built within the context of CLiMB project, which p...
Taking a more quantitative approach in linguistic landscape research, we explore recent techniques o...
Marriott Library received funding to explore the feasibility of using image analysis to generate des...
Image catalogs containing several million reproductions of artworks still pose a costly or computati...
We describe a series of studies aimed at identifying specifications for a text extraction module of ...
© 2016 IEEE. The goal of this work is to automatically collect a large number of highly relevant ima...
A major drawback in today's information landscape is the distribution of resources across numerous r...
The growing predominance of social semantics in the form of tagging presents the metadata community ...
International audienceThis work combines semantic reasoning and machine learning to create tools tha...
INTRODUCTION: With the increase in the availability of digital text collections for humanities rese...