Visual question answering has recently been settled as a fundamental multi-modal reasoning task of artificial intelligence that allows users to get information about visual content by asking questions in natural language. In the cultural heritage domain this task can contribute to assist visitors in museums and cultural sites, thus increasing engagement. However, the development of visual question answering models for cultural heritage is prevented by the lack of suitable large-scale datasets. To meet this demand, we built a large-scale heterogeneous and multilingual (Italian and English) dataset for cultural heritage that comprises approximately 500K Italian cultural assets and 6.5M question-answer pairs. We propose a novel formulation of ...
The advance of information technology has enabled in recent years new fruition scenarios for cultur...
This paper deals with the survey and communications of Cultural Heritage through the development of ...
Cultural Heritage (CH) assets may be defined as integrated spatial systems composed of interconnecte...
Sheng S., Van Gool L., Moens M.-F., ''A dataset for multimodal question answering in the cultural he...
International Conference Florence Heri-Tech is a conference about the technology applied to cultural...
With the increasing use of mobile devices, taking pictures becomes an easy and natural way for peopl...
Multimodal machine learning involving textual and visual data is a fundamental research topic in the...
The recent advancement in Artificial Intelligence (AI) has paved the way for the wide adoption of ne...
Cultural heritage makes reference to an extremely diverse set of sources. More specifically, histori...
Large datasets that were made publicly available to the research community over the last 20 years ha...
The recent breakthroughs in the field of deep learning have lead to state-of-the-art results in seve...
Digitizing large collections of Cultural Heritage (CH) resources and providing tools for their manag...
New technologies, tools, and methodologies have been used in the Cultural Heritage (CH) scenarios to...
Recent technological developments are changing how people experience physical and virtual environmen...
The advance of information technology has enabled in recent years new fruition scenarios for cultur...
This paper deals with the survey and communications of Cultural Heritage through the development of ...
Cultural Heritage (CH) assets may be defined as integrated spatial systems composed of interconnecte...
Sheng S., Van Gool L., Moens M.-F., ''A dataset for multimodal question answering in the cultural he...
International Conference Florence Heri-Tech is a conference about the technology applied to cultural...
With the increasing use of mobile devices, taking pictures becomes an easy and natural way for peopl...
Multimodal machine learning involving textual and visual data is a fundamental research topic in the...
The recent advancement in Artificial Intelligence (AI) has paved the way for the wide adoption of ne...
Cultural heritage makes reference to an extremely diverse set of sources. More specifically, histori...
Large datasets that were made publicly available to the research community over the last 20 years ha...
The recent breakthroughs in the field of deep learning have lead to state-of-the-art results in seve...
Digitizing large collections of Cultural Heritage (CH) resources and providing tools for their manag...
New technologies, tools, and methodologies have been used in the Cultural Heritage (CH) scenarios to...
Recent technological developments are changing how people experience physical and virtual environmen...
The advance of information technology has enabled in recent years new fruition scenarios for cultur...
This paper deals with the survey and communications of Cultural Heritage through the development of ...
Cultural Heritage (CH) assets may be defined as integrated spatial systems composed of interconnecte...