In conjunction with the 27th International Joint Conference on Artificial IntelligenceInternational audienceIn this paper, we describe experiments with techniques for locating foods and recognizing food states in cooking videos. We describe production of a new data set that provides annotated images for food types and food states. We compare results with two techniques for detecting food types and food states, and then show that recognizing type and state with separate classifiers improves recognition results. We then use this to provide detection of composite activation maps for food types. The results provide a promising first step towards construction of narratives for cooking actions
This paper introduces a publicly available dataset of complex activities that involve manipulative g...
In this paper we investigate features and their combinations for food image analysis and a classific...
Videos are one of the most frequently used forms of multimedia resources. People want to interact wi...
In conjunction with the 27th International Joint Conference on Artificial IntelligenceInternational ...
In this dissertation, we discuss our work on analyzing cooking content for the ultimate goal ofautom...
While activity recognition is a current focus of research the challenging problem of fine-grained ac...
This paper deals with automatic systems for image recipe recognition. For this purpose, we compare a...
We present a novel method for aligning a se-quence of instructions to a video of some-one carrying o...
Automatic image-based food recognition is a particularly challenging task. Traditional image analysi...
This thesis addresses the problem of recognition, modelling and description of human activities. We ...
In order to make the best use of multimedia contents effec-tively, the crucial point is the structur...
The deep learning has demonstrated its effectiveness and powerful in computer vision, which is used ...
This paper introduces a publicly available dataset of complex activities that involve manipulative g...
In this paper, we propose a novel hybrid transformer architecture for food cuisine detection and cla...
Abstract—In this paper, we address a novel task ”cooking recognition task”. Cooking recognition task...
This paper introduces a publicly available dataset of complex activities that involve manipulative g...
In this paper we investigate features and their combinations for food image analysis and a classific...
Videos are one of the most frequently used forms of multimedia resources. People want to interact wi...
In conjunction with the 27th International Joint Conference on Artificial IntelligenceInternational ...
In this dissertation, we discuss our work on analyzing cooking content for the ultimate goal ofautom...
While activity recognition is a current focus of research the challenging problem of fine-grained ac...
This paper deals with automatic systems for image recipe recognition. For this purpose, we compare a...
We present a novel method for aligning a se-quence of instructions to a video of some-one carrying o...
Automatic image-based food recognition is a particularly challenging task. Traditional image analysi...
This thesis addresses the problem of recognition, modelling and description of human activities. We ...
In order to make the best use of multimedia contents effec-tively, the crucial point is the structur...
The deep learning has demonstrated its effectiveness and powerful in computer vision, which is used ...
This paper introduces a publicly available dataset of complex activities that involve manipulative g...
In this paper, we propose a novel hybrid transformer architecture for food cuisine detection and cla...
Abstract—In this paper, we address a novel task ”cooking recognition task”. Cooking recognition task...
This paper introduces a publicly available dataset of complex activities that involve manipulative g...
In this paper we investigate features and their combinations for food image analysis and a classific...
Videos are one of the most frequently used forms of multimedia resources. People want to interact wi...