International audienceVideo hyperlinking represents a classical example of multimodal problems. Common approaches to such problems are early fusion of the initial modalities and crossmodal translation from one modality to the other. Recently, deep neural networks, especially deep autoencoders, have proven promising both for crossmodal translation and for early fusion via multimodal embedding. A particular architecture, bidirectional symmetrical deep neural networks, have been proven to yield improved multimodal embeddings over classical autoencoders, while also being able to perform crossmodal translation. In this work, we focus firstly at evaluating good single-modal continuous representations both for textual and for visual information. W...
In this dissertation, the thesis that deep neural networks are suited for analysis of visual, textua...
International audienceContinuous multimodal representations suitable for multimodal information retr...
International audienceContinuous multimodal representations suitable for multimodal information retr...
International audienceVideo hyperlinking represents a classical example of multimodal problems. Comm...
International audienceVideo hyperlinking represents a classical example of multimodal problems. Comm...
International audienceVideo hyperlinking represents a classical example of multimodal problems. Comm...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
In this dissertation, the thesis that deep neural networks are suited for analysis of visual, textua...
International audienceContinuous multimodal representations suitable for multimodal information retr...
International audienceContinuous multimodal representations suitable for multimodal information retr...
International audienceVideo hyperlinking represents a classical example of multimodal problems. Comm...
International audienceVideo hyperlinking represents a classical example of multimodal problems. Comm...
International audienceVideo hyperlinking represents a classical example of multimodal problems. Comm...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceCommon approaches to problems involving multiple modalities (classification, r...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
International audienceWith the recent resurgence of neural networks and the proliferation of massive...
In this dissertation, the thesis that deep neural networks are suited for analysis of visual, textua...
International audienceContinuous multimodal representations suitable for multimodal information retr...
International audienceContinuous multimodal representations suitable for multimodal information retr...