International audienceSaliency models aim to predict where we look at within a visual scene. They are based on low-level visual features including color, intensity, and orientation. They process these visual features to generate a 2D saliency map, indicating the most salient part of the image. In contrast with saliency models, saccadic models intend to predict the sequence of eye fixations, i.e. the way observers deploy their gaze while viewing a stimulus on screen. Rather than computing a unique saliency map, saccadic models compute plausible visual scanpaths, i.e. the actual sequence of fixations and saccades unfolding across time. By the term plausible, we mean that the predicted scanpaths should be as similar as possible to human scanpa...