The dataset includes all the ids of the tweets analysed in the paper "Global misinformation spillovers in the online vaccination debate before and during COVID-19", divided by lang. They are all the tweets containing vaccines keywords in 18 different European languages, spanning the period October 2019 - March 2021 (excluded 2020 Jan - 2020 Jun). Note that a large fraction of the tweets can't be retrieved because of the suspension of the accounts and the removal of the posts by the users, so the study is only partially reproducible. The unzipped dataset has a dimension of 8,0G. The authors of the paper are: Lenti J, Mejova Y, Kalimeri K, Panisson A, Paolotti D, Tizzani M, Starnini M