We present the DTL1000 dataset, which was created in the “Dig That Lick” project and covers the history of recorded jazz with a sample of 1,750 improvisations extracted from 1,060 audio tracks. The dataset contains a mixture of collected (editorial metadata), manually annotated (structure, style), and automatically generated (main melody transcriptions of solos) data describing the recordings. The motivation for creating this dataset was the study of patterns in jazz improvisation, but there are many other applications for this resource. The accompanying paper presents the dataset creation process, data structure and contents with descriptive statistics and discusses the origin and process of the annotations, as well as general use cases ...