Stack Overflow is the most popular question-and-answer website for software developers, providing a large amount of code snippets and free-form text on a wide variety of topics. Like other software artifacts, questions and answers on Stack Overflow evolve over time, for example when bugs in code snippets are fixed, code is updated to work with a more recent library version, or text surrounding a code snippet is edited for clarity. To be able to analyze how content on Stack Overflow evolves, we built SOTorrent, an open dataset based on the official Stack Exchange data dump. SOTorrent provides access to the version history of Stack Overflow content at the level of whole posts and individual text or code blocks. This dataset has been retrieve...