We used this dataset to evaluate different string similarity metrics for SOTorrent (http://sotorrent.org/). The dataset has been created with this tool: https://github.com/sotorrent/so-posthistory-gt The dataset has been used in this project: https://github.com/sotorrent/metrics-compariso