Thanks for sharing interesting work. I'm currently trying to download the pre-train datasets, following the [instruction](https://github.com/Wangt-CN/DisCo#data-preparation) I faced an error in the middle of the downloading process.  Can you check the integrity of the dataset? Thank you.
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be resolved. The issue was opened by hygenie1228 and has received 1 comments.