CDL Misinfo Datasets
A repository for misinformation datasets and benchmarks for detection made by Complex Data Lab
Misinformation is a challenging societal issue, and mitigating solutions are difficult to create due to data deficiencies. To address this problem, we have curated the largest collection of (mis)information datasets in the literature, totaling 75. From these, we evaluated the quality of all of the 36 datasets that consist of statements or claims. If you would like to contribute a novel dataset or report any issues, please email us, visit our Hugging Face, or GitHub.
