Big Data & Society 6(1), https://doi.org/10.1177/2053951719843310. First published May 23, 2019.
Fake news is a problem. It is a big data problem. We are trying to solve it with small amounts of data. Those are, in a nutshell, the three main points of our paper. We review available datasets and introduce the MisInfoText repository as a contribution of our lab to the community. We make available the full text of the news articles, together with veracity labels previously assigned based on manual assessment of the articles’ truth content by fact-checkers. We also perform a topic modeling experiment to elaborate on the gaps and sources of imbalance in currently available datasets to guide future efforts. We appeal to the community to collect more data and to make it available for research purposes.
This video was taken during Innovations in Research, an event at Simon Fraser University in Vancouver, as part of the Community Summit “Confronting the Disinformation Age”.
Credit: Simon Fraser University.