Big Data & Society: Machine Learning in Tutorials - Universal Applicability, Underinformed Application, and Other Misconceptions

Monday, 7 June 2021

Machine Learning in Tutorials - Universal Applicability, Underinformed Application, and Other Misconceptions

Hendrik Heuer introduces a new paper on, "Machine learning in tutorials – Universal applicability, underinformed application, and other misconceptions", out in Big Data & Society doi:10.1177/20539517211017593. First published May 21, 2021.

Video abstract

Abstract.

Machine learning has become a key component of contemporary information systems. Unlike prior information systems explicitly programmed in formal languages, ML systems infer rules from data. This paper shows what this difference means for the critical analysis of socio-technical systems based on machine learning. To provide a foundation for future critical analysis of machine learning-based systems, we engage with how the term is framed and constructed in self-education resources. For this, we analyze machine learning tutorials, an important information source for self-learners and a key tool for the formation of the practices of the machine learning community. Our analysis identifies canonical examples of machine learning as well as important misconceptions and problematic framings. Our results show that machine learning is presented as being universally applicable and that the application of machine learning without special expertise is actively encouraged. Explanations of machine learning algorithms are missing or strongly limited. Meanwhile, the importance of data is vastly understated. This has implications for the manifestation of (new) social inequalities through machine learning-based systems.

Keywords: Machine learning, artificial intelligence, algorithms, data science, critical data studies, tutorials