Kate Crawford

Research Professor of Communication and STS, Senior Principal Researcher
USC Annenberg, Microsoft Research

Excavating “Ground Truth” in AI: Epistemologies and Politics in Training Data

Kate Crawford
Kate Crawford

Lecture I: Excavating “Ground Truth” in AI
Thursday, September 23, 2021
4:10 p.m. – 6:15 p.m., Toll Room, Alumni House

Lecture II: Seminar Discussion
Friday, September 24, 2021
4:10 p.m. – 6:15 p.m., Toll Room, Alumni House
with commentary by Marion Fourcade, Angjoo Kanazawa and Trevor Paglen

The lectures and the seminar are currently planned to take place in person. Advance registration will be required.


About Kate Crawford

Professor Kate Crawford is a leading international scholar of the social and political implications of artificial intelligence. Her work focuses on understanding large-scale data systems in the wider contexts of history, politics, labor, and the environment. She is a Research Professor of Communication and STS at USC Annenberg, a Senior Principal Researcher at Microsoft Research New York, and an Honorary Professor at the University of Sydney. She is the inaugural Visiting Chair for AI and Justice at the École Normale Supérieure in Paris, where she co-leads the international working group on the Foundations of Machine Learning. Over her twenty year research career, she has also produced groundbreaking creative collaborations and visual investigations.

Her project Anatomy of an AI System with Vladan Joler won the Beazley Design of the Year Award, and is in the permanent collection of the Museum of Modern Art in New York and the V&A in London. Her collaboration with the artist Trevor Paglen produced Training Humans – the first major exhibition of the images used to train AI systems. Their investigative project, Excavating AI, won the Ayrton Prize from the British Society for the History of Science. Crawford’s latest book, Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence (Yale University Press) has been described as “a fascinating history of data” by the New Yorker, a “timely and urgent contribution” by Science. and named one of the best books on technology in 2021 by the Financial Times.


About the Lectures

The last decade has seen a dramatic capture of digital material for machine learning production. This data is the basis for sense-making in AI, not as classical representations of the world with individual meaning, but as mass collections: ground truth for machine abstractions and operations. OpenAI’s GPT-3 language model is trained on a corpus of 1 billion words, ImageNet contains over 14 million images, and Tencent’s ML Images contains more than 17.5 million annotated images – predominantly scraped from the internet. Training datasets shape the epistemic boundaries governing how machine learning operates, and thus are an essential part of understanding socially significant questions about AI. But when we closely investigate the benchmark training sets widely used in NLP and computer vision systems, we find complex social, political, and epistemological challenges. What happens when data is seen as an aggregate, stripped of context, meaning, and specificity? In what ways does training data limit what and how machine learning systems interpret the world? And most importantly, what forms of power do these approaches enhance and enable? In this lecture, Kate Crawford will share new work that reflects on what’s at stake in the architecture and contents of training sets, and how they are increasingly part of our urban, legal, logistical, and commercial infrastructures.


About the Commentators

Marion Fourcade

Professor, Sociology, University of California, Berkeley

Marion Fourcade is Professor of Sociology and Director of Social Science Matrix at UC Berkeley. She received her PhD from Harvard University in 2000 and is an alumna of the Ecole Normale Supérieure in Paris, France. A comparative sociologist by training and taste, she is the author of Economists and Societies: Discipline and Profession in the United States, Britain and France, 1890s to 1990s (Princeton University Press, 2009). Other published works explore national variations in neoliberal transitions, political mores, and valuation cultures. More recently, her research focus has shifted to the rise, consolidation and social consequences of new classificatory regimes powered by digital data and algorithms. A book on this topic, The Ordinal Society (with Kieran Healy), is under contract with Harvard University Press. Fourcade is a recipient of the American Sociological Association’s Distinguished Book Award and the Society for the Social Studies of Science’s Ludwik Fleck prize for outstanding book in the area of science and technology studies, as well as of the Lewis Coser award for theoretical agenda setting


Angjoo Kanazawa

Assistant Professor, Electrical Engineering and Computer Sciences, University of California, Berkeley

Angjoo Kanazawa is an Assistant Professor in the Department of Electrical Engineering and Computer Science at the University of California, Berkeley. Previously, she was a research scientist at Google NYC, and prior to that she was a BAIR postdoc at UC Berkeley advised by Jitendra Malik, Alyosha Efros, and Trevor Darrell. She completed her PhD in Computer Science at the University of Maryland, College Park with her advisor David Jacobs. She also spent time at the Max Planck Institute for Intelligent Systems with Michael Black.

Her research is at the intersection of Computer Vision, Computer Graphics, and Machine Learning, focusing on visual perception of the dynamic 3D world behind everyday photographs and video, such as 3D perception of people and animals along with their environment. She has been named a Rising Star in EECS and is a recipient of the Google Anita Borg Memorial Scholarship, Eurographics Best Paper Award 2016 and the Google Research Scholar Program 2021. She also serves on the advisory board of Wonder Dynamics, whose goal is to utilize AI technologies to make VFX effects more accessible for indie filmmakers.


Trevor Paglen

Artist and Geographer

Trevor Paglen is an artist and geographer living and working in New York City. In addition to several books and numerous one-person exhibitions, Paglen has launched an artwork into distant orbit around Earth, contributed research and cinematography to the Academy Award-winning film Citizenfour, and created a radioactive public sculpture for the exclusion zone in Fukushima, Japan. Paglen holds a B.A. from U.C. Berkeley, an MFA from the Art Institute of Chicago, and a Ph.D. in Geography from U.C. Berkeley. In 2014, he received the Electronic Frontier Foundation Pioneer Award and in 2016, he won the Deutsche Börse Photography Prize. Paglen was named a MacArthur Fellow in 2017.