Volume

November 2021

VIEW MASTHEAD

Feature

A Relational Theory of Data Governance

30 November 2021

Privacy • Contracts

DOWNLOAD PDF

abstract. This Feature advances a theoretical account of data as social relations, constituted by both legal and technical systems. It shows how data relations result in supraindividual legal interests. Properly representing and adjudicating among those interests necessitates far more public and collective (i.e., democratic) forms of governing data production. Individualist data-subject rights cannot represent, let alone address, these population-level effects.

This account offers two insights for data-governance law. First, it better reflects how and why data collection and use produce economic value as well as social harm in the digital economy. This brings the law governing data flows into line with the economic realities of how data production operates as a key input to the information economy. Second, this account offers an alternative normative argument for what makes datafication—the transformation of information about people into a commodity—wrongful. What makes datafication wrong is not (only) that it erodes the capacity for subject self-formation, but instead that it materializes unjust social relations: data relations that enact or amplify social inequality. This account indexes many of the most pressing forms of social informational harm that animate criticism of data extraction but fall outside typical accounts of informational harm. This account also offers a positive theory for socially beneficial data production. Addressing the inegalitarian harms of datafication—and developing socially beneficial alternatives—will require democratizing data social relations: moving from individual data-subject rights to more democratic institutions of data governance.

author. Academic Fellow, Columbia Law School. Many thanks to the members of the 2020 Privacy Law Scholars Workshop, the Information Law Institute Fellows Workshop at NYU Law, and the Digital Life Initiative Fellows Group at Cornell Tech for their careful and generous comments. Additional thanks to Ashraf Ahmed, José Argueta Funes, Chinmayi Arun, Yochai Benkler, Elettra Bietti, Julie Cohen, Angelina Fisher, Jake Goldenfein, Ben Green, Lily Hu, Woodrow Hartzog, Aziz Huq, Amy Kapczynski, Duncan Kennedy, Issa Kohler-Hausmann, Michael Madison, Lee McGuigan, Lev Menand, Christopher Morten, Helen Nissenbaum, Amanda Parsons, Angie Raymond, Neil Richards, Thomas Schmidt, Katherine Strandburg, Thomas Streinz, Mark Verstraete, Ari Ezra Waldman, and Richard Wagner. An early version of this work was presented in 2018 at Indiana University’s Ostrom Workshop.

Introduction

In recent years, the technology industry has been the focus of increased public distrust, civil and worker activism, and regulatory scrutiny.1 Concerns over datafication—the transformation of information about people into a commodity—play a central role in this widespread front of curdled goodwill, popularly referred to as the “techlash.”2

As technology firms mediate more of our daily lives and grow more economically dominant, the centrality they place on data collection raises the stakes of data-governance law—the legal regime that governs how data about people is collected, processed, and used. As data becomes an essential component of informational capital, the law regulating data production becomes central to debates regarding how—and why—to regulate informational capitalism. There is broad consensus that current data-governance law has failed to protect technology users from the harms of data extraction, in part because it cannot account for this large and growing gap between data’s de jure status as the subject of consumer rights and its de facto status as quasi capital.3

Data-governance reform is the subject of much debate and lively theorizing, with many proposals emerging to address the status quo’s inadequacy.4 This Feature evaluates the legal conceptualizations behind these proposals—in other words, how proposed reforms conceive of what makes datafication worth regulating and whose interests in information ought to gain legal recognition. How datafication is conceptualized shapes and constrains how the law responds to datafication’s effects. If data-governance law is inattentive to how data production creates social benefits and harms, it will be poorly equipped to mitigate those harms and foster data production’s benefits.

This Feature’s core argument is that the data-collection practices of the most powerful technology companies are aimed primarily at deriving (and producing) population-level insights regarding how data subjects relate to others, not individual insights specific to the data subject. These insights can then be applied to all individuals (not just the data subject) who share these population features.

This population-level economic motivation matters conceptually for the legal regimes that regulate the activity of data collection and use; it requires revisiting long-held notions of why individuals have a legal interest in information about them and where such interests obtain.

The status quo of data-governance law, as well as prominent proposals for its reform, approach these population-level relational effects as incidental or a byproduct of eroded individual data rights, to the extent that they recognize these effects at all. As a result, both the status quo and reform proposals suffer from a common conceptual flaw: they attempt to reduce legal interests in information to individualist claims subject to individualist remedies, which are structurally incapable of representing the interests and effects of data production’s population-level aims. This in turn allows significant forms of social informational harm to go unrepresented and unaddressed in how the law governs data collection, processing, and use.

Properly representing the population-level interests that result from data production in the digital economy will require far more collective modes of ordering this productive activity.5 The relevant task of data governance is not to reassert individual control over the terms of one’s own datafication (even if this were possible) or to maximize personal gain, as leading legal approaches to data governance seek to do. Instead, the task is to develop the institutional responses necessary to represent (and adjudicate among) the relevant population-level interests at stake in data production. In other words, responding adequately to the economic imperatives and social effects of data production will require moving past proposals for individualist data-subject rights and toward theorizing the collective institutional forms required for responsible data governance.

This Feature builds on prior digital-privacy and data-governance scholarship that points out the importance of social causes and social effects of privacy erosion.6 It takes up these insights to offer an account of why the social effects of privacy erosion should be considered of greater relevance—indeed, central relevance—for data-governance law. By placing data relations and their population-level effects at the center of discussions regarding why data about people is (and ought to be) legally regulated, this Feature offers two contributions to the literature on data-governance law.

First, it aligns the legal debates regarding how to govern data production with the economic transformation of data into a key input of the information economy. This in turn illuminates the growing role (and heightened stakes) of data-governance law as a primary legal regime regulating informational capitalism.

The descriptive contribution of this Feature details how data production in the digital economy is fundamentally relational: a basic purpose of data production as a commercial enterprise is to relate people to one another based on relevant shared population features. This produces both considerable social value and many of the pressing forms of social risk that plague the digital economy. As this Feature explores further below, data’s relationality results in widespread population-level interests in data collection and use that are irreducible to individual legal interests within a given data exchange. Contending with the economic realities of data production thus expands the task of data-governance law: from disciplining against forms of interpersonal violation to also structuring the rules of economic production (and social reproduction) in the information economy.

Second, this Feature departs from prior work to offer an alternative normative account for what makes datafication wrongful. Privacy and data-governance law have traditionally governed forms of private interpersonal exchange in order to secure the benefits of data-subject dignity or autonomy. Yet as data collection and use become key productive activities (i.e., economic activities that define the contemporary economyas an information economy), new kinds of information-based harm arise. There is growing evidence of the role that digital technology plays in facilitating social and economic inequality.7 Digital-surveillance technologies used to enhance user experience for the rich simultaneously provide methods of discipline and punishment for the poor. Algorithmic systems may reproduce or amplify sex and race discrimination.8 Even seemingly innocuous data collection may be used in service of domination and oppression.9 The pursuit of user attention and uninterrupted access to data flows amplifies forms of identitarian polarization, aggression, and even violence.10 Such evidence suggests that social processes of datafication not only produce violations of personal dignity or autonomy, but also enact or amplify social inequality.

Prior accounts rightly identify the deep entanglement between the challenges of protecting autonomy in the digital economy and the realities of how data production operates as a social process: without securing better social conditions for data production for everyone, the personal benefits of robust privacy protection cannot be realized.11 On this view, the supraindividual nature of digital-privacy erosion matters because it raises additional complications for securing the benefits of robust digital-privacy protection for individuals.

This Feature departs from such accounts in that it places the inegalitarian effects of data extraction on equal footing with its autonomy-eroding effects. Privacy erosion’s social effects do implicate the personal (and social) value of individual autonomy. But the inequality that results from data production should be considered relevant to the task of data governance for its own sake, and not only for the effects inequality has on data subjects’ individual capacities for self-formation and self-enactment. This Feature thus argues that, alongside traditional concerns over individual autonomy, the social inequalities that result from data production are also forms of informational harm.

Both current and proposed data-governance law fail to adequately grasp the socioeconomic and normative centrality of data relations. This poses two problems. The first problem is conceptual: a central economic imperative that drives data production goes unrepresented in both existing and proposed laws governing datafication. As a practical matter, this leaves the law out of step with many of the ways that information creates social value and allows material forms of social informational harm to persist unaddressed. This presents U.S. data-governance law with a sociality problem: how can data-governance law account for data production’s social effects?

The second problem is a matter of institutional design. Individualist theories of informational interests result in legal proposals that advance a range of new rights and duties with respect to information but practically fall back on individuals to adjudicate between legitimate and illegitimate information production. This not only leaves certain social informational harms unrepresented (let alone addressed), but also risks foreclosing socially beneficial information production. This presents U.S. data-governance law with a legitimacy problem: how can the legal regimes governing data production distinguish legitimate from illegitimate data use without relying on individual notice and choice?

The sociality problem demonstrates the need in data-governance law for an expanded account of the interests at stake in information production, while the legitimacy problem points to the need for data-governance law to expand its remit by considering whose interests are relevant for deciding whether a particular instance of data production is legitimate, and on what grounds.

This Feature offers a response to these conceptual and institutional design problems. Conceptually, it offers an account of the sociality problem that recognizes the ubiquity and the relevance of the population-level interests that result from data production. From such recognition follows this Feature’s response to the legitimacy problem, which argues for governing many types of data as a collective resource that necessitates far more democratic, as opposed to personal, forms of institutional governance.

This in turn leads to a different line of inquiry regarding the legal challenges facing data-governance law. Current debates center on how to secure greater data-subject control, more robust protections for data-subject dignity, or better legal expressions of data-subject autonomy. An account of data social relations focuses future inquiry on how to balance the overlapping and at times competing interests that comprise the population-level effects of data production. This line of inquiry raises core questions of democratic governance: how to grant people a say in the social processes of their mutual formation; how to balance fair recognition with special concern for certain minority interests; what level of civic life achieves the appropriate level of pooled interest; and how to recognizethat data production produces winners and losers and, in turn, develop fair institutional responses to these effects.

This Feature proceeds in four Parts. Part I describes the stakes and the status quo of data governance. It begins by documenting the significance of data processing for the digital economy. It then evaluates how the predominant legal regimes that govern data collection and use—contract and privacy law—code data as an individual medium. This conceptualization is referred to throughout the Feature as “data as individual medium” (DIM). DIM regimes apprehend data’s capacity to cause individual harm as the legally relevant feature of datafication; from this theory of harm follows the tendency of DIM regimes to subject data to private individual ordering.

Part II presents the Feature’s core argument regarding the incentives and implications of data social relations within the data political economy. Data’s capacity to transmit social and relational meaning renders data production especially capable of benefitting and harming others beyond the data subject from whom the data is collected. It also results in population-level interests in data production that are not reducible to the individual interests that generally feature in data governance. Thus, data’s relationality presents a conceptual challenge for data governance reform.

Part III evaluates two prominent sets of legal reform proposals that have emerged in response to concerns over datafication. Data has been extensively analogized, and proposals for reform locate data at different points on the continuum from “object-like” to “person-like.”12 On one end of this spectrum, propertarian proposals respond to growing wealth inequality in the data economy by formalizing individual propertarian rights over data. These reforms call for formalizing an alienable right to data as labor or property, to be bought and sold in a market for goods or labor. On the other end, dignitarian reforms conceive of data as an extension of data-subject selfhood. Dignitarian reforms respond to how excessive data extraction can erode individual autonomy by strengthening the fundamental rights data subjects enjoy over their data as an extension of their personal selfhood. While propertarian and dignitarian proposals differ on the theories of injustice underlying datafication and accordingly provide different solutions, both resolve to individualist claims and remedies that do not represent, let alone address, the relational nature of data collection and use.

Finally, Part IV proposes an alternative approach: data as a democratic medium (DDM). This alternative conceptual approach recognizes data’s capacity to cause social harm as a fundamentally relevant feature of datafication. This leads to a commitment to collective institutional forms of ordering. Conceiving of data as a collective resource subject to democratic ordering accounts for the importance of population-based relationality in the digital economy. This recognizes a greater number of relevant interests in data production. DDM responds not only to salient forms of injustice identified by other data-governance reforms, but also to significant forms of injustice missed by individualist accounts. In doing so, DDM also provides a theory of data governance from which to defend forms of socially beneficial data production that individualist accounts may foreclose. Part IV concludes by outlining some examples of what regimes that conceive of data as democratic could look like in practice.

Before continuing, three definitional and stylistic notes regarding this Feature’s use of key terms are in order:

Data. For the sake of brevity, “data” refers to data about people unless otherwise noted. Data about people is the data collected as people “invest, work, operate businesses, socialize,” and otherwise go about their lives.13 This data is of greatest interest to competing digital-technology companies and to observers of the business models built from data collection. It is also deliberately more expansive than U.S. definitions of “personal data” or the closely related term “personally identifiable information.”14 Furthermore, this Feature will refer to “data” as a singular, not a plural noun. This stylistic choice is in line with the common rather than the strictly correct usage.

Data subject and data collector. This Feature will use the term “data subject” to refer to the individual from whom data is being collected—often also referred to in technology communities as the “user.” “Data processor” is used synonymously with “data collector” to refer to the entity or set of entities that collect, analyze, process, and use data. The definitions of “data subject” and “data processor” are loosely derived from the European Union’s General Data Protection Regulation (GDPR).15 While the GDPR’s definition of personal data offers some capacity for nonindividualistic interpretation, any reference to “data subject” in this Feature will refer to the individual from whom or about whom data is being collected.

Informational Harm. Individual informational harmrefers to harm that a data subject may incur from how information about them is collected, processed, or used. In contrast, social informational harm refers to harms that third-party individuals may incur when information about a data subject is collected, processed, or used.

Facebook’s Cambridge Analytica scandal marked a turning point in the press coverage and popular …

Facebook’s Cambridge Analytica scandal marked a turning point in the press coverage and popular sentiment toward technology companies. For more on Cambridge Analytica, see, for example, Mark Zuckerberg Testimony: Senators Question Facebook’s Commitment to Privacy, N.Y. Times (Apr. 10, 2018), https:‌//‌www‌.nytimes‌.com‌/2018‌/04‌/10‌/us‌/politics‌/mark‌-zuckerberg‌-‌testimony‌.html [https:‌//‌perma‌.cc‌/6MKF‌-UEER]; and Zeynep Tufekci, Facebook’s Surveillance Machine, N.Y. Times (Mar. 19, 2018), https:‌//‌www‌.nytimes‌.com‌/2018‌/03‌/19‌/opinion‌/facebook‌-cambridge‌-analytica‌.html [https:‌//‌perma‌.cc‌/FC9A‌-EJWY]. From 2015 to 2019, the number of Americans who held a positive view of technology fell by twenty-one percentage points. See Carroll Doherty & Jocelyn Kiley, Americans Have Become Much Less Positive About Tech Companies’ Impact on the U.S., Pew Rsch. (July 29, 2019), https:‌//‌www‌.pewresearch‌.org‌/fact‌-tank‌/2019‌/07‌/29‌/americans‌-have‌-become‌-much‌-less‌-positive‌-about‌-tech-companies‌-impact‌-on‌-the‌-u‌-s [https:‌//‌perma‌.cc‌/JA9T‌-J78F]. Worker activism at tech companies has increased sharply since 2016, particularly in response to contracts between technology companies and the U.S. Department of Defense and U.S. Immigration and Customs Enforcement (ICE). See, e.g., #NoTechforICE, https:‌//‌notechforice‌.com [https:‌//‌perma‌.cc‌/TR89‌-N8U8]; Worker Power in the Tech Industry, Tech Workers Coal., https:‌//‌techworkerscoalition‌.org [https:‌//‌perma‌.cc‌/5CRC‌-7PAP]; Jimmy Wu, Optimize What?, Commune (Mar. 15, 2019), https:‌//‌communemag‌.com‌/optimize‌-what [https:‌//‌perma‌.cc‌/F5BJ‌-6HXR]; Drew Harwell, Google to Drop Pentagon AI Contract After Employee Objections to the ‘Business of War,’ Wash. Post (June 1, 2018), https:‌//‌www‌.washingtonpost‌.com‌/news‌/the‌-switch‌/wp‌/2018‌/06‌/01‌/google‌-to‌-drop‌-pentagon‌-ai‌-contract‌-after‌-employees‌-called‌-it‌-the‌-business-of‌-war [https://perma.cc/GZV5-FM3G].

The origin of the term “techlash” is commonly attributed to its use in The Economist in 2013. Adri…

The origin of the term “techlash” is commonly attributed to its use in The Economist in 2013. Adrian Wooldridge, The Coming Tech-Lash, Economist (Nov. 18, 2013), https:‌//‌www‌.economist‌.com‌/news‌/2013‌/11‌/18‌/the‌-coming‌-tech‌-lash [https:‌//‌perma‌.cc‌/8G7E‌-KDZ9]. In 2018, both the Oxford English Dictionary and the Financial Times deemed “techlash” to be a word of the year. See Word of the Year 2018: Shortlist, Oxford Languages, https:‌//‌languages‌.oup‌.com‌/word‌-of‌-the‌-year‌/2018‌-shortlist [https:‌//‌perma‌.cc‌/M49Z‌-9UER]; Rana Foroohar, Year in a Word: Techlash, Fin. Times (Dec. 16, 2018), https:‌//‌www‌.ft‌.com‌/content‌/76578fba‌-fca1-11e8‌-ac00‌-57a2a826423e [https:‌//‌perma‌.cc‌/XER8‌-FBDQ].

Julie E. Cohen, Between Truth and Power: The Legal Constructions of Informational Capitalism 44 (2…

Julie E. Cohen, Between Truth and Power: The Legal Constructions of Informational Capitalism 44 (2019) (“One important byproduct of the access-for-data arrangement is a quiet revolution in the legal status of data and algorithms as (de facto if not de jure) proprietary information property.”); see also id. at 76 (observing that there is “a growing constellation of de jure and de facto legal immunities that predominantly bolsters private economic power, that magnifies the vulnerability of ordinary citizens to manipulation, exploitation, and political disempowerment, and that threatens profound collective harm”).

See infra Parts I and III for an extended discussion.

This Feature will refer variously to the “data political economy,” the “data economy,” and…

This Feature will refer variously to the “data political economy,” the “data economy,” and the “digital economy.” While there are distinctions between these concepts in their own right, here these all refer to sets of actors, products, business practices, and imperatives that depend on the ability to produce economic value (and political effects) through processes of data capture, transfer, and analysis. See Mark Andrejevic, Infoglut: How Too Much Information Is Changing the Way We Think and Know 1-18, 20-21 (2013); Matthew Crain, Financial Markets and Online Advertising: Reevaluating the Dotcom Investment Bubble, 17 Info., Commc’n & Soc’y 371, 374-81 (2014); Oscar H. Gandy, Jr., The Panoptic Sort: A Political Economy of Personal Information 1-13 (1993); Lee McGuigan & Vincent Manzerolle, “All the World’s a Shopping Cart:” Theorizing the Political Economy of Ubiquitous Media and Markets, 17 New Media & Soc’y 1830, 1831-39 (2015); Joseph Turow & Nick Couldry, Media as Data Extraction: Towards a New Map of a Transformed Communications Field, 68 J. Commc’n 415, 415 (2018) (arguing that the rising economic importance of data extraction and analysis for digital-media companies has ushered in a “major shift” in the object of study for media and communications scholars: from the traditional focus on media content itself to how the media industries’ “surveillance and population constructions” are “key infrastructural aspects of economic life”).

See, e.g., Helen Nissenbaum, Privacy in Context: Technology, Policy, and the Integrity of Social L…

See, e.g., Helen Nissenbaum, Privacy in Context: Technology, Policy, and the Integrity of Social Life 1-4, 10-11 (2010); Priscilla M. Regan, Legislating Privacy: Technology, Social Values, and Public Policy 220-31 (1995); Julie E. Cohen, What Privacy Is For, 126 Harv. L. Rev. 1904, 1904-06 (2013). For a more complete discussion of prior accounts, see infra Part I.

See, e.g., Virgina Eubanks, Automating Inequality: How High-Tech Tools Profile, Police, and Punish…

See, e.g., Virgina Eubanks, Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor (2018) (investigating the disparate impacts of sorting and monitoring technology systems on poor and working-class Americans); Ben Green, The Smart Enough City: Putting Technology in Its Place to Reclaim Our Urban Future 39-116 (2019) (describing how urban technology can result in exacerbating social and political inequality); Ben Green & Salomé Viljoen, Algorithmic Realism: Expanding the Boundaries of Algorithmic Thought, Proc. ACM Conf. Fairness, Accountability & Transparency 19, 20, 21-23 (2020) (observing how algorithmic formalism can entrench adverse social conditions, discrimination, and inequality); Frank Pasquale, Two Narratives of Platform Capitalism, 35 Yale L. & Pol’y Rev. 309, 310-17 (2016) (advancing counternarratives of platform capitalism that suggest platforms can entrench inequalities, increase discrimination, undermine economic growth, and limit user agency); Neil Irwin, To Understand Rising Inequality, Consider the Janitors at Two Top Companies, Then and Now, N.Y. Times (Sept. 3, 2017), https:‌//‌www‌.nytimes‌.com‌/2017‌/09‌/03‌/upshot‌/to‌-understand‌-rising‌-inequality‌-consider‌-the‌-janitors‌-at‌-two‌-top‌-companies‌-then‌-and‌-now‌.html [https:‌//‌perma‌.cc‌/64ZF‌-KTSC]; Miriam Pawel, You Call It the Gig Economy. California Calls It “Feudalism,” N.Y. Times (Sept. 12, 2019), https:‌//‌www‌.nytimes‌.com‌/2019‌/09‌/12‌/opinion‌/california‌-gig‌-economy‌-bill‌-ab5‌.html [https:‌//‌perma‌.cc‌/E4WR‌-RZH5]. Other arguments highlight how the negative effects of surveillance are apportioned along lines of privilege. See Frank Pasquale, Paradoxes of Privacy in an Era of Asymmetrical Social Control, in Big Data, Crime and Social Control 31, 31 (Aleš Završnik ed., 2018) (discussing asymmetries in surveillance and particular legal benefits afforded to the wealthy); Solon Barocas & Andrew D. Selbst, Big Data’s Disparate Impact, 104 Calif. L. Rev. 671, 677-92 (2016) (identifying mechanisms by which data mining can have discriminatory impacts on protected classes); Paul Blest, ICE Is Using Location Data from Games and Apps to Track and Arrest Immigrants, Report Says, Vice News (Feb. 7, 2020), https:‌//‌www‌.vice‌.com‌/en‌/article‌/v7479m‌/ice‌-is‌-using‌-location‌-data‌-from‌-games‌-and‌-apps‌-to‌-track‌-and‌-arrest‌-immigrants‌-report‌-says [https:‌//‌perma‌.cc‌/XB7V‌-3B7G].

See, e.g., Joy Buolamwini & Timnit Gebru, Gender Shades: Intersectional Accuracy Disparities in Co…

See, e.g., Joy Buolamwini & Timnit Gebru, Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification, 81 Proc. Mach. Learning Rsch. 1, 1-2, 5-10 (2018); Safiya Umoja Noble, Google Search: Hyper-Visibility as a Means of Rendering Black Women and Girls Invisible, InVisible Culture (Oct. 13, 2013), https:‌//‌ivc‌.lib‌.rochester‌.edu‌/google‌-search‌-hyper‌-visibility‌-as‌-a‌-means‌-of‌-rendering‌-black‌-women‌-and‌-girls‌-invisible [https:‌//‌perma‌.cc‌/FWJ6‌-KXNL]; Ben Green, The False Promise of Risk Assessments: Epistemic Reform and the Limits of Fairness, Proc. ACM Conf. Fairness, Accountability & Transparency 594, 596-600 (2020).

See Blest, supra note 7; Joseph Cox, How the U.S. Military Buys Location Data from Ordinary Apps, …

See Blest, supra note 7; Joseph Cox, How the U.S. Military Buys Location Data from Ordinary Apps, Vice News (Nov. 16, 2020), https:‌//‌www‌.vice‌.com‌/en‌/article‌/jgqm5x‌/us‌-military‌-location‌-data‌-xmode‌-locate‌-x [https:‌//‌perma‌.cc‌/WNR6‌-A7PL] (detailing how the U.S military buys location data from many sources, including a Muslim prayer app with over ninety-eight million downloads).

See, e.g., About Us, Media Manipulation Casebook, https://mediamanipulation.org/about-us [https://…

See, e.g., About Us, Media Manipulation Casebook, https://mediamanipulation.org/about-us [https://perma.cc/U7H7-88DQ]; Weaponizing the Digital Influence Machine: The Political Perils of Online Ad Tech, Data & Soc’y (Oct. 17, 2018), https:‌//‌datasociety‌.net‌/library‌/weaponizing‌-the‌-digital‌-influence‌-machine [https:‌//‌perma‌.cc‌/BT7F‌-Q59B]; Ronan Farrow, A Pennsylvania Mother’s Path to Insurrection, New Yorker (Feb. 1, 2021), https:‌//‌www‌.newyorker‌.com‌/news‌/news‌-desk‌/a‌-pennsylvania‌-mothers‌-path‌-to‌-insurrection‌-capitol‌-riot [https:‌//‌perma‌.cc‌/6MGF‌-FPCB]; Chinmayi Arun, On WhatsApp, Rumours, and Lynchings, 54 Econ. & Pol. Wkly. 30, 30-33 (Feb. 9, 2019).

For more on the extended discussion of the democratic values at issue in data production, see Amy …

For more on the extended discussion of the democratic values at issue in data production, see Amy Kapczynski, The Law of Informational Capitalism, 129 Yale L.J. 1276 (2020) (reviewing Shoshana Zuboff, Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power (2019)); and Cohen, supra note 3. See Evgeny Morozov, Digital Socialism?, 116/117 New Left Rev. (2019); Ben Tarnoff & Moira Weigel, Why Silicon Valley Can’t Fix Itself, Guardian (May 3, 2018), https:‌//‌www‌.theguardian‌.com‌/news‌/2018‌/may‌/03‌/why‌-silicon‌-valley‌-cant‌-fix‌-itself‌-tech‌-humanism [https:‌//‌perma‌.cc‌/T6PD‌-QPRJ].

Data has been extensively analogized to both objects and aspects of personhood, spawning a robust …

Data has been extensively analogized to both objects and aspects of personhood, spawning a robust literature on the purposes, limits, and effects of data metaphors. See Luke Stark & Anna Lauren Hoffmann, Data Is the New What? Popular Metaphors & Professional Ethics in Emerging Data Culture, 4 J. Cultural Analytics 1, 5-13 (2019); Rowan Wilken, An Exploratory Comparative Analysis of the Use of Metaphors in Writing on the Internet and Mobile Phones, 23 Soc. Semiotics 632, 635-41 (2013); Dawn Nafus, Stuck Data, Dead Data, and Disloyal Data: The Stops and Starts in Making Numbers into Social Practices, 15 Distinktion: J. Soc. Theory 208, 208-11 (2014); Cornelius Puschmann & Jean Burgess, Metaphors of Big Data, 8 Int’l J. Commc’n 1690, 1697-1701 (2014); Deborah Lupton, Swimming or Drowning in the Data Ocean? Thoughts on the Metaphors of Big Data, Soc. Life (Oct. 29, 2013), https:‌//‌simplysociology‌.wordpress‌.com‌/2013‌/10‌/29‌/swimming‌-or‌-drowning‌-in‌-the‌-data‌-ocean‌-thoughts‌-on‌-the‌-metaphors‌-of‌-big‌-data [https:‌//‌perma‌.cc‌/26BN‌-MJ5K]; Sara M. Watson, Data Is the New “___,” DIS Mag. (May 28, 2016), http:‌//‌dismagazine‌.com‌/discussion‌/73298‌/sara‌-m‌-watson‌-metaphors‌-of‌-big‌-data [https:‌//‌perma‌.cc‌/A44E‌-J7U5]; Kailash Awati & Simon Buckingham Shum, Big Data Metaphors We Live by, Towards Data Sci. (May 14, 2015), https:‌//‌towardsdatascience‌.com‌/big‌-data‌-metaphors‌-we‌-live‌-by‌-98d3fa44ebf8 [https:‌//‌perma‌.cc‌/6Q4K‌-KY3S]; Cory Doctorow, Personal Data Is as Hot as Nuclear Waste, Guardian (Jan. 15, 2008), https:‌//‌www‌.theguardian‌.com‌/technology‌/2008‌/jan‌/15‌/data‌.security [https:‌//‌perma‌.cc‌/D34R‌-GAFK]; Lilly Irani, Justice for “Data Janitors,” Pub. Books (Jan. 15, 2015), https:‌//‌www‌.publicbooks‌.org‌/justice‌-for‌-data‌-janitors [https:‌//‌perma‌.cc‌/7QMG‌-PVKX].

Cohen, supra note 3, at 38.

U.S. privacy law is a patchwork of state and federal laws, several of which are discussed in great…

U.S. privacy law is a patchwork of state and federal laws, several of which are discussed in greater depth in Part I. Definitions of personal data vary by regulation, but a hallmark of U.S. privacy laws is that many of the obligations they place on regulated entities are tied to “personal data” or “personally identifiable information,” however defined. Some of these definitions are quite broad and encompass much, if not quite all, of the social data discussed in this Feature. For instance, the National Institute of Standards and Technology (NIST) defines personally identifiable information in the federal-agency context as “any information about an individual maintained by an agency, including (1) any information that can be used to distinguish or trace an individual’s identity, such as name, social security number, date and place of birth, mother’s maiden name, or biometric records; and (2) any other information that is linked or linkable to an individual, such as medical, educational, financial, and employment information.” Erika McCallister, Tim Grance & Karen Scarfone, Guide to Protecting the Confidentiality of Personally Identifiable Information (PII), Nat’l Inst. Standards & Tech. 2-1 (2010), https:‌//‌nvpubs‌.nist‌.gov‌/nistpubs‌/Legacy‌/SP‌/nistspecialpublication800‌-122‌.pdf [https:‌//‌perma‌.cc‌/6RVU‌-QPG4] (quoting U.S. Gov’t Accountability Off., GAO-08-536, Privacy: Alternatives Exist for Enhancing Protection of Personally Identifiable Information 1, 29 (2008), https:‌//‌www‌.gao‌.gov‌/assets‌/gao‌-08‌-536‌.pdf [https:‌//‌perma‌.cc‌/H2VZ‌-Z8Y9]). State breach-notification laws and data-security laws typically define personal data more narrowly, focusing on sensitive categories of information like social-security numbers, credit-card and financial-account numbers, personal health data, financial data, creditworthiness data, and biometric data. For a list of state data-breach-notification laws, see Security Breach Notification Laws, Nat’l Conf. State Legislatures (Apr. 15, 2021), https:‌//‌www‌.ncsl‌.org‌/research‌/telecommunications‌-and‌-information‌-technology‌/security‌-breach‌-notification‌-laws‌.aspx [https:‌//‌perma‌.cc‌/CWU6‌-CMRU].

Article 4 offers the following definition: “‘personal data’ means any information relating t…

Article 4 offers the following definition: “‘personal data’ means any information relating to an identified or identifiable natural person (‘data subject’); an identifiable natural person is one who can be identified, directly or indirectly, in particular by reference to an identifier such as a name, an identification number, location data, an online identifier or to one or more factors specific to the physical, physiological, genetic, mental, economic, cultural or social identity of that natural person.” Council Regulation 2016/679, art. 4, 2016 O.J. (L 119) 1, https:‌//‌eur-lex‌.europa‌.eu‌/legal‌-content‌/EN‌/TXT‌/PDF‌/?uri‌=CELEX:32016R0679 [https:‌//‌perma‌.cc‌/2RZ3‌-KZKT].

Featured

Article

A History of Vacatur

Benjamin B. Johnson

31 Jan 2026

Federal Courts • Remedies • Legal History

Article

Trading Acres

Jessica A. Shoemaker & James Fallows Tierney

31 Jan 2026

Law and Political Economy • Property • Corporate Law

Feature

The Forgotten Income-Attribution Power

Alex Zhang

31 Jan 2026

Tax • Constitutional Law

News

13 January 2026

A Relational Theory of Data Governance

Introduction

Featured

News

Announcing the Editors of Volume 136

Announcing the First-Year Editors of Volume 135

Articles & Essays Webinar: Tips & Tricks for a Successful Submissions Cycle

Announcing Volume 134’s Emerging Scholar of the Year: Kate Redburn