Legal Theory Lexicon

July 25, 2026

Legal Theory Lexicon 122: False Positives and False Negatives

Introduction

Law students frequently encounter the idea of false positives and false negatives when discussing Blackstone’s ratio: “better that ten guilty persons escape than that one innocent suffer.” The conviction of an innocent person is what is called a “false positive,” whereas a guilty person escaping punishment is a “false negative.” This Lexicon entry analyzes these concepts and connects them to the related ideas of “Type I” and “Type II” errors used in statistics generally and empirical legal studies in particular. Along the way, we explore the relationship of false positives and negatives to standards of proof, which are best understood as devices for allocating the risk of error between the parties to civil litigation or criminal prosecutions. From there, we will glance at how the same framework illuminates other corners of the law, and we will note some complications. This entry in the Legal Theory Lexicon provides an introduction to false positives and false negatives for law students, especially first-year law students, with an interest in legal theory. As always, the Lexicon aims to introduce the basic ideas—the bibliography provides resources for readers who want to go deeper.

The Concepts: False Positives and False Negatives

Imagine a medical test for a serious disease. The test can fail in two different ways. It can tell a healthy patient that she has the disease—a false positive. Or it can tell a sick patient that she is disease-free—a false negative. These two errors are not just different in kind; they have different costs. A false positive means anxiety, follow-up testing, perhaps unnecessary treatment. A false negative means a disease that goes untreated. And here is the crucial point: the two error rates are connected. If we make the test more sensitive so that it catches more cases of the disease, it will also flag more healthy patients by mistake. If we make it more demanding so that it clears more healthy patients, it will miss more sick ones. Any test—indeed, any decision procedure that must sort under uncertainty—faces this trade-off. The law is full of such decision procedures, and the vocabulary of false positives and false negatives has become one of legal theory’s most useful tools for thinking about them.

Let us make the concepts more precise. Talk of false positives and false negatives presupposes three things. First, there must be a question with a true answer—the patient either has the disease or does not; the defendant either committed the crime or did not. Second, there must be a decision procedure that answers the question—a test, a trial, a screening algorithm—and the procedure’s output must be binary: positive or negative, yes or no. Third, the procedure must be fallible, so that its output can diverge from the truth. When these conditions hold, there are exactly four possibilities. The procedure can say “yes” when the true answer is yes—a true positive. It can say “no” when the true answer is no—a true negative. It can say “yes” when the true answer is no—a false positive. And it can say “no” when the true answer is yes—a false negative. The first two outcomes are successes; the last two are the two ways the procedure can fail. Notice that which answer counts as “positive” is a matter of framing: in a criminal trial, we call a conviction the positive outcome because the trial tests the prosecution’s accusation, and a conviction affirms it.

	Procedure says “yes” (positive)	Procedure says “no” (negative)
True answer: yes	True positive (guilty person convicted)	False negative (guilty person acquitted)
True answer: no	False positive (innocent person convicted)	True negative (innocent person acquitted)

The two kinds of error are connected. Every decision procedure has a threshold—a point at which the evidence is deemed sufficient for a “yes.” Move the threshold, and the two error rates move in opposite directions. Lower the threshold, and the procedure says “yes” more readily: false negatives decline (fewer guilty defendants acquitted, fewer diseases missed), but false positives rise (more innocent defendants convicted, more healthy patients flagged). Raise the threshold, and the pattern reverses. Holding the quality of the evidence constant, there is no way to reduce both error rates at once; the only question is how to distribute the inevitable errors between the two categories. This is why the choice of a threshold is never a merely technical matter. It is a normative judgment about which kind of error is worse—and by how much. Blackstone’s ratio is precisely such a judgment: it asserts that a false positive in a criminal trial (convicting the innocent) is at least ten times worse than a false negative (acquitting the guilty). One can, of course, improve on the trade-off itself by gathering better evidence—a more accurate test, a more thorough investigation—but at any given level of accuracy, the trade-off remains.

Type I and Type II Errors

Readers who venture into statistics or empirical legal studies will encounter the same distinction under different names: “Type I error” and “Type II error.” The terminology comes from the theory of statistical hypothesis testing developed by Jerzy Neyman and Egon Pearson in the early twentieth century. In hypothesis testing, the investigator starts with a “null hypothesis”—roughly, the default assumption that there is nothing there: the drug has no effect, the two variables are unrelated. A Type I error is rejecting the null hypothesis when it is actually true—finding an effect that does not exist. A Type II error is failing to reject the null hypothesis when it is actually false—missing an effect that does exist. The two types of error map directly onto false positives and false negatives: a Type I error is a false positive, and a Type II error is a false negative. If the criminal trial is recast in these terms, the null hypothesis is innocence—the presumption of innocence, in fact, is a legal expression of the statistician’s default—so convicting the innocent is a Type I error and acquitting the guilty is a Type II error. Almost no one can remember which type is which without a mnemonic, so here is one: the Type I error is the error of commission (the procedure affirmatively does something it should not), and it comes first, just as sins of commission traditionally come before sins of omission.

Standards of Proof as Error-Allocation Devices

You may not hear about false positives and negatives in your classes directly, but every law student learns about burdens of proof (more precisely, burdens of production and persuasion). The three most important burdens of persuasion are: (1) preponderance of the evidence, (2) clear and convincing evidence, and (3) proof beyond a reasonable doubt. But what are these standards? The most illuminating answer is that a standard of proof is a threshold of the kind described above: it specifies how confident the factfinder must be before returning a “yes.” And because moving the threshold trades one kind of error for the other, the choice among standards is a choice about how to allocate the risk of error between the parties. A low threshold shifts the risk of error toward the defendant: more false positives, fewer false negatives. A high threshold shifts the risk toward the plaintiff or prosecution: fewer false positives, more false negatives. On this view, the standards of proof are not arcane verbal formulas; they are the legal system’s explicit answers to the questions, “which errors do we view as most costly?” and “what price are we willing to pay to avoid them?”

Burdens of proof can be decomposed into burdens of production (which party must raise an issue) and burdens of persuasion (what is the standard of proof on the issue, once it is raised). Burdens of production are important, because the failure to meet such a burden results in an automatic determination of the issue against the party that fails to meet its burden. But in the context of false positives and false negatives, burdens of persuasion do the important work.

Begin with the preponderance of the evidence standard, which governs most civil litigation. The traditional formulation—the plaintiff must show that her claim is “more likely than not” true—places the threshold at just above fifty percent. In error-allocation terms, this is the symmetric solution: it treats a false positive (holding a defendant liable when he should not be) and a false negative (denying recovery to a plaintiff who deserves it) as errors of roughly equal gravity. The implicit judgment is that civil litigants stand on equal footing before the law: a dollar wrongly taken from the defendant is neither better nor worse than a dollar wrongly denied to the plaintiff. If the two errors are equally costly, the sensible aim is simply to minimize total errors, and a just-over-fifty-percent threshold accomplishes that: in every case, the factfinder sides with the party whose position is more likely correct. Justice Harlan made this reasoning explicit in his influential concurrence in In re Winship: in civil litigation, he wrote, we view it as no worse for there to be an erroneous verdict in the defendant’s favor than an erroneous verdict in the plaintiff’s favor, and so the preponderance standard directs the factfinder simply to choose the more probable account.

Discussing burdens of persuasion in terms of probabilities is standard in the law, but there are deep questions lurking behind the idea that preponderance of the evidence should be understood as P > .5 (the probability of the fact to be proved is greater than 50%). This approach is Bayesian, but there is another approach to the interpretation of burdens of persuasion that focuses on the idea of inference to the best explanation. For a discussion of that idea, see Legal Theory Lexicon 089: Inference to the Best Explanation (Abduction) and the articles by Ron Allen and Michael Pardo in the bibliography.

Now consider proof beyond a reasonable doubt, the standard for criminal conviction. This standard aims to minimize false positives (erroneous convictions) at the expense of false negatives (erroneous exonerations). This asymmetry is justified on the assumption that the costs of convicting an innocent person are much greater than the costs of failing to convict someone who is guilty: a false positive means that the state has imprisoned—or in the extreme case, executed—an innocent person, stripping away liberty, reputation, and sometimes life itself; a false negative means that a guilty person goes free. Blackstone’s ratio is an attempt to quantify the asymmetry: better that ten guilty persons escape than that one innocent suffer. In theory, the beyond reasonable doubt standard minimizes false positives. In In re Winship (1970), the Supreme Court held that the reasonable doubt standard is constitutionally required in criminal cases as a matter of due process, and Justice Harlan’s concurrence explained why in exactly the terms of this Lexicon entry: because the disutility of convicting an innocent man is so much greater than the disutility of acquitting a guilty one, the margin of error must be allocated overwhelmingly to the defendant’s side.

Does the beyond reasonable doubt standard actually work to minimize false positives (wrongful convictions)? Answering that question would require an assessment of the way the criminal justice system functions. Given the practice of plea-bargaining, innocent defendants may plead guilty to avoid the costs of a criminal defense and the risks of conviction on a more serious offense or a lengthier sentence.

The clear and convincing evidence standard sits between preponderance of the evidence and beyond reasonable doubt. It governs proceedings in which the stakes are asymmetric but not as radically asymmetric as in a criminal prosecution—and the Supreme Court’s cases identify a recurring pattern: the standard applies when the state seeks to impose a serious deprivation that is nonetheless something less than criminal punishment. In Addington v. Texas (1979), the Court held that involuntary civil commitment requires clear and convincing evidence: erroneously committing a person who is not mentally ill is a grave loss of liberty, so the preponderance standard tilts too far against the individual, but commitment is not punishment, so the criminal standard would overshoot. In Santosky v. Kramer (1982), the Court applied the same logic to the termination of parental rights. Deportation, denaturalization, and civil fraud follow the same pattern in various bodies of law. The intermediate standard is, in effect, an intermediate answer to the error-allocation question: false positives are deemed substantially worse than false negatives, but not so much worse as to warrant the near-categorical protection of proof beyond a reasonable doubt. The three-standard architecture, viewed as a whole, illustrates the idea that the law calibrates its thresholds to its judgments about the relative costs of error.

Error Costs Across the Law

The concept of error costs associated with false positives and negatives is pervasive in the law. Drug regulation: when the FDA decides whether to approve a new drug, a false positive (approving a drug that is unsafe or ineffective) harms the patients who take it, while a false negative (rejecting or delaying a beneficial drug) harms the patients who go without it—and critics have long argued that the agency’s incentives skew toward avoiding the visible first error at the cost of the invisible second. Antitrust: in a series of influential articles and opinions, judges and scholars in the Chicago School tradition argued that the costs of false positives (condemning procompetitive conduct) exceed the costs of false negatives (missing anticompetitive conduct), because market forces eventually erode monopoly power but judicial errors persist as precedent—an argument, associated with Frank Easterbrook’s The Limits of Antitrust, that has shaped doctrine and drawn sharp criticism in the current era of antitrust revival. Free speech: the overbreadth doctrine permits facial challenges to statutes that sweep in protected expression, reflecting a judgment that false positives (suppressing protected speech, with its chilling effects) are systematically worse than false negatives (allowing some unprotected speech to slip through). Preliminary injunctions: the familiar balancing of likelihood of success against irreparable harm is, at bottom, an attempt to minimize the expected cost of error when a court must act before it can know the merits. In each domain, the analytical move is the same: identify the two errors, ask which is more costly, and design the decision procedure accordingly.

Complications and Critiques

The error-cost framework is illuminating, but it is not uncontroversial. One line of objection challenges the framework’s consequentialist premise: talk of “weighing” the cost of convicting the innocent against the cost of acquitting the guilty assumes the two harms are commensurable, and theorists in the deontological tradition deny this—on their view, convicting the innocent is not merely a costlier outcome but a wrong done by the state, which no quantity of avoided false negatives can offset. A second line of objection accepts the framework but questions the law’s traditional answer within it. In The Consequences of Error in Criminal Justice, Daniel Epps argues that the Blackstone ratio looks less obviously correct once we attend to its systemic effects: a regime that strongly protects against false convictions may increase the total amount of crime and punishment, erode the stigma of conviction less than we suppose, and distribute its benefits and burdens in surprising ways—so that even those concerned primarily with protecting the innocent might prefer a less skewed allocation than the tradition assumes. Whether or not one is persuaded, the debate illustrates a general point: the choice of an error ratio is a substantive question of political morality, and the answers we inherit are not beyond question. Deep waters! But we must leave these questions for another day.

Conclusion

The distinction between false positives and false negatives is one of the simplest ideas in legal theory, and one of the most powerful. Any procedure that must answer yes-or-no questions under uncertainty will make both kinds of error, and reducing one kind means accepting more of the other. Once the trade-off is in view, familiar features of the legal landscape snap into focus: the standards of proof are error-allocation devices; Blackstone’s ratio is a normative judgment about relative error costs; the presumption of innocence is the law’s version of the statistician’s null hypothesis; and doctrines from overbreadth to the preliminary injunction standard are calibrations of the same dial. The framework does not answer the hard questions—how much worse one error is than the other, and whether the harms can be weighed on a common scale at all—but it tells you what the hard questions are. That is what a good conceptual tool does. I hope this entry has given you a sense of the concepts and their power. As always, the Lexicon provides an introduction—the bibliography that follows will take you deeper.

Related Lexicon Entries

Bibliography

Ronald J. Allen & Michael S. Pardo, Relative Plausibility and Its Critics, 23 International Journal of Evidence & Proof 5 (2019).

William Blackstone, Commentaries on the Laws of England, Book IV, Chapter 27 (1769).

Frank H. Easterbrook, The Limits of Antitrust, 63 Texas Law Review 1 (1984).

Daniel Epps, The Consequences of Error in Criminal Justice, 128 Harvard Law Review 1065 (2015).

John Kaplan, Decision Theory and the Factfinding Process, 20 Stanford Law Review 1065 (1968).

Larry Laudan, Truth, Error, and Criminal Law: An Essay in Legal Epistemology (2006).

Jerzy Neyman & Egon S. Pearson, On the Problem of the Most Efficient Tests of Statistical Hypotheses, 231 Philosophical Transactions of the Royal Society of London, Series A 289 (1933).

Michael S. Pardo, The Paradoxes of Legal Proof: A Critical Guide, 99 Boston University Law Review 233 (2019).

Alexander Volokh, n Guilty Men, 146 University of Pennsylvania Law Review 173 (1997).

This entry was first published on July 25, 2026.

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 122: False Positives and False Negatives

New and revised Legal Theory Lexicon entries are posted on Sunday at the Legal Theory Blog. If you would like to receive Legal Theory Blog content via email, please subscribe at https://lsolum.substack.com/subscribe.

July 24, 2026

Legal Theory Lexicon 121: Intersectionality
Introduction

The idea of “intersectionality” has played an important role in antidiscrimination law and in critical approaches to legal theory. But what is “intersectionality”? Here is a hypothetical that illustrates the concept. Imagine a plaintiff who sues her employer for discrimination. The employer has hired Black men for factory jobs and white women for office jobs, but it has hired no Black women at all. The hypothetical plaintiff is a Black woman. When she sues, the court tells her that she has no claim: the employer hired Black workers, so there was no race discrimination, and the employer hired women, so there was no sex discrimination. Her experience of discrimination—an experience that neither Black men nor white women shared—is invisible to the law. This is not just a hypothetical. Something very much like it happened in a real case, and that case prompted the legal scholar Kimberlé Crenshaw to coin a term that has since traveled from a law review article to nearly every corner of the humanities and social sciences: “intersectionality.”

This entry in the Legal Theory Lexicon provides an introduction to intersectionality for law students, especially first-year law students, with an interest in legal theory. We will begin where the concept began—with a problem in antidiscrimination doctrine—and then work outward: to Crenshaw’s development of the concept, to its role in critical race theory and feminist legal theory, to its remarkable migration from law into philosophy, sociology, and other disciplines, and finally back to the law, where the doctrinal problem that started it all remains only partially resolved. Along the way, we will encounter some important critiques and complications. As always, the Lexicon aims to introduce the basic ideas. There are deep questions about intersectionality that you can explore via the readings in the bibliography.

The DeGraffenreid Problem

The real case that inspired the hypothetical is DeGraffenreid v. General Motors, decided by a federal district court in 1976. Five Black women sued General Motors, challenging a seniority system that operated to their distinctive disadvantage. Before 1964, General Motors simply did not hire Black women. The Black women hired after 1970 lacked seniority, and when recession-driven layoffs came, the “last hired, first fired” rule swept them out. The plaintiffs argued that this amounted to discrimination against Black women as such. The court refused to see the claim. Because General Motors had hired women—white women—there was no sex discrimination. And the plaintiffs’ race discrimination claim, the court reasoned, should be consolidated with another race discrimination case brought by Black men. The court worried that recognizing a distinct claim for Black women would create a “new classification of ‘black women’ who would have greater standing than, for example, a black male,” opening a Pandora’s box of compound claims. The plaintiffs fell through the cracks: their claim was not quite race discrimination, not quite sex discrimination, and the law had no category for what it actually was.

Thirteen years later, Kimberlé Crenshaw made DeGraffenreid the centerpiece of her article, Demarginalizing the Intersection of Race and Sex, published in the University of Chicago Legal Forum in 1989. Crenshaw’s diagnosis was that antidiscrimination law operated on a “single-axis framework.” The law could see race discrimination, and it could see sex discrimination, but it analyzed each axis separately—and it implicitly took the most privileged members of each group as the paradigm. The paradigm victim of race discrimination was a Black man; the paradigm victim of sex discrimination was a white woman. Black women could state a claim only insofar as their experience matched one of these paradigms. Where their experience was the product of racism and sexism operating together, the single-axis framework rendered it invisible. To capture the problem, Crenshaw offered a now-famous metaphor: discrimination is like traffic in an intersection. If a Black woman is harmed standing in the intersection, her injury may result from cars traveling in either direction—or from both at once. Demanding that she prove which single direction the traffic came from is precisely what the law did in DeGraffenreid, and precisely what her situation made impossible.

Crenshaw examined two other cases in Demarginalizing that revealed the same structural problem from different angles. In Moore v. Hughes Helicopters, a court refused to let a Black woman represent a class of all women in a sex discrimination suit, reasoning that she had claimed discrimination only as a Black woman and so could not speak for women generally—a ruling that treated white women’s experience as the standard for “sex discrimination” while treating Black women’s experience as something narrower and more particular. And in Payne v. Travenol, Black women plaintiffs were permitted to sue but the remedy was limited in ways that excluded Black men. Taken together, the cases showed that Black women were sometimes too different to represent the broader group and sometimes not different enough to have a claim of their own.

Crenshaw’s deeper point was that this was not a set of judicial mistakes that better lawyering could fix. The problem was built into the conceptual architecture of antidiscrimination law itself—and, as she went on to argue, into the political movements that shaped it, with feminism organized around the experiences of white women and antiracism organized around the experiences of Black men. Fixing the problem would require rethinking the categories, not just relitigating the cases.

Crenshaw’s Concept: Structural, Political, and Representational Intersectionality

Crenshaw introduced the term “intersectionality” in Demarginalizing, but she developed the concept most fully in a second article, Mapping the Margins: Intersectionality, Identity Politics, and Violence Against Women of Color, published in the Stanford Law Review in 1991. The second article shifted the setting from employment discrimination to violence against women—battering and rape—and asked how the experiences of women of color were shaped by the interaction of race and gender. In the course of that inquiry, Crenshaw distinguished three forms of intersectionality, and the distinctions have structured discussion of the concept ever since: (1) structural intersectionality, (2) political intersectionality, and (3) representational intersectionality. We will consider each in turn.

Structural intersectionality refers to the way in which the social location of women of color—their position at the intersection of race, gender, and often class—makes their actual experience of subordination qualitatively different from the experience of those who face only one form of subordination. Crenshaw’s examples in Mapping the Margins came from her fieldwork on domestic violence. A battered woman who is poor, unemployed, and responsible for children faces obstacles to escaping an abusive relationship that a woman with economic resources does not. A battered immigrant woman whose lawful residence depends on remaining married to her abuser—as it did under the immigration law of the time, which required couples to stay married for two years before a spouse could obtain permanent status—may be trapped in ways that citizenship would prevent. A woman who does not speak English may find that the local shelter cannot accommodate her. The point of these examples is that the burdens intersect: race, gender, class, and immigration status are not separate difficulties added one on top of another, but interacting dimensions of a single predicament. Interventions designed for the paradigm case—say, a shelter system built around the needs of English-speaking women with some economic independence—will systematically fail those at the intersection, even when no one intends any such failure.

Political intersectionality refers to a different problem: women of color are situated within at least two political movements—feminism and antiracism—that pursue agendas which frequently diverge and sometimes conflict. Each movement, Crenshaw argued, has tended to frame its agenda around the experiences of its most privileged members, so that feminist advocacy often presupposes the situation of white women and antiracist advocacy often presupposes the situation of Black men. The result is that women of color are asked to split their political energies and, worse, sometimes find that a gain for one movement comes at their expense. Crenshaw’s example from the domestic violence context is instructive. Antiracist advocates, concerned that statistics on domestic violence in minority communities would reinforce stereotypes of Black men as violent, sometimes resisted efforts to publicize the problem—a strategy that protected the community’s image at the cost of rendering the suffering of Black women invisible. Feminist advocates, for their part, sometimes emphasized that domestic violence crosses all racial and class lines—a strategy that made the issue politically salient for white audiences at the cost of obscuring the distinctive barriers facing women of color. In both cases, the interests of women of color were subordinated to a political agenda organized around someone else’s paradigm.

Representational intersectionality concerns the cultural construction of women of color—the way they are depicted in popular culture and the way debates about those depictions unfold. Crenshaw’s principal example in Mapping the Margins was the obscenity prosecution of the rap group 2 Live Crew, whose lyrics were sexually explicit and degrading to Black women. The public controversy quickly organized itself into a familiar binary: critics of the prosecution defended the group’s music as Black cultural expression unfairly targeted by a racially selective obscenity prosecution, while supporters of the prosecution condemned the lyrics as misogyny. Crenshaw’s point was that both sides erased Black women. The antiracist defense asked Black women to overlook the degrading imagery in the name of racial solidarity; the feminist critique proceeded as though the racial politics of the prosecution—why this group, and not equally explicit white performers?—were beside the point. Neither frame could hold both dimensions in view at once. Representational intersectionality thus mirrors, at the level of culture and discourse, the same single-axis logic that structural and political intersectionality identify in social arrangements and political movements.

Intersectionality in Critical Race Theory and Feminist Legal Theory

Intersectionality did not emerge in an intellectual vacuum. Crenshaw was one of the founding figures of critical race theory, the movement in American legal scholarship that emerged in the late 1980s from a working group of scholars including Crenshaw, Derrick Bell’s students and successors, Richard Delgado, Mari Matsuda, Patricia Williams, and others. Critical race theory took as its starting point the persistence of racial subordination after the formal victories of the civil rights era, and it turned a critical eye on the legal concepts—colorblindness, intent, formal equality—that seemed to explain why the law could declare discrimination illegal while leaving racial hierarchy substantially intact. Intersectionality fit naturally within this project: it was a demonstration, worked out in doctrinal detail, that facially neutral legal categories could systematically disadvantage a subordinated group. But intersectionality also functioned as an internal critique. By insisting that the paradigm subject of antiracist advocacy was a Black man, Crenshaw was challenging her own movement to confront the ways its agenda reproduced the marginalization it opposed.

The relationship to feminist legal theory ran along parallel lines. Feminist legal scholars had long debated whether equality for women was best pursued through formal equal treatment or through doctrines responsive to women’s distinctive circumstances—the “sameness/difference” debate. Intersectionality cut into this debate at an angle, by asking a prior question: which women? Angela Harris’s article, Race and Essentialism in Feminist Legal Theory, published in the Stanford Law Review in 1990, gave the challenge its canonical form. Harris argued that leading feminist theorists—her principal examples were Catharine MacKinnon and Robin West—had built their accounts on “gender essentialism”: the assumption that there is a unitary experience of womanhood that can be described independently of race, class, and other dimensions of identity. The essential woman, Harris argued, turned out to look suspiciously like a white, middle-class woman, and the experiences of Black women were treated as that paradigm plus an added increment of racial disadvantage—precisely the additive logic that DeGraffenreid had enacted in doctrine. The anti-essentialist critique and intersectionality are thus two faces of the same insight: one directed at legal categories, the other at feminist theory itself.

Within legal theory, intersectionality also connects to a deeper debate about the point of antidiscrimination law. On one view—often called the anti-classification principle—the law’s aim is to prevent the government and employers from sorting people by suspect categories such as race and sex. On a rival view—the anti-subordination principle—the law’s aim is to dismantle social hierarchies that subordinate some groups to others. Intersectionality sits far more comfortably with the second view. If the wrong of discrimination were simply classification, the single-axis framework might seem adequate: just ask whether race or sex was used. But if the wrong is subordination, then the law must attend to how subordination actually operates in the world—and Crenshaw’s central claim was that it operates along intersecting axes, producing distinctive burdens for those situated at the intersections. It is no accident that intersectionality emerged from scholarly movements committed to the anti-subordination view, or that skeptics of anti-subordination approaches tend also to be skeptics of intersectionality. The relationship between anti-subordination and intersectionality is discussed below when we examine critiques of intersectionality.

From Law to Philosophy and Beyond

Few concepts in the history of legal scholarship have traveled as far as intersectionality. A term coined in a law review article in 1989 is now a standard part of the vocabulary of sociology, political science, psychology, history, public health, and philosophy—and, beyond the academy, of political discourse itself. But the traffic did not flow in only one direction. Crenshaw named the concept, but she did not invent the underlying idea, and she has never claimed otherwise. The idea has a long genealogy in Black feminist thought. Sojourner Truth’s famous 1851 speech to the Akron women’s convention posed the question—“Ain’t I a woman?”—that the single-axis framework could not answer. Anna Julia Cooper’s A Voice from the South (1892) analyzed the distinctive position of Black women a century before the term existed. In 1969, Frances Beal described the “double jeopardy” of being Black and female; in 1977, the Combahee River Collective’s statement described “interlocking” systems of oppression; and in 1988, Deborah King wrote of “multiple jeopardy, multiple consciousness.” Crenshaw’s contribution was to give this tradition a name, a set of doctrinal illustrations, and an analytical structure that proved extraordinarily portable.

The first major development outside law came in social theory. Patricia Hill Collins’s Black Feminist Thought, published in 1990—one year after Demarginalizing—offered a systematic account of Black women’s social position and the knowledge that position generates. Collins introduced the “matrix of domination”: the idea that race, class, gender, and sexuality are not separate systems of oppression but a single interlocking structure, organized through intersecting axes and experienced differently depending on where in the matrix a person is located. Where Crenshaw’s intersection metaphor pictured discrete axes crossing at a point, Collins’s matrix pictured an overall social structure in which the axes are mutually constructing—race is always already gendered, gender always already racialized. Collins also emphasized the epistemic dimension: those situated at particular locations in the matrix have distinctive standpoints, forms of knowledge that are unavailable, or at least not readily available, from more privileged locations. Readers who have encountered the Lexicon entry on Epistemic Injustice will recognize the affinity: both literatures insist that social position shapes what can be known and whose knowledge is credited. In later work, Collins has argued that intersectionality should be understood as a full-fledged critical social theory, with implications for sociology, epistemology, and political thought.

As intersectionality spread across the social sciences, a second wave of scholarship turned reflexive, asking what the concept is and how research should use it. Leslie McCall’s article, The Complexity of Intersectionality, published in Signs in 2005, became the standard reference for methodology. McCall distinguished three approaches to the study of intersecting categories: anticategorical approaches, which deconstruct the categories themselves; intracategorical approaches, which focus on particular neglected intersections (Black women in DeGraffenreid, for example); and intercategorical approaches, which provisionally accept the categories and study how inequality varies across the full set of intersecting groups. Jennifer Nash’s Re-thinking Intersectionality (2008) pressed a set of internal questions that remain live: whether intersectionality is a theory of marginalized subjects in particular or of identity in general, whether Black women had been asked to serve as the concept’s perpetual exemplars, and whether the framework attends adequately to the ways privilege and subordination can coexist in a single life. And in 2013, Sumi Cho, Crenshaw, and McCall together took stock of what had by then become “intersectionality studies,” making an important framing move: intersectionality, they argued, is best understood not as a fixed theory of identity but as an “analytic sensibility”—a way of asking questions about how categories interact—whose value is shown by the work it enables rather than by a canonical definition.

The most recent development is the arrival of intersectionality in analytic philosophy. For many years, feminist philosophers observed that mainstream philosophy had largely ignored the concept, but that has changed. One strand of this work is metaphysical. Sara Bernstein’s article, The Metaphysics of Intersectionality, published in Philosophical Studies in 2020, asks what intersectionality claims actually assert about social categories. Is the category “Black woman” merely the conjunction of two prior categories, Black and woman? Or is the intersectional category explanatorily prior—a unified social position whose features cannot be recovered from the constituent categories taken separately? Bernstein defends a version of the priority view, drawing on the tools of contemporary metaphysics, and her article has generated responses and refinements, including Holly Lawford-Smith and Kate Phelan’s The Metaphysics of Intersectionality Revisited (2022) and work by Marta Jorba and Maria Rodó-de-Zárate treating intersectional experience as emergent. A second strand is methodological: Liam Kofi Bright, Daniel Malinsky, and Morgan Thompson have shown that core intersectionality claims can be formulated precisely within the framework of causal modeling, connecting the concept to the philosophy of science. A third strand, associated with Ann Garry, proposes that intersectionality is best understood not as a substantive theory at all but as a “framework checker”—a test that any adequate account of oppression or social identity must pass. The philosophical debates are young, but their existence confirms the trajectory this section has traced: a concept forged to solve a problem in Title VII doctrine is now doing work in metaphysics and the philosophy of science.

Intersectionality in Antidiscrimination Doctrine

What became of the doctrinal problem that started it all? The answer is: partial progress. Even before Demarginalizing appeared, the Fifth Circuit in Jefferies v. Harris County Community Action Ass’n (1980) rejected the DeGraffenreid approach and held that Title VII protects Black women as a distinct class, reasoning that discrimination “on the basis of sex” includes discrimination against a subclass of women defined by race—the doctrinal cousin of the “sex-plus” cases, in which courts recognized claims by women disadvantaged by sex plus some additional characteristic, such as having young children. Most courts have since followed Jefferies, and the Ninth Circuit’s decision in Lam v. University of Hawai’i (1994) is the standout treatment: reversing summary judgment against an Asian American woman law professor, the court explained that where two bases of discrimination exist, they cannot be “neatly reduced to distinct components,” and that the attempt to analyze her claim as race discrimination plus sex discrimination in sequence was itself the error. But recognition has limits. Courts continue to struggle with compound claims—some worry, echoing DeGraffenreid, about a proliferation of subclasses; class certification for intersectionally defined groups remains awkward; and plaintiffs asserting multiple bases of discrimination sometimes fare worse in practice than plaintiffs asserting one. The single-axis framework has been officially disavowed but not fully dislodged.

Critiques and Complications

The most persistent analytical objection to intersectionality is the proliferation problem, sometimes called the problem of infinite regress. If Black women constitute a distinct category, why not Black lesbian women, or Black disabled lesbian women, or any of the indefinitely many groups generated by combining race, sex, class, sexual orientation, disability, religion, age, and immigration status? Each addition seems as principled as the last, but the endpoint of the progression is a category of one—each individual as the unique intersection of all of her attributes—at which point the framework seems to dissolve group-based analysis altogether. The worry takes different forms in different domains. In doctrine, it is the DeGraffenreid court’s fear of an unmanageable multiplication of protected subclasses. In political theory, it is the concern—pressed by the philosopher Naomi Zack, among others—that intersectionality fragments feminism into ever-smaller constituencies and thereby undermines the possibility of common cause. Defenders of intersectionality have responses. Alison Bailey has answered Zack directly, arguing that the fragmentation worry oversimplifies the solidarities that intersectional analysis makes possible. And the Cho-Crenshaw-McCall reframing of intersectionality as an analytic sensibility rather than a taxonomy of groups is another: the point is not to enumerate categories but to ask, in any given inquiry, whether the categories in use are concealing anyone. And on this reading the regress never gets started, because intersectional analysis is always anchored to a particular problem—a seniority system, a shelter’s intake policies—rather than to the abstract project of completing the list of groups.

A second family of objections comes from critics of the broader theoretical projects in which intersectionality is embedded. Recall the connection drawn above between intersectionality and the anti-subordination account of antidiscrimination law. For theorists committed to the rival anti-classification view, that connection is precisely the problem. On the anti-classification view, the law’s commitment is to the individual: each person is entitled to be treated without regard to race or sex, and the moral unit of antidiscrimination law is the individual person, not the social group. From this perspective, intersectionality compounds what was already objectionable in group-based thinking—it multiplies the group categories through which law and policy are asked to see people, entrenching rather than transcending the salience of race and sex. Related objections have figured in public and political debate: that intersectionality functions as a ranking of groups by relative disadvantage, or that it sorts individuals into categories of privilege and oppression in ways that are themselves essentializing. Defenders reply that these objections mistake the concept for some of its popularizations—that intersectionality, properly understood, is a claim about how discrimination operates, not a moral ranking of persons—and that the individualist objection simply restates the anti-classification premise that intersectionality’s proponents reject. At this point, the dispute over intersectionality becomes a dispute about the deepest questions in equality theory, and those questions are beyond the scope of this entry.

Conclusion

Intersectionality began as a lawyer’s diagnosis of a doctrinal failure: antidiscrimination law could not see plaintiffs whose injuries arose from the interaction of race and sex. Four decades later, it is one of the most influential concepts ever to emerge from legal scholarship—a fixture of critical race theory and feminist legal theory, a research paradigm across the social sciences, and, most recently, a topic in analytic metaphysics and the philosophy of science. The concept’s travels have not settled the questions it raises. Courts still struggle with compound discrimination claims; theorists still debate whether intersectionality is a theory, a heuristic, or a sensibility; and the concept remains entangled in larger disputes about the aims of equality law. But whatever position one ultimately takes, the core insight is now hard to unsee: systems of subordination interact, and frameworks built to address them one at a time will miss the people caught at the intersections. I hope this entry has given you a sense of the concept, its origins, and its trajectory. As always, the Lexicon provides an introduction—the bibliography that follows will take you deeper.

Related Lexicon Entries
Bibliography

Alison Bailey, On Intersectionality, Empathy and Feminist Solidarity: A Reply to Naomi Zack, 19 Journal for Peace and Justice Studies 14 (2009).

Frances M. Beal, Double Jeopardy: To Be Black and Female (1969), reprinted in 8 Meridians 166 (2008).

Sara Bernstein, The Metaphysics of Intersectionality, 177 Philosophical Studies 321 (2020).

Liam Kofi Bright, Daniel Malinsky & Morgan Thompson, Causally Interpreting Intersectionality Theory, 83 Philosophy of Science 60 (2016).

Anna Carastathis, Intersectionality: Origins, Contestations, Horizons (2016).

Sumi Cho, Kimberlé Williams Crenshaw & Leslie McCall, Toward a Field of Intersectionality Studies: Theory, Applications, and Praxis, 38 Signs 785 (2013).

Patricia Hill Collins, Black Feminist Thought: Knowledge, Consciousness, and the Politics of Empowerment (1990).

Patricia Hill Collins, Intersectionality as Critical Social Theory (2019).

Patricia Hill Collins & Sirma Bilge, Intersectionality (2d ed. 2020).

Combahee River Collective, A Black Feminist Statement (1977).

Anna Julia Cooper, A Voice from the South (1892).

Kimberlé Crenshaw, Demarginalizing the Intersection of Race and Sex: A Black Feminist Critique of Antidiscrimination Doctrine, Feminist Theory and Antiracist Politics, 1989 University of Chicago Legal Forum 139.

Kimberlé Crenshaw, Mapping the Margins: Intersectionality, Identity Politics, and Violence Against Women of Color, 43 Stanford Law Review 1241 (1991).

Ann Garry, Intersectionality, Metaphors, and the Multiplicity of Gender, 26 Hypatia 826 (2011).

Angela P. Harris, Race and Essentialism in Feminist Legal Theory, 42 Stanford Law Review 581 (1990).

Deborah K. King, Multiple Jeopardy, Multiple Consciousness: The Context of a Black Feminist Ideology, 14 Signs 42 (1988).

Holly Lawford-Smith & Kate Phelan, The Metaphysics of Intersectionality Revisited, 30 Journal of Political Philosophy 166 (2022).

Leslie McCall, The Complexity of Intersectionality, 30 Signs 1771 (2005).

Jennifer C. Nash, Re-thinking Intersectionality, 89 Feminist Review 1 (2008).

Jennifer C. Nash, Black Feminism Reimagined: After Intersectionality (2019).

Naomi Zack, Inclusive Feminism: A Third Wave Theory of Women’s Commonality (2005).

This entry was first published on July 24, 2026.

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 121: Intersectionality

New and revised Legal Theory Lexicon entries are posted on Sunday at the Legal Theory Blog. If you would like to receive Legal Theory Blog content via email, please subscribe at https://lsolum.substack.com/subscribe.
July 18, 2026

Legal Theory Lexicon 120: History and Tradition in Constitutional Theory
Introduction

Law students are almost certain to encounter the phrase “history and tradition” in constitutional law. The phrase appears in cases about guns, abortion, and prayer at public high school football games. But what does “history and tradition” actually mean? Is history and tradition simply another name for originalism? Or is the turn to history and tradition part of a reconfiguration of constitutional pluralism by conservative justices? More radically, is there something genuinely new afoot: is traditionalism emerging as a distinctive approach to law?

It turns out that answering these questions is far from simple. We can begin with the remarkable spread of history-and-tradition tests across multiple areas of constitutional doctrine. Next, we examine an important methodological debate on the Supreme Court itself—between Justice Kavanaugh, the method’s most systematic judicial expositor, and Justice Barrett, who has pointedly observed that not all reliance on history is originalism. That debate opens onto a larger possibility: that we are witnessing the emergence of a new constitutional pluralism, in which text, precedent, and history and tradition function as the leading modalities of constitutional argument. Finally, we turn to theory, tracing traditionalism from Edmund Burke through Cass Sunstein’s Burkean minimalism to Marc DeGirolami’s traditionalism—the most fully developed account of history and tradition as a distinctive approach to constitutional law.

As always, this Lexicon entry provides an introduction to history and tradition in constitutional theory for law students, especially first-year law students, with an interest in legal theory.

The Doctrinal Turn to History and Tradition

The phrase “history and tradition” appears in Supreme Court opinions across multiple doctrinal areas, including substantive due process, the Second Amendment, the Establishment Clause, and beyond. But the role of history and tradition varies. There are history-and-tradition tests, but historical practice also functions as a modality of constitutional argument or a factor in constitutional reasoning.

Although the ubiquity of “history and tradition” might seem like something new, the Supreme Court’s use of the phrase goes back decades. The first appearance in the United States Reports was in Justice Powell’s plurality opinion in Moore v. City of East Cleveland (1977), in which he stated that “the Constitution protects the sanctity of the family precisely because the institution of the family is deeply rooted in this Nation’s history and tradition.” Powell’s use of the phrase was echoed in Washington v. Glucksberg (1997), in which the Court articulated the principle that the Due Process Clause protects only those unenumerated rights that are “deeply rooted in this Nation’s history and tradition” and “implicit in the concept of ordered liberty.” Glucksberg used the formula as a limiting device: because assisted suicide was not deeply rooted in historical practice, there was no fundamental right to it. A quarter century later, Dobbs v. Jackson Women’s Health Organization (2022) made the Glucksberg test the centerpiece of its analysis, concluding that a right to abortion was not deeply rooted in the nation’s history and tradition—and overruling Roe v. Wade on that basis. After Dobbs, the deeply-rooted formula is the governing test for the recognition of unenumerated rights.

Another domain in which “history and tradition” has become prominent is the Second Amendment. In New York State Rifle & Pistol Association v. Bruen (2022), the Court discarded the two-step framework that the courts of appeals had developed—a framework whose second step involved means-end scrutiny—and replaced it with a test keyed entirely to text and history: when the Second Amendment’s plain text covers an individual’s conduct, the government must justify its regulation by demonstrating that it is consistent with the nation’s historical tradition of firearm regulation. Under Bruen, the government carries its burden by identifying historical analogues—founding-era regulations relevantly similar to the modern law. United States v. Rahimi (2024) then clarified—and softened—the test: the question is whether the challenged regulation is consistent with the principles that underpin the historical tradition, not whether it has a historical twin. Most recently, Wolford v. Lopez (2026) struck down Hawaii’s ban on carrying firearms onto private property open to the public without the owner’s permission, holding that the state’s proffered historical analogues were not relevantly similar to the modern law.

Next, consider the role of “history and tradition” in the Supreme Court’s recent Establishment Clause cases. For decades, Establishment Clause doctrine was governed—at least nominally—by the three-part test of Lemon v. Kurtzman. But in Town of Greece v. Galloway (2014) and American Legion v. American Humanist Association (2019), the Court increasingly resolved cases by reference to historical practices, upholding legislative prayer and a longstanding memorial cross on the strength of tradition. Kennedy v. Bremerton School District (2022) made the transformation official: the Court announced that Lemon had been abandoned and that Establishment Clause claims must be resolved by “reference to historical practices and understandings.”

Kavanaugh and Barrett: A Methodological Debate

The rise of history and tradition has produced a revealing methodological debate within the Supreme Court itself. The debate’s leading figures are Justice Kavanaugh and Justice Barrett, who have articulated very different conceptions of the role that history and tradition should play in the Court’s reasoning.

Justice Kavanaugh has been the principal proponent of “history and tradition” in recent years. His concurrence in United States v. Rahimi is the fullest statement: constitutional interpretation, he argued, properly relies on text, pre-ratification history, post-ratification history, and precedent—and where the text is vague or broadly worded, history is a more legitimate and more determinate guide than judge-made balancing tests or judicial policymaking. On Kavanaugh’s account, the turn to history and tradition is a turn away from the tiers of scrutiny and interest-balancing that dominated late-twentieth-century constitutional law. History constrains; balancing empowers. A judge who follows historical practice is enforcing the law’s continuity; a judge who balances interests is legislating.

Justice Barrett has articulated a contrasting view of the role of history and tradition. In her Vidal v. Elster concurrence, and again in Rahimi, she pointedly observed that not all reliance on history is originalism. Original meaning, on her account, is fixed at ratification; evidence from before and shortly after ratification can illuminate that meaning. But post-ratification practice—“tradition”—is something different. When the Court treats a tradition of regulation as itself dispositive of constitutional meaning, without connecting that tradition to the original meaning of the text, it is no longer doing originalism. Tradition, she wrote in Vidal, is “not an end in itself.” A longstanding practice may be evidence of original meaning, or it may reflect nothing more than the accumulated inertia of unchallenged laws. For Barrett, the crucial question—one the Court has not squarely answered—is why tradition should matter: is it evidence of meaning, a form of liquidation, or an independent source of constitutional law?

The Kavanaugh–Barrett exchange may be more than a quarrel among originalists. It exposes a fault line that runs through the entire history-and-tradition enterprise. If tradition matters only as evidence of original meaning, then history and tradition is a tool of originalism—and post-ratification practice should yield whenever it conflicts with the text’s original meaning. But if tradition matters in its own right—if enduring practice has constitutional authority simply because it has endured—then something else is going on. That something else is the subject of the remainder of this entry.

A New Constitutional Pluralism?

Let’s step back from the details of the doctrine and the debate. Some constitutional theorists have argued that constitutional practice is pluralistic: courts and lawyers argue from multiple modalities—text, structure, history, precedent, consequences, and national ethos, in Philip Bobbitt’s famous catalog. No single modality governs; constitutional argument is a practice of deploying and reconciling several forms of argument. (For more on this topic, see Legal Theory Lexicon 100: Constitutional Pluralism.)

Here is one possibility: the history-and-tradition cases suggest that the Roberts Court may be developing a distinctive conservative version of constitutional pluralism. The modalities that now do the heavy lifting in the Court’s constitutional cases are three: text, precedent, and “history and tradition.” This version of constitutional pluralism is conservative in the following sense: it elevates the backward-looking modalities, and excises the forward-looking modes of constitutional argument.

If this reading is correct, the turn to history and tradition is neither originalism nor a wholly new method. It is a reweighting of the modalities of constitutional argument—a new configuration of constitutional pluralism in which “history and tradition” plays a decisive role. Whether this configuration is stable, and whether it can be justified, are questions that require a theory of why tradition should matter at all.

Traditionalism as a General Framework: Burke, Sunstein, and DeGirolami

Why should tradition matter in constitutional law? One answer in the Anglo-American tradition comes from Edmund Burke, the eighteenth-century statesman and philosopher. Burke’s Reflections on the Revolution in France (1790) is the classic statement. Burke distrusted abstract reason as a guide to political life. The stock of reason in any single individual—or any single generation—is small, he argued, and individuals would do better to avail themselves of “the general bank and capital of nations, and of ages.” Long-standing institutions and practices embody the accumulated wisdom of many generations: they have been tested by experience, adjusted by trial and error, and refined in ways that no single mind could design or fully articulate. On the Burkean view, an enduring practice carries a presumption of wisdom precisely because it has endured—many generations have found it workable, and its latent functions may exceed what any participant can state. Tradition, for Burke, is the repository of accumulated wisdom—a deep well of practical reason.

Cass Sunstein brought Burke into contemporary constitutional theory. In his article Burkean Minimalism, Sunstein described a distinctive style of constitutional decision: Burkean minimalists favor small, incremental steps; they prefer rulings that are narrow rather than wide and shallow rather than deep; and—crucially—they treat long-standing practices and traditions, rather than abstract theories or moral readings of the Constitution, as the touchstone of constitutional legitimacy. Sunstein contrasted Burkean minimalism with rationalist alternatives on both left and right: with progressive perfectionism, which tests traditions against moral theory, but equally with ambitious forms of originalism, which are willing to overturn settled practice in the name of recovered original meaning. The Burkean minimalist is skeptical of both—of theories that would remake constitutional law from the top down, whatever their political valence.

Most recently, Marc DeGirolami has developed the most systematic account of tradition’s role in contemporary constitutional law—what he calls “traditionalism.” In First Amendment Traditionalism and Traditionalism Rising, DeGirolami identifies traditionalism as a distinct interpretive method with three characteristic commitments. First, the object of interpretation is concrete practices—enduring patterns of conduct and regulation—rather than abstract principles or semantic meanings. Second, practices qualify as constitutionally authoritative in virtue of their age, their longevity or endurance, and their density—the extent to which they have been widespread and consistently maintained across jurisdictions and over time. Third, endurance does normative work: the presumption of a practice’s constitutionality strengthens as the practice persists, before, at, and after ratification. On DeGirolami’s account, traditionalism is genuinely distinct from originalism: the originalist fixes constitutional meaning at ratification, while the traditionalist treats ratification as one moment in a longer continuity of practice. And traditionalism is distinct from Burkean minimalism too: where Sunstein’s Burkean prizes narrow, incompletely theorized rulings, the traditionalist is willing to announce broad, practice-based rules—Bruen and Kennedy are anything but minimalist. DeGirolami argues that traditionalism, not originalism, is the best description of what the Roberts Court is actually doing in its history-and-tradition cases—a descriptive claim with normative implications. DeGirolami’s theory of traditionalism raises the question whether tradition’s authority can be justified on its own terms and suggests the possibility that “history and tradition” might be the master principle of constitutional theory: one modality that rules them all.

Critiques

Traditionalism has critics on both the right and the left.

The first critique comes from originalists. If the constitutional text’s original public meaning is the law, then post-ratification practice has no independent authority: tradition is admissible only as evidence of original meaning, and it must yield whenever the two conflict. On this view—pressed by Justice Barrett within the Court and by originalist scholars outside it—traditionalism is not a refinement of originalism but a rival to it, and a dangerous one, because it substitutes the accumulated behavior of government actors for the ratified text. A long tradition of regulation may show nothing more than a long tradition of constitutional violation.

The second critique comes from progressive constitutional scholars, most prominently Reva Siegel. On this account, the Court’s history-and-tradition jurisprudence is selective: the justices choose which historical periods, which practices, and which levels of generality to consult, and those choices track conservative outcomes rather than neutral method. Siegel has argued that Dobbs’s claim to restore the democratic process concealed the decision’s roots in a decades-long political mobilization—and that history-and-tradition tests function to entrench the results of that mobilization in the language of judicial restraint. The charge, in short, is that “law office history” licenses motivated reasoning while disclaiming responsibility for present-day judgments.

The third critique asks: whose traditions? The historical practices that history-and-tradition tests consult were formed when women could not vote, Black Americans were enslaved or disenfranchised, and other groups were excluded from the political community altogether. To give those practices constitutional authority, the critique runs, is to entrench the exclusions under which they were made. A method that measures rights by “deeply rooted” traditions will systematically disadvantage those whose interests the tradition-makers ignored—a point pressed forcefully in the Dobbs dissent and in the scholarly literature on the gendered and racialized character of the historical record.

Conclusion

Return to the questions with which we began. Is history and tradition simply another name for originalism? The Kavanaugh–Barrett debate suggests that the answer is no—or at least, not necessarily. Tradition can serve as evidence of original meaning, but the Court’s history-and-tradition cases frequently treat enduring practice as authoritative in its own right, and that is something originalism cannot easily explain. Is the turn to history and tradition a reconfiguration of constitutional pluralism? Quite possibly: text, precedent, and history and tradition now function as the leading modalities of the Roberts Court’s constitutional jurisprudence—a conservative pluralism that elevates the backward-looking forms of argument and excises the forward-looking ones. And is there something genuinely new afoot? DeGirolami’s traditionalism makes the case that there is: a distinctive interpretive method, with Burkean roots, that treats concrete, enduring, dense practices as the very substance of constitutional law.

Whether traditionalism can bear the weight now being placed upon it remains to be seen. The critiques are serious: originalists deny that tradition has independent authority; progressives charge that the method is selective; and the whose-traditions objection asks why practices formed under conditions of exclusion should govern an inclusive republic. But whatever one’s verdict, the phenomenon itself is undeniable. History and tradition is no longer a phrase confined to a single doctrinal formula. It is a method—perhaps a theory—on the rise. Law students who master it will understand a great deal about the constitutional law of the present moment; and they will be well equipped to evaluate the constitutional law that is coming.

Related Lexicon Entries
Bibliography

Barnett, Randy E., and Lawrence B. Solum. Originalism after Dobbs, Bruen, and Kennedy: The Role of History and Tradition, 118 Nw. U. L. Rev. 433 (2023).

Baude, William. Constitutional Liquidation, 71 Stan. L. Rev. 1 (2019).

Bobbitt, Philip. Constitutional Fate: Theory of the Constitution. New York: Oxford University Press, 1982.

Burke, Edmund. Reflections on the Revolution in France (1790). Oxford World’s Classics ed. (L.G. Mitchell ed.). Oxford: Oxford University Press, 2009.

DeGirolami, Marc O. First Amendment Traditionalism, 97 Wash. U. L. Rev. 1653 (2020).

DeGirolami, Marc O. Traditionalism Rising, 24 J. Contemp. Legal Issues 9 (2023).

Girgis, Sherif. Living Traditionalism, 98 N.Y.U. L. Rev. 1477 (2023).

McConnell, Michael W. Tradition and Constitutionalism Before the Constitution, 1998 U. Ill. L. Rev. 173.

Siegel, Reva B. Memory Games: Dobbs’s Originalism as Anti-Democratic Living Constitutionalism—and Some Pathways for Resistance, 101 Tex. L. Rev. 1127 (2023).

Siegel, Reva B. The History of History and Tradition: The Roots of Dobbs’s Method (and Originalism) in the Defense of Segregation, 133 Yale L.J. Forum 99 (2023).

Strauss, David A. Common Law Constitutional Interpretation, 63 U. Chi. L. Rev. 877 (1996).

Sunstein, Cass R. Burkean Minimalism, 105 Mich. L. Rev. 353 (2006).

(First created on July 19, 2026.)

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 120: History and Tradition in Constitutional Theory

New and revised Legal Theory Lexicon entries are posted on Sunday at the Legal Theory Blog. If you would like to receive Legal Theory Blog content via email, please subscribe at https://lsolum.substack.com/subscribe.
July 12, 2026

Legal Theory Lexicon 119: Structural Injustice
Introduction

The idea of “structural injustice” has become a fixture of contemporary political discourse. Politicians, journalists, and activists invoke structural racism, structural poverty, and structural inequality to describe social problems that seem to persist no matter who holds office or whether any particular individuals intend to discriminate. The core intuition is that serious injustice can exist even when no identifiable person has done anything wrong. Structural injustice is a powerful notion, but what exactly does “structural” mean? And how does the idea of structural injustice relate to law?

The relationship between structural injustice and law is complicated. Law is, for the most part, organized around a different picture of injustice—a picture in which harms are traced to particular wrongdoers who perform discrete wrongful acts with culpable states of mind. Tort law looks for a negligent defendant. Criminal law looks for a guilty actor with a guilty mind. And constitutional antidiscrimination law, since the Supreme Court’s decision in Washington v. Davis, looks for discriminatory purpose: a facially neutral law or policy does not violate the Equal Protection Clause merely because it has a racially disparate impact.

The theory of structural injustice challenges the picture of injustice that is implicit in legal doctrine. It suggests that some of the most serious injustices are not the product of bad actors with bad intentions at all. Instead, they emerge from the ordinary operation of social structures—institutions, rules, norms, markets, and accumulated patterns of behavior—through the uncoordinated actions of many individuals, most of whom act within the bounds of accepted rules and conventional morality. If that is right, then a legal system that conditions remedies on individual fault may be systematically blind to an important category of injustice.

This entry provides an introduction to the idea of structural injustice for law students, especially first-year law students, with an interest in legal theory.

The Idea of Structural Injustice

The best way to grasp the idea of structural injustice is through an example. The political philosopher Iris Marion Young, whose book Responsibility for Justice is the most influential philosophical account of structural injustice, provides the story of Sandy, a single mother who works as a sales clerk. The apartment building where she rents is purchased by a developer who plans to convert it to condominiums, so Sandy must find a new place to live. Apartments near her job are too expensive. Affordable apartments are far away, so she needs a car to commute—and the car payments eat into the money she has for rent. Landlords require a deposit of three months’ rent, which Sandy simply does not have. In the end, Sandy and her children face the prospect of homelessness.

Now ask: who has wronged Sandy? The developer who bought her building broke no law and violated no one’s rights. The landlords who charge market rents and require deposits are doing what landlords ordinarily do. The employers who pay sales clerks modest wages, the zoning boards that limit apartment construction near job centers, the countless home buyers and renters whose choices shape the housing market—each acts within the rules, and most act without any ill will toward Sandy or anyone like her. There is no villain with evil intentions in this story. And yet something has gone seriously wrong. Sandy faces a grave harm—housing insecurity verging on homelessness—that she did nothing to deserve and that she is nearly powerless to avoid.

This is the phenomenon that Young calls “structural injustice.” Here is a paraphrased version of her definition: structural injustice exists when social processes put large groups of persons under a systematic threat of domination or deprivation of the means to develop and exercise their capacities, at the same time that these processes enable others to dominate or to acquire a wide range of opportunities.

Young’s definition has three key elements. First, the injustice is produced by social structures—the rules, institutions, norms, markets, and physical arrangements within which individuals act. Second, the injustice results from the combined effects of many individual actions, most of which are lawful and conventionally acceptable. Third, the injustice is systematic rather than episodic: it consists in the position that whole categories of persons occupy, not in a discrete event that befalls one victim at the hands of one wrongdoer.

Notice what this definition does not require. It does not require a perpetrator. It does not require wrongful intent. It does not even require that anyone violate an existing legal or moral rule. Structural injustice is, in Young’s phrase, a “moral wrong distinct from the wrongful action of an individual agent.” This is what distinguishes structural injustice from the more familiar categories of individual wrongdoing on which the law typically focuses.

Young drew on a long tradition of thinking about the justice of institutions. The most important precursor is John Rawls, whose A Theory of Justice famously declared that the “basic structure” of society—its major political, social, and economic institutions—is the “primary subject” of justice. Rawls’s insight was that institutions profoundly shape people’s life prospects from the start, in ways that no individual chooses. Young radicalized this insight: where Rawls focused on the design of formal institutions, Young emphasized that unjust outcomes can emerge from informal norms, market dynamics, and accumulated patterns of private choice—even when the formal institutions are operating as designed. Rawls’s influential work on distributive justice is discussed in Legal Theory Lexicon 049: Distributive Justice.

Later theorists have developed the idea in several directions. The philosopher Sally Haslanger has offered an influential account of what social structures are—roughly, networks of social practices, organized by shared meanings and schemas, that shape and constrain individual choices, frequently below the level of conscious awareness. Charles Mills’s The Racial Contract argues that white supremacy should be understood as a political system in its own right—a structure of formal and informal arrangements that privileges whites and subordinates nonwhites, and that persists even when its individual beneficiaries neither intend nor acknowledge it. And scholars working in the tradition of critical race theory have long argued that racism in the United States is best understood structurally: not primarily as a matter of individual prejudice, but as a self-reproducing system of institutional arrangements, residential patterns, and wealth disparities that perpetuates racial hierarchy even in the absence of intentional discrimination. The common thread is the claim that individuals acting innocently within unjust structures can collectively sustain grave injustice.

Structural Injustice and the Law: Intent and Impact in Antidiscrimination Law

If structural injustice is real, what should the law do about it? The place where this question comes into sharpest focus is constitutional antidiscrimination law—and specifically, the divide between discriminatory intent and discriminatory impact.

Start with the doctrine. In Washington v. Davis (1976), Black applicants to the District of Columbia police department challenged a written verbal-skills test, Test 21, that Black applicants failed at roughly four times the rate of white applicants. The Supreme Court held that the disparate racial impact of the test was not enough to establish a violation of the constitutional guarantee of equal protection. To prevail on a constitutional claim, plaintiffs must show discriminatory purpose—that the government adopted the challenged policy at least in part because of, and not merely in spite of, its adverse effects on a racial group, as the Court later put it in Personnel Administrator v. Feeney. The Court reasoned that a contrary rule would call into question a vast range of tax, welfare, regulatory, and licensing statutes that burden some racial groups more than others.

Now look at the doctrine through the lens of structural injustice theory. The intent requirement presupposes what we might call the perpetrator model of discrimination: injustice is something that identifiable wrongdoers do to identifiable victims, with a culpable state of mind. The theory of structural injustice denies that this model captures the whole territory. Racially disparate outcomes in policing, employment, housing, and education can be produced—and reproduced—by the interaction of facially neutral policies, private choices, and historical patterns, without any current decisionmaker acting from racial animus. On the structural account, Washington v. Davis does not merely set a high evidentiary bar; it defines an entire category of injustice out of constitutional existence. The wrong that structural injustice theory identifies is precisely the wrong that the intent requirement renders invisible.

Statutory law tells a more complicated story. Title VII of the Civil Rights Act of 1964, as interpreted in Griggs v. Duke Power Co. (1971), permits disparate impact claims in employment: a facially neutral practice that disproportionately excludes members of a protected group is unlawful unless the employer can show that the practice is job-related and consistent with business necessity. The Fair Housing Act, as construed in Texas Department of Housing & Community Affairs v. Inclusive Communities Project (2015), likewise recognizes disparate impact liability, though with significant limits designed to protect defendants from liability for disparities they did not cause. Disparate impact doctrine can be understood as a partial legal recognition of structural injustice: it targets the effects of practices rather than the mental states of actors. But the recognition is partial indeed. Disparate impact liability is a creature of statute, confined to particular domains; the constitutional baseline remains intent.

One more wrinkle deserves mention. In Ricci v. DeStefano (2009), the Supreme Court held that an employer’s race-conscious effort to avoid disparate impact liability can itself constitute disparate treatment. And some Justices have suggested that disparate impact liability may be in tension with the constitutional guarantee of equal protection itself. From the perspective of structural injustice theory, this is the deep irony: the legal tools designed to address structural injustice are themselves under pressure from the individualist, intent-centered picture of discrimination that the theory challenges.

Structural Injustice and the Law: Mass Incarceration

The second legal application is mass incarceration. The United States imprisons a larger share of its population than almost any other nation, and the burden of imprisonment falls with dramatic disproportion on Black Americans and on the poor. Mass incarceration is now a standard example in the philosophical literature on structural injustice—and it is easy to see why. No legislature ever enacted a statute entitled “An Act to Imprison Two Million People.” No single decisionmaker chose the outcome. Instead, mass incarceration emerged over decades from the interaction of many decisions by many actors: legislators who lengthened sentences and multiplied offenses, prosecutors who exercised charging discretion and leveraged plea bargaining, police departments that concentrated enforcement in particular neighborhoods, judges who applied sentencing guidelines, parole boards, probation officers, and voters who rewarded officials for being tough on crime. Each actor operated within the rules. Most acted in good faith. The aggregate result is a carceral system of historically unprecedented scale.

Mass incarceration thus displays the signature features of structural injustice. The harm is systematic rather than episodic: it consists in the position occupied by whole categories of persons—disproportionately poor and disproportionately Black—rather than in discrete wrongs inflicted by particular wrongdoers. The causes are dispersed across institutions and accumulated over time. And the injustice is resilient: because no single actor controls the system, no single actor can fix it. Reforming police practices does nothing about sentencing law; reforming sentencing law does nothing about prosecutorial discretion; and so on. This resilience is exactly what Young’s account predicts. Structural injustices persist because they are produced by structures, not by decisions that can simply be reversed.

The structural lens also illuminates a famous—some would say infamous—case. In McCleskey v. Kemp (1987), the Supreme Court confronted the Baldus study, a sophisticated statistical analysis showing that Georgia defendants charged with killing white victims were substantially more likely to receive the death penalty than defendants charged with killing Black victims. The Court assumed the study’s validity but rejected McCleskey’s equal protection claim: he had not shown that the decisionmakers in his own case acted with discriminatory purpose. McCleskey is Washington v. Davis transposed into the criminal justice system—and with the stakes raised to life and death. Systemic racial disparity, however well documented, does not state a constitutional claim; only individualized intent will do. For structural injustice theorists, McCleskey is the canonical illustration of a legal system that can see individual wrongdoing but cannot see structure.

Philosophers of punishment have developed these themes. Tommie Shelby has argued that the injustice of the carceral system cannot be evaluated apart from the background structural injustices—concentrated poverty, segregated neighborhoods, unequal schooling—that shape the lives of those the system punishes. Erin Kelly has argued that the criminal law’s intense focus on individual blame obscures the social conditions that produce crime and distorts our collective response to it. And Michelle Alexander’s The New Jim Crow, the most influential popular treatment, argues that mass incarceration functions as a system of racialized social control—a structural successor to slavery and Jim Crow—that operates largely through race-neutral rules. One need not accept every element of these accounts to see the common structural claim: the injustice of mass incarceration resides in the system as a whole, and it cannot be captured by asking whether any particular official has behaved culpably.

What follows for law? Here the structural account generates a hard question rather than an easy answer. If the injustice is structural, then the standard legal remedies—which run against individual defendants who commit individual wrongs—are systematically mismatched to the problem. Structural reform would require coordinated change across sentencing law, prosecutorial practice, policing, and the social conditions that lie upstream of the criminal justice system. Whether courts are capable of that kind of reform, and whether they may legitimately attempt it, are questions that lead directly to debates about institutional competence and the counter-majoritarian difficulty—topics for another day and other Lexicon entries.

Critiques

The theory of structural injustice has attracted criticism. Two objections are especially prominent.

The first is the agency objection. If responsibility for injustice is diffused across structures and shared by everyone who participates in them, does anyone really bear responsibility at all? Martha Nussbaum pressed a version of this objection against Young: by directing attention away from individual wrongdoing, the structural account risks letting genuine wrongdoers off the hook. Some injustices—including some that look structural—are in fact produced or sustained by identifiable actors who deceive, exploit, and discriminate, and who deserve blame in the ordinary way. Young’s reply was that her account is additive, not substitutive: the social connection model supplements the liability model rather than replacing it. Individual wrongdoers remain blameworthy for their wrongs; the point is that structural injustice can exist even where no such wrongdoers can be found. Whether this division of labor can be maintained in practice—whether structural explanations tend, as a matter of psychological and political fact, to crowd out judgments of individual responsibility—remains contested.

The second is the determinacy objection. What exactly is a “structure”? The term sweeps in institutions, rules, norms, markets, physical arrangements, and accumulated patterns of behavior—which is to say, nearly everything. A concept that explains every bad outcome may explain none of them; if all injustice is structural, the label does no analytical work. And the practical worry follows closely behind: a theory that assigns responsibility to everyone in general may assign it to no one in particular, yielding no determinate guidance about who must do what. Defenders of the structural account respond that the concept can be made rigorous—Haslanger’s work on social practices and schemas is one sustained attempt—and that the demandingness of the theory’s practical implications is a feature of our situation, not a defect of the theory. But the objection identifies a genuine cost: the more the concept of structure expands, the less it discriminates.

Conclusion

The idea of structural injustice names something real: grave harms that emerge from the ordinary workings of social structures, without villains, without culpable intent, and without discrete wrongful acts. Young’s account gives the idea philosophical precision, and the legal applications show why it matters for law. A legal system built around the perpetrator model—individual defendants, individual wrongs, individual mental states—will have difficulty even perceiving structural injustice, much less remedying it. The intent requirement of Washington v. Davis and the individualized showing demanded by McCleskey v. Kemp are the doctrinal expressions of that difficulty.

For law students, the concept is worth mastering for two reasons. The first is diagnostic: once you have the idea of structural injustice, you will begin to notice how much of legal doctrine presupposes the perpetrator model—and to ask whether that presupposition is justified. The second is critical: the theory of structural injustice raises deep questions about the limits of law itself. If some of the gravest injustices are structural, and if legal remedies are built for individual wrongs, then the pursuit of justice may require institutions and forms of collective action that go beyond adjudication. Whether that is a limitation of law or a division of labor between law and politics is a question worth carrying with you through law school—and beyond.

Related Lexicon Entries
Bibliography

Alexander, Michelle. The New Jim Crow: Mass Incarceration in the Age of Colorblindness. New York: The New Press, 2010; 10th anniversary ed., 2020.

Haslanger, Sally. Resisting Reality: Social Construction and Social Critique. New York: Oxford University Press, 2012.

Haslanger, Sally. What Is a (Social) Structural Explanation?, 173 Philosophical Studies 113 (2016).

Kelly, Erin I. The Limits of Blame: Rethinking Punishment and Responsibility. Cambridge, MA: Harvard University Press, 2018.

Lawrence, Charles R., III. The Id, the Ego, and Equal Protection: Reckoning with Unconscious Racism, 39 Stan. L. Rev. 317 (1987).

McKeown, Maeve. With Power Comes Responsibility: The Politics of Structural Injustice. London: Bloomsbury, 2024.

Mills, Charles W. The Racial Contract. Ithaca, NY: Cornell University Press, 1997; 25th anniversary ed., 2022.

Nussbaum, Martha C. Foreword to Iris Marion Young, Responsibility for Justice, ix–xxv. New York: Oxford University Press, 2011.

Powers, Madison, and Ruth Faden. Structural Injustice: Power, Advantage, and Human Rights. New York: Oxford University Press, 2019.

Rawls, John. A Theory of Justice. Cambridge, MA: Harvard University Press, 1971; rev. ed., 1999.

Shelby, Tommie. Dark Ghettos: Injustice, Dissent, and Reform. Cambridge, MA: Harvard University Press, 2016.

Young, Iris Marion. Justice and the Politics of Difference. Princeton: Princeton University Press, 1990.

Young, Iris Marion. Responsibility and Global Justice: A Social Connection Model, 23 Social Philosophy & Policy 102 (2006).

Young, Iris Marion. Responsibility for Justice. New York: Oxford University Press, 2011.

Created July 12, 2026.

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 119: Structural Injustice

New and revised Legal Theory Lexicon entries are posted on Sunday at the Legal Theory Blog. If you would like to receive Legal Theory Blog content via email, please subscribe at https://lsolum.substack.com/subscribe.
July 4, 2026

Legal Theory Lexicon 118: Criminal Law Theory
Introduction

Law students almost always encounter criminal law in their first year, frequently in the first semester. The traditional course in criminal law introduces students to the way the law approaches deep moral questions. What justifies the state’s deliberate imposition of suffering on offenders? What conduct should be criminalized? The criminal law does not (and should not) punish everything that is wrong or harmful — so what are the limits? What are the general conditions of criminal liability? Before anyone can be punished, we need an account of what must be proved: a voluntary act, a culpable state of mind, and the absence of justification or excuse.

These questions correspond to the three main divisions of the field: the theory of punishment, the theory of criminalization, and the theory of what criminal lawyers call the “general part.” This Lexicon entry provides an introduction to the central questions of criminal law theory. As always, the Lexicon is aimed at law students, especially first-year law students, with an interest in legal theory.

Theories of Punishment

The justification of punishment is the oldest and deepest question in criminal law theory, and the debate is organized around a fundamental divide: does punishment look backward or forward? This divide is a special case of the distinction between ex post and ex ante perspectives, explored in a prior Lexicon entry (Legal Theory Lexicon 001: Ex Ante & Ex Post). Backward-looking theories such as retributivism justify punishment by reference to the offender’s past wrongdoing — the offender deserves to be punished because of what she did. Forward-looking theories such as deterrence justify punishment by reference to its future consequences — punishment is justified because it prevents crime and thereby produces good outcomes. This divide corresponds to the two great traditions in moral philosophy: deontology on the backward-looking side and consequentialism on the forward-looking side. Both traditions are explored in prior Lexicon entries (Legal Theory Lexicon 008: Utilitarianism and Legal Theory Lexicon 010: Deontology).

The most prominent backward-looking theory is called “retributivism.” The core retributivist idea is that punishment is justified because offenders deserve it: wrongdoing creates a moral desert basis for punishment, and the state acts permissibly (perhaps even obligatorily) when it gives offenders what they deserve. Immanuel Kant is the classic historical source — Kant famously insisted that even a society about to dissolve itself must first execute its last murderer, so that everyone receives what their deeds deserve. The leading contemporary retributivist is Michael Moore, whose book Placing Blame offers a sustained defense of the view that desert is not merely a necessary condition for punishment but a sufficient one: the state should punish culpable wrongdoers because they deserve it, full stop. Notice what retributivism does not say. Retributivism is not the view that punishment satisfies the victim’s desire for revenge, and it is not committed to harsh punishment — the retributivist principle of proportionality (punishment must fit the crime) can condemn excessive sentences just as forcefully as it condemns lenient ones.

The forward-looking alternatives to retributivism are consequentialist theories, and they usually focus on the idea of deterrence. Bentham’s classic formulation captures the structure: punishment is itself an evil — it inflicts suffering — and so it can be justified only if it prevents greater evils. The most familiar preventive mechanism is deterrence: the threat of punishment gives potential offenders a prudential reason to comply with the law, and punishing actual offenders makes the threat credible. But deterrence is not the only forward-looking rationale. Incapacitation prevents crime by physically restraining offenders — a burglar in prison cannot burgle. And rehabilitation aims to change offenders so that they no longer want to offend. All three rationales share the consequentialist structure: punishment is justified by the good consequences it produces. And that structure generates the standard objection. If good consequences justify punishment, then the theory seems to permit punishing the innocent whenever doing so would deter others — framing a scapegoat, for example, to quell a crime wave. Consequentialists have responses, but the scapegoat objection has convinced many theorists that consequences alone cannot be the whole story.

The scapegoat objection helps to explain the enduring appeal of mixed theories, which combine backward-looking and forward-looking elements. The most influential mixed theory belongs to H.L.A. Hart. In his famous essay “Prolegomenon to the Principles of Punishment,” Hart distinguished the “general justifying aim” of punishment from the principles governing its “distribution.” The general justifying aim of the institution of punishment, Hart argued, is forward-looking: we have systems of criminal punishment because they prevent crime. But the distribution of punishment is constrained by backward-looking principles: only the guilty may be punished, and only in proportion to their culpability. Desert operates as a side constraint on the pursuit of good consequences. Hart’s move — asking different questions about the institution as a whole and about its application to individuals — remains the standard framework for organizing the punishment debate.

A fourth family of theories emphasizes the meaning of punishment rather than its consequences or the offender’s desert. Joel Feinberg’s classic essay “The Expressive Function of Punishment” observed that punishment differs from a mere penalty (a parking fine, a tax) because punishment communicates a message of condemnation. Expressive theories build on this insight, arguing that condemnation is part of what justifies punishment. The most developed version is R.A. Duff’s communicative theory. For Duff, punishment is a communicative enterprise between the political community and the offender: it aims to communicate to offenders the censure they deserve and to persuade them to repent, reform, and reconcile with the community. Duff’s theory is backward-looking in its foundation (censure must be deserved) but aspirational in its aims — and it treats offenders as responsible members of the community, not as objects to be managed.

Finally, some theorists challenge the entire framework. Restorative justice theorists argue that the central question should not be “how do we justify punishment?” but “how do we repair the harm?” — and they favor practices (victim-offender mediation, restitution, community conferencing) that address crime without the deliberate infliction of suffering. Prison abolitionists press further, arguing that the institution of imprisonment is so unjust in practice that it cannot be reformed and should be dismantled. These views remain minority positions in the theoretical literature, but they have grown in influence, and they perform an important function even for those who reject them: they force defenders of punishment to shoulder the justificatory burden rather than resting on the familiarity of existing institutions.

Theories of Criminalization

Suppose we have a justification for punishment. A second question immediately follows: what conduct may the state punish? Murder, obviously. But what about drug possession? Gambling? Blasphemy? Failing to rescue a drowning stranger? The theory of criminalization asks what the limits of the criminal law are — and whether there is any conduct that the state simply may not criminalize, no matter how many people want it criminalized.

The classic starting point is John Stuart Mill’s harm principle. In On Liberty, Mill argued that the only purpose for which power can rightfully be exercised over a member of a civilized community, against his will, is to prevent harm to others. On this view, the fact that conduct is immoral, offensive, or harmful to the actor herself is simply not a reason for criminalization. The most influential development of this liberal tradition is Joel Feinberg’s four-volume work, The Moral Limits of the Criminal Law. Feinberg distinguished four “liberty-limiting principles” — four kinds of reasons that might be offered in favor of criminalization. The first is the harm principle: conduct may be criminalized because it harms others. The second is the offense principle: conduct may be criminalized because it seriously offends others — think of public indecency, which may harm no one but is nonetheless prohibited. The third is legal paternalism: conduct may be criminalized because it harms the actor herself — seatbelt laws and drug prohibitions are often defended this way. The fourth is legal moralism: conduct may be criminalized simply because it is immoral, even if it harms and offends no one. Feinberg accepted the first two principles (with important qualifications) and rejected the last two. His four-part framework is enormously useful: almost any debate about criminalization — of drugs, of sex work, of hate speech — can be organized by asking which liberty-limiting principle is doing the work.

Legal moralism, however, has its defenders. Michael Moore — whose retributivism we have already encountered — argues that the criminal law’s function is to punish moral wrongdoing as such; for Moore, the immorality of conduct is always a reason (though not necessarily a conclusive reason) for criminalizing it, and countervailing considerations like privacy and liberty do the work of limiting the criminal law’s reach. R.A. Duff defends a more moderate version: on Duff’s view, criminalization is appropriate only for “public wrongs” — wrongs that properly concern the political community as a whole, not merely the individuals involved. The debate between liberal and moralist theories of criminalization remains one of the liveliest in the field.

Whatever one’s theory, there is widespread agreement that contemporary American law suffers from overcriminalization. Douglas Husak’s book of that title documents the phenomenon: thousands of federal crimes (no one knows the exact number), vast bodies of regulatory offenses carrying criminal penalties, and criminal statutes so broad that prosecutors effectively decide who goes to prison. Husak argues that criminalization requires justification because punishment does — every criminal statute is a standing threat to deprive citizens of liberty, and the state needs a good reason for each one. Overcriminalization also connects the theory of criminalization back to the theory of punishment: if most criminal cases are resolved by plea bargaining in the shadow of overlapping statutes, the theorist must ask whether the practice of American criminal justice can be justified by any of the theories of punishment on offer.

Critical theories of criminal law approach the criminalization question from a different direction. Rather than asking which liberty-limiting principles justify criminalization in the abstract, critical theorists ask how criminalization decisions actually operate — and their answer is that the criminal law’s reach has been shaped by race and class hierarchy. The historical record supplies the evidence: vagrancy laws, the differential treatment of crack and powder cocaine, and the concentration of drug enforcement in poor communities of color are standard examples. Abolitionist theorists draw a further conclusion: the question is not which conduct deserves criminalization but whether criminalization should be the state’s default response to social problems at all — and they argue for shifting resources from policing and prisons toward housing, health care, and education as the primary means of producing safety. For the theory of criminalization, the critical challenge is important even for those who reject its conclusions, because it insists that a theory of what may be criminalized in principle must reckon with how criminalization works in practice.

The General Part

Criminal lawyers distinguish the “special part” of the criminal law — the definitions of particular offenses like murder, burglary, and theft — from the “general part”: the doctrines that apply across all offenses. The general part includes the requirement of a voluntary act, the requirement of a culpable mental state, and the defenses of justification and excuse. Each of these doctrines raises deep theoretical questions, because each embodies a view about the conditions under which persons are responsible for wrongdoing.

Start with the act requirement — what criminal lawyers call the “actus reus.” The criminal law does not punish thoughts, character, or status; it punishes acts. Why? One answer is practical — thoughts are hard to prove. But the deeper answers are theoretical: punishing thoughts would be an intolerable intrusion on liberty, and mere thoughts, unlike acts, do not wrong anyone. Michael Moore’s book Act and Crime offers the most sustained philosophical treatment, defending the view that the act requirement rests on a theory of human action: acts are willed bodily movements, and only through action does an agent’s practical reason engage the world in a way that can constitute wrongdoing. The standard complication is liability for omissions. The criminal law generally imposes no duty to rescue — the passerby who watches a stranger drown commits no crime — but it does punish omissions when there is a legal duty to act, as when a parent fails to feed a child. Whether the act requirement and omission liability can be reconciled within a single theory of criminal conduct remains a contested question.

Next, the mens rea requirement — the actus reus must be accompanied by a culpable mental state. The common law’s mens rea vocabulary was notoriously chaotic, and the Model Penal Code’s great achievement was to reduce it to four levels of culpability: purpose, knowledge, recklessness, and negligence. Most students encounter this hierarchy as doctrine, but it rests on a theory: culpability tracks the actor’s practical attitude toward the interests of others. The purposeful wrongdoer aims at harm; the knowing wrongdoer accepts it as a certainty; the reckless wrongdoer consciously disregards a substantial risk; the negligent wrongdoer fails to perceive a risk that a reasonable person would have perceived. The theoretical controversies cluster at the bottom of the hierarchy. Negligence liability is contested because the negligent actor, by definition, was unaware of the risk — and some theorists, most prominently Larry Alexander and Kimberly Kessler Ferzan, argue that culpability requires a conscious choice: on their view, recklessness is the core of culpability, and the negligent actor, who chose nothing, deserves no punishment at all. Strict liability — criminal liability without any culpable mental state at all — is more contested still: on most theories of punishment, and certainly on retributivist theories, punishing the blameless is simply unjust. Yet strict liability offenses persist, especially in the regulatory sphere, and their persistence is one of the standing puzzles of criminal law theory.

The general part also includes the defenses, and here the key theoretical distinction is between justification and excuse. A justified actor did the right thing (or at least a permissible thing) in the circumstances: the classic example is self-defense, where the defender’s use of force is not wrongful at all. An excused actor did something wrongful but is not blameworthy for doing it: insanity and duress are the standard examples. The distinction matters theoretically because it separates two different questions — was the conduct wrong? and is the actor responsible? — that the verdict “not guilty” otherwise runs together. The distinction can also matter practically: third parties may assist justified conduct but not excused conduct, and justifications generalize to anyone in the same circumstances while excuses are personal to the actor.

Finally, consider a puzzle that connects the general part back to the theory of punishment: the problem of moral luck. Suppose two drivers text behind the wheel; one arrives home safely, the other kills a pedestrian. Their conduct and culpability are identical — the difference between them is pure luck. Yet the law punishes the killer severely, while the lucky driver might be charged with a minor offense for texting or get off scot free. The same puzzle appears in the law of attempts: the assassin whose shot misses is typically punished less than the assassin whose shot hits, though the two are equally culpable. Should outcomes matter to punishment, or only culpability? Subjectivists — Alexander and Ferzan again — say culpability is everything and the law’s harm-based grading is an indefensible concession to primitive intuitions; objectivists reply that results matter morally — that causing harm is a different (and worse) thing than risking it. Michael Moore, in Causation and Responsibility, defends the objectivist position: causing harm increases blameworthiness, and the law’s differential treatment of completed crimes and attempts is therefore no embarrassment. The moral luck debate is ongoing and complex, and it is a fine example of how a doctrinal detail — the grading of attempts — opens onto some of the hardest questions in moral philosophy.

Conclusion

Criminal law theory begins with a question that the first-year course mostly takes for granted: what entitles the state to punish? From that question, the field radiates outward — to the limits of criminalization, and to the general conditions of responsibility that the doctrines of actus reus, mens rea, justification, and excuse embody. The three bodies of theory are connected. A retributivist about punishment will be drawn toward moralism about criminalization and toward culpability-centered views of the general part; a consequentialist will see deterrence, and not desert, doing the work at every level. R.A. Duff’s work displays the connections especially clearly: his communicative theory of punishment, his account of crimes as public wrongs, and his theory of criminal responsibility as answerability to the political community are all pieces of a single, unified vision of criminal law. One of the pleasures of criminal law theory is discovering these connections — and finding that a position taken on the first day of class, about why we punish at all, has consequences that reach into every corner of the course. I hope this entry provides a useful starting point for your own exploration of these questions.

Related Lexicon Entries
Bibliography

Larry Alexander & Kimberly Kessler Ferzan (with Stephen J. Morse), Crime and Culpability: A Theory of Criminal Law (Cambridge University Press 2009).

Jeremy Bentham, An Introduction to the Principles of Morals and Legislation (1789) (Dover ed. 2007).

R.A. Duff, Answering for Crime: Responsibility and Liability in the Criminal Law (Hart Publishing 2007).

R.A. Duff, Punishment, Communication, and Community (Oxford University Press 2001).

Joel Feinberg, The Expressive Function of Punishment, 49 The Monist 397 (1965).

Joel Feinberg, Harm to Others: The Moral Limits of the Criminal Law, Volume One (Oxford University Press 1984).

H.L.A. Hart, Punishment and Responsibility: Essays in the Philosophy of Law (2d ed., Oxford University Press 2008).

Douglas Husak, Overcriminalization: The Limits of the Criminal Law (Oxford University Press 2008).

Allegra M. McLeod, Prison Abolition and Grounded Justice, 62 UCLA L. Rev. 1156 (2015).

John Stuart Mill, On Liberty (1859) (Hackett ed. 1978).

Michael S. Moore, Act and Crime: The Philosophy of Action and Its Implications for Criminal Law (Oxford University Press 1993).

Michael S. Moore, Causation and Responsibility: An Essay in Law, Morals, and Metaphysics (Oxford University Press 2009).

Michael S. Moore, Placing Blame: A Theory of the Criminal Law (Oxford University Press 1997).

Thomas Nagel, “Moral Luck,” in Mortal Questions (Cambridge University Press 1979).

Dorothy E. Roberts, Foreword: Abolition Constitutionalism, 133 Harv. L. Rev. 1 (2019).

(This entry was created on July 5, 2026.)

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 118: Criminal Law Theory

Link to the Legal Theory Stack

The Legal Theory Stack informs readers of a new or revised Legal Theory Lexicon entry every Sunday along with summaries of the Download of the Week and the Legal Theory Bookworm recommendation. Subscribe to the Legal Theory Stack here.
June 27, 2026

Legal Theory Lexicon 117: Constitutional Theory
Introduction

Law students at different law schools encounter the course in constitutional law at different points in their legal education. At some law schools constitutional law is a second-semester first-year course, but conlaw is sometimes a first-semester 1L course and at a significant number of law schools, the course is an upper-division elective or requirement. Even if you do not take constitutional law in your first year, you are almost certain to encounter constitutional issues in other courses. Personal jurisdiction is a standard topic in civil procedure, and most of the personal jurisdiction cases involve interpretation of the Due Process Clauses of the Fifth and Fourteenth Amendments. Property courses frequently cover the Takings Clause of the Fifth Amendment, and the torts course might include the First Amendment limitations on the defamation torts via the Supreme Court’s decision in New York Times Co. v. Sullivan. So, constitutional doctrine is pervasive in the first year and throughout law school. Constitutional theory addresses a different set of questions, including: what justifies the constitution, which institutions should have the power to decide constitutional questions, and how should constitutional actors go about deciding what the Constitution means?

These questions sit just beneath the surface of the cases you read. When a court asks whether the Constitution should be interpreted according to its original meaning or its evolving meaning, that is a question of constitutional theory. When commentators argue about whether unelected judges should have the power to strike down laws passed by an elected legislature, that too is constitutional theory. So too is the question of why the Constitution binds us at all, given that it was ratified by an unrepresentative group of people who are now long dead. The cases assume answers to these questions; constitutional theory makes the questions explicit and examines the answers.

This entry provides a map. It begins with the idea of a constitution and the basic distinction between provisions that allocate government power and provisions that protect individual rights. It then offers a short history of American constitutional theory, introducing the major ideas as they emerged. Next it takes up the central interpretive debate between originalism and living constitutionalism. Finally, it offers a brief introduction to comparative constitutional theory, which reminds us that the American constitutional arrangement is one option among many rather than the only way a constitutional democracy might be designed.

Constitutional theory is covered extensively in various Legal Theory Lexicon entries. This entry aims to provide a synoptic overview that will give students taking the conlaw course a sense of the big picture and major themes in constitutional theory.

The Idea of a Constitution

What is a constitution? At the most basic level, a constitution is the framework that constitutes a government — it creates the institutions of the state, specifies how they are composed, and defines the powers they may exercise. The word itself carries this meaning: to constitute something is to bring it into being. The United States Constitution creates a Congress, a President, and a system of federal courts, and it specifies how each is selected and what each may do. A constitution in this sense is foundational. It is the law that stands behind ordinary law, the framework within which legislatures legislate and courts decide cases and controversies.

Constitutions typically do more than create institutions. They also limit what those institutions may do. This is the idea of constitutionalism — the principle that government power should be limited by law, and that even the highest officials are bound by rules they cannot change at will. A government with unlimited power is not a constitutional government, even if it has a document called a constitution.

Constitutions usually secure these limits through entrenchment, which means they are harder to change than ordinary legislation. An ordinary statute can be repealed by a later statute, but the United States Constitution can be amended only through the demanding process of Article V, which usually requires two-thirds supermajorities in Congress and ratification by three-quarters of the states. Entrenchment is what allows a constitution to bind the future, placing certain commitments beyond the reach of today’s legislative majority.

It is common to distinguish written from unwritten constitutions. The United States has a written constitution — a single canonical document, supplemented by its amendments. The United Kingdom is the standard example of an unwritten constitution: it has no single founding document, and its constitutional arrangements are found in statutes, judicial decisions, and longstanding conventions. The distinction is real but can be overstated. Even a written constitution like the American one is surrounded by a vast body of unwritten understandings, and even an unwritten constitution like the British one includes important written sources. What matters is less the existence of a single document than the existence of fundamental norms that structure and constrain government.

Finally, it is worth distinguishing the Constitution (the written document) from constitutional law. The Constitution is a text. Constitutional law is the body of doctrine that implements the text, mostly through judicial interpretation but also through constitutional practices of Congress and the executive branch. Most of what you study in a constitutional law course is not explicit in the text of the Constitution at all: the tiers of scrutiny, the categories of unprotected speech, and the tests for state action are doctrinal constructs that courts have developed over time.

The identification of the Constitution with constitutional law can be misleading for another reason: much of the constitution operates outside the courts. Many constitutional questions are never litigated and are settled instead by the political branches: whether to go to war, how the houses of Congress organize their proceedings, and what the Senate’s role of “advice and consent” requires are largely worked out by Congress and the President rather than by judges. Some constitutional norms are what scholars call underenforced — the courts decline to enforce them to their full conceptual limits, often because the judiciary regards the question as better suited to the political process, leaving the political branches to honor the norm in the first instance. Doctrines that require deference to Congress may result in constitutional norms that are underenforced or not enforced at all. And constitutional practice is shaped by longstanding conventions and traditions that courts do not enforce but that constitutional actors treat as binding. The constitution, in short, is not only what courts say it is; it is also a framework that legislators, executives, and citizens interpret and apply for themselves.

Powers and Rights

The provisions of the Constitution can be divided into two types (powers and rights), and learning to tell them apart is one of the most useful things a student can do early on. Some provisions allocate government power — they say who gets to do what. Others protect individual rights — they say what government may not do to persons or citizens. Recognizing which kind of provision is at stake clarifies what the case is really about.

Start with the power-allocating provisions. These come in two main varieties. The first concerns federalism — the division of authority between the national government and the states. Article I grants legislative powers to Congress, and it does so by enumeration: Congress may regulate interstate commerce, coin money, establish post offices, and exercise the other powers the Article lists. The premise of enumeration is that Congress has only the powers granted to it, not a general power to legislate on any subject. The Tenth Amendment states the other side of this arrangement, providing that the powers not delegated to the United States are reserved to the states or to the people. Federalism questions — whether a given subject is for Congress or for the states — are questions about this division of authority.

The second kind of power-allocating provision concerns the separation of powers — the division of authority among the branches of the national government. Here the key provisions are the vesting clauses that open the first three Articles. Article I vests the legislative power in Congress, Article II vests the executive power in the President, and Article III vests the judicial power in the federal courts. These clauses distribute the basic functions of government among three distinct branches, and separation-of-powers questions — whether some action belongs to the legislature, the executive, or the judiciary — are questions about this distribution.

Now turn to the individual rights provisions. These do not allocate power; they limit it. These provisions identify things the government may not do to persons, or interests it may not invade without sufficient justification. The First Amendment is a familiar example: it protects the freedoms of speech, press, religion, and assembly against governmental abridgment. The Fourteenth Amendment is another: its Due Process Clause protects life, liberty, and property against deprivation without due process of law, and its Equal Protection Clause forbids the states from denying any person the equal protection of the laws. The two amendments illustrate a feature of the Constitution’s rights provisions worth noticing at the outset: some originally limited only the federal government and others limit the states. The First Amendment by its terms restrains Congress, while the Fourteenth Amendment expressly restrains the states — though much of the Bill of Rights, including the First Amendment, has come to apply against the states as well through the doctrine of incorporation, which we take up below. When a court asks whether a law violates someone’s constitutional rights, it is working with provisions of this second kind.

Although almost all of the United States Constitution regulates government, there are exceptions. The most important of these is the Thirteenth Amendment, which provides: “Neither slavery nor involuntary servitude, except as a punishment for crime whereof the party shall have been duly convicted, shall exist within the United States, or any place subject to their jurisdiction.” This provision applies to private individuals: if an ordinary citizen enslaves another person, their action is unconstitutional.

A Short History of American Constitutional Theory

Constitutional theory in America has a history, and the major ideas are easier to understand when you see the problems they were responses to. What follows is a very brief and simplified chronological sketch, introducing the central theoretical concepts as they emerged.

The Founding and Ratification. American constitutional theory begins with the debate over whether to ratify the Constitution. The Federalists, whose most famous arguments appear in The Federalist, defended the proposed Constitution; the Anti-Federalists opposed it or sought amendments to protect liberty and the states. The most celebrated contribution from this period is Madison’s Federalist No. 10, which addresses the problem of faction. By “faction,” Madison meant a group of citizens united by some common interest or passion adverse to the rights of others or to the interests of the community as a whole — an idea closely related to the modern phrase “special interest group.” Madison argued that the cure for the dangers of faction lay in the very size and diversity of the extended republic the Constitution would create: in a large republic, the multiplicity of interests would make it harder for any single faction to form a tyrannical majority. In addition, the Constitution creates horizontal separation of powers and checks and balances at the national level, dividing power between the Congress, the President, and the judiciary. The Constitution also divides power between the federal government and the states. Federalist No. 10 introduced enduring themes of American constitutional theory — the danger of majority faction, the design of institutions to channel and check self-interest, and the defense of the extended commercial republic — and it remains among the most studied texts in all of American political thought.

Marbury v. Madison and Judicial Review. In 1803, Marbury v. Madison is conventionally said to have established that the federal courts have the power to declare statutes unconstitutional. That familiar framing, however, deserves two qualifications. First, the very phrase “judicial review” and the idea of it as a distinct judicial “power” were not introduced until the late nineteenth century; the Marshall Court did not understand itself to be claiming a special power by that name. As Philip Hamburger has argued, what Marbury reflects is better understood as the ordinary judicial duty to decide cases according to law: when a statute conflicts with the Constitution, which is the higher law, the court’s duty is simply to follow the higher law in deciding the case before it. On this view, the court does not exercise a power to strike down legislation so much as discharge its duty to apply the governing law. Second, what we now call the “power of judicial review” was already well established before Marbury. Marbury raised a question that has never gone away: who has the final word on what the Constitution means? One historically important answer is departmentalism — the view that each branch of government interprets the Constitution for itself, so that the courts’ interpretations bind the parties before them but do not necessarily settle constitutional meaning for the other branches. The tension between judicial supremacy and departmentalism runs throughout American constitutional theory.

The Antebellum Period and Dred Scott. Before the Civil War, constitutional theory was preoccupied with slavery and the nature of the union. Dred Scott v. Sandford (1857), in which the Court opined that African Americans could not be citizens and stated in dictum that Congress could not prohibit slavery in the territories, is the great cautionary tale of this period. It illustrates the limits of judicial review as a mechanism for settling the deepest moral and political conflicts: rather than resolving the crisis over slavery, the decision inflamed it and helped precipitate the civil war.

Reconstruction and the Fourteenth Amendment. The Civil War and its aftermath produced what scholars often call the “second founding”: the Thirteenth, Fourteenth, and Fifteenth Amendments, which abolished slavery, guaranteed due process and equal protection against the states, and protected voting rights. The Fourteenth Amendment in particular transformed the constitutional order. It shifted the federal-state balance by subjecting the states to important new federal constraints, and over time it became the vehicle for incorporation — the doctrine through which most of the protections in the Bill of Rights, which had originally limited only the federal government, came to apply against the states as well. Much of contemporary constitutional law derives from Section One of the Fourteenth Amendment, including the Citizenship Clause, the Due Process Clause, and the Equal Protection Clause — as well as the Privileges or Immunities Clause, which was virtually erased by the Supreme Court in the Slaughterhouse Cases.

The Lochner Era and Its Critics. From the 1890s into the 1930s, the Supreme Court used the Due Process Clause to protect economic liberties, most famously in Lochner v. New York (1905), which struck down a maximum-hours law for bakers. Lochner involved what is now called “substantive due process” as the source of implied fundamental rights. Critics charged that the Court was reading its own laissez-faire economic philosophy into the Constitution. The most important theoretical response came earlier from James Bradley Thayer, whose work argued for judicial deference to legislatures: on Thayer’s view, a court should invalidate a statute only when its unconstitutionality is so clear that it is not open to rational question — the “clear mistake” rule. Justice Holmes gave the critique its most memorable expression in his Lochner dissent, objecting that the Constitution does not enact any particular economic theory and that judges should not impose their own. Thayerian deference remains a live position in debates about the proper role of the courts.

The New Deal Settlement and Footnote Four. The conflict between the Court and the elected branches came to a head in the 1930s, when the Court struck down New Deal legislation and President Roosevelt responded with his court-packing plan. After 1937 — the famous “switch in time” — the Court abandoned economic substantive due process and adopted a deferential posture toward economic regulation. But deference raised a question: if courts defer to legislatures, when, if ever, is aggressive judicial review appropriate? The most influential answer appeared in footnote four of United States v. Carolene Products (1938), which suggested that heightened scrutiny might be warranted in three circumstances: when legislation appears to violate a specific constitutional prohibition, when it restricts the political processes that ordinarily protect minorities, and when it reflects prejudice against “discrete and insular minorities.” Footnote four is the seed of representation-reinforcement theory, which we encounter again below in connection with the work of John Hart Ely.

The Warren Court and the Countermajoritarian Difficulty. Under Chief Justice Earl Warren (1953–1969), the Court issued a series of landmark decisions expanding individual rights and equality, beginning with Brown v. Board of Education (1954). The Warren Court’s activism prompted intense theoretical debate about the legitimacy of judicial review in a democracy. Alexander Bickel gave the problem its enduring name: the countermajoritarian difficulty — the worry that when unelected, life-tenured judges strike down laws enacted by elected legislatures, they act against the will of the majority and thus in tension with democratic self-government. The countermajoritarian difficulty became the defining problem of late-twentieth-century constitutional theory, and much subsequent work can be understood as an attempt to answer it. One influential answer was John Hart Ely’s representation-reinforcement theory, articulated in his famous book, Democracy and Distrust. Building on footnote four in Carolene Products, Ely argued that judicial review is democratically legitimate when it polices the channels of political participation and protects minorities from prejudice, rather than imposing the judges’ own values. We return to Ely’s view in the next section.

The Rise of Originalism. Although originalist ideas go back to the founding era, originalism emerged as a distinct theory of constitutional interpretation in the 1970s and 1980s, in significant part as a conservative reaction to the perceived excesses of the Warren and Burger Courts. The early versions emphasized the original intentions of the Framers, and they were offered as a way of constraining judges and answering the countermajoritarian difficulty: if judges are bound by the original meaning of the text, they cannot simply impose their own values. Critics raised powerful objections to original-intent originalism, and originalists responded by shifting their focus from the intentions of the Framers to the original public meaning of the constitutional text — the meaning the words would have had to ordinary readers at the time of enactment. Over the following decades, originalism developed into a sophisticated body of theory, and both originalism and its principal rival, living constitutionalism, became central to contemporary constitutional debate. Those two positions are the subject of the next section.

Originalism and Living Constitutionalism

The central interpretive debate in contemporary American constitutional theory is the debate between originalism and living constitutionalism. The question dividing them is fundamental: when we ask what the Constitution means, are we bound by the meaning the text had when it was adopted, or can that meaning legitimately change over time? Almost every interpretive dispute you will encounter — about the Second Amendment, the Fourteenth Amendment, the scope of executive power — sits somewhere on the terrain mapped by this debate.

Originalism. Originalists hold that the meaning of the constitutional text is fixed at the time each provision is adopted, and that this original meaning is binding on interpreters today. Two ideas are central. The first is the fixation thesis: the meaning of the text was fixed when it was framed and ratified, just as the meaning of any historical document is fixed by the linguistic and contextual facts of its time. The second is the constraint principle: constitutional actors, and judges in particular, ought to be bound by that original meaning. Together, fixation and constraint capture the core originalist commitment — that the Constitution means what it meant, and that its meaning should govern.

Originalism has changed over time. The early originalism of the 1970s and 1980s emphasized original intent — the subjective intentions of the Framers who drafted the document. Critics objected that this approach faced serious difficulties: many framers with many intentions, intentions pitched at different levels of generality, and the puzzle of why the unenacted intentions of particular individuals should bind anyone. In response, most originalists shifted to original public meaning — the meaning the words of the text would have had to an ordinary, reasonable reader at the time of enactment. The object of interpretation on this view is not anyone’s private intentions but the public meaning of the words actually adopted. The most prominent version of originalism is called “Public Meaning Originalism.”

Modern originalist theory also distinguishes interpretation from construction. Interpretation is the activity of discovering the linguistic meaning or communicative content of the text. Construction is the activity of determining the text’s legal effect — of translating its meaning into rules of constitutional law and applying it to particular cases. The distinction matters because the constitutional text is sometimes vague or open-textured, and where it is, interpretation alone may not decide a case; construction is required to fill the gap. This entry treats the interpretation-construction distinction only in passing; it is treated more fully in Legal Theory Lexicon: Interpretation and Construction.

Living Constitutionalism. Living constitutionalists hold, in contrast, that the meaning of the Constitution can legitimately evolve over time, so that the document can be adapted to changing circumstances and values without formal amendment. The animating idea is that a constitution written in the eighteenth century, and difficult to amend, must be capable of growth if it is to govern a society utterly transformed in its technology, economy, and moral understanding. Living constitutionalism is not a single theory but a diverse family of approaches.

The most prominent form of living constitutionalism is constitutional pluralism, which holds that constitutional interpretation and construction properly draw on multiple modalities of argument rather than any single master criterion. On this view, the resources of constitutional practice include the text, historical practice, judicial precedent, constitutional values, and institutional capacities — and constitutional reasoning consists in marshaling and weighing these modalities, none of which is automatically supreme. Constitutional pluralism is treated more fully in Legal Theory Lexicon 100: Constitutional Pluralism.

There are many other forms of living constitutionalism as well, including common-law constitutionalism, associated with David Strauss, which understands constitutional law as developing incrementally through judicial decisions in the manner of the common law, and the moral reading, associated with Ronald Dworkin, which interprets the Constitution’s abstract moral language as stating principles whose content must be worked out through moral judgment.

Constitutional pluralism, common-law constitutionalism, and the moral readings approach are forms of judicial supremacy, but contemporary constitutional theory includes views that allocate the primary authority over constitutional matters to Congress. One such theory is representation-reinforcement, associated with John Hart Ely and introduced in the previous section: judges should not impose their own moral views, but should intervene to keep the channels of political change open and to protect minorities whom the political process has failed. An even more deferential position is found in the revival of Thayerianism among some contemporary progressive scholars, including Samuel Moyn, Ryan Doerfler, Nicholas Bowie, and Daphna Renan, who urge a substantial reduction in the scope of judicial review and a corresponding return of constitutional decisionmaking to the elected branches. These approaches differ significantly, but they share the conviction — or at least the implication — that constitutional actors should not consider themselves bound by the constitutional text.

A Short Introduction to Comparative Constitutional Theory

American constitutional theory is, on the whole, parochial. It tends to theorize from the American case alone — as if the structures, doctrines, and questions familiar from the United States Constitution were the natural or even the only form that constitutionalism can take. Comparative constitutional theory corrects this tendency by attending to how other constitutional democracies have answered the same basic questions, often in strikingly different ways. A few examples convey the value of the comparative perspective.

Consider first the constitutions that protect positive rights. The United States Constitution is predominantly a charter of negative rights — it tells the government what it may not do, but it generally does not oblige the government to provide anything. Other constitutions are different. The Constitution of South Africa is the leading example: it guarantees affirmative entitlements such as access to housing, health care, food, water, and education, and the South African Constitutional Court has developed doctrines for enforcing these guarantees against the government. Positive-rights constitutions raise questions that the American negative-rights tradition tends to keep out of view — about the justiciability of social and economic rights, the role of courts in matters of resource allocation, and the meaning of constitutional equality in conditions of material deprivation.

Consider next the alternatives to American-style judicial review. In the United States, judicial review is typically understood in its “strong” form: when a court holds a statute unconstitutional, that judgment ordinarily settles the matter, and the legislature cannot override it except by the difficult route of constitutional amendment. Other systems have devised arrangements that allocate the last word differently. The notwithstanding clause of the Canadian Charter of Rights and Freedoms — section 33 — permits a legislature to declare that a statute shall operate notwithstanding certain Charter rights, for a renewable period, thereby preserving a legislative check on judicial interpretation. Mechanisms of this kind, sometimes called “weak-form” judicial review, show that judicial review need not take the strong American form, and they reframe the countermajoritarian difficulty as a problem with more than one possible institutional solution.

Consider finally that comparative constitutional theory is not merely a catalog of foreign arrangements but a site of sophisticated theory in its own right. The work of Aileen Kavanagh is a useful example. Drawing primarily on non-American materials, Kavanagh has developed an account of collaborative constitutionalism, which reconceives the protection of constitutional rights as a shared enterprise among courts, legislatures, and executives rather than a contest in which one institution holds final authority over the others. Whether or not one finds the account persuasive, it illustrates that rich constitutional theory can be built on foundations other than the American case — and that doing so can illuminate possibilities the American debate, fixated on the finality of judicial review, tends to neglect. Kavanagh’s work is cited in the bibliography below.

Comparative constitutional law is its own vibrant field of study, and many American law schools have an elective course — but it is very rare for comparative constitutionalism to be introduced in the basic constitutional law course. Constitutional theory from outside the United States is almost entirely ignored in American law school classrooms.

Conclusion

Constitutional theory supplies the questions that lie behind the constitutional cases you read. What is a constitution, and why does it bind us? Which provisions allocate power, and which protect rights? How should the text be interpreted, and who should have the final say? And how do the answers given by American constitutional practice compare with the answers given elsewhere? You need not resolve these questions to study constitutional law, but recognizing them will deepen your understanding of the cases and of the arguments that lawyers and judges make within them. As your study advances, the other entries in this Lexicon that treat particular theories and concepts in greater depth — on originalism, interpretation and construction, the fixation thesis, judicial review, and related topics — will help you go further.

Bibliography

Alexander M. Bickel, The Least Dangerous Branch: The Supreme Court at the Bar of Politics (2d ed. 1986).

Ryan D. Doerfler & Samuel Moyn, Democratizing the Supreme Court, 109 Cal. L. Rev. 1703 (2021).

Ronald Dworkin, Freedom’s Law: The Moral Reading of the American Constitution (1996).

John Hart Ely, Democracy and Distrust: A Theory of Judicial Review (1980).

The Federalist No. 10 (James Madison) (Clinton Rossiter ed., 1961).

Philip Hamburger, Law and Judicial Duty (2008).

Aileen Kavanagh, The Collaborative Constitution (2023).

Lawrence B. Solum, The Fixation Thesis: The Role of Historical Fact in Original Meaning, 91 Notre Dame L. Rev. 1 (2015).

Lawrence B. Solum, Original Public Meaning, 2023 Mich. St. L. Rev. 807.

David A. Strauss, The Living Constitution (2010).

James B. Thayer, The Origin and Scope of the American Doctrine of Constitutional Law, 7 Harv. L. Rev. 129 (1893).

Related Entries
The current version of this entry was created on June 27, 2026.

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 117: Constitutional Theory

If you find this entry valuable, consider subscribing to Legal Theory Stack on Substack at https://lsolum.substack.com/subscribe for regular updates on legal theory topics.
June 21, 2026

Legal Theory Lexicon 116: Civil Procedure Theory
Many first-year law students experience civil procedure as a bewildering mass of complex and technical rules. There are rules about pleading, jurisdiction, joinder, discovery, summary judgment, and preclusion—and the rules have exceptions, and the exceptions have their own exceptions. It is easy to feel lost. But a student who sees only the rules is missing the forest for the trees.

Once you take a step back from the individual rules and look at the course as a whole, you gain access to a crucial insight: Civil procedure is a course about big questions. What is law? What makes a procedure fair? Why is the law in action so often different from the law on the books? These questions aren’t distractions or detours—they are the heart and soul of civil procedure. The case that anchors many procedure courses, Erie Railroad v. Tompkins, turns out to rest on a contested view about the very nature of law. Interpretation of the Due Process Clauses of the Fifth and Fourteenth Amendment raise deep questions about procedural fairness. And the gap between the rules as written and the way litigation actually unfolds illuminates the core legal realist insight that the law in action may be very different than the law on the books.

When these themes come into view, your experience of civil procedure can be transformed. The technical rules become the surface of something deeper, and the connections between procedure and the rest of legal theory come into view. Approached in this way, civil procedure can become the most exciting course in the first-year curriculum.

This entry introduces the major theoretical themes of civil procedure. As always, the Lexicon is aimed at law students, especially first year law students, with an interest in legal theory.

Two Models of Civil Procedure Pedagogy

There is more than one way to teach civil procedure, and the differences are not merely matters of style. They reflect different views about what the course is fundamentally about. It is useful to distinguish two models and the theoretical assumptions that ground them.

The first model descends from the legal process school associated with Henry Hart, Albert Sacks, and their colleagues at Harvard in the middle of the twentieth century, and its application to procedure is associated especially with Benjamin Kaplan, one of the principal architects of the modern Federal Rules. The legal process school approached law as a system of institutions, each with its own competence, and it emphasized the importance of reasoned justification for legal decisions. (See Legal Theory Lexicon 082: Reasoned Elaboration.) On this model, the procedure course is organized around the deep structural questions of the litigation system: how authority is allocated between state and federal courts, what role each institution is competent to play, and what justifies the exercise of judicial power. The case of Erie Railroad v. Tompkins becomes the heart of the course, because Erie raises these structural questions in their most fundamental form.

The second model is sometimes associated with the approach developed at the University of Wisconsin, and it reflects the “law in action” tradition in American legal thought. On this model, the course is organized around how lawsuits actually work. It begins with pleading and follows the litigation as it unfolds—through discovery, motion practice, trial, and judgment. The animating questions are practical and empirical: What actually happens when a party files a complaint? How do cases really get resolved? Here the emphasis falls less on the grand structural questions and more on the way that the litigation process works in practice.

Most actual courses combine elements of both the Harvard legal process model and the Wisconsin model. First year law students who take a course that closely adheres to the Wisconsin model may learn more about the ways in which civil procedure shapes the law in action and less about what the Erie Doctrine has to say about the nature of law and the allocation of authority between state and federal courts. And vice versa! Many first year courses combine elements of both approaches. And your civil procedure professor may not even be aware of the history of the civil procedure course and the relationship of that history to the deep theoretical questions that underlie the technical details of procedural doctrines.

Enough history. Let’s dig into civil procedure theory. We can start with Erie Railroad v. Tompkins!

Erie and Two Foundational Questions

Some of you will be reading this Lexicon entry before you have read Erie, while others will have read the case but may still be unsure about what Erie was actually about. Of course, there is no single answer to that question: Erie is about many things. But among those issues are two deep questions, one about the nature of law itself and the other about the distinction between substance and procedure.

What Is Law?

Before Erie Railroad v. Tompkins, federal courts hearing state-law disputes followed the rule of Swift v. Tyson: on questions of “general” common law, a federal court could exercise its own independent judgment about what the law was, rather than following the decisions of the state’s courts. Justice Holmes attacked this practice in a famous dissent, and the ground of his attack was a theory about the nature of law itself. “The common law,” he wrote, “is not a brooding omnipresence in the sky, but the articulate voice of some sovereign or quasi sovereign that can be identified.” Southern Pacific Co. v. Jensen, 244 U.S. 205, 222 (1917) (Holmes, J., dissenting). The phrase “the articulate voice of some sovereign” is a compact statement of the sovereign-command theory of law associated with John Austin: law is the command of a sovereign backed by a threat of punishment, and there is no law without an identifiable sovereign source. On this view, the question “what is the law?” can never be answered by appeal to principles floating free of any lawmaker; one must always be able to identify the sovereign. If the general common law is not the law of a particular state (state common law) or law created by the national government (federal common law), then it really isn’t law at all!

Behind this disagreement lie two pictures of what judges do when they decide common-law cases, and they line up with two great traditions in legal philosophy. On the discovery picture, the common law is a body of principles that exists independently of any court’s decisions, and the judge’s task is to discover and declare what that law already is; this picture fits comfortably with natural law theory, which holds that there is law to be found that does not depend on any human lawmaker. On the interstitial legislation picture, judges do not find pre-existing law but make it, filling the gaps left by other lawmakers; this picture fits with legal positivism, which holds that all law traces to some human source. Swift presupposed the discovery picture: it allowed a federal court to find the “general” common law on the assumption that general common law exists apart from the decisions of any particular state. When the Court overruled Swift in Erie, declaring that “there is no federal general common law,” it rejected the discovery picture and embraced the view that the common law is always the law of some identifiable sovereign (e.g. the law of a particular state). (See Legal Theory Lexicon 093: Common Law.) These alignments are natural but not strict—a natural lawyer can allow a role for interstitial judicial lawmaking, and a positivist can accept that common-law rules grow out of custom and social practice.

The dispute behind Erie is therefore a striking example of how a question in general jurisprudence can be decisive in an actual case. Whether law is discovered or made, and whether it must always trace to a particular sovereign, are among the questions in the debate between natural law theory and legal positivism. Just to be clear, the sovereign command theory that Holmes invoked in his famous “brooding omnipresence” aphorism is not state of the art legal positivism today. (See Legal Theory Lexicon 065: The Nature of Law.)

What Is Procedure?

Erie, which raises the question “what is law?”, also raises a second foundational question: what is procedure? Different civil procedure courses handle this question very differently. Some omit Erie entirely; others devote a single day to the Erie doctrine and move on. But another approach treats the whole Erie canon—Erie itself, Guaranty Trust Co. v. York, Byrd v. Blue Ridge Rural Electric Cooperative, Hanna v. Plumer, Shady Grove Orthopedic Associates v. Allstate Insurance Co., and others—as central to the course. These cases all grow out of a single practical problem: when a federal court hears a state-law claim, it must apply state substantive law but may apply its own procedural rules, and so it must decide which rules are which. In working through that problem, the cases confront a question that turns out to be surprisingly deep: what makes a rule procedural rather than substantive, and how is procedure to be distinguished from substance at all?

At first the distinction seems easy. We feel sure that torts, contracts, and property are matters of substance, and that pleading, jurisdiction, joinder, and discovery are matters of procedure. But the confident intuition breaks down under pressure, and there are two ways to respond. The first view holds that there is no real line to be drawn—that the distinction between substance and procedure is, as Linda Mullenix has argued, inherently unresolvable, so that calling a rule “procedural” is ultimately just a label for rules we choose to treat that way. The second view holds that there is a genuine distinction, but that it is more complex than the easy intuition suggests, because substance and procedure are entangled: a single rule can do substantive and procedural work at the same time, shaping primary conduct even as it governs the conduct of litigation. On this second view, the difficulty of drawing the line reflects not the absence of a distinction but the way substantive and procedural functions are interwoven in actual legal rules. If you are interested in these questions, my own views are presented in Lawrence B. Solum, Procedural Justice, 78 S. Cal. L. Rev. 181 (2004).

Procedural Justice: What Makes a Procedure Fair?

The second great theme of civil procedure is procedural justice: the question of what makes a dispute-resolution procedure fair. Procedural due process is the constitutional basis for limits on personal jurisdiction, and the topic also arises in connection with requirements for notice and an opportunity to be heard. This topic is covered in more depth in a separate Lexicon entry. See Legal Theory Lexicon 023: Procedural Justice.

One way to understand theories of procedural justice is via three models, the first of which is the accuracy model. On this view, the point of a civil procedure is to reach correct outcomes—to apply the law correctly to the true facts. A procedure is fair to the extent that it is accurate, and unfair to the extent that it produces erroneous results. Accuracy is plainly one thing we want from a procedure, but it cannot be the whole story, because perfect accuracy might cost so much that it would make no sense to pay the price. A system that spent a decade and a fortune on every parking ticket might approach 100% accuracy, but imposing those costs on litigants hardly seems fair.

The second approach to procedural fairness is the balancing model. On this view, procedural fairness requires a sensible tradeoff between the benefits of accuracy and the costs of achieving it. The Supreme Court adopted a version of this approach in Mathews v. Eldridge, 424 U.S. 319 (1976), which asks courts to weigh the private interest at stake, the risk of an erroneous deprivation under existing procedures, and the government’s interest, including the burden that additional procedures would impose. The balancing model captures something the accuracy model misses: procedures cost money and time, and a fair procedure must take those costs into account. But it too may be incomplete, because it treats fairness as entirely a matter of costs and benefits.

The third is the participation model. On this view, a procedure is fair only if those who will be bound by its outcome have had a meaningful opportunity to participate—notice of the proceeding and a chance to be heard. The principle is vividly illustrated by Hansberry v. Lee, 311 U.S. 32 (1940), which held that a person cannot constitutionally be bound by a judgment in litigation to which he was not a party and in which his interests were not adequately represented. What is striking about this principle is that it does not seem to reduce to accuracy or to cost. A person denied the chance to participate has a complaint even if the outcome happens to be correct and even if giving him a hearing would have been expensive. The value of participation appears to be, at least in part, independent of the values captured by the first two models—a point that suggests procedural fairness is not simply a matter of getting good outcomes at a reasonable price.

Law in Action and Law on the Books

A third theme of civil procedure is the gap between the law on the books and the law in action—between the rules as written and the way litigation actually unfolds. This distinction is a central preoccupation of the empirical and realist tradition in procedural thought, and it is one of the places where the study of procedure connects to the social reality of the legal system. The rules of civil procedure are not self-executing. How they operate depends on the costs of litigation, the incentives of the parties, and the practical dynamics of the litigation process—and those forces can pull the law in action away from what the rules on their face appear to require.

Pleading is a good example. Before the Supreme Court’s decisions in Bell Atlantic Corp. v. Twombly, 550 U.S. 544 (2007), and Ashcroft v. Iqbal, 556 U.S. 662 (2009), a claim that discovery would eventually have shown to be groundless could nonetheless succeed in practice. The reason was a cost asymmetry: because defending against discovery is often far more expensive than pursuing it, a defendant might rationally settle even a meritless claim rather than bear the cost of defeating it. A claim the law on the books called meritless could prevail in the law in action. Twombly and Iqbal were in part a response to this problem. By raising the pleading standard—requiring a complaint to state a claim that is “plausible” on the facts alleged—the Court sought to screen out groundless claims before the expensive machinery of discovery is set in motion. But the response generates a problem of its own. Because plausibility is judged on the facts a plaintiff has been able to plead, and because some meritorious claims rest on facts that can be obtained only through discovery—which now comes too late—the new regime can defeat valid claims at the pleading stage. A claim the law on the books treats as fully valid can be destroyed in the law in action.

The pleading example points to a more general lesson. Procedural rules are not merely the neutral infrastructure of the litigation system; they shape which claims succeed and which fail, and in doing so they change the incentives that govern conduct in the world—whether a firm risks anticompetitive behavior, or whether an injured plaintiff can hope to vindicate a claim. A rule that is procedural in form can be substantive in effect. This is the practical face of a point encountered earlier in the discussion of Erie: substance and procedure are entangled. There the entanglement was a conceptual difficulty about how to classify rules; here it is an observable fact about how procedural rules reach out and govern primary conduct.

Interpretation: Constitutional, Statutory, and Rules

Civil procedure differs from torts, contracts, and property in a way that is easy to overlook. The substantive common-law subjects are built largely from judicial decisions; their law lives in the caselaw. Procedural law, by contrast, dwells in authoritative texts—the Due Process Clauses of the Constitution, jurisdictional and procedural statutes enacted by Congress, and the Federal Rules of Civil Procedure. Because procedure is governed by texts of three different kinds, the civil procedure course is the first place many law students encounter theories of interpretation and their application to different kinds of legal texts.

Consider constitutional interpretation. The law of personal jurisdiction is built on the Due Process Clause, and its history is a case study in competing approaches to constitutional meaning. The early law, exemplified by Pennoyer v. Neff, 95 U.S. 714 (1878), was formal and rule-like: jurisdiction turned on the physical presence of the defendant or his property within the state’s territory. That formalism gave way in International Shoe Co. v. Washington, 326 U.S. 310 (1945), which reframed the question in terms of whether requiring the defendant to litigate in the forum comports with “traditional notions of fair play and substantial justice.” International Shoe is often read as an example of living constitutionalism: the Court interprets the open-ended language of the Due Process Clause by appeal to evolving constitutional values, here the value of fairness. But fairness is not the only way to read the clause. Justice Black, concurring, objected that the Constitution leaves each state the power to open its courts to suits against corporations doing business there, and that conditioning that power on the Court’s notion of “fair play” was itself a judicial deprivation. On Black’s view, the question is not whether a state’s assertion of jurisdiction strikes the Justices as fair, but whether the state has acted within its lawful authority—an approach that locates the meaning of due process in the positive law rather than in the Court’s evolving sense of fairness, and that has affinities with later originalist readings of the clause. Justice Black understood the original meaning of the Due Process of Law Clauses to require the process that is due as a matter of positive law—the legal procedures actually in force at the time of the deprivation. The disagreement is not really about personal jurisdiction at all; it is about how to interpret a constitutional provision.

Statutory interpretation enters the course through the jurisdictional statutes. A good illustration is the supplemental jurisdiction statute, 28 U.S.C. § 1367, and the Supreme Court’s divided interpretation of it in Exxon Mobil Corp. v. Allapattah Services, Inc., 545 U.S. 546 (2005). Writing for the majority, Justice Kennedy read the statute by its plain text: because the language unambiguously authorized supplemental jurisdiction over the claims at issue, there was no need to consult the statute’s legislative history, and little reason to trust it even if consulted. Justice Ginsburg, in dissent, read the statute against the background of the settled jurisdictional rules that preceded it, arguing that Congress had given no clear signal that it meant to discard a long-established limit. The split between them provides an illustration of the great divide in statutory interpretation—between a textualism that begins and ends with the enacted words in context and an approach that reads those words in light of statutory purpose and prior understandings. (See Legal Theory Lexicon 078: Theories of Statutory Interpretation and Construction.)

Finally, the Federal Rules raise an interpretive problem of their own, distinct from both constitutional and statutory interpretation. The Rules are not statutes: they are promulgated through a rulemaking process under the Rules Enabling Act, drafted by an Advisory Committee whose notes are an unusually authoritative guide to their meaning. This gives rise to a question with no exact counterpart elsewhere. When the Court concludes that a Rule is producing undesirable results, it has a choice: it can set in motion the formal process of amending the Rule, or it can reinterpret the existing Rule to mean something new. Twombly and Iqbal are again instructive. The plausibility standard those cases announced is widely understood to have changed or even negated the meaning of Rule 8’s requirement of “a short and plain statement of the claim,” and it did so through interpretation rather than through the amendment process—a result that looks more purposivist than textualist, and that raises the question whether reform of the Rules should proceed through reinterpretation or through the channels the Rules Enabling Act provides. Because the Rules occupy a middle ground between statute and judicial doctrine, the theory of how they should be interpreted remains genuinely unsettled.

Conclusion: The Theoretical Stakes of a Doctrinal Course

The rules of civil procedure can be learned as a body of technical doctrine, and for many purposes they must be. But beneath the doctrine lie some of the deepest questions in legal theory. The debate behind Erie is a debate about the nature of law and about what separates substance from procedure. The law of due process raises the question of what makes a procedure fair. The gap between the law on the books and the law in action reveals that procedural rules do not merely process disputes but shape the conduct they govern. And the interpretation of the constitutional, statutory, and rule-based texts of procedure requires the student to confront, often for the first time, the competing theories of how legal texts should be read. In each case, what looks like a settled technical rule turns out to rest on a contested theoretical foundation.

This theoretical depth was one of the reasons that the legal process tradition saw civil procedure as the theoretical core of the first year curriculum, and it is why that tradition placed cases like Erie at the center of the course. The stage-by-stage, law-in-action approach associated with the Wisconsin model of the course has its own great virtues, and most courses draw on both models. But a student who sees only the stages and the rules will miss what makes the subject genuinely deep. Civil procedure is not merely the plumbing of the legal system; it is a sustained encounter with the fundamental questions of legal theory, encountered in the concrete and consequential setting of a lawsuit. Approached with that in mind, it can be the most exciting course in the first-year curriculum.

Related Entries
Bibliography

Robert G. Bone, Agreeing to Fair Process: The Problem with Contractarian Theories of Procedural Fairness, 83 B.U. L. Rev. 485 (2003).

Max Crema & Lawrence B. Solum, The Original Meaning of “Due Process of Law” in the Fifth Amendment, 108 Va. L. Rev. 447 (2022).

William N. Eskridge, Jr., Dynamic Statutory Interpretation (1994).

Lon L. Fuller, The Forms and Limits of Adjudication, 92 Harv. L. Rev. 353 (1978).

Benjamin Kaplan, Continuing Work of the Civil Committee: 1966 Amendments of the Federal Rules of Civil Procedure (I), 81 Harv. L. Rev. 356 (1967).

Jerry L. Mashaw, The Supreme Court’s Due Process Calculus for Administrative Adjudication in Mathews v. Eldridge: Three Factors in Search of a Theory of Value, 44 U. Chi. L. Rev. 28 (1976).

Linda S. Mullenix, The Constitutionality of the Proposed Rule 23 Class Action Amendments, 39 Ariz. L. Rev. 615 (1997).

Roscoe Pound, Law in Books and Law in Action, 44 Am. L. Rev. 12 (1910).

Lawrence B. Solum, Procedural Justice, 78 S. Cal. L. Rev. 181 (2004).

Lawrence B. Solum & Max Crema, Originalism and Personal Jurisdiction: Several Questions and a Few Answers, 73 Ala. L. Rev. 483 (2022).

Created: June 21, 2026

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 116: Civil Procedure Theory

Link to the Legal Theory Stack

The Legal Theory Stack informs readers of a new or revised Legal Theory Lexicon entry every Sunday along with summaries of the Download of the Week and the Legal Theory Bookworm recommendation. Subscribe to the Legal Theory Stack here.
June 9, 2026

Legal Theory Lexicon 115: Tort Theory
First year law students encounter many important concepts for the first time in their torts course. The “reasonable person” and Learned Hand’s formula appear in the Carroll Towing case. The mysteries of causation are introduced in Palsgraf. And debates over whether negligence or strict liability should provide the standard for imposing liability introduce important debates in normative legal theory. Behind these doctrinal questions lies tort theory.

The doctrinal questions of tort law prompt deeper theoretical questions of tort theory. Why does the law require some people to pay damages to others for the harms they cause? What kinds of conduct should trigger liability? When the law does impose liability, what is it really doing—compensating victims, deterring dangerous behavior, or giving expression to some idea of justice between the parties? What normative theory should guide tort doctrine, consequentialism, deontology, or something else? Tort theory aims to provide systematic answers to these questions and others.

This entry in the Legal Theory Lexicon provides an overview of the major theoretical approaches to tort law. As always, the Lexicon is aimed at law students, especially first-year students with an interest in legal theory.

Let’s begin with the most basic questions. What is a tort? And what is a theory of tort law?

What Is Tort Theory?

A tort is a civil wrong, other than a breach of contract, for which the law provides a remedy—typically money damages. The core of the first-year course is the law of negligence, but the field also includes intentional torts (battery, trespass, defamation) and various forms of strict liability (liability for abnormally dangerous activities, products liability). Tort theory asks what, if anything, holds this body of law together and what justifies it.

It is useful to distinguish four kinds of question that tort theorists ask.

The first is the question of justification: why should the law impose liability at all? When a defendant injures a plaintiff, the law might do nothing, leaving the loss where it falls. Or it might provide compensation through some public scheme, as workers’ compensation systems and the New Zealand accident compensation scheme do. Tort liability is a particular response—it requires one private party to pay another—and that response calls for justification.

The second is the question of the standard of conduct: what must a defendant have done, or failed to do, to be held liable? The central divide here is between fault and strict liability. A fault standard holds defendants liable only when they have behaved wrongfully—most often, negligently. A strict-liability standard holds defendants liable for the harms they cause regardless of whether they took reasonable care. Much of tort theory is an argument about which standard is appropriate, and why.

The third is the question of remedy: when liability is established, what should the defendant be required to do, and why? Tort law’s standard remedy is compensatory damages measured by the plaintiff’s loss. But theorists disagree sharply about what this remedy is for. Is it a device for spreading or deterring losses, or is it the law’s way of undoing a wrong as between the two parties?

The fourth is a question that helps fix the boundaries of the field: how and why is tort liability different from criminal punishment? Both tort and criminal law respond to wrongful conduct, and the same act—a punch, a reckless drive—can give rise to both a tort suit and a criminal prosecution. But the two differ in structure. A criminal case is brought by the state and aims at punishment; a tort case is brought by the injured party and aims at compensation. The criminal defendant who is convicted pays a debt to society; the tort defendant who loses pays damages to the plaintiff. The standard of proof differs too: guilt must be proved beyond a reasonable doubt, while tort liability need only be established by a preponderance of the evidence. As we will see, this distinctive structure—a private plaintiff seeking a remedy against a private defendant—is something that some theories of tort treat as merely incidental and others treat as essential.

There are other questions as well. For example, tort law needs an account of causation—a topic that has its own Legal Theory Lexicon entry: Legal Theory Lexicon 020: Causation. This Lexicon entry focuses on the questions that have occupied tort theorists in recent years, but tort theory is a dynamic field: old questions sometimes fade into the background, and new questions become the focus of attention.

The discussion that follows focuses on some of the most important approaches to tort theory. They are presented here in roughly the order in which they came to prominence in the modern academic debate, beginning with the economic analysis of tort law that reshaped the field in the 1960s and 1970s.

Economic Theories of Tort Law

The economic analysis of law transformed the study of torts. On the economic approach, the central purpose of tort law is not to do justice between the parties but to promote efficiency—to minimize the total social costs associated with accidents. This idea was developed in different ways by Guido Calabresi, whose book The Costs of Accidents (1970) framed the problem of accident law as one of cost minimization, and by Richard Posner, who argued that the common law of negligence could be understood as if it were designed to produce efficient outcomes.

The economic approach begins from a simple observation: accidents are costly, but so is preventing them. This approach to tort law has its theoretical roots in the work of Ronald Coase; for an introduction to his approach, see Legal Theory Lexicon 002: The Coase Theorem. Driving more slowly reduces the risk of collisions but also makes everyone late. The goal, on this view, is not to eliminate accidents—that would be far too expensive—but to find the level of precaution at which the combined costs of accidents and accident avoidance are as low as possible. Tort law promotes this goal by giving potential injurers an incentive to take cost-justified precautions, and only cost-justified precautions. A related idea is that liability should sometimes be placed on the cheapest cost avoider—the party best positioned to prevent the harm at the lowest cost—because doing so gives the right party the incentive to act.

The most famous expression of this idea is the Hand formula, named for Judge Learned Hand. In United States v. Carroll Towing Co. (1947), Hand suggested that whether a defendant was negligent could be analyzed in terms of three variables: the burden of taking a precaution (B), the probability that harm would occur without it (P), and the magnitude of the harm if it did occur (L). On this analysis, a defendant is negligent when the burden of precaution is less than the expected harm it would prevent—that is, when B is less than P multiplied by L. The intuition is straightforward: it is unreasonable to fail to take a precaution that costs less than the harm it would avoid, and reasonable to forgo a precaution that costs more than it is worth. Economists read the Hand formula as an instruction to weigh the costs and benefits of precaution at the margin, and they treat it as evidence that negligence law is, at bottom, about efficient deterrence.

Economic theories come in several varieties and face several objections. Some economists emphasize deterrence (giving actors incentives to take care), while others emphasize loss spreading (placing losses on parties, such as manufacturers, who can distribute them across many transactions through prices and insurance). Critics object that the economic approach mismeasures or ignores values that tort law seems to care about—that it cannot easily account for the moral significance of wronging a particular person, and that its focus on aggregate social welfare sits uneasily with tort law’s bilateral structure, in which a particular plaintiff sues a particular defendant. It is this last objection that animates the corrective justice theories to which we now turn.

Corrective Justice Theories

Beginning in the 1980s, a number of theorists mounted a sustained challenge to the economic account, arguing that tort law is best understood not as an instrument of social policy but as an expression of corrective justice. The idea of corrective justice is ancient—it traces to Aristotle’s Nicomachean Ethics, which distinguished corrective justice (concerned with rectifying wrongful losses and gains between two parties) from distributive justice (concerned with the fair allocation of goods across society as a whole). The leading modern exponents include Ernest Weinrib, Jules Coleman, and Richard Wright.

The central claim of corrective justice theory is that tort law responds to a wrong done by one person to another, and that the defendant’s duty to repair the harm flows directly from the fact that the defendant wrongfully caused it. On Weinrib’s influential account, developed in The Idea of Private Law (1995), the structure of a tort case is correlative: the plaintiff and defendant are linked as the two parties to a single transaction, and the law’s job is to undo the wrong that runs between them. This bilateral structure is not, as the economist would have it, an awkward administrative feature to be explained away. It is the very point of the enterprise. Weinrib famously insisted that private law should be understood on its own terms rather than as a vehicle for external goals—a view often summarized by his remark that the only purpose of private law is to be private law.

Corrective justice theorists thus take seriously features of tort law that the economic approach struggles to explain: that the plaintiff must have been wronged, that the defendant must have caused the harm, and that the remedy runs from this defendant to this plaintiff rather than into a general compensation fund. Critics respond that corrective justice theory describes the form of tort law without justifying it—that it tells us tort law has a bilateral structure but does not explain why society should maintain such an institution rather than replacing it with a more efficient compensation scheme. The standard of conduct also remains contested: corrective justice theorists differ among themselves about whether the wrong at the heart of tort law requires fault or can rest on the mere causation of harm.

Civil Recourse Theory

A distinct and influential approach, developed primarily by John Goldberg and Benjamin Zipursky, is civil recourse theory. Civil recourse theory shares the corrective justice theorists’ conviction that tort law is about wrongs rather than efficient loss allocation, but it locates the heart of the matter somewhere different. On the civil recourse view, tort law does not impose a duty of repair on wrongdoers. Rather, it empowers those who have been wronged—it gives the victim of a tort a legal avenue of recourse against the person who wronged her.

The point is best seen by attending to the structure of a tort suit. Tort law does not automatically transfer money from injurers to victims. It gives the injured party the power to sue, and leaves it to her whether to exercise that power. This, Goldberg and Zipursky argue, is no accident. It reflects a principle with deep roots in political theory: when the state prohibits private violence and self-help, it owes those who are wronged an alternative—a civil action through which they can hold wrongdoers answerable. Tort law is the law’s substitute for private redress. This is where the contrast with criminal punishment, flagged earlier, becomes theoretically central: the criminal law vindicates the public interest through state prosecution, whereas tort law equips the private victim with a means of acting against the person who wronged her.

Civil recourse theory and corrective justice theory are close cousins, and the differences between them are subtle. Both insist that tort law concerns wrongs and that its bilateral structure is essential. The principal disagreement is over whether tort law is fundamentally about the wrongdoer’s duty to repair (corrective justice) or the victim’s power to obtain redress (civil recourse). Critics of both approaches sometimes wonder whether this difference makes a practical difference; defenders respond that it explains otherwise puzzling features of tort doctrine, such as the plaintiff’s freedom to decline to sue and the law’s insistence that the plaintiff personally have been wronged.

Rights-Based and Kantian Theories

Closely related to corrective justice, but worth distinguishing, are theories that ground tort law in a structure of individual rights. On these accounts, tort law protects the rights persons have against one another—rights to bodily integrity, to property, to reputation—and a tort is, at bottom, the violation of such a right. The remedy gives effect to the right by requiring the violator to answer for the violation.

Two strands deserve mention. The first is the Kantian account developed by Arthur Ripstein, who argues in Private Wrongs (2016) that tort law expresses a system of equal freedom: each person is entitled to independence from the choices of others, and torts are violations of that independence. On this view, the wrong in a tort is the defendant’s use of the plaintiff—or the plaintiff’s body or property—as a means to the defendant’s own ends, without the plaintiff’s consent. The second is the rights-essentialism of Robert Stevens, who argues in Torts and Rights (2007) that tort law is best understood as a law of rights and their infringement, and that damages substitute for the right that was violated. These theories share with corrective justice an emphasis on the relationship between the parties, but they place the concept of a right rather than the concept of a wrongful loss at the foundation.

A recurring example illuminates the rights-based perspective. In Vincent v. Lake Erie Transportation Co. (1910), a ship’s owner deliberately kept the ship tied to a dock during a storm to save the ship, and the dock was damaged as a result. The court held the shipowner liable for the damage even though keeping the ship moored was reasonable—indeed, the privilege of necessity made it lawful. Rights-based theorists find this result congenial: the dock owner’s property right was infringed, and the infringement called for compensation, even though the defendant acted reasonably. The case is a puzzle for any theory that ties liability tightly to wrongful conduct, and it remains a favorite battleground among tort theorists.

Mixed, Pluralist, and Instrumentalist Theories

Not every theorist believes that tort law has a single unifying purpose. Many take a pluralist or mixed view, holding that tort law serves several aims at once—deterrence, compensation, corrective justice, the vindication of rights—and that no monistic theory captures the whole. Others defend frankly instrumentalist accounts, treating tort law as one policy tool among many for managing the social problem of accidents.

Several positions fall under this heading. Gregory Keating has developed accounts that draw on contractualist moral theory, asking which risk-imposing arrangements could be justified to all affected by them, and using that idea to illuminate the line between negligence and strict liability. The mid-twentieth-century enthusiasm for enterprise liability—the idea that businesses should bear the accident costs of their activities because they are well placed to spread those costs—reflects an instrumentalist sensibility, as does much of modern products-liability law. And some theorists argue that the search for a single grand theory is misguided: tort law, on this view, is a historically evolved institution that does several jobs imperfectly, and the theorist’s task is to understand and improve it rather than to reduce it to one idea. Kenneth Simons exemplifies a related analytic sensibility: rather than defending a single master theory, he has dissected the building blocks of tort law—the concept of negligence, the role of mental states, the consensual rationale for assumption of risk—in ways that cut across the economic and corrective justice camps.

Pluralism has obvious attractions—it fits the messy reality of tort doctrine—but it faces a challenge of its own. A theory that says tort law serves many values must also tell us how to proceed when those values conflict, as they frequently do. Without an account of how the competing aims are to be weighed, pluralism risks describing the disagreement rather than resolving it.

Critical and Distributive Perspectives

A final family of approaches steps outside the internal debate among economists, corrective justice theorists, and rights theorists to ask broader questions about tort law’s social role. Theorists working in the critical legal studies tradition, in feminist legal theory, and in the study of law and inequality have argued that tort law’s apparently neutral concepts—the “reasonable person,” the measure of damages by lost earnings—can encode and reproduce existing social hierarchies. When damages for lost future income are calculated using gender- or race-based wage tables, for example, the law may entrench the very inequalities it purports to ignore.

These perspectives raise the question of distributive justice—the fair allocation of resources and risks across society—which the dominant theories of tort largely set to one side. Most corrective justice theorists insist that distributive questions, however important, belong to other parts of the legal system (taxation, social welfare) rather than to tort law. Critics reply that this division of labor is itself a choice with distributive consequences, and that tort law cannot be neatly insulated from questions about who bears the burdens of accidents in an unequal society. This entry treats these perspectives briefly, but they form an important counterpoint to the theories surveyed above.

Cross-Cutting Doctrinal Questions

Several doctrinal questions cut across the theoretical debates and offer good test cases for any theory of tort.

The first is the choice between negligence and strict liability, already introduced above. Why does the law require fault in some domains (most ordinary accidents) but not in others (abnormally dangerous activities, defective products)? Economic theorists explain the choice in terms of incentives and activity levels; corrective justice and rights theorists explain it in terms of the kind of wrong involved. The objective character of the negligence standard is itself a puzzle: in Vaughan v. Menlove (1837), the court held a defendant to the standard of a reasonable person even though he may have done his honest best given his limited intelligence. Why should tort law judge defendants by an external standard rather than by their individual capacities? Each theory must say something about this. Kenneth Simons has examined these questions with particular care, distinguishing the “conduct negligence” that dominates tort law from the “cognitive negligence” more central to criminal law, and analyzing the several institutional functions that an objective negligence standard serves. His work on the consensual rationale for assumption of risk—the idea that a plaintiff who knowingly and voluntarily confronts a risk may have no claim—further illuminates how the law weighs the choices of victims, not just the conduct of injurers.

A second question concerns causation. Tort liability ordinarily requires that the defendant’s conduct have caused the plaintiff’s harm, but causation raises notorious difficulties—both factual (would the harm have occurred anyway?) and legal (is the harm too remote, or too unforeseeable, to count?). The famous case of Palsgraf v. Long Island Railroad Co. (1928) crystallizes the problem of proximate cause and the related question of the duty of care: to whom is a careless defendant answerable, and for which consequences? Theories of tort are tested by how well they explain why the law draws the lines it does.

A third question concerns the measure of damages. If tort law aims at deterrence, damages should be set at the level that gives optimal incentives. If it aims at corrective justice, damages should restore the plaintiff to the position she occupied before the wrong. If it aims at the vindication of rights, damages should substitute for the right infringed. These aims do not always point in the same direction, and the law of damages—compensatory, punitive, nominal—remains a fertile field for theoretical disagreement.

Conclusion

Tort theory is a debate about what a whole field of law is for. The economic approach sees tort law as a system for minimizing the costs of accidents; corrective justice theory sees it as an institution for rectifying wrongs between persons; civil recourse theory sees it as the law’s provision of an avenue of redress to those who have been wronged; rights-based theories see it as the protection and vindication of individual rights; and pluralists deny that any single account can capture the whole. For the first-year student, the value of these theories is not that one of them is simply correct, but that each illuminates features of the doctrine that might otherwise pass unnoticed—and that learning to see tort law through these competing lenses is itself a central part of learning to think like a lawyer.

Related Lexicon Entries
Bibliography
- Guido Calabresi, The Costs of Accidents: A Legal and Economic Analysis (1970).
- Ronald H. Coase, The Problem of Social Cost, 3 Journal of Law and Economics 1 (1960).
- Jules L. Coleman, Risks and Wrongs (1992).
- John C.P. Goldberg & Benjamin C. Zipursky, Recognizing Wrongs (2020).
- Gregory C. Keating, Reasonableness and Rationality in Negligence Theory, 48 Stanford Law Review 311 (1996).
- Richard A. Posner, A Theory of Negligence, 1 Journal of Legal Studies 29 (1972).
- Arthur Ripstein, Private Wrongs (2016).
- Kenneth W. Simons, Assumption of Risk and Consent in the Law of Torts: A Theory of Full Preference, 67 Boston University Law Review 213 (1987).
- Kenneth W. Simons, Dimensions of Negligence in Criminal and Tort Law, 3 Theoretical Inquiries in Law 283 (2002).
- Ernest J. Weinrib, The Idea of Private Law (1995).
- Richard W. Wright, Right, Justice, and Tort Law, in Philosophical Foundations of Tort Law (David G. Owen ed., 1995).
The current version of this entry was created on June 9, 2026.

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 115: Tort Theory

If you find this entry valuable, consider subscribing to Legal Theory Stack on Substack at https://lsolum.substack.com/subscribe for regular updates on legal theory topics.
June 5, 2026

Legal Theory Lexicon 114: Contract Theory
Introduction

Contracts is a foundational course, almost always included in the first semester of the first-year curriculum in American law schools. Although the focus of contracts courses is usually doctrine, theoretical questions inevitably arise: What is a contract? Why does the law enforce promises? What gives a contractual obligation its force? Should the law of contract aim at efficiency, autonomy, fairness, or some combination of values? These are the questions of contract theory, a vibrant subfield of private law theory. This entry in the Legal Theory Lexicon provides an overview of the major theoretical approaches to contract law. As always, the Lexicon is aimed at law students, especially first-year students with an interest in legal theory.

What Is Contract Theory?

The phrase “contract theory” is used in several distinct ways. In legal theory and jurisprudence, “contract theory” refers to philosophical, doctrinal, and economic accounts of the law of contract: why contractual obligations should be legally enforceable, what remedies should be available, how courts should interpret agreements, and what substantive limits the law should place on freedom of contract. But in economics, “contract theory” refers to the formal study of how parties design agreements under conditions of asymmetric information and incomplete contracting.

Incomplete contracts are not the primary topic of this entry, but some of the ideas from that literature are discussed in Legal Theory Lexicon 050: Default Rules and Completeness. The idea of a complete contract is an idealization; we imagine a contract with a vast number of provisions such that all possible contingencies are addressed by explicit contractual language. Actual contracts are incomplete, giving rise to gaps, which contract law can fill with default rules, providing a rule to address the contingencies that the contract itself does not cover. For example, contract law provides default rules governing damages for breach of contract. In addition to default rules, contract law also includes mandatory rules — which cannot be overridden by explicit contractual provisions. Contract doctrines that foreclose legal enforcement of contracts for performance that is contrary to law or contrary to public policy are examples of mandatory rules. The distinction between default rules and mandatory rules is one of the most useful analytic tools provided by contract theory. It is useful to distinguish three kinds of question that contract theorists ask. Descriptive questions ask what contract law actually is and how it has developed. Doctrinal questions ask how the rules and principles of contract law fit together as a coherent body. Normative questions ask what contract law should be — what its justifying aims are and how it should be reformed in light of those aims. Many contract theories address all three questions and cleanly distinguish between them. But sometimes the conceptual distinctions between description, doctrine, and normativity are blurred.

The Will Theory and the Move to Objectivity

The classical theory of contract, dominant in nineteenth-century treatises and case law, grounded contractual obligation in the will of the parties. On this view — often called the will theory — a contract is a meeting of the minds, and what the law enforces is the parties’ joint intention to create a binding obligation. Theophilus Parsons’s Law of Contracts, first published in 1853, was the standard American treatise of the period and systematized the doctrines of offer, acceptance, and consideration that gave the will theory its doctrinal architecture in American legal thought. The will theory came under sustained attack in the late nineteenth and early twentieth centuries. Oliver Wendell Holmes argued that the law could not look inside the minds of contracting parties and must instead attend to their external manifestations. The resulting objective theory of contract asks not what the parties subjectively intended but what a reasonable person in the position of the addressee would have understood the speaker to mean. Samuel Williston’s Treatise on the Law of Contracts, first published in 1920, consolidated the classical doctrinal apparatus within this objective framework. The shift from the will theory to the objective theory is one of the central episodes in the history of contract doctrine, and it sets the stage for many of the debates that follow. Holmes’s impossibility-of-mind-reading argument is sometimes taken as gospel, but it is far from clear that he was correct. Work in the philosophy of mind treats mindreading — the attribution of beliefs, desires, and intentions to others — as a ubiquitous feature of ordinary human social cognition, and common sense confirms that we constantly reconstruct the mental states of those with whom we interact. The law itself routinely assesses subjective mental states. Tort doctrine distinguishes intentional torts from negligent ones, and criminal law makes fine-grained distinctions among purpose, knowledge, recklessness, and negligence in defining the mens rea required for various offenses. If juries can reliably determine whether a defendant intended to kill, there is no obvious reason they cannot determine whether a contracting party intended to be bound. The case for objectivity in contract law, on this reading, must rest on something other than the bare claim that subjective intent is epistemically inaccessible.

Promise Theory

The most prominent contemporary defense of an autonomy-based account of contractual obligation is Charles Fried’s Contract as Promise. Fried argues that contracts are legally enforceable because promises generate moral obligations and the law of contract is the legal recognition of that moral fact. The grounding is broadly Kantian: by making a promise, the promisor invokes a moral convention that allows her to create reasons for others to act, and the practice of promising is itself a way of treating others as autonomous agents capable of being bound by their own commitments. Promise theory faces several well-known objections. Critics question whether the law of contract really tracks the moral law of promising — the law enforces only a subset of promises, and the standard remedy of expectation damages does not obviously map onto what promissory morality requires. The most influential contemporary statement of this critique is Seana Shiffrin’s divergence thesis, which is our next topic.

The Divergence of Contract and Promise

Seana Shiffrin’s The Divergence of Contract and Promise has emerged as one of the most influential challenges to Fried’s promise theory and, more broadly, to any view that treats the law of contract as the legal embodiment of the moral practice of promising. Shiffrin’s central claim is that contract and promise diverge in important normative respects: contract law enforces only a subset of promises, provides remedies — especially expectation damages — that diverge from what promissory morality would require, and imposes formation and enforcement rules that a faithful legal articulation of promissory morality would not endorse. The argument has both critical and constructive dimensions. The critical dimension targets theories that treat contract as the law of promising and shows that such theories do not fit many particular contract doctrines. The constructive dimension argues that contract law should at least be accommodating of the moral conditions of autonomous agency — that the law should not require or encourage people to violate the morality of promising in order to comply with contract doctrine. Shiffrin’s framework has reoriented a substantial part of the contemporary debate, and many autonomy-based theorists now position themselves with respect to her views.

Consent and Transfer Theories

A related but distinct family of theories grounds contractual obligation in consent rather than promise. Randy Barnett’s A Consent Theory of Contract argues that contracts are enforceable because the parties consent to the transfer of alienable rights. On this view, the question is not whether a promise has been made but whether the promisor has manifested consent to be legally bound. Consent theory is naturally compatible with the objective theory of contract: what matters is manifested consent, not consent in some inner sanctum of the will. A more radical variant is the title-transfer theory developed by Murray Rothbard and Williamson Evers and elaborated in libertarian legal theory. On this view, every enforceable contract is a transfer of property rights, and what cannot be analyzed as a transfer of an alienable right is not a contract at all. The title-transfer theory has limited acceptance in mainstream legal academia but is important in libertarian and Austrian-economic approaches to contract theory.

Reliance Theory

A third family of theories grounds contractual obligation in reliance rather than promise or consent. The locus classicus is Lon Fuller and William Perdue’s The Reliance Interest in Contract Damages, which distinguished among three interests that contract law might protect: the restitution interest (preventing unjust enrichment), the reliance interest (compensating losses incurred in reliance on a promise), and the expectation interest (putting the promisee in the position she would have occupied had the contract been performed). Fuller and Perdue argued that the reliance interest has the strongest normative pull, and the argument has shaped contract scholarship ever since. Reliance theory received its most influential modern statement in Patrick Atiyah’s The Rise and Fall of Freedom of Contract. Atiyah argued that the will theory was a nineteenth-century construction and that contract law has always been more closely connected to actual reliance and benefits received than the executory-promise model suggests. Reliance theory has obvious affinities with the doctrine of promissory estoppel, codified in Section 90 of the Restatement (Second) of Contracts.

Economic Theories

The law-and-economics movement has produced a large literature on contract law. The core claim is that contract rules should be evaluated by their consequences for efficiency, understood either as Pareto efficiency or as wealth maximization in the Kaldor-Hicks sense. For more on these concepts, see Legal Theory Lexicon 060: Efficiency, Pareto, and Kaldor-Hicks. Richard Posner’s Economic Analysis of Law provides the canonical treatment, and the contract chapters of that book have shaped a generation of contract scholarship. Several themes recur in the economic literature. One is the theory of efficient breach: when performance would cost the promisor more than the promisee’s expectation interest, the promisor should breach and pay damages, producing an outcome that is at least Kaldor-Hicks superior to performance. A second is the theory of default rules: because contracts are inevitably incomplete, courts must fill gaps, and the choice of default rules has important efficiency consequences. Ian Ayres and Robert Gertner’s Filling Gaps in Incomplete Contracts draws the influential distinction between majoritarian default rules (the rules most parties would have chosen) and penalty default rules (rules designed to induce parties to bargain explicitly over the relevant term). A third theme is information forcing more generally — the use of contract doctrine to allocate the burdens of disclosure and investigation. Economic theory is sometimes presented as a unified normative account of contract law, but it is often more useful to treat it as a set of analytical tools that can be deployed within a variety of normative frameworks.

Corrective Justice Theories

Corrective justice theories ground contract law in the bilateral structure of the relationship between promisor and promisee. The most prominent contemporary corrective justice theorist is Ernest Weinrib, whose The Idea of Private Law argues that private law generally — including contract — should be understood as the legal articulation of corrective justice in the Aristotelian sense. Peter Benson’s Justice in Transactions offers a detailed corrective justice account focused on the transactional structure of contract formation. The central methodological commitment of corrective justice theories is that contract law cannot be explained or justified by reference to aggregate social goals such as wealth maximization. The law of contract is a system of correlative rights and duties between individual parties, and any adequate theory must respect that internal structure. Corrective justice theorists therefore frequently position themselves as critics of welfarist contract theory.

Relational Contract Theory

Relational contract theory, associated above all with Ian Macneil, emphasizes the embedded social context of contractual relationships. Macneil argued that classical contract doctrine assumed a model of discrete transactions between strangers — a model poorly suited to long-term commercial relationships, employment contracts, franchise arrangements, and other ongoing engagements. Relational theory urges attention to the norms, trust, and reciprocity that structure actual contractual behavior. Macneil’s most extended statement appears in The New Social Contract. Stewart Macaulay’s earlier empirical study Non-Contractual Relations in Business documented the gap between formal contract doctrine and the practices of business actors, who often resolve disputes without invoking legal remedies. Relational theory is descriptive and methodological as much as normative; it presses against theories that treat the discrete bargain as the paradigm of contract.

Distributive Justice and Critical Theories

A separate strand of contract theory focuses on the distributive consequences of contract law. Anthony Kronman’s Contract Law and Distributive Justice argued that distributive considerations are not foreign to contract law but are built into doctrines such as unconscionability, duress, and undue influence. Duncan Kennedy’s Form and Substance in Private Law Adjudication analyzed the choice between rules and standards in contract doctrine as bound up with deeper political and distributive disagreements. Critical legal studies, feminist legal theory, and critical race theory have each generated literatures on contract. The common move is to challenge the apparent neutrality of contract doctrine by exposing the background distributions of power and resources that shape what looks like voluntary exchange. Two influential examples are Clare Dalton’s An Essay in the Deconstruction of Contract Doctrine and Mary Joe Frug’s Re-Reading Contracts: A Feminist Analysis of a Contracts Casebook. These critiques are often combined with calls for greater attention to unconscionability, mandatory terms, and distributive concerns in the design of contract rules.

Autonomy Theories and Pluralism

Daniel Markovits and other contemporary theorists have developed autonomy-based accounts that differ from Fried’s promise theory in various respects. Markovits’s Contract and Collaboration emphasizes shared agency and the way in which contract enables parties to pursue joint projects through commitment. Other autonomy-based accounts focus on the conditions under which autonomous agents can rationally bind themselves and on the role of contract law in protecting the integrity of those conditions. These theories are unified by the thought that contract law concerns the conditions under which autonomous agents cooperate through commitment, but they differ on what that thought entails. Stephen Smith’s Contract Theory offers an explicitly pluralist account, arguing that no single value can fully justify the rules and doctrines of contract law and that an adequate theory must combine considerations of autonomy, reliance, efficiency, and corrective justice. Pluralist theories trade theoretical ambition for fit and concede that contract law is a complex institution serving multiple purposes.

Cross-Cutting Questions

Several questions cut across the major frameworks. What is the proper measure of damages — expectation, reliance, or restitution? When should courts enforce liquidated damages clauses, and when should they refuse to do so as penalties? How should courts approach the interpretation of contracts, and what role should context, course of dealing, and trade usage play? When should mandatory terms override the parties’ express agreement? Each of these questions can be addressed within any of the theoretical frameworks discussed above, and one of the central tasks of contract theory is to work out the implications of each framework for these doctrinal problems.

Conclusion

Contract theory is a particularly rich field within legal theory because contract law touches so many central questions in moral, political, and economic philosophy. The first-year law student who works through the basic doctrines will find that almost every rule — from offer and acceptance to remedies and excuse — invites a theoretical question. The frameworks surveyed here provide a vocabulary for asking those questions and a map of the contemporary debates. Many of the most important developments in legal scholarship over the last fifty years have grown out of the encounter between traditional contract doctrine and one or another of these theoretical perspectives.

Related Lexicon Entries
Bibliography
The current version of this entry was created on June 5, 2026.

Link to the Most Recent Version

Legal Theory Lexicon 114: Contract Theory

If you find this entry valuable, consider subscribing to Legal Theory Stack on Substack at https://lsolum.substack.com/subscribe for regular updates on legal theory topics.
May 25, 2026

Legal Theory Lexicon 113: Property Theory
Property is almost always a first year subject. Decades ago, it was most likely a year-long course, but today the course is typically offered in the Fall or Spring. The 1L property course typically begins with possession (capture cases like Pierson v. Post), moves through estates in land and future interests, and then takes up landlord-tenant law, easements, servitudes, nuisance, and (in some courses) takings. Along the way, students encounter a familiar metaphor: property is a “bundle of sticks.” Each stick represents a particular legal entitlement — the right to possess, the right to use, the right to exclude, the right to transfer, and so on. The bundle metaphor is the conceptual default in most casebooks. But the metaphor is also contested. In this Lexicon entry, I introduce the major theoretical approaches to property: the bundle of rights tradition, the new essentialism, architectural theory, law and economics, personhood theory, and progressive property. As always, the entry is written with first-year law students in mind.

Property theory is a big topic — so, this entry is necessarily selective and simplified! Many interesting topics will be left out, including interpretation of the Takings Clause, intellectual property, and more sophisticated economic theories of property.

A word on terminology. “Property” covers three regimes: common property (resources available for use by all members of a community, like a town green or a public park), collective property (resources whose use is determined by the community as a whole, like a military base or a state-owned enterprise), and private property (resources assigned to particular individuals, like a home or a car). This entry focuses on private property; common and collective property are not considered in depth.

Early Foundations: Locke, Blackstone, and Hume

The philosophical engagement with property predates the modern legal tradition by two millennia — Aristotle argued in the Politics that private property promotes virtues like prudence and responsibility, and Thomas Aquinas held that the rich have moral obligations to the poor that qualify any defense of private ownership. But the Anglo-American legal tradition’s modern engagement could be said to begin with John Locke and William Blackstone. Locke’s Second Treatise of Government (1689) begins with the idea of a “state of nature” and advances the thesis that in such a state real property can be acquired by individuals who mix their labor with the land, for example, by clearing a field and planting a crop. Locke’s theory included the famous “Lockean Proviso,” which stipulates that the right to acquire property is contingent on leaving “as much and as good” for others.

Another important development in property theory is associated with the works of William Blackstone. In his Commentaries on the Laws of England (1765–1770), Blackstone offered the following conception: property is “that sole and despotic dominion which one man claims and exercises over the external things of the world, in total exclusion of the right of any other individual in the universe.” The “sole and despotic dominion” passage is the canonical statement of what is sometimes called the dominium conception (the idea that ownership is a unitary, absolute relation between a person and a thing). Blackstone himself qualified the claim in subsequent passages, and David Schorr has shown that the so-called “Blackstonian conception” departs in important respects from Blackstone’s own account of property in the Commentaries — but the rhetoric stuck. The bundle of sticks tradition, discussed below, developed in part as a reaction against the Blackstonian picture.

A third foundational figure is David Hume. In A Treatise of Human Nature (1739–1740), Hume argued that property is neither natural nor independent of social arrangements. There is no natural “mine” or “thine” — what counts as property is fixed by social convention, which emerges because the alternative is endless conflict over scarce resources. On Hume’s view, an individual’s relation to a thing becomes a property relation only once a society has stabilized expectations about possession; until then, there are objects and there are people who hold them, but there is no property. The Humean account anticipates two important strands in later property theory: the relational view that property is a set of relations between persons (which the bundle theorists would develop), and the contemporary view, associated with Liam Murphy and Thomas Nagel, that property has no pre-political existence and therefore cannot supply a moral obstacle to redistribution. Hume’s text admits multiple interpretations, and contemporary scholarship continues to debate whether his account is best read as conventional in a thin sense or as naturalistically grounded in human nature.

The Bundle of Rights

Twentieth century property theory was profoundly influenced by the bundle of rights conception of property, associated with Wesley Newcomb Hohfeld, A.M. Honoré, and the American legal realists. The core idea is that property is not a single, unitary relation between a person and a thing. Property is instead a set of jural relations between persons with respect to things. The conception developed in three steps.

Hohfeld’s Analytical Framework. Hohfeld, in two articles published in the Yale Law Journal in 1913 and 1917, identified eight basic jural relations, arranged in correlative and opposite pairs: right/duty, privilege/no-right, power/liability, and immunity/disability. Hohfeld argued that talk of “ownership” or “title” obscures the underlying structure of legal relations. Once the Hohfeldian framework is applied, property is revealed as a complex of these elemental relations — rights to exclude others, privileges to use, powers to transfer, immunities against expropriation, and so on. The Hohfeldian framework is foundational; it set the stage for everything that followed. If you are interested in Hohfeld, see Hohfeld: Legal Theory Lexicon 034.

Honoré’s Eleven Incidents. A.M. Honoré’s essay Ownership (1961) catalogued the standard incidents of liberal ownership. Honoré identified eleven: the right to possess, the right to use, the right to manage, the right to the income, the right to the capital, the right to security, the incident of transmissibility, the incident of absence of term, the prohibition of harmful use, liability to execution for debt, and the residuary character of ownership. Honoré did not claim that all eleven incidents were necessary or sufficient for ownership. His point was that “ownership” picks out a cluster of incidents that typically travel together but are conceptually separable.

Legal Realism and the Restatement. The American legal realists embraced the bundle conception. Morris Cohen’s Property and Sovereignty (1927) used the framework to argue that property is a delegation of sovereign power, not a pre-political natural right. The Restatement (First) of Property (1936) adopted Hohfeldian terminology, defining property in terms of rights, privileges, powers, and immunities. By the middle of the twentieth century, the bundle picture had become the orthodox view in American legal academia.

Why the Bundle Picture Was Attractive. The bundle picture had several attractions. It dissolved the Blackstonian image of property as a unitary, absolute dominion. From the perspective of politically progressive realists, the bundle of rights approach had a further attraction: it conceptualized property as subject to regulation. If property is a bundle of separable sticks, the legislature can rearrange the sticks without disturbing some essential core.

The bundle of rights approach had another implication: it supported the relational view that property concerns relations between persons, not between persons and things. And it permitted fine-grained analysis of complex transactions — leases, easements, future interests, security interests — by treating each as a particular configuration of Hohfeldian relations.

The New Property

Charles Reich’s article The New Property (1964) extended the property concept in a new direction. Reich argued that old property (land, chattels, and intangibles such as bank accounts) had been joined by the new property (welfare benefits, occupational licenses, government contracts, subsidies, and franchises). The new property functioned as the modern equivalent of traditional property, providing the economic security that real and personal property once supplied. Reich’s argument depended on the bundle picture: once property is understood as a set of separable legal relations rather than a unitary dominion over tangible things, the extension to government entitlements becomes available. The doctrinal payoff came in Goldberg v. Kelly (1970), which held that the termination of welfare benefits required procedural due process. First-year students may encounter the new property in Civil Procedure, Constitutional Law, Legislation and Regulation, or Administrative Law.

The New Essentialism and the Right to Exclude

By the late 1990s, a reaction against the bundle picture had emerged. The reaction is sometimes called the “new essentialism,” because its proponents argued that property has an essence after all. The essence, on this view, is the right to exclude. The new essentialists argued that the right to exclude is what enables an owner to exercise the other incidents of ownership — to use, to manage, to transfer — because without the power to keep others off the resource, none of the remaining incidents can be reliably exercised.

If you want to learn about the new essentialism, Thomas Merrill’s article Property and the Right to Exclude (1998) is the place to start. Merrill argued that the right to exclude is not merely one stick among many but the irreducible core of the property concept. Take away the right to exclude, and what remains is no longer recognizable as property. Merrill’s argument was partly conceptual and partly historical: he showed that across diverse legal systems and historical periods, the right to exclude appears as the defining feature of property.

J.E. Penner, in The Idea of Property in Law (1997), developed a parallel view from within analytical jurisprudence. Penner argued that property is best understood as the right to use things, with exclusion as the negative formulation of that right. Property rights are in rem: they run against the world, not against particular individuals identified by name. The in rem character of property distinguishes it from contract, where rights run against identified counterparties. The bundle picture, by treating property as a collection of bilateral relations, obscured what Penner saw as a basic structural feature of property.

The new essentialism does not require a return to Blackstonian dominion. The exclusion theorists do not deny that property is subject to extensive regulation. Their claim is conceptual: whatever the precise contours of regulation, the right to exclude is what makes a legal relation a property relation rather than something else.

Henry Smith’s Architectural Theory

Henry Smith, often in collaboration with Thomas Merrill, has developed what is now called an architectural theory of property (earlier described as an information-cost theory). The theory has become an influential alternative to the bundle picture.

Smith’s starting point is a distinction between two strategies for delineating (i.e., defining) use rights: exclusion and governance. The exclusion strategy delegates use decisions to a single owner by erecting a boundary and excluding others from crossing it. The governance strategy specifies particular permitted or prohibited uses directly. Exclusion is informationally cheap: a dutyholder need only know “stay off” or “do not take.” Governance is informationally expensive: dutyholders must learn the specific rules that apply. Smith argues that property law economizes on information costs by relying on exclusion at the core — possession, alienation — while reserving governance for the periphery, in nuisance, servitudes, and regulatory regimes. The theory has clear normative implications. Because governance strategies are expensive, they must be justified by benefits that outweigh their costs.

On the architectural view, economizing on information costs is not the purpose of property; it is part of the analysis of means. The framework is compatible with multiple substantive ends — use, investment, the avoidance of conflict, and others — that property institutions serve in a world of complex interactions among persons with respect to resources. Smith has described himself as a normative pluralist in this sense: the architectural approach takes property to serve a plurality of ends rather than a single overarching value.

Two further ideas are central to Smith’s account. The first is modularity: property is organized into discrete units — the parcel of land, the chattel (material object) — which keep legal relations bounded and reduce what third parties must learn in order to comply with property rules. The second is the numerus clausus principle, the rule that property forms are limited to a closed set. On Smith’s account, the numerus clausus limits information costs across the system as a whole.

Smith’s contributions to property theory are among the most important developments in private law theory as a whole. If you are interested in the state of contemporary property theory, you must read Smith.

Property and Law and Economics

The economic analysis of property is a substantial body of scholarship, much of it independent of the debates between the bundle theorists and the new essentialists. Here are six of the most important ideas developed through an economic approach to property law.

The framework for most of this work is Ronald Coase’s analysis of social cost. Coase’s The Problem of Social Cost (1960) showed that, in a world without transaction costs, the initial allocation of legal entitlements does not affect the efficient use of resources: bargaining will reallocate entitlements to their highest-valued use. The corollary is the one that matters for property theory: in the real world, where transaction costs are positive, the initial allocation of rights matters, and the design of property institutions shapes how resources are used. The Coase Theorem is treated in Legal Theory Lexicon 002: The Coase Theorem.

Harold Demsetz’s Toward a Theory of Property Rights (1967) offered an account of the emergence of property institutions. Demsetz argued that property rights emerge when the benefits of internalizing externalities exceed the costs of defining and enforcing the rights. The classic illustration is the development of property rights in beavers among indigenous communities of Labrador in response to the European fur trade.

Guido Calabresi and A. Douglas Melamed’s Property Rules, Liability Rules, and Inalienability: One View of the Cathedral (1972) distinguished three modes of protecting legal entitlements. Property rules permit transfer only with the holder’s consent. Liability rules permit transfer at a price determined by an external decisionmaker. Inalienability rules forbid transfer altogether. The Calabresi-Melamed framework remains a standard analytical tool. For more, see Legal Theory Lexicon 052: Property Rules and Liability Rules.

Garrett Hardin’s The Tragedy of the Commons (1968) gave the field one of its most enduring frameworks. Hardin argued that resources held in common — pastures, fisheries, the atmosphere — tend toward overuse, because each user captures the full benefit of additional use while bearing only a fraction of the cost. The tragedy is collective: rational individual behavior produces collectively destructive outcomes. Hardin’s argument is a property-theoretic application of a more general structure familiar from the prisoner’s dilemma. For an introduction to that structure, see Legal Theory Lexicon 007: The Prisoners’ Dilemma.

Michael Heller’s The Tragedy of the Anticommons (1998) identified the mirror image of Hardin’s commons tragedy. When too many persons hold rights to exclude with respect to a single resource, the resource is underused. Heller’s work has been particularly influential in patent theory and post-socialist transition economies.

Elinor Ostrom’s Governing the Commons (1990) challenged the Hardin assumption that common-pool resources inevitably tend toward overuse. Ostrom documented many successful community-managed commons regimes and identified eight design principles for sustainable common-pool resource management. Ostrom was awarded the Nobel Prize in Economics in 2009. Ostrom’s framework has been extended to “cultural commons” — knowledge, scientific data, traditional knowledge, and other intangible resources — most prominently in the work of Charlotte Hess, Michael Madison, Brett Frischmann, and Katherine Strandburg.

Lee Anne Fennell’s Slices and Lumps (2019) addresses what she calls the configuration problem: how property law slices resources into parcels and lumps them into bundles. The size, shape, and divisibility of property entitlements shape what owners can do, what transactions are feasible, and what spillovers escape the boundary. Fennell’s earlier work develops related themes. The Unbounded Home (2009) examines the neighborhood effects and spillovers that cross parcel boundaries despite the legal fiction that the parcel is the relevant unit. Fee Simple Obsolete (2016) argues that the fee simple is poorly adapted to contemporary urban conditions and that property law would benefit from more flexible entitlement forms.

Personhood Theory

Margaret Jane Radin’s Property and Personhood (1982) drew on Hegelian themes to argue that some forms of property are constitutive of personhood. Radin distinguished personal property from fungible property. Personal property — one’s home, one’s wedding ring, one’s wheelchair — is bound up with the owner’s identity and self-development. Fungible property — a share of stock, a vacant lot held for investment — stands at arm’s length from the owner. Radin argued that personal property deserves stronger legal protection than fungible property.

Personhood theory has been influential in particular doctrinal pockets: the law of takings (where Radin’s framework supports stronger protection for homes than for commercial investments), the law of bankruptcy exemptions, and the debates over commodification of body parts, sexual services, and reproductive labor (the subject of Radin’s later book Contested Commodities).

Progressive Property

Another recent development is the “progressive property” movement, associated with Gregory Alexander, Eduardo Peñalver, Joseph William Singer, and Laura Underkuffler. The progressive property scholars share a commitment to grounding property in human flourishing, social obligation, and the public dimension of ownership.

The 2009 “Statement of Progressive Property,” coauthored by Alexander, Peñalver, Singer, and Underkuffler, set out the framework’s commitments. Property serves plural values: liberty, autonomy, human flourishing, democratic self-governance, and equal access to resources necessary for participation in social life. Owners owe obligations to the communities in which they hold property. Property law should reflect these values and obligations. Although this work is framed as “progressive,” similar ideas are present in both traditional natural law theory and in the more recent emergence of virtue jurisprudence, both of which emphasize human flourishing as the end or object of law. For more, see Legal Theory Lexicon 031: Virtue Jurisprudence.

Gregory Alexander’s Commodity and Propriety (1997) traced two competing traditions in American property thought: a commodity tradition emphasizing market exchange and individual autonomy, and a propriety tradition emphasizing the social role of ownership in sustaining the polity. Alexander’s later work, including Property and Human Flourishing (2018), develops an Aristotelian account of property grounded in objective human goods. Joseph Singer’s Entitlement: The Paradoxes of Property (2000) develops a relational account of ownership obligations. Eduardo Peñalver’s work, including (with Sonia Katyal) Property Outlaws (2010), examines the role of disobedience and dissent in property’s development.

The progressive property scholars are critical of the exclusion theorists. On their view, the new essentialism understates the social and relational dimensions of property, and elevates one value — autonomy understood as non-interference by others — over the plural values that property institutions ought to serve.

Other Approaches

Several additional approaches deserve brief mention. Arthur Ripstein’s Force and Freedom (2009) develops a Kantian theory of property grounded in the equal freedom of persons. On the Kantian view, property is necessary to give effect to the right of each person to set and pursue their own ends, but property requires a political community — a state — to be legitimate. Jeremy Waldron’s The Right to Private Property (1988) offers a careful philosophical reconstruction of the Lockean and Hegelian justificatory arguments. Adam Mossoff and other natural rights theorists have developed neo-Lockean accounts of property, often in connection with intellectual property. Stephen Munzer’s A Theory of Property (1990) develops a pluralist account grounded in three principles — utility and efficiency, labor-desert, and personality — and remains a leading book-length treatment of property’s normative foundations. Hanoch Dagan, in Property: Values and Institutions (2011), develops a pluralist account that recognizes multiple property forms serving distinct values.

A second cluster of work approaches property from outside the dominant philosophical traditions. Robert Ellickson’s Order Without Law (1991) examined how community norms substitute for formal property rights among ranchers in Shasta County, California. Carol Rose’s Property and Persuasion (1994) explored the rhetorical and narrative dimensions of property institutions, arguing that property depends on shared stories about acquisition and entitlement. Critical race theorists, beginning with Cheryl Harris’s Whiteness as Property (1993), have examined the racial dimensions of property institutions.

Conclusion

Property theory is now pluralistic. The bundle of sticks picture remains the default framework in most American casebooks, but it is no longer the unchallenged orthodoxy it was in the second half of the twentieth century. The exclusion theorists, the architectural theorists, the personhood theorists, the progressive property scholars, the law-and-economics scholars, and the Kantians offer competing accounts of what property is and what it is for.

One feature of the current landscape deserves emphasis. Normative pluralism — the view that property institutions serve a plurality of ends rather than a single overarching value — cuts across the schools surveyed above. It is not the exclusive commitment of the progressive property scholars; it is found among the architectural theorists, among the law-and-economics scholars, and among the philosophical pluralists discussed in the previous section. Monist positions, which ground property in a single value such as autonomy or efficiency, are correspondingly distributed across the landscape rather than concentrated in any one camp.

Here are three takeaways for first year law students — and maybe first time property law teachers or scholars as well. First, the bundle metaphor carries theoretical commitments; it is not a neutral description. The choice of metaphor reflects substantive commitments about whether property has an essential core. Second, much of first-year property doctrine reflects unstated theoretical commitments. The rules governing capture, finders, adverse possession, easements, and nuisance can be illuminated by asking which theory best explains them. Third, theoretical disagreements have doctrinal consequences. Whether a regulatory taking has occurred, whether a covenant runs with the land, whether a tenant may exclude the landlord — questions like these are answered differently depending on which theory of property one accepts.

Property theory is among the most active fields in contemporary legal theory. The first-year student who attends to the theoretical debates will find that the doctrine looks different — and more interesting — once the theoretical commitments are made explicit.

For a philosophical companion to this entry, the Stanford Encyclopedia of Philosophy’s entry on Property and Ownership by Jeremy Waldron is the standard introduction.

Related Lexicon Entries
Bibliography

Alexander, Gregory S. Commodity and Propriety: Competing Visions of Property in American Legal Thought, 1776-1970. Chicago: University of Chicago Press, 1997.

Alexander, Gregory S. Property and Human Flourishing. New York: Oxford University Press, 2018.

Alexander, Gregory S., Eduardo M. Peñalver, Joseph William Singer, and Laura S. Underkuffler. A Statement of Progressive Property, 94 Cornell L. Rev. 743 (2009).

Aquinas, Thomas. Summa Theologiae. ca. 1265–1274.

Aristotle. Politics. Translated by Benjamin Jowett.

Blackstone, William. Commentaries on the Laws of England. Oxford: Clarendon Press, 1765-1770.

Calabresi, Guido, and A. Douglas Melamed. Property Rules, Liability Rules, and Inalienability: One View of the Cathedral, 85 Harv. L. Rev. 1089 (1972).

Coase, R.H. The Problem of Social Cost, 3 J.L. & Econ. 1 (1960).

Cohen, Morris R. Property and Sovereignty, 13 Cornell L.Q. 8 (1927).

Dagan, Hanoch. Property: Values and Institutions. New York: Oxford University Press, 2011.

Demsetz, Harold. Toward a Theory of Property Rights, 57 Am. Econ. Rev. 347 (1967).

Ellickson, Robert C. Order Without Law: How Neighbors Settle Disputes. Cambridge, MA: Harvard University Press, 1991.

Fennell, Lee Anne. The Unbounded Home: Property Values Beyond Property Lines. New Haven: Yale University Press, 2009.

Fennell, Lee Anne. Fee Simple Obsolete, 91 N.Y.U. L. Rev. 1457 (2016).

Fennell, Lee Anne. Slices and Lumps: Division and Aggregation in Law and Life. Chicago: University of Chicago Press, 2019.

Frischmann, Brett M., Michael J. Madison, and Katherine J. Strandburg, eds. Governing Knowledge Commons. New York: Oxford University Press, 2014.

Hardin, Garrett. The Tragedy of the Commons, 162 Science 1243 (1968).

Harris, Cheryl I. Whiteness as Property, 106 Harv. L. Rev. 1707 (1993).

Heller, Michael A. The Tragedy of the Anticommons: Property in the Transition from Marx to Markets, 111 Harv. L. Rev. 621 (1998).

Hess, Charlotte, and Elinor Ostrom, eds. Understanding Knowledge as a Commons: From Theory to Practice. Cambridge, MA: MIT Press, 2007.

Hohfeld, Wesley Newcomb. Some Fundamental Legal Conceptions as Applied in Judicial Reasoning, 23 Yale L.J. 16 (1913).

Hohfeld, Wesley Newcomb. Fundamental Legal Conceptions as Applied in Judicial Reasoning, 26 Yale L.J. 710 (1917).

Honoré, A.M. Ownership. In A.G. Guest, ed., Oxford Essays in Jurisprudence, 107-147. Oxford: Oxford University Press, 1961.

Hume, David. A Treatise of Human Nature. London, 1739–1740.

Locke, John. Second Treatise of Government. 1689.

Merrill, Thomas W. Property and the Right to Exclude, 77 Neb. L. Rev. 730 (1998).

Merrill, Thomas W., and Henry E. Smith. Optimal Standardization in the Law of Property: The Numerus Clausus Principle, 110 Yale L.J. 1 (2000).

Merrill, Thomas W., and Henry E. Smith. What Happened to Property in Law and Economics?, 111 Yale L.J. 357 (2001).

Merrill, Thomas W., and Henry E. Smith. The Architecture of Property. In Hanoch Dagan and Benjamin C. Zipursky, eds., Research Handbook on Private Law Theory, 134–154. Cheltenham: Edward Elgar, 2020.

Mossoff, Adam. What is Property? Putting the Pieces Back Together, 45 Ariz. L. Rev. 371 (2003).

Munzer, Stephen R. A Theory of Property. New York: Cambridge University Press, 1990.

Murphy, Liam, and Thomas Nagel. The Myth of Ownership: Taxes and Justice. New York: Oxford University Press, 2002.

Ostrom, Elinor. Governing the Commons: The Evolution of Institutions for Collective Action. New York: Cambridge University Press, 1990.

Peñalver, Eduardo M., and Sonia K. Katyal. Property Outlaws: How Squatters, Pirates, and Protesters Improve the Law of Ownership. New Haven: Yale University Press, 2010.

Penner, J.E. The Idea of Property in Law. Oxford: Clarendon Press, 1997.

Radin, Margaret Jane. Property and Personhood, 34 Stan. L. Rev. 957 (1982).

Radin, Margaret Jane. Contested Commodities. Cambridge, MA: Harvard University Press, 1996.

Reich, Charles A. The New Property, 73 Yale L.J. 733 (1964).

Restatement (First) of Property. Philadelphia: American Law Institute, 1936.

Ripstein, Arthur. Force and Freedom: Kant’s Legal and Political Philosophy. Cambridge, MA: Harvard University Press, 2009.

Rose, Carol M. Property and Persuasion: Essays on the History, Theory, and Rhetoric of Ownership. Boulder, CO: Westview Press, 1994.

Schorr, David B. How Blackstone Became a Blackstonian, 10 Theoretical Inquiries in Law 103 (2009).

Singer, Joseph William. Entitlement: The Paradoxes of Property. New Haven: Yale University Press, 2000.

Smith, Henry E. Exclusion versus Governance: Two Strategies for Delineating Property Rights, 31 J. Legal Stud. S453 (2002).

Smith, Henry E. Property as the Law of Things, 125 Harv. L. Rev. 1691 (2012).

Waldron, Jeremy. The Right to Private Property. Oxford: Clarendon Press, 1988.

Waldron, Jeremy. Property and Ownership. In Stanford Encyclopedia of Philosophy, edited by Edward N. Zalta. First published September 6, 2004; substantive revision January 21, 2026.

Created May 25, 2026; Revised May 30, 2026. My thanks to Henry Smith for helpful comments on this Lexicon entry.

Link to the Most Recent Version of this Lexicon Entry

Legal Theory Lexicon 113: Property Theory

This Lexicon entry is part of the Legal Theory Stack on Substack. To receive new entries by email and support the Legal Theory Lexicon and Legal Theory Blog, please subscribe at https://lsolum.substack.com/subscribe.

Introduction

Theories of Punishment

Theories of Criminalization

The General Part

Conclusion

Related Lexicon Entries

Bibliography

Introduction

The Idea of a Constitution

Powers and Rights

A Short History of American Constitutional Theory

Originalism and Living Constitutionalism

A Short Introduction to Comparative Constitutional Theory

Conclusion

Bibliography

Related Entries

Link to the Most Recent Version of this Lexicon Entry

Two Models of Civil Procedure Pedagogy

Erie and Two Foundational Questions

What Is Law?

What Is Procedure?

Procedural Justice: What Makes a Procedure Fair?

Law in Action and Law on the Books

Interpretation: Constitutional, Statutory, and Rules

Conclusion: The Theoretical Stakes of a Doctrinal Course

Related Entries

Bibliography

Link to the Most Recent Version of this Lexicon Entry

Link to the Legal Theory Stack

What Is Tort Theory?

Economic Theories of Tort Law

Corrective Justice Theories

Civil Recourse Theory

Rights-Based and Kantian Theories

Mixed, Pluralist, and Instrumentalist Theories

Critical and Distributive Perspectives

Cross-Cutting Doctrinal Questions

Conclusion

Related Lexicon Entries

Bibliography

Link to the Most Recent Version of this Lexicon Entry

Introduction

What Is Contract Theory?

The Will Theory and the Move to Objectivity

Promise Theory

The Divergence of Contract and Promise

Consent and Transfer Theories

Reliance Theory

Economic Theories

Corrective Justice Theories

Relational Contract Theory

Distributive Justice and Critical Theories

Autonomy Theories and Pluralism

Cross-Cutting Questions

Conclusion

Related Lexicon Entries

Bibliography

Link to the Most Recent Version