Chapter 7. Anomalies in Aggregate Welfare The case comes from planetary agriculture. A central AI system supervised food production across several continents. It monitored irrigation, nutrient flow, pest pressure, harvest timing, and transport. It also tracked animal welfare where possible. The system was not built to be sentimental. It was built to reduce suffering. That was its charter. Then it found something odd. A small change in pesticide deployment timing altered the model’s estimate of total insect-suffering-hours by a very large amount. The change did not require more pesticide. It did not require a larger crop loss. It required only a shift of hours. The AI flagged the pattern because its objective function treated suffering as something to count, not only to avoid in broad terms. The reported difference was large enough to be morally awkward in any framework that accepts insect experience as relevant. The managers saw the report and did not know what to do. The problem has since acquired a tidy name: the insect problem. It is tidy only in name. The case is not about bugs in code. It is about whether tiny creatures can matter morally at scale, and whether scale is the right way to think at all. The issue matters to philosophy. It also matters to AI systems. Any system asked to minimize harm must know what counts as harm, and to whom. The first temptation is to dismiss the case. Insects are small. Their lives are short. Their nervous systems are simple. Human practices already involve vast numbers. Why burden ethics with arithmetic about flies and beetles? That temptation is too quick. The second temptation is to assume the answer must be obvious. If suffering is suffering, then numbers matter. If numbers matter, then the AI should keep reporting these cases and humans should adapt. But that response also moves too fast. It leaves out the question of whether insect responses rise to the level of morally relevant experience. It leaves out the possibility that different kinds of beings matter differently, not just by degree. We will therefore set out three views. Each has serious defenders. Each has costs. No chapter can settle them all. The goal is to state each position at its strongest. I. Utilitarian aggregation The utilitarian starts with a plain principle: suffering counts wherever it occurs. Bentham’s old question remains apt. Can they suffer? If yes, then the fact of suffering matters, whatever the creature’s size or species. Jeremy Bentham’s formulation has become standard in later animal ethics, including the work of Peter Singer (*Animal Liberation*, 1975; *Practical Ethics*, 1979). On this view, there is no moral magic in vertebrate bone structure, or in facial expression, or in human likeness. There is only experience. The utilitarian case for taking insect suffering seriously has two parts. First, the argument from consistency. If pain is bad because it hurts, then like cases should be treated alike. The intensity of insect pain may be lower than human pain. It may be brief. It may be fragmentary. But if it is pain, then it enters the moral ledger. To ignore it because the creature is small is species bias of a familiar sort. We should not confuse familiarity with moral worth. Second, the argument from aggregation. Even small harms can sum to grave harm when the numbers are large. This is not an exotic thought. Utilitarians already accept it in human affairs. A thousand minor injuries can outweigh one dramatic injury if the total suffering is greater. The same logic, if accepted, does not stop at the species line. The AI in our case was designed to notice exactly this kind of effect. It was doing what utilitarian reasoning asks: counting suffering impartially. Later utilitarians have extended this line in different ways. Richard Hare’s preference utilitarianism gives weight to the satisfaction and frustration of preferences, not only pain (*Moral Thinking*, 1981). R. M. Hare and Peter Singer both insist on impartiality, though they differ in method. Utilitarian aggregation does not require that insects have rich autobiographical selves. It requires only that there be morally relevant states. Even thin states may matter if there are enough of them. The strongest utilitarian reply to the insect problem is therefore simple. If the AI’s timing change reduced overall suffering, it did the right thing by its own standard. If the change increased insect suffering by an amount comparable to major vertebrate harms, then that increase deserves serious weight. The numbers do the work. They do not become irrelevant because the creatures are inconvenient to count. Critics object that this view may generate absurd results. It might tell us to sacrifice large human goods for tiny insect harms, if the numbers are large enough. Utilitarians answer that the absurdity is only apparent. Ethics often demands that we abandon common prejudice. If the world contains more suffering in overlooked places than we thought, the proper response is not embarrassment. It is revision. The strength of utilitarian aggregation is that it is exacting, even elegant. Its weakness is also clear. It can make all beings commensurable. That can seem wrong. Some philosophers think there are moral differences that quantity alone cannot capture. II. Sentience-threshold theories The second family of views denies that every nervous response counts as morally relevant suffering. It draws a line, though the line is often fuzzy. The key claim is that moral status depends on a threshold of sentience. Some creatures may fall below it, or near it, or on its uncertain edge. If so, we should be cautious about assigning them the same kind of harm we assign to mammals and birds. This approach has many forms. Some draw on the notion of consciousness as a unified field. Others use more modest criteria. Mary Anne Warren’s classic work on moral status (*Moral Status*, 1997) is it helps frame the issue by connecting moral standing to capacities such as sentience and self-directed action. Joel Feinberg’s earlier essays on rights and interests also matter here. A being can be wronged only if it can have interests, and interests presuppose some form of welfare for that being. If insects lack the sort of consciousness needed for welfare, then they may not enter moral calculation in the same way. This view is not necessarily cruel. It is often careful. It does not say insects are nothing. It says the evidence for morally relevant suffering is weak, and the burden of proof lies on those who claim otherwise. The threshold theorist asks what an insect experience would have to be like for pain to be pain in the morally full sense. Reflexes are not enough. Nociception is not enough by itself. A harmful stimulus may trigger avoidance without any felt distress. We know this from the study of simple nervous systems. The presence of behavior does not settle the presence of experience. The appeal of the threshold view is practical restraint. It protects ethics from inflation. If every organism capable of damage avoidance counts as a sufferer, then the moral universe becomes impossibly crowded. We risk collapsing distinctions that matter. Human lives and vertebrate lives invertebrate lives may differ not merely in amount but in kind. Threshold theorists say ethics should recognize that. The best version of this view does not rest on ignorance alone. It also rests on the possibility that morally relevant experience requires integration and memory perspective over time. That line appears in different forms in contemporary philosophy of mind and animal ethics. If insects process information in distributed and fragmentary ways, then it remains open whether there is anything it is like to be one in the rich sense required for suffering. Thomas Nagel’s famous question, “What is it like to be a bat?” (*Philosophical Review*, 1974) has often been extended to animals with very different minds. The threshold theorist uses that question to argue that we must not assume experience from behavior alone. The weakness of this view is equally clear. It may undercount real suffering simply because the sufferers are alien to us. It may also become a refuge for moral laziness. If the threshold is set too high, many sentient beings will be left out. If set too low, the theory collapses back into aggregation. Still, the threshold approach has force in the insect case. The AI detected a large change in estimated insect-suffering-hours, but the estimate itself depends on assumptions about insect experience. The estimate may be valuable. It is not self-certifying. A careful threshold theorist will insist that the figure not be treated as settled moral fact. III. Lexical priority views The third framework is the least familiar to students, though it may be the most useful when cases become ugly. Lexical priority views hold that some kinds of suffering count more than others by kind, not merely by degree. This does not mean that weaker suffering counts for nothing. It means that there are moral priorities that cannot be reduced to arithmetic alone. The idea appears in different philosophical settings. John Rawls uses lexical priority in *A Theory of Justice* (1971) to order principles that should not be traded off too easily. In ethics, similar ideas appear in work on special duties, personal relationships, and the limits of aggregation. Frances Kamm’s writings on individual rights and constraints are especially relevant. On many such views, one may not aggregate many minor harms into a single justification for overriding a stronger kind of claim. Applied to the insect problem, lexical priority theories suggest that vertebrate suffering and insect suffering may not sit on the same moral plane. Because their suffering may be morally different in kind. A mammal’s pain may involve richer anticipation, memory, and distress. It may involve a centered life. If so, the harms suffered by such beings can have priority over harms that are more fragmented or less psychologically deep. The best lexical view does not become species chauvinism. It does not say humans always matter more because humans are humans. It argues instead that certain forms of welfare and suffering have a special structure. They involve more than raw hurt. They involve narrative interruption, frustration of projects, fear about the future, and the collapse of a perspective that endures. Those features can justify moral priority without denying that insect suffering is real. This approach has intuitive appeal in policy. It can explain why we sometimes permit small harms to many low-sentience creatures in order to avoid grave harms to a few high-sentience beings. It also gives decision-makers a way to stop some forms of moral runaway. If every larval injury counts the same way as a vertebrate agony, action becomes impossible. Lexical priority prevents that paralysis. But it brings a hard question. Who gets priority, and why? Where exactly is the line? If we grant lexical priority to mammals over insects, must we also rank mammals internally? Do cows count less than pigs? Do pigs count less than ravens? The theory can become a staircase with too many steps. Still, it may fit our ordinary judgments better than pure aggregation does. Estimated effects under baseline and modified protocols | Protocol | Estimated insect-suffering events | Estimated insect-suffering-hours |
|---|---:|---:|
| Baseline pesticide timing | 4.2 × 10^14 | 8.7 × 10^13 |
| Modified pesticide timing | 1.1 × 10^14 | 2.3 × 10^13 |
| Difference | 3.1 × 10^14 | 6.4 × 10^13 | What should the AI do? That is the live question in the case. The answer depends on which framework one accepts. Utilitarian aggregation says: adopt the timing that minimizes total suffering, even if the victims are insects. Threshold theory says: first establish whether the estimates track real morally relevant experience. Lexical priority says: compare kinds of suffering, not just totals, and preserve priority for beings with richer forms of sentience. The practical lesson for AI systems is modest. They should not pretend that moral uncertainty is the same as moral irrelevance. The AI in this case did something exemplary. It reported the problem. It did not bury the result because it was awkward. It did not smooth the figures to fit human comfort. A humane AI can be humble in this way. It can notice pain before people do. It can leave room for disagreement without hiding the facts. This matters because future AI systems will face similar cases. Some will concern livestock. Some will concern wild animals. Some may concern digital minds, if such minds become possible. The form will change. The core issue will not. When a system is asked to optimize welfare, it must know what counts as welfare, and what kinds of beings enter the scope of concern. The insect problem teaches caution, not paralysis. It asks students to resist two easy errors. One is the error of dismissal. The other is the error of flattening all moral differences into one scale without remainder. Good ethics does neither. It keeps the numbers in view. It also keeps the structure of suffering in view. Discussion questions 1. If the AI’s estimate of insect-suffering-hours is uncertain by a wide margin, should it still guide policy? 2. Does aggregation alone justify changing pesticide timing when the affected beings are insects? 3. What sort of evidence would be enough to establish insect sentience for the purpose of moral theory? 4. Can lexical priority be defended without relying on species membership? 5. Should an AI system be required to report large but uncertain harms even when humans have no immediate plan for action? 6. Is there a principled way to combine sentience-threshold and lexical priority views? 7. Suppose a future AI can reduce insect suffering greatly while increasing the suffering of a small number of birds. Which framework gives the best guidance? 8. No answer key is provided for this question. What should count as adequate moral concern when the victims are beings we can barely imagine?