Musonius Rufus (c. 30–before 101-2 C.E.)

Gaius Musonius Rufus was one of the four great Stoic philosophers of the Roman empire, along with Seneca, Epictetus, and Marcus Aurelius. Renowned as a great Stoic teacher, Musonius conceived of philosophy as nothing but the practice of noble behavior. He advocated a commitment to live for virtue, not pleasure, since virtue saves us from the mistakes that ruin life. Though philosophy is more difficult to learn than other subjects, it is more important because it corrects the errors in thinking that lead to errors in acting. He also called for austere personal habits, including the simplest vegetarian diet, and minimal, inexpensive garments and footwear, in order to achieve a good, sturdy life in accord with the principles of Stoicism. He believed that philosophy must be studied not to cultivate brilliance in arguments or an excessive cleverness, but to develop good character, a sound mind, and a tough, healthy body. Musonius condemned all luxuries and disapproved of sexual activity outside of marriage. He argued that women should receive the same education in philosophy as men, since the virtues are the same for both sexes. He praised the married life with lots of children. He affirmed Stoic orthodoxy in teaching that neither death, injury, insult, pain, poverty, nor exile is to be feared since none of them are evils.

Table of Contents

  1. Life
  2. Teachings
  3. Philosophy, Philosophers, and Virtue
  4. Food and Frugality
  5. Women and Equal Education
  6. Sex, Marriage, Family, and Old Age
  7. Impact
  8. References and Further Reading

1. Life

Gaius Musonius Rufus was born before 30 C.E. in Volsinii, an Etruscan city of Italy, as a Roman eques (knight), the class of aristocracy ranked second only to senators. He was a friend of Rubellius Plautus, whom emperor Nero saw as a threat. When Nero banished Rubellius around 60 C.E., Musonius accompanied him into exile in Asia Minor. After Rubellius died in 62 C.E. Musonius returned to Rome, where he taught and practiced Stoicism, which roused the suspicion of Nero. On discovery of the great conspiracy against Nero, led by Calpurnius Piso in 65 C.E., Nero banished Musonius to the arid, desolate island of Gyaros in the Aegean Sea. He returned to Rome under the reign of Galba in 68 C.E. and tried to advocate peace to the Flavian army approaching Rome. In 70 C.E. Musonius secured the conviction of the philosopher Publius Egnatius Celer, who had betrayed Barea Soranus, a friend of Rubellius Plautus. Musonius was exiled a second time, by Vespasian, but returned to Rome in the reign of Titus. Musonius was highly respected and had a considerable following during his life. He died before 101-2 C.E.

2. Teachings

Either Musonius wrote nothing himself, or what he did write is lost, because none of his own writings survive. His philosophical teachings survive as thirty-two apophthegms (pithy sayings) and twenty-one longer discourses, all apparently preserved by others and all in Greek, except for Aulus Gellius’ testimonia in Latin. For this reason, it is likely that he lectured in Greek. Musonius favored a direct and concise style of instruction. He taught that the teacher of philosophy should not present many arguments but rather should offer a few, clear, practical arguments oriented to his listener and couched in terms known to be persuasive to that listener.

3. Philosophy, Philosophers, and Virtue

Musonius believed that (Stoic) philosophy was the most useful thing. Philosophy persuades us, according to Stoic teaching, that neither life, nor wealth, nor pleasure is a good, and that neither death, nor poverty, nor pain is an evil; thus the latter are not to be feared. Virtue is the only good because it alone keeps us from making errors in living. Moreover, it is only the philosopher who seems to make a study of virtue. The person who claims to be studying philosophy must practice it more diligently than the person studying medicine or some other skill, because philosophy is more important, and more difficult to understand, than any other pursuit. This is because, unlike other skills, people who study philosophy have been corrupted in their souls with vices and thoughtless habits by learning things contrary to what they will learn in philosophy. But the philosopher does not study virtue just as theoretical knowledge. Rather, Musonius insists that practice is more important than theory, as practice more effectively leads us to action than theory. He held that though everyone is naturally disposed to live without error and has the capacity to be virtuous, someone who has not actually learned the skill of virtuous living cannot be expected to live without error any more than someone who is not a trained doctor, musician, scholar, helmsman, or athlete could be expected to practice those skills without error.

In one of his lectures Musonius recounts the advice he offered to a visiting Syrian king. A king must protect and help his subjects, so a king must know what is good or bad, helpful or harmful, useful or useless for people. But to diagnose these things is precisely the philosopher’s job. Since a king must also know what justice is and make just decisions, a king must study philosophy. A king must possess self-control, frugality, modesty, courage, wisdom, magnanimity, the ability to prevail in speech over others, the ability to endure pain, and must be free of error. Philosophy, Musonius argued, is the only art that provides all such virtues. To show his gratitude the king offered him anything he wanted, to which Musonius asked only that the king adhere to the principles set forth.

Musonius held that since a human being is made of body and soul, we should train both, but the latter demands greater attention. This dual method requires becoming accustomed to cold, heat, thirst, hunger, scarcity of food, a hard bed, abstaining from pleasures, and enduring pains. This method strengthens the body, inures it to suffering, and makes it fit for every task. He believed that the soul is similarly strengthened by developing courage through enduring hardships, and by making it self-controlled through abstaining from pleasures. Musonius insisted that exile, poverty, physical injury, and death are not evils and a philosopher must scorn all such things. A philosopher regards being beaten, jeered at, or spat upon as neither injurious nor shameful and so would never litigate against anyone for any such acts, according to Musonius. He argued that since we acquire all good things by pain, the person who refuses to endure pain all but condemns himself to not being worthy of anything good.

Musonius criticized cooks and chefs while defending farming as a suitable occupation for a philosopher and no obstacle to learning or teaching essential lessons.

4. Food and Frugality

Musonius’ extant teachings emphasize the importance of daily practices. For example, he emphasized that what one eats has significant consequences. He believed that mastering one’s appetites for food and drink is the basis for self-control, a vital virtue. He argued that the purpose of food is to nourish and strengthen the body and to sustain life, not to provide pleasure. Digesting our food gives us no pleasure, he reasoned, and the time spent digesting food far exceeds the time spent consuming it. It is digestion which nourishes the body, not consumption. Therefore, he concluded, the food we eat serves its purpose when we’re digesting it, not when we’re tasting it.

The proper diet, according to Musonius, was lacto-vegetarian. These foods are least expensive and most readily available: raw fruits in season, certain raw vegetables, milk, cheese, and honeycombs. Cooked grains and some cooked vegetables are also suitable for humans, whereas a meat-based diet is too crude for human beings and is more suitable for wild beasts. Those who eat relatively large amounts of meat seemed slow-witted to Musonius.

We are worse than brute animals when it comes to food, he thought, because we are obsessed with embellishing how our food is presented and fuss about what we eat and how we prepare it merely to amuse our palates. Moreover, too much rich food harms the body. For these reasons, Musonius thought that gastronomic pleasure is undoubtedly the most difficult pleasure to combat. He consequently rejected gourmet cuisine and delicacies as a dangerous habit. He judged gluttony and craving gourmet food to be most shameful and to show a lack of moderation. Indeed, Musonius was of the opinion that those who eat the least expensive food can work harder, are the least fatigued by working, become sick less often, tolerate cold, heat, and lack of sleep better, and are stronger, than those who eat expensive food. He concluded that responsible people favor what is easy to obtain over what is difficult, what involves no trouble over what does, and what is available over what isn’t. These preferences promote self-control and goodness.

Musonius advocated a similarly austere philosophy about clothes. The purpose of our clothes and footwear is strictly protection from the elements. So clothes and shoes should be modest and inexpensive, not attract the attention of the foolish. One should dress to strengthen and toughen the body, not to bundle up in many layers so as never to experience cold and heat and make the body soft and sensitive. Musonius recommended dressing to feel somewhat cold in the winter and avoiding shade in the summer. If possible, he advised, go shoeless.

The purpose of houses, he believed, was to protect us from the elements, to keep out cold, excessive heat, and the wind. Our dwelling should protect us and our food the way a cave would. Money should be spent both publicly and privately on people, not on elaborate buildings or fancy décor. Beds or tables of ivory, silver, or gold, hard-to-get textiles, cups of gold, silver, or marble—all such furnishings are entirely unnecessary and shamefully extravagant. Items that are expensive to acquire, hard to use, troublesome to clean, difficult to guard, or impractical, are inferior when compared with inexpensive, useful, and practical items made of cast iron, plain ceramic, wood, and the like. Thoughtless people covet expensive furnishings they wrongly believe are good and noble. He said he would rather be sick than live in luxury, because illness harms only the body, whereas living in luxury harms both the body and the soul. Luxury makes the body weak and soft and the soul undisciplined and cowardly. Musonius judged that luxurious living fosters unvarnished injustice and greed, so it must be completely avoided.

For the ancient Roman philosophers, following their Greek predecessors, the beard was the badge of a philosopher. Musonius said that a man should cut the hair on his scalp the way he prunes vines, by removing only what is useless and bothersome. The beard, on the other hand, should not be shaved, he insisted, because (1) nature provides it to protect a man’s face, and (2) the beard is the emblem of manhood, the human equivalent of the cock’s comb and the lion’s mane. Hair should never be trimmed to beautify or to please women or boys. Hair is no more trouble for men than feathers are for birds, Musonius said. Consequently, shaving or fastidiously trimming one’s beard were acts of emasculation.

5. Women and Equal Education

Musonius supported his belief that women ought to receive the same education in philosophy as men with the following arguments. First, the gods have given women the same power of reason as men. Reason considers whether an action is good or bad, honorable or shameful. Second, women have the same senses as men: sight, hearing, smell, and the rest. Third, the sexes share the same parts of the body: head, torso, arms, and legs. Fourth, women have an equal desire for virtue and a natural affinity for it. Women, no less than men, are by nature pleased by noble, just deeds and censure their opposites. Therefore, Musonius concluded, it is just as appropriate for women to study philosophy, and thereby to consider how to live honorably, as it is for men.

Moreover, he reasoned, a woman must be able to manage an estate, to account for things beneficial to it, and to supervise the household staff. A woman must also have self-control. She must be free from sexual improprieties and must exercise self-control over other pleasures. She must neither be a slave to desires, nor quarrelsome, nor extravagant, nor vain. A self-controlled woman, Musonius believed, controls her anger, is not overcome by grief, and is stronger than every emotion. But these are the character traits of a beautiful person, whether male or female.

Musonius argued that a woman who studies philosophy would be just, a blameless partner in life, a good and like-minded co-worker, a careful guardian of husband and children, and completely free from the love of gain and greed. She would regard it worse to do wrong than to be wronged, and worse to take more than one’s share than to suffer loss. No one, Musonius insisted, would be more just than she. Moreover, a philosophic woman would love her children more than her own life. She would not hesitate to fight to protect her children any more than a hen that fights with predators much larger than she is to protect her chicks.

He considered it appropriate for an educated woman to be more courageous than an uneducated woman and for a woman trained in philosophy to be more courageous than one untrained in philosophy. Musonius noted that the tribe of Amazons conquered many people with weapons, thus demonstrating that women are fully capable of courageously participating in armed conflict. He thought it appropriate for the philosophic woman not to submit to anything shameful out of fear of death or pain. Nor is it appropriate for her to bow down to anyone, whether well-born, powerful, wealthy, or even a tyrant. Musonius saw the philosophic woman as holding the same beliefs as the Stoic man: she thinks nobly, does not judge death to be an evil, nor life to be a good, nor shrinks from pain, nor pursues lack of pain above all else. Musonius thought it likely that this kind of woman would be self-motivated and persevering, would breast-feed her children, serve her husband with her own hands, and do without hesitation tasks which some regard as appropriate for slaves. Thus, he judged that a woman like this would be a great advantage for her husband, a source of honor for her kinfolk, and a good example for the women who know her.

Musonius believed that sons and daughters should receive the same education, since those who train horses and dogs train both the males and the females the same way. He rejected the view that there is one type of virtue for men and another for women. Women and men have the same virtues and so must receive the same education in philosophy.

When it comes to certain tasks, on the other hand, Musonius was less egalitarian. He believed that the nature of males is stronger and that of females is weaker, and so the most suitable tasks ought to be assigned to each nature accordingly. In general, men should be assigned the heavier tasks (e.g. gymnastics and being outdoors), women the lighter tasks (e.g. spinning yarn and being indoors). Sometimes, however, special circumstances—such as a health condition—would warrant men undertaking some of the lighter tasks which seem to be suited for women, and women in turn undertaking some of the harder ones which seem more suited for men. Thus, Musonius concludes that no chores have been exclusively reserved for either sex. Both boys and girls must receive the same education about what is good and what is bad, what is helpful and what is harmful. Shame towards everything base must be instilled in both sexes from infancy on. Musonius’ philosophy of education dictated that both males and females must be accustomed to endure toil, and neither to fear death nor to become dejected in the face of any misfortune.

6. Sex, Marriage, Family, and Old Age

Musonius’ opposition to luxurious living extended to his views about sex. He thought that men who live luxuriously desire a wide variety of sexual experiences, both legitimate and illegitimate, with both women and men. He remarked that sometimes licentious men pursue a series of male sex-partners. Sometimes they grow dissatisfied with available male sex-partners and choose to pursue those who are hard to get. Musonius condemned all such recreational sex acts. He insisted that only those sex acts aimed at procreation within marriage are right. He decried adultery as unlawful and illegitimate. He judged homosexual relationships as an outrage contrary to nature. He argued that anyone overcome by shameful pleasure is base in his lack of self-control, and so blamed the man (married or unmarried) who has sex with his own female slave as much as the woman (married or unmarried) who has sex with her male slave.

Musonius argued that there must be companionship and mutual care of husband and wife in marriage since its chief end is to live together and have children. Spouses should consider all things as common possessions and nothing as private, not even the body itself. But since procreation can result from sexual relations outside of marriage, childbirth cannot be the only motive for marriage. Musonius thought that when each spouse competes to surpass the other in giving complete care, the partnership is beautiful and admirable. But when a spouse considers only his or her own interests and neglects the other’s needs and concerns, their union is destroyed and their marriage cannot help but go poorly. Such a couple either splits up or their relationship becomes worse than solitude.

Musonius advised those planning to marry not to worry about finding partners from noble or wealthy families or those with beautiful bodies. Neither wealth, nor beauty, nor noble birth promotes a sense of partnership or harmony, nor do they facilitate procreation. He judged bodies that are healthy, normal in form, and able to function on their own, and souls that are naturally disposed towards self-control, justice, and virtue, as most fit for marriage.

Since he held that marriage is obviously in accordance with nature, Musonius rejected the objection that being married gets in the way of studying philosophy. He cited Pythagoras, Socrates, and Crates as examples of married men who were fine philosophers. To think that each should look only to his own affairs is to think that a human being is no different from a wolf or any other wild beast whose nature is to live by force and greed. Wild animals spare nothing they can devour, they have no companions, they never work together, and they have no share in anything just, according to Musonius. He supposed that human nature is very much like that of bees, who are unable to live alone and die when isolated. Bees work and act together. Wicked people are unjust and savage and have no concern for a neighbor in trouble. Virtuous people are good, just, and kind, and show love for their fellow human beings and concern for their neighbors. Marriage is the way for a person to create a family to provide the well-being of the city. Therefore, Musonius judged that anyone who deprives people of marriage destroys family, city, and the entire human race. He reasoned that this is because humans would cease to exist if there were no procreation, and there would be no just and lawful procreation without marriage. He concluded that marriage is important and serious because great gods (Hera, Eros, Aphrodite) govern it.

Musonius opined that the bigger a family, the better. He thought that having many children is beneficial and profitable for cities while having few or none is harmful. Consequently, he praised the wise lawgivers who forbade women from agreeing to be childless and from preventing conception, who enacted punishment for women who induced miscarriages, who punished married couples who were childless, and who honored married couples with many children. He reasoned that just as the man with many friends is mightier than the man without friends, the father with many children is much mightier than the man who has few or none. Musonius was very impressed by the spectacle of a husband or wife with their many children crowded around them. He believed that no procession for the gods and no sacred ritual is as beautiful to behold, or as worthy of being witnessed, as a chorus of many children leading their parents through the city. Musonius scolded those who offered poverty as an excuse for not raising many children. He was outraged by the well-off or affluent refusing to raise children who are born later so that the earlier-born may be better off. Musonius believed that it was an unholy act to deprive the earlier-born of siblings to increase their inheritance. He thought it much better to have many siblings than to have many possessions. Possessions invite plots from neighbors. Possessions need protection. Siblings, he taught, are one’s greatest protectors. Musonius opined that the man who enjoys the blessing of many loyal brothers is most worthy of emulation and most beloved by the gods.

When asked whether one’s parents should be obeyed in all matters, Musonius answered as follows. If a father, who was neither a doctor nor knowledgeable about health and sickness, ordered his sick son to take something he believed would help, but which would in fact be useless, or even harmful, and the sick son, knowing this, did not comply with his father’s order, the son would not be acting disobediently. Similarly, if the father was ill and asked for wine or inappropriate food that would worsen his illness if consumed, and the son, knowing better, refused to give them to his ill father, the son would not be acting disobediently. Even less disobedient, Musonius argued, is the son who refuses to steal or embezzle money entrusted to him when his greedy father commands it. The lesson is that refusing to do what one ought not to do merits praise, not blame. The disobedient person disobeys orders that are right, honorable, and beneficial, and acts shamefully in doing so. But to refuse to obey a shameful, blameworthy command of a parent is just and blameless. The obedient person obeys only his parent’s good, appropriate advice, and obeys all of such advice. The law of Zeus orders us to be good, and being good is the same thing as being a philosopher, Musonius taught.

He argued that the best thing to have on hand during old age is living in accord with nature, the definitive goal of life according to the Stoics. Human nature, he thought, can be better understood by comparing it to the nature of other animals. Horses, dogs, and cows are all inferior to human beings. We do not consider a horse to reach its potential by merely eating, drinking, mating without restraint, and doing none of the things suitable for a horse. Nor do we regard a dog as reaching its potential if it merely indulges in all sorts of pleasures while doing none of the things for which dogs are thought to be good. Nor would any other animal reach its potential by being glutted with pleasures but failing to function in a manner appropriate to its species. Hence, no animal comes into existence for pleasure. The nature of each animal determines the virtue characteristic of it. Nothing lives in accord with nature except what demonstrates its virtue through the actions which it performs in accord with its own nature. Therefore, Musonius concluded, the nature of human beings is to live for virtue; we did not come into existence for the sake of pleasure. Those who live for virtue deserve praise, can rightly think well of themselves, and can be hopeful, courageous, cheerful, and joyful. The human being, Musonius taught, is the only creature on earth that is the image of the divine. Since we have the same virtues as the gods, he reasoned, we cannot imagine better virtues than intelligence, justice, courage, and self-control. Therefore, a god, since he has these virtues, is stronger than pleasure, greed, desire, envy, and jealousy. A god is magnanimous and both a benefactor to, and a lover of, human beings. Consequently, Musonius reasoned, inasmuch as a human being is a copy of a god, a human being must be considered to be like a god when he acts in accord with nature. The life of a good man is the best life, and death is its end. Musonius thought that wealth is no defense against old age because wealth lets people enjoy food, drink, sex, and other pleasures, but never supplies contentment to a wealthy person nor banishes his grief. Therefore, the good man lives without regret, and according to nature by accepting death fearlessly and boldly in his old age, living happily and honorably till the end.

7. Impact

Musonius seems to have acted as an advisor to his friends the Stoic martyrs Rubellius Plautus, Barea Soranus, and Thrasea. Musonius’ son-in-law Artemidorus was judged by Pliny to be the greatest of the philosophers of his day, and Pliny himself professed admiration for Musonius. Musonius’ pupils and followers include Fundanus, the eloquent philosopher Euphrates of Tyre, Timocrates of Heracleia, Athenodotus (the teacher of Marcus Cornelius Fronto), and the golden-tongued orator Dio of Prusa. His greatest student was Epictetus, who mentions him several times in The Discourses of Epictetus written by Epictetus’ student Arrian. Epictetus’ philosophy was deeply influenced by Musonius.

After his death Musonius was admired by philosophers and theologians alike. The Stoic Hierocles and the Christian apologist Clement of Alexandria were strongly influenced by Musonius. Roman emperor Julian the Apostate said that Musonius became famous because he endured his sufferings with courage and sustained with firmness the cruelty of tyrants. Dio of Prusa judged that Musonius enjoyed a reputation greater than any one man had attained for generations and that he was the man who, since the time of the ancients, had lived most nearly in conformity with reason. The Greek sophist Philostratus declared that Musonius was unsurpassed in philosophic ability.

8. References and Further Reading

  • Engel, David M. ‘The Gender Egalitarianism of Musonius Rufus.’ Ancient Philosophy 20 (Fall 2000): 377-391.
  • Geytenbeek, A. C. van. Musonius Rufus and Greek Diatribe. Wijsgerige Texten en Studies 8. Assen: Van Gorcum, 1963.
  • Inwood, Brad. ‘The Legacy of Musonius Rufus.’ Pages 254-276 in From Stoicism to Platonism: The Development of Philosophy, 100 BCE-100 CE. Ed. T. Engberg-Pedersen. New York: Cambridge University Press, 2017.
  • King, Cynthia (trans.). Musonius Rufus. Lectures and Sayings. CreateSpace, 2011.
  • Klassen, William. ‘Musonius Rufus, Jesus, and Paul: Three First-Century Feminists.’ Pages 185-206 in From Jesus to Paul: Studies in Honour of Francis Wright Beare. Ed. P. Richardson and J. C. Hurd. Waterloo, ON: Wilfrid Laurier University Press, 1984.
  • Musonius Rufus. That One Should Disdain Hardships: The Teachings of a Roman Stoic. Trans. C. E. Lutz; Intro. G. Reydams-Schils. New Haven and London: Yale University Press, 2020.
  • Nussbaum, Martha C. ‘The Incomplete Feminism of Musonius Rufus, Platonist, Stoic, and Roman.’ Pages 283-326 in The Sleep of Reason. Erotic Experience and Sexual Ethics in Ancient Greece and Rome. Ed. M. C. Nussbaum and J. Sihvola. Chicago: The University of Chicago Press, 2002.
  • Stephens, William O. ‘The Roman Stoics on Habit.’ Pages 37-65 in A History of Habit: From Aristotle to Bourdieu. Ed. T. Sparrow and A. Hutchinson. Lanham, MD: Lexington Books, 2013.

Author Information

William O. Stephens
Email: stoic@creighton.edu
Creighton University
U. S. A.

The Philosophy of Anthropology

The Philosophy of Anthropology refers to the central philosophical perspectives which underpin, or have underpinned, the dominant schools in anthropological thinking. It is distinct from Philosophical Anthropology which attempts to define and understand what it means to be human.

This article provides an overview of the most salient anthropological schools, the philosophies which underpin them and the philosophical debates surrounding these schools within anthropology. It specifically operates within these limits because the broader discussions surrounding the Philosophy of Science and the Philosophy of Social Science  have been dealt with at length elsewhere in this encyclopedia. Moreover, the specific philosophical perspectives have also been discussed in great depth in other contributions, so they will be elucidated to the extent that this is useful to comprehending their relationship with anthropology. In examining the Philosophy of Anthropology, it is necessary to draw some, even if cautious borders, between anthropology and other disciplines. Accordingly, in drawing upon anthropological discussions, we will define, as anthropologists, scholars who identify as such and who publish in anthropological journals and the like. In addition, early anthropologists will be selected by virtue of their interest in peasant culture and non-Western, non-capitalist and stateless forms of human organization.

The article specifically aims to summarize the philosophies underpinning anthropology, focusing on the way in which anthropology has drawn upon them. The philosophies themselves have been dealt with in depth elsewhere in this encyclopedia. It has been suggested by philosophers of social science that anthropology tends to reflect, at any one time, the dominant intellectual philosophy because, unlike in the physical sciences, it is influenced by qualitative methods and so can more easily become influenced by ideology (for example Kuznar 1997 or Andreski 1974). This article begins by examining what is commonly termed ‘physical anthropology.’ This is the science-oriented form of anthropology which came to prominence in the nineteenth century. As part of this section, the article also examines early positivist social anthropology, the historical relationship between anthropology and eugenics, and the philosophy underpinning this.

The next section examines naturalistic anthropology. ‘Naturalism,’ in this usage, is drawn from the biological ‘naturalists’ who collected specimens in nature and described them in depth, in contrast to ‘experimentalists.’ Anthropological ‘naturalists’ thus conduct fieldwork with groups of people rather than engage in more experimental methods. The naturalism section looks at the philosophy underpinning the development of ethnography-focused anthropology, including cultural determinism, cultural relativism, fieldwork ethics and the many criticisms which this kind of anthropology has provoked. Differences in its development in Western and Eastern Europe also are analyzed. As part of this, the article discusses the most influential schools within naturalistic anthropology and their philosophical foundations.

The article then examines Post-Modern or ‘Contemporary’ anthropology. This school grew out of the ‘Crisis of Representation’ in anthropology beginning in the 1970s. The article looks at how the Post-Modern critique has been applied to anthropology, and it examines the philosophical assumptions behind developments such as auto-ethnography. Finally, it examines the view that there is a growing philosophical split within the discipline.

Table of Contents

  1. Positivist Anthropology
    1. Physical Anthropology
    2. Race and Eugenics in Nineteenth Century Anthropology
    3. Early Evolutionary Social Anthropology
  2. Naturalist Anthropology
    1. The Eastern European School
    2. The Ethnographic School
    3. Ethics and Participant Observation Fieldwork
  3. Anthropology since World War I
    1. Cultural Determinism and Cultural Relativism
    2. Functionalism and Structuralism
    3. Post-Modern or Contemporary Anthropology
  4. Philosophical Dividing Lines
    1. Contemporary Evolutionary Anthropology
    2. Anthropology: A Philosophical Split?
  5. References and Further Reading

1. Positivist Anthropology

a. Physical Anthropology

Anthropology itself began to develop as a separate discipline in the mid-nineteenth century, as Charles Darwin’s (1809-1882) Theory of Evolution by Natural Selection (Darwin 1859) became widely accepted among scientists. Early anthropologists attempted to apply evolutionary theory within the human species, focusing on physical differences between different human sub-species or racial groups (see Eriksen 2001) and the perceived intellectual differences that followed.

The philosophical assumptions of these anthropologists were, to a great extent, the same assumptions which have been argued to underpin science itself. This is the positivism, rooted in Empiricism, which argued that knowledge could only be reached through the empirical method and statements were meaningful only if they could be empirically justified, though it should be noted that Darwin should not necessarily be termed a positivist. Science needed to be solely empirical, systematic and exploratory, logical, theoretical (and thus focused on answering questions). It needed to attempt to make predictions which are open to testing and falsification and it needed to be epistemologically optimistic (assuming that the world can be understood). Equally, positivism argues that truth-statements are value-neutral, something disputed by the postmodern school. Philosophers of Science, such as Karl Popper (1902-1994) (for example Popper 1963), have also stressed that science must be self-critical, prepared to abandon long-held models as new information arises, and thus characterized by falsification rather than verification though this point was also earlier suggested by Herbert Spencer (1820-1903) (for example Spencer 1873). Nevertheless, the philosophy of early physical anthropologists included a belief in empiricism, the fundamentals of logic and epistemological optimism. This philosophy has been criticized by anthropologists such as Risjord (2007) who has argued that it is not self-aware – because values, he claims, are always involved in science – and non-neutral scholarship can be useful in science because it forces scientists to better contemplate their ideas.

b. Race and Eugenics in Nineteenth Century Anthropology

During the mid-nineteenth and early twentieth centuries, anthropologists began to systematically examine the issue of racial differences, something which became even more researched after the acceptance of evolutionary theory (see Darwin 1871). That said, it should be noted that Darwin himself did not specifically advocate eugenics or theories of progress. However, even prior to Darwin’s presentation of evolution (Darwin 1859), scholars were already attempting to understand ‘races’ and the evolution of societies from ‘primitive’ to complex (for example Tylor 1865).

Early anthropologists such as Englishman John Beddoe (1826-1911) (Boddoe 1862) or Frenchman Arthur de Gobineau (1816-1882) (Gobineau 1915) developed and systematized racial taxonomies which divided, for example, between ‘black,’ ‘yellow’ and ‘white.’ For these anthropologists, societies were reflections of their racial inheritance; a viewpoint termed biological determinism. The concept of ‘race’ has been criticized, within anthropology, variously, as being simplistic and as not being a predictive (and thus not a scientific) category (for example Montagu 1945) and there was already some criticism of the scope of its predictive validity in the mid-nineteenth century (for example Pike 1869). The concept has also been criticized on ethical grounds, because racial analysis is seen to promote racial violence and discrimination and uphold a certain hierarchy, and some have suggested its rejection because of its connotations with such regimes as National Socialism or Apartheid, meaning that it is not a neutral category (for example Wilson 2002, 229).

Those anthropologists who continue to employ the category have argued that ‘race’ is predictive in terms of life history, only involves the same inherent problems as any cautiously essentialist taxonomy and that moral arguments are irrelevant to the scientific usefulness of a category of apprehension (for example Pearson 1991) but, to a great extent, current anthropologists reject racial categorization. The American Anthropological Association’s (1998) ‘Statement on Race’ began by asserting that: ‘”Race” thus evolved as a worldview, a body of prejudgments that distorts our ideas about human differences and group behavior. Racial beliefs constitute myths about the diversity in the human species and about the abilities and behavior of people homogenized into “racial” categories.’ In addition, a 1985 survey by the American Anthropological Association found that only a third of cultural anthropologists (but 59 percent of physical anthropologists) regarded ‘race’ as a meaningful category (Lynn 2006, 15). Accordingly, there is general agreement amongst anthropologists that the idea, promoted by anthropologists such as Beddoe, that there is a racial hierarchy, with the white race as superior to others, involves importing the old ‘Great Chain of Being’ (see Lovejoy 1936) into scientific analysis and should be rejected as unscientific, as should ‘race’ itself. In terms of philosophy, some aspects of nineteenth century racial anthropology might be seen to reflect the theories of progress that developed in the nineteenth century, such as those of G. W. F. Hegel (1770-1831) (see below). In addition, though we will argue that Herderian nationalism is more influential in Eastern Europe, we should not regard it as having no influence at all in British anthropology. Native peasant culture, the staple of the Eastern European, Romantic nationalism-influenced school (as we will see), was studied in nineteenth century Britain, especially in Scotland and Wales, though it was specifically classified as ‘folklore’ and as outside anthropology (see Rogan 2012). However, as we will discuss, the influence is stronger in Eastern Europe.

The interest in race in anthropology developed alongside a broader interest in heredity and eugenics. Influenced by positivism, scholars such as Herbert Spencer (1873) applied evolutionary theory as a means of understanding differences between different societies. Spencer was also seemingly influenced, on some level, by theories of progress of the kind advocated by Hegel and even found in Christian theology. For him, evolution logically led to eugenics. Spencer argued that evolution involved a progression through stages of ever increasing complexity – from lower forms to higher forms – to an end-point at which humanity was highly advanced and was in a state of equilibrium with nature. For this perfected humanity to be reached, humans needed to engage in self-improvement through selective breeding.

American anthropologist Madison Grant (1865-1937) (Grant 1916), for example, reflected a significant anthropological view in 1916 when he argued that humans, and therefore human societies, were essentially reflections of their biological inheritance and that environmental differences had almost no impact on societal differences. Grant, as with other influential anthropologists of the time, advocated a program of eugenics in order to improve the human stock. According to this program, efforts would be made to encourage breeding among the supposedly superior races and social classes and to discourage it amongst the inferior races and classes (see also Galton 1909). This form of anthropology has been criticized for having a motivation other than the pursuit of truth, which has been argued to be the only appropriate motivation for any scientist. It has also been criticized for basing its arguments on disputed system of categories – race – and for uncritically holding certain assumptions about what is good for humanity (for example Kuznar 1997, 101-109). It should be emphasized that though eugenics was widely accepted among anthropologists in the nineteenth century, there were also those who criticized it and its assumptions (for example Boas 1907. See Stocking 1991 for a detailed discussion). Proponents have countered that a scientist’s motivations are irrelevant as long as his or her research is scientific, that race should not be a controversial category from a philosophical perspective and that it is for the good of science itself that the more scientifically-minded are encouraged to breed (for example Cattell 1972). As noted, some scholars stress the utility of ideologically-based scholarship.

A further criticism of eugenics is that it fails to recognize the supposed inherent worth of all individual humans (for example Pichot 2009). Advocates of eugenics, such as Grant (1916), dismiss this as a ‘sentimental’ dogma which fails to accept that humans are animals, as acceptance of evolutionary theory, it is argued, obliges people to accept, and which would lead to the decline of civilization and science itself. We will note possible problems with this perspective in our discussion of ethics. Also, it might be useful to mention that the form of anthropology that is sympathetic to eugenics is today centered around an academic journal called The Mankind Quarterly, which critics regard as ‘racist’ (for example Tucker 2002, 2) and even academically biased (for example Ehrenfels 1962). Although ostensibly an anthropology journal, it also publishes psychological research. A prominent example of such an anthropologist is Roger Pearson (b. 1927), the journal’s current editor. But such a perspective is highly marginal in current anthropology.

c. Early Evolutionary Social Anthropology

Also from the middle of the nineteenth century, there developed a school in Western European and North American anthropology which focused less on race and eugenics and more on answering questions relating to human institutions, and how they evolved, such as ‘How did religion develop?’ or ‘How did marriage develop?’ This school was known as ‘cultural evolutionism.’ Members of this school, such as Sir James Frazer (1854-1941) (Frazer 1922), were influenced by the positivist view that science was the best model for answering questions about social life. They also shared with other evolutionists an acceptance of a modal human nature which reflected evolution to a specific environment. However, some, such as E. B. Tylor (1832-1917) (Tylor 1871), argued that human nature was the same everywhere, moving away from the focus on human intellectual differences according to race. The early evolutionists believed that as surviving ‘primitive’ social organizations, within European Empires for example, were examples of the ‘primitive Man,’ the nature of humanity, and the origins of its institutions, could be best understood through analysis of these various social groups and their relationship with more ‘civilized’ societies (see Gellner 1995, Ch. 2).

As with the biological naturalists, scholars such as Frazer and Tylor collected specimens on these groups – in the form of missionary descriptions of ‘tribal life’ or descriptions of ‘tribal life’ by Westernized tribal members – and compared them to accounts of more advanced cultures in order to answer discrete questions. Using this method of accruing sources, now termed ‘armchair anthropology’ by its critics, the early evolutionists attempted to answered discrete questions about the origins and evolution of societal institutions. As early sociologist Emile Durkheim (1858-1917) (Durkheim 1965) summarized it, such scholars aimed to discover ‘social facts.’ For example, Frazer concluded, based on sources, that societies evolved from being dominated by a belief in Magic, to a belief in Spirits and then a belief in gods and ultimately one God. For Tylor, religion began with ‘animism’ and evolved into more complex forms but tribal animism was the essence of religion and it had developed in order to aid human survival.

This school of anthropology has been criticized because of its perceived inclination towards reductionism (such as defining ‘religion’ purely as ‘survival’), its speculative nature and its failure to appreciate the problems inherent in relying on sources, such as ‘gate keepers’ who will present their group in the light in which they want it to be seen. Defenders have countered that without attempting to understand the evolution of societies, social anthropology has no scientific aim and can turn into a political project or simply description of perceived oddities (for example Hallpike 1986, 13). Moreover, the kind of stage theories advocated by Tylor have been criticized for conflating evolution with historicist theories of progress, by arguing that societies always pass through certain phases of belief and the Western civilization is the pinnacle of development, a belief known as unilinealism. This latter point has been criticized as ethnocentric (for example Eriksen 2001) and reflects some of the thinking of Herbert Spencer, who was influential in early British anthropology.

2. Naturalist Anthropology

a. The Eastern European School

Whereas Western European and North American anthropology were oriented towards studying the peoples within the Empires run by the Western powers and was influenced by Darwinian science, Eastern European anthropology developed among nascent Eastern European nations. This form of anthropology was strongly influenced by Herderian nationalism and ultimately by Hegelian political philosophy and the Romantic Movement of eighteenth century philosopher Jean-Jacques Rousseau (1712-1778). Eastern European anthropologists believed, following the Romantic Movement, that industrial or bourgeois society was corrupt and sterile. The truly noble life was found in the simplicity and naturalness of communities close to nature. The most natural form of community was a nation of people, bonded together by shared history, blood and customs, and the most authentic form of such a nation’s lifestyle was to be found amongst its peasants. Accordingly, Eastern European anthropology elevated peasant life as the most natural form of life, a form of life that should, on some level, be strived towards in developing the new ‘nation’ (see Gellner 1995).

Eastern European anthropologists, many of them motivated by Romantic nationalism, focused on studying their own nations’ peasant culture and folklore in order to preserve it and because the nation was regarded as unique and studying its most authentic manifestation was therefore seen as a good in itself. As such, Eastern European anthropologists engaged in fieldwork amongst the peasants, observing and documenting their lives. There is a degree to which the kind of anthropology – or ‘ethnology’ – remains more popular in Eastern than in Western Europe (see, for example, Ciubrinskas 2007 or SarkanyND) at the time of writing.

Siikala (2006) observes that Finnish anthropology is now moving towards the Western model of fieldwork abroad but as recently as the 1970s was still predominantly the study of folklore and peasant culture. Baranski (2009) notes that in Poland, Polish anthropologists who wish to study international topics still tend to go to the international centers while those who remain in Poland tend to focus on Polish folk culture, though the situation is slowly changing. Lithuanian anthropologist Vytis Ciubrinkas (2007) notes that throughout Eastern Europe, there is very little separate ‘anthropology,’ with the focus being ‘national ethnology’ and ‘folklore studies,’ almost always published in the vernacular. But, again, he observes that the kind of anthropology popular in Western Europe is making inroads into Eastern Europe. In Russia, national ethnology and peasant culture also tends to be predominant (for example Baiburin 2005). Indeed, even beyond Eastern Europe, it was noted in the year 2000 that ‘the emphasis of Indian social anthropologists remains largely on Indian tribes and peasants. But the irony is that barring the detailed tribal monographs prepared by the British colonial officers and others (. . .) before Independence, we do not have any recent good ethnographies of a comparable type’ (Srivastava 2000). By contrast, Japanese social anthropology has traditionally been in the Western model, studying cultures more ‘primitive’ than its own (such as Chinese communities), at least in the nineteenth century. Only later did it start to focus more on Japanese folk culture and it is now moving back towards a Western model (see Sedgwick 2006, 67).

The Eastern school has been criticized for uncritically placing a set of dogmas – specifically nationalism – above the pursuit of truth, accepting a form of historicism with regard to the unfolding of the nation’s history and drawing a sharp, essentialist line around the nationalist period of history (for example Popper 1957). Its anthropological method has been criticized because, it is suggested, Eastern European anthropologists suffer from home blindness. By virtue of having been raised in the culture which they are studying, they cannot see it objectively and penetrate to its ontological presuppositions (for example Kapferer 2001).

b. The Ethnographic School

The Ethnographic school, which has since come to characterize social and cultural anthropology, was developed by Polish anthropologist Bronislaw Malinowski (1884-1942) (for example Malinowski 1922). Originally trained in Poland, Malinowski’s anthropological philosophy brought together key aspects of the Eastern and Western schools. He argued that, as with the Western European school, anthropologists should study foreign societies. This avoided home blindness and allowed them to better perceive these societies objectively. However, as with the Eastern European School, he argued that anthropologists should observe these societies in person, something termed ‘participant observation’ or ‘ethnography.’ This method, he argued, solved many of the problems inherent in armchair anthropology.

It is this method which anthropologists generally summarize as ‘naturalism’ in contrast to the ‘positivism,’ usually followed alongside a quantitative method, of evolutionary anthropologists. Naturalist anthropologists argue that their method is ‘scientific’ in the sense that it is based on empirical observation but they argue that some kinds of information cannot be obtained in laboratory conditions or through questionnaires, both of which lend themselves to quantitative, strictly scientific analysis. Human culturally-influenced actions differ from the subjects of physical science because they involve meaning within a system and meaning can only be discerned after long-term immersion in the culture in question. Naturalists therefore argue that a useful way to find out information about and understand a people – such as a tribe – is to live with them, observe their lives, gain their trust and eventually live, and even think, as they do. This latter aim, specifically highlighted by Malinowski, has been termed the empathetic perspective and is considered, by many naturalist anthropologists, to be a crucial sign of research that is anthropological. In addition to these ideas, the naturalist perspective draws upon aspects of the Romantic Movement in that it stresses, and elevates, the importance of ‘gaining empathy’ and respecting the group it is studying, some naturalists argue that there are ‘ways of knowing’ other than science (for example Rees 2010) and that respect for the group can be more important than gaining new knowledge. They also argue that human societies are so complex that they cannot simply be reduced to biological explanations.

In many ways, the successor to Malinowski as the most influential cultural anthropologist was the American Clifford Geertz (1926-2006). Where Malinowski emphasized ‘participant observation’ – and thus, to a greater degree, an outsider perspective – it was Geertz who argued that the successful anthropologist reaches a point where he sees things from the perspective of the native. The anthropologist should bring alive the native point of view, which Roth (1989) notes ‘privileges’ the native, thus challenging a hierarchical relationship between the observed and the observer. He thus strongly rejected a distinction which Malinowski is merely critical of: the distinction between a ‘primitive’ and ‘civilized’ culture. In many respects, this distinction was also criticised by the Structuralists – whose central figure, Claude Levi-Strauss (1908-2009), was an earlier generation than Geertz – as they argued that all human minds involved similar binary structures (see below).

However, there was a degree to which both Malinowski and Geertz did not divorce ‘culture’ from ‘biology.’ Malinowski (1922) argued that anthropological interpretations should ultimately be reducible to human instincts while Geertz (1973, 46-48) argued that culture can be reduced to biology and that culture also influences biology, though he felt that the main aim of the ethnographer was to interpret. Accordingly, it is not for the anthropologist to comment on the culture in terms of its success or the validity of its beliefs. The anthropologist’s purpose is merely to record and interpret.

The majority of those who practice this form of anthropology are interpretivists. They argue that the aim of anthropology is to understand the norms, values, symbols and processes of a society and, in particular, their ‘meaning’ – how they fit together. This lends itself to the more subjective methods of participant observation. Applying a positivist methodology to studying social groups is regarded as dangerous because scientific understanding is argued to lead to better controlling the world and, in this case, controlling people. Interpretivist anthropology has been criticized, variously, as being indebted to imperialism (see below) and as too subjective and unscientific, because, unless there is a common set of analytical standards (such as an acceptance of the scientific method, at least to some extent), there is no reason to accept one subjective interpretation over another. This criticism has, in particular, been leveled against naturalists who accept cultural relativism (see below).

Also, many naturalist anthropologists emphasize the separateness of ‘culture’ from ‘biology,’ arguing that culture cannot simply be traced back to biology but rather is, to a great extent, independent of it; a separate category. For example, Risjord (2000) argues that anthropology ‘will never reach the social reality at which it aims’ precisely because ‘culture’ cannot simply be reduced to a series of scientific explanations. But it has been argued that if the findings of naturalist anthropology are not ultimately consilient with science then they are not useful to people outside of naturalist anthropology and that naturalist anthropology draws too stark a line between apes and humans when it claims that human societies are too complex to be reduced to biology or that culture is not closely reflective of biology (Wilson 1998, Ch. 1). In this regard, Bidney (1953, 65) argues that, ‘Theories of culture must explain the origins of culture and its intrinsic relations to the psychobiological nature of man’ as to fail to do so simply leaves the origin of culture as a ‘mystery or an accident of time.’

c. Ethics and Participant Observation Fieldwork

From the 1970s, the various leading anthropological associations began to develop codes of ethics. This was, at least in part, inspired by the perceived collaboration of anthropologists with the US-led counterinsurgency groups in South American states. For example, in the 1960s, Project Camelot commissioned anthropologists to look into the causes of insurgency and revolution in South American States, with a view to confronting these perceived problems. It was also inspired by the way that increasing numbers of anthropologists were employed outside of universities, in the private sector (see Sluka 2007).

The leading anthropological bodies – such as the Royal Anthropological Institute – hold to a system of research ethics which anthropologists, conducting fieldwork, are expected, though not obliged, to adhere to. For example, the most recent American Anthropological Association Code of Ethics (1998) emphasizes that certain ethical obligations can supersede the goal of seeking new knowledge. Anthropologists, for example, may not publish research which may harm the ‘safety,’ ‘privacy’ or ‘dignity’ of those whom they study, they must explain their fieldwork to their subjects and emphasise that attempts at anonymity may sometimes fail, they should find ways of reciprocating to those whom they study and they should preserve opportunities for future fieldworkers.

Though the American Anthropological Association does not make their philosophy explicit, much of the philosophy appears to be underpinned by the golden rule. One should treat others as one would wish to be treated oneself. In this regard, one would not wish to be exploited, misled or have ones safety or privacy comprised. For some scientists, the problem with such a philosophy is that, from their perspective, humans should be an objective object of study like any other. The assertion that the ‘dignity’ of the individual should be preserved may be seen to reflect a humanist belief in the inherent worth of each human being. Humanism has been accused of being sentimental and of failing to appreciate the substantial differences between human beings intellectually, with some anthropologists even questioning the usefulness of the broad category ‘human’ (for example Grant 1916). It has also been accused of failing to appreciate that, from a scientific perspective, humans are a highly evolved form of ape and scholars who study them should attempt to think, as Wilson (1975, 575) argues, as if they are alien zoologists. Equally, it has been asked why primary ethical responsibility should be to those studied. Why should it not be to the public or the funding body? (see Sluka 2007) In this regard, it might be suggested that the code reflects the lauding of members of (often non-Western) cultures which might ultimately be traced back to the Romantic Movement. Their rights are more important than those of the funders, the public or of other anthropologists.

Equally, the code has been criticized in terms of power dynamics, with critics arguing that the anthropologist is usually in a dominant position over those being studied which renders questionable the whole idea of ‘informed consent’ (Bourgois 2007). Indeed, it has been argued that the most recent American Anthropological Association Code of Ethics (1998) is a movement to the right, in political terms, because it accepts, explicitly, that responsibility should also be to the public and to funding bodies and is less censorious than previous codes with regard to covert research (Pels 1999). This seems to be a movement towards a situation where a commitment to the group being studied is less important than the pursuit of truth, though the commitment to the subject of study is still clear.

Likewise, the most recent set of ethical guidelines from the Association of Anthropologists of the UK and the Commonwealth implicitly accepts that there is a difference of opinion among anthropologists regarding whom they are obliged to. It asserts, ‘Most anthropologists would maintain that their paramount obligation is to their research participants . . .’ This document specifically warrants against giving subjects ‘self-knowledge which they did not seek or want.’ This may be seen to reflect a belief in a form of cultural relativism. Permitting people to preserve their way of thinking is more important than their knowing what a scientist would regard as the truth. Their way of thinking – a part of their culture – should be respected, because it is theirs, even if it is inaccurate. This could conceivably prevent anthropologists from publishing dissections of particular cultures if they might be read by members of that culture (see Dutton 2009, Ch. 2). Thus, philosophically, the debate in fieldwork ethics ranges from a form of consequentialism to, in the form of humanism, a deontological form of ethics. However, it should be emphasized that the standard fieldwork ethics noted are very widely accepted amongst anthropologists, particularly with regard to informed consent. Thus, the idea of experimenting on unwilling or unknowing humans is strongly rejected, which might be interpreted to imply some belief in human separateness.

3. Anthropology since World War I

a. Cultural Determinism and Cultural Relativism

As already discussed, Western European anthropology, around the time of World War I, was influenced by eugenics and biological determinism. But as early as the 1880s, this was beginning to be questioned by German-American anthropologist Franz Boas (1858-1942) (for example Boas 1907), based at Columbia University in New York. He was critical of biological determinism and argued for the importance of environmental influence on individual personality and thus modal national personality in a way of thinking called ‘historical particularism.’

Boas emphasized the importance of environment and history in shaping different cultures, arguing that all humans were biologically relatively similar and rejecting distinctions of ‘primitive’ and civilized.’ Boas also presented critiques of the work of early evolutionists, such as Tylor, demonstrating that not all societies passed through the phases he suggested or did not do so in the order he suggested. Boas used these findings to stress the importance of understanding societies individually in terms of their history and culture (for example Freeman 1983).

Boas sent his student Margaret Mead (1901-1978) to American Samoa to study the people there with the aim of proving that they were a ‘negative instance’ in terms of violence and teenage angst. If this could be proven, it would undermine biological determinism and demonstrate that people were in fact culturally determined and that biology had very little influence on personality, something argued by John Locke (1632-1704) and his concept of the tabula rasa. This would in turn mean that Western people’s supposed teenage angst could be changed through changing the culture. After six months in American Samoa, Mead returned to the USA and published, in 1928, her influential book Coming of Age in Samoa: A Psychological Study of Primitive Youth for Western Civilization (Mead 1928). It portrayed Samoa as a society of sexual liberty in which there were none of the problems associated with puberty that were associated with Western civilization. Accordingly, Mead argued that she had found a negative instance and that humans were overwhelming culturally determined. At around the same time Ruth Benedict (1887-1948), also a student of Boas’s, published her research in which she argued that individuals simply reflected the ‘culture’ in which they were raised (Benedict 1934).

The cultural determinism advocated by Boas, Benedict and especially Mead became very popular and developed into school which has been termed ‘Multiculturalism’ (Gottfried 2004). This school can be compared to Romantic nationalism in the sense that it regards all cultures as unique developments which should be preserved and thus advocates a form of ‘cultural relativism’ in which cultures cannot be judged by the standards of other cultures and can only be comprehended in their own terms. However, it should be noted that ‘cultural relativism’ is sometimes used to refer to the way in which the parts of a whole form a kind of separate organism, though this is usually referred to as ‘Functionalism.’ In addition, Harris (see Headland, Pike, and Harris 1990) distinguishes between ‘emic’ (insider) and ‘etic’ (outsider) understanding of a social group, arguing that both perspectives seem to make sense from the different viewpoints. This might also be understood as cultural relativism and perhaps raises the question of whether the two worlds can so easily be separated.  Cultural relativism also argues, as with Romantic Nationalism, that so-called developed cultures can learn a great deal from that which they might regard as ‘primitive’ cultures. Moreover, humans are regarded as, in essence, products of culture and as extremely similar in terms of biology.

Cultural Relativism led to so-called ‘cultural anthropologists’ focusing on the symbols within a culture rather than comparing the different structures and functions of different social groups, as occurred in ‘social anthropology’ (see below). As comparison was frowned upon, as each culture was regarded as unique, anthropology in the tradition of Mead tended to focus on descriptions of a group’s way of life. Thick description is a trait of ethnography more broadly but it is especially salient amongst anthropologists who believe that cultures can only be understood in their own terms. Such a philosophy has been criticized for turning anthropology into little more than academic-sounding travel writing because it renders it highly personal and lacking in comparative analysis (see Sandall 2001, Ch. 1).

Cultural relativism has also been criticized as philosophically impractical and, ultimately, epistemologically pessimistic (Scruton 2000), because it means that nothing can be compared to anything else or even assessed through the medium of a foreign language’s categories. In implicitly defending cultural relativism, anthropologists have cautioned against assuming that some cultures are more ‘rational’ than others. Hollis (1967), for example, argues that anthropology demonstrates that superficially irrational actions may become ‘rational’ once the ethnographer understands the ‘culture.’ Risjord (2000) makes a similar point. This implies that the cultures are separate worlds, ‘rational’ in themselves. Others have suggested that entering the field assuming that the Western, ‘rational’ way of thinking is correct can lead to biased fieldwork interpretation (for example Rees 2010).

Critics have argued that certain forms of behaviour can be regarded as undesirable in all cultures, yet are only prevalent in some. It has also been argued that Multiculturalism is a form of Neo-Marxism on the grounds that it assumes imperialism and Western civilization to be inherently problematic but also because it lauds the materially unsuccessful. Whereas Marxism extols the values and lifestyle of the worker, and critiques that of the wealthy, Multiculturalism promotes “materially unsuccessful” cultures and critiques more materially successful, Western cultures (for example Ellis 2004 or Gottfried 2004).

Cultural determinism has been criticized both from within and from outside anthropology. From within anthropology, New Zealand anthropologist Derek Freeman (1916-2001), having been heavily influenced by Margaret Mead, conducted his own fieldwork in Samoa around twenty years after she did and then in subsequent fieldwork visits. As he stayed there far longer than Mead, Freeman was accepted to a greater extent and given an honorary chiefly title. This allowed him considerable access to Samoan life. Eventually, in 1983 (after Mead’s death) he published his refutation: Margaret Mead and Samoa: The Making and Unmaking of an Anthropological Myth (Freeman 1983). In it, he argued that Mead was completely mistaken. Samoa was sexually puritanical, violent and teenagers experienced just as much angst as they did everywhere else. In addition, he highlighted serious faults with her fieldwork: her sample was very small, she chose to live at the American naval base rather than with a Samoan family, she did not speak Samoan well, she focused mainly on teenage girls and Freeman even tracked one down who, as an elderly lady, admitted she and her friends had deliberately lied to Mead about their sex lives for their own amusement (Freeman 1999). It should be emphasized that Freeman’s critique of Mead related to her failure to conduct participant observation fieldwork properly (in line with Malinowski’s recommendations). In that Freeman rejects distinctions of primitive and advanced, and stresses the importance of culture in understanding human differences, it is also in the tradition of Boas. However, it should be noted that Freeman’s (1983) critique of Mead has also been criticized as being unnecessarily cutting, prosecuting a case against Mead to the point of bias against her and ignoring points which Mead got right (Schankman 2009, 17).

There remains an ongoing debate about the extent to which culture reflects biology or is on a biological leash. However, a growing body of research in genetics is indicating that human personality is heavily influenced by genetic factors (for example Alarcon, Foulks, and Vakkur 1998 or Wilson 1998), though some research also indicates that environment, especially while a fetus, can alter the expression of genes (see Nettle 2007). This has become part of the critique of cultural determinism from evolutionary anthropologists.

b. Functionalism and Structuralism

Between the 1930s and 1970s, various forms of functionalism were influential in British social anthropology. These schools accepted, to varying degrees, the cultural determinist belief that ‘culture’ was a separate sphere from biology and operated according to its own rules but they also argued that social institutions could be compared in order to better discern the rules of such institutions. They attempted to discern and describe how cultures operated and how the different parts of a culture functioned within the whole. Perceiving societies as organisms has been traced back to Herbert Spencer. Indeed, there is a degree to which Durkheim (1965) attempted to understand, for example, the function of religion in society. But functionalism seemingly reflected aspects of positivism: the search for, in this case, social facts (cross-culturally true), based on empirical evidence.

E. E. Evans-Pritchard (1902-1973) was a leading British functionalist from the 1930s onwards. Rejecting grand theories of religion, he argued that a tribe’s religion could only make sense in terms of function within society and therefore a detailed understanding of the tribe’s history and context was necessary. British functionalism, in this respect, was influenced by the linguistic theories of Swiss thinker Ferdinand de Saussure (1857-1913), who suggested that signs only made sense within a system of signs. He also engaged in lengthy fieldwork. This school developed into ‘structural functionalism.’ A. R. Radcliffe-Brown (1881-1955) is often argued to be a structural functionalist, though he denied this. Radcliffe-Brown rejected Malinowski’s functionalism – which argued that social practices were grounded in human instincts. Instead, he was influenced by the process philosophy of Alfred North Whitehead (1861-1947). Radcliffe-Brown claimed that the units of anthropology were processes of human life and interaction. They are in constant flux and so anthropology must explain social stability. He argued that practices, in order to survive, must adapt to other practices, something called ‘co-adaptation’ (Radcliffe-Brown 1957). It might be argued that this leads us asking where any of the practices came from in the first place.

However, a leading member of the structural functionalist school was Scottish anthropologist Victor Turner (1920-1983). Structural functionalists attempted to understand society as a structure with inter-related parts. In attempting to understand Rites of Passage, Turner argued that everyday structured society could be contrasted with the Rite of Passage (Turner 1969). This was a liminal (transitional) phase which involved communitas (a relative breakdown of structure). Another prominent anthropologist in this field was Mary Douglas (1921-2007). She examined the contrast between the ‘sacred’ and ‘profane’ in terms of categories of ‘purity’ and ‘impurity’ (Douglas 1966). She also suggested a model – the Grid/Group Model – through which the structures of different cultures could be categorized (Douglas 1970). Philosophically, this school accepted many of the assumptions of naturalism but it held to aspects of positivism in that it aimed to answer discrete questions, using the ethnographic method. It has been criticized, as we will see below, by postmodern anthropologists and also for its failure to attempt consilience with science.

Turner, Douglas and other anthropologists in this school, followed Malinowski by using categories drawn from the study of ‘tribal’ cultures – such as Rites of Passage, Shaman and Totem – to better comprehend advanced societies such as that of Britain. For example, Turner was highly influential in pursuing the Anthropology of Religion in which he used tribal categories as a means of comprehending aspects of the Catholic Church, such as modern-day pilgrimage (Turner and Turner 1978). This research also involved using the participant observation method. Critics, such as Romanian anthropologist Mircea Eliade (1907-1986) (for example Eliade 2004), have insisted that categories such as ‘shaman’ only make sense within their specific cultural context. Other critics have argued that such scholarship attempts to reduce all societies to the level of the local community despite there being many important differences and fails to take into account considerable differences in societal complexity (for example Sandall 2001, Ch. 1). Nevertheless, there is a growing movement within anthropology towards examining various aspects of human life through the so-called tribal prism and, more broadly, through the cultural one. Mary Douglas, for example, has looked at business life anthropologically while others have focused on politics, medicine or education. This has been termed ‘traditional empiricism’ by critics in contemporary anthropology (for example Davies 2010).

In France, in particular, the most prominent school, during this period, was known as Structuralism. Unlike British Functionalism, structuralism was influenced by Hegelian idealism.  Most associated with Claude Levi-Strauss, structuralism argued that all cultures follow the Hegelian dialectic. The human mind has a universal structure and a kind of a priori category system of opposites, a point which Hollis argues can be used as a starting point for any comparative cultural analysis. Cultures can be broken up into components – such as ‘Mythology’ or ‘Ritual’ – which evolve according to the dialectical process, leading to cultural differences. As such, the deep structures, or grammar, of each culture can be traced back to a shared starting point (and in a sense, the shared human mind) just as one can with a language. But each culture has a grammar and this allows them to be compared and permits insights to be made about them (see, for example, Levi-Strauss 1978). It might be suggested that the same criticisms that have been leveled against the Hegelian dialectic might be leveled against structuralism, such as it being based around a dogma. It has also been argued that category systems vary considerably between cultures (see Diamond 1974). Even supporters of Levi-Strauss have conceded that his works are opaque and verbose (for example Leach 1974).

c. Post-Modern or Contemporary Anthropology

The ‘postmodern’ thinking of scholars such as Jacques Derrida (1930-2004) and Michel Foucault (1926-1984) began to become influential in anthropology in the 1970s and have been termed anthropology’s ‘Crisis of Representation.’ During this crisis, which many anthropologists regard as ongoing, every aspect of ‘traditional empirical anthropology’ came to be questioned.

Hymes (1974) criticized anthropologists for imposing ‘Western categories’ – such as Western measurement – on those they study, arguing that this is a form of domination and was immoral, insisting that truth statements were always subjective and carried cultural values. Talal Asad (1971) criticized field-work based anthropology for ultimately being indebted to colonialism and suggested that anthropology has essentially been a project to enforce colonialism. Geertzian anthropology was criticized because it involved representing a culture, something which inherently involved imposing Western categories upon it through producing texts. Marcus argued that anthropology was ultimately composed of ‘texts’ – ethnographies – which can be deconstructed to reveal power dynamics, normally the dominant-culture anthropologist making sense of the oppressed object of study through means of his or her subjective cultural categories and presenting it to his or her culture (for example Marcus and Cushman 1982). By extension, as all texts – including scientific texts – could be deconstructed, they argued, that they can make no objective assertions. Roth (1989) specifically criticizes seeing anthropology as ‘texts’ arguing that it does not undermine the empirical validity of the observations involved or help to find the power structures.

Various anthropologists, such as Roy Wagner (b. 1938) (Wagner 1981), argued that anthropologists were simply products of Western culture and they could only ever hope to understand another culture through their own. There was no objective truth beyond culture, simply different cultures with some, scientific ones, happening to be dominant for various historical reasons. Thus, this school strongly advocated cultural relativism. Critics have countered that, after Malinowski, anthropologists, with their participant observation breaking down the color bar, were in fact an irritation to colonial authorities (for example Kuper 1973) and have criticized cultural relativism, as discussed.

This situation led to what has been called the ‘reflexive turn’ in cultural anthropology. As Western anthropologists were products of their culture, just as those whom they studied were, and as the anthropologist was himself fallible, there developed an increasing movement towards ‘auto-ethnography’ in which the anthropologist analyzed their own emotions and feelings towards their fieldwork. The essential argument for anthropologists engaging in detailed analysis of their own emotions, sometimes known as the reflexive turn, is anthropologist Charlotte Davies’ (1999, 6) argument that the ‘purpose of research is to mediate between different constructions of reality, and doing research means increasing understanding of these varying constructs, among which is included the anthropologist’s own constructions’ (see Curran 2010, 109). But implicit in Davies’ argument is that there is no such thing as objective reality and objective truth; there are simply different constructions of reality, as Wagner (1981) also argues. It has also been argued that autoethnography is ‘emancipatory’ because it turns anthropology into a dialogue rather than a traditional hierarchical analysis (Heaton-Shreshta 2010, 49). Auto-ethnography has been criticized as self-indulgent and based on problematic assumptions such as cultural relativism and the belief that morality is the most important dimension to scholarship (for example Gellner 1992). In addition, the same criticisms that have been leveled against postmodernism more broadly have been leveled against postmodern anthropology, including criticism of a sometimes verbose and emotive style and the belief that it is epistemologically pessimistic and therefore leads to a Void (for example Scruton 2000). However, cautious defenders insist on the importance of being at least ‘psychologically aware’ (for example Emmett 1976) before conducting fieldwork, a point also argued by Popper (1963) with regard to conducting any scientific research. And Berger (2010) argues that auto-ethnography can be useful to the extent that it elucidates how a ‘social fact’ was uncovered by the anthropologist.

One of the significant results of the ‘Crisis of Representation’ has been a cooling towards the concept of ‘culture’ (and indeed ‘culture shock’) which was previously central to ‘cultural anthropology’ (see Oberg 1960 or Dutton 2012). ‘Culture’ has been criticized as old-fashioned, boring, problematic because it possesses a history (Rees 2010), associated with racism because it has come to replace ‘race’ in far right politics (Wilson 2002, 229), problematic because it imposes (imperialistically) a Western category on other cultures, vague and difficult to perfectly define (Rees 2010), helping to maintain a hierarchy of cultures (Abu Lughod 1991) and increasingly questioned by globalization and the breakdown of discrete cultures (for example Eriksen 2002 or Rees 2010). Defenders of culture have countered that many of these criticisms can be leveled against any category of apprehension and that the term is not synonymous with ‘nation’ so can be employed even if nations become less relevant (for example Fox and King 2002). Equally, ‘culture shock,’ formerly used to describe a rite of passage amongst anthropologists engaging in fieldwork, has been criticized because of its association with culture and also as old-fashioned (Crapanzano 2010).

In addition, a number of further movements have been provoked by the postmodern movement in anthropology. One of these is ‘Sensory Ethnography’ (for example Pink 2009). It has been argued that traditionally anthropology privileges the Western emphasis on sight and the word and that ethnographies, in order to avoid this kind of cultural imposition, need to look at other senses such as smell, taste and touch. Another movement, specifically in the Anthropology of Religion, has argued that anthropologists should not go into the field as agnostics but should accept the possibility that the religious perspective of the group which they are studying may actually be correct and even work on the assumption that it is and engage in analysis accordingly (a point discussed in Engelke 2002).

During the same period, schools within anthropology developed based around a number of other fashionable philosophical ideologies. Feminist anthropology, like postmodern anthropology, began to come to prominence in the early 1970s. Philosophers such as Sandra Harding (1991) argued that anthropology had been dominated by men and this had led to anthropological interpretations being androcentric and a failure to appreciate the importance of women in social organizations. It has also led to androcentric metaphors in anthropological writing and focusing on research questions that mainly concern men. Strathern (1988) uses what she calls a Marxist-Feminist approach. She employs the categories of Melanesia in order to understand Melanesian gender relations to produce an ‘endogenous’ analysis of the situation. In doing so, she argues that actions in Melanesia are gender-neutral and the asymmetry between males and females is ‘action-specific.’ Thus, Melanesian women are not in any permanent state of social inferiority to men. In other words, if there is a sexual hierarchy it is de facto rather than de jure.

Critics have countered that prominent feminist interpretations have simply turned out to be empirically inaccurate. For example, feminist anthropologists, such as Weiner (1992) as well as philosopher Susan Dahlberg (1981), argued that foraging societies prized females and were peaceful and sexually egalitarian. It has been countered that this is a projection of feminist ideals which does not match with the facts (Kuznar 1997, Ch. 3). It has been argued that it does not follow that just because anthropology is male-dominated it is thus biased (Kuznar 1997, Ch. 3). However, feminist anthropologist Alison Wylie (see Risjord 1997) has argued that ‘politically motivated critiques’ including feminist ones, can improve science. Feminist critique, she argues, demonstrates the influence of ‘androcentric values’ on theory which forces scientists to hone their theories.

Another school, composed of some anthropologists from less developed countries or their descendants, have proffered a similar critique, shifting the feminist view that anthropology is androcentric by arguing that it is Euro-centric. It has been argued that anthropology is dominated by Europeans, and specifically Western Europeans and those of Western European descent, and therefore reflects European thinking and bias. For example, anthropologists from developing countries, such as Greenlandic Karla Jessen-Williamson, have argued that anthropology would benefit from the more holistic, intuitive thinking of non-Western cultures and that this should be integrated into anthropology (for example Jessen-Williamson 2006). American anthropologist Lee Baker (1991) describes himself as ‘Afro-Centric’ and argues that anthropology must be critiqued due to being based on a ‘Western’ and ‘positivistic’ tradition which is thus biased in favour of Europe. Afrocentric anthropology aims to shift this to an African (or African American) perspective. He argues that metaphors in anthropology, for example, are Euro-centric and justify the suppression of Africans. Thus, Afrocentric anthropologists wish to construct an ‘epistemology’ the foundations of which are African. The criticisms leveled against cultural relativism have been leveled with regard to such perspectives (see Levin 2005).

4. Philosophical Dividing Lines

a. Contemporary Evolutionary Anthropology

The positivist, empirical philosophy already discussed broadly underpins current evolutionary anthropology and there is an extent to which it, therefore, crosses over with biology. This is inline with the Consilience model, advocated by Harvard biologist Edward Wilson (b. 1929) (Wilson 1998), who has argued that the social sciences must attempt to be scientific, in order to share in the success of science, and, therefore, must be reducible to the science which underpins them. Contemporary evolutionary anthropologists, therefore, follow the scientific method, and often a quantitative methodology, to answer discrete questions and attempt to orient anthropological research within biology and the latest discoveries in this field. Also some scholars, such as Derek Freeman (1983), have defended a more qualitative methodology but, nevertheless, argued that their findings need to be ultimately underpinned by scientific research.

For example, anthropologist Pascal Boyer (2001) has attempted to understand the origins of ‘religion’ by drawing upon the latest research in genetics and in particular research into the functioning of the human mind. He has examined this alongside evidence from participant observation in an attempt to ‘explain’ religion. This subsection of evolutionary anthropology has been termed ‘Neuro-anthropology’ and attempts to better understand ‘culture’ through the latest discoveries in brain science. There are many other schools which apply different aspects of evolutionary theory – such as behavioral ecology, evolutionary genetics, paleontology and evolutionary psychology – to understanding cultural differences and different aspects of culture or subsections of culture such as ‘religion.’ Some scholars, such as Richard Dawkins (b. 1941) (Dawkins 1976), have attempted to render the study of culture more systematic by introducing the concept of cultural units – memes – and attempting to chart how and why certain memes are more successful than others, in light of research into the nature of the human brain.

Critics, in naturalist anthropology, have suggested that evolutionary anthropologists are insufficiently critical and go into the field thinking they already know the answers (for example Davies 2010). They have also argued that evolutionary anthropologists fail to appreciate that there are ways of knowing other than science. Some critics have also argued that evolutionary anthropology, with its acceptance of personality differences based on genetics, may lead to the maintenance of class and race hierarchies and to racism and discrimination (see Segerstråle 2000).

b. Anthropology: A Philosophical Split?

It has been argued both by scholars and journalists that anthropology, more so than other social scientific disciplines, is rent by a fundamental philosophical divide, though some anthropologists have disputed this and suggested that qualitative research can help to answer scientific research questions as long as naturalistic anthropologists accept the significance of biology.

The divide is trenchantly summarized by Lawson and McCauley (1993) who divide between ‘interpretivists’ and ‘scientists,’ or, as noted above, ‘positivists’ and ‘naturalists.’ For the scientists, the views of the ‘cultural anthropologists’ (as they call themselves) are too speculative, especially because pure ethnographic research is subjective, and are meaningless where they cannot be reduced to science. For the interpretivists, the ‘evolutionary anthropologists’ are too ‘reductionistic’ and ‘mechanistic,’ they do not appreciate the benefits of subjective approach (such as garnering information that could not otherwise be garnered), and they ignore questions of ‘meaning,’ as they suffer from ‘physics envy.’

Some anthropologists, such as Risjord (2000, 8), have criticized this divide arguing that two perspectives can be united and that only through ‘explanatory coherence’ (combining objective analysis of a group with the face-value beliefs of the group members) can a fully coherent explanation be reached. Otherwise, anthropology will ‘never reach the social reality at which it aims.’ But this seems to raise the question of what it means to ‘reach the social reality.’

In terms of physical action, the split has already been happening, as discussed in Segal and Yanagisako (2005, Ch. 1). They note that some American anthropological departments demand that their lecturers are committed to holist ‘four field anthropology’ (archaeology, cultural, biological and linguistic) precisely because of this ongoing split and in particular the divergence between biological and cultural anthropology. They observe that already by the end of the 1980s most biological anthropologists had left the American Anthropological Association. Though they argue that ‘holism’ was less necessary in Europe – because of the way that US anthropology, in focusing on Native Americans, ‘bundled’ the four – Fearn (2008) notes that there is a growing divide in British anthropology departments as well along the same dividing lines of positivism and naturalism.

Evolutionary anthropologists and, in particular, postmodern anthropologists do seem to follow philosophies with essentially different presuppositions. In November 2010, this divide became particularly contentious when the American Anthropological Association voted to remove the word ‘science’ from its Mission Statement (Berrett 2010).

5. References and Further Reading

  • Abu-Lughod, Lila. 1991. “Writing Against Culture.” In Richard Fox (ed.), Recapturing Anthropology: Working in the Present (pp. 466-479). Santa Fe: School of American Research Press.
  • Alarcon, Renato, Foulks, Edward and Vakkur, Mark. 1998. Personality Disorders and Culture: Clinical and Conceptual Interactions. New York: John Wiley and Sons.
  • American Anthropological Association. 1998. “American Anthropological Association Statement on “Race.”” 17 May.
  • Andreski, Stanislav. 1974. Social Sciences as Sorcery. London: Penguin.
  • Asad, Talal. 1971. “Introduction.” In Talal Asad (ed.), Anthropology and the Colonial Encounter. Atlantic Highlands: Humanities Press.
  • Baiburin, Albert. 2005. “The Current State of Ethnography and Anthropology in Russia.” For Anthropology and Culture 2, 448-489.
  • Baker, Lee. 1991. “Afro-Centric Racism.” University of Pennsylvania: African Studies Center.
  • Barenski, Janusz. 2008. “The New Polish Anthropology.” Studio Ethnologica Croatica 20, 211-222.
  • Beddoe, John. 1862. The Races of Britain: A Contribution to the Anthropology of Western Europe. London.
  • Benedict, Ruth. 1934. Patterns of Culture. New York: Mifflin.
  • Berger, Peter. 2010. “Assessing the Relevance and Effects of “Key Emotional Episodes” for the Fieldwork Process.” In Dimitrina Spencer and James Davies (eds.), Anthropological Fieldwork: A Relational Process (pp. 119-143). Newcastle: Cambridge Scholars Press.
  • Berrett, Daniel. 2010. “Anthropology Without Science.” Inside Higher Ed, 30 November.
  • Bidney, David. 1953. Theoretical Anthropology. New York: Columbia University Press.
  • Boas, Franz. 1907. The Mind of Primitive Man. New York: MacMillan.
  • Bourgois, Philippe. 2007. “Confronting the Ethics of Ethnography: Lessons from Fieldwork in Central America.” In Antonius Robben and Jeffrey Slukka (eds.), Ethnographic Fieldwork: An Anthropological Reader (pp. 288-297). Oxford: Blackwell.
  • Boyer, Pascal. 2001. Religion Explained: The Human Instincts That Fashion Gods, Spirits and Ancestors. London: William Heinnemann.
  • Cattell, Raymond. 1972. Beyondism: A New Morality from Science. New York: Pergamon.
  • Ciubrinskas, Vytis. 2007. “Interview: “Anthropology is Badly Needed in Eastern Europe.”” Antropologi.info.
  • Curran, John. 2010. “Emotional Interaction and the Acting Ethnographer: An Ethical Dilemma?” In Dimitrina Spencer and James Davies (eds.), Anthropological Fieldwork: A Relational Process (pp. 100-118). Newcastle: Cambridge Scholars Press.
  • Crapanzano, Vincent. 2010. ““At the Heart of the Discipline”: Critical Reflections on Fieldwork.” In James Davies and Dimitrina Spencer (eds.), Emotions in the Field: The Psychology and Anthropology of Fieldwork Experience (pp. 55-78). Stanford: Stanford University Press.
  • Dahlberg, Frances. 1981. “Introduction.” In Frances Dahlberg (ed.), Woman the Gatherer (pp. 1-33). New Haven: Yale University Press.
  • Darwin, Charles. 1871. The Descent of Man. London: John Murray.
  • Darwin, Charles. 1859. The Origin of Species. London: John Murray.
  • Davies, James. 2010. “Conclusion: Subjectivity in the Field: A History of Neglect.” In Dimitrina Spencer and James Davies (eds.), Anthropological Fieldwork: A Relational Process (pp. 229-243). Newcastle: Cambridge Scholars Publishing.
  • Davies, Charlotte. 1999. Reflexive Ethnography: A Guide to Researching Selves and Others. London: Routledge.
  • Dawkins, Richard. 1976. The Selfish Gene. Oxford: Oxford University Press.
  • Diamond, Stanley. 1974. In Search of the Primitive. New Brunswick: Transaction Books.
  • Douglas, Mary. 1970. Natural Symbols: Explorations in Cosmology. London: Routledge.
  • Douglas, Mary. 1966. Purity and Danger: An Analysis of the Concepts of Pollution and Taboo. London: Routledge.
  • Durkheim, Emile. 1995. The Elementary Forms of Religious Life. New York: Free Press.
  • Dutton, Edward. 2012. Culture Shock and Multiculturalism. Newcastle: Cambridge Scholars Publishing.
  • Ehrenfels, Umar Rolf, Madan, Triloki Nath, and Comas, Juan. 1962. “Mankind Quarterly Under Heavy Criticism: 3 Comments on Editorial Practices.” Current Anthropology 3, 154-158.
  • Eliade, Mircea. 2004. Shamanism: Archaic Technique of Ecstasy. Princeton: Princeton University Press.
  • Ellis, Frank. 2004. Political Correctness and the Theoretical Struggle: From Lenin and Mao to Marcus and Foucault. Auckland: Maxim Institute.
  • Emmet, Dorothy. 1976. “Motivation in Sociology and Social Anthropology.” Journal for the Theory of Social Behaviour 6, 85-104.
  • Engelke, Matthew. 2002. “The Problem of Belief: Evans-Pritchard and Victor Turner on the Inner Life.” Anthropology Today 18, 3-8.
  • Eriksen, Thomas Hylland. 2003. “Introduction.” In Thomas Hylland Eriksen (ed.), Globalisation: Studies in Anthropology (pp. 1-17). London: Pluto Press.
  • Eriksen, Thomas Hylland. 2001. A History of Anthropology. London: Pluto Press.
  • Fearn, Hannah. 2008. “The Great Divide.” Times Higher Education, 28 November.
  • Fox, Richard, and King, Barbara. 2002. “Introduction: Beyond Culture Worry.” In Richard Fox and Barbara King (eds.), Anthropology Beyond Culture (pp. 1-19). Oxford: Berg.
  • Frazer, James. 1922. The Golden Bough: A Study in Magic and Religion. London: MacMillan.
  • Freeman, Derek. 1999. The Fateful Hoaxing of Margaret Mead: A Historical Analysis of Her Samoan Research. London: Basic Books.
  • Freeman, Derek. 1983. Margaret Mead and Samoa: The Making and Unmaking of an Anthropological Myth. Cambridge: Harvard University Press.
  • Galton, Francis. 1909. Essays in Eugenics. London: Eugenics Education Society.
  • Geertz, Clifford. 1973. The Interpretation of Cultures. New York: Basic Books.
  • Geertz, Clifford. 1999. “From the Native’s Point of View’: On the Nature of Anthropological Understanding.” In Russell T. McCutcheon (ed.), The Insider/Outsider Problem in the Study of Religion: A Reader (pp. 50-63). New York: Cassell.
  • Gellner, Ernest. 1995. Anthropology and Politics: Revolutions in the Sacred Grove. Oxford: Blackwell.
  • Gellner, Ernest. 1992. Post-Modernism, Reason and Religion. London: Routledge.
  • Gobineau, Arthur de. 1915. The Inequality of Races. New York: G. P. Putnam and Sons.
  • Gorton, William. 2010. “The Philosophy of Social Science.” Internet Encyclopedia of Philosophy. https://iep.utm.edu/soc-sci/
  • Gottfried, Paul. 2004. Multiculturalism and the Politics of Guilt: Towards a Secular Theocracy. Columbia: University of Missouri Press.
  • Grant, Madison. 1916. The Passing of the Great Race: Or the Racial Basis of European History. New York: Charles Scribner’s Sons.
  • Hallpike, Christopher Robert. 1986. The Principles of Social Evolution. Oxford: Clarendon Press.
  • Harding, Sandra. 1991. Whose Science? Whose Knowledge? Thinking from Women’s Lives. Ithaca: Cornell University Press.
  • Headland, Thomas, Pike, Kenneth, and Harris, Marvin. 1990. Emics and Etics: The Insider/ Outsider Debate. New York: Sage Publications.
  • Heaton-Shreshta, Celayne. 2010. “Emotional Apprenticeships: Reflections on the Role of Academic Practice in the Construction of “the Field.”” In Dimitrina Spencer and James Davies (eds.), Anthropological Fieldwork: A Relational Process (pp. 48-74). Newcastle: Cambridge Scholars Publishing.
  • Hollis, Martin. 1967. “The Limits of Irrationality.” European Journal of Sociology 8, 265-271.
  • Hymes, Dell. 1974. “The Use of Anthropology: Critical, Political, Personal.” In Dell Hymes (ed.), Reinventing Anthropology (pp. 3-82). New York: Vintage Books.
  • Jessen Williamson, Karla. 2006. Inuit Post-Colonial Gender Relations in Greenland. Aberdeen University: PhD Thesis.
  • Kapferer, Bruce. 2001. “Star Wars: About Anthropology, Culture and Globalization.” Suomen Antropologi: Journal of the Finnish Anthropological Society 26, 2-29.
  • Kuper, Adam. 1973. Anthropologists and Anthropology: The British School 1922-1972. New York: Pica Press.
  • Kuznar, Lawrence. 1997. Reclaiming a Scientific Anthropology. Walnut Creek: AltaMira Press.
  • Lawson, Thomas, and McCauley, Robert. 1993. Rethinking Religion: Connecting Cognition and Culture. Cambridge: Cambridge University Press.
  • Leach, Edmund. 1974. Claude Levi-Strauss. New York: Viking Press.
  • Levi-Strauss, Claude. 1978. Myth and Meaning. London: Routledge.
  • Levin, Michael. 2005. Why Race Matters. Oakton: New Century Foundation.
  • Lovejoy, Arthur. 1936. The Great Chain of Being: A Study of the History of an Idea. Cambridge: Harvard University Press.
  • Lynn, Richard. 2006. Race Differences in Intelligence: An Evolutionary Analysis. Augusta: Washington Summit Publishers.
  • Malinowski, Bronislaw. 1922. Argonauts of the Western Pacific. London: Routledge.
  • Marcus, George, and Cushman, Dick. 1974. “Ethnographies as Texts.” Annual Review of Anthropology 11, 25-69.
  • Mead, Margaret. 1928. Coming of Age in Samoa: A Psychological Study of Primitive Youth for Western Civilization. London: Penguin.
  • Montagu, Ashley. 1945. Man’s Most Dangerous Myth: The Fallacy of Race. New York: Columbia University Press.
  • Nettle, Daniel. 2007. Personality: What Makes Us the Way We Are. Oxford: Oxford University Press.
  • Oberg, Kalervo. 1960. “Culture Shock: Adjustment to New Cultural Environments.” Practical Anthropology 7, 177-182.
  • Pearson, Roger. 1991. Race, Intelligence and Bias in Academe. Washington DC: Scott-Townsend Publishers.
  • Pels, Peter. 1999. “Professions of Duplexity: A Prehistory of Ethical Codes.” Current Anthropology 40, 101-136.
  • Pichot, Andre. 2009. The Pure Society: From Darwin to Hitler. London: Verso.
  • Pike, Luke. 1869. “On the Alleged Influence of Race Upon Religion.” Journal of the Anthropological Society of London 7, CXXXV-CLIII.
  • Pink, Sarah. 2009. Doing Sensory Ethnography. London: Sage Publications.
  • Popper, Karl. 1963. Conjectures and Refutations: The Growth of Scientific Knowledge. London: Routledge.
  • Popper, Karl. 1957. The Poverty of Historicism. London: Routledge.
  • Radcliffe-Brown, Alfred Reginald. 1957. A Natural Science of Society. Chicago: University of Chicago Press.
  • Rees, Tobias. 2010. “On the Challenge – and the Beauty – of (Contemporary) Anthropological Inquiry: A Response to Edward Dutton.” Journal of the Royal Anthropological Institute 16, 895-900.
  • Risjord, Mark. 2007. “Scientific Change as Political Action: Franz Boas and the Anthropology of Race.” Philosophy of Social Science 37, 24-45.
  • Risjord, Mark. 2000. Woodcutters and Witchcraft: Rationality and the Interpretation of Change in the Social Sciences. New York: University of New York Press.
  • Rogan, Bjarn. 2012. “The Institutionalization of Folklore.” In Regina Bendix and Galit Hasam-Rokem (eds.), A Companion to Folklore (pp. 598-630). Oxford: Blackwell.
  • Roth, Paul. 1989. “Anthropology Without Tears.” Current Anthropology 30, 555-569.
  • Sandall, Roger. 2001. The Culture Cult: On Designer Tribalism and Other Essays. Oxford: Westview Press.
  • Sarkany, Mihaly. ND. “Cultural and Social Anthropology in Central and Eastern Europe.” Liebnitz Institute for the Social Sciences.
  • Shankman, Paul. 2009. The Trashing of Margaret Mead: Anatomy of an Anthropological Controversy. Madison: University of Wisconsin Press.
  • Scruton, Roger. 2000. Modern Culture. London: Continuum.
  • Sedgwick, Mitchell. 2006. “The Discipline of Context: On Ethnography Amongst the Japanese.” In Joy Hendry and Heung Wah Wong (eds.), Dismantling the East-West Dichotomy: Essays in Honour of Jan van Bremen (pp. 64-68). New York: Routledge.
  • Segal, Daniel, and Yanagisako, Sylvia. 2005. “Introduction.” In Daniel Segal and Sylvia Yanagisako (eds.), Unwrapping the Sacred Bundle: Reflections on the Disciplining of Anthropology (pp. 1-23). Durham: Duke University Press.
  • Segerstråle, Ullica. 2000. Defenders of the Truth: The Sociobiology Debate. Oxford: Oxford University Press.
  • Siikala, Jukka. 2006. “The Ethnography of Finland.” Annual Review of Anthropology 35, 153-170.
  • Sluka, Jeffrey. 2007. “Fieldwork Ethics: Introduction.” In Antonius Robben and Jeffrey Slukka (eds.), Ethnographic Fieldwork: An Anthropological Reader (pp. 271-276). Oxford: Blackwell.
  • Spencer, Herbert. 1873. The Study of Sociology. New York: D. Appleton and Co.
  • Srivastava, Vinay Kumar. 2000. “Teaching Anthropology.” Seminar 495.
  • Strathern, Marylin. 1988. The Gender of the Gift: Problems with Women and Problems with Society in Melanesia. Berkley: University of California Press.
  • Stocking, George. 1991. Victorian Anthropology. New York: Free Press.
  • Tucker, William. 2002. The Funding of Scientific Racism: Wycliffe Draper and the Pioneer Fund. Illinois: University of Illinois Press.
  • Turner, Victor. 1969. The Ritual Process: Structure and Anti-Structure. New York: Aldine Publishers.
  • Turner, Victor, and Turner, Edith. 1978. Image and Pilgrimage in Christian Culture: Anthropological Perspectives. New York: Columbia University Press.
  • Tylor, Edward Burnett. 1871. Primitive Culture: Researches into the Development of Mythology, Religion, Art and Custom. London: John Murray.
  • Tylor, Edward Burnett. 1865. Researchers into the Early History of Mankind. London: John Murray.
  • Wagner, Roy. 1981. The Invention of Culture. Chicago: University of Chicago Press.
  • Weiner, Annette. 1992. Inalienable Possession. Los Angeles: University of California Press.
  • Wilson, Edward Osborne. 1998. Consilience: Towards the Unity of Knowledge. New York: Alfred A. Knopf.  
  • Wilson, Edward Osborne. 1975. Sociobiology: A New Synthesis. Cambridge: Harvard University Press.
  • Wilson, Richard. 2002. “The Politics of Culture in Post-apartheid South Africa.” In Richard Fox and Barbara King (eds.), Anthropology Beyond Culture (pp. 209-234). Oxford: Berg.

 

Author Information

Edward Dutton
Email: ecdutton@hotmail.com
University of Oulu
Finland

Modern Morality and Ancient Ethics

It is commonly supposed that there is a vital difference between ancient ethics and modern morality. For example, there appears to be a vital difference between virtue ethics and the modern moralities of deontological ethics (Kantianism) and consequentialism (utilitarianism). At second glance, however, one acknowledges that both ethical approaches have more in common than their stereotypes may suggest. Oversimplification, fallacious interpretations, as well as a broad variation within a particular ethical theory make it in general harder to determine the real differences and similarities between ancient ethics and modern morality. But why should we bother about ancient ethics at all? What is the utility of comparing the strengths and weaknesses of the particular approaches? The general answer is that a proper understanding of the strengths and weaknesses of virtue ethics and modern moral theories can be used to overcome current ethical problems and to initiate fruitful developments in ethical reasoning and decision-making.

This article examines the differences and similarities between ancient ethics and modern morality by analysing and comparing their main defining features in order to show that the two ethical approaches are less distinct than one might suppose. The first part of the article outlines the main ethical approaches in Ancient Greek ethics by focusing on the Cynics, the Cyrenaics, Aristotle’s virtue ethics, the Epicureans, and the Stoics. This part also briefly outlines the two leading modern ethical approaches, that is, Kantianism and utilitarianism, in more general terms in order to provide a sufficient background. The second part provides a detailed table with the main defining features of the conflicting stereotypes of ancient ethics and modern morality. Three main issues – the good life versus the good action, the use of the term “moral ought,” and whether a virtuous person can act in a non-virtuous way – are described in more detail in the third part of the article in order to show that the differences have more in common than the stereotypes may initially suggest. The fourth part deals with the idea of the moral duty in ancient ethics.

Table of Contents

  1. Ancient Ethics and Modern Morality
    1. Ethics and Morality
    2. Ancient Ethics
      1. The Cynics and the Cyrenaics – The Extremes
      2. The Peripatetic School – Aristotle’s Virtue Ethics
      3. Epicureanism and Stoicism
    3. Modern Morality
      1. Kantianism
      2. Utilitarianism
    4. The Up-shot
  2. The Table of Ancient Ethics and Modern Morality – A Comparison
  3. Ancient Ethics and Modern Morality – The Main Differences
    1. The Good Life versus the Good Action
    2. The Moral Ought
    3. Can a Virtuous Person Act in a Non-Virtuous Way?
  4. Special Problem: Kant and Aristotle – Moral Duty and For the Sake of the Noble
  5. Conclusion
  6. References and Further Reading

1. Ancient Ethics and Modern Morality

There are at least two main criteria that each moral theory must fulfil: first, the criterion of justification (that is, the particular moral theory should not contain any contradictions) and, second, the criterion of applicability (that is, the particular moral theory should solve concrete problems and offer ethical orientation). However, many (traditional) moral theories are unable to meet the second criterion and simply fall short of the high demands of applied ethics to solve the complex moral problems of our times. Why is this the case? The main point is that the traditional moral theories are not sufficiently well equipped to deal with completely new problems such as issues concerning nuclear power, gene technology, and cloning and so forth. Therefore, there is constant interest in updating and enhancing a particular moral theory in order to make it compatible with the latest demands. Examples are neo-Aristotelians such as Hursthouse on abortion (1991) and on nature (2007), as well as neo-Kantians such as Regan on animals (1985), Korsgaard in general and in particular on animals and nature (1996), and Altman’s edited volume on the use and limits of Kant’s practical philosophy in applied ethics (2011). This is a difficult and often very complex process.

a. Ethics and Morality

When people talk about ethical approaches in Antiquity, they refer to these approaches by using the words “ancient ethics” rather than “ancient morality”. They talk about “virtue ethics” and not about “virtue morality”. But, why is this the case? The challenging question is, according to Annas (1992: 119-120), whether ancient scholars such as Plato and Aristotle as well as the Stoics and Epicureans are really talking about morality at all, since their main focus is limited to the agent’s happiness, which obviously “doesn’t sound much like morality” (119). Even if one acknowledges the fact that happiness means a satisfactory and well-lived life according to the ethical virtues and not only a happy moment or so, it still does not sound like morality. Furthermore, the general idea in virtue ethics, that the good of other people enters the scene by being a part of one’s own good and that, for example, the notion of justice is introduced as a character trait and not as the idea of the rights of others (see, Dworkin’s phrase, “rights as trumps”), makes it obvious that there is a systematic difference between the notions of ethics and morality. Ancient ethics is about living a good and virtuous life according to the ethical virtues, that is, to become a virtuous person, while the modern notion of morality is primarily focused on the interests of other people and the idea of deontological constraints. That is, one acts morally because one has to meet certain standards and not because it supports one’s own good life. But even this simple picture might be premature depending on how one conceives the idea of “moral motivation” in ancient ethics (see, below).

Historically speaking, from a different perspective, there is no evidence which term is most legitimate. In Ancient Greek history, the Greek term for ethics is êthos and means something like character. When Aristotle analyses the good life in the Nicomachean Ethics and the Eudemian Ethics, he therefore focuses on the central topic of good and bad character traits that is virtues and vices. In this original sense, ethics means an analysis about the character or character traits. In Ancient Roman thought, which was essentially influenced by Cicero, the Greek term ethikos (the adjective to êthos) was translated with the Latin term moralis (the adjective of mores) whereas the Latin term mores, in fact, means habits and customs. It is possible to translate the Greek term êthos with habits and customs, but it is more likely that the translation of ethikos with moralis was a mistranslation. The term moralis rather refers to the Greek ethos whose primary meaning is habits and customs. If the term morality refers to mores, then the term morality means the totality of all habits and customs of a given community. The term moralis became a terminus technicus in the Latin-shaped philosophy, which covers the present meaning of the term. In modern times, the habits and customs of a given community are termed ‘conventions’, which are authoritative for the social life in society. Morality, however, is not simply a matter of mere convention but the latter often conflicts with morality (for example, an immoral convention), hence, it seems inappropriate to shorten the term in this way (Steinfath 2000). At present, there are, at least, four different possibilities to distinguish between ethics and morality:

  1. Ethics and morality as distinct spheres: Ethics has to do with the pursuit of one’s own happiness or well-being and private lifestyle, that is, how we should live to make good lives for ourselves. Morality has to do with other people’s interests and deontological constraints (for example Jürgen Habermas).
  2. The equation of ethics and morality (for example Peter Singer).
  3. Morality as a special field in the ethical realm: Ethics is the generic term for ethical and moral issues in the above-mentioned sense. Morality is a special part of ethics (for example, Bernard Williams).
  4. Morality as the object of ethics: Ethics is the philosophical theory of morality which is the systematic analysis of moral norms and values (standard reading).

The upshot is that it is always important to ask how the terms ethics and morality are used and how one uses them for oneself. It is certain that one makes a textual and not only a conceptual differentiation by claiming that the terms differ.

b. Ancient Ethics

It is impossible to give a complete depiction of the rich history of ethical reasoning and decision-making in Antiquity here, therefore the focus of this section concerns the main lines of ethical reasoning of the most important philosophical schools in the classic and Hellenistic period. This rather simplified overview is nonetheless sufficient for our purposes. One can roughly distinguish the classic and Hellenistic periods into four different but closely connected parts. The first part concerns Socrates and his arguments with the Sophists (second half of the fifth century BC); the second part covers the post-Socratian formation of important philosophical schools deeply influenced by Socratic thought for example Antisthenes’ school of the Cynics, Aristippus’ school of the Cyrenaics, and Plato’s Academy which is the most influential ancient school (second half of the fifth and fourth centuries BC). The third part is characterized, on the one hand, by the formation of one new major philosophical school, namely Aristotle’s peripatetic school, which developed from Plato’s Academy, and, on the other hand, by the exchange of arguments among the existing schools on various issues (fourth century BC). The fourth part concerns the formation of two new important philosophical schools, which become highly influential in Antiquity, first, Epicurus’ school of epicureanism standing in the tradition of the Cyrenaics and, secondly, Zeno’s school of the Stoics which partly developed from the Cynics (second half of the fourth and third century BC). All the philosophical schools – being at odds with each other – are still united by the fact that they are deeply concerned with the most important ethical questions of how to live a good life and how to achieve happiness. Their responses to these vital questions are, of course, diverse.

ancient and modern ethics Figure 1

Figure 1. The Most Prominent Philosophical Schools in Ancient Greece

The following brief depiction focuses on the basic ethical assumptions of the philosophical schools of the Cynics and Cyrenaics, the peripatetic school, the Epicureans, and the Stoics. Socrates and Plato’s Academy are left out by virtue that Socrates did not provide any (written) systematic ethics. His unsystematic ethical position is mainly depicted in Plato’s early dialogues, for example Laches, Charmides, Protagoras and some of Xenophon’s works, such as Apology, Symposium, and Memorabilia. Plato himself did not provide any systematic ethics comparable to the other main ancient schools either, even though one can certainly reconstruct – at least to some extent – his ethical viewpoint in the dialogue Politeia. In addition, most (ethical) works of the classic and Hellenistic periods are lost in the dark of history; what remains is a collection of fragments, phrases, and (parts of) letters of various important philosophers (and commentators) standing in the tradition of particular schools at that time. Many rival views on ethics are mediated through the works of Plato and Aristotle, in which they criticize their opponents. In addition, some of these rudiments and testimonials were also mediated by famous writers and politicians such as Xenophon (fifth and fourth century BC) and the important historian of philosophy Diogenes Laertios (third century AD). Aristotle, however, is the only ancient philosopher whose two substantial and complete ethical contributions, that is, the Nicomachean Ethics and the Eudemian Ethics – leaving aside the Magna Moralia of which the authorship is unclear – have survived, even though all of his dialogues including those that are concerned with ethics and ethical issues are also lost.

i. The Cynics and the Cyrenaics – The Extremes

The founder of the school of the Cynics, Antisthenes of Athens, taught that virtue in terms of practical wisdom is a good and also sufficient for eudaimonia, that is, happiness. Badness is an evil and everything else is indifferent. In accord with Socrates, Antisthenes claimed that virtue is teachable and he also accepted the doctrine of the unity of the virtues which is the general idea that if a person possesses one ethical virtue, then he or she thereby possesses all other ethical virtues as well (for a recent contribution to this controversial doctrine, see Russell, 2009). The only good of human beings is that what is peculiar to them, that is, their ability to reason. Against the Cyrenaics he argues that pleasure is never a good. Things such as death, illness, servitude, poverty, disgrace, and hard labour are only supposed to be bad but are not real evils. One should be indifferent towards one’s honour, property, liberty, health and life (committing suicide was allowed). The Cynics, in general, lived a beggar’s life and were probably the first real cosmopolitans in human history – a feature that the Stoics wholeheartedly adopted later. They were also against the common cultural and religious rites and practices, a main feature which they shared with the Sophists. They took Socratian frugality to extremes and tried to be as independent of material goods as possible, like Diogenes of Sinope who lived in a barrel. Furthermore, one should abstain from bad things and seek apathy and tranquillity, which are important features the Stoics adopted from the Cynics as well. According to the Cynics, there are two groups of people: first, the wise people living a perfect and happy life – they cannot lose their virtues once they achieved this condition (similar to Aristotle) – and, secondly, the fools who are unhappy and make mistakes (Diogenes Laertios VI, 1 and 2; Zeller 1883: 116-121; Long 2007: 623-629).

Aristippus of Cyrene was well known and highly regarded among philosophers in Antiquity and was the first Socratian disciple who took money in exchange for lessons. He was the founder of the Cyrenaics – a famous philosophical school whose members were devoted to (sensualistic) hedonism (which certainly influenced Jeremy Bentham’s version of hedonistic utilitarianism). Thereby, the school of the Cyrenaics stands in striking contrast to the Cynics. Aristippus claims that knowledge is valuable only insofar as it is useful in practical matters (a feature that the Cyrenaics share with the Cynics); all actions should strive for the utmost pleasure since pleasure is the highest good. There are gradual qualitative differences of the goods. Unlike Aristotle the Hedonists believed that happiness understood as a long-term state is not the overall purpose in life but the bodily pleasure of the very moment, which is the goal of life. The past has gone by and the future is uncertain therefore only the here and now is decisive since the immediate feelings are the only guide to what is really genuinely valuable. Practical wisdom is the precondition of happiness in being instrumentally useful for achieving pleasure. Aristippus and the Cyrenaics were seeking maximum pleasure in each moment without being swamped by it. Aristippus – known for his cheerful nature and praiseworthy character as well as his distinguished restraint – famously claimed that one should be the master in each moment: “I possess, but I am not possessed”. A. A. Long rightly claims: “Aristippus Senior had served as the paradigm of a life that was both autonomous and effortlessly successful in turning circumstances into sources of bodily enjoyment” (2007: 636). Aristippus was a true master in making the best out of each situation; he also taught that one should be able to limit one’s wishes if they are likely to cause severe problems for oneself, to preserve self-control (a general feature he shares with Socrates), to secure one’s happiness, to seek inner freedom, and to be cheerful. Obviously his teachings of a life solely devoted to bodily pleasure – that is, his pursuit of lust and his view concerning the unimportance of knowledge – stand in striking contrast to Socrates’ teachings (as well as to Plato and Aristotle). His disciples – most notably Aristippus the Younger, Theodoros, Anniceris (who bought the release of Plato), and Hegesias – established new Cyrenaic schools offering sophisticated versions of hedonism by virtue of fruitful disputes with Epicurus and the Cynics (for a brief overview on Aristippus’ disciples, see A. A. Long 2007: 632-639 and for the teachings, for example, Diogenes Laertios II, 8; Zeller 1883: 121-125; Döring 1988. For the view that Aristippus’ hedonism is not limited to “bodily pleasures”, see Urstad 2009).

ii. The Peripatetic School – Aristotle’s Virtue Ethics      

Aristotle proposed the most prominent and sophisticated version of virtue ethics in Antiquity and his teachings have become authoritative for many scholars and still remain alive in the vital contributions of neo-Aristotelians in contemporary philosophy. His main ethical work is the Nicomachean Ethics; less prominent but still valuable and authentic is the Eudemian Ethics while Aristotle’s authorship of the Magna Moralia is highly questionable. Aristotle claims that happiness (eudaimonia) is the highest good – that is the final, perfect, and self-contained goal – to which all people strive at. In particular, happiness is the goal of life, that is, a life that is devoted to “doing” philosophy (EN X, 6–9). Whether a person can be called “happy” can only be determined at the very end of a person’s life, retrospectively. For a good and general overview on Aristotle’s ethics see Broadie (1991) and Wolf (2007).

However, the idea that life should be devoted to reasoning follows from Aristotle’s important human function argument (EN I, 5, 6) in which he attempts to show – by analogy – that human beings as such must also have a proper function in comparison to other things such as a pair of scissors (the proper function is to cutting) and a flute player (the proper function is to flute playing) and so forth. If the proper function is performed in a good way, then Aristotle claims that the particular thing has goodness (aretê). For example, if the proper function of a pair of scissors is to cutting, then the proper function of a good pair of scissors is to cutting well (likewise in all other cases). Since the proper function of human beings – according to Aristotle – is to reason, the goodness of human beings depends on the good performance of the proper human function that is to reason well. In fact, Aristotle claims that the goodness of human beings does not consist in the mere performance of the proper function but rather in their disposition. This claim is substantiated by his example of the good person and the bad person who cannot be distinguished from each other during their bedtime if one only refers to their (active) performance. The only possible way to distinguish them is to refer to their different dispositions. It is a matter of debate whether there is a particular human function as proposed by Aristotle.

All in all, one can distinguish four different lines of reasoning in Aristotle’s ethics: the virtue of the good person (standard interpretation), the idea of an action-oriented virtue ethics, the application of practical wisdom, and the idea of the intrinsic value of virtues. The different approaches are dealt with in order.

The virtue of the good person (EN II, 3, 4): according to Aristotle, an action is good (or right) if a virtuous person would perform that action in a similar situation; an action is bad or wrong (and hence prohibited) if the virtuous person would never perform such an action. Three criteria must be met, according to Aristotle, in order to ensure that an action is virtuous given that the agent is in a certain condition when he performs them: (i.) the agent must have knowledge of the circumstances of the action (the action must not happen by accident); (ii.) the action is undertaken out of deliberative choice and is done for its own sake; and (iii.) the action is performed without hesitation, that is, the action is performed by a person with a firm and stable virtuous character.

The action-oriented virtue ethics (EN II, 6, 1107a10–15): Aristotle’s virtue ethics contains some hints that he not only adheres to the standard interpretation, but also claims that there are some actions that are always morally blameworthy under any circumstances, that is, some actions are intrinsically bad. The fine or the noble and the just require the virtuous person to do or refrain from doing certain things, for example, not to murder (in particular, not to kill one’s parents), not to commit adultery, and not to commit theft. This line of reasoning contains deontological limitations insofar as the virtuous person is no longer the overall standard of evaluation, but the virtuous person herself must meet some ethical criteria in order to fulfil the external demands of, for example, “the noble” and “the just” to act virtuously.

Practical wisdom (EN VI): in some passages in book VI of the Nicomachean Ethics, Aristotle argues that it is our practical wisdom that makes our practical considerations good, both with regard to the good or virtuous life and with regard to our particular goals. He claims that a practically wise person has a special sensitivity or special perceptual skill with which to evaluate a situation in a morally correct or appropriate way. Here, the emphasis lies on the practical wisdom – as the capacity of ethical reasoning and decision-making – rather than on adhering to single ethical virtues, even though Aristotle claims that it is impossible to be practically wise without having ethical virtues and vice versa.

The intrinsic value of the virtues: following the standard interpretation of the role of the ethical virtues with regard to living a good life, Aristotle argues in the Nicomachean Ethics (EN X, 6–9) that these virtues are somewhat less important when it comes to the overall goal, that is, happiness of living a good life. The primary goal is to live a life devoted to “doing” philosophy and thereby living a good life; the secondary goal is to live a life among other people which makes it necessary to adopt the ethical virtues, as well.

iii. Epicureanism and Stoicism

Epicurus – educated by the Platonist Pamphilus and highly influenced by the important teachings of Democritus – developed his philosophical school of the Epicureans in controversies with the Cyrenaics and the Stoics and meeting their objections and challenges. The lively exchange of arguments concerning the vital issue of how to live a good life put Epicurus in the position to successfully articulate a refined and sophisticated version of hedonism, which was regarded as superior to the rival philosophical school of the Cyrenaics. He claims that sensation is the only standard of measuring good and evil. Epicurus shares the view with the Cyrenaics that all living beings strive for pleasure and try to avoid pain. But, unlike the Cyrenaic school, he argues that happiness consists of not only the very moment of bodily pleasure but lasts a whole life and also contains mental pleasure, which is – according to him – preferable to bodily pleasure. In his Letter to Menoceus, Epicurus comments on flawed views of his ethical position and claims: “For what produces the pleasant life is not continuous drinking and parties or pederasty or womanizing or the enjoyment of fish and the other dishes of an expensive table, but sober reasoning […]” (Epic. EP. Men. 132, in: Long and Sedley 2011: 114).  The ultimate goal in life is not to strive for positive pleasure but to seek for absence of pain. Unlike Aristippus, Epicurus claims in support of the importance of mental states that bodily pleasure and pain is limited to the here and now, while the soul is also concerned with the pleasurable and painful states of the past and prospective pleasure and pain. Thus, sensations based on recollections, hope and fear in the context of mental states with regard to the past and future are much stronger than the bodily pleasure of the moment. Being virtuous is a precondition of tranquillity, that is, peace and freedom from fear, which is closely connected to happiness. In addition, Epicurus taught that one should free oneself from prejudices, to master and restrict one’s desires, to live a modest life (for example a life not devoted to achieve glory and honour), which does not exclude bodily pleasure, and to cultivate close friendships, for which the Epicureans were well known (see, Diogenes Laertios X, 1; Zeller 1883: 263-267; Erler and Schofield 2007: 642-674; Long and Sedley 2000: §20-§25).

Shortly after the rise of epicureanism, Zeno of Citium – the founder of stoicism – established a new school in Athens. The members were well known for their cosmopolitism that is the idea that all human beings belong to a single community that should be cultivated (quite similar to Aristippus’ view and the Stoics), their self-contained life style and deep concern for friendship as well as their strong adherence to ataraxia that is the freedom from passions such as pleasure, desires, sorrow, and fear which jeopardize the inner independence. The Stoics were influenced by teachings of the Cynics. Human beings, according to stoicism, are able to perceive the laws of nature through reason and to act accordingly. The best life is a life according to nature (Zeller 1883: 243). Zeno believed that the most general instinct is the instinct of self-preservation; for each living being the only thing that is valuable is what conduces to the being’s self-preservation and thereby contributes to the being’s happiness. For example, in the case of rational beings only what is in accord with reason is valuable; only virtue, which is necessary and sufficient for happiness, is a good. Following the Cynics, the Stoics argue that honour, property, health and life are not goods and that poverty, disgrace, illness, and death are not evils. Against the Cyrenaics and Epicureans, they hold the view that pleasure is not a good and certainly not the highest good; they agree with Aristotle that pleasure is the consequence of our actions – if they are of the right kind – but not the goal itself. Two main doctrines are of utmost importance in the teachings of stoicism, first, the significance of ataraxia and, secondly, the idea of doing what nature demands. First, happiness is ataraxia – the freedom from passions – and a self-contained life style. Secondly, the idea that one must act in accordance with one’s own nature in terms of acting virtuously stands in striking contrast to the other philosophical schools at that time. In addition, the right motif transforms the performance of one’s duty into a virtuous action, completely independent of the outcome of the particular action (an important feature that we find again in Kant’s ethics). Following Socrates and Plato, the Stoics believed that virtue is ethical knowledge and that non-virtuous people simply lack ethical knowledge, since virtue consists in the reasonable condition of the soul, which leads to correct views. The Cynic idea of the sharp distinction between the existence of a very few wise people and many fools, that is all non-wise people, had become less sharp in the process of time. In addition, the Roman philosopher and politician Cicero (106–43 BC) is the first author whose work on the notion of duty survives, De Officiis, in which he examined the notion in great detail in the first century BC (44 BC). It should be noted, however, that the stoic philosopher Panaitios of Rhodes (180–110 BC) had already published an important book on the notion of duty prior to Cicero. Panaitios’ work is lost but we know some essential ideas from it mediated through Cicero since he often refers to Panaitios in his De Officiis. Stoicism outlived the other philosophical schools with regard to its ethics by being an attractive position for many people and leading philosophers and politicians such as Seneca (first century AD) and Marcus Aurelius (second century AD) in Ancient Rome. (see, Diogenes Laertios VII, 1; Zeller 1883: 243-253; Inwood and Donini 2007: 675-738; Long and Sedley 2000: §56-§67).

c. Modern Morality

The two main moral theories of modern virtue ethics (or neo-Aristotelianism) are Kant’s deontological ethics and utilitarianism. Both theories have been adopted and modified by many scholars in recent history in order to make them (more) compatible with the latest demands in ethical reasoning and decision-making, in particular, by meeting the objections raised by modern virtue ethics. The following briefly depicts Kantianism in its original form and the main features of utilitarianism.

i. Kantianism

The German philosopher Immanuel Kant is the founder of deontological ethics. His ethics, which he mainly put forth in the Groundwork of the Metaphysics of Morals (1785), Critique of Practical Reason (1788), and Metaphysics of Morals (1797), is one of the most prominent and highly respected theories in modernity. Kant’s ethics is deontological in the sense that one has to obey the duties and obligations which derive from his supreme principle of morality, that is, the Categorical Imperative: “Act only according to that maxim whereby you can at the same time will that it should become a universal law” (Kant 1785). The Categorical Imperative is a test for maxims which, in turn, determine whether certain acts have moral worth or not. A maxim is an individual’s subjective principle or rule of the will (in German, das subjektive Prinzip des Wollen), which tells the individual what to do in a given particular situation. If the maxim can be universalized, then it is valid and one must act upon it. A maxim cannot be universalized when it faces two severe instances: (i.) the case of logical inconsistency (the example of suicide, which is against the “perfect duty”); and, (ii.) the case of impossibility to will the maxim to be universalized (failing to cultivate one’s talents, which is against the “imperfect duty”). Perfect duties are those duties that are blameworthy if they are not met by human beings (for example the suicide example); imperfect duties allow for human desires and hence they are not as strong as perfect duties but they are still morally binding and people do not attract blame if they do not complete them (for example failing to cultivate one’s talents). Kant’s ethics is universal in the sense that the system of moral duties and obligations point at all rational beings (not only human beings). Morality is not based in interests (such as social contract theories), emotions and intuitions, or conscience, but in reason alone. This is the reason why Kant’s ethics is not heteronomous – by being a divine ethical theory in which God commands what human beings should do (for example the Bible, the Ten Commandments) or natural law conception in which nature itself commands what human beings should do by providing human beings with the faculty of reason who, in turn, detect what should be done in moral matters – but truly autonomous with regard to rational beings, who make their moral decisions in the light of pure practical reason. However, pure practical reason, in determining the moral law or Categorical Imperative, determines what ought to be done without reference to empirical contingent factors (that is, anthropology in the broad sense of the term including the empirical sciences; see preface to Groundwork) such as one’s own desires or any personal inclinations (in German Neigungen). The pure practical reason is not limited to the particular nature of human reasoning but is the source and the field of universal norms, which stem from a general notion of a rational being as such (see, Eisler 2008: 577; Paton 1967; Timmermann 2010; Altman 2011).

ii. Utilitarianism

Historically speaking, Jeremy Bentham in his Introduction to the Principles of Morals and Legislation (1789) and John Stuart Mill in Utilitarianism (1863) are the founders of utilitarianism, while Francis Hutcheson (1755) and William Paley (1785) could be seen as their legitimate predecessors by pointing out that utility should be seen as an important standard of evaluation in ethical reasoning and decision-making. Bentham claims that the duration and intensity of pleasure and pain are of utmost importance and that it is even possible – according to Bentham – to measure the right action by applying a hedonistic calculus which determines the exact utility of the actions. The action with the best hedonistic outcome should be put into practice. His position is called radical quantitative hedonism. Mill instead questions the very idea of a hedonistic calculus and argues that one must distinguish between mental and bodily pleasure by giving more weight to mental pleasures. His position is called qualitative hedonism. Mill’s basic formula of utilitarianism is as follows:

The creed which accepts as the foundation of morals, Utility, or the Greatest Happiness Principle, holds that actions are right in proportion as they tend to promote happiness, wrong as they tend to produce the reverse of happiness. By happiness is intended pleasure, and the absence of pain; by unhappiness, pain and the privation of pleasure. (Mill’s Utilitarianism, chapter 2)

There is widespread agreement that there exist numerous different utilitarian theories in modern ethics; hence it would be impossible to provide an adequate depiction of all important major strands in this brief subsection. However, the following four main aspects are typical for each utilitarian theory. (1.) The consequence principle: Utilitarianism is not about actions but about the consequences of actions. This kind of theory is a form of consequentialism, which means that the moral worth of the particular action is determined by its outcome. (2.) Happiness: Utilitarianism is a teleological theory insofar as happiness (but, not in the ancient sense of the term) is the main goal that should be achieved. This particular goal can be identified with (i.) the promotion of pleasure, (ii.) the avoidance of pain or harm, (iii.) the fulfilment of desires or considered preferences, or (iv.) with meeting some objective criteria of well-being. (3.) Greatest Happiness Principle: Utilitarianism is not about mere happiness but about “the greatest happiness” attainable. Utilitarianism is a theory with one principle that judges the consequences of a given action regarding its utility, which is the general aim of actions. The moral rightness or wrongness of actions depends on the goal of achieving the greatest happiness for the greatest number of sentient beings, in short, “the greatest happiness for the greatest number”. (4.) Maximising: The collective amount of utility regarding sentient beings affected by the action should be maximized. This line of reasoning contains strong altruistic claims because, roughly speaking, one should only choose those actions which improve other sentient beings’ happiness.

Furthermore, one major methodological distinction should be mentioned briefly since it really divides all utilitarian theories in two different groups by either applying the principle of utility to actions or rules. In act utilitarianism (or direct utilitarianism) the principle of utility is applied to the particular action; in this case, one asks whether the action in question is morally right or wrong in this particular situation. In rule utilitarianism (or indirect utilitarianism), instead, the principle of utility is applied to rules only which, in turn, are applied to the particular actions and serve as guidelines for human behaviour in order to guarantee the greatest happiness for the greatest number. Here, the vital question is whether a specific rule maximises the general utility or not. From time to time, it happens that the general utility will be maximised by rule utilitarianism to a lesser degree than it would have been the case regarding act utilitarianism. For example, one should act according to the general rule which says that one should keep one’s promises which – in the long run – maximises the general utility (rule utilitarianism). However, in some cases it would be better to adhere to act utilitarianism since it maximises the general utility to a higher degree depending on the particular situation and circumstances of the case in question (act utilitarianism).

d. The Up-shot

The depiction of the ethical views of some important philosophical schools as well as their interrelatedness in Antiquity and the outline of the two leading moral theories in modern morality show that there is – despite the systematic difference concerning the importance of the question of the good life – a significant overlap of important lines of reasoning. In addition, the supposed distinction between ancient ethics and modern morality contains many misleading claims. Socrates can be seen as the initial ignition of a broad variety of diverse virtue ethical approaches such as cynicism, the teachings of the Cyrenaics, Aristotelianism, epicureanism, and stoicism. All philosophical schools were concerned with the vital questions of how to live a good life and how to achieve happiness by pointing out what the appropriate actions were. The brief outline of the different philosophical schools in Antiquity supports this view. Modern morality is different in that its focus is on the basic question of how one should act. The ancient question of how should one live is secondary. However, modern morality in particular Kantianism and utilitarianism did not start from scratch but already had some important and highly influential ancient predecessors. For example, the Kantian idea of doing the right thing because reason dictates it has its roots in stoicism (see, Cooper 1998, Schneewind 1998) and the utilitarian idea of living a happy life according to pleasure has its roots in the teachings of the Cyrenaics (for example Bentham 1789) and Epicureans (for example Mill 1863). The history of ideas conveyed important ethical insights handed down from Antiquity to modernity. The idea that there is a clear and easy distinction between ancient (virtue) ethics and modern moral theories is premature and misleading. Indeed, there are some important differences but one must acknowledge the simple fact that there is no unity or broad consensus among ancient virtue ethicists concerning the question of how to live a good life and which actions should count as virtuous. Hence, it follows that there is no “ancient ethics” as such but many important and diverse virtue ethical approaches, which have either more or less in common with “modern morality”.

In addition, modern morality, in particular contemporary morality, is characterized by the fact that quite a few important scholars elaborated modern versions of Aristotle’s classical virtue ethics in the twentieth century. These scholars argue that virtue ethics was quite successful in solving ethical problems in Antiquity and they believe that adhering to a refined version of virtue ethics is not only useful but also superior in solving our modern moral problems. Among the most important neo-Aristotelian scholars are Anscombe (1958), Foot (1978, 2001), Hursthouse (1999), MacIntyre (1981), Nussbaum (1992, 1993, 1995), Slote (2001), Swanton (2003), and Williams (1985) who claim that the traditional ethical theories such as deontological ethics (Kantianism) and consequentialism (utilitarianism) are doomed to failure. In general they adhere, at least, to two main hypotheses: (i.) People in Antiquity already employed a very efficient way of ethical reasoning and decision-making; and, (ii.) this particular way got lost in modernity without having been properly replaced. Hence it follows that one should overcome the deficient modern ethical theories and again adhere to virtue ethics as a viable alternative without, of course, abandoning the existing ethical developments (see Bayertz 2005: 115).

The following section depicts the old but still persisting stereotypical differences between ancient ethics and modern morality in order to further deepen our understanding about the supposed and real differences and similarities of both ethical approaches.

2. The Table of Ancient Ethics and Modern Morality – A Comparison

This self-explanatory table presents a simple but instructive comparison of the defining features of the stereotypes of ancient ethics and modern morality (for a similar table see Bayertz 2005: 117).

No. Criteria Ancient Ethics Modern Morality
1. Basic Question  What is the good life? What is happiness and human flourishing? What should one/I do? The question of the good life plays, at best, a sub-ordinate role.
2. What is the Object of Concern?  Self-centred: The person’s own interests dominate. Other-related: The interests of other people are most central.
3. What is most important?  Pursuit of Goals: Personal perfection, personal projects, and personal relationships. Universal moral obligations & rules: Individuals should seek for impartiality (and hence they alienate themselves from their own personal projects).
4. What is examined?  Agent: Most important are the acting person and his/her character (agent-centred ethics). Actions & Consequences: Most important is the correctness of the action & consequence (action & consequences centred ethics).
5. Central Notions  Virtues: aretaic notions for example good, excellence, virtue (aretaic language). Norms: prescriptive notions concerning rules, duties, obligations for example must, should (deontic language).
6. Rationality is seen as?  Rationality is seen as a capacity of context-sensitive insight and decision-making. Rationality is “mainly” seen as the capacity to (rationally) deduce inferences from abstract propositions.
7. The Goals of human actions  The goals of human actions are objective (notion of happiness: for example thinking, pleasure). The goals of human actions are individually defined by the people (subjectivism). No God, no nature.
8.  Scope of Morality Adult male citizens with full citizenship. Men, women, children, animals, environment.
9.  Individual and Community The individual is in unity with the community (harmony). The individual and the community are rather disconnected from each other.

Table 1: Ancient Ethics and Modern Morality

3. Ancient Ethics and Modern Morality – The Main Differences

a. The Good Life versus the Good Action

The most common stereotype with regard to ancient ethics and modern morality concerns the vital issue that ancient ethics is only about the question “What is the good life” and that modern moral theories only deal with the question “What should one do” or “How should one act”. Many stereotypes certainly depict some truth, but there is almost always a lot of room for a better understanding of the differences and similarities of the particular issue. To be more precise with regard to this issue, it is true that ancient ethics concerns the vital question of how to live a good life and to become a virtuous person by acting in accordance with the ethical virtues. However, the idea that virtue ethics does not deal with actions and hence is unable to provide concrete answers to ethical problems is premature; it is not only modern moral theories that deal with actions (see, Hursthouse 1999, chapters 1-3; Slote 2001, chapter 1; Swanton 2003, chapter 11). An ethical virtue, according to Aristotle, needs to be completely internalized by its agent through many actions of the same type so that the person is able to accomplish a firm disposition. In other words, a brave person who has the virtue of courage has to perform many brave actions in the area of fear and confidence in order to accomplish a brave disposition. Performing the appropriate actions is the only way one can do this. Indeed, modern moral theories are rather focused on the question of what should one do in a particular situation, and usually ethicists do not pay much attention to the question of living a good life. Ancient ethicists, instead, believe that one cannot separate both issues.

A related issue that seems to strongly support the initial idea concerns the claim that, on the one hand, ancient ethics is self-centred because it only focuses on the agent’s interests in living a good life and becoming a virtuous person and, on the other hand, that modern morality is other-regarding by only focusing on the interests of other people. Broadly speaking, ancient ethics is egoistical and modern morality is altruistic. The interests of other people in virtue ethics enter the stage by being incorporated into the person’s own interest in becoming virtuous and living a good life. In her article Ancient Ethics and Modern Morality, Annas examines this point in more detail and claims “the confusion comes from the thought that if the good of others is introduced into the agent’s own final good, it cannot really be the good of others, but must in some way be reduced to what matters to the agent”.  She points out that the confusion might be that “the good of others must matter to me because it is the good of others, not because it is part of my own good” (Annas 1992: 131). Annas thinks that this is compatible with the overall final good of the virtuous person since the good of others matters to the virtuous person not because it is part of the agent’s own good but because it is the good of others.

Other people, however, might claim that the difference is between “morality” and “legality”, to use a Kantian distinction. In this context, legality means simply to fulfil the moral claims that other people have; morality means to fulfil the moral claims that other people have and, in addition, to have the right motive in doing so, that is, to act out of “the good will” – to act out of a sense of moral obligation or duty. Translated into “ancient” language, the virtuous person should consider other people’s interests not because she feels indifferent to them or because their interests are only instrumentally useful to her as agent, but because the virtuous person wholeheartedly believes, feels, and acknowledges the fact that the other people’s interests are important in their own right. Another example is Aristotle who believes that the good person is living a good life if and only if she devotes her life to “philosophy” and, secondarily, lives a social life among other people. The latter requires the usage of ethical virtues, which are by nature other-regarding; the former does not require the usage of ethical virtues (see, Aristotle EN X, 6–9), even though, according to Aristotle, one cannot be a practically wise person without being virtuous, and vice versa. Both concepts are mutually dependent (EN VI).

One might claim that self-interest and the interests of other people do not stand in contrast to each other in ancient ethics but converge by adhering to an objective idea of the good (see, Bayertz 2005). The line between moral questions that concern the interests of other people and ethical questions that concern the well-being of the particular agent is disfigured beyond recognition. In modern morality, however, there is a clear difference because the question of the good life is secondary, and is systematically not important for the question of how one should act in a particular situation. Modern moral theories are rather subjective in character and hence lack the strong commitments of virtue ethical theories concerning their objective basis, as well as their claims regarding elitism and the devaluation of the moral common sense. The upshot is, however, that there is a systematic difference between ancient ethics and modern morality concerning the way in which moral problems are solved, but the idea that ancient ethics is egoistic and does not appeal to actions is premature and simply wrong.

b. The Moral Ought

Anscombe points out in her classical paper Modern Moral Philosophy (1958) that modern morality is doomed to failure because it only focuses on the analysis of language and notions and, in particular, it adheres to the fallacious idea of the moral duty. She argues that the idea of the moral duty and the moral ought used in deontological ethics originally comes from religious reasoning and theological ethics, where God was the ultimate source of morality and where the people had to obey God’s commands. Here, the ideas of a moral duty and a moral ought were appropriate. In secular ethics, however, there is no general consent to the idea of a moral duty that is universally binding on all rational people. The idea of a moral duty, according to Anscombe, should be replaced by the notion of virtue. Furthermore, Schopenhauer convincingly claims in his book On the Basis of Morality that even in the case of religious ethics there is no categorical moral duty, since people obey God’s moral rules simply because they do not want to be punished, if they decide not to act accordingly. But this means that the moral duty is hypothetical rather than categorical. It is commonly said that in ancient ethics there is no moral duty and no moral ought simply because the Greek and Romans lack those particular notions. However, from the bare fact that they lack the notions of moral duty and moral ought, one cannot conclude that they also lack the particular phenomena as well (Bayertz 2005: 122). In addition, one might claim that his point still misses the general idea of using similar notions as main ethical key terms, which reflects a certain particular way of ethical reasoning and decision-making. Whether there is something like a ‘moral ought’ in ancient virtue ethics that is comparable to deontological ethics will be briefly examined below by focusing on Aristotle’s ethics. 

c. Can a Virtuous Person Act in a Non-Virtuous Way?

According to ancient ethics, a completely virtuous person, who is the bearer of all ethical virtues, is unable to act in a non-virtuous way. If a person bears one virtue, he thereby bears all other virtues as well (that is the thesis of the unity of the virtues). The practically wise person – according to Ancient ethicists – will always act in accordance with the ethical virtues. In other words, the virtuous person is always master of her emotions and, in general, will never be swamped by her emotions, which otherwise might have led her to act in a non-virtuous way. Generally speaking, this is a quite demanding line of argumentation since it can be the case, at least according to our modern way of thinking, that a brave person who has the virtue of courage might not be able to show the virtue of liberality. However, even if one acknowledges that person A is a virtuous person, one might not be convinced that this person will never be able to act in a non-virtuous way. This particular problem has to do with the famous hypothesis of ‘the unity of the virtues’ (for a recent contribution to this problem, see Russell, 2009). In modern morality, utilitarianism, for example, convincingly distinguishes between the evaluation of the character of a person and his or her actions. It can easily be the case, according to utilitarianism, that a morally bad person performs a morally right action or that a morally good person performs a morally wrong action. This distinction is impossible to draw for proponents of (classic) virtue ethics because an ethically right action always presupposes that the person has an ethically good character.

4. Special Problem: Kant and Aristotle – Moral Duty and For the Sake of the Noble

There is a widely shared agreement among philosophers that Kant’s deontological ethics and Aristotle’s virtue ethics can be easily distinguished by acknowledging the simple fact that Kant is concerned with acting from duty or on the moral principle or because one thinks that it is morally right; while Aristotle’s approach completely lacks this particular idea of moral motivation and, hence, it would be unsound to claim that the virtuous person is morally obligated to act in a way similar to the Kantian agent. In other words, there is no such thing as acting from a sense of duty in virtue ethics. The common view has been challenged by, for example, neo-Aristotelians (for example Hursthouse 2010) who claim that there is not only a strong notion of moral motivation in Aristotle’s approach, but also that the virtuous person is better equipped to meet the demands of acting from a sense of duty than the Kantian moral agent. The following sketches out the main line of reasoning (see, also Engstrom and Whiting 1998; Jost and Wuerth 2011).

Hursthouse claims in her book On Virtue Ethics that “there is a growing enthusiasm for the idea that the ideal Kantian agent, the person with a good will, who acts “from a sense of duty”, and the ideal neo-Aristotelian agent, who acts from virtue – from a settled state of character – are not as different as they were once supposed to be” (2010: 140). Her view is supported by some important works of Hudson (1990), Audi (1995), and Baron (1995). This fact, however, has also been acknowledged by neo-Kantian philosophers such as Korsgaard (1998) and Herman (1998). In this respect it reflects a lack of awareness about current developments in virtue ethics and neo-Kantianism if one still up-holds the claim of the clear distinction between ancient ethics and modern morality, in particular, concerning Aristotle and Kant that has been proposed for hundreds of years. A related issue concerning the question of whether there is a fundamental distinction between aretaic and deontic terms has been critically discussed by Gryz (2011) who argues against Stocker (1973) who claims that “good” and “right” mean the same thing. Gryz is convinced that even if both groups of terms converge (as close as possible), there will still either remain an unbridgeable gap or in case that one attempts to define one group of terms by the other group, it follows that something is left behind which cannot be explained by the second group. This contemporary debate shows that there is still no common view on the relationship between ancient ethics and modern morality.

Kant claims in the Groundwork that the morally motivated agent acts from good will. In more detail, to act from duty or to act because one thinks that it is morally right is to perform an action because one thinks that its maxim has the form of a law (Korsgaard 1998: 218). For example, if a person is in need the Kantian agent does the right action not because – as Korsgaard claims – that it is her purpose to simply do her duty, but because the person chooses the action for its own sake that means her purpose is to help (Korsgaard 1998: 207).

Even if the Ancient Greeks lacked the particular notions that can be translated as moral ought, duty, right, and principle (for example Gryz 2011, Hursthouse 2010), it seems nonetheless correct to claim that the idea of doing the right thing because it is right or because one is required to do it is also a well-known phenomenon in classic virtue ethics in general and with regard to Aristotle and stoicism in particular. There are quite a few passages in the Nicomachean Ethics in which Aristotle clearly claims that morally good actions are done for their own sake or because it is the morally right thing to do:

Now excellent actions are noble and done for the sake of the noble. (EN IV, 2, 1120a23–24)

Now the brave man is as dauntless as man may be. Therefore, while he will fear even the things that are not beyond human strength, he will fear them as he ought and as reason directs, and he will face them for the sake of what is noble; for this is the end of excellence. (EN III, 10 1115b10-13)

The standard of all things is the good and the good man; he is striving for the good with all his soul and does the good for the sake of the intellectual element in him. (EN IX, 4, 1166a10–20)

The good man acts for the sake of the noble. (EN IX, 8, 1168a33-35)

For the wicked man, what he does clashes with what he ought to do, but what the good man ought to do he does; for the intellect always chooses what is best for itself, and the good man obeys his intellect. (EN IX, 8, 1169a15–18)

If the virtuous person acts because she thinks that it is the right thing to do, because she acts for the sake of the noble without any inclination other than to do good for the sake of the noble, then she is comparable with the Kantian moral agent. For example, according to Aristotle the noble is “that which is both desirable for its own sake and also worthy of praise” (Rhetoric I, 9, 1366a33); and in 1366b38–67a5 he holds the view that nobility is exhibited in actions “that benefit others rather than the agent, and actions whose advantages will only appear after the agent’s death, since in these cases we can be sure the agent himself gets nothing out of it” (Korsgaard 1998: 217). Hence it follows, the virtuous person will not be able to act in a non-virtuous way because he or she acts from a strong inner moral obligation to act according to the morally right thing, since it is the very nature of the virtuous person to act virtuously. The Kantian agent, instead, sometimes acts according to the universal law and hence performs a morally right action, and on other occasions he or she fails to do so. This is because he or she has no stable and firm disposition to always act in accordance with the universal law. That is the very reason why the Aristotelian virtuous person can be seen as an agent who is not only acting from duty in the sense of doing the right thing because it is right, but also because the virtuous person constantly perceives and adheres to the moral duty, that is, to act virtuously.

5. Conclusion

The upshot is, however, that the vital question of how to live a good life cannot be separated from the essential question of how one should act. Conceptually and phenomenologically, both questions are intimately interwoven and a complete ethical theory will always be concerned with both issues, independently of whether the theory is of ancient or modern origin.

6. References and Further Reading

  • Altman, M. 2011. Kant and Applied Ethics: The Uses and Limits of Kant’s Practical Philosophy. Oxford: Wiley-Blackwell.
  • Annas, J. 1992. “Ancient Ethics and Modern Morality.” Philosophical Perspectives 6: 119-136.
  • Anscombe, E. 1958. “Modern Moral Philosophy.“ Philosophy 33 (124): 1-19.
  • Apelt, O, trans. 1998. Diogenes Laertius: Leben und Meinungen berühmter Philosophen. Hamburg: Meiner.
  • Aristotle. 1985. “Nicomachean Ethics.” Translated by W.D. Ross. In Complete Works of Aristotle, edited by J. Barnes, 1729-1867. Princeton: Princeton University Press.
  • Aristotle. 2007. Rhetoric. Translated by George A. Kennedy. New York: Oxford University Press.
  • Audi, R. 1995. “Acting from Virtue.” Mind 104 (415): 449-471.
  • Baron, M. 1995. Kantian Ethics Almost Without Apology. New York: Cornell University Press.
  • Bayertz, K. 2005. “Antike und moderne Ethik.“ Zeitschrift für philosophische Forschung 59 (1): 114-132.
  • Bentham, J. 1962. „The Principles of Morality and Legislation” [1798]. In J. Bowring (ed.), The Works of Jeremy Bentham, edited by J. Bowring, 1-154. New York: Russell and Russell.
  • Broadie, S. 1991. Ethics with Aristotle. Oxford: Oxford University Press.
  • Cicero, M.T. 2001. On Obligations. Translated by P.G. Walsh. Oxford, New York: Oxford University Press.
  • Cooper, J.M. 1998. “Eudaimonism, the Appeal to Nature, and ‘Moral Duty’ in Stoicism.” In Aristotle, Kant, and the Stoics: Rethinking Happiness and Duty, edited by S. Engstrom and  J. Withing, 261-284. Cambridge, New York: Cambridge University Press.
  • Döring, K. 1988. Der Sokratesschüler Aristipp und die Kyrenaiker. Stuttgart, Wiesbaden: Franz Steiner.
  • Dworkin, R. 1984. Rights as Trumps. In Theories of Rights, edited by J. Waldron, 152-167. Oxford: Oxford University Press.
  • Eisler, R. 2008. Kant Lexikon: Nachschlagewerk Zu Kants Sämtlichen Schriften, Briefen Und Handschriftlichem Nachlaß. Hildesheim: Weidmannsche Verlagsbuchhandlung.
  • Engstrom, S. and J. Withing. 1998. Aristotle, Kant, and the Stoics. Rethinking Happiness and Duty.  Cambridge: Cambridge University Press.
  • Erler, M. and M. Schofield. 2007. “Epicurean Ethics.” In The Cambridge History of Hellenistic Philosophy, edited by K. Algra, J. Barnes, J. Mansfield and M. Schofield, 642-669. Cambridge: Cambridge University Press.
  • Foot, P. 1978. Virtues and Vices and other Essays in Moral Philosophy. Berkeley: University of California Press.
  • Foot, P. 2001. Natural Goodness. Oxford: Clarendon Press.
  • Gryz, J. 2010. “On the Relationship Between the Aretaic and the Deontic.” Ethical Theory and Moral Practice 14 (5): 493–501.
  • Herman, B. 1998. “Making Room for Character.” In Aristotle, Kant, and the Stoics. Rethinking Happiness and Duty, edited by S. Engstrom and J. Withing, 36-60.  Cambridge: Cambridge University Press.
  • Hudson, S. 1990. “What is Morality All About?” Philosophia: Philosophical Quarterly of Israel 20 (1-2): 3-13.
  • Hursthouse, R. 1991. “Virtue Theory and Abortion.” Philosophy and Public Affairs 20 (3): 223-246.
  • Hursthouse, R. 2010. On Virtue Ethics. Oxford: Oxford University Press.
  • Hursthouse, R. 2007. “Environmental Virtue Ethics.” In Working Virtue: Virtue Ethics and Contemporary Moral Problems, edited by R.L. Walker and P.J. Ivanhoe, 155-171. New York: Oxford University Press.
  • Hutcheson, F. 2005. A System of Moral Philosophy, in Two Books [1755]. London: Continuum International Publishing Group.
  • Inwood, B. and P. Donini. 2007. “Stoic Ethics.” In The Cambridge History of Hellenistic Philosophy, edited by K. Algra, J. Barnes, J. Mansfield and M. Schofield, 675-736. Cambridge:  Cambridge University Press.
  • Jost, L. and J. Wuerth. 2011. Perfecting Virtue: New Essays on Kantian Ethics and Virtue Ethics. Cambridge: Cambridge University Press.
  • Kant., I. 1991. Metaphysics of Morals [1797]. Translated by M. Gregor. Cambridge: Cambridge University Press.
  • Kant, I. 1997. Critique of Practical Reason [1788]. Translated by M. Gregor. Cambridge: Cambridge University Press.
  • Kant, I. 2005. Groundwork for the Metaphysics of Morals [1785]. Translated by A.W. Wood. New Haven CT:  Yale University Press.
  • Korsgaard, C.M. 1996. Sources of Normativity. Cambridge: Cambridge University Press.
  • Korsgaard, C.M. 1998. “From Duty and From the Sake of the Noble: Kant and Aristotle on Morally Good Action.” In Aristotle, Kant, and the Stoics. Rethinking Happiness and Duty, edited by S. Engstrom and J. Withing, 203-236. Cambridge: Cambridge University Press.
  • Long, A.A. 2007. “The Socratic Legacy.” In The Cambridge History of Hellenistic Philosophy, edited by K. Algra, J. Barnes, J. Mansfield and M. Schofield, 617-639. Cambridge: Cambridge University Press.
  • Long, A.A and D.N. Sedley. 2000. The Hellenistic Philosophers, Volume 2: Greek and Latin Texts with Notes and Bibliography. Cambridge: Cambridge University Press.
  • Long, A.A. and D.N. Sedley. 2011. The Hellenistic Philosophers, Volume 1: Translations of the Principal Sources, with Philosophical Commentary. Cambridge: Cambridge University Press.
  • MacIntyre, A. 1981.  After Virtue. London: Duckworth.
  • Mill, J.S. 1998. Utilitarianism [1863]. Edited by R. Crisp. Oxford: Oxford University Press.
  • Nussbaum, M.C. 1992. “Human Functioning and Social Justice. In Defense of Aristotelian Essentialism.” Political Theory 20 (2): 202-246.
  • Nussbaum, M. 1993. “Non-Relative Virtues: An Aristotelian Approach.” In The Quality of Life, edited by M.C. Nussbaum and A. Sen, (1-6). New York:  Oxford University Press.
  • Nussbaum, M. 1995. “Aristotle on Human Nature and the Foundations of Ethics.” In World, Mind, and Ethics: Essays on the Ethical Philosophy of Bernard Williams, edited by J.E.J. Altham and R. Harrison, 86-131. Cambridge, New York: Cambridge University Press.
  • Paley, W. 2002. The Principles of Moral and Political Philosophy [1785]. Indianapolis: Liberty Fund.
  • Paton, H. J. 1967. The Categorical Imperative: A Study in Kant’s Moral Philosophy. London: Hutchinson & co.
  • Regan, T. 1985.  The Case for Animal Rights. Berkeley: University of California Press.
  • Russell, D.C. 2009. Practical Intelligence and the Virtues. Oxford: Clarendon Press.
  • Schneewind, J.B. 1998. “Kant and Stoic Ethics.” In Aristotle, Kant, and the Stoics: Rethinking Happiness and Duty, edited by S. Engstrom and J. Withing, 285-302. Cambridge, New York: Cambridge University Press.
  • Schopenhauer, A. 1995. On the Basis of Morality [1841]. Translated by E.F.J. Payne. Providence: Berghahn Books.
  • Slote, M. 2001. Morals from Motives. Oxford: Oxford University Press.
  • Steinfath, H. 2000. „Ethik und Moral.“ In Vorlesung: Typen ethischer Theorien, 1-21 (unpublished)
  • Stocker, M. 1973. “Rightness and Goodness: Is There a Difference?” American Philosophical Quarterly 10 (2): 87–98.
  • Swanton, C. 2003. Virtue Ethics: A Pluralistic View. Oxford: Oxford University Press.
  • Timmermann, Jens. 2010. Kant’s Groundwork of the Metaphysics of Morals. Cambridge: Cambridge University Press.
  • Urstad, K. 2009. “Pathos, Pleasure and the Ethical Life in Aristippus.” Journal of Ancient Philosophy III (1): 1-22.
  • Williams, B. 1985. Ethics and the Limits of Philosophy. Cambridge MA: Harvard University Press.
  • Wolf, Ursula. 2002. Aristoteles’ “Nikomachische Ethik”. Darmstadt: Wissenschaftliche Buchgesellschaft.
  • Zeller, E. 1883. Geschichte der Griechischer Philosophie, Stuttgart: Magnus.

 

Author Information

John-Stewart Gordon
Email: jgordon@uni-koeln.de
University of Cologne, Germany
Vytautas Magnus University Kaunas, Lithuania

Simplicity in the Philosophy of Science

The view that simplicity is a virtue in scientific theories and that, other things being equal, simpler theories should be preferred to more complex ones has been widely advocated in the history of science and philosophy, and it remains widely held by modern scientists and philosophers of science. It often goes by the name of “Ockham’s Razor.” The claim is that simplicity ought to be one of the key criteria for evaluating and choosing between rival theories, alongside criteria such as consistency with the data and coherence with accepted background theories. Simplicity, in this sense, is often understood ontologically, in terms of how simple a theory represents nature as being—for example, a theory might be said to be simpler than another if it posits the existence of fewer entities, causes, or processes in nature in order to account for the empirical data. However, simplicity can also been understood in terms of various features of how theories go about explaining nature—for example, a theory might be said to be simpler than another if it contains fewer adjustable parameters, if it invokes fewer extraneous assumptions, or if it provides a more unified explanation of the data.

Preferences for simpler theories are widely thought to have played a central role in many important episodes in the history of science. Simplicity considerations are also regarded as integral to many of the standard methods that scientists use for inferring hypotheses from empirical data, the most of common illustration of this being the practice of curve-fitting. Indeed, some philosophers have argued that a systematic bias towards simpler theories and hypotheses is a fundamental component of inductive reasoning quite generally.

However, though the legitimacy of choosing between rival scientific theories on grounds of simplicity is frequently taken for granted, or viewed as self-evident, this practice raises a number of very difficult philosophical problems. A common concern is that notions of simplicity appear vague, and judgments about the relative simplicity of particular theories appear irredeemably subjective. Thus, one problem is to explain more precisely what it is for theories to be simpler than others and how, if at all, the relative simplicity of theories can be objectively measured. In addition, even if we can get clearer about what simplicity is and how it is to be measured, there remains the problem of explaining what justification, if any, can be provided for choosing between rival scientific theories on grounds of simplicity. For instance, do we have any reason for thinking that simpler theories are more likely to be true?

This article provides an overview of the debate over simplicity in the philosophy of science. Section 1 illustrates the putative role of simplicity considerations in scientific methodology, outlining some common views of scientists on this issue, different formulations of Ockham’s Razor, and some commonly cited examples of simplicity at work in the history and current practice of science. Section 2 highlights the wider significance of the philosophical issues surrounding simplicity for central controversies in the philosophy of science and epistemology. Section 3 outlines the challenges facing the project of trying to precisely define and measure theoretical simplicity, and it surveys the leading measures of simplicity and complexity currently on the market. Finally, Section 4 surveys the wide variety of attempts that have been made to justify the practice of choosing between rival theories on grounds of simplicity.

Table of Contents

  1. The Role of Simplicity in Science
    1. Ockham’s Razor
    2. Examples of Simplicity Preferences at Work in the History of Science
      1. Newton’s Argument for Universal Gravitation
      2. Other Examples
    3. Simplicity and Inductive Inference
    4. Simplicity in Statistics and Data Analysis
  2. Wider Philosophical Significance of Issues Surrounding Simplicity
  3. Defining and Measuring Simplicity
    1. Syntactic Measures
    2. Goodman’s Measure
    3. Simplicity as Testability
    4. Sober’s Measure
    5. Thagard’s Measure
    6. Information-Theoretic Measures
    7. Is Simplicity a Unified Concept?
  4. Justifying Preferences for Simpler Theories
    1. Simplicity as an Indicator of Truth
      1. Nature is Simple
      2. Meta-Inductive Proposals
      3. Bayesian Proposals
      4. Simplicity as a Fundamental A Priori Principle
    2. Alternative Justifications
      1. Falsifiability
      2. Simplicity as an Explanatory Virtue
      3. Predictive Accuracy
      4. Truth-Finding Efficiency
    3. Deflationary Approaches
  5. Conclusion
  6. References and Further Reading

1. The Role of Simplicity in Science

There are many ways in which simplicity might be regarded as a desirable feature of scientific theories. Simpler theories are frequently said to be more “beautiful” or more “elegant” than their rivals; they might also be easier to understand and to work with. However, according to many scientists and philosophers, simplicity is not something that is merely to be hoped for in theories; nor is it something that we should only strive for after we have already selected a theory that we believe to be on the right track (for example, by trying to find a simpler formulation of an accepted theory). Rather, the claim is that simplicity should actually be one of the key criteria that we use to evaluate which of a set of rival theories is, in fact, the best theory, given the available evidence: other things being equal, the simplest theory consistent with the data is the best one.

This view has a long and illustrious history. Though it is now most commonly associated with the 14th century philosopher, William of Ockham (also spelt “Occam”), whose name is attached to the famous methodological maxim known as “Ockham’s razor”, which is often interpreted as enjoining us to prefer the simplest theory consistent with the available evidence, it can be traced at least as far back as Aristotle. In his Posterior Analytics, Aristotle argued that nothing in nature was done in vain and nothing was superfluous, so our theories of nature should be as simple as possible. Several centuries later, at the beginning of the modern scientific revolution, Galileo espoused a similar view, holding that, “[n]ature does not multiply things unnecessarily; that she makes use of the easiest and simplest means for producing her effects” (Galilei, 1962, p396). Similarly, at beginning of the third book of the Principia, Isaac Newton included the following principle among his “rules for the study of natural philosophy”:

  • No more causes of natural things should be admitted than are both true and sufficient to explain their phenomena.
    As the philosophers say: Nature does nothing in vain, and more causes are in vain when fewer will suffice. For Nature is simple and does not indulge in the luxury of superfluous causes. (Newton, 1999, p794 [emphasis in original]).

In the 20th century, Albert Einstein asserted that “our experience hitherto justifies us in believing that nature is the realisation of the simplest conceivable mathematical ideas” (Einstein, 1954, p274). More recently, the eminent physicist Steven Weinberg has claimed that he and his fellow physicists “demand simplicity and rigidity in our principles before we are willing to take them seriously” (Weinberg, 1993, p148-9), while the Nobel prize winning economist John Harsanyi has stated that “[o]ther things being equal, a simpler theory will be preferable to a less simple theory” (quoted in McAlleer, 2001, p296).

It should be noted, however, that not all scientists agree that simplicity should be regarded as a legitimate criterion for theory choice. The eminent biologist Francis Crick once complained, “[w]hile Occam’s razor is a useful tool in physics, it can be a very dangerous implement in biology. It is thus very rash to use simplicity and elegance as a guide in biological research” (Crick, 1988, p138). Similarly, here are a group of earth scientists writing in Science:

  • Many scientists accept and apply [Ockham’s Razor] in their work, even though it is an entirely metaphysical assumption. There is scant empirical evidence that the world is actually simple or that simple accounts are more likely than complex ones to be true. Our commitment to simplicity is largely an inheritance of 17th-century theology. (Oreskes et al, 1994, endnote 25)

Hence, while very many scientists assert that rival theories should be evaluated on grounds of simplicity, others are much more skeptical about this idea. Much of this skepticism stems from the suspicion that the cogency of a simplicity criterion depends on assuming that nature is simple (hardly surprising given the way that many scientists have defended such a criterion) and that we have no good reason to make such an assumption. Crick, for instance, seemed to think that such an assumption could make no sense in biology, given the patent complexity of the biological world. In contrast, some advocates of simplicity have argued that a preference for simple theories need not necessarily assume a simple world—for instance, even if nature is demonstrably complex in an ontological sense, we should still prefer comparatively simple explanations for nature’s complexity. Oreskes and others also emphasize that the simplicity principles of scientists such as Galileo and Newton were explicitly rooted in a particular kind of natural theology, which held that a simple and elegant universe was a necessary consequence of God’s benevolence. Today, there is much less enthusiasm for grounding scientific methods in theology (the putative connection between God’s benevolence and the simplicity of creation is theologically controversial in any case). Another common source of skepticism is the apparent vagueness of the notion of simplicity and the suspicion that scientists’ judgments about the relative simplicity of theories lack a principled and objective basis.

Even so, there is no doubting the popularity of the idea that simplicity should be used as a criterion for theory choice and evaluation. It seems to be explicitly ingrained into many scientific methods—for instance, standard statistical methods of data analysis (Section 1d). It has also spread far beyond philosophy and the natural sciences. A recent issue of the FBI Law Enforcement Bulletin, for instance, contained the advice that “[u]nfortunately, many people perceive criminal acts as more complex than they really are… the least complicated explanation of an event is usually the correct one” (Rothwell, 2006, p24).

a. Ockham’s Razor

Many scientists and philosophers endorse a methodological principle known as “Ockham’s Razor”. This principle has been formulated in a variety of different ways. In the early 21st century, it is typically just equated with the general maxim that simpler theories are “better” than more complex ones, other things being equal. Historically, however, it has been more common to formulate Ockham’s Razor as a more specific type of simplicity principle, often referred to as “the principle of parsimony”. Whether William of Ockham himself would have endorsed any of the wide variety of methodological maxims that have been attributed to him is a matter of some controversy (see Thorburn, 1918; entry on William of Ockham), since Ockham never explicitly referred to a methodological principle that he called his “razor”. However, a standard of formulation of the principle of parsimony—one that seems to be reasonably close to the sort of principle that Ockham himself probably would have endorsed—is as the maxim “entities are not to be multiplied beyond necessity”. So stated, the principle is ontological, since it is concerned with parsimony with respect to the entities that theories posit the existence of in attempting to account for the empirical data. “Entity”, in this context, is typically understood broadly, referring not just to objects (for example, atoms and particles), but also to other kinds of natural phenomena that a theory may include in its ontology, such as causes, processes, properties, and so forth. Other, more general formulations of Ockham’s Razor are not exclusively ontological, and may also make reference to various structural features of how theories go about explaining nature, such as the unity of their explanations. The remainder of this section will focus on the more traditional ontological interpretation.

It is important to recognize that the principle, “entities are not to be multiplied beyond necessity” can be read in at least two different ways. One way of reading it is as what we can call an anti-superfluity principle (Barnes, 2000). This principle calls for the elimination of ontological posits from theories that are explanatorily redundant. Suppose, for instance, that there are two theories, T1 and T2, which both seek to explain the same set of empirical data, D. Suppose also that T1 and T2 are identical in terms of the entities that are posited, except for the fact that T2 entails an additional posit, b, that is not part of T1. So let us say that T1 posits a, while T2 posits a + b. Intuitively, T2 is a more complex theory than T1 because it posits more things. Now let us assume that both theories provide an equally complete explanation of D, in the sense that there are no features of D that the two theories cannot account for. In this situation, the anti-superfluity principle would instruct us to prefer the simpler theory, T1, to the more complex theory, T2. The reason for this is because T2 contains an explanatorily redundant posit, b, which does no explanatory work in the theory with respect to D. We know this because T1, which posits a alone provides an equally adequate account of D as T2. Hence, we can infer that positing a alone is sufficient to acquire all the explanatory ability offered by T2, with respect to D; adding b does nothing to improve the ability of T2 to account for the data.

This sort of anti-superfluity principle underlies one important interpretation of “entities are not to be multiplied beyond necessity”: as a principle that invites us to get rid of superfluous components of theories. Here, an ontological posit is superfluous with respect to a given theory, T, in so far as it does nothing to improve T’s ability to account for the phenomena to be explained. This is how John Stuart Mill understood Ockham’s razor (Mill, 1867, p526). Mill also pointed to a plausible justification for the anti-superfluity principle: explanatorily redundant posits—those that have no effect on the ability of the theory to explain the data—are also posits that do not obtain evidential support from the data. This is because it is plausible that theoretical entities are evidentially supported by empirical data only to the extent that they can help us to account for why the data take the form that they do. If a theoretical entity fails to contribute to this end, then the data fails to confirm the existence of this entity. If we have no other independent reason to postulate the existence of this entity, then we have no justification for including this entity in our theoretical ontology.

Another justification that has been offered for the anti-superfluity principle is a probabilistic one. Note that T2 is a logically stronger theory than T1: T2 says that a and b exist, while T1 says that only a exists. It is a consequence of the axioms of probability that a logically stronger theory is always less probable than a logically weaker theory, thus, so long as the probability of a existing and the probability of b existing are independent of each other, the probability of a existing is greater than zero, and the probability of b existing is less than 1, we can assert that Pr (a exists) > Pr (a exists & b exists), where Pr (a exists & b exists) = Pr (a exists) * Pr (b exists). According to this reasoning, we should therefore regard the claims of T1 as more a priori probable than the claims of T2, and this is a reason to prefer it. However, one objection to this probabilistic justification for the anti-superfluity principle is that it doesn’t fully explain why we dislike theories that posit explanatorily redundant entities: it can’t really because they are logically stronger theories; rather it is because they postulate entities that are unsupported by evidence.

When the principle of parsimony is read as an anti-superfluity principle, it seems relatively uncontroversial. However, it is important to recognize that the vast majority of instances where the principle of parsimony is applied (or has been seen as applying) in science cannot be given an interpretation merely in terms of the anti-superfluity principle. This is because the phrase “entities are not to be multiplied beyond necessity” is normally read as what we can call an anti-quantity principle: theories that posit fewer things are (other things being equal) to be preferred to theories that posit more things, whether or not the relevant posits play any genuine explanatory role in the theories concerned (Barnes, 2000). This is a much stronger claim than the claim that we should razor off explanatorily redundant entities. The evidential justification for the anti-superfluity principle just described cannot be used to motivate the anti-quantity principle, since the reasoning behind this justification allows that we can posit as many things as we like, so long as all of the individual posits do some explanatory work within the theory. It merely tells us to get rid of theoretical ontology that, from the perspective of a given theory, is explanatorily redundant. It does not tell us that theories that posit fewer things when accounting for the data are better than theories that posit more things—that is, that sparser ontologies are better than richer ones.

Another important point about the anti-superfluity principle is that it does not give us a reason to assert the non-existence of the superfluous posit. Absence of evidence, is not (by itself) evidence for absence. Hence, this version of Ockham’s razor is sometimes also referred to as an “agnostic” razor rather than an “atheistic” razor, since it only motivates us to be agnostic about the razored-off ontology (Sober, 1981). It seems that in most cases where Ockham’s razor is appealed to in science it is intended to support atheistic conclusions—the entities concerned are not merely cut out of our theoretical ontology, their existence is also denied. Hence, if we are to explain why such a preference is justified we need will to look for a different justification. With respect to the probabilistic justification for the anti-superfluity principle described above, it is important to note that it is not an axiom of probability that Pr (a exists & b doesn’t exist) > Pr (a exists & b exists).

b. Examples of Simplicity Preferences at Work in the History of Science

It is widely believed that there have been numerous episodes in the history of science where particular scientific theories were defended by particular scientists and/or came to be preferred by the wider scientific community less for directly empirical reasons (for example, some telling experimental finding) than as a result of their relative simplicity compared to rival theories. Hence, the history of science is taken to demonstrate the importance of simplicity considerations in how scientists defend, evaluate, and choose between theories. One striking example is Isaac Newton’s argument for universal gravitation.

i. Newton’s Argument for Universal Gravitation

At beginning of the third book of the Principia, subtitled “The system of the world”, Isaac Newton described four “rules for the study of natural philosophy”:

  • Rule 1 No more causes of natural things should be admitted than are both true and sufficient to explain their phenomena.
  • As the philosophers say: Nature does nothing in vain, and more causes are in vain when fewer will suffice. For Nature is simple and does not indulge in the luxury of superfluous causes.
  • Rule 2 Therefore, the causes assigned to natural effects of the same kind must be, so far as possible, the same.
  • Rule 3 Those qualities of bodies that cannot be intended and remitted [i.e., qualities that cannot be increased and diminished] and that belong to all bodies on which experiments can be made should be taken as qualities of all bodies universally.
  • For the qualities of bodies can be known only through experiments; and therefore qualities that square with experiments universally are to be regarded as universal qualities… Certainly ideal fancies ought not to be fabricated recklessly against the evidence of experiments, nor should we depart from the analogy of nature, since nature is always simple and ever consonant with itself…
  • Rule 4 In experimental philosophy, propositions gathered from phenomena by induction should be considered either exactly or very nearly true notwithstanding any contrary hypotheses, until yet other phenomena make such propositions either more exact or liable to exceptions.
  • This rule should be followed so that arguments based on induction may not be nullified by hypotheses. (Newton, 1999, p794-796).

Here we see Newton explicitly placing simplicity at the heart of his conception of the scientific method. Rule 1, a version of Ockham’s Razor, which, despite the use of the word “superfluous”, has typically been read as an anti-quantity principle rather than an anti-superfluity principle (see Section 1a), is taken to follow directly from the assumption that nature is simple, which is in turn taken to give rise to rules 2 and 3, both principles of inductive generalization (infer similar causes for similar effects, and assume to be universal in all bodies those properties found in all observed bodies). These rules play a crucial role in what follows, the centrepiece being the argument for universal gravitation.

After laying out these rules of method, Newton described several “phenomena”—what are in fact empirical generalizations, derived from astronomical observations, about the motions of the planets and their satellites, including the moon. From these phenomena and the rules of method, he then “deduced” several general theoretical propositions. Propositions 1, 2, and 3 state that the satellites of Jupiter, the primary planets, and the moon are attracted towards the centers of Jupiter, the sun, and the earth respectively by forces that keep them in their orbits (stopping them from following a linear path in the direction of their motion at any one time). These forces are also claimed to vary inversely with the square of the distance of the orbiting body (for example, Mars) from the center of the body about which it orbits (for example, the sun). These propositions are taken to follow from the phenomena, including the fact that the respective orbits can be shown to (approximately) obey Kepler’s law of areas and the harmonic law, and the laws of motion developed in book 1 of the Principia. Newton then asserted proposition 4: “The moon gravitates toward the earth and by the force of gravity is always drawn back from rectilinear motion and kept in its orbit” (p802). In other words, it is the force of gravity that keeps the moon in its orbit around the earth. Newton explicitly invoked rules 1 and 2 in the argument for this proposition (what has become known as the “moon-test”). First, astronomical observations told us how fast the moon accelerates towards the earth. Newton was then able to calculate what the acceleration of the moon would be at the earth’s surface, if it were to fall down to the earth. This turned out to be equal to the acceleration of bodies observed to fall in experiments conducted on earth. Since it is the force of gravity that causes bodies on earth to fall (Newton assumed his readers’ familiarity with “gravity” in this sense), and since both gravity and the force acting on the moon “are directed towards the center of the earth and are similar to each other and equal”, Newton asserted that “they will (by rules 1 and 2) have the same cause” (p805). Therefore, the forces that act on falling bodies on earth, and which keeps the moon in its orbit are one and the same: gravity. Given this, the force of gravity acting on terrestrial bodies could now be claimed to obey an inverse-square law. Through similar deployment of rules 1, 2, and 4, Newton was led to the claim that it is also gravity that keeps the planets in their orbits around the sun and the satellites of Jupiter and Saturn in their orbits, since these forces are also directed toward the centers of the sun, Jupiter, and Saturn, and display similar properties to the force of gravity on earth, such as the fact that they obey an inverse-square law. Therefore, the force of gravity was held to act on all planets universally. Through several more steps, Newton was eventually able to get to the principle of universal gravitation: that gravity is a mutually attractive force that acts on any two bodies whatsoever and is described by an inverse-square law, which says that the each body attracts the other with a force of equal magnitude that is proportional to the product of the masses of the two bodies and inversely proportional to the squared distance between them. From there, Newton was able to determine the masses and densities of the sun, Jupiter, Saturn, and the earth, and offer a new explanation for the tides of the seas, thus showing the remarkable explanatory power of this new physics.

Newton’s argument has been the subject of much debate amongst historians and philosophers of science (for further discussion of the various controversies surrounding its structure and the accuracy of its premises, see Glymour, 1980; Cohen, 1999; Harper, 2002). However, one thing that seems to be clear is that his conclusions are by no means forced on us through simple deductions from the phenomena, even when combined with the mathematical theorems and general theory of motion outlined in book 1 of the Principia. No experiment or mathematical derivation from the phenomena demonstrated that it must be gravity that is the common cause of the falling of bodies on earth, the orbits of the moon, the planets and their satellites, much less that gravity is a mutually attractive force acting on all bodies whatsoever. Rather, Newton’s argument appears to boil down to the claim that if gravity did have the properties accorded to it by the principle of universal gravitation, it could provide a common causal explanation for all the phenomena, and his rules of method tell us to infer common causes wherever we can. Hence, the rules, which are in turn grounded in a preference for simplicity, play a crucial role in taking us from the phenomena to universal gravitation (for further discussion of the apparent link between simplicity and common cause reasoning, see Sober, 1988). Newton’s argument for universal gravitation can thus be seen as argument to the putatively simplest explanation for the empirical observations.

ii. Other Examples

Numerous other putative examples of simplicity considerations at work in the history of science have been cited in the literature:

  • One of the most commonly cited concerns Copernicus’ arguments for the heliocentric theory of planetary motion. Copernicus placed particular emphasis on the comparative “simplicity” and “harmony” of the account that his theory gave of the motions of the planets compared with the rival geocentric theory derived from the work of Ptolemy. This argument appears to have carried significant weight for Copernicus’ successors, including Rheticus, Galileo, and Kepler, who all emphasized simplicity as a major motivation for heliocentrism. Philosophers have suggested various reconstructions of the Copernican argument (see for example, Glymour, 1980; Rosencrantz, 1983; Forster and Sober, 1994; Myrvold, 2003; Martens, 2009). However, historians of science have questioned the extent to which simplicity could have played a genuine rather than purely rhetorical role in this episode. For example, it has been argued that there is no clear sense in which the Copernican system was in fact simpler than Ptolemy’s, and that geocentric systems such as the Tychronic system could be constructed that were at least as simple as the Copernican one (for discussion, see Kuhn, 1957; Palter, 1970; Cohen, 1985; Gingerich, 1993; Martens, 2009).
  • It has been widely claimed that simplicity played a key role in the development of Einstein’s theories of theories of special and general relativity, and in the early acceptance of Einstein’s theories by the scientific community (see for example, Hesse, 1974; Holton, 1974; Schaffner, 1974; Sober, 1981; Pais, 1982; Norton, 2000).
  • Thagard (1988) argues that simplicity considerations played an important role in Lavoisier’s case against the existence of phlogiston and in favour of the oxygen theory of combustion.
  • Carlson (1966) describes several episodes in the history of genetics in which simplicity considerations seemed to have held sway.
  • Nolan (1997) argues that a preference for ontological parsimony played an important role in the discovery of the neutrino and in the proposal of Avogadro’s hypothesis.
  • Baker (2007) argues that ontological parsimony was a key issue in discussions over rival dispersalist and extensionist bio-geographical theories in the late 19th and early 20th century.

Though it is commonplace for scientists and philosophers to claim that simplicity considerations have played a significant role in the history of science, it is important to note that some skeptics have argued that the actual historical importance of simplicity considerations has been over-sold (for example, Bunge, 1961; Lakatos and Zahar, 1978). Such skeptics dispute the claim that we can only explain the basis for these and other episodes of theory change by according a role to simplicity, claiming other considerations actually carried more weight. In addition, it has been argued that, in many cases, what appear on the surface to have been appeals to the relative simplicity of theories were in fact covert appeals to some other theoretical virtue (for example, Boyd, 1990; Sober, 1994; Norton, 2003; Fitzpatrick, 2009). Hence, for any putative example of simplicity at work in the history of science, it is important to consider whether the relevant arguments are not best reconstructed in other terms (such a “deflationary” view of simplicity will be discussed further in Section 4c).

c. Simplicity and Inductive Inference

Many philosophers have come to see simplicity considerations figuring not only in how scientists go about evaluating and choosing between developed scientific theories, but also in the mechanics of making much more basic inductive inferences from empirical data. The standard illustration of this in the modern literature is the practice of curve-fitting. Suppose that we have a series of observations of the values of a variable, y, given values of another variable, x. This gives us a series of data points, as represented in Figure 1.

Figure 1

Given this data, what underlying relationship should we posit between x and y so that we can predict future pairs of xy values? Standard practice is not to select a bumpy curve that neatly passes through all the data points, but rather to select a smooth curve—preferably a straight line, such as H1—that passes close to the data. But why do we do this? Part of an answer comes from the fact that if the data is to some degree contaminated with measurement error (for example, through mistakes in data collection) or “noise” produced by the effects of uncontrolled factors, then any curve that fits the data perfectly will most likely be false. However, this does not explain our preference for a curve like H1 over an infinite number of other curves—H2, for instance—that also pass close to the data. It is here that simplicity has been seen as playing a vital, though often implicit role in how we go about inferring hypotheses from empirical data: H1 posits a “simpler” relationship between x and y than H2—hence, it is for reasons of simplicity that we tend to infer hypotheses like H1.

The practice of curve-fitting has been taken to show that—whether we aware of it or not—human beings have a fundamental cognitive bias towards simple hypotheses. Whether we are deciding between rival scientific theories, or performing more basic generalizations from our experience, we ubiquitously tend to infer the simplest hypothesis consistent with our observations. Moreover, this bias is held to be necessary in order for us to be able select a unique hypotheses from the potentially limitless number of hypotheses consistent with any finite amount of experience.

The view that simplicity may often play an implicit role in empirical reasoning can arguably be traced back to David Hume’s description of enumerative induction in the context of his formulation of the famous problem of induction. Hume suggested that a tacit assumption of the uniformity of nature is ingrained into our psychology. Thus, we are naturally drawn to the conclusion that all ravens have black feathers from the fact that all previously observed ravens have black feathers because we tacitly assume that the world is broadly uniform in its properties. This has been seen as a kind of simplicity assumption: it is simpler to assume more of the same.

A fundamental link between simplicity and inductive reasoning has been retained in many more recent descriptive accounts of inductive inference. For instance, Hans Reichenbach (1949) described induction as an application of what he called the “Straight Rule”, modelling all inductive inference on curve-fitting. In addition, proponents of the model of “Inference to Best Explanation”, who hold that many inductive inferences are best understood as inferences to the hypothesis that would, if true, provide the best explanation for our observations, normally claim that simplicity is one of the criteria that we use to determine which hypothesis constitutes the “best” explanation.

In recent years, the putative role of simplicity in our inferential psychology has been attracting increasing attention from cognitive scientists. For instance, Lombrozo (2007) describes experiments that she claims show that participants use the relative simplicity of rival explanations (for instance, whether a particular medical diagnosis for a set of symptoms involves assuming the presence of one or multiple independent conditions) as a guide to assessing their probability, such that a disproportionate amount of contrary probabilistic evidence is required for participants to choose a more complex explanation over a simpler one. Simplicity considerations have also been seen as central to learning processes in many different cognitive domains, including language acquisition and category learning (for example, Chater, 1999; Lu and others, 2006).

d. Simplicity in Statistics and Data Analysis

Philosophers have long used the example of curve-fitting to illustrate the (often implicit) role played by considerations of simplicity in inductive reasoning from empirical data. However, partly due to the advent of low-cost computing power and that the fact scientists in many disciplines find themselves having to deal with ever larger and more intricate bodies of data, recent decades have seen a remarkable revolution in the methods available to scientists for analyzing and interpreting empirical data (Gauch, 2006). Importantly, there are now numerous formalized procedures for data analysis that can be implemented in computer software—and which are widely used in disciplines from engineering to crop science to sociology—that contain an explicit role for some notion of simplicity. The literature on such methods abounds with talk of “Ockham’s Razor”, “Occam factors”, “Ockham’s hill” (MacKay, 1992; Gauch, 2006), “Occam’s window” (Raftery and others, 1997), and so forth. This literature not only provides important illustrations of the role that simplicity plays in scientific practice, but may also offer insights for philosophers seeking to understand the basis for this role.

As an illustration, consider standard procedures for model selection, such as the Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), Minimum Message Length (MML) and Minimum Description Length (MDL) procedures, and numerous others (for discussion see, Forster and Sober, 1994; Forster, 2001; Gauch, 2003; Dowe and others, 2007). Model selection is a matter of selecting the kind of relationship that is to be posited between a set of variables, given a sample of data, in an effort to generate hypotheses about the true underlying relationship holding in the population of inference and/or to make predictions about future data. This question arises in the simple curve-fitting example discussed above—for instance, whether the true underlying relationship between x and y is linear, parabolic, quadratic, and so on. It also arises in lots of other contexts, such as the problem of inferring the causal relationship that exists between an empirical effect and a set of variables. “Models” in this sense are families of functions, such as the family of linear functions, LIN: y = a + bx, or the family of parabolic functions, PAR: y = a + bx + cx2. The simplicity of a model is normally explicated in terms of the number of adjustable parameters it contains (MML and MDL measure the simplicity of models in terms of the extent to which they provide compact descriptions of the data, but produce similar results to the counting of adjustable parameters). On this measure, the model LIN is simpler than PAR, since LIN contains two adjustable parameters, whereas PAR has three. A consequence of this is that a more complex model will always be able to fit a given sample of data better than a simpler model (“fitting” a model to the data involves using the data to determine what the values of the parameters in the model should be, given that data—that is, identifying the best-fitting member of the family). For instance, returning to the curve-fitting scenario represented in Figure 1, the best-fitting curve in PAR is guaranteed to fit this data set at least as well as the best-fitting member of the simpler model, LIN, and this is true no matter what the data are, since linear functions are special cases of parabolas, where c = 0, so any curve that is a member of LIN is also a member of PAR.

Model selection procedures produce a ranking of all the models under consideration in light of the data, thus allowing scientists to choose between them. Though they do it in different ways, AIC, BIC, MML, and MDL all implement procedures for model selection that impose a penalty on the complexity of a model, so that a more complex model will have to fit the data sample at hand significantly better than a simpler one for it to be rated higher than the simpler model. Often, this penalty is greater the smaller is the sample of data. Interestingly—and contrary to the assumptions of some philosophers—this seems to suggest that simplicity considerations do not only come into play as a tiebreaker between theories that fit the data equally well: according to the model selection literature, simplicity sometimes trumps better fit to the data. Hence, simplicity need not only come into play when all other things are equal.

Both statisticians and philosophers of statistics have vigorously debated the underlying justification for these sorts of model selection procedures (see, for example, the papers in Zellner and others, 2001). However, one motivation for taking into account the simplicity of models derives from a piece of practical wisdom: when there is error or “noise” in the data sample, a relatively simple model that fits the sample less well will often be more accurate when it comes to predicting extra-sample (for example, future) data than a more complex model that fits the sample more closely. The logic here is that since more complex models are more flexible in their ability to fit the data (since they have more adjustable parameters), they also have a greater propensity to be misled by errors and noise, in which case they may recover less of the true underlying “signal” in the sample. Thus, constraining model complexity may facilitate greater predictive accuracy. This idea is captured in what Gauch (2003, 2006) (following MacKay, 1992) calls “Ockham’s hill”. To the left of the peak of the hill, increasing the complexity of a model improves its accuracy with respect to extra-sample data because this recovers more of the signal in the sample. However, after the peak, increasing complexity actually diminishes predictive accuracy because this leads to over-fitting to spurious noise in the sample. There is therefore an optimal trade-off (at the peak of Ockham’s hill) between simplicity and fit to the sample data when it comes to facilitating accurate prediction of extra-sample data. Indeed, this trade-off is essentially the core idea behind AIC, the development of which initiated the now enormous literature on model selection, and the philosophers Malcolm Forster and Elliott Sober have sought to use such reasoning to make sense of the role of simplicity in many areas of science (see Section 4biii).

One important implication of this apparent link between model simplicity and predictive accuracy is that interpreting sample data using relatively simple models may improve the efficiency of experiments by allowing scientists to do more with less data—for example, scientists may be able to run a costly experiment fewer times before they can be in a position to make relatively accurate predictions about the future. Gauch (2003, 2006) describes several real world cases from crop science and elsewhere where this gain in accuracy and efficiency from the use of relatively simple models has been documented.

2. Wider Philosophical Significance of Issues Surrounding Simplicity

The putative role of simplicity, both in the evaluation of rival scientific theories and in the mechanics of how we go about inferring hypotheses from empirical data, clearly raises a number of difficult philosophical issues. These include, but are by no means limited to: (1) the question of what precisely it means to say the one theory or hypothesis is simpler than another and how the relative simplicity of theories is to be measured; (2) the question of what rational justification (if any) can be provided for choosing between rival theories on grounds of simplicity; and (3) the closely related question of what weight simplicity considerations ought to carry in theory choice relative to other theoretical virtues, particularly if these sometimes have to be traded-off against each other. (For general surveys of the philosophical literature on these issues, see Hesse, 1967; Sober, 2001a, 2001b). Before we delve more deeply into how philosophers have sought to answer these questions, it is worth noting the close connections between philosophical issues surrounding simplicity and many of the most important controversies in the philosophy of science and epistemology.

First, the problem of simplicity has close connections with long-standing issues surrounding the nature and justification of inductive inference. Some philosophers have actually offered up the idea that simpler theories are preferable to less simple ones as a purported solution to the problem of induction: it is the relative simplicity of the hypotheses that we tend to infer from empirical observations that supposedly provides the justification for these inferences—thus, it is simplicity that provides the warrant for our inductive practices. This approach is not as popular as it once was, since it is taken to merely substitute the problem of induction for the equally substantive problem of justifying preferences for simpler theories. A more common view in the recent literature is that the problem of induction and the problem of justifying preferences for simpler theories are closely connected, or may even amount to the same problem. Hence, a solution to the latter problem will provide substantial help towards solving the former.

More generally, the ability to make sense of the putative role of simplicity in scientific reasoning has been seen by many to be a central desideratum for any adequate philosophical theory of the scientific method. For example, Thomas Kuhn’s (1962) influential discussion of the importance of scientists’ aesthetic preferences—including but not limited to judgments of simplicity—in scientific revolutions was a central part of his case for adopting a richer conception of the scientific method and of theory change in science than he found in the dominant logical empiricist views of the time. More recently, critics of the Bayesian approach to scientific reasoning and theory confirmation, which holds that sound inductive reasoning is reasoning according to the formal principles of probability, have claimed that simplicity is an important feature of scientific reasoning that escapes a Bayesian analysis. For instance, Forster and Sober (1994) argue that Bayesian approaches to curve-fitting and model selection (such as the Bayesian Information Criterion) cannot themselves be given Bayesian rationale, nor can any other approach that builds in a bias towards simpler models. The ability of the Bayesian approach to make sense of simplicity in model selection and other aspects of scientific practice has thus been seen as central to evaluating its promise (see for example, Glymour, 1980; Forster and Sober, 1994; Forster, 1995; Kelly and Glymour, 2004; Howson and Urbach, 2006; Dowe and others, 2007).

Discussions over the legitimacy of simplicity as a criterion for theory choice have also been closely bound up with debates over scientific realism. Scientific realists assert that scientific theories aim to offer a literally true description of the world and that we have good reason to believe that the claims of our current best scientific theories are at least approximately true, including those claims that purport to be about “unobservable” natural phenomena that are beyond our direct perceptual access. Some anti-realists object that it is possible to formulate incompatible alternatives to our current best theories that are just as consistent with any current data that we have, perhaps even any future data that we could ever collect. They claim that we can therefore never be justified in asserting that the claims of our current best theories, especially those concerning unobservables, are true, or approximately true. A standard realist response is to emphasize the role of the so-called “theoretical virtues” in theory choice, among which simplicity is normally listed. The claim is thus that we rule out these alternative theories because they are unnecessarily complex. Importantly, for this defense to work, realists have to defend the idea that not only are we justified in choosing between rival theories on grounds of simplicity, but also that simplicity can be used as a guide to the truth. Naturally, anti-realists, particularly those of an empiricist persuasion (for example, van Fraassen, 1989), have expressed deep skepticism about the alleged truth-conduciveness of a simplicity criterion.

3. Defining and Measuring Simplicity

The first major philosophical problem that seems to arise from the notion that simplicity plays a role in theory choice and evaluation concerns specifying in more detail what it means to say that one theory is simpler than another and how the relative simplicity of theories is to be precisely and objectively measured. Numerous attempts have been made to formulate definitions and measures of theoretical simplicity, all of which face very significant challenges. Philosophers have not been the only ones to contribute to this endeavour. For instance, over the last few decades, a number of formal measures of simplicity and complexity have been developed in mathematical information theory. This section provides an overview of some of the main simplicity measures that have been proposed and the problems that they face. The proposals described here have also normally been tied to particular proposals about what justifies preferences for simpler theories. However, discussion of these justifications will be left until Section 4.

To begin with, it is worth considering why providing a precise definition and measure of theoretical simplicity ought to be regarded as a substantial philosophical problem. After all, it often seems that when one is confronted with a set of rival theories designed to explain a particular empirical phenomenon, it is just obvious which is the simplest. One does not always need a precise definition or measure of a particular property to be able to tell whether or not something exhibits it to a greater degree than something else. Hence, it could be suggested that if there is a philosophical problem here, it is only of very minor interest and certainly of little relevance to scientific practice. There are, however, some reasons to regard this as a substantial philosophical problem, which also has some practical relevance.

First, it is not always easy to tell whether one theory really ought to be regarded as simpler than another, and it is not uncommon for practicing scientists to disagree about the relative simplicity of rival theories. A well-known historical example is the disagreement between Galileo and Kepler concerning the relative simplicity of Copernicus’ theory of planetary motion, according to which the planets move only in perfect circular orbits with epicycles, and Kepler’s theory, according to which the planets move in elliptical orbits (see Holton, 1974; McAllister, 1996). Galileo held to the idea that perfect circular motion is simpler than elliptical motion. In contrast, Kepler emphasized that an elliptical model of planetary motion required many fewer orbits than a circular model and enabled a reduction of all the planetary motions to three fundamental laws of planetary motion. The problem here is that scientists seem to evaluate the simplicity of theories along a number of different dimensions that may conflict with each other. Hence, we have to deal with the fact that a theory may be regarded as simpler than a rival in one respect and more complex in another. To illustrate this further, consider the following list of commonly cited ways in which theories may be held to be simpler than others:

  • Quantitative ontological parsimony (or economy): postulating a smaller number of independent entities, processes, causes, or events.
  • Qualitative ontological parsimony (or economy): postulating a smaller number of independent kinds or classes of entities, processes, causes, or events.
  • Common cause explanation: accounting for phenomena in terms of common rather than separate causal processes.
  • Symmetry: postulating that equalities hold between interacting systems and that the laws describing the phenomena look the same from different perspectives.
  • Uniformity (or homogeneity): postulating a smaller number of changes in a given phenomenon and holding that the relations between phenomena are invariant.
  • Unification: explaining a wider and more diverse range of phenomena that might otherwise be thought to require separate explanations in a single theory (theoretical reduction is generally held to be a species of unification).
  • Lower level processes: when the kinds of processes that can be posited to explain a phenomena come in a hierarchy, positing processes that come lower rather than higher in this hierarchy.
  • Familiarity (or conservativeness): explaining new phenomena with minimal new theoretical machinery, reusing existing patterns of explanation.
  • Paucity of auxiliary assumptions: invoking fewer extraneous assumptions about the world.
  • Paucity of adjustable parameters: containing fewer independent parameters that the theory leaves to be determined by the data.

As can be seen from this list, there is considerable diversity here. We can see that theoretical simplicity is frequently thought of in ontological terms (for example, quantitative and qualitative parsimony), but also sometimes as a structural feature of theories (for example, unification, paucity of adjustable parameters), and while some of these intuitive types of simplicity may often cluster together in theories—for instance, qualitative parsimony would seem to often go together with invoking common cause explanations, which would in turn often seem to go together with explanatory unification—there is also considerable scope for them pointing in different directions in particular cases. For example, a theory that is qualitatively parsimonious as a result of positing fewer different kinds of entities might be quantitatively unparsimonious as result of positing more of a particular kind of entity; while the demand to explain in terms of lower-level processes rather than higher-level processes may conflict with the demand to explain in terms of common causes behind similar phenomena, and so on. There are also different possible ways of evaluating the simplicity of a theory with regard to any one of these intuitive types of simplicity. A theory may, for instance, come out as more quantitatively parsimonious than another if one focuses on the number of independent entities that it posits, but less parsimonious if one focuses on the number of independent causes it invokes. Consequently, it seems that if a simplicity criterion is actually to be applicable in practice, we need some way of resolving the disagreements that may arise between scientists about the relative simplicity of rival theories, and this requires a more precise measure of simplicity.

Second, as has already been mentioned, a considerable amount of the skepticism expressed both by philosophers and by scientists about the practice of choosing one theory over another on grounds of relative simplicity has stemmed from the suspicion that our simplicity judgments lack a principled basis (for example, Ackerman, 1961; Bunge, 1961; Priest, 1976). Disagreements between scientists, along with the multiplicity and scope for conflict between intuitive types of simplicity have been important contributors to this suspicion, leading to the view that for any two theories, T1 and T2, there is some way of evaluating their simplicity such that T1 comes out as simpler than T2, and vice versa. It seems, then, that an adequate defense of the legitimacy a simplicity criterion needs to show that there are in fact principled ways of determining when one theory is indeed simpler than another. Moreover, in so far as there is also a justificatory issue to be dealt with, we also need to be clear about exactly what it is that we need to justify a preference for.

a. Syntactic Measures

One proposal is that the simplicity of theories can be precisely and objectively measured in terms of how briefly they can be expressed. For example, a natural way of measuring the simplicity of an equation is just to count the number of terms, or parameters that it contains. Similarly, we could measure the simplicity of a theory in terms of the size of the vocabulary—for example, the number of extra-logical terms—required to write down its claims. Such measures of simplicity are often referred to as syntactic measures, since they involve counting the linguistic elements required to state, or to describe the theory.

A major problem facing any such syntactic measure of simplicity is the problem of language variance. A measure of simplicity is language variant if it delivers different results depending on the language that is used to represent the theories being compared. Suppose, for example, that we measure the simplicity of an equation by counting the number of non-logical terms that it contains. This will produce the result that r = a will come out as simpler than x2 + y2 = a2. However, this second equation is simply a transformation of the first into Cartesian co-ordinates, where r2 = x2 + y2, and is hence logically equivalent. The intuitive proposal for measuring simplicity in curve-fitting contexts, according to which hypotheses are said to be simpler if they contain fewer parameters, is also language variant in this sense. How many parameters a hypothesis contains depends on the co-ordinate scales that one uses. For any two non-identical functions, F and G, there is some way of transforming the co-ordinate scales such that we can turn F into a linear curve and G into a non-linear curve, and vice versa.

Nelson Goodman’s (1983) famous “new riddle of induction” allows us to formulate another example of the problem of language variance. Suppose all previously observed emeralds have been green. Now consider the following hypotheses about the color properties of the entire population of emeralds:

  • H1: all emeralds are green
  • H2: all emeralds first observed prior to time t are green and all emeralds first observed after time t are blue (where t is some future time)

Intuitively, H1 seems to be a simpler hypothesis than H2. To begin with, it can be stated with a smaller vocabulary. H1 also seems to postulate uniformity in the properties of emeralds, while H2 posits non-uniformity. For instance, H2 seems to assume that there is some link between the time at which an emerald is first observed and its properties. Thus it can be viewed as including an additional time parameter. But now consider Goodman’s invented predicates, “grue” and “bleen”. These have been defined in variety of different ways, but let us define them here as follows: an object is grue if it is first observed before time t and the object is green, or first observed after t and the object is blue; an object is bleen if it is first observed before time t and the object is blue, or first observed after the time t and the object is green. With these predicates, we can define a further property, “grolor”. Grue and bleen are grolors just as green and blue are colors. Now, because of the way that grolors are defined, color predicates like “green” and “blue” can also be defined in terms of grolor predicates: an object is green if first observed before time t and the object is grue, or first observed after time t and the object is bleen; an object is blue if first observed before time t and the object is bleen, or first observed after t and the object is grue. This means that statements that are expressed in terms of green and blue can also be expressed in terms of grue and bleen. So, we can rewrite H1 and H2 as follows:

  • H1: all emeralds first observed prior to time t are grue and all emeralds first observed after time t are bleen (where t is some future time)
  • H2: all emeralds are grue

Re-call that earlier we judged H1 to be simpler than H2. However, if we are retain that simplicity judgment, we cannot say that H1 is simpler than H2 because it can be stated with a smaller vocabulary; nor can we say that it H1 posits greater uniformity, and is hence simpler, because it does not contain a time parameter. This is because simplicity judgments based on such syntactic features can be reversed merely by switching the language used to represent the hypotheses from a color language to a grolor language.

Examples such as these have been taken to show two things. First, no syntactic measure of simplicity can suffice to produce a principled simplicity ordering, since all such measures will produce different results depending of the language of representation that is used. It is not enough just to stipulate that we should evaluate simplicity in one language rather than another, since that would not explain why simplicity should be measured in that way. In particular, we want to know that our chosen language is accurately tracking the objective language-independent simplicity of the theories being compared. Hence, if a syntactic measure of simplicity is to be used, say for practical purposes, it must be underwritten by a more fundamental theory of simplicity. Second, a plausible measure of simplicity cannot be entirely neutral with respect to all of the different claims about the world that the theory makes or can be interpreted as making. Because of the respective definitions of colors and grolors, any hypothesis that posits uniformity in color properties must posit non-uniformity in grolor properties. As Goodman emphasized, one can find uniformity anywhere if no restriction is placed on what kinds of properties should be taken into account. Similarly, it will not do to say that theories are simpler because they posit the existence of fewer entities, causes and processes, since, using Goodman-like manipulations, it is trivial to show that a theory can be regarded as positing any number of different entities, causes and processes. Hence, some principled restriction needs to be placed on which aspects of the content of a theory are to be taken into account and which are to be disregarded when measuring their relative simplicity.

b. Goodman’s Measure

According to Nelson Goodman, an important component of the problem of measuring the simplicity of scientific theories is the problem of measuring the degree of systematization that a theory imposes on the world, since, for Goodman, to seek simplicity is to seek a system. In a series of papers in the 1940s and 50s, Goodman (1943, 1955, 1958, 1959) attempted to explicate a precise measure of theoretical systematization in terms of the logical properties of the set of concepts, or extra-logical terms, that make up the statements of the theory.

According to Goodman, scientific theories can be regarded as sets of statements. These statements contain various extra-logical terms, including property terms, relation terms, and so on. These terms can all be assigned predicate symbols. Hence, all the statements of a theory can be expressed in a first order language, using standard symbolic notion. For instance, “… is acid” may become “A(x)”, “… is smaller than ____” may become “S(x, y)”, and so on. Goodman then claims that we can measure the simplicity of the system of predicates employed by the theory in terms of their logical properties, such as their arity, reflexivity, transitivity, symmetry, and so on. The details arehighly technical but, very roughly, Goodman’s proposal is that a system of predicates that can be used to express more is more complex than a system of predicates that can be used to express less. For instance, one of the axioms of Goodman’s proposal is that if every set of predicates of a relevant kind, K, is always replaceable by a set of predicates of another kind, L, then K is not more complex than L.

Part of Goodman’s project was to avoid the problem of language variance. Goodman’s measure is a linguistic measure, since it concerns measuring the simplicity of a theory’s predicate basis in a first order language. However, it is not a purely syntactic measure, since it does not involve merely counting linguistic elements, such as the number of extra-logical predicates. Rather, it can be regarded as an attempt to measure the richness of a conceptual scheme: conceptual schemes that can be used to say more are more complex than conceptual schemes that can be used to say less. Hence, a theory can be regarded as simpler if it requires a less expressive system of concepts.

Goodman developed his axiomatic measure of simplicity in considerable detail. However, Goodman himself only ever regarded it as a measure of one particular type of simplicity, since it only concerns the logical properties of the predicates employed by the theory. It does not, for example, take account of the number of entities that a theory postulates. Moreover, Goodman never showed how the measure could be applied to real scientific theories. It has been objected that even if Goodman’s measure could be applied, it would not discriminate between many theories that intuitively differ in simplicity—indeed, in the kind of simplicity as systematization that Goodman wants to measure. For instance, it is plausible that the system of concepts used to express the Copernican theory of planetary motion is just as expressively rich as the system of concepts used to express the Ptolemaic theory, yet the former is widely regarded as considerably simpler than the latter, partly in virtue of it providing an intuitively more systematic account of the data (for discussion of the details of Goodman’s proposal and the objections it faces, see Kemeny, 1955; Suppes, 1956; Kyburg, 1961; Hesse, 1967).

c. Simplicity as Testability

It has often been argued that simpler theories say more about the world and hence are easier to test than more complex ones. C. S. Peirce (1931), for example, claimed that the simplest theories are those whose empirical consequences are most readily deduced and compared with observation, so that they can be eliminated more easily if they are wrong. Complex theories, on the other hand, tend to be less precise and allow for more wriggle room in accommodating the data. This apparent connection between simplicity and testability has led some philosophers to attempt to formulate measures of simplicity in terms of the relative testability of theories.

Karl Popper (1959) famously proposed one such testability measure of simplicity. Popper associated simplicity with empirical content: simpler theories say more about the world than more complex theories and, in so doing, place more restriction on the ways that the world can be. According to Popper, the empirical content of theories, and hence their simplicity, can be measured in terms of their falsifiability. The falsifiability of a theory concerns the ease with which the theory can be proven false, if the theory is indeed false. Popper argued that this could be measured in terms of the amount of data that one would need to falsify the theory. For example, on Popper’s measure, the hypothesis that x and y are linearly related, according to an equation of the form, y = a + bx, comes out as having greater empirical content and hence greater simplicity than the hypotheses that they are related according a parabola of the form, y = a + bx + cx2. This is because one only needs three data points to falsify the linear hypothesis, but one needs at least four data points to falsify the parabolic hypothesis. Thus Popper argued that empirical content, falsifiability, and hence simplicity, could be seen as equivalent to the paucity of adjustable parameters. John Kemeny (1955) proposed a similar testability measure, according to which theories are more complex if they can come out as true in more ways in an n-member universe, where n is the number of individuals that the universe contains.

Popper’s equation of simplicity with falsifiability suffers from some serious objections. First, it cannot be applied to comparisons between theories that make equally precise claims, such as a comparison between a specific parabolic hypothesis and a specific linear hypothesis, both of which specify precise values for their parameters and can be falsified by only one data point. It also cannot be applied when we compare theories that make probabilistic claims about the world, since probabilistic statements are not strictly falsifiable. This is particularly troublesome when it comes to accounting for the role of simplicity in the practice of curve-fitting, since one normally has to deal with the possibility of error in the data. As a result, an error distribution is normally added to the hypotheses under consideration, so that they are understood as conferring certain probabilities on the data, rather than as having deductive observational consequences. In addition, most philosophers of science now tend to think that falsifiability is not really an intrinsic property of theories themselves, but rather a feature of how scientists are disposed to behave towards their theories. Even deterministic theories normally do not entail particular observational consequences unless they are conjoined with particular auxiliary assumptions, usually leaving the scientist the option of saving the theory from refutation by tinkering with their auxiliary assumptions—a point famously emphasized by Pierre Duhem (1954). This makes it extremely difficult to maintain that simpler theories are intrinsically more falsifiable than less simple ones. Goodman (1961, p150-151) also argued that equating simplicity with falsifiability leads to counter-intuitive consequences. The hypothesis, “All maple trees are deciduous”, is intuitively simpler than the hypothesis, “All maple trees whatsoever, and all sassafras trees in Eagleville, are deciduous”, yet, according to Goodman, the latter hypothesis is clearly the easiest to falsify of the two. Kemeny’s measure inherits many of the same objections.

Both Popper and Kemeny essentially tried to link the simplicity of a theory with the degree to which it can accommodate potential future data: simpler theories are less accommodating than more complex ones. One interesting recent attempt to make sense of this notion of accommodation is due to Harman and Kulkarni (2007). Harman and Kulkarni analyze accommodation in terms of a concept drawn from statistical learning theory known as the Vapnik-Chervonenkis (VC) dimension. The VC dimension of a hypothesis can be roughly understood as a measure of the “richness” of the class of hypotheses from which it is drawn, where a class is richer if it is harder to find data that is inconsistent with some member of the class. Thus, a hypothesis drawn from a class that can fit any possible set of data will have infinite VC dimension. Though VC dimension shares some important similarities with Popper’s measure, there are important differences. Unlike Popper’s measure, it implies that accommodation is not always equivalent to the number of adjustable parameters. If we count adjustable parameters, sine curves of the form y = a sin bx, come out as relatively unaccommodating, however, such curves have an infinite VC dimension. While Harman and Kulkarni do not propose that VC dimension be taken as a general measure of simplicity (in fact, they regard it as an alternative to simplicity in some scientific contexts), ideas along these lines might perhaps hold some future promise for testability/accommodation measures of simplicity. Similar notions of accommodation in terms of “dimension” have been used to explicate the notion of the simplicity of a statistical model in the face of the fact the number of adjustable parameters a model contains is language variant (for discussion, see Forster, 1999; Sober, 2007).

d. Sober’s Measure

In his early work on simplicity, Elliott Sober (1975) proposed that the simplicity of theories be measured in terms of their question-relative informativeness. According to Sober, a theory is more informative if it requires less supplementary information from us in order for us to be able to use it to determine the answer to the particular questions that we are interested in. For instance, the hypothesis, y = 4x, is more informative and hence simpler than y = 2z + 2x with respect to the question, “what is the value of y?” This is because in order to find out the value of y one only needs to determine a value for x on the first hypothesis, whereas on the second hypothesis one also needs to determine a value for z. Similarly, Sober’s proposal can be used to capture the intuition that theories that say that a given class of things are uniform in their properties are simpler than theories that say that the class is non-uniform, because they are more informative relative to particular questions about the properties of the class. For instance, the hypothesis that “all ravens are black” is more informative and hence simpler than “70% of ravens are black” with respect to the question, “what will be the colour of the next observed raven?” This is because on the former hypothesis one needs no additional information in order to answer this question, whereas one will have to supplement the latter hypothesis with considerable extra information in order to generate a determinate answer.

By relativizing the notion of the content-fullness of theories to the question that one is interested in, Sober’s measure avoids the problem that Popper and Kemeny’s proposals faced of the most arbitrarily specific theories, or theories made up of strings of irrelevant conjunctions of claims, turning out to be the simplest. Moreover, according to Sober’s proposal, the content of the theory must be relevant to answering the question for it to count towards the theory’s simplicity. This gives rise to the most distinctive element of Sober’s proposal: different simplicity orderings of theories will be produced depending on the question one asks. For instance, if we want to know what the relationship is between values of z and given values of y and x, then y = 2z + 2x will be more informative, and hence simpler, than y = 4x. Thus, a theory can be simple relative to some questions and complex relative to others.

Critics have argued that Sober’s measure produces a number of counter-intuitive results. Firstly, the measure cannot explain why people tend to judge an equation such as y = 3x + 4x2 – 50 as more complex than an equation like y = 2x, relative to the question, “what is the value of y?” In both cases, one only needs a value of x to work out a value for y. Similarly, Sober’s measure fails to deal with Goodman’s above cited counter-example to the idea that simplicity equates to testability, since it produces the counter-intuitive outcome that there is no difference in simplicity between “all maple trees whatsoever, and all sassafras trees in Eagleville, are deciduous” and “all maple trees are deciduous” relative to questions about whether maple trees are deciduous. The interest-relativity of Sober’s measure has also generated criticism from those who prefer to see simplicity as a property that varies only with what a given theory is being compared with, not with the question that one happens to be asking.

e. Thagard’s Measure

Paul Thagard (1988) proposed that simplicity ought to be understood as a ratio of the number of facts explained by a theory to the number of auxiliary assumptions that the theory requires. Thagard defines an auxiliary assumption as a statement, not part of the original theory, which is assumed in order for the theory to be able to explain one or more of the facts to be explained. Simplicity is then measured as follows:

  • Simplicity of T = (Facts explained by T – Auxiliary assumptions of T) / Facts explained by T

A value of 0 is given to a maximally complex theory that requires as many auxiliary assumptions as facts that it explains and 1 to a maximally simple theory that requires no auxiliary assumptions at all to explain. Thus, the higher the ratio of facts explained to auxiliary assumptions, the simpler the theory. The essence of Thagard’s proposal is that we want to explain as much as we can, while making the fewest assumptions about the way the world is. By balancing the paucity of auxiliary assumptions against explanatory power it prevents the unfortunate consequence of the simplest theories turning out to be those that are most anaemic.

A significant difficulty facing Thargard’s proposal lies in determining what the auxiliary assumptions of theories actually are and how to count them. It could be argued that the problem of counting auxiliary assumptions threatens to become as difficult as the original problem of measuring simplicity. What a theory must assume about the world for it to explain the evidence is frequently extremely unclear and even harder to quantify. In addition, some auxiliary assumptions are bigger and more onerous than others and it is not clear that they should be given equal weighting, as they are in Thagard’s measure. Another objection is that Thagard’s proposal struggles to make sense of things like ontological parsimony—the idea that theories are simpler because they posit fewer things—since it is not clear that parsimony per se would make any particular difference to the number of auxiliary assumptions required. In defense of this, Thagard has argued that ontological parsimony is actually less important to practicing scientists than has often been thought.

f. Information-Theoretic Measures

Over the last few decades, a number of formal measures of simplicity and complexity have been developed in mathematical information theory. Though many of these measures have been designed for addressing specific practical problems, the central ideas behind them have been claimed to have significance for addressing the philosophical problem of measuring the simplicity of scientific theories.

One of the prominent information-theoretic measures of simplicity in the current literature is Kolmogorov complexity, which is a formal measure of quantitative information content (see Li and Vitányi, 1997). The Kolmogorov complexity K(x) of an object x is the length in bits of the shortest binary program that can output a completely faithful description of x in some universal programming language, such as Java, C++, or LISP. This measure was originally formulated to measure randomness in data strings (such as sequences of numbers), and is based on the insight that non-random data strings can be “compressed” by finding the patterns that exist in them. If there are patterns in a data string, it is possible to provide a completely accurate description of it that is shorter than the string itself, in terms of the number of “bits” of information used in the description, by using the pattern as a mnemonic that eliminates redundant information that need not be encoded in the description. For instance, if the data string is an ordered sequence of 1s and 0s, where every 1 is followed by a 0, and every 0 by a 1, then it can be given a very short description that specifies the pattern, the value of the first data point and the number of data points. Any further information is redundant. Completely random data sets, however, contain no patterns, no redundancy, and hence are not compressible.

It has been argued that Kolmogorov complexity can be applied as a general measure of the simplicity of scientific theories. Theories can be thought of as specifying the patterns that exist in the data sets they are meant to explain. As a result, we can also think of theories as compressing the data. Accordingly, the more a theory T compresses the data, the lower the value of K for the data using T, and the greater is its simplicity. An important feature of Kolmogorov complexity is that it is measured in a universal programming language and it can be shown that the difference in code length between the shortest code length for x in one universal programming language and the shortest code length for x in another programming language is no more than a constant c, which depends on the languages chosen, rather than x. This has been thought to provide some handle on the problem of language variance: Kolmogorov complexity can be seen as a measure of “objective” or “inherent” information content up to an additive constant. Due to this, some enthusiasts have gone so far as to claim that Kolmogorov complexity solves the problem of defining and measuring simplicity.

A number of objections have been raised against this application of Kolmogorov complexity. First, finding K(x) is a non-computable problem: no algorithm exists to compute it. This is claimed to be a serious practical limitation of the measure. Another objection is that Kolmogorov complexity produces some counter-intuitive results. For instance, theories that make probabilistic rather than deterministic predictions about the data must have maximum Kolmogorov complexity. For example, a theory that says that a sequence of coin flips conforms to the probabilistic law, Pr(Heads) = ½, cannot be said to compress the data, since one cannot use this law to reconstruct the exact sequence of heads and tails, even though it offers an intuitively simple explanation of what we observe.

Other information-theoretic measures of simplicity, such as the Minimum Message Length (MML) and Minimum Description Length (MDL) measures, avoid some of the practical problems facing Kolmogorov complexity. Though there are important differences in the details of these measures (see Wallace and Dowe, 1999), they all adopt the same basic idea that the simplicity of an empirical hypothesis can be measured in terms of the extent to which it provides a compact encoding of the data.

A general objection to all such measures of simplicity is that scientific theories generally aim to do more than specify patterns in the data. They also aim to explain why these patterns are there and it is in relation to how theories go about explaining the patterns in our observations that theories have often been thought to be simple or complex. Hence, it can be argued that mere data compression cannot, by itself, suffice as an explication of simplicity in relation to scientific theories. A further objection to the data compression approach is that theories can be viewed as compressing data sets in a very large number of different ways, many of which we do not consider appropriate contributions to simplicity. The problem raised by Goodman’s new riddle of induction can be seen as the problem of deciding which regularities to measure: for example, color regularities or grolor regularities? Formal information-theoretic measures do not discriminate between different kinds of pattern finding. Hence, any such measure can only be applied once we specify the sorts of patterns and regularities that should be taken into account.

g. Is Simplicity a Unified Concept?

There is a general consensus in the philosophical literature that the project of articulating a precise general measure of theoretical simplicity faces very significant challenges. Of course, this has not stopped practicing scientists from utilizing notions of simplicity in their work, and particular concepts of simplicity—such as the simplicity of a statistical model, understood in terms of paucity of adjustable parameters or model dimension—are firmly entrenched in several areas of science. Given this, one potential way of responding to the difficulties that philosophers and others have encountered in this area—particularly in light of the apparent multiplicity and scope for conflict between intuitive explications of simplicity—is to raise the question of whether theoretical simplicity is in fact a unified concept at all. Perhaps there is no single notion of simplicity that is (or should be) employed by scientists, but rather a cluster of different, sometimes related, but also sometimes conflicting notions of simplicity that scientists find useful to varying degrees in particular contexts. This might be evidenced by the observation that scientists’ simplicity judgments often involve making trade-offs between different notions of simplicity. Kepler’s preference for an astronomical theory that abandoned perfectly circular motions for the planets, but which could offer a unified explanation of the astronomical observations in terms of three basic laws, over a theory that retained perfect circular motion, but could not offer a similarly unified explanation, seems to be a clear example of this.

As a result of thoughts in this sort of direction, some philosophers have argued that there is actually no single theoretical value here at all, but rather a cluster of them (for example, Bunge, 1961). It is also worth considering the possibility that which of the cluster is accorded greater weight than the others, and how each of them is understood in practice, may vary greatly across different disciplines and fields of inquiry. Thus, what really matters when it comes to evaluating the comparative “simplicity” of theories might be quite different for biologists than for physicists, for instance, and perhaps what matters to a particle physicist is different to what matters to an astrophysicist. If there is in fact no unified concept of simplicity at work in science that might also indicate that there is no unitary justification for choosing between rival theories on grounds of simplicity. One important suggestion that this possibility has lead to is that the role of simplicity in science cannot be understood from a global perspective, but can only be understood locally. How simplicity ought to be measured and why it matters may have a peculiarly domain-specific explanation.

4. Justifying Preferences for Simpler Theories

Due to the apparent centrality of simplicity considerations to scientific methods and the link between it and numerous other important philosophical issues, the problem of justifying preferences for simpler theories is regarded as a major problem in the philosophy of science. It is also regarded as one of the most intractable. Though an extremely wide variety of justifications have been proposed—as with the debate over how to correctly define and measure simplicity, some important recent contributions have their origins in scientific literature in statistics, information theory, and other cognate fields—all of them have met with significant objections. There is currently no agreement amongst philosophers on what is the most promising path to take. There is also skepticism in some circles about whether an adequate justification is even possible.

Broadly speaking, justificatory proposals can be categorized into three types: 1) accounts that seek to show that simplicity is an indicator of truth (that is, that simpler theories are, in general, more likely to be true, or are somehow better confirmed by the empirical data than their more complex rivals); 2) accounts that do not regard simplicity as a direct indicator of truth, but which seek to highlight some alternative methodological justification for preferring simpler theories; 3) deflationary approaches, which actually reject the idea that there is a general justification for preferring simpler theories per se, but which seek to analyze particular appeals to simplicity in science in terms of other, less problematic, theoretical virtues.

a. Simplicity as an Indicator of Truth

i. Nature is Simple

Historically, the dominant view about why we should prefer simpler theories to more complex ones has been based on a general metaphysical thesis of the simplicity of nature. Since nature itself is simple, the relative simplicity of theories can thus be regarded as direct evidence for their truth. Such a view was explicitly endorsed by many of the great scientists of the past, including Aristotle, Copernicus, Galileo, Kepler, Newton, Maxwell, and Einstein. Naturally however, the question arises as to what justifies the thesis that nature is simple? Broadly speaking, there have been two different sorts of argument given for this thesis: i) that a benevolent God must have created a simple and elegant universe; ii) that the past record of success of relatively simple theories entitles us to infer that nature is simple. The theological justification was most common amongst scientists and philosophers during the early modern period. Einstein, on the other hand, invoked a meta-inductive justification, claiming that the history of physics justifies us in believing that nature is the realization of the simplest conceivable mathematical ideas.

Despite the historical popularity and influence of this view, more recent philosophers and scientists have been extremely resistant to the idea that we are justified in believing that nature is simple. For a start, it seems difficult to formulate the thesis that nature is simple so that it is not either obviously false, or too vague to be of any use. There would seem to be many counter-examples to the claim that we live in a simple universe. Consider, for instance, the picture of the atomic nucleus that physicists were working with in the early part of the twentieth century: it was assumed that matter was made only of protons and electrons; there were no such things as neutrons or neutrinos and no weak or strong nuclear forces to be explained, only electromagnetism. Subsequent discoveries have arguably led to a much more complex picture of nature and much more complex theories have had to be developed to account for this. In response, it could be claimed that though nature seems to be complex in some superficial respects, there is in fact a deep underlying simplicity in the fundamental structure of nature. It might also be claimed that the respects in which nature appears to be complex are necessary consequences of its underlying simplicity. But this just serves to highlight the vagueness of the claim that nature is simple—what exactly does this thesis amount to, and what kind of evidence could we have for it?

However the thesis is formulated, it would seem to be an extremely difficult one to adequately defend, whether this be on theological or meta-inductive grounds. An attempt to give a theological justification for the claim that nature is simple suffers from an inherent unattractiveness to modern philosophers and scientists who do not want to ground the legitimacy of scientific methods in theology. In any case, many theologians reject the supposed link between God’s benevolence and the simplicity of creation. With respect to a meta-inductive justification, even if it were the case that the history of science demonstrates the better than average success of simpler theories, we may still raise significant worries about the extent to which this could give sufficient credence to the claim that nature is simple. First, it assumes that empirical success can be taken to be a reliable indicator of truth (or at least approximate truth), and hence of what nature is really like. Though this is a standard assumption for many scientific realists—the claim being that success would be “miraculous” if the theory concerned was radically false—it is a highly contentious one, since many anti-realists hold that the history of science shows that all theories, even eminently successful theories, typically turn out to be radically false. Even if one does accept a link between success and truth, our successes to date may still not provide a representative sample of nature: maybe we have only looked at the problems that are most amenable to simple solutions and the real underlying complexity of nature has escaped our notice. We can also question the degree to which we can extrapolate any putative connection between simplicity and truth in one area of nature to nature as a whole. Moreover, in so far as simplicity considerations are held to be fundamental to inductive inference quite generally, such an attempted justification risks a charge of circularity.

ii. Meta-Inductive Proposals

There is another way of appealing to past success in order to try to justify a link between simplicity and truth. Instead of trying to justify a completely general claim about the simplicity of nature, this proposal merely suggests that we can infer a correlation between success and very particular simplicity characteristics in particular fields of inquiry—for instance, a particular kind of symmetry in certain areas of theoretical physics. If success can be regarded as an indicator of at least approximate truth, we can then infer that theories that are simpler in the relevant sense are more likely to be true in fields where the correlation with success holds.

Recent examples of this sort of proposal include McAllister (1996) and Kuipers (2002). In an effort to account for the truth-conduciveness of aesthetic considerations in science, including simplicity, Theo Kuipers (2002) claims that scientists tend to become attracted to theories that share particular aesthetic features in common with successful theories that they have been previously exposed to. In other words, we can explain the particular aesthetic preferences that scientists have in terms that are similar to a well-documented psychological effect known as the “mere-exposure effect”, which occurs when individuals take a liking to something after repeated exposure to it. If, in a given field of inquiry, theories that have been especially successful exhibit a particular type of simplicity (however this is understood), and thus such theories have been repeatedly presented to scientists working in the field during their training, the mere-exposure effect will then lead these scientists to be attracted to other theories that also exhibit that same type of simplicity. This process can then be used to support an aesthetic induction to a correlation between simplicity in the relevant sense and success. One can then make a case that this type of simplicity can legitimately be taken as an indicator of at least approximate truth.

Even though this sort of meta-inductive proposal does not attempt to show that nature in general is simple, many of the same objections can be raised against it as are raised against the attempt to justify that metaphysical thesis by appeal to the past success of simple theories. Once again, there is the problem of justifying the claim that empirical success is a reliable guide to (approximate) truth. Kuipers’ own arguments for this claim rest on a somewhat idiosyncratic account of truth approximation. In addition, in order to legitimately infer that there is a genuine correlation between simplicity and success, one cannot just look at successful theories; one must look at unsuccessful theories too. Even if all the successful theories in a domain have the relevant simplicity characteristic, it might still be the case that the majority of theories with the characteristic have been (or would have been) highly unsuccessful. Indeed, if one can potentially modify a successful theory in an infinite number of ways while keeping the relevant simplicity characteristic, one might actually be able to guarantee that the majority of possible theories with the characteristic would be unsuccessful theories, thus breaking the correlation between simplicity and success. This could be taken as suggesting that in order to carry any weight, arguments from success also need to offer an explanation for why simplicity contributes to success. Moreover, though the mere-exposure effect is well documented, Kuipers provides no direct empirical evidence that scientists actually acquire their aesthetic preferences via the kind of process that he proposes.

iii. Bayesian Proposals

According to standard varieties of Bayesianism, we should evaluate scientific theories according to their probability conditional upon the evidence (posterior probability). This probability, Pr(T | E), is a function of three quantities:

  • Pr(T | E) = Pr(E | T) Pr(T) / Pr(E)

Pr(E | T), is the probability that the theory, T, confers on the evidence, E, which is referred to as the likelihood of T. Pr(T) is the prior probability of T, and Pr(E) is the probability of E. T is then held to have higher posterior probability than a rival theory, T*, if and only if:

  • Pr(E | T) Pr(T) > Pr(E | T*) Pr(T*)

A standard Bayesian proposal for understanding the role of simplicity in theory choice is that simplicity is one of the key determinates of Pr(T): other things being equal, simpler theories and hypotheses are held to have higher prior probability of being true than more complex ones. Thus, if two rival theories confer equal or near equal probability on the data, but differ in relative simplicity, other things being equal, the simpler theory will tend to have a higher posterior probability. This idea, which Harold Jeffreys called “the simplicity postulate”, has been elaborated in a number of different ways by philosophers, statisticians, and information theorists, utilizing various measures of simplicity (for example, Carnap, 1950; Jeffreys, 1957, 1961; Solomonoff, 1964; Li, M. and Vitányi, 1997).

In response to this proposal, Karl Popper (1959) argued that, in some cases, assigning a simpler theory a higher prior probability actually violates the axioms of probability. For instance, Jeffreys proposed that simplicity be measured by counting adjustable parameters. On this measure, the claim that the planets move in circular orbits is simpler than the claim that the planets move in elliptical orbits, since the equation for an ellipse contains an additional adjustable parameter. However, circles can also be viewed as special cases of ellipses, where the additional parameter is set to zero. Hence, the claim that planets move in circular orbits can also be seen as a special case of the claim that the planets move in elliptical orbits. If that is right, then the former claim cannot be more probable than the latter claim because the truth of the former entails the truth of latter and probability respects entailment. In reply to Popper, it has been argued that this prior probabilistic bias towards simpler theories should only be seen as applying to comparisons between inconsistent theories where no relation of entailment holds between them—for instance, between the claim that the planets move in circular orbits and the claim that they move in elliptical but non-circular orbits.

The main objection to the Bayesian proposal that simplicity is a determinate of prior probability is that the theory of probability seems to offer no resources for explaining why simpler theories should be accorded higher prior probability. Rudolf Carnap (1950) thought that prior probabilities could be assigned a priori to any hypothesis stated in a formal language, on the basis of a logical analysis of the structure of the language and assumptions about the equi-probability of all possible states of affairs. However, Carnap’s approach has generally been recognized to be unworkable. If higher prior probabilities cannot be assigned to simpler theories on the basis of purely logical or mathematical considerations, then it seems that Bayesians must look outside of the Bayesian framework itself to justify the simplicity postulate.

Some Bayesians have taken an alternative route, claiming that a direct mathematical connection can be established between the simplicity of theories and their likelihood—that is, the value of Pr(E | T) ( see Rosencrantz, 1983; Myrvold, 2003; White, 2005). This proposal depends on the assumption that simpler theories have fewer adjustable parameters, and hence are consistent with a narrower range of potential data. Suppose that we collect a set of empirical data, E, that can be explained by two theories that differ with respect to this kind of simplicity: a simple theory, S, and a complex theory, C. S has no adjustable parameters and only ever entails E, while C has an adjustable parameter, θ, which can take a range of values, n. When θ is set to some specific value, i, it entails E, but on other values of θ, C entails different and incompatible observations. It is then argued that S confers a higher probability on E. This is because C allows that lots of other possible observations could have been made instead of E (on different possible settings for θ). Hence, the truth of C would make our recording those particular observations less probable than would the truth of S. Here, the likelihood of C is calculated as the average of the likelihoods of each of the n versions of C, defined by a unique setting of θ. Thus, as the complexity of a theory increases—measured in terms of the number of adjustable parameters it contains—the number of versions of the theory that will give a low probability to E will increase and the overall value of Pr(E | T) will go down.

An objection to this proposal (Kelly, 2004, 2010) is that for us to be able to show that S has a higher posterior probability than C as a result of its having a higher likelihood, it must be assumed that the prior probability of C is not significantly greater than the prior probability of S. This is a substantive assumption to make because of the way that simplicity is defined in this argument. We can view C as coming in a variety of different versions, each of which is picked out by a different value given to θ. If we then assume that S and C have roughly equal prior probability we must, by implication, assume that each version of C has a very low prior probability compared to S, since the prior probability of each version of C would be Pr(C) / n (assuming that the theory does not say that any particular parameter setting is more probable than any of the others). This would effectively build in a very strong prior bias in favour of S over each version of C. Given that each version of C could be considered independently—that is, the complex theory could be given a simpler, more restricted formulation—this would require an additional supporting argument. The objection is thus that the proposal simply begs the question by resting on a prior probabilistic bias towards simpler theories. Another objection is that the proposal suffers from the limitation that it can only be applied to comparisons between theories where the simpler theory can be derived from the more complex one by fixing certain of its parameters. At best, this represents a small fraction of cases in which simplicity has been thought to play a role.

iv. Simplicity as a Fundamental A Priori Principle

In the light of the perceived failure of philosophers to justify the claim that simpler theories are more likely to true, Richard Swinburne (2001) has argued that this claim has to be regarded as a fundamental a priori principle. Swinburne argues that it is just obvious that the criteria for theory evaluation that scientists use reliably lead them to make correct judgments about which theories are more likely to true. Since, Swinburne argues, one of these is that simpler theories are, other things being equal, more likely to be true, we just have to accept that simplicity is indeed an indicator of probable truth. However, Swinburne doesn’t think that this connection between simplicity and truth can be established empirically, nor does he think that it can be shown to follow from some more obvious a priori principle. Hence, we have no choice but to regard it as a fundamental a priori principle—a principle that cannot be justified by anything more fundamental.

In response to Swinburne, it can be argued that this is hardly going to convince those scientists and philosophers for whom it is not at all obvious the simpler theories are more likely to be true.

b. Alternative Justifications

i. Falsifiability

Famously, Karl Popper (1959) rejected the idea that theories are ever confirmed by evidence and that we are ever entitled to regard a theory as true, or probably true. Hence, Popper did not think simplicity could be legitimately regarded as an indicator of truth. Rather, he argued that simpler theories are to be valued because they are more falsifiable. Indeed, Popper thought that the simplicity of theories could be measured in terms of their falsifiability, since intuitively simpler theories have greater empirical content, placing more restriction on the ways the world can be, thus leading to a reduced ability to accommodate any future that we might discover. According to Popper, scientific progress consists not in the attainment of true theories, but in the elimination of false ones. Thus, the reason we should prefer more falsifiable theories is because such theories will be more quickly eliminated if they are in fact false. Hence, the practice of first considering the simplest theory consistent with the data provides a faster route to scientific progress. Importantly, for Popper, this meant that we should prefer simpler theories because they have a lower probability of being true, since, for any set of data, it is more likely that some complex theory (in Popper’s sense) will be able to accommodate it than a simpler theory.

Popper’s equation of simplicity with falsifiability suffers from some well-known objections and counter-examples, and these pose significant problems for his justificatory proposal (Section 3c). Another significant problem is that taking degree of falsifiability as a criterion for theory choice seems to lead to absurd consequences, since it encourages us to prefer absurdly specific scientific theories to those that have more general content. For instance, the hypothesis, “all emeralds are green until 11pm today when they will turn blue” should be judged as preferable to “all emeralds are green” because it is easier to falsify. It thus seems deeply implausible to say that selecting and testing such hypotheses first provides the fastest route to scientific progress.

ii. Simplicity as an Explanatory Virtue

A number of philosophers have sought to elucidate the rationale for preferring simpler theories to more complex ones in explanatory terms (for example, Friedman, 1974; Sober, 1975; Walsh, 1979; Thagard, 1988; Kitcher, 1989; Baker, 2003). These proposals have typically been made on the back of accounts of scientific explanation that explicate notions of explanatoriness and explanatory power in terms of unification, which is taken to be intimately bound up with notions of simplicity. According to unification accounts of explanation, a theory is explanatory if it shows how different phenomena are related to each other under certain systematizing theoretical principles, and a theory is held to have greater explanatory power than its rivals if it systematizes more phenomena. For Michael Friedman (1974), for instance, explanatory power is a function of the number of independent phenomena that we need to accept as ultimate: the smaller the number of independent phenomena that are regarded as ultimate by the theory, the more explanatory is the theory. Similarly, for Philip Kitcher (1989), explanatory power is increased the smaller the number of patterns of argument, or “problem-solving schemas”, that are needed to deliver the facts about the world that we accept. Thus, on such accounts, explanatory power is seen as a structural relationship between the sparseness of an explanation—the fewness of hypotheses or argument patterns—and the plenitude of facts that are explained. There have been various attempts to explicate notions of simplicity in terms of these sorts of features. A standard type of argument that is then used is that we want our theories not only to be true, but also explanatory. If truth were our only goal, there would be no reason to prefer a genuine scientific theory to a collection of random factual statements that all happen to be true. Hence, explanation is an ultimate, rather than a purely instrumental goal of scientific inquiry. Thus, we can justify our preferences for simpler theories once we recognize that there is a fundamental link between simplicity and explanatoriness and that explanation is a key goal of scientific inquiry, alongside truth.

There are some well-known objections to unification theories of explanation, though most of them concern the claim that unification is all there is to explanation—a claim on which the current proposal does not depend. However, even if we accept a unification theory of explanation and accept that explanation is an ultimate goal of scientific inquiry, it can be objected that the choice between a simple theory and a more complex rival is not normally a choice between a theory that is genuinely explanatory, in this sense, and a mere factual report. The complex theory can normally be seen as unifying different phenomena under systematizing principles, at least to some degree. Hence, the justificatory question here is not about why we should prefer theories that explain the data to theories that do not, but why we should prefer theories that have greater explanatory power in the senses just described to theories that are comparatively less explanatory. It is certainly a coherent possibility that the truth may turn out to be relatively disunified and unsystematic. Given this, it seems appropriate to ask why we are justified in choosing theories because they are more unifying. Just saying that explanation is an ultimate goal of scientific inquiry does not seem to be enough.

iii. Predictive Accuracy

In the last few decades, the treatment of simplicity as an explicit part of statistical methodology has become increasingly sophisticated. A consequence of this is that some philosophers of science have started looking to the statistics literature for illumination on how to think about the philosophical problems surrounding simplicity. According to Malcolm Forster and Elliott Sober (Forster and Sober, 1994; Forster, 2001; Sober, 2007), the work of the statistician, Hirotugu Akaike (1973), provides a precise theoretical framework for understanding the justification for the role of simplicity in curve-fitting and model selection.

Standard approaches to curve-fitting effect a trade-off between fit to a sample of data and the simplicity of the kind of mathematical relationship that is posited to hold between the variables—that is, the simplicity of the postulated model for the underlying relationship, typically measured in terms of the number of adjustable parameters it contains. This often means, for instance, that a linear hypothesis that fits a sample of data less well may be chosen over a parabolic hypothesis that fits the data better. According to Forster and Sober, Akaike developed an explanation for why it is rational to favor simpler models, under specific circumstances. The proposal builds on the practical wisdom that when there is a particular amount of error or noise in the data sample, more complex models have a greater propensity to “over-fit” to this spurious data in the sample and thus lead to less accurate predictions of extra-sample (for instance, future) data, particularly when dealing with small sample sizes. (Gauch [2003, 2006] calls this “Ockham’s hill”: to the left of the peak of the hill, increasing the complexity of a model improves its accuracy with respect to extra-sample data; after the peak, increasing complexity actually diminishes predictive accuracy. There is therefore an optimal trade-off at the peak of Ockham’s hill between simplicity and fit to the data sample when it comes to facilitating accurate prediction). According to Forster and Sober, what Akaike did was prove a theorem, which shows that, given standard statistical assumptions, we can estimate the degree to which constraining model complexity when fitting a curve to a sample of data will lead to more accurate predictions of extra-sample data. Following Forster and Sober’s presentation (1994, p9-10), Akaike’s theorem can be stated as follows:

  • Estimated[A(M)] = (1/N)[log-likelihood(L(M)) – k],

where A(M) is the predictive accuracy of the model, M, with respect to extra-sample data, N is the number of data points in the sample, log-likelihood is a measure of goodness of fit to the sample (the higher the log-likelihood score the closer the fit to the data), L(M) is the best fitting member of M, and k is the number of adjustable parameters that M contains. Akaike’s theorem is claimed to specify an unbiased estimator of predictive accuracy, which means that the distribution of estimates of A is centered around the true value of A (for proofs and further details on the assumptions behind Akaike’s theorem, see Sakamoto and others, 1986). This gives rise to a model selection procedure, Akaike’s Information Criterion (AIC), which says that we should choose the model that has the highest estimated predictive accuracy, given the data at hand. In practice, AIC implies that when the best-fitting parabola fits the data sample better than the best-fitting straight line, but not so much better that this outweighs its greater complexity (k), the straight line should be used for making predictions. Importantly, the penalty imposed on complexity has less influence on model selection the larger the sample of data, meaning that simplicity matters more for predictive accuracy when dealing with smaller samples.

Forster and Sober argue that Akaike’s theorem explains why simplicity has a quantifiable positive effect on predictive accuracy by combating the risk of over-fitting to noisy data. Hence, if one is interested in generating accurate predictions—for instance, of future data—one has a clear rationale for preferring simpler models. Forster and Sober are explicit that this proposal is only meant to apply to scientific contexts that can be understood from within a model selection framework, where predictive accuracy is the central goal of inquiry and there is a certain amount of error or noise in the data. Hence, they do not view Akaike’s work as offering a complete solution to the problem of justifying preferences for simpler theories. However, they have argued that a very significant number of scientific inference problems can be understood from an Akaikian perspective.

Several objections have been raised against Forster and Sober’s philosophical use of Akaike’s work. One objection is that the measure of simplicity employed by AIC is not language invariant, since the number of adjustable parameters a model contains depends on how the model is described. However, Forster and Sober argue that though, for practical purposes, the quantity, k, is normally spelt out in terms of number of adjustable parameters, it is in fact more accurately explicated in terms of the notion of the dimension of a family of functions, which is language invariant. Another objection is that AIC is not statistically consistent. Forster and Sober reply that this charge rests on a confusion over what AIC is meant to estimate: for example, erroneously assuming that AIC is meant to be estimator of the true value of k (the size of the simplest model that contains the true hypothesis), rather than an estimator of the predictive accuracy of a particular model at hand. Another worry is that over-fitting considerations imply that an idealized false model will often make more accurate predictions than a more realistic model, so the justification is merely instrumentalist and cannot warrant the use of simplicity as a criterion for hypothesis acceptance where hypotheses are construed realistically, rather than just as predictive tools. For their part, Forster and Sober are quite happy to accept this instrumentalist construal of the role of simplicity in curve-fitting and model selection: in this context, simplicity is not a guide to the truth, but to predictive accuracy. Finally, there are a variety of objections concerning the nature and validity of the assumptions behind Akaikie’s theorem and whether AIC is applicable to some important classes of model selection problems (for discussion, see Kieseppä, 1997; Forster, 1999, 2001; Howson and Urbach, 2006; Dowe and others, 2007; Sober, 2007; Kelly, 2010).

iv. Truth-Finding Efficiency

An important recent proposal about how to justify preferences for simpler theories has come from work in the interdisciplinary field known as formal learning theory (Schulte, 1999; Kelly, 2004, 2007, 2010). It has been proposed that even if we do not know whether the world is simple or complex, inferential rules that are biased towards simple hypotheses can be shown to converge to the truth more efficiently than alternative inferential rules. According to this proposal, an inferential rule is said to converge to the truth efficiently, if, relative to other possible convergent inferential rules, it minimizes the maximum number of U-turns or “retractions” of opinion that might be required of the inquirer while using the rule to guide her decisions on what to believe given the data. Such procedures are said to converge to the truth more directly and in a more stable fashion, since they require fewer changes of mind along the way. The proposal is that even if we do not know whether the truth is simple or complex, scientific inference procedures that are biased towards simplicity can be shown a priori to be optimally efficient in this sense, converging to the truth in the most direct and stable way possible.

To illustrate the basic logic behind this proposal, consider the following example from Oliver Schulte (1999). Suppose that we are investigating the existence of hypothetical particle, Ω. If Ω does exist, we will be able to detect it with an appropriate measurement device. However, as yet, it has not been detected. What attitude should we take towards the existence Ω? Let us say that Ockham’s Razor suggests that we deny that Ω exists until it is detected (if ever). Alternatively, we could assert that Ω does exist until a finite number of attempts to detect Ω have proved to be unsuccessful, say ten thousand, in which case, we assert that Ω does not exist; or, we could withhold judgment until Ω is either detected, or there have been ten thousand unsuccessful attempts to detect it. Since we are assuming that existent particles do not go undetected forever, abiding by any of three of these inferential rules will enable us to converge to the truth in the limit, whether Ω exists or not. However, Schulte argues that Ockham’s Razor provides the most efficient route to the truth. This is because following Ockham’s Razor incurs a maximum of only one retraction of opinion: retracting an assertion of non-existence to an assertion of existence, if Ω is detected. In contrast, the alternative inferential rules both incur a maximum of two retractions, since Ω could go undetected ten thousand times, but is then detected on the ten thousandth and one time. Hence, truth-finding efficiency requires that one adopt Ockham’s Razor and presume that Ω does not exist until it is detected.

Kevin Kelly has further developed this U-turn argument in considerable detail. Kelly argues that, with suitable refinements, it can be extended to an extremely wide variety of real world scientific inference problems. Importantly, Kelly has argued that, on this proposal, simplicity should not be seen as purely a pragmatic consideration in theory choice. While simplicity cannot be regarded as a direct indicator of truth, we do nonetheless have a reason to think that the practice of favoring simpler theories is a truth-conducive strategy, since it promotes speedy and stable attainment of true beliefs. Hence, simplicity should be regarded as a genuinely epistemic consideration in theory choice.

One worry about the truth-finding efficiency proposal concerns the general applicability of these results to scientific contexts in which simplicity may play a role. The U-turn argument for Ockham’s razor described above seems to depend on the evidential asymmetry between establishing that Ω exists and establishing that Ω does not exist: a detection of Ω is sufficient to establish the existence of Ω, whereas repeated failures of detection are not sufficient to establish non-existence. The argument may work where detection procedures are relatively clear-cut—for instance where there are relatively unambiguous instrument readings that count as “detections”—but what about entities that are very difficult to detect directly and where mistakes can easily be made about existence as well as non-existence? Similarly, a current stumbling block is that the U-turn argument cannot be used as a justification for the employment of simplicity biases in statistical inference, where the hypotheses under consideration do not have deductive observational consequences. Kelly is, however, optimistic about extending the U-turn argument to statistical inference. Another objection concerns the nature of the justification that is being provided here. What the U-turn argument seems to show is that the strategy of favoring the simplest theory consistent with the data may help one to find the truth with fewer reversals along the way. It does not establish that simpler theories themselves should be regarded as in any way “better” than their more complex rivals. Hence, there are doubts about the extent to which this proposal can actually make sense of standard examples of simplicity preferences at work in the history and current practice of science, where the guiding assumption seems to be that simpler theories are not to be preferred merely for strategic reasons, but because they are better theories.

c. Deflationary Approaches

Various philosophers have sought to defend broadly deflationary accounts of simplicity. Such accounts depart from all of the justificatory accounts discussed so far by rejecting the idea that simplicity should in fact be regarded as a theoretical virtue and criterion for theory choice in its own right. Rather, according to deflationary accounts, when simplicity appears to be a driving factor in theory evaluation, something else is doing the real work.

Richard Boyd (1990), for instance, has argued that scientists’ simplicity judgments are typically best understood as just covert judgements of theoretical plausibility. When a scientist claims that one theory is “simpler” than another this is often just another way of saying that the theory provides a more plausible account of the data. For Boyd, such covert judgments of theoretical plausibility are driven by the scientist’s background theories. Hence, it is the relevant background theories that do the real work in motivating the preference for the “simpler” theory, not the simplicity of the theory per se. John Norton (2003) has advocated a similar view in the context of his “material theory” of induction, according to which inductive inferences are licensed not by universal inductive rules or inference schemas, but rather by local factual assumptions about the domain of inquiry. Norton argues that the apparent use of simplicity in induction merely reflects material assumptions about the nature of the domain being investigated. For instance, when we try to fit curves to data we choose the variables and functions that we believe to be appropriate to the physical reality we are trying to get at. Hence, it is because of the facts that we believe to prevail in this domain that we prefer a “simple” linear function to a quadratic one, if such a curve fits the data sufficiently well. In a different domain, where we believe that different facts prevail, our decision about which hypotheses are “simple” or “complex” are likely to be very different.

Elliott Sober (1988, 1994) has defended this sort of deflationary analysis of various appeals to simplicity and parsimony in evolutionary biology. For example, Sober argues that the common claim that group selection hypotheses are “less parsimonious” and hence to be taken less seriously as explanations for biological adaptations than individual selection hypotheses, rests on substantive assumptions about the comparative rarity of the conditions required for group selection to occur. Hence, the appeal to Ockham’s Razor in this context is just a covert appeal to local background knowledge. Other attempts to offer deflationary analyses of particular appeals to simplicity in science include Plutynski (2005), who focuses on the Fisher-Wright debate in evolutionary biology, and Fitzpatrick (2009), who focuses on appeals to simplicity in debates over the cognitive capacities of non-human primates.

If such deflationary analyses of the putative role of simplicity in particular scientific contexts turn out to be plausible, then problems concerning how to measure simplicity and how to offer a general justification for preferring simpler theories can be avoided, since simplicity per se can be shown to do no substantive work in the relevant inferences. However, many philosophers are skeptical that such deflationary analyses are possible for many of the contexts where simplicity considerations have been thought to play an important role. Kelly (2010), for example, has argued that simplicity typically comes into play when our background knowledge underdetermines theory choice. Sober himself seems to advocate a mixed view: some appeals to simplicity in science are best understood in deflationary terms, others are better understood in terms of Akaikian model selection theory.

5. Conclusion

The putative role of considerations of simplicity in the history and current practice of science gives rise to a number of philosophical problems, including the problem of precisely defining and measuring theoretical simplicity, and the problem of justifying preferences for simpler theories. As this survey of the literature on simplicity in the philosophy of science demonstrates, these problems have turned out to be surprisingly resistant to resolution, and there remains a live debate amongst philosophers of science about how to deal with them. On the other hand, there is no disputing the fact that practicing scientists continue to find it useful to appeal to various notions of simplicity in their work. Thus, in many ways, the debate over simplicity resembles other long-running debates in the philosophy science, such as that over the justification for induction (which, it turns out, is closely related to the problem of justifying preferences for simpler theories). Though there is arguably more skepticism within the scientific community about the legitimacy of choosing between rival theories on grounds of simplicity than there is about the legitimacy of inductive inference—the latter being a complete non-issue for practicing scientists—as is the case with induction, very many scientists continue to employ practices and methods that utilize notions of simplicity to great scientific effect, assuming that appropriate solutions to the philosophical problems that these practices give rise to do in fact exist, even though philosophers have so far failed to articulate them. However, as this survey has also shown, statisticians, information and learning theorists, and other scientists have been making increasingly important contributions to the debate over the philosophical underpinning for these practices.

6. References and Further Reading

  • Ackerman, R. 1961. Inductive simplicity. Philosophy of Science, 28, 162-171.
    • Argues against the claim that simplicity considerations play a significant role in inductive inference. Critiques measures of simplicity proposed by Jeffreys, Kemeny, and Popper.
  • Akaike, H. 1973. Information theory and the extension of the maximum likelihood principle. In B. Petrov and F. Csaki (eds.), Second International Symposium on Information Theory. Budapest: Akademiai Kiado.
    • Laid the foundations for model selection theory. Proves a theorem suggesting that the simplicity of a model is relevant to estimating its future predictive accuracy. Highly technical.
  • Baker, A. 2003. Quantitative parsimony and explanatory power. British Journal for the Philosophy of Science, 54, 245-259.
    • Builds on Nolan (1997), argues that quantitative parsimony is linked with explanatory power.
  • Baker, A. 2007. Occam’s Razor in science: a case study from biogeography. Biology and Philosophy, 22, 193-215.
    • Argues for a “naturalistic” justification of Ockham’s Razor and that preferences for ontological parsimony played a significant role in the late 19th century debate in bio-geography between dispersalist and extensionist theories.
  • Barnes, E.C. 2000. Ockham’s razor and the anti-superfluity principle. Erkenntnis, 53, 353-374.
    • Draws a useful distinction between two different interpretations of Ockham’s Razor: the anti-superfluity principle and the anti-quantity principle. Explicates an evidential justification for anti-superfluity principle.
  • Boyd, R. 1990. Observations, explanatory power, and simplicity: towards a non-Humean account. In R. Boyd, P. Gasper and J.D. Trout (eds.), The Philosophy of Science. Cambridge, MA: MIT Press.
    • Argues that appeals to simplicity in theory evaluation are typically best understood as covert judgments of theoretical plausibility.
  • Bunge, M. 1961. The weight of simplicity in the construction and assaying of scientific theories. Philosophy of Science, 28, 162-171.
    • Takes a skeptical view about the importance and justifiability of a simplicity criterion in theory evaluation.
  • Carlson, E. 1966. The Gene: A Critical History. Philadelphia: Saunders.
    • Argues that simplicity considerations played a significant role in several important debates in the history of genetics.
  • Carnap, R. 1950. Logical Foundations of Probability. Chicago: University of Chicago Press.
  • Chater, N. 1999. The search for simplicity: a fundamental cognitive principle. The Quarterly Journal of Experimental Psychology, 52A, 273-302.
    • Argues that simplicity plays a fundamental role in human reasoning, with simplicity to be defined in terms of Kolmogorov complexity.
  • Cohen, I.B. 1985. Revolutions in Science. Cambridge, MA: Harvard University Press.
  • Cohen, I.B. 1999. A guide to Newton’s Principia. In I. Newton, The Principia: Mathematical Principles of Natural Philosophy; A New Translation by I. Bernard Cohen and Anne Whitman. Berkeley: University of California Press.
  • Crick, F. 1988. What Mad Pursuit: a Personal View of Scientific Discovery. New York: Basic Books.
    • Argues that the application of Ockham’s Razor to biology is inadvisable.
  • Dowe, D, Gardner, S., and Oppy, G. 2007. Bayes not bust! Why simplicity is no problem for Bayesians. British Journal for the Philosophy of Science, 58, 709-754.
    • Contra Forster and Sober (1994), argues that Bayesians can make sense of the role of simplicity in curve-fitting.
  • Duhem, P. 1954. The Aim and Structure of Physical Theory. Princeton: Princeton University Press.
  • Einstein, A. 1954. Ideas and Opinions. New York: Crown.
    • Einstein’s views about the role of simplicity in physics.
  • Fitzpatrick, S. 2009. The primate mindreading controversy: a case study in simplicity and methodology in animal psychology. In R. Lurz (ed.), The Philosophy of Animal Minds. New York: Cambridge University Press.
    • Advocates a deflationary analysis of appeals to simplicity in debates over the cognitive capacities of non-human primates.
  • Forster, M. 1995. Bayes and bust: simplicity as a problem for a probabilist’s approach to confirmation. British Journal for the Philosophy of Science, 46, 399-424.
    • Argues that the Bayesian approach to scientific reasoning is inadequate because it cannot make sense of the role of simplicity in theory evaluation.
  • Forster, M. 1999. Model selection in science: the problem of language variance. British Journal for the Philosophy of Science, 50, 83-102.
    • Responds to criticisms of Forster and Sober (1994). Argues that AIC relies on a language invariant measure of simplicity.
  • Forster, M. 2001. The new science of simplicity. In A. Zellner, H. Keuzenkamp and M. McAleer (eds.), Simplicity, Inference and Modelling. Cambridge: Cambridge University Press.
    • Accessible introduction to model selection theory. Describes how different procedures, including AIC, BIC, and MDL, trade-off simplicity and fit to the data.
  • Forster, M. and Sober, E. 1994. How to tell when simpler, more unified, or less ad hoc theories will provide more accurate predictions. British Journal for the Philosophy of Science, 45, 1-35.
    • Explication of AIC statistics and its relevance to the philosophical problem of justifying preferences for simpler theories. Argues against Bayesian approaches to simplicity. Technical in places.
  • Foster, M. and Martin, M. 1966. Probability, Confirmation, and Simplicity: Readings in the Philosophy of Inductive Logic. New York: The Odyssey Press.
    • Anthology of papers discussing the role of simplicity in induction. Contains important papers by Ackermann, Barker, Bunge, Goodman, Kemeny, and Quine.
  • Friedman, M. 1974. Explanation and scientific understanding. Journal of Philosophy, LXXI, 1-19.
    • Defends a unification account of explanation, connects simplicity with explanatoriness.
  • Galilei, G. 1962. Dialogues concerning the Two Chief World Systems. Berkeley: University of California Press.
    • Classic defense of Copernicanism with significant emphasis placed on the greater simplicity and harmony of the Copernican system. Asserts that nature does nothing in vain.
  • Gauch, H. 2003. Scientific Method in Practice. Cambridge: Cambridge University Press.
    • Wide-ranging discussion of the scientific method written by a scientist for scientists. Contains a chapter on the importance of parsimony in science.
  • Gauch, H. 2006. Winning the accuracy game. American Scientist, 94, March-April 2006, 134-141.
    • Useful informal presentation of the concept of Ockham’s hill and its importance to scientific research in a number of fields.
  • Gingerich, O. 1993. The Eye of Heaven: Ptolemy, Copernicus, Kepler. New York: American Institute of Physics.
  • Glymour, C. 1980. Theory and Evidence. Princeton: Princeton University Press.
    • An important critique of Bayesian attempts to make sense of the role of simplicity in science. Defends a “boot-strapping” analysis of the simplicity arguments for Copernicanism and Newton’s argument for universal gravitation.
  • Goodman, N. 1943. On the simplicity of ideas. Journal of Symbolic Logic, 8, 107-1.
  • Goodman, N. 1955. Axiomatic measurement of simplicity. Journal of Philosophy, 52, 709-722.
  • Goodman, N. 1958. The test of simplicity. Science, 128, October 31st 1958, 1064-1069.
    • Reasonably accessible introduction to Goodman’s attempts to formulate a measure of logical simplicity.
  • Goodman, N. 1959. Recent developments in the theory of simplicity. Philosophy and Phenomenological Research, 19, 429-446.
    • Response to criticisms of Goodman (1955).
  • Goodman, N. 1961. Safety, strength, simplicity. Philosophy of Science, 28, 150-151.
    • Argues that simplicity cannot be equated with testability, empirical content, or paucity of assumption.
  • Goodman, N. 1983. Fact, Fiction and Forecast (4th edition). Cambridge, MA: Harvard University Press.
  • Harman, G. 1999. Simplicity as a pragmatic criterion for deciding what hypotheses to take seriously. In G. Harman, Reasoning, Meaning and Mind. Oxford: Oxford University Press.
    • Defends the claim that simplicity is a fundamental component of inductive inference and that this role has a pragmatic justification.
  • Harman, G. and Kulkarni, S. 2007. Reliable Reasoning: Induction and Statistical Learning Theory. Cambridge, MA: MIT Press.
    • Accessible introduction to statistical learning theory and VC dimension.
  • Harper, W. 2002. Newton’s argument for universal gravitation. In I.B. Cohen and G.E. Smith (eds.), The Cambridge Companion to Newton. Cambridge: Cambridge University Press.
  • Hesse, M. 1967. Simplicity. In P. Edwards (ed.), The Encyclopaedia of Philosophy, vol. 7. New York: Macmillan.
    • Focuses on attempts by Jeffreys, Popper, Kemeny, and Goodman to formulate measures of simplicity.
  • Hesse, M. 1974. The Structure of Scientific Inference. London: Macmillan.
    • Defends the view that simplicity is a determinant of prior probability. Useful discussion of the role of simplicity in Einstein’s work.
  • Holton, G. 1974. Thematic Origins of Modern Science: Kepler to Einstein. Cambridge, MA: Harvard University Press.
    • Discusses the role of aesthetic considerations, including simplicity, in the history of science.
  • Hoffman, R., Minkin, V., and Carpenter, B. 1997. Ockham’s Razor and chemistry. Hyle, 3, 3-28.
    • Discussion by three chemists of the benefits and pitfalls of applying Ockham’s Razor in chemical research.
  • Howson, C. and Urbach, P. 2006. Scientific Reasoning: The Bayesian Approach (Third Edition). Chicago: Open Court.
    • Contains a useful survey of Bayesian attempts to make sense of the role of simplicity in theory evaluation. Technical in places.
  • Jeffreys, H. 1957. Scientific Inference (2nd edition). Cambridge: Cambridge University Press.
    • Defends the “simplicity postulate” that simpler theories have higher prior probability.
  • Jeffreys, H. 1961. Theory of Probability. Oxford: Clarendon Press.
    • Outline and defense of the Bayesian approach to scientific inference. Discusses the role of simplicity in the determination of priors and likelihoods.
  • Kelly, K. 2004. Justification as truth-finding efficiency: how Ockham’s Razor works. Minds and Machines, 14, 485-505.
    • Argues that Ockham’s Razor is justified by considerations of truth-finding efficiency. Critiques Bayesian, Akiakian, and other traditional attempts to justify simplicity preferences. Technical in places.
  • Kelly, K. 2007. How simplicity helps you find the truth without pointing at it. In M. Friend, N. Goethe, and V.Harizanov (eds.), Induction, Algorithmic Learning Theory, and Philosophy. Dordrecht: Springer.
    • Refinement and development of the argument found in Kelly (2004) and Schulte (1999). Technical.
  • Kelly, K. 2010. Simplicity, truth and probability. In P. Bandyopadhyay and M. Forster (eds.), Handbook of the Philosophy of Statistics. Dordrecht: Elsevier.
    • Expands and develops the argument found in Kelly (2007). Detailed critique of Bayesian accounts of simplicity. Technical.
  • Kelly, K. and Glymour, C. 2004. Why probability does not capture the logic of scientific justification. In C. Hitchcock (ed.), Contemporary Debates in the Philosophy of Science. Oxford: Blackwell.
    • Argues that Bayesians can’t make sense of Ockham’s Razor.
  • Kemeny, J. 1955. Two measures of complexity. Journal of Philosophy, 52, p722-733.
    • Develops some of Goodman’s ideas about how to measure the logical simplicity of predicates and systems of predicates. Proposes a measure of simplicity similar to Popper’s (1959) falsifiability measure.
  • Kieseppä, I. A. 1997. Akaike Information Criterion, curve-fitting, and the philosophical problem of simplicity. British Journal for the Philosophy of Science, 48, p21-48.
    • Critique of Forster and Sober (1994). Argues that Akaike’s theorem has little relevance to traditional philosophical problems surrounding simplicity. Highly technical.
  • Kitcher, P. 1989. Explanatory unification and the causal structure of the world. In P. Kitcher and W. Salmon, Minnesota Studies in the Philosophy of Science, vol 13: Scientific Explanation, Minneapolis: University of Minnesota Press.
    • Defends a unification theory of explanation. Argues that simplicity contributes to explanatory power.
  • Kuhn, T. 1957. The Copernican Revolution. Cambridge, MA: Harvard University Press.
    • Influential discussion of the role of simplicity in the arguments for Copernicanism.
  • Kuhn, T. 1962. The Structure of Scientific Revolutions. Chicago: University of Chicago Press.
  • Kuipers, T. 2002. Beauty: a road to truth. Synthese, 131, 291-328.
    • Attempts to show how aesthetic considerations might be indicative of truth.
  • Kyburg, H. 1961. A modest proposal concerning simplicity. Philosophical Review, 70, 390-395.
    • Important critique of Goodman (1955). Argues that simplicity be identified with the number of quantifiers in a theory.
  • Lakatos, I. and Zahar, E. 1978. Why did Copernicus’s research programme supersede Ptolemy’s? In J. Worrall and G. Curie (eds.), The Methodology of Scientific Research Programmes: Philosophical Papers of Imre Lakatos, Volume 1. Cambridge: Cambridge University Press.
    • Argues that simplicity did not really play a significant role in the Copernican Revolution.
  • Lewis, D. 1973. Counterfactuals. Oxford: Basil Blackwell.
    • Argues that quantitative parsimony is less important than qualitative parsimony in scientific and philosophical theorizing.
  • Li, M. and Vitányi, P. 1997. An Introduction to Kolmogorov Complexity and its Applications (2nd edition). New York: Springer.
    • Detailed elaboration of Kolmogorov complexity as a measure of simplicity. Highly technical.
  • Lipton, P. 2004. Inference to the Best Explanation (2nd edition). Oxford: Basil Blackwell.
    • Account of inference to the best explanation as inference to the “loveliest” explanation. Defends the claim that simplicity contributes to explanatory loveliness.
  • Lombrozo, T. 2007. Simplicity and probability in causal explanation. Cognitive Psychology, 55, 232–257.
    • Argues that simplicity is used as a guide to assessing the probability of causal explanations.
  • Lu, H., Yuille, A., Liljeholm, M., Cheng, P. W., and Holyoak, K. J. 2006. Modeling causal learning using Bayesian generic priors on generative and preventive powers. In R. Sun and N. Miyake (eds.), Proceedings of the 28th annual conference of the cognitive science society, 519–524. Mahwah, NJ: Erlbaum.
    • Argues that simplicity plays a significant role in causal learning.
  • MacKay, D. 1992. Bayesian interpolation. Neural Computation, 4, 415-447.
    • First presentation of the concept of Ockham’s Hill.
  • Martens, R. 2009. Harmony and simplicity: aesthetic virtues and the rise of testability. Studies in History and Philosophy of Science, 40, 258-266.
    • Discussion of the Copernican simplicity arguments and recent attempts to reconstruct the justification for them.
  • McAlleer, M. 2001. Simplicity: views of some Nobel laureates in economic science. In A. Zellner, H. Keuzenkamp and M. McAleer (eds.), Simplicity, Inference and Modelling. Cambridge: Cambridge University Press.
    • Interesting survey of the views of famous economists on the place of simplicity considerations in their work.
  • McAllister, J. W. 1996. Beauty and Revolution in Science. Ithaca: Cornell University Press.
    • Proposes that scientists’ simplicity preferences are the product of an aesthetic induction.
  • Mill, J.S. 1867. An Examination of Sir William Hamilton’s Philosophy. London: Walter Scott.
  • Myrvold, W. 2003. A Bayesian account of the virtue of unification. Philosophy of Science, 70, 399-423.
  • Newton, I. 1999. The Principia: Mathematical Principles of Natural Philosophy; A New Translation by I. Bernard Cohen and Anne Whitman. Berkeley: University of California Press.
    • Contains Newton’s “rules for the study of natural philosophy”, which includes a version of Ockham’s Razor, defended in terms of the simplicity of nature. These rules play an explicit role in Newton’s argument for universal gravitation.
  • Nolan, D. 1997. Quantitative Parsimony. British Journal for the Philosophy of Science, 48, 329-343.
    • Contra Lewis (1973), argues that quantitative parsimony has been important in the history of science.
  • Norton, J. 2000. ‘Nature is the realization of the simplest conceivable mathematical ideas’: Einstein and canon of mathematical simplicity. Studies in the History and Philosophy of Modern Physics, 31, 135-170.
    • Discusses the evolution of Einstein’s thinking about the role of mathematical simplicity in physical theorizing.
  • Norton, J. 2003. A material theory of induction. Philosophy of Science, 70, p647-670.
    • Defends a “material” theory of induction. Argues that appeals to simplicity in induction reflect factual assumptions about the domain of inquiry.
  • Oreskes, N., Shrader-Frechette, K., Belitz, K. 1994. Verification, validation, and confirmation of numerical models in the earth sciences. Science, 263, 641-646.
  • Palter, R. 1970. An approach to the history of early astronomy. Studies in History and Philosophy of Science, 1, 93-133.
  • Pais, A. 1982. Subtle Is the Lord: The science and life of Albert Einstein. Oxford: Oxford University Press.
  • Peirce, C.S. 1931. Collected Papers of Charles Sanders Peirce, vol 6. C. Hartshorne, P. Weiss, and A. Burks (eds.). Cambridge, MA: Harvard University Press.
  • Plutynski, A. 2005. Parsimony and the Fisher-Wright debate. Biology and Philosophy, 20, 697-713.
    • Advocates a deflationary analysis of appeals to parsimony in debates between Wrightian and neo-Fisherian models of natural selection.
  • Popper, K. 1959. The Logic of Scientific Discovery. London: Hutchinson.
    • Argues that simplicity = empirical content = falsifiability.
  • Priest, G. 1976. Gruesome simplicity. Philosophy of Science, 43, 432-437.
    • Shows that standard measures of simplicity in curve-fitting are language variant.
  • Raftery, A., Madigan, D., and Hoeting, J. 1997. Bayesian model averaging for linear regression models. Journal of the American Statistical Association, 92, 179-191.
  • Reichenbach, H. 1949. On the justification of induction. In H. Feigl and W. Sellars (eds.), Readings in Philosophical Analysis. New York: Appleton-Century-Crofts.
  • Rosencrantz, R. 1983. Why Glymour is a Bayesian. In J. Earman (ed.), Testing Scientific Theories. Minneapolis: University of Minnesota Press.
    • Responds to Glymour (1980). Argues that simpler theories have higher likelihoods, using Copernican vs. Ptolemaic astronomy as an example.
  • Rothwell, G. 2006. Notes for the occasional major case manager. FBI Law Enforcement Bulletin, 75, 20-24.
    • Emphasizes the importance of Ockham’s Razor in criminal investigation.
  • Sakamoto, Y., Ishiguro, M., and Kitagawa, G. 1986. Akaike Information Criterion Statistics. New York: Springer.
  • Schaffner, K. 1974. Einstein versus Lorentz: research programmes and the logic of comparative theory evaluation. British Journal for the Philosophy of Science, 25, 45-78.
    • Argues that simplicity played a significant role in the development and early acceptance of special relativity.
  • Schulte, O. 1999. Means-end epistemology. British Journal for the Philosophy of Science, 50, 1-31.
    • First statement of the claim that Ockham’s Razor can be justified in terms of truth-finding efficiency.
  • Simon, H. 1962. The architecture of complexity. Proceedings of the American Philosophical Society, 106, 467-482.
    • Important discussion by a Nobel laureate of features common to complex systems in nature.
  • Sober, E. 1975. Simplicity. Oxford: Oxford University Press.
    • Argues that simplicity can be defined in terms of question-relative informativeness. Technical in places.
  • Sober, E. 1981. The principle of parsimony. British Journal for the Philosophy of Science, 32, 145-156.
    • Distinguishes between “agnostic” and “atheistic” versions of Ockham’s Razor. Argues that the atheistic razor has an inductive justification.
  • Sober, E. 1988. Reconstructing the Past: Parsimony, Evolution and Inference. Cambridge, MA: MIT Press.
    • Defends a deflationary account of simplicity in the context of the use of parsimony methods in evolutionary biology.
  • Sober, E. 1994. Let’s razor Ockham’s Razor. In E. Sober, From a Biological Point of View, Cambridge: Cambridge University Press.
    • Argues that the use of Ockham’s Razor is grounded in local background assumptions.
  • Sober, E. 2001a. What is the problem of simplicity? In H. Keuzenkamp, M. McAlleer, and A. Zellner (eds.), Simplicity, Inference and Modelling. Cambridge: Cambridge University Press.
  • Sober, E. 2001b. Simplicity. In W.H. Newton-Smith (ed.), A Companion to the Philosophy of Science, Oxford: Blackwell.
  • Sober, E. 2007. Evidence and Evolution. New York: Cambridge University Press.
  • Solomonoff, R.J. 1964. A formal theory of inductive inference, part 1 and part 2. Information and Control, 7, 1-22, 224-254.
  • Suppes, P. 1956. Nelson Goodman on the concept of logical simplicity. Philosophy of Science, 23, 153-159.
  • Swinburne, R. 2001. Epistemic Justification. Oxford: Oxford University Press.
    • Argues that the principle that simpler theories are more probably true is a fundamental a priori principle.
  • Thagard, P. 1988. Computational Philosophy of Science. Cambridge, MA: MIT Press.
    • Simplicity is a determinant of the goodness of an explanation and can be measured in terms of the paucity of auxiliary assumptions relative to the number of facts explained.
  • Thorburn, W. 1918. The myth of Occam’s Razor. Mind, 23, 345-353.
    • Argues that William of Ockham would not have advocated many of the principles that have been attributed to him.
  • van Fraassen, B. 1989. Laws and Symmetry. Oxford: Oxford University Press.
  • Wallace, C. S. and Dowe, D. L. 1999. Minimum Message Length and Kolmogorov Complexity. Computer Journal, 42(4), 270–83.
  • Walsh, D. 1979. Occam’s Razor: A Principle of Intellectual Elegance. American Philosophical Quarterly, 16, 241-244.
  • Weinberg, S. 1993. Dreams of a Final Theory. New York: Vintage.
    • Argues that physicists demand simplicity in physical principles before they can be taken seriously.
  • White, R. 2005. Why favour simplicity? Analysis, 65, 205-210.
    • Attempts to justify preferences for simpler theories in virtue of such theories having higher likelihoods.
  • Zellner, A, Keuzenkamp, H., and McAleer, M. 2001. Simplicity, Inference and Modelling. Cambridge: Cambridge University Press.
    • Collection papers by statisticians, philosophers, and economists on the role of simplicity in scientific inference and modelling.

Author Information

Simon Fitzpatrick
Email: sfitzpatrick@jcu.edu
John Carroll University
U. S. A.