Theories of Explanation
Within the philosophy of science there have been competing ideas about what an explanation is. Historically, explanation has been associated with causation: to explain an event or phenomenon is to identify its cause. But with the growth and development of philosophy of science in the 20th century, the concept of explanation began to receive more rigorous and specific analysis. Of particular concern were theories that posited the existence of unobservable entities and processes (atoms, fields, genes, and so forth). These posed a dilemma: on the one hand, the staunch empiricist had to reject unobservable entities as a matter of principle; on the other, theories that appealed to unobservable entities were clearly producing revolutionary results. Thus philosophers of science sought some way to characterize the obvious value of these theories without abandoning the empiricist principles deemed central to scientific rationality.
A theory of explanation might treat explanations in either a realist or an epistemic (that is, anti-realist) sense. A realist interpretation of explanation holds that the entities or processes an explanation posits actually exist–the explanation is a literal description of external reality. An epistemic interpretation, on the contrary, holds that such entities or processes do not necessarily exist in any literal sense but are simply useful for organizing human experience and the results of scientific experiments–the point of an explanation is only to facilitate the construction of a consistent empirical model, not to furnish a literal description of reality. Thus Hempel‘s epistemic theory of explanation deals only in logical form, making no mention of any actual physical connection between the phenomenon to be explained and the facts purported to explain it, whereas Salmon’s realist account emphasizes that real processes and entities are conceptually necessary for understanding exactly why an explanation works.
In contrast to these theoretical and primarily scientific approaches, some philosophers have favored a theory of explanation grounded in the way people actually perform explanation. Ordinary Language Philosophy stresses the communicative or linguistic aspect of an explanation, its utility in answering questions and furthering understanding between two individuals, while an approach based in cognitive science maintains that explaining is a purely cognitive activity and that an explanation is a certain kind of mental representation that results from or aids in this activity. It is a matter of contention within cognitive science whether explanation is properly conceived as the process and results of belief revision or as the activation of patterns within a neural network.
This article focuses on the way thinking about explanation within the philosophy of science has changed since 1950. It begins by discussing the philosophical concerns that gave rise to the first theory of explanation, the deductive-nomological model. Discussions of this theory and standard criticisms of it are followed by an examination of attempts to amend, extend or replace this first model. There is particular emphasis on the most general aspects of explanation and on the extent to which later developments reflect the priorities and presuppositions of different philosophical traditions. There are many important aspects of explanation not covered, most notably the relation between the different types of explanation such as teleological, functional, reductive, psychological, and historical explanation — that are employed in various branches of human inquiry.
Table of Contents
- Hempel’s Theory of Explanation
- Standard Criticisms of Hempel’s Theory of Explanation
- Contemporary Developments in the Theory of Explanation
- The Current State of the Theory of Explanation
- References and Further Reading
Most people, philosophers included, think of explanation in terms of causation. Very roughly, to explain an event or phenomenon is to identify its cause. The nature of causation is one of the perennial problems of philosophy, so on the basis of this connection one might reasonably attempt to trace thinking about the nature of explanation to antiquity. (Among the ancients, for example, Aristotle’s theory of causation is plausibly regarded as a theory of explanation.) But the idea that the concept of explanation warrants independent analysis really did not begin to take hold until the 20th century. Generally, this change occurred as the result of the linguistic turn in philosophy. More specifically, it was the result of philosophers of science attempting to understand the nature of modern theoretical science.
Of particular concern were theories that posited the existence of unobservable entities and processes (for example, atoms, fields, genes, etc.). These posed a dilemma. On the one hand, the staunch empiricist had to reject unobservable entities as a matter of principle; on the other hand, theories that appealed to unobservables were clearly producing revolutionary results. A way was needed to characterize the obvious value of these theories without abandoning the empiricist principles deemed central to scientific rationality.
In this context it became common to distinguish between the literal truth of a theory and its power to explain observable phenomena. Although the distinction between truth and explanatory power is important, it is susceptible to multiple interpretations, and this remains a source of confusion even today. The problem is this: In philosophy the terms “truth” and “explanation” have both realist and epistemic interpretations. On a realist interpretation the truth and explanatory power of a theory are matters of the correspondence of language with an external reality. A theory that is both true and explanatory gives us insight into the causal structure of the world. On an epistemic interpretation, however, these terms express only the power of a theory to order our experience. A true and explanatory theory orders our experience to a greater degree than a false non-explanatory one. Hence, someone who denies that scientific theories are explanatory in the realist sense of the term may or may not be denying that they are explanatory in the epistemic sense. Conversely, someone who asserts that scientific theories are explanatory in the epistemic sense may or may not be claiming that they are explanatory in the realist sense. The failure to distinguish these senses of “explanation” can and does foster disagreements that are purely semantic in nature.
One common way of employing the distinction between truth and explanation is to say that theories that refer to unobservable entities may explain the phenomena, but they are not literally true. A second way is to say that these theories are true, but they do not really explain the phenomena. Although these statements are superficially contradictory, they can both be made in support of the same basic view of the nature of scientific theories. This, it is now easy to see, is because the terms ‘truth’ and ‘explanation’ are being used differently in each statement. In the first, ‘explanation’ is being used epistemically and ‘truth’ realistically; in the second, ‘explanation’ is being used realistically and ‘truth’ epistemically. But both statements are saying roughly the same thing, namely, that a scientific theory may be accepted as having a certain epistemic value without necessarily accepting that the unobservable entities it refers to actually exist. (This view is known as anti-realism.) One early 20th century philosopher scientist, Pierre Duhem, expressed himself according to the latter interpretation when he claimed:
A physical theory is not an explanation. It is a system of mathematical propositions, deduced from a small number of principles, which aim to represent as simply, as completely, and as exactly as possible a set of experimental laws. ( 1962: p7)
Duhem claimed that:
To explain is to strip the reality of the appearances covering it like a veil, in order to see the bare reality itself. (op.cit.: p19)
Explanation was the task of metaphysics, not science. Science, according to Duhem, does not comprehend reality, but only gives order to appearance. However, the subsequent rise of analytic philosophy and, in particular, logical positivism made Duhem’s acceptance of classical metaphysics unpopular. The conviction grew that, far from being explanatory, metaphysics was meaningless insofar as it issued claims that had no implications for experience. By the time Carl Hempel (who, as a logical positivist, was still fundamentally an anti-realist about unobservable entities) articulated the first real theory of explanation (1948) the explanatory power of science could be stipulated.
To explain the phenomena in the world of our experience, to answer the question “Why?” rather than only the question “What?”, is one of the foremost objectives of all rational inquiry; and especially scientific research, in its various branches strives to go beyond a mere description of its subject matter by providing an explanation of the phenomena it investigates. (Hempel and Oppenheim 1948: p8)
For Hempel, answering the question “Why?” did not, as for Duhem, involve an appeal to a reality beyond all experience. Hempel employs the epistemic sense of explanation. For him the question “Why?” was an expression of the need to gain predictive control over our future experiences, and the value of a scientific theory was to be measured in terms of its capacity to produce this result.
According to Hempel, an explanation is:
…an argument to the effect that the phenomenon to be explained …was to be expected in virtue of certain explanatory facts. (1965 p. 336)
Hempel claimed that there are two types of explanation, what he called ‘deductive-nomological’ (DN) and ‘inductive-statistical’ (IS) respectively.” Both IS and DN arguments have the same structure. Their premises each contain statements of two types: (1) initial conditions C, and (2) law-like generalizations L. In each, the conclusion is the event E to be explained:
C1, C2, C3,…Cn
L1, L2, L3,…Ln
The only difference between the two is that the laws in a DN explanation are universal generalizations, whereas the laws in IS explanations have the form of statistical generalizations. An example of a DN explanation containing one initial condition and one law-like generalization is:
C. The infant’s cells have three copies of chromosome 21.
L. Any infant whose cells have three copies of chromosome 21 has Down’s Syndrome.
E. The infant has Down’s Syndrome.
An example of an IS explanation is:
C. The man’s brain was deprived of oxygen for five continuous minutes.
L. Almost anyone whose brain is deprived of oxygen for five continuous minutes will sustain brain damage.
E. The man has brain damage.
For Hempel, DN explanations were always to be preferred to IS explanations. There were two reasons for this.
First, the deductive relationship between premises and conclusion maximized the predictive value of the explanation. Hempel accepted IS arguments as explanatory just to the extent that they approximated DN explanations by conferring a high probability on the event to be explained.
Second, Hempel understood the concept of explanation as something that should be understood fundamentally in terms of logical form. True premises are, of course, essential to something being a good DN explanation, but to qualify as a DN explanation (what he sometimes called a potential DN explanation) an argument need only exhibit the deductive-nomological structure. (This requirement placed Hempel squarely within the logical positivist tradition, which was committed to analyzing all of the epistemically significant concepts of science in logical terms.) There is, however, no corresponding concept of a potential IS explanation. Unlike DN explanations, the inductive character of IS explanations means that the relation between premises and conclusion can always be undermined by the addition of new information. (For example, the probability of brain damage, given that a man is deprived of oxygen for 7 minutes, is lowered somewhat by the information that the man spent this time at the bottom of a very cold lake.) Consequently, it is always possible that a proposed IS explanation, even if the premises are true, would fail to predict the fact in question, and thus have no explanatory significance for the case at hand.
Hempel’s dissatisfaction with statistical explanation was at odds with modern science, for which the explanatory use of statistics had become indispensable. Moreover, Hempel’s requirement that IS explanations approximate the predictive power of DN explanations has the counterintuitive implication that for inherently low probability events no explanations are possible. For example, since smoking two packs of cigarettes a day for 40 years does not actually make it probable that a person will contract lung cancer, it follows from Hempel’s theory that a statistical law about smoking will not be involved in an IS explanation of the occurrence of lung cancer. Hempel’s view might be defended here by claiming that when our theories do not allow us to predict a phenomenon with a high degree of accuracy, it is because we have incomplete knowledge of the initial conditions. However, this seems to require us to base a theory of explanation on the now dubious metaphysical position that all events have determinate causes.
Another important criticism of Hempel’s theory is that many DN arguments with true premises do not appear to be explanatory. Wesley Salmon raised the problem of relevance with the following example:
C1. Butch takes birth control pills.
C2: Butch is a man.
L: No man who takes birth control pills becomes pregnant.
E: Butch has not become pregnant.
Unfortunately, this reasoning qualifies as explanatory on Hempel’s theory despite the fact that the premises seem to be explanatorily irrelevant to the conclusion.
Sylvain Bromberger raised the problem of asymmetry by pointing out that, while on Hempel’s model one can explain the period of a pendulum in terms of the length of the pendulum together with the law of simple periodic motion, one can just as easily explain the length of a pendulum in terms of its period in accord with the same law. Our intuitions tell us that the first is explanatory, but the second is not. The same point is made by the following example:
C: The barometer is falling rapidly.
L: Whenever the barometer falls rapidly, a storm is approaching.
E: A storm is approaching.
While the falling barometer is a trustworthy indicator of an approaching storm, it is counterintuitive to say that the barometer explains the occurrence of the storm. Rather, it is the approaching storm that explains the falling barometer.
These two problems, relevance and asymmetry, expose the difficulty of developing a theory of explanation that makes no reference to causal relations. Reference to causal relations is not an option for Hempel, however, since causation heads the anti-realist’s list of metaphysically suspect concepts. It would also undermine his view that explanation should be understood as an epistemic rather than a metaphysical relationship. Hempel’s response to these problems was that they raise purely pragmatic issues. His model countenances many explanations that prove to be useless, but whether an explanation has any practical value is not, in Hempel’s view, something that can be determined by philosophical analysis. This is a perfectly cogent reply, but it has not generally been regarded as an adequate one. Virtually all subsequent attempts to improve upon Hempel’s theory accept the above criticisms as legitimate.
As noted above, Hempel’s model requires that an explanation make use of at least one law-like generalization. This presents another sort of problem for the DN model. Hempel was careful to distinguish law-like generalizations from accidental generalizations. The latter are generalizations that may be true, but not in virtue of any law of nature. (for example, “All of my shirts are stained with coffee” may be true, but it is- I hope- just an accidental fact, not a law of nature.) Although the idea that explanation consists in subsuming events under natural laws has wide appeal in the philosophy of science, it is doubtful whether this requirement can be made consistent with Hempel’s epistemic view of explanation. The reason is simply that no one has ever articulated an epistemically sound criterion for distinguishing between law-like generalizations and accidental generalizations. This is essentially just Hume’s problem of induction, namely, that no finite number of observations can justify the claim that a regularity in nature is due to an natural necessity. In the absence of such a criterion, Hempel’s model seems to violate the spirit of the epistemic view of explanation, as well as the idea that explanation can be understood in purely logical terms.
Contemporary developments in the theory of explanation in many ways reflect the fragmented state of analytic philosophy since the decline of logical positivism. In this article we will look briefly at examples of how explanation has been conceived within the following five traditions: (1) Causal Realism, (2) Constructive Empiricism, (3) Ordinary Language Philosophy, (4) Cognitive Science and (5) Naturalism and Scientific Realism.
With the decline of logical positivism and the gathering success of modern theoretical science, philosophers began to regard continued skepticism about the reality of unobservable entities and processes as pointless. Different varieties of realism were articulated and against this background several different causal theories of explanation were developed. The idea behind them is the ordinary intuition noted at the beginning of this essay: to explain is to attribute a cause. Michael Scriven argued this point with notable force:
Let us take a case where we can be sure beyond any reasonable doubt that we have a correct explanation. As you reach for the dictionary, your knee catches the edge of the table and thus turns over the ink bottle, the contents of which proceed to run over the table’s edge and ruin the carpet. If you are subsequently asked to explain how the carpet was damaged you have a complete explanation. You did it by knocking over the ink. The certainty of this explanation is primeval…This capacity for identifying causes is learnt, is better developed in some people than in others, can be tested, and is the basis for what we call judgments. (1959: p. 456)
Wesley Salmon’s causal theory of explanation is perhaps the most influential developed within the realist tradition. Salmon had earlier developed a fundamentally epistemic view according to which an explanation is a list of statistically relevant factors. However he later rejected this, and any epistemic theory, as inadequate. His reason was that all epistemic theories are incapable of showing how explanations produce scientific understanding. This is because scientific understanding is not only a matter of having justified beliefs about the future. Salmon now insists that even a Laplacean Demon whose knowledge of the laws and initial conditions of the universe were so precise and complete as to issue in perfect predictive knowledge would lack scientific understanding. Specifically, he would lack the concepts of causal relevance and causal asymmetry and he could not distinguish between true causal processes and pseudo-processes. (As an example of the latter, consider the beam of a search light as it describes an arc through the sky. The movement of the beam is a pseudo-process since earlier stages of the beam do not cause later stages. By contrast, the electrical generation of the light itself, and the movement of the lamp housing are true causal processes.)
Salmon defends his causal realism by rejecting the Humean conception of causation as linked chains of events, and by attempting to articulate an epistemologically sound theory of continuous causal processes and causal interactions to replace it. The theory itself is detailed and does not lend itself to compression. It reads not so much as an analysis of the term ‘explanation’ as a set of instructions for producing an explanation of a particular phenomenon or event. One begins by compiling a list of statistically relevant factors and analyzing the list by a variety of methods. The procedure terminates in the creation of causal models of these statistical relationships and empirical testing to determine which of these models is best supported by the evidence.
Insofar as Salmon’s theory insists that an adequate explanation has not been achieved until the fundamental causal mechanisms of a phenomenon have been articulated, it is deeply reductionistic. It is not clear, for example, how Salmon’s model of explanation could ever generate meaningful explanations of mental events, which supervene on, but do not seem to be reducible to a unique set of causal relationships. Salmon’s theory is also similar to Hempel’s in at least one sense, and that is that both champion ideal forms of explanation, rather than anything that scientists or ordinary people are likely to achieve in the workaday world. This type of theorizing clearly has its place, but it has also been criticized by those who see explanation primarily as a form of communication between individuals. On this view, simplicity and ease of communication are not merely pragmatic, but essential to the creation of human understanding.
In his book The Scientific Image (1980) Bas van Fraassen produced an influential defense of anti-realism. Terming his view “constructive empiricism” van Fraassen claimed that theoretical science was properly construed as a creative process of model construction rather than one of discovering truths about the unobservable world. While avoiding the fatal excesses of logical positivism he argued strongly against the realistic interpretation of theoretical terms, claiming that contemporary scientific realism is predicated on a dire misunderstanding of the nature of explanation. (See “Naturalism and Scientific Realism” below). In support of his constructive empiricism van Fraassen produced an epistemic theory of explanation that draws on the logic of why-questions and draws on a Bayesian interpretation of probability.
Like Hempel, van Fraassen seeks to explicate explanation as a purely logical concept. However, the logical relation is not that of premises to conclusion, but one of question to answer. Following Bromberger, van Fraassen characterizes explanation as an answer to a why-question. Why-questions, for him, are essentially contrastive. That is, they always, implicitly or explicitly, ask: Why Pk, rather than some set of alternatives X= ? Why-questions also implicitly stipulate a relevance relation R, which is the explanatory relation (for example, causation) any answer must bear to the ordered pair .
Van Fraassen follows Hempel in addressing explanatory asymmetry and explanatory relevance as pragmatic issues. However, van Fraassen’s question-answering model makes this view a bit more intuitive. The relevance relation is defined by the interests of the person posing the question. For example, an individual who asks for an explanation of an airline accident in terms of the human decisions that led to it can not be forced to accept an explanation solely in terms of the weather. van Fraassen deals with the problem of explanatory asymmetry by showing that this, too, is a function of context. For example, most people would say that bad weather explains plane crashes, but plane crashes don’t explain bad weather. However, there are conditions (for example, unstable atmospheric conditions, an airplane carrying highly explosive cargo) that could combine to supply the latter explanation with an appropriate context.
Van Fraassen’s model also avoids Hempel’s problematic requirement of high probability for IS explanation. For van Fraassen, an answer will be potentially explanatory if it “favors” Pk over all the other members of the contrast class. This means roughly that the answer must confer greater probability on Pk than on any other Pi. It does not require that Pk actually be probable, or even that the probability of Pk be raised as a result of the answer, since favoring can actually result from an answer that lowers the probability of all other Pi relative to Pk. For van Fraassen, the essential tool for calculating the explanatory value of a theory is Bayes’ Rule, which allows one to calculate the probability of a particular event relative to a set of background assumptions and some new information. From a Bayesian point of view, the rationality of a belief is relative to a set of background assumptions which are not themselves the subject of evaluation. Van Fraassen’s theory of explanation is therefore deeply subjectivist: what counts as a good explanation for one person may not count as a good explanation for another, since their background assumptions may differ.
Van Fraassen’s pragmatic account of explanation buttresses his anti-realist position, by showing that when properly analyzed there is nothing about the concept of explanation that demands a realistic interpretation of causal processes or unobservables. Van Fraassen does not make the positivist mistake of claiming that talk of such things is metaphysical nonsense. He claims only that a full appreciation of science does not depend on a realistic interpretation. His pragmatism also offers an alternative account of Salmon’s Laplacean Demon. van Fraassen agrees with Salmon that an individual with perfect knowledge of the laws and initial conditions of the universe lacks something, but what he lacks is not objective knowledge of the difference between causal processes and pseudo processes. Rather, he simply lacks the human interests that make causation a useful concept.
Although van Fraassen’s theory of explanation is based on the view that explanation is a process of communication, he still chooses to explicate the concept of explanation as a logical relationship between question and answer, rather than as a communicative relationship between two individuals. Ordinary Language Philosophy tends to emphasize this latter quality, rejecting traditional epistemology and metaphysics and focusing on the requirements of effective communication. For this school, philosophical problems do not arise because ordinary language is defective, but because we are in some way ignoring the communicative function of language. Consequently, the point of ordinary language analysis is not to improve upon ordinary usage by clarifying the meanings of terms for use in some ideal vocabulary, but rather to bring the full ordinary meanings of the terms to light.
Within this tradition Peter Achinstein (1983) developed an illocutionary theory of explanation. Like Salmon, Achinstein characterizes explanation as the pursuit of understanding. He defines the act of explanation as the attempt by one person to produce understanding in another by answering a certain kind of question in a certain kind of way. Achinstein rejects Salmon’s narrow association of understanding with causation, as well as van Fraassen’s analysis in terms of why-questions. For Achinstein there are many different kinds of questions that we ordinarily regard as attempts to gain understanding (for example, who-, what-, when-, and where-questions) and it follows that the act of answering any of these is properly regarded as an act of explanation.
According to Achinstein’s theory S (a person) explains q (an interrogative expressing some question Q) by uttering u only if:
S utters u with the intention that his utterance of u render q understandable by producing the knowledge of the proposition expressed by u that it is a correct answer to Q. (1983: p.13)
Achinstein’s approach is an interesting departure from the types of theory discussed above in that it draws freely both on the concept of intention as well as the irreducibly causal notion of “producing knowledge.” This move clearly can not be countenanced by someone who sees explanation as a fundamentally logical concept. Even the causal realist who believes that explanations make essential reference to causes does not construe explanation itself in causal terms. Indeed, Achinstein’s approach is so different from theories that we have discussed so far that it might be best construed as addressing a very different question. Whereas traditional theories have attempted to explicate the logic of explanation, Achinstein’s theory may be best understood as an attempt to describe the process of explanation itself.
Like van Fraassen’s theory, Achinstein’s theory is deeply pragmatic. He stipulates that all explanations are given relative to a set of instructions (cf. van Fraassen’s relevance relations) and indicates that these instructions are ultimately determined by the individual asking the question. So, for example, a person who ask for an explanation why the electrical power in the house has gone out implicitly instructs that the question be answered in a way that would be relevant to the goal of turning the electricity back on. An answer that explained the absence of an electrical current in scientific terms, say by reference to Maxwell’s equations, would be inappropriate in this case.
Achinstein attempts to avoid van Fraassen’s subjectivism, by identifying understanding with knowledge that a certain kind of proposition is true. These, he calls “content giving propositions” which are to be contrasted with propositions that have no real cognitive significance. For example, Achinstein would want to rule out as non-explanatory, answers to questions that are purely tautological, such as: Mr. Pheeper died because Mr. Pheeper ceased to live. Achinstein also counts as non explanatory the scientifically correct answer to a question like: What is the speed of light in a vacuum? For him 186,000 miles/ second is not explanatory because, as it stands, it is just an incomprehensibly large number offering no basis of comparison with velocities that are cognitively significant. This does not mean that speed of light in a vacuum can not be explained. For example, a more cognitively significant answer to the above question might be that light can travel 7 1/2 times around the earth in one second. (Thanks to Professor Norman Swartz for this example)
One of the main difficulties with Achinstein’s theory is that the idea of a content-giving proposition remains too vague. His refusal to narrow the list of questions that qualify as requests for explanation makes it very difficult to identify any interesting property that an act of explanation must have in order to produce understanding. Moreover, Achinstein’s theory suffers from epistemological problems of its own. His theory of explanation makes essential reference to the intention to produce a certain kind of knowledge-state, but it is unclear from what Achinstein says how a knowledge state can be the result of an illocutionary act simpliciter. Certainly, such acts can produce beliefs, but not all beliefs so produced will count as knowledge, and Achinstein’s theory does not distinguish between the kinds of explanatory acts that are likely to result in such knowledge, and the kinds that will not.
While explanation may be fruitfully regarded as an act of communication, still another departure from the standard relational analysis is to think of explaining as a purely cognitive activity, and an explanation as a certain kind of mental representation that results from or aids in this activity. Considered in this way, explaining (sometimes called ‘abduction’) is a universal phenomenon. It may be conscious, deliberative, and explicitly propositional in nature, but it may also be unconscious, instinctive, and involve no explicit propositional knowledge whatsoever. For example: a father, hearing a high-pitched wail coming from the next room, rushes to his daughter’s aid. Whether he reacted instinctively, or on the basis of an explicit inference, we can say that the father’s behavior was the result of his having explained the wailing sound as the cry of his daughter.
From this perspective the term ‘explanation’ is neither a meta-logical nor a metaphysical relation. Rather, the term has been given a theoretical status and an explanatory function of its own; that is, we explain a person’s behavior by reference to the fact that he is in possession of an explanation. Put differently, ‘explanation’ has been subsumed into the theoretical vocabulary of science (with explanation itself being one of the problematic unobservables) an understanding of which was the very purpose of the theory of explanation in the first place.
Cognitive science is a diverse discipline and there are many different ways of approaching the concept of explanation within it. One major rift within the discipline concerns the question whether “folk psychology” with its reference to mental entities like intentions, beliefs and desires is fundamentally sound. Cognitive scientists in the artificial intelligence (AI) tradition argue that it is sound, and that the task of cognitive science is to develop a theory that preserves the basic integrity of belief-desire explanation. On this view, explaining is a process of belief revision, and explanatory understanding is understood by reference to the set of beliefs that result from that process. Cognitive scientists in the neuroscience tradition, in contrast, argue that folk psychology is not explanatory at all: in its completed state all reference to beliefs and desires will be eliminated from the vocabulary of cognitive science in favor of a vocabulary that allows us to explain behavior by reference to models of neural activity. On this view explaining is a fundamentally neurological process, and explanatory understanding is understood by reference to activation patterns within a neural network.
One popular approach that incorporates aspects of both traditional AI and neuroscience makes use of the idea of a mental model (cf. Holland et al. ) Mental models are internal representations that occur as a result of the activation of some part of a network of condition-action (or if-then) type rules. These rules are clustered in such a way that when a certain number of conditions becomes active, some action results. For example, here is a small cluster of rules that a simple cognitive system might use to distinguish different types of small furry mammals in a backyard environment.
(i) If [large, scurries, meows] then [cat].
(ii) If [small, scurries, squeaks] then [rat].
(iii) If [small, hops, chirps] then [squirrel].
(iv) If [squirrel or rat] then [flees].
(v) If [cat] then [approaches].
A mental model of a squirrel, then, can be described as an activation of rule (iii).
A key concept within the mental models framework is that of a default hierarchy. A set of rules such as those above, state a standard set of default conditions. When these are met, a set of expectations is generated. For example, the activation of rule (iii) generates expectations of type (iv). However, a viable representational system must be able to revise prior rule activations when expectations are contradicted by future experience. In the mental models framework, this is achieved by incorporating a hierarchy of rules below the default condition with more specific conditions at lower levels of the model whose actions will defeat default expectations. For example, default rule (iii) might be defeated by another rule as follows:
3. Level 1: If [small, hops, chirps] then [squirrel].
Level 2: If [flies] then [bird].
In other words, a system that identifies a small, hopping chirping animal as a squirrel generates a set of expectations about its future behavior. If these expectations are contradicted by, for example, the putative squirrel flying, then the system will descend to a lower level of the hierarchy thereby allowing the system to reclassify the object as a bird.
Although this is just a cursory characterization of the mental models framework it is enough to show how explanation can be handled within it. In this context it is natural to think of explanation as a process that is triggered by a predictive failure. Essentially, when the expectations activated at Level 1 of the default hierarchy fail, the system searches lower levels of the hierarchy to find out why. If the above example were formulated in explicitly propositional terms, we would say that the failure of Level 1 expectations generated the question: Why did the animal, which I previously identified as a squirrel, fly? The answer supplied at level 2 is: Because the animal is not a squirrel, but a bird. Of course, Level 2 rules produce their own set of expectations, which must themselves be corroborated with future experience or defeated by future explanations. Clearly, the above example is a rudimentary form of explanation. Any viable system must incorporate learning algorithms which allow it to modify both the content and structure of the default hierarchy when its expectations are repeatedly undermined by experience. This will necessarily involve the ability to generalize over past experiences and activate entirely new rules at every level of the default hierarchy.
One can reasonably doubt whether philosophical questions about the nature of explanation are addressed by defining and ultimately engineering systems capable of explanatory cognition. To the extent that these questions are understood in purely normative terms, they obviously arise in regard to systems built by humans with at least as much force as they arise for humans themselves. In defense of the cognitive science approach, however, one might assert that the simple philosophical question “What is explanation?” is not well-formed. If we accept some form of epistemic relativity, the proper form of such a question is always “What is explanation in cognitive system S?” Hence, doubts about the significance of explanatory cognition in some system S are best expressed as doubts about whether system S-type explanation models human cognition accurately enough to have any real significance for human beings.
Historically, naturalism is associated with the inclination to reject any kind of explanation of natural phenomena that makes essential reference to unnatural phenomena. Insofar as this view is understood simply as the rejection of supernatural phenomena (for example the actions of gods, irreducibly spiritual substances, etc.) it is uncontroversial within the philosophy of science. However, when it is understood to entail the rejection of irreducibly non-natural properties, (that is, the normative properties of ‘rightness’ and ‘wrongness’ that we appeal to in making evaluative judgments about human thought and behavior), it is deeply problematic. The problem is just that the aim of the philosophy of science has always been to establish an a priori basis for making precisely these evaluative judgments about scientific inquiry itself. If they can not be made, then it follows that the goals of philosophical inquiry have been badly misconceived.
Most contemporary naturalists do not regard this as an insurmountable problem. Rather, they just reject the idea that philosophical inquiry can occur from a vantage point outside of science, and they deny that evaluative judgments we make about scientific reasoning and scientific concepts have any a priori status. Put differently, they think philosophical inquiry should be seen as a very abstract form of scientific inquiry, and they see the normative aspirations of philosophers as something that must be achieved by using the very tools and methods that philosophers have traditionally sought to justify.
The relevance of naturalism to the theory of explanation can be understood briefly as follows. Naturalism undermines the idea that knowledge is prior to understanding. If it is true that there will never be an inductive logic that can provide an a priori basis for calling an observed regularity a natural law, then there is, in fact, no independent way of establishing what is the case prior to understanding why it is the case. Because of this, some naturalists (for example, Sellars) have suggested a different way of thinking about the epistemic significance of explanation. The idea, basically, is that explanation is not something that occurs on the basis of pre-confirmed truths. Rather, successful explanation is actually part of the process of confirmation itself:
Our aim [is] to manipulate the three basic components of a world picture: (a) observed objects and events, (b) unobserved objects and events, (c) nomological connections, so as to achieve a maximum of “explanatory coherence.” In this reshuffle no item is sacred. (Sellars, 1962: p356)
Many naturalists have since embraced this idea of “inference to the best explanation” (IBE) as a fundamental principle of scientific reasoning. Moreover, they have put this principle to work as an argument for realism. Briefly, the idea is that if we treat the claim that unobservable entities exist as a scientific hypothesis, then it can be seen as providing an explanation of the success of theories that employ them: namely, the theories are successful because they are (approximately) true. Anti-realism, by contrast, can provide no such explanation; on this view theories that make reference to unobservables are not literally true and so the success of scientific theories remains mysterious. It should be noted here that scientific realism has a very different flavor from the more foundational form of realism discussed above. Traditional realists do not think of realism as a scientific hypothesis, but as an independent metaphysical thesis.
Although IBE has won many converts in recent years it is deeply problematic precisely because of the way it employs the concept of explanation. While most people find IBE to be intuitively plausible, the fact remains that no theory of explanation discussed above can make sense of the idea that we accept a claim on the basis of its explanatory power. Rather, every such view stipulates as a condition of having explanatory power at all that a statement must be true or well-confirmed. Moreover, van Fraassen has argued that even if we can make sense of IBE, it remains a highly dubious principle of inductive inference. The reason is that “inference to the best explanation” really can only mean “inference to the best explanation given to date.” We are unable to compare proposed explanations to others that no one has yet thought of, and for this reason the property of being the best explanation can not be an objective measure of the likelihood that it is true.
One way of responding to these criticisms is to observe that Sellars’ concept of explanatory coherence is based on a view about the nature of understanding that simply eludes the standard models of explanation. According to this view an explanation increases our understanding, not simply by being the correct answer to a particular question, but by increasing the coherence of our entire belief system. This view has been developed in the context of traditional epistemology (Harman, Lehrer) as well as the philosophy of science (Thagard, Kitcher). In the latter context, the terms “explanatory unification” and “consilience” have been introduced to promote the idea that good explanations necessarily tend to produce a more unified body of knowledge. Although traditionalists will insist that there is no a priori basis for thinking that a unified or coherent set of beliefs is more likely to be true, (counterexamples are, in fact, easy to produce) this misses the point that most naturalists reject the possibility of establishing IBE, or any other inductive principle, on purely a priori grounds.
For critiques of naturalism, see the Social Science article.
This brief summary may leave the reader with the impression that philosophers are hopelessly divided on the nature of explanation, but this is not really the case. Most philosophers of science would agree that our understanding of explanation is far better now than it was in 1948 when Hempel and Oppenheim published “Studies in the Logic of Explanation.” While it serves expository purposes to represent the DN model and each of its successors as fatally flawed, this should not obscure the fact that these theories have brought real advances in understanding which succeeding models are required to preserve. At this point, fundamental disagreements on the nature of explanation fall into one of two categories. First, there are metaphysical disagreements. Realists and anti-realists continue to differ over what sort of ontological commitments one makes in accepting an explanation. Second, there are meta-philosophical disagreements. Naturalists and non-naturalists remain at odds concerning the relevance of scientific inquiry ( namely, inquiry into the way scientists, ordinary people and computers actually think) to a philosophical theory of explanation. These disputes are unlikely to be resolved anytime soon. Fortunately, however, the significance of further research into the logical and cognitive structure of explanation does not depend on their outcome.
- Achinstein, Peter (1983) The Nature of Explanation. New York: Oxford University Press.
- Belnap and Steele (1976) The Logic of Questions and Answers. New Haven: Yale University
- Bromberger, Sylvain (1966) “Why-Questions,” In Baruch A. Brody, ed., Readings in the Philosophy of Science, 66-84. Englewood Cliffs: Prentice Hall, Inc..
- Brody, Baruch A. (1970) Readings in the Philosophy of Science. Englewood Cliffs, N.J.: Prentice Hall
- Duhem, Pierre (1962) The Aim and Structure of Physical Theory. New York:
- Friedman, Michael (1974 ) “Explanation and Scientific Understanding.” Journal of Philosophy 71: 5-19.
- Harman, Gilbert (1965) “The Inference to the Best Explanation.” Philosophical Review, 74: 88-95.
- Hempel, Carl G. and Oppenheim, Paul (1948) “Studies in the Logic of Explanation.” In Brody p. 8-38.
- Hempel, Carl G. (1965) Aspects of Scientific Explanation and other Essays in the Philosophy of Science. New York: Free Press.
- Holland, John; Holyoak, Keith; Nisbett, Richard; Thagard, Paul (1986) Induction: Processes of Inference, Learning, and Discovery. Cambridge: MIT Press
- Hume, David (1977) An Enquiry Concerning Human Understanding. Indianapolis: Hackett
- Kitcher, Philip (1981) “Explanatory Unification.” Philosophy of Science 48:507-531.
- Lehrer, Keith (1990) Theory of Knowledge. Boulder: West View Press.
- Pitt, Joseph C. (1988) Theories of Explanation. Oxford: Oxford University Press.
- Quine, W. V. (1969) “Epistemology Naturalized.” In Ontological Relativity and Other Essays. New York: Columbia University Press: 69-90.
- Salmon, Wesley (1984) Scientific Explanation and the Causal Structure of the World. Princeton: Princeton University Press.
- Salmon, Wesley (1990) Four Decades of Scientific Explanation. Minneapolis: University of Minnesota Press.
- Scriven, M (1959) “Truisms as the Grounds for Historical Explanations.” In P. Gardiner (Ed.), Theories of History: Readings from Classical and Contemporary Sources, New York: Free Press, pp. 443-475.
- Sellars, Wilfred (1962) Science, Perception, and Reality. New York: Humanities Press.
- Stich, Stephen (1983) From Folk Psychology to Cognitive Science. Cambridge: The MIT Press.
- Thagard, Paul (1988) Computational Philosophy of Science. Cambridge: MIT Press.
- van Fraassen, Bas C. (1980) The Scientific Image. Oxford: Clarendon Press.
- van Fraassen, Bas C. (1989) Laws and Symmetry. Oxford: Clarendon Press.
G. Randolph Mayes
California State University Sacramento