Hostname: page-component-54dcc4c588-42vt5 Total loading time: 0 Render date: 2025-10-05T10:48:02.725Z Has data issue: false hasContentIssue false

On the Use of Parsimony in Archaeological Reasoning

Published online by Cambridge University Press:  29 September 2025

Sam C. Lin*
Affiliation:
Environmental Futures, School of Science, University of Wollongong, Wollongong, NSW, Australia
Ross Pain
Affiliation:
Department of Philosophy, University of Bristol, Bristol, UK
Alex Mackay
Affiliation:
Environmental Futures, School of Science, University of Wollongong, Wollongong, NSW, Australia
*
Corresponding author: Sam C. Lin; Email: samlin@uow.edu.au
Rights & Permissions [Opens in a new window]

Abstract

When evaluating competing hypotheses in archaeology, researchers frequently invoke the principle of parsimony, which states that simpler hypotheses should be preferred. However, the criteria for measuring simplicity and the rationale for labeling a hypothesis as most parsimonious often remain unclear. More broadly, the epistemic merit of parsimony and its relevance to archaeological reasoning are generally assumed but rarely clarified. This article explores how archaeologists use parsimony in ethnographic analogy and formal model selection. In the first context, it is usually uncertain how simplicity should be measured or why increased simplicity raises the plausibility of an ethnographic analogy. We contend that discussions of ethnographic analogy are better characterized under inference to the best explanation, where parsimony is only one heuristic among others. In the second context, simplicity is assessed by the relative complexity, rather than the quantity, of postulates in a model. This approach to parsimony, which does track plausibility under certain conditions, helps prevent false positives in archaeological interpretation. However, it also heightens the risk of rejecting alternative, complex causes. We argue that parsimony can aid in evaluating the relative likelihood of competing models and, more importantly, serve as a guide to clarify the complex histories of archaeological phenomena.

Resumen

Resumen

Al evaluar hipótesis en competencia en arqueología, los investigadores suelen invocar el principio de parsimonia, que establece que se deben preferir las hipótesis más simples. Sin embargo, los criterios para medir la simplicidad y la justificación para calificar una hipótesis como la más parsimoniosa a menudo no son claros. Más ampliamente, el mérito epistémico de la parsimonia y su relevancia para el razonamiento arqueológico suelen darse por sentados, pero rara vez se explicitan. Este artículo explora cómo los arqueólogos utilizan la parsimonia en la analogía etnográfica y la selección de modelos formales. En el primer contexto, suele ser incierto cómo debe medirse la simplicidad o por qué una mayor simplicidad aumenta la plausibilidad de una analogía etnográfica. Sostenemos que las discusiones sobre analogía etnográfica se caracterizan mejor bajo la inferencia a la mejor explicación, donde la parsimonia es solo una heurística entre otras. En el segundo contexto, la simplicidad se evalúa en función de la complejidad relativa, más que de la cantidad, de postulados en un modelo. Este enfoque, que refleja la plausibilidad en ciertas condiciones, ayuda a evitar falsos positivos en la interpretación arqueológica, pero también aumenta el riesgo de rechazar causas alternativas más complejas. Argumentamos que la parsimonia puede ayudar a evaluar la probabilidad relativa de modelos en competencia y, más importante aún, servir como una guía para clarificar las complejas historias de los fenómenos arqueológicos.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Society for American Archaeology.

Archaeologists reconstruct the behaviors of past human populations from their material remains. This task is inferential. The nature of archaeological inferences and how they should be formulated has been the subject of much debate. In the 1970s, some theorists argued that archaeological inference building should be formalized according to the hypothetico-deductive method (Fritz and Plog Reference Fritz and Plog1970; Watson et al. Reference Watson, LeBlanc and Redman1971). Others argued that the realities of archaeological reasoning show that it is not a deductive process (Morgan Reference Morgan1973; Salmon Reference Salmon1975; Tuggle et al. Reference Tuggle, Townsend and Riley1972). One important consideration here is equifinality—there are typically many different processes that might produce any given pattern in the record (Lucas Reference Lucas2005; Morgan Reference Morgan1973; Perreault Reference Perreault2019; Read and LeBlanc Reference Read and LeBlanc1978). For instance, we might stipulate that forager settlement type S tends to produce artifact spatial distribution type A, but we cannot be certain that settlement type S always results in artifact distribution type A, nor can we be sure that it is the only process that can generate artifact distribution type A. Consequently, it has been suggested that archaeological inference is more probabilistic in nature (Morgan Reference Morgan1973; Salmon Reference Salmon1975; Wylie Reference Wylie1982); that is, although finding artifact distribution type A does not confirm the occurrence of settlement type S, it does strengthen the plausibility of the hypothesis, given that we accept the generalization between settlement type and artifact distribution. In other words, the reasoning is inductive, and, more specifically, an inference to the best explanation.

In this sense, archaeological explanations need to be evaluated in terms of their relative inferential strength or weakness. A good archaeological explanation is one that we are highly confident is plausible. However, where multiple explanations are plausible—as is often the case in archaeology and other historical sciences (Chamberlin Reference Chamberlin1890)—we are confronted with the question of how we might rank them as more and less plausible. As we will elaborate below, there is a broad array of metrics we might use to rank competing hypotheses. One metric that archaeologists have used is the parsimony principle (Fotiadis Reference Fotiadis and López Varela2018). According to this principle, when we are faced with competing hypotheses that can equally account for the data, the “simplest” hypothesis should be preferred. “Simplicity” here refers to the number and complexity of the entities, causes, and/or processes postulated in explaining the empirical data. An explanation is “simpler” than another explanation if it posits fewer and/or less complex entities, causes, and processes.

If parsimony is used to support a hypothesis against rival explanations, then we need to be very clear about the number and complexity of postulates in a hypothesis. Unfortunately, however, few studies outline the reasoning behind their appeal to parsimony (e.g., Bettinger Reference Bettinger, Prentiss, Kuijt and Chatters2009; Brantingham Reference Brantingham2003). We see two main issues here. First, we are rarely offered explicit metrics for quantifying the number and complexity of postulates in a theory. Second, we are rarely offered comparisons that explicitly rank theories according to simplicity. It is therefore left unclear both why and whether some favored hypothesis is in fact the most parsimonious.

More broadly speaking, the epistemic merit of parsimony and its relevance to archaeological inference is typically treated as self-evident. Indeed, as Fotiadis (Reference Fotiadis and López Varela2018) observed, the adjective “parsimonious” is often used in lieu of “plausible,” “probable,” “likely,” or “satisfactory” in the archaeology literature. However, given that the formation of patterns in the archaeological record is a complex causal system with many causal inputs (Bentley and Maschner Reference Bentley, Maschner, Alexander Bentley, Maschner and Chippindale2009; Kohler Reference Kohler and Hodder2012), it is by no means clear that the link between parsimony and plausibility holds at all. In the case of archaeological reasoning, the simplest explanation may not be the most plausible.

In this article, we provide a review of the role parsimony plays in archaeological inference building. We are particularly interested in the ways archaeologists use the principle to prioritize competing archaeological explanations.

We focus on parsimony reasoning in two general contexts. The first is the use of ethnographic analogies to construct archaeological hypotheses. In this context, we show that researchers rarely offer explicit metrics for understanding why one theory is simpler than another, and we are not offered clear comparisons between theories regarding the number or complexity of postulates. Moreover, when we start to think through how this might be done, it becomes unclear whether theories that are claimed by proponents to be parsimonious do in fact contain fewer and less complex postulates than their rivals. We think the core issues in these debates have little to do with parsimony, and in fact, they are better characterized in terms of the broader framework of inference to the best explanation. On this view, parsimony is just one of many heuristics we should take into account when ranking competing hypotheses. Indeed, we argue that the epistemic strength of ethnographic analogies stems less from simplicity and more from their empirical breath and explanatory power.

The second context we examine concerns the goodness of fit between model and data. In this case, we do think that parsimony reasoning can play a productive role in theory choice. However, simplicity here is more aligned with the complexity—rather than quantity—of postulates in a model. The application of parsimony is more akin to Morgan’s (Reference Morgan1894) canon in comparative psychology, insofar as it involves differentiating and prioritizing “lower level” causes from “higher level” causes. We outline a metric for understanding this process. We argue that this approach to parsimony does track plausibility under certain conditions, meaning that explanations with less complex postulates do represent more likely explanations. However, our approach brings certain constraints. We outline these and show that some debates appealing to parsimony reasoning meet them. We also show that the conservative approach is biased against alternative explanations and carries a heightened risk of incorrectly excluding higher-level causes in archaeological explanations (i.e., false negatives).

Parsimony and Its Epistemic Value

The principle of parsimony, which is commonly attributed to the writing of the fourteenth-century English theologian William of Ockham, states, Entia non sunt multiplicanda praeter necessitatem, which translates to “entities are not to be multiplied without necessity.” Other commonly quoted statements about the principle include frustra fit per plura quod potest fieri per pauciora, or “it is futile to do with more what can be done with fewer.” The notion of simplicity can be traced back to Aristotle, who said “nature does nothing in vain” in relation to his teleological view. This belief that nature prefers fewer causes—and that explanations of nature should therefore do likewise—played a prominent role in the development of Western science. For example, Newton’s (Reference Newton1999 [1687]:398) Rule I of Reasoning in Philosophy is “we are to admit no more causes of natural things than such as are both true and sufficient to explain their appearance.” Similarly, Galileo (Reference Galileo1967 [1632]:397) stated, “Nature does not multiply things unnecessarily; that she makes use of the easiest and simplest means for producing her effects.”

Prior to the twentieth century, justification for the belief that nature does not contain causes beyond those necessary to produce some phenomenon typically involved a theistic argument (Sober Reference Sober2015): the universe is made by God, and the perfection of God is reflected in the simplicity of nature. Newton (Untitled Treatise on Revelation, Yahuda Ms. 1.1, National Library of Israel, Jerusalem, Israel; https://www.newtonproject.ox.ac.uk/view/texts/normalized/THEM00135) famously remarked, “It is the perfection of God’s works that they are all done with the greatest simplicity. He is the God of order and not of confusion.” Since the twentieth century, theistic arguments for simplicity have largely disappeared, and discussions regarding the value of parsimony now generally revolve around different methodological and epistemological viewpoints. For some, there is no a priori justification or empirical evidence for the belief that simpler explanations are more likely to be true than complex alternatives. Instead, the merit of parsimony is purely in its aesthetic and pragmatic effects (Huemer Reference Huemer2009; Quine Reference Quine1963; Walsh Reference Walsh1979). Simpler explanations are more pleasing aesthetically, easier to verify empirically, and more useful for providing a unified treatment of the phenomena in question. The aesthetic appeal of simplicity has been attributed to biases in our cognitive perception (Lombrozo Reference Lombrozo2007; Pothos and Chater Reference Pothos and Chater2002), whereas the pragmatic appeal relates to the sociological realities of scientific practice. For instance, simple hypotheses are more accessible to experimental testing and confirmation than complex hypotheses (Quine Reference Quine1963). This pragmatic view of parsimony is shared by some archaeological researchers. In his discussion of archaeological knowledge and inference, Fotiadis (Reference Fotiadis and López Varela2018:4) remarked that “the main merit of a parsimonious and elegantly articulated hypothesis is their perspicuousness. But perspicuousness is a pragmatic virtue, not necessarily an epistemic one; it does not by itself render a hypothesis more likely than rival hypotheses.” Similarly, in his study of lithic raw material procurement patterns, Brantingham (Reference Brantingham2003:505) noted that “there is no guarantee that the simplest explanation is the correct one.” However, Brantingham (Reference Brantingham2003:505) explicitly acknowledged the methodological value of parsimony: “the probability of producing a Type II error [sic] (i.e., false positive) is minimized by favouring the simplest model available and also that the opportunity for identifying and rectifying such an error is greater.” This view aligns with Karl Popper’s (Reference Popper1959) argument that simpler theories should be preferred because they are more falsifiable and therefore can be more quickly eliminated.

In contrast, there are several accounts that defend the epistemic merit of parsimony. For instance, given that parsimony is a prominent and long-standing heuristic of the scientific method, it can be argued that the success of science is itself an empirical testament that the principle does tend to lead one to the truth. From a Bayesian perspective, all else being equal, simpler hypotheses are held to have higher prior probabilities of being true than more complex ones (Jeffreys Reference Jeffreys1957). Others have contended that simpler hypotheses have greater explanatory power (e.g., Friedman Reference Friedman1974), or that they can better accommodate fewer possible sets of observations and make more specific predictions with greater accuracy (e.g., Forster Reference Forster, Zellner, Keuzenkamp and McAleer2002; Forster and Sober Reference Forster and Sober1994; Sober Reference Sober and Newton-Smith2000). With regard to predictive accuracy, scholars often refer to Akaike’s theorem with regard to curve fitting and the trade-off between fit to data points and the simplicity of the mathematical function, which is typically measured in terms of the number of adjustable parameters it contains (Forster Reference Forster, Zellner, Keuzenkamp and McAleer2002; Forster and Sober Reference Forster and Sober1994; Sober Reference Sober and Newton-Smith2000, Reference Sober2015). If our goal is to make accurate predictions over a general class of systems, more complex functions are more likely to “overfit” the data points and are therefore less able to fit to additional data or make accurate predictions of future observations. In comparison, simpler models with fewer variables have a lower risk of overfitting and a positive effect of leading to more accurate predictions of out-of-sample data. This is particularly acute in the case of archaeological inference, where we only have access to fragments of the target system (Perreault Reference Perreault2019).

It appears that archaeologists tend to agree with the latter view—that more parsimonious explanations are not to be preferred solely for pragmatic reasons; rather, they are preferable because they are more likely to be true. As mentioned above, archaeologists commonly use the term “parsimonious” in lieu of “probable” and “likely.” Yet, very few studies actually explain the reasoning behind this link between simplicity and truth. One example is Bettinger (Reference Bettinger, Prentiss, Kuijt and Chatters2009) and his discussion of macroevolution theory in archaeology. When deciding which hypotheses to prefer, he reasoned,

When individual and group selection hypotheses provide equally compelling accounts of a given cultural behaviour, shouldn’t the more parsimonious individual selection hypothesis be accepted? The answer naturally lies in what one takes to be the prior probabilities of individual and group selection, i.e., the likelihood that one is more common therefore more likely beforehand [Bettinger Reference Bettinger, Prentiss, Kuijt and Chatters2009:293].

However, even in this case, it is not entirely clear why Bettinger thinks that individual selection is simpler than group selection. Does individual selection involve fewer causes, postulates, and entities? Or is it a less complex process? Prima facie, it is not obvious that the answer to either of these questions is “yes.”

Two things are required to justify this kind of reasoning. First, we need to be told how the metric of simplicity is defined and measured. Second, we need to be offered an explicit comparison with rival theories, showing precisely why one theory performs better than others according to some metric for simplicity. In the following sections, we consider these two tasks by reviewing how archaeologists apply parsimony in two broad contexts: ethnographic analogy and model-based analysis.

Ethnographic Analogy, Parsimony, and Inference to the Best Explanation

A context in which parsimony is commonly mentioned in the archaeological literature concerns the use and evaluation of ethnographic analogy. This involves applying analogical reasoning to draw inferences about past processes on the basis of observations made in the present (Wylie Reference Wylie1982). If a stone chip shares key morphological features with a flake made by a modern knapper, we infer that the stone chip was made in a way that is similar to that of the modern flake (Lin et al. Reference Lin, Rezek and Dibble2018). Underlying this reasoning is a uniformitarian assumption that the observed similarities in material outcome warrant the existence of further similarities in processes leading to their formation in the past (Wylie Reference Wylie1982, Reference Wylie and Schiffer1985). An explicit formulation of analogic inference in archaeology is “ethnographic analogy,” where observations of living societies are used as the analog source from which to draw inferences about past behaviors.

One of the main ways archaeologists apply ethnographic analogy is through the so-called direct historical approach developed in the United States during the 1920s and 1930s. With roots in the historical particularism of Boasian anthropology of North America in the early twentieth century, the direct historical approach was based on the view that the archaeological record of the Americas is an extension into the past of contemporary Indigenous societies living in the same geographic region (Trigger Reference Trigger1981). Assuming that cultural expression is largely conditioned by historical inheritance and cultural continuity in a geographically limited area, then knowledge regarding the historical periods of a region can logically be extended back into earlier times (Strong Reference Strong1935).

According to Lyman and O’Brien (Reference Lyman and Michael2001), the direct historical approach works in two general steps. By working backward from historical records, archaeologists first assign ethnic identities to archaeological phenomena of a region and identify material culture traits to measure the ancestor-descendant relationships over time. This step links the analog source (historic/ethnographic account) and subject (archaeological phenomena) in a chronological order of assumed cultural inheritance. Once the link is established between an ethnographic group and an archaeological manifestation, archaeologists can extract a whole suite of cultural information from the ethnographic source and “use it to reconstitute the archaeological-subject phenomena into a dynamic cultural system” (Lyman and O’Brien Reference Lyman and Michael2001:316).

What happens when direct historical connections cannot be made between an archaeological phenomenon and a contemporary society? In these cases, it was suggested that a different analogy can be established on the basis of “the same general level of technological development, perhaps existing under similar environmental situations” (Willey Reference Willey, Tax, Eiseley, Rouse and Voegelin1953:229). The development of this alternative form of ethnographic analogy, termed “general comparative analogy” (Willey Reference Willey, Tax, Eiseley, Rouse and Voegelin1953) or “new analogy” (Ascher Reference Ascher1961), occurred at the time of interest in North American anthropology shifting away from culture history and toward cultural reconstruction (Lyman and O’Brien Reference Lyman and Michael1997). In particular, influenced by the works of people such as Steward (Reference Steward1955) and White (Reference White1959), anthropologists commonly regarded cultures as functionally adaptive systems operating at varying stages of evolutionary development (e.g., Parsons Reference Parsons1966; Sahlins and Service Reference Sahlins and Service1960). From this adaptationist perspective, societies manipulating similar environments through similar technologies ought to converge with respect to other sociocultural behaviors, such as settlement pattern, economy, and social organization. For this reason, to identify functional drivers of cultural change, it is necessary to compare cultural expressions from different geographic regions that share similar levels of technological development and that exist under similar environmental circumstances. This cross-cultural approach to ethnographic analogy became associated with the processual archaeology movement in the 1960s and 1970s, during which ethnographic correlations between behavior and material outcome under different environmental conditions were used to establish general regularities—or “middle range theories” (Binford Reference Binford and Binford1977, Reference Binford1981)—for connecting the static material record with dynamic cultural processes in the past.

Parsimony and Probability

So, what justifies the epistemic credentials of ethnographic analogy? When interested in assessing the accuracy of an analogy, many researchers point to parsimony. (Note, however, that there are reasons one might seek to pursue simpler explanations regardless of their accuracy; e.g., Currie [Reference Currie2019] and Nyrup [Reference Nyrup, Killin and Allen-Hermanson2021]. Thanks to an anonymous reviewer for raising this point.) For instance, when describing general comparative analogy, Stanish (Reference Stanish and Deborah2008) explained that there are invariant laws in the natural world that enable and constrain how people tend to engage and interact with the physical environment for a particular purpose: “Given this, and given the precept of parsimony, archaeologists can make high-probability interpretations of the function and use of . . . features in the archaeological record” (Stanish Reference Stanish and Deborah2008:1363). Stanish (Reference Stanish and Deborah2008:1364) also stated that “the adoption of the principle of parsimony is necessary” for the direct historical approach, such that “if one excavates a round structure with similar features to a historic hogan in Arizona, and if there is no evidence of cultural disruption, then one can make a high-probability statement that the excavated structure functioned in the same way as the historic hogan.” In another example, Hantman et alia (Reference Hantman, Gold and Dunham2004:583) invoked parsimony to justify their inference of cultural continuity between burial mounds in central Virginia circa AD 900–1700 and the colonial-era Monacan Indian people by stating that “the use of ‘parsimony’ and concordance with ‘available historical data’ make clear that we think the Monacan connection is the best and most logical one, pending other data.” Likewise, d’Errico and colleagues (Reference d’Errico, Backwell, Villa, Degano, Lucejko, Bamford, Higham, Colombini and Beaumont2012a:E3291) justified their interpretation of poison use at Border Cave in South Africa roughly 24,000 years ago as the “the most parsimonious,” because the observed presence of caster-bean extract on archaeological notched wooden sticks is “virtually identical to those used by [modern] San for applying poison.”

It is important to highlight the repeated mentioning of “high probability” in the quotes by Stanish (Reference Stanish and Deborah2008) outlined above. In fact, this sense of inferential probability is commonly brought up in discussions of direct historical analogy (Brumfiel Reference Brumfiel1976; Gould Reference Gould, Donnan and Clewlow1974, Reference Gould1980; see Lyman and O’Brien [Reference Lyman and Michael2001] for more discussion). Stanish (Reference Stanish and Deborah2008:1364) stated that “objects found in archaeological contexts of those same peoples’ ancestors carry a high probability of having the same function.” Similarly, in his review of Mesoamerican archaeology, Brumfiel (Reference Brumfiel1976:398) reasoned that “because the degree of cultural continuity between prehistoric and early historic Mesoamerican cultures should be greatest, the likelihood of drawing a correct analogy should also be greatest when inferences have been based upon the ethnohistorical literature of the area.” Likewise, Gould (Reference Gould1980:35) argued that direct historical analogies have “an inherently greater probability of being accurate approximations of behavioural realities than [general comparative analogies].”

If ethnographic analogies are claimed to offer high-probability explanations, and the parsimony principle is frequently invoked to justify these analogies, does it mean that this sense of inferential probability is derived from simplicity? What would be needed to demonstrate this is (1) a definition and metric for quantifying simplicity and (2) some explicit comparison between theories that shows one is simpler than another according to this metric. One way of approaching this is to formulate ethnographic analogies within an explicit phylogenetic framework, in which cultural traits are treated as heritable units subjected to evolutionary processes similar to biological traits (O’Brien et al. Reference O’Brien, Lyman, Mesoudi and VanPool2010). With this formal approach, historical continuity can be justified as simpler and more parsimonious than convergent innovation because it involves fewer evolutionary changes to explain the observed similarities. We will discuss this phylogenetic framework in more detail later with respect to model-based inference. For now, however, we note that explicit formulations of ethnographic analogy in terms of evolutionary phylogeny is relatively rare in the archaeological literature (Lipo et al. Reference Lipo, O’Brien, Collard and Shennan2017; Mace and Holden Reference Mace and Holden2005; O’Brien et al. Reference O’Brien, Collard, Buchanan and Boulanger2013). Instead, applications of ethnographic analogies tend to focus on other “boundary conditions” (Ascher Reference Ascher1961) to warrant the analogic inference, such as the temporal, spatial, or environmental similarities between the ethnographic source and the archaeological subject. If we go by these conditions and work through the task of defining and evaluating simplicity, it is by no means clear that those theories claimed to be parsimonious rank lower than their rivals on plausible metrics for simplicity.

Let us consider the perceived higher probability of direct historical analogy over the general comparative analogy. If parsimony is the reason for believing that cultural inheritance has a higher prior probability of being true than convergent development, then the cultural continuity explanation ought to postulate fewer or less complex causes, entities, and processes than convergence. To infer cultural continuity, we need to assume that (1) the amount of time separating the ethnographic and the archaeological is short enough for cultural continuity to be reasonable, (2) the groups of people have largely remained in one geographic region over time, (3) the material traits we are measuring represent homologous traits that are culturally inherited, and (4) no major cultural change has occurred over time to disrupt the transmission of the homologous traits (see Lyman and O’Brien [Reference Lyman and Michael2001] for more discussion about these assumptions). In contrast, to infer convergent development, the main assumptions would be that (1) the cultural traits that are being compared need to represent functional or technological responses to environmental factors, (2) the success/failure of these technologies have an impact on the fitness or survival of people such that there is pressure to enhance the performance of the technology (hence causing convergence; Stanish Reference Stanish and Deborah2008), and (3) societies have the necessary capacity and resources to independently develop these cultural traits. Are there fewer causes, entities, and processes outlined in the former process (1–4) than in the latter process (1–3)? Are the postulates stipulated in the former process less numerous and less complex than those outlined in the latter process? It is by no means clear to us that this is the case.

Properly working through the task of quantifying the number and complexity of postulates in these theories is beyond the scope of the present article. The main point we want to emphasize above is that, if researchers want to invoke the principle of parsimony in the service of ranking hypotheses, then it is a project with which they must engage. Unfortunately, it is also a very difficult project. To illustrate some of the complexities involved, let us look briefly at the temporal aspect of the direct historical approach. Consider the amount of time that separates the ethnographic source and the archaeological subject. Although studies generally agree that direct historical analogy is most applicable when the source and the subject are close in time and space, how close is “close enough” remains open to question. We may intuitively believe that a time range of up to a few hundred years or even one or two thousand years is plausible to observe continuity in cultural expressions, because we observe historical continuity in technology and material culture over similar time scales in our societies today. However, it is also evident in our own society that drastic cultural changes can take place over relatively short time frames, sometimes leaving little to no notable material traces. What about a geological time scale of tens of thousands of years, covering environmental transformations from the last glacial period to the late Holocene, such as that stipulated for the continuity of San material culture into the Late Pleistocene of Southern Africa by d’Errico et alia (Reference d’Errico, Backwell, Villa, Degano, Lucejko, Bamford, Higham, Colombini and Beaumont2012b)? What additional factors and processes about human behavior and societies do we need to assume in order to warrant an interpretation of cultural continuity over geological time?

The key point here is that if we measure simplicity by the number and complexity of postulates, it is by no means clear that direct historical analogy is simpler than general comparative analogy. Indeed, the inverse might well be the case. What is needed from researchers, then, is a clear articulation of the metric for simplicity they have in mind—and a clear comparison of theories based on this metric. Without this information, appeals to parsimony look epistemically unwarranted.

Inference to the Best Explanation

From this perspective, there appear to be significant issues associated with casting the inferential warrant of ethnographic analogy in terms of parsimony. It may well be the case that ethnographic analogy is best placed to explain similarities between ethnographic observations and the archaeological record, but it is by no means clear that appeals to simplicity account for why this is so. However, recall that parsimony, as we have mentioned, is used in the service of a broader inferential strategy: inference to the best explanation. This form of reasoning occurs when “one infers, from the premise that a given hypothesis would provide a ‘better’ explanation for the evidence than would any other hypothesis, to the conclusion that the given hypothesis is true” (Harman Reference Harman1965:89). If we follow this process, the plausibility of a hypothesis is increased according to its ability to explain observational data. Inference to the best explanation has been shown to be pervasive in archaeology, and it forms the underlying standard of archaeological reasoning that transcends processual and post-processual approaches to archaeological interpretation (Campanaro Reference Campanaro2021; Fogelin Reference Fogelin2007; Hanen and Kelly Reference Hanen, Kelley, Pinsky and Wylie1989). Indeed, regardless of theoretical perspectives, a core activity of archaeology is to gather and examine a range of empirical evidence and then evaluate competing hypotheses to determine the best explanation for that evidence. In this respect, the task facing archaeologists—and the reasoning process involved—is no different from the one we find in most other sciences.

But how do we judge if one hypothesis is “better” than another? What exactly is this “better” metric? There are various answers to this in the literature. Lipton (Reference Lipton2004) argued that the best explanation is the one that provides the most understanding of the phenomenon in question. As a result, it can be judged the most plausible. Other factors that make one explanation better than another may include explanatory breadth (i.e., the ability to explain multiple observed facts), depth (i.e., the ability to not raise more questions than it answers), power (i.e., the ability to apply to similar contexts), falsifiability (i.e., the ability to be refuted by empirical observations), modesty (i.e., the ability to not claim any more than is needed to explain the observed facts), and simplicity (i.e., the ability to postulate fewer and less complex entities, causes, or processes) (Fogelin Reference Fogelin2007). An inference is better than another if it scores more highly on these various metrics than a competing inference. Here, simplicity is just one of several virtues, and a good inference to the best explanation should meet as many of these conditions as possible. Consequently, although parsimony can contribute to finding the best explanations, it is certainly not the only—or even the most important—criterion. It is merely one among a set of (potentially competing) theoretical desiderata. We think this is an important and underappreciated point. When researchers invoke parsimony reasoning alone, we need to be told why parsimony—and not any of the other metrics guiding inference to the best explanation—is the relevant heuristic. Indeed, in many cases, we suspect this is a mistake. Instead, archaeological reasoning should be more pluralistic in its understanding of what makes one explanation “better” than another. In the remainder of this section, we expand on this idea.

Returning to the historical analogy versus comparative analogy debate, we have seen that there is good reason to doubt whether parsimony reasoning offers us the resources for resolving this debate. Rather, given the way ethnographic analogy is commonly formulated in the literature, we suggest that issues at stake here are best framed in terms of inference to the best explanation. In particular, the key question is whether or not processes of cultural inheritance or processes of convergence based on socioecology best explain similarities between contemporary artifacts and archaeological artifacts. Answering this question is a difficult task, which in part relates to the lack of established baseline expectations about how human cultural processes manifest in the archaeological record. Consider the difficulty of distinguishing homologous structures from analogous structures in material culture. Although certain material features, such as decorative symbols and motifs on ceramic vessels, arguably represent arbitrary cultural expressions, it is less straightforward to make the same argument for technologies whose tool configuration and production process are heavily constrained by the physical properties of the material and the desired functional purposes (Moore Reference Moore2020; Sackett Reference Sackett1982). As an example, for a wooden stick to be useful as a digging implement, it should be made of hardwood with a bevelled head. There are only so many ways to produce a beveled head on a wooden stick using a cutting edge. Consequently, if we are to find an archaeological digging stick made of the same wood and with the same beveled head design as an ethnographic example, can we be confident that these similarities are sufficient to warrant an interpretation of cultural continuity, even if these items occur in the same geographic location? The issue here is that due to constraints imposed by the physical properties of the material, there is a finite number of technological options available to solve particular problems. Of course, the chance for simple tools (e.g., a digging stick) to be reinvented independently is likely to be higher than that of more complex tools (Byrne Reference Byrne2007; Lucas et al. Reference Lucas, Kings, Whittle, Davey, Happé, Caldwell and Thornton2020). However, the challenge remains that morphological similarities in object classes can represent “spandrels” (Moore Reference Moore, Andrea and Khlentzos2007) that may manifest independently without cultural relatedness. In order to strengthen our interpretative confidence, we need to have a better understanding of the baseline expectation of the range of material expressions that can emerge from the finite range of technological options available to solve particular problems. This question is increasingly being discussed and empirically investigated, especially among studies of cumulative culture in relation to Paleolithic stone tool variability (e.g., Moore and Perston Reference Moore and Perston2016; Tennie et al. Reference Tennie, Braun, Premo, McPherron, Haidle, Conard and Bolus2016).

An important feature of the above discussion is that parsimony plays no role. The question is not “Which theory is simpler?”; it is “Which theory can best account for the data at hand?” Consequently, we suggest that the debate is better understood in terms of inference to the best explanation rather than via parsimony.

At a more fundamental level, analogies are probabilistic and therefore neither valid nor invalid (Wylie Reference Wylie1982). As mentioned earlier, ethnographic analogy is applied based on “boundary conditions” (Ascher Reference Ascher1961). With direct historical analogy, the boundary condition is the proximity in time and space; with general comparative analogy, the boundary condition is the proximity in technology and environmental conditions. When these conditions are met, the analogy is applied to explain the archaeological data; when the conditions are not met, the analogy is not applied. In the latter scenario, the analogy is not falsified; instead, it is deemed inapplicable to the specific context in question. Instead of falsification, our confidence in analogic inference depends on its ability to account for multiple strands of independent evidence (Wylie Reference Wylie1989, Reference Wylie2002). In this sense, the epistemic appeal of ethnographic analogy relates more to its explanatory breath, given that the analogy can simultaneously account for multiple co-occurring similarities that exist between the source and the subject. Moreover, ethnographic analogy provides explanatory “power” to connect and make sense of various available evidence. This latter point about power fulfills Lipton’s criterion that a good explanation should provide information that increases our potential understanding about the phenomenon we are trying to investigate. For instance, with direct historical analogy, similarities between ethnographic observations and archaeological phenomena are viewed to be best explained as homologous structures sharing common ancestry. This explanation offers potential understanding not only of the ethnic identity and lifeways of past people but also of cultural inheritance in the form of material culture and broader cultural transmission processes. With general comparative analogy, likeness in material culture is viewed to be best explained as analogous structures that converged in the form of functional responses to similar external drivers, especially when a prehistoric-historic cultural continuity cannot be demonstrated between the subject and the source. This explanation of cultural adaptation provides potential understanding of the generalities of how cultures interact and operate in relation to external environments.

Currie (Reference Currie2016) argued that, in its most basic form, ethnographic analogy is an application of the comparative method of human culture and material remains involving archaeological data. Although the parsimony principle is often invoked to justify the “likeliness” of ethnographic analogy in archaeology, we contend that the epistemic appeal of ethnographic analogy has little to do with simplicity and more to do with its perceived explanatory breath and power as inference to the best explanation. In fact, the view that ethnographic analogy provides compelling, “high-probability” explanations of archaeological data may, in part, stem from an intuitive perception that its applicability to archaeological data validates our preexisting beliefs regarding the role of cultural inheritance and ecological plasticity in shaping human behavioral regularities. Yet, because analogies are neither valid nor invalid, there is no license for all-encompassing acceptance or rejection of ethnographic analogies (Currie Reference Currie2016). Instead, ethnographic data should be treated either as one line of evidence that archaeologists can employ to strengthen hypotheses (Currie Reference Currie2016) or as inspirations for researchers to “imagine” new hypotheses that can account for the observed archaeological record (Lin Reference Lin2014; Lin et al. Reference Lin, Rezek and Dibble2018). In other words, although the reasoning process of ethnographic analogy follows that of inference to the best explanation, the analogies should not be treated as epistemically justified conclusions in and of themselves but rather as supportive means for generating and strengthening hypotheses that are testable with further empirical observations.

Parsimony and the Principle of Conservatism

The second context in which parsimony is often invoked in archaeology concerns the fit between model and data. In this context, “model” typically refers to formal explanations of archaeological patterning based on explicit processes. Here, parsimony is employed as a tool to favor the simplest model capable of accounting for the observed empirical evidence. However, as we will show below, notions of simplicity here are focused more on the complexity of postulates than their number. This approach to parsimony is often explicit among studies that apply evolutionary biology methods to explain archaeological variations. In this setting, cultural traits are treated as analogous to biological features, such that cultural variation over time can be explained by evolutionary processes of inheritance, selection, and drift. In the following section, we will first look at the use of cladistics in archaeology as an example of this approach to parsimony.

Archaeologists have used cladistics and parsimony to infer ancestor-descendant relationships among a range of artifact classes, including stone bifaces and projectile points, ceramic vessels, basketry, textile, and earth lodges (Buchanan and Collard Reference Buchanan and Collard2007; Jordan and Shennan Reference Jordan and Shennan2003, Reference Jordan and Shennan2009; Lycett Reference Lycett2009, Reference Lycett2011; O’Brien and Lyman Reference O’Brien and Lyman2002; O’Brien et al. Reference O’Brien, Darwent and Lyman2001, Reference O’Brien, Buchanan, Collard, Boulanger and Pontarotti2012; Tehrani and Collard Reference Tehrani and Collard2002; Tehrani et al. Reference Tehrani, Collard and Shennan2010). Considering that material culture is a product of cultural information transmitted among toolmakers, artifact variation over time can be analyzed to reconstruct artifact lineages that reflect heritable continuity and shared ancestry. However, there are usually multiple phylogenetic trees that can account for the observed data. Consequently, there need to be ways for deciding which tree is the “best” phylogenetic reconstruction to select. In this setting, parsimony is formulated as an explicit model selection method that prefers phylogenetic trees enlisting the fewest evolutionary changes to explain the observed variation. In other words, shared ancestry and homologic branching are given explanatory priority over convergent evolution and homoplastic changes.

With cladistics parsimony, the preference for homologic over homoplastic explanations echoes the sentiment outlined earlier regarding ethnographic analogies—that is, direct historical analogy is seen as more probable than general comparative analogy. However, in the case of cladistics, simplicity can be more straightforwardly quantified and compared by the length of the trees and the number of postulated evolutionary events. The shortest tree with the fewest homoplasties is the most parsimonious and therefore considered the “best” explanation. As Collard and Shennan (Reference Collard, Shennan, Stark, Bowser and Horne2008:28) explained, “Where the fit between a cultural dataset and the tree model is close, we can invoke the principle of parsimony and legitimately conclude that the similarities and differences among the cultural units are primarily results of branching.” Only when the fit between the tree model and the cultural dataset is poor can we “justifiably infer that borrowing or convergent evolution played a more important role in generating the similarities and differences among the cultural units” (Collard and Shennan Reference Collard, Shennan, Stark, Bowser and Horne2008:28).

But evolution is not a parsimonious process. Because evolution is nondirectional, there is no guarantee that evolution always “takes the shortest path.” If we select phylogenetic trees based on the parsimony criterion, the “best” tree may underestimate the actual extent of evolutionary change that could have occurred. What, then, justifies choosing the most parsimonious tree over others in cladistics? One can make a methodological argument. For example, in their archaeological cladistic study, O’Brien and colleagues (Reference O’Brien, Buchanan and Eren2016:73) stated that “the term ‘parsimony’ has nothing to do with whether evolution itself is parsimonious. Rather, it has to do with logical argumentation: it is more parsimonious to make as few ad hoc phylogenetic hypotheses as possible . . . [and] the simplest hypothesis is the one that is most easily defensible.” As a pragmatic tool, parsimony “provides a logical basis for choosing among alternative working hypotheses, even when the most parsimonious one seems counterintuitive” (O’Brien et al. Reference O’Brien, Buchanan and Eren2016:73). However, this seems like a worrying move. Presumably, the goal of developing cultural phylogenies is to provide an accurate description of processes of cultural change. If this is the case, and if we have no reason to think evolutionary systems are parsimonious, then our models should reflect this fact.

On the other hand, as Sober (Reference Sober1988, Reference Sober2015) reasoned, it is possible for cladistics parsimony to mirror likelihood without presupposing that evolution is parsimonious. This is because the application of parsimony here involves an epistemological asymmetry: certain processes (e.g., heritable continuity) are considered more common and likely than others (e.g., evolutionary change through natural selection). Indeed, if we accept that biological (and cultural) entities are linked by heritable continuity, it would be improbable for similarities to derive from separate sources, especially among closely related lineages. Instead, it is more probable that similar features emerged only once from a single common ancestor (Sober Reference Sober2015). Underlying this view is the idea that evolutionary events leading to character-state changes are not easy for evolution to produce. As O’Brien and colleagues (Reference O’Brien, Darwent and Lyman2001:1120) remarked, “Given that evolution is improbable to begin with . . . when faced with the problem of character polarity, we choose the option that requires the fewest evolutionary events.” However, it is important to remember that cultural evolution is not directly analogous to biological evolution. The former involves vertical, horizontal, and oblique lines of inheritance, and it is influenced by unstable mechanisms such as social norms, habits, and conventions. This is thought to mean that cultural evolution is messier and faster than biological evolution (e.g., Ram et al. Reference Ram, Liberman and Feldman2018). Under such circumstances, it is less clear that change over time must be attributed to standard evolutionary processes, such as selection or drift.

Moving on from cladistics, a similar approach to parsimony that involves explicitly prioritizing “simpler” causes over others can be seen in the application of population genetics–based methods. In general, these studies are interested in determining whether cultural patterns observed in the archaeological record are the result of selection. To assess this, researchers investigate the expected variation of cultural patterning under selectively “neutral” conditions, affected only by inheritance and drift. The development of neutral models has shown that material culture can vary considerably through cultural transmission and demographic mechanisms alone, such as population density and migration (Henrich Reference Henrich2004; Powell et al. Reference Powell, Shennan and Thomas2009; Shennan and Wilkinson Reference Shennan and Wilkinson2001). It is worth pointing out that there are questions concerning the theoretical justification and empirical applicability of these neutral models (Collard et al. Reference Collard, Vaesen, Cosgrove and Roebroeks2016; Premo Reference Premo2016, Reference Premo2021; Vaesen et al. Reference Vaesen, Collard, Cosgrove and Roebroeks2016), although these questions are beyond the scope of this article. What is important is that with this approach, the selectively neutral processes are viewed as simpler and are therefore prioritized over selection and evolutionary change for explaining patterns in the archaeological record (Powell et al. Reference Powell, Shennan and Thomas2009). Collard and Shennan (Reference Collard, Shennan, Stark, Bowser and Horne2008:22) state the following:

Where cultural patterns agree with the patterns predicted by a [neutral] model, we can invoke the principle of parsimony and discount the processes that are more complicated than the modelled process. In contrast, where the cultural and modelled patterns disagree, we can legitimately disregard the modelled process and seek a more complex explanation.

Neutral models are sometimes explicitly presented as the null hypothesis (Premo Reference Premo2006; Raup Reference Raup, Matthew and Hoffman1987). For example, in showing that Neanderthal extinction can be explained by a simple process of repeated migration and neutral drift, Kolodny and Feldman (Reference Kolodny and Feldman2017:1) posited that their model “offers a parsimonious alternative to those that invoke external factors or selective advantages, and represents a null hypothesis for assessing such alternatives.” Also dealing with Neanderthal extinction, Vaesen et alia (Reference Vaesen, Scherjon, Hemerik and Verpoorte2019) developed a demographic model to show that, even without competition with modern humans, Neanderthal extinction can be explained by the smaller population sizes of the extinct hominin: “An explanation solely in terms of the internal dynamics of the Neanderthal population, as the one presented here, serves as a null hypothesis against which competing, and less parsimonious, hypotheses are to be assessed” (Vaesen et al. Reference Vaesen, Scherjon, Hemerik and Verpoorte2019:10). The logic here, as summarized by Currie and Meneganzin (Reference Currie and Meneganzin2022), is that the “null” consists of the simpler and more parsimonious explanation available. If the null cannot be rejected, then it is assumed to represent the best hypothesis for explaining the phenomenon at hand. Alternative, more complex scenarios should only be entertained when the null is rejected by empirical data. Note that this “null-based” approach differs from the way null hypothesis is typically used in science. According to Currie and Meneganzin (Reference Currie and Meneganzin2022), in an experimental science setting, the null hypothesis serves to demonstrate that the observed effect of the independent variable is due to chance; consequently, accepting the null is a negative finding about the alternative hypothesis. From this view, neutral models actually represent competing hypotheses rather than the null. We will come back to this issue later when we discuss methodological biases with the “null-based” approach.

Morgan’s Canon

At the beginning of this section, we suggested that the approach to parsimony we have seen in this context is not merely about tallying up and comparing the number of postulates among competing hypotheses. An example is the demographic explanations of Neanderthal extinction. If we judge simplicity solely by the number of postulates, “a purely selective, or a purely environmental, explanation is presumably just as simple as a purely neutral explanation” (Currie and Meneganzin Reference Currie and Meneganzin2022:20). Instead, parsimony is applied here in a way akin to Morgan’s canon in comparative psychology, which states, “In no case is an animal activity to be interpreted in terms of higher psychological processes if it can be fairly interpreted in terms of processes which stand lower in the scale of psychological evolution and development” (Morgan Reference Morgan1894:53). According to Morgan, we should, by default, always adhere to a “principle of conservatism” (Sober Reference Sober2015) by prioritizing lower-level causes or processes unless the evidence suggests otherwise.

The difference between a higher and a lower cause is usually framed in terms of complexity, with the lower cause being “simpler” and therefore more parsimonious than the higher one. For example, when inferring human cognitive evolution from archaeological remains, Coolidge and colleagues (Reference Coolidge, Noël Haidle, Lombard and Wynn2016:221) stated, “Strict parsimony must apply. The simplest cognitive system that can account for archaeological features must be given priority.” In a general sense, this conservative approach of preferring simpler processes over more complex ones is relatively common in science. Scientists are generally more averse to committing type I errors, or false positives (Doan Reference Doan and Kempf-Leonard2005)—that is, we tend to think that concluding that something exists when it does not is worse than missing something when it does exist. If we prefer the simpler explanation by default, we can guard against falsely attributing higher-level causes to the phenomena at hand. Such a position is further warrantable in cases where false positives can have substantial impact on the understanding and treatment of the phenomenon in question.

But what is the basis of the differentiation between lower/simpler versus higher/complex? This is a complicated question, and one that has motivated some to advocate rejecting Morgan’s canon altogether (Fitzpatrick Reference Fitzpatrick2008). In what follows, we offer what we take to be a plausible way of applying the canon to archaeological inference, though there will undoubtably be others. In his original writing, Morgan (Reference Morgan1894:59) reasoned that “any animal may be at a stage where certain higher faculties have not yet been evolved from their lower precursors; and hence we are logically bound not to assume the existence of these higher faculties until good reasons shall have been shown for such existence.” We can conceptualize this evolutionary relationship in terms of “causal priority” (Starzak Reference Starzak2017)—that certain features and processes must be in place before others can develop. If faculty A needs to evolve first (i.e., ancestral) before faculty B can emerge (i.e., derived), then A is causally prior and hence lower than B (Starzak Reference Starzak2017). According to Morgan, the lower A should be prioritized over the higher B.

However, there can be issues with causal priority (Starzak Reference Starzak2017). Because evolution is nondirectional, a more ancestral feature may not always be simpler than one that evolved later (i.e., the lower A being more complex than the higher B). Consequently, we cannot assume a correlation between simplicity and causal priority. Moreover, it is possible for an organism to lose a simpler ancestral trait but retain a more complex one (i.e., having B but not A). In this case, preferring the lower cause would be incorrect.

Alternatively, we may justify Morgan’s canon with the concept of “entailment” as outlined by Sober (Reference Sober, Denise and Allen1998). Put simply, process B is at a higher level than A if and only if the occurrence of B entails the occurrence of A, but not conversely (Sober Reference Sober, Denise and Allen1998, Reference Sober2015). At first glance, this notion is similar to causal priority, given that the lower A must take place before B can occur. However, unlike causal priority, the entailment concept means that having the higher B must require having the lower A, which then leaves no room for the possibility for the lower A to be lost (Starzak Reference Starzak2017). This entailment relationship is crucial because it allows the different levels of causes to scale with simplicity. This is because a proposition of the higher-level B is actually a proposition of A and B (because the occurrence of the higher B must entail the occurrence of the lower A), which is a more complex proposition than one that postulates A alone (without B). In this way, we can demonstrate that a lower cause is indeed simpler and more parsimonious than a higher cause. For these reasons, Sober’s (Reference Sober, Denise and Allen1998) notion of entailment offers us a productive way of conceptualizing complexity.

In fact, we can further extend this idea of entailment to think about the relative likelihood of lower and higher causes. Let us assume, for the moment, that the probability for a lower process A and a higher process B is equally 0.2 (i.e., Pr(A) = Pr(B) = 0.2). The chance of having the lower process A alone without B (Pr(A & ¬B) = 0.2 × (1 − 0.2) = 0.16) should be higher than having both lower and higher processes A and B (Pr(A & B) = 0.2 × 0.2 = 0.04). This relative difference in likelihood between lower and higher causes will always hold if we have reasons to believe that the lower processes are equally common, if not more so, than the higher process (i.e., Pr(A) ≥ Pr(B)). For example, we may assume that heritable continuity is common, whereas evolutionary change is rare, or that individual selection is more prevalent than group selection, as Bettinger argued earlier. Moreover, if the occurrence of the higher process entails having multiple lower processes, then it is possible for a hypothesis of multiple lower causes to still have a greater probability than a hypothesis involving a single higher cause. We can show this by assuming that a higher B entails lower A1 and A2, and that, again, all causes have an equal probability of 0.2. The chance of having B (entailing A1 and A2) (Pr(A1 & A2 & B) = 0.2 × 0.2 × 0.2 = 0.008) would be smaller than having A1 and A2 without B (Pr(A1 & A2 & ¬B) = 0.2 × 0.2 × (1 − 0.2) = 0.032]. The point here is that under the condition of entailment, lower-level causes and processes can be demonstrated to be not only simpler but also more likely.

With the entailment concept, we can begin to clarify how parsimony is used to rank competing hypotheses in this context. For example, going back to cladistics parsimony, homoplastic change represents a higher-level process because evolutionary selection entails having heritable continuity of traits, but having heritable continuity does not entail evolutionary selection. For this reason, homological explanations are simpler and therefore preferred over homoplastic explanations. In the same vein, in the applications of population genetics-based methods, selection as an evolutionary process entails having demographic variation and inheritance, but the reverse is not true. Consequently, demographic hypotheses are more parsimonious and should be prioritized as explanations.

As an anonymous reviewer has suggested to us, the proposal above links with various contemporary projects that apply Bayesian analytical methods to archaeological inference (e.g., Crema et al. Reference Crema, Kerig and Shennan2014; Gjesfjeld and Jordan Reference Gjesfjeld, Jordan and Prentiss2019; Kandler and Crema Reference Kandler, Crema and Prentiss2019; Marwick et al. Reference Marwick, Matzig and Riede2023). This reviewer has also suggested that this opens up an intriguing possibility. If we follow our above proposal, one way in which to make parsimony assumptions explicit would be to make the probability estimates used in competing inferential models explicit. More broadly, researchers could provide both prior and posterior probability estimates on hypotheses. We believe the rigor and clarity of archaeological inference would be greatly increased were such practices to become commonplace. Unfortunately, the scope of this article prevents exploring these possibilities in more detail, but we think they represent an important line of future inquiry.

The Value of Conservatism

As mentioned earlier, an advantage of preferring simpler explanations is that we reduce the chance of mistakenly committing false positives in attributing higher-level causes to the phenomena in question. Many researchers in archaeology and human evolution studies find this conservative approach useful (Brantingham Reference Brantingham2003; Tennie et al. Reference Tennie, Premo, Braun and Shannon2017). However, on the flip side, conservatism heightens the risk of incorrectly omitting or denying the occurrence of higher causes, entities, or processes (i.e., false negatives), leading to potential oversight of interesting findings (Doan Reference Doan and Kempf-Leonard2005). In fact, it can be argued that, with the conservative approach, alternative explanations are disadvantaged in the research design, especially if the simpler explanation is set up as the null hypothesis rather than as a competing hypothesis (Currie and Meneganzin Reference Currie and Meneganzin2022). As mentioned earlier, this is because accepting the more complex alternative requires us to first reject the null, but this is not the case the other way around. Using Currie and Meneganzin’s (Reference Currie and Meneganzin2022) words, the null hypothesis is “on the table” by default, but the alternative explanations must “jump” over an extra bar that is set by the null in order to be considered. Therefore, operationally, it is easier to uphold the simpler explanation than to entertain the possibility of alternative hypotheses. An implication of this bias is that it can create an impression that the phenomena in question are simple and explainable by singular causes, mutually exclusive of other alternatives. This issue can be problematic in archaeology, given that such impressions reduce the complex histories and contextual aspects of human cultural practice and archaeological formation into simple narratives.

A possible way to address these issues is to adopt the view that accepting a more parsimonious hypothesis does not justify denying the existence of alternative explanations. This position is grounded in two considerations. First, the formation of the archaeological record is a complex system involving multiple causal inputs. As such, we must entertain the possibility that more than one hypothesis may be simultaneously true (Chamberlin Reference Chamberlin1890; see also Elliott and Brook Reference Elliott and Brook2007). From this standpoint, denying the existence of alternative explanations beyond the most parsimonious one appears restrictive and counterproductive, particularly if our objective is to unravel the intricacies of formational complexity as a means to infer past behavioral dynamics (Dibble et al. Reference Dibble, Holdaway, Lin, Braun, Douglass, Iovita, McPherron, Olszewski and Sandgathe2017; Rezek et al. Reference Rezek, Holdaway, Olszewski, Lin, Douglass, McPherron, Iovita, Braun and Sandgathe2020). Second, with respect to the issue of equifinality mentioned earlier in the article, if similar properties of the archaeological record can result from different causal inputs, then we must be open to the possibility that any given archaeological property can have multiple causes. For example, modeling studies have shown that the “distance-decay” distribution of lithic raw material abundance commonly observed among Paleolithic assemblages can be explained equally by either varying forager movement tendencies or varying stone-tool discard rate, or a combination of both (Brantingham Reference Brantingham2003, Reference Brantingham2006; Lin and Premo Reference Premo2021). Here, due to equifinality, accepting one causal hypothesis for explaining a phenomenon in question does not deny the possibility of other causal hypotheses. In this setting, although parsimony is used to identify superfluous causes that are not necessary to explain the empirical data, we should remain agnostic and open to the possibility that the superfluous causes exist.

Sober (Reference Sober2015) described this agnostic approach as wielding Ockham’s razor as a “razor of silence”—that is, we remain silent and uncommitted to, rather than outright denying, causes that are unnecessary to explain the phenomena at hand. An example of the application of this approach to parsimony can be seen in Brantingham’s (Reference Brantingham2003) neutral model of forager stone procurement. He states,

If [the model output and the archaeological pattern] are consistent, then we must confront the difficult possibility that empirically observable features of the archaeological record such as raw material richness and transport distances may not be telling us much of anything about optimization of procurement behaviours, depth of planning, risk management strategies, and, ultimately, adaptive variability [Brantingham Reference Brantingham2003:491].

In other words, accepting the more parsimonious neutral model does not deny past foragers of these behavioral complexities. Rather, with the razor of silence, it is not necessary to postulate these aspects of past human lifeways to explain the specific archaeological pattern in question (which, in this case, is the distribution of stone artifact abundance by distance to raw material source). If our goal is to infer these behavioral aspects, then we should be motivated to continue the line of inquiry by looking into other properties of the archaeological record.

Conclusion

Archaeologists often invoke the parsimony principle to support inferences about the past based on observations of the archaeological record. Proportionally few do so having considered the challenges of operationalizing that principle. Without clearly articulating the reasoning and treatment of parsimony, however, we risk turning parsimony into a mere rhetorical device rather than an epistemological tool. As we have shown in this article, parsimony is used by archaeologists in many different ways. In the context of ethnographic analogy, despite the regular reference to parsimony, it is not obvious how simplicity is measured or why simplicity justifies the plausibility of the analogy. Instead, we think that it is more useful to frame ethnographic analogy as an inference to the best explanation; its appeal stems from its perceived ability to best explain the observed data. However, whether ethnographic analogy offers the right answer or not for explaining archaeological observations is a different question that needs to be investigated empirically. In the context of model fitting, parsimony is applied more in the form of Morgan’s canon—that is, lower causes are simpler and hence prioritized over higher-level ones. We found this approach to parsimony more workable. In particular, by utilizing the notion of entailment (Sober Reference Sober, Denise and Allen1998, Reference Sober2015), higher-level causes are demonstrably more complex and less probable than lower causes under certain conditions. This general conservative approach is common in archaeological interpretation and arguably justified to prevent false-positive conclusions about the past. However, we caution against the counterproductive use of this approach to deny the possible existence of alternative superfluous causes. Parsimony is a method for assessing the relative likelihood of competing explanations; it does not resolve which of those explanations is true. Instead, researchers should be motivated and encouraged to continue investigating the archaeological phenomenon in order to clarify the record’s complex formational history in relation to past cultural and natural processes.

Acknowledgments

An early version of this article was presented at the “Inference in Paleoarchaeology” online session at the 86th Annual Meeting of the Society for American Archaeology in 2021. We thank Colin Murray-Wallace and Robin Torrence for their feedback on earlier drafts of this article, and Colin Murray-Wallace for directing us to T. C. Chamberlin’s article on multiple working hypotheses.

Funding Statement

During the preparation of the article, Sam Lin was in part supported by the Australian Research Council under a DECRA Fellowship (DE200100502). Ross Pain received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (101018533).

Data Availability Statement

No original data were used.

Competing Interests

The authors declare none.

References

References Cited

Ascher, Robert. 1961. Analogy in Archaeological Interpretation. Southwestern Journal of Anthropology 17(4):317325.10.1086/soutjanth.17.4.3628943CrossRefGoogle Scholar
Bentley, R. Alexander, and Maschner, Herbert D. G.. 2009. Complex Theory. In Handbook of Archaeological Theories, edited by Alexander Bentley, R., Maschner, Herbert D. G. and Chippindale, Christopher, pp. 245270. AltaMira Press, Lanham, Maryland.Google Scholar
Bettinger, Robert L. 2009. Macroevolutionary Theory and Archaeology: Is There a Big Picture? In Macroevolution in Human Prehistory: Evolutionary Theory and Processual Archaeology, edited by Prentiss, Anna M., Kuijt, Ian and Chatters, James C., pp. 275295. Springer, New York.10.1007/978-1-4419-0682-3_11CrossRefGoogle Scholar
Binford, Lewis R. 1977. General Introduction. In For Theory Building in Archaeology: Essays on Faunal Remains, Aquatic Resources, Spatial Analysis, and Systematic Modeling, edited by Binford, Lewis R., pp. 110. Academic Press, New York.Google Scholar
Binford, Lewis R. 1981. Bones: Ancient Men and Modem Myths. Academic Press, San Diego, California.Google Scholar
Brantingham, P. Jeffrey. 2003. A Neutral Model of Stone Raw Material Procurement. American Antiquity 68(3):487509.10.2307/3557105CrossRefGoogle Scholar
Brantingham, P. Jeffrey. 2006. Measuring Forager Mobility. Current Anthropology 47(3):435459.10.1086/503062CrossRefGoogle Scholar
Brumfiel, Elizabeth M. 1976. Review of Guide to Ethnohistorical Sources, edited by Howard F. Cline. American Antiquity 41 (3):398403.10.2307/279533CrossRefGoogle Scholar
Buchanan, Briggs, and Collard, Mark. 2007. Investigating the Peopling of North America through Cladistic Analyses of Early Paleoindian Projectile Points. Journal of Anthropological Archaeology 26(3):366393.10.1016/j.jaa.2007.02.005CrossRefGoogle Scholar
Byrne, Richard W. 2007. Culture in Great Apes: Using Intricate Complexity in Feeding Skills to Trace the Evolutionary Origin of Human Technical Prowess. Philosophical Transactions of the Royal Society B 362(1480):577585.10.1098/rstb.2006.1996CrossRefGoogle ScholarPubMed
Campanaro, Danilo Marco. 2021. Inference to the Best Explanation (IBE) and Archaeology: Old Tool, New Model. European Journal of Archaeology 24(3):412432.10.1017/eaa.2021.6CrossRefGoogle Scholar
Chamberlin, Thomas C. 1890. The Method of Multiple Working Hypotheses. Science 15 (366):9296.10.1126/science.ns-15.366.92CrossRefGoogle Scholar
Collard, Mark, and Shennan, Stephen J.. 2008. Patterns, Processes, and Parsimony: Studying Cultural Evolution with Analytical Techniques from Evolutionary Biology. In Cultural Transmission and Material Culture: Breaking Down Boundaries, edited by Stark, Miriam T., Bowser, Brenda J. and Horne, Lee, pp. 1733. University of Arizona Press, Tucson.Google Scholar
Collard, Mark, Vaesen, Krist, Cosgrove, Richard, and Roebroeks, Wil. 2016. The Empirical Case against the “Demographic Turn” in Palaeolithic Archaeology. Philosophical Transactions of the Royal Society B 371(1698):20150242. https://doi.org/10.1098/rstb.2015.0242.CrossRefGoogle Scholar
Coolidge, Frederick L., Noël Haidle, Miriam, Lombard, Marlize, and Wynn, Thomas. 2016. Bridging Theory and Bow Hunting: Human Cognitive Evolution and Archaeology. Antiquity 90(349):219228.10.15184/aqy.2015.139CrossRefGoogle Scholar
Crema, Enrico R., Kerig, Tim, and Shennan, Stephen. 2014. Culture, Space, and Metapopulation: A Simulation-Based Study for Evaluating Signals of Blending and Branching. Journal of Archaeological Science 43:289298.10.1016/j.jas.2014.01.002CrossRefGoogle Scholar
Currie, Adrian. 2016. Ethnographic Analogy, the Comparative Method, and Archaeological Special Pleading. Studies in History and Philosophy of Science Part A 55:8494.10.1016/j.shpsa.2015.08.010CrossRefGoogle ScholarPubMed
Currie, Adrian. 2019. Simplicity, One-Shot Hypotheses and Paleobiological Explanation. History and Philosophy of the Life Sciences 41(1):10.10.1007/s40656-019-0247-0CrossRefGoogle ScholarPubMed
Currie, Adrian, and Meneganzin, Andra. 2022. Not by Demography Alone: Neanderthal Extinction and Null Hypotheses in Paleoanthropological Explanation. Biology and Philosophy 37(6):50.10.1007/s10539-022-09881-yCrossRefGoogle Scholar
d’Errico, Francesco, Backwell, Lucinda, Villa, Paola, Degano, Ilaria, Lucejko, Jeannette J., Bamford, Marion K., Higham, Thomas F. G., Colombini, Maria Perla, and Beaumont, Peter B.. 2012a. Reply to Evans: Use of Poison Remains the Most Parsimonious Explanation for Border Cave Castor Bean Extract. PNAS 109(48):E3291E3292.10.1073/pnas.1214711109CrossRefGoogle Scholar
d’Errico, Francesco, Backwell, Lucinda, Villa, Paola, Degano, Ilaria, Lucejko, Jeannette J., Bamford, Marion K., Higham, Thomas F. G., Colombini, Maria Perla, and Beaumont, Peter B.. 2012b. Early Evidence of San Material Culture Represented by Organic Artifacts from Border Cave, South Africa. PNAS 109(33):1321413219.10.1073/pnas.1204213109CrossRefGoogle Scholar
Dibble, Harold L., Holdaway, Simon J., Lin, Sam C., Braun, David R., Douglass, Matthew J., Iovita, Radu, McPherron, Shannon P., Olszewski, Deborah I., and Sandgathe, Dennis. 2017. Major Fallacies Surrounding Stone Artifacts and Assemblages. Journal of Archaeological Method and Theory 24(3):813851.10.1007/s10816-016-9297-8CrossRefGoogle Scholar
Doan, Alesha E. 2005. Type I and Type II Error. In Encyclopedia of Social Measurement, edited by Kempf-Leonard, Kimberly, pp. 883888. Elsevier, Amsterdam.10.1016/B0-12-369398-5/00110-9CrossRefGoogle Scholar
Elliott, Louis P., and Brook, Barry W.. 2007. Revisiting Chamberlin: Multiple Working Hypotheses for the 21st Century. BioScience 57(7):608614.10.1641/B570708CrossRefGoogle Scholar
Fitzpatrick, Simon. 2008. Doing Away with Morgan’s Canon. Mind & Language 23(2):224246.10.1111/j.1468-0017.2007.00338.xCrossRefGoogle Scholar
Fogelin, Lars. 2007. Inference to the Best Explanation: A Common and Effective Form of Archaeological Reasoning. American Antiquity 72(4):603626.10.2307/25470436CrossRefGoogle Scholar
Forster, Malcolm R. 2002. The New Science of Simplicity. In Simplicity, Inference and Modeling: Keeping it Sophisticatedly Simple, edited by Zellner, Arnold, Keuzenkamp, Hugo A., and McAleer, Michael, pp. 83119. Cambridge University Press, Cambridge.10.1017/CBO9780511493164.005CrossRefGoogle Scholar
Forster, Malcolm, and Sober, Elliott. 1994. How to Tell When Simpler, More Unified, or Less Ad Hoc Theories Will Provide More Accurate Predictions. British Journal for the Philosophy of Science 45(1):135.10.1093/bjps/45.1.1CrossRefGoogle Scholar
Fotiadis, Michael. 2018. Knowledge and Inference. In The Encyclopedia of Archaeological Sciences, edited by López Varela, Sandra L., pp. 15. Wiley-Blackwell, Hoboken, New Jersey.Google Scholar
Friedman, Michael. 1974. Explanation and Scientific Understanding. Journal of Philosophy 71(1):519.10.2307/2024924CrossRefGoogle Scholar
Fritz, John M., and Plog, Fred T.. 1970. The Nature of Archaeological Explanation. American Antiquity 35(4):405412.10.2307/278113CrossRefGoogle Scholar
Galileo, . 1967 [1632]. Dialogue Concerning the Two Chief World Systems, Ptolemaic and Copernican. Translated by Stillman Drake. 2nd rev. ed. University of California Press, Berkeley.10.1525/9780520342941CrossRefGoogle Scholar
Gjesfjeld, Erik, and Jordan, Peter. 2019. Contributions of Bayesian Phylogenetics to Exploring Patterns of Macroevolution in Archaeological Data. In Handbook of Evolutionary Research in Archaeology, edited by Prentiss, Anna Marie, pp. 161182. Springer Nature, Cham, Switzerland.10.1007/978-3-030-11117-5_9CrossRefGoogle Scholar
Gould, Richard A. 1974. Some Current Problems in Ethnoarchaeology. In Ethnoarchaeology, edited by Donnan, Christopher B. and Clewlow, C. William Jr., pp. 2748. Institute of Archaeology, University of California, Los Angeles.Google Scholar
Gould, Richard A. 1980. Living Archaeology. Cambridge University Press, Cambridge.Google Scholar
Hanen, Marsha, and Kelley, Jane. 1989. Inference to the Best Explanation in Archaeology. In Critical Traditions in Contemporary Archaeology: Essays in the Philosophy, History, and Socio-Politics of Archaeology, edited by Pinsky, Valerie and Wylie, Alison, pp. 1417. Cambridge University Press, Cambridge.Google Scholar
Hantman, Jeffrey L., Gold, Debra L., and Dunham, Gary H.. 2004. Of Parsimony and Archaeological Histories: A Response to Comment by Boyd. American Antiquity 69(3):583585.10.2307/4128408CrossRefGoogle Scholar
Harman, Gilbert H. 1965. The Inference to the Best Explanation. Philosophical Review 74(1):8895.10.2307/2183532CrossRefGoogle Scholar
Henrich, Joseph. 2004. Demography and Cultural Evolution: How Adaptive Cultural Processes Can Produce Maladaptive Losses: The Tasmanian Case. American Antiquity 69(2):197214.10.2307/4128416CrossRefGoogle Scholar
Huemer, Michael. 2009. When Is Parsimony a Virtue? Philosophical Quarterly 59(235):216236.10.1111/j.1467-9213.2008.569.xCrossRefGoogle Scholar
Jeffreys, Harold. 1957. Scientific Inference. 2nd ed. Cambridge University Press, Cambridge.Google Scholar
Jordan, Peter, and Shennan, Stephen. 2003. Cultural Transmission, Language, and Basketry Traditions amongst the California Indians. Journal of Anthropological Archaeology 22(1):4274.10.1016/S0278-4165(03)00004-7CrossRefGoogle Scholar
Jordan, Peter, and Shennan, Stephen. 2009. Diversity in Hunter-Gatherer Technological Traditions: Mapping Trajectories of Cultural “Descent with Modification” in Northeast California. Journal of Anthropological Archaeology 28(3):342365.10.1016/j.jaa.2009.05.004CrossRefGoogle Scholar
Kandler, Anne, and Crema, Enrico R.. 2019. Analysing Cultural Frequency Data: Neutral Theory and Beyond. In Handbook of Evolutionary Research in Archaeology, edited by Prentiss, Anna Marie, pp. 83108. Springer Nature, Cham, Switzerland.10.1007/978-3-030-11117-5_5CrossRefGoogle Scholar
Kohler, Timothy A. 2012. Complex Systems and Archaeology. In Archaeological Theory Today, edited by Hodder, Ian, pp. 93123. Polity Press, Cambridge.Google Scholar
Kolodny, Oren, and Feldman, Marcus W.. 2017. A Parsimonious Neutral Model Suggests Neanderthal Replacement Was Determined by Migration and Random Species Drift. Nature Communications 8:1040. https://doi.org/10.1038/s41467-017-01043-z.CrossRefGoogle ScholarPubMed
Lin, Sam C. 2014. Experimentation and Scientific Inference Building in the Study of Hominin Behavior through Stone Artifact Archaeology. PhD dissertation, Department of Anthropology, University of Pennsylvania, Philadelphia.Google Scholar
Lin, Sam C., and Premo, L. S.. 2021. Forager Mobility and Lithic Discard Probability Similarly Affect the Distance of Raw Material Discard from Source. American Antiquity 86(4):845863.10.1017/aaq.2021.66CrossRefGoogle Scholar
Lin, Sam C., Rezek, Zeljko, and Dibble, Harold L.. 2018. Experimental Design and Experimental Inference in Stone Artifact Archaeology. Journal of Archaeological Method and Theory 25(3):663688.10.1007/s10816-017-9351-1CrossRefGoogle Scholar
Lipo, Carl P., O’Brien, Michael J., Collard, Mark, and Shennan, Stephen J. (editors). 2017. Mapping Our Ancestors: Phylogenetic Approaches in Anthropology and Prehistory. Routledge, London.Google Scholar
Lipton, Peter. 2004. Inference to the Best Explanation. 2nd ed. Routledge, London.Google Scholar
Lombrozo, Tania. 2007. Simplicity and Probability in Causal Explanation. Cognitive Psychology 55(3):232257.10.1016/j.cogpsych.2006.09.006CrossRefGoogle ScholarPubMed
Lucas, Amanda J., Kings, Michael, Whittle, Devi, Davey, Emma, Happé, Francesca, Caldwell, Christine A., and Thornton, Alex. 2020. The Value of Teaching Increases with Tool Complexity in Cumulative Cultural Evolution. Proceedings of the Royal Society B 287(1939):20201885. https://doi.org/10.1098/rspb.2020.1885.CrossRefGoogle ScholarPubMed
Lucas, Gavin. 2005. The Archaeology of Time. Routledge, London.Google Scholar
Lycett, Stephen J. 2009. Understanding Ancient Hominin Dispersals Using Artefactual Data: A Phylogeographic Analysis of Acheulean Handaxes. PLoS ONE 4(10):e7404. https://doi.org/10.1371/journal.pone.0007404.CrossRefGoogle Scholar
Lycett, Stephen J. 2011. “Most Beautiful and Most Wonderful”: Those Endless Stone Tool Forms. Journal of Evolutionary Psychology 9(2):143171.10.1556/JEP.9.2011.23.1CrossRefGoogle Scholar
Lyman, R. Lee, and Michael, J. O’Brien. 1997. The Concept of Evolution in Early Twentieth‐Century Americanist Archaeology. Archaeological Papers of the American Anthropological Association 7(1):2148.10.1525/ap3a.1997.7.1.21CrossRefGoogle Scholar
Lyman, R. Lee, and Michael, J. O’Brien. 2001. The Direct Historical Approach, Analogical Reasoning, and Theory in Americanist Archaeology. Journal of Archaeological Method and Theory 8(4):303342.10.1023/A:1013736416067CrossRefGoogle Scholar
Mace, Ruth, and Holden, Clare J.. 2005. A Phylogenetic Approach to Cultural Evolution. Trends in Ecology & Evolution 20(3):116121.10.1016/j.tree.2004.12.002CrossRefGoogle ScholarPubMed
Marwick, Ben, Matzig, David N., and Riede, Felix. 2023. Bayesian Inference of Material Culture Phylogenies Using Continuous Traits: A Birth–Death Model for Late Neolithic/Early Bronze Age Arrowheads from Northwestern Europe. SocArXiv. https://doi.org/10.31235/osf.io/j2kva.CrossRefGoogle Scholar
Moore, Mark W. 2007. Lithic Design Space Modelling and Cognition in Homo floresiensis. In Mental Stages, Volume 1: Evolution, Function, Nature, edited by Andrea, C. Schalley and Khlentzos, Drew, pp. 1134. John Benjamins, Amsterdam.10.1075/slcs.92.04mooCrossRefGoogle Scholar
Moore, Mark W. 2020. Hominin Stone Flaking and the Emergence of “Top-Down” Design in Human Evolution. Cambridge Archaeological Journal 30(4):647664.10.1017/S0959774320000190CrossRefGoogle Scholar
Moore, Mark W., and Perston, Yinika. 2016. Experimental Insights into the Cognitive Significance of Early Stone Tools. PLoS ONE 11(7):e0158803. https://doi.org/10.1371/journal.pone.0158803.CrossRefGoogle ScholarPubMed
Morgan, C. Lloyd. 1894. An Introduction to Comparative Psychology. Walter Scott, London.10.1037/11344-000CrossRefGoogle Scholar
Morgan, Charles G. 1973. Archaeology and Explanation. World Archaeology 4(3):259276.10.1080/00438243.1973.9979538CrossRefGoogle Scholar
Newton, Isaac. 1999 [1687]. The Principia: Mathematical Principles of Natural Philosophy. Translated by I. Bernard Cohen and Anne Whitman. University of California Press, Berkeley.Google Scholar
Nyrup, Rune. 2021. Mortar and Pestle or Cooking Vessel? When Archaeology Makes Progress through Failed Analogies. In Explorations in Archaeology and Philosophy, edited by Killin, Anton and Allen-Hermanson, Sean, pp. 2546. Springer Nature, Cham, Switzerland.10.1007/978-3-030-61052-4_3CrossRefGoogle Scholar
O’Brien, Michael J., Buchanan, Briggs, Collard, Mark, and Boulanger, Matthew T.. 2012. Cultural Cladistics and the Early Prehistory of North America. In Evolutionary Biology: Mechanisms and Trends, edited by Pontarotti, Pierre, pp. 2342. Springer, Berlin, Germany.10.1007/978-3-642-30425-5_2CrossRefGoogle Scholar
O’Brien, Michael J., Buchanan, Briggs, and Eren, Metin I.. 2016. Clovis Colonization of Eastern North America: A Phylogenetic Approach. Science and Technology of Archaeological Research 2(1):6789.10.1080/20548923.2016.1183920CrossRefGoogle Scholar
O’Brien, Michael J., Collard, Mark, Buchanan, Briggs, and Boulanger, Matthew T.. 2013. Trees, Thickets, or Something in Between? Recent Theoretical and Empirical Work in Cultural Phylogeny. Israel Journal of Ecology and Evolution 59(2):4561.10.1080/15659801.2013.825431CrossRefGoogle Scholar
O’Brien, Michael J., Darwent, John, and Lyman, R. Lee. 2001. Cladistics Is Useful for Reconstructing Archaeological Phylogenies: Palaeoindian Points from the Southeastern United States. Journal of Archaeological Science 28(10):11151136.10.1006/jasc.2001.0681CrossRefGoogle Scholar
O’Brien, Michael J., and Lyman, R. Lee. 2002. Evolutionary Archeology: Current Status and Future Prospects. Evolutionary Anthropology 11(1):2636.10.1002/evan.10007CrossRefGoogle Scholar
O’Brien, Michael J., Lyman, R. Lee, Mesoudi, Alex, and VanPool, Todd L.. 2010. Cultural Traits as Units of Analysis. Philosophical Transactions of the Royal Society B 365(1559):37973806.10.1098/rstb.2010.0012CrossRefGoogle ScholarPubMed
Parsons, Talcott. 1966. Societies: Evolutionary and Comparative Perspectives. Prentice-Hall, Englewood Cliffs, New Jersey.Google Scholar
Perreault, Charles. 2019. The Quality of the Archaeological Record. University of Chicago Press, Chicago.10.7208/chicago/9780226631011.001.0001CrossRefGoogle Scholar
Popper, Karl R. 1959. The Logic of Scientific Discovery. Basic Books, New York.Google Scholar
Pothos, Emmanuel M., and Chater, Nick. 2002. A Simplicity Principle in Unsupervised Human Categorization. Cognitive Science 26(3):303343.10.1207/s15516709cog2603_6CrossRefGoogle Scholar
Powell, Adam, Shennan, Stephen J., and Thomas, Mark G.. 2009. Late Pleistocene Demography and the Appearance of Modern Human Behavior. Science 324(5932):12981301.10.1126/science.1170165CrossRefGoogle ScholarPubMed
Premo, L. S. 2006. Patchiness and Prosociality: Modeling the Evolution and Archaeology of Plio-Pleistocene Hominin Food Sharing. PhD dissertation, Department of Anthropology, University of Arizona, Tucson.Google Scholar
Premo, L. S. 2016. Effective Population Size and the Effects of Demography on Cultural Diversity and Technological Complexity. American Antiquity 81(4):605622.10.7183/0002-7316.81.4.605CrossRefGoogle Scholar
Premo, L. S. 2021. Population Size Limits the Coefficient of Variation in Continuous Traits Affected by Proportional Copying Error (and Why This Matters for Studying Cultural Transmission). Journal of Archaeological Method and Theory 28(2):512534.10.1007/s10816-020-09464-9CrossRefGoogle Scholar
Quine, Willard van Orman. 1963. On Simple Theories of a Complex World. Synthese 15(1):103106.10.1007/BF00484843CrossRefGoogle Scholar
Ram, Yoav, Liberman, Uri, and Feldman, Marcus W.. 2018. Evolution of Vertical and Oblique Transmission under Fluctuating Selection. PNAS 115(6):E1174E1183.10.1073/pnas.1719171115CrossRefGoogle ScholarPubMed
Raup, David M. 1987. Neutral Models in Paleobiology. In Neutral Models in Biology, edited by Matthew, H. Nitecki and Hoffman, Antoni, pp. 121132. Oxford University Press, New York.Google Scholar
Read, Dwight W., and LeBlanc, Steven A.. 1978. Descriptive Statements, Covering Laws, and Theories in Archaeology [and Comments and Reply]. Current Anthropology 19(2):307335.10.1086/202077CrossRefGoogle Scholar
Rezek, Zeljko, Holdaway, Simon J., Olszewski, Deborah I., Lin, Sam C., Douglass, Matthew, McPherron, Shannon P., Iovita, Radu, Braun, David R., and Sandgathe, Dennis. 2020. Aggregates, Formational Emergence, and the Focus on Practice in Stone Artifact Archaeology. Journal of Archaeological Method and Theory 27(4):887929.10.1007/s10816-020-09445-yCrossRefGoogle Scholar
Sackett, James R. 1982. Approaches to Style in Lithic Archaeology. Journal of Anthropological Archaeology 1(1):59112.10.1016/0278-4165(82)90008-3CrossRefGoogle Scholar
Sahlins, Marshall D., and Service, Elman R. (editors). 1960. Evolution and Culture. University of Michigan Press, Ann Arbor.10.3998/mpub.8980CrossRefGoogle Scholar
Salmon, Merrilee H. 1975. Confirmation and Explanation in Archaeology. American Antiquity 40(4):459464.10.2307/279333CrossRefGoogle Scholar
Shennan, Stephen J., and Wilkinson, J. R.. 2001. Ceramic Style Change and Neutral Evolution: A Case Study from Neolithic Europe. American Antiquity 66(4):577593.10.2307/2694174CrossRefGoogle Scholar
Sober, Elliott. 1988. Reconstructing the Past: Parsimony, Evolution, and Inference. Massachusetts Institute of Technology, Cambridge, Massachusetts.Google Scholar
Sober, Elliott. 1998. Morgan’s Canon. In The Evolution of Mind, edited by Denise, D. Cummins and Allen, Colin, pp. 224242. Oxford University Press, New York.Google Scholar
Sober, Elliott. 2000. Simplicity. In A Companion to the Philosophy of Science, edited by Newton-Smith, W. H., pp. 433441. Blackwell, Oxford.Google Scholar
Sober, Elliott. 2015. Ockham’s Razors: A User’s Manual. Cambridge University Press, Cambridge.10.1017/CBO9781107705937CrossRefGoogle Scholar
Stanish, Charles. 2008. Explanation in Archaeology, Overview. In Encyclopedia of Archaeology, edited by Deborah, M. Pearsall, pp. 13581364. Academic Press, New York.10.1016/B978-012373962-9.00108-4CrossRefGoogle Scholar
Starzak, Tobias. 2017. Interpretations without Justification: A General Argument against Morgan’s Canon. Synthese 194(5):16811701.10.1007/s11229-016-1013-4CrossRefGoogle Scholar
Steward, Julian H. 1955. Theory of Culture Change: The Methodology of Multilinear Evolution. University of Illinois Press, Urbana.Google Scholar
Strong, William D. 1935. An Introduction to Nebraska Archeology. Smithsonian Miscellaneous Collections Vol. 93, No. 10. Smithsonian Institution, Washington, DC.Google Scholar
Tehrani, Jamshid, and Collard, Mark. 2002. Investigating Cultural Evolution through Biological Phylogenetic Analyses of Turkmen Textiles. Journal of Anthropological Archaeology 21(4):443463.10.1016/S0278-4165(02)00002-8CrossRefGoogle Scholar
Tehrani, Jamshid J., Collard, Mark, and Shennan, Stephen J.. 2010. The Cophylogeny of Populations and Cultures: Reconstructing the Evolution of Iranian Tribal Craft Traditions Using Trees and Jungles. Philosophical Transactions of the Royal Society B 365(1559):38653874.10.1098/rstb.2010.0020CrossRefGoogle ScholarPubMed
Tennie, Claudio, Braun, David R., Premo, L. S., and McPherron, Shannon P.. 2016. The Island Test for Cumulative Culture in the Paleolithic. In The Nature of Culture: Based on an Interdisciplinary Symposium ‘The Nature of Culture’, Tübingen, Germany, edited by Haidle, Miriam N., Conard, Nicholas J. and Bolus, Michael, pp. 121133. Springer, Dordrecht, Germany.10.1007/978-94-017-7426-0_11CrossRefGoogle Scholar
Tennie, Claudio, Premo, L. S., Braun, David R., and Shannon, P. McPherron. 2017. Early Stone Tools and Cultural Transmission: Resetting the Null Hypothesis. Current Anthropology 58(5):652654.10.1086/693846CrossRefGoogle Scholar
Trigger, Bruce G. 1981. Archaeology and the Ethnographic Present. Anthropologica 23(1):317.10.2307/25605060CrossRefGoogle Scholar
Tuggle, H. David, Townsend, Alex H., and Riley, Thomas J.. 1972. Laws, Systems, and Research Designs: A Discussion of Explanation in Archaeology. American Antiquity 37(1):312.10.2307/278882CrossRefGoogle Scholar
Vaesen, Krist, Collard, Mark, Cosgrove, Richard, and Roebroeks, Wil. 2016. Population Size Does Not Explain Past Changes in Cultural Complexity. PNAS 113(16):E2241E2247.10.1073/pnas.1520288113CrossRefGoogle ScholarPubMed
Vaesen, Krist, Scherjon, Fulco, Hemerik, Lia, and Verpoorte, Alexander. 2019. Inbreeding, Allee Effects and Stochasticity Might Be Sufficient to Account for Neanderthal Extinction. PLoS ONE 14(11):e0225117. https://doi.org/10.1371/journal.pone.0225117.CrossRefGoogle ScholarPubMed
Walsh, Dorothy. 1979. Occam’s Razor: A Principle of Intellectual Elegance. American Philosophical Quarterly 16(3):241244.Google Scholar
Watson, Patty Jo, LeBlanc, Steven A., and Redman, Charles L.. 1971. Explanation in Archaeology: An Explicitly Scientific Approach. Columbia University Press, New York.Google Scholar
White, Leslie A. 1959. The Concept of Culture. American Anthropologist 61(2):227251.10.1525/aa.1959.61.2.02a00040CrossRefGoogle Scholar
Willey, Gordon R. 1953. What Archaeologists Want. In An Appraisal of Anthropology Today, edited by Tax, Sol, Eiseley, Loren C., Rouse, Irving and Voegelin, Carl F., pp. 229232. University of Chicago Press, Chicago.Google Scholar
Wylie, Alison. 1982. An Analogy by Any Other Name Is Just as Analogical: A Commentary on the Gould-Watson Dialogue. Journal of Anthropological Archaeology 1(4):382401.10.1016/0278-4165(82)90003-4CrossRefGoogle Scholar
Wylie, Alison. 1985. The Reaction against Analogy. In Advances in Archaeological Method and Theory, Vol. 8, edited by Schiffer, Michael B., pp. 63111. Academic Press, Orlando, Florida.10.1016/B978-0-12-003108-5.50008-7CrossRefGoogle Scholar
Wylie, Alison. 1989. Archaeological Cables and Tacking: The Implications of Practice for Bernstein’s “Options Beyond Objectivism and Relativism.” Philosophy of the Social Sciences 19(1):118.10.1177/004839318901900101CrossRefGoogle Scholar
Wylie, Alison. 2002. Thinking from Things: Essays in the Philosophy of Archaeology. University of California Press, Berkeley.10.1525/9780520935402CrossRefGoogle Scholar