Mind the gaps, mend the gaps

A very long time ago I asked whether Neandertals’ brains grew like ours do today, a question raised by conflicting results coming from two research teams. Both teams reconstructed the brain endocasts of modern humans and fossil Neandertals, and compared how endocast shapes changed during growth and development. As I mused in that post, the different results seem to result largely from differences in how a critical fossil specimen (the Neandertal newborn from Mezmaiskaya, Russia) was reconstructed. This is a perennial problem for paleoanthropology. Our knowledge of the human past hinges on a few thousands of individuals whose bones and teeth managed to survive and be discovered after several thousands or millions of years. Most of these precious remains are fragmentary and cannot speak for themselves. So, researchers must rely on their own anatomical expertise and a bit of artistic license to reconstruct what many key fossils would have looked like in their original condition.

Physical reconstruction of a Homo erectus cranium (A and turquoise in C) compared to its “virtual” reconstruction (B and gray in C), by Karen Baab (2025).

This is a perennial problem for paleoanthropology. Our knowledge of the human past hinges on a few thousands of individuals whose bones and teeth managed to survive and be discovered after several thousands or millions of years. Most of these precious remains are fragmentary and cannot speak for themselves. So, researchers must rely on their own anatomical expertise and a bit of artistic license to reconstruct what many key fossils would have looked like in their original condition.

Over thirty years ago Christophe Zollikofer and colleagues (1995: 283) reported that, “Fossil specimens can be restored, measured and replicated without physical contact using … computer assisted reconstruction.” The development of these “virtual anthropology” methods has made fossil reconstruction much more accessible. Most importantly, virtual methods allow researchers to generate multiple, reasonably realistic reconstructions of the same fossil. As Philipp Gunz and colleagues (2009: 61) noted, “While there typically will be shape differences among equally plausible reconstructions, these different estimates might still support a single conclusion. But they need not do so, and all assumptions must be strenuously challenged if one or more reconstructions, or a statistical analysis based on them, are to be treated as arguments for a scientific claim.”

As these paleo pioneers have also acknowledged, making data publicly available will also help assess the extent to which specific reconstructions might affect subsequent interpretations. Both of these research groups have published 3D landmark datasets with some overlapping specimens, allowing us to address this central question. Simon Neubauer and colleagues (2018) published the landmark data used in their reconstruction and analysis of a juvenile Homo erectus cranium (here). A team led by Marcia Ponce de León (2021) and Christophe Zollikofer (2022) have posted comparable data from their endocast reconstructions of Homo erectus from Dmanisi, Georgia (here) and early Homo sapiens from Herto, Ethiopia (here). These great datasets bear on the evolution of brain size and shape—let’s dig in.

Both groups—Neubauer et al. and Ponce de León et al. + Zollikofer et al. (hereafter “PZ”)—include recent modern humans from different skeletal collections and the same nine fossil Homo specimens: KNM-ER 1813 (H. habilis), KNM-ER 1470 (H. rudolfensis), and seven other fossils from Kenya and Indonesia typically attributed to Homo erectus. Most of the fossils required varying extents of reconstruction, from the alignment of separate cranial fragments to the mathematical estimation of endocranial surfaces that aren’t preserved. The two teams measured endocast shape using comparable but slightly different sets of 3D landmark coordinates, so we can’t combine the datasets but we can run the same set of analyses on each sample separately and then compare the results.

Overall size and shape variation in the two datasets. Left: Centroid size of each specimen with the dashed line indicating parity between samples. Center and right: endocast shape variability within the Neubauer (center) and PZ (right) samples; color-coded 3D models beneath each graphs show how endocast shape varies along PC1.

The graphs above show how the nine fossils vary within and between datasets. The 3D landmarks used to measure endocast size and shape return similar overall sizes for each specimen (left graphs). There are differences in the relative positions of a few specimens (ER 3883 vs. WT 15000 and ER 3733 vs. Sambungmacan 3), but these discrepancies are small probably mostly within the range of uncertainty for individual fossil reconstructions.

The effects of different reconstructions on endocranial shape, on the other hand, are a bit more profound. In each dataset, the main dimension of variation (PC1, the horizontal axis in the center and right graphs) captures similar patterns of shape variability. In both samples, fossils with a longer and lower endocast fall on the left side of the graph, while rounder endocasts fall on the right side of the graph. But where individual specimens plot in the graphs (i.e., their overall endocast shape) differs notably between datasets. For example, the “Mojokerto” infant Homo erectus has the roundest shape while WT 15000 has one of the ‘flatter’ shapes in the Neubauer sample, whereas WT 15000 is the ‘roundest’ in the PZ sample.

So, different decisions in the reconstruction process can lead to different overall patterns of shape variation within a sample. This can have important impacts on subsequent analyses. For instance, we often want to assess how similar or different fossil specimens are to one another, looking for clusters of similar shapes that might tell us something meaningful about the biology we’re hoping to capture. The two datasets, however, produce slightly different clusters:

Cluster dendrograms based on shape variation within the two endocast datasets. Fossil specimens are color-coded to highlight difference between the two trees.

Both datasets produce clusters with early H. erectus specimens ER 3733 and ER 3883, and later Indonesian H. erectus fossils Sambungmacan 3 and Solo XI. But the similarities among other fossils differ between the two samples, in ways that could lead to different biological interpretations. One might interpret the Neubauer clustering to mean that the Mojokerto infant differs from the rest since it hadn’t completed brain growth, while the other clusters could potentially reflect evolutionary changes both from early Homo (ER 1813 and 1470) to H. erectus and over time within H. erectus. In contrast, the PZ tree could be interpreted to mean that the adolescent WT 15000 had an ‘underdeveloped’ brain like Mojokerto, while the different clusters of ER 1813 and ER 1470 could reflect a more convoluted pattern of brain evolution from early Homo to H. erectus.

Of course, principal components and cluster analyses are statistical approaches for exploring variation within a sample, and they don’t necessarily map onto meaningful phenomena. Biological patterns could ‘override’ variation due to differences in reconstruction. For instance, endocast shape variation due to growth and development could produce marked, characteristic differences between infants and adults. Indeed, if we compare endocast shape of the infant Mojokerto to the average adult H. erectus, both datasets yield fairly similar results:

Endocast shape differences between the Mojokerto infant and adult H. erectus. In both rows, the left side shows Mojokerto (blue/red) aligned to the adult (gray); note that they are scaled to the same size. The center shows where Mojokerto (blue/red) or the adult (yellow) projects more than the other. On the right, lines between points show how corresponding landmarks differ between Mojokerto and the average adult in each sample.

In addition, if groups/species have distinct endocast shapes, such differences could still be captured by studies using different fossil reconstructions. For instance, both studies produce similar results when comparing early Homo specimens ER 1813 and ER 1470, and comparing adult H. erectus and modern humans:

So, getting back to our original question: do different virtual reconstructions produce different results? Yes and no. Yes, there will be observable differences between studies, and these could be subtle (e.g., brain sizes estimates) or more severe (e.g., clustering patterns within a fossil sample). But as Melvin Moss reminded us, we must keep in mind the underlying biological questions when interpreting statistical patterns. Ultimately, fossil preservation is probably the greatest source of variability between different studies. Many researchers will bring similar levels of expertise and similar analytical toolkits to study fossils, but more fragmentary specimens will have greater uncertainty in how to to reconstruct them. In contrast to the different growth patterns identified in the Neandertal studies mentioned at the beginning of this post, the consistent ‘growth’ signal in H. erectus fossils may be due to the fact that the Mojokerto infant is better preserved and required less reconstruction than Neandertal neonates.

As Gunz and colleagues (2009) stressed when they laid out “principles for the virtual reconstruction of hominin crania,” these powerful virtual methods can never produce “the” single correct reconstruction of a fossil. Rather, researchers must acknowledge and remain cognizant of all the decisions and assumptions that go into their reconstructions, and attempt to produce multiple reconstructions reflecting these varied uncertainties. Making data openly available further allows other researchers to assess how conclusions were reached, and to add new fossils to existing datasets.

REFERENCES

Baab, K. L. (2025). A fresh look at an iconic human fossil: Virtual reconstruction of the KNM-WT 15000 cranium. Journal of Human Evolution, 202, 103664. https://doi.org/10.1016/j.jhevol.2025.103664

Gunz, P., Mitteroecker, P., Neubauer, S., Weber, G. W., & Bookstein, F. L. (2009). Principles for the virtual reconstruction of hominin crania. Journal of Human Evolution, 57(1), 48–62. https://doi.org/10.1016/j.jhevol.2009.04.004

Neubauer, S., Gunz, P., Leakey, L., Leakey, M., Hublin, J.-J., & Spoor, F. (2018). Reconstruction, endocranial form and taxonomic affinity of the early Homo calvaria KNM-ER 42700. Journal of Human Evolution, 121, 25–39. https://doi.org/10.1016/j.jhevol.2018.04.005

Ponce De León, M. S., Bienvenu, T., Marom, A., Engel, S., Tafforeau, P., Alatorre Warren, J. L., Lordkipanidze, D., Kurniawan, I., Murti, D. B., Suriyanto, R. A., Koesbardiati, T., & Zollikofer, C. P. E. (2021). The primitive brain of early Homo. Science, 372(6538), 165–171. https://doi.org/10.1126/science.aaz0032

Zollikofer, C. P. E., Bienvenu, T., Beyene, Y., Suwa, G., Asfaw, B., White, T. D., & Ponce De León, M. S. (2022). Endocranial ontogeny and evolution in early Homo sapiens: The evidence from Herto, Ethiopia. Proceedings of the National Academy of Sciences, 119(32), e2123553119. https://doi.org/10.1073/pnas.2123553119

Zollikofer, C. P. E., Ponce de León, M. S., Martin, R. D., & Stucki, P. (1995). Neanderthal computer skulls. Nature, 375(6529), 283–285. https://doi.org/10.1038/375283b0

Picking the brain of Homo naledi

A question that has come up a lot on the blog is what endocast fossils can tell us about the brains and behaviors of long-extinct animals. This question is especially salient for Homo naledi, an unexpected human cousin that lived in South Africa at the same time as the earliest modern-like humans around 300,000 years ago. Brain size in H. naledi ranged from 460–610 ml, similar to Australopithecus and the earliest Homo over 1.5 million years ago and less than half the size of other, contemporaneous fossil humans. Despite its small brain size, the frontal lobe of H. naledi seems to have been more similar to humans than australopiths, including in the area associated with speech.

To learn more about the brain of Homo naledi, I teamed up with Shawn Hurst and John Hawks to virtually reconstruct the endocast from the most complete skull and skeleton of the species, nicknamed Neo (which means “gift” in the SeSotho language). We have a paper about it coming out soon in the journal Brain Structure and Function, where we show that Neo’s endocast shape is fairly distinct among Pleistocene hominins. We go on to discuss some implications and limitations of our results for understanding the brain and behavior of H. naledi and other hominins. I’ll write more about it when the paper is actually in press, but in the meantime you can get a sneak preview by checking the data and running the analyses for yourself.

The original Neo cranium (left), its preserved endocranial surface (center), and virtually reconstructed endocast shape (right)

I published the landmark data and R code from our study in the open access repository Zenodo (here). I built off the endocast landmark data that Simon Neubauer and colleagues (2018) made available from their study of the KNM-ER 42700 fossil. I created a landmark template from that dataset and applied it to Neo and Australopithecus africanus cranium Sts 5, and then used geometric morphometric methods to reconstruct the missing regions of Neo and compare it to the other hominins. The accompanying R code walks you through inputting these data, estimating missing landmark positions, and comparing endocast shapes of humans and fossil hominins.

Endocast shapes of Neo and a modern human, as viewed from the left, front, and bottom. Endocasts are scaled by size and aligned by the cranial base.

Hopefully these data and code will let others build off our results, add more fossils to the mix, and generate more insights about how the human brain has changed over the past several millions of years.

Krapina endocast update (open data & code)

In the Summer of 2019 I worked with some great Vassar undergrads to make virtual endocasts and generate new brain size estimates for the Neandertals from the site of Krapina, which we then published in 2021 (discussed in this blog post). The virtual approach to endocast reconstruction uses 3D landmark-based geometric morphometrics methods, and so in the spirit of open science we also published all the landmark data used for the study (as well as a bunch of other fossil human brain size estimates) in the Zenodo repository (here).

Neandertal fossil specimens Krapina 3 (purple/green) and Krapina 6 (yellow/red) with preserved landmarks and virtually reconstructed endocasts.

Something major and global happened around that time — who can even remember what? — and so I never got around to posting R code to accompany the study. So, I’ve finally gotten around to adding some very basic code to the Zenodo entry (better late than never). The code simply reads in the landmarks, estimates missing data for fossils, and does some very basic shape analysis and visualization. It’s doesn’t get into all the nuts and bolts of our study, but it should be enough to help folks check our data or get started with shape analysis in R.

R code includes ways to visualize the landmark data. Left: Principal components analysis graph of endocast shape for humans (red) and Neandertals (blue). Right: Triangle meshes of the average human and Neandertal endocast shapes, viewed from the right, bottom, and back.

Original article
Cofran Z, Boone M, Petticord M. 2021. Virtually estimated endocranial volumes of the Krapina Neandertals. American Journal of Physical Anthropology 174: 117–128. (link)