Early Pleistocene enamel proteome from Dmanisi resolves Stephanorhinus phylogeny

Cappellini, Enrico; Welker, Frido; Pandolfi, Luca; Ramos-Madrigal, Jazmín; Samodova, Diana; Rüther, Patrick L.; Fotakis, Anna K.; Lyon, David; Moreno-Mayar, J. Víctor; Bukhsianidze, Maia; Rakownikow Jersie-Christensen, Rosa; Mackie, Meaghan; Ginolhac, Aurélien; Ferring, Reid; Tappen, Martha; Palkopoulou, Eleftheria; Dickinson, Marc R.; Stafford, Thomas W.; Chan, Yvonne L.; Götherström, Anders; Nathan, Senthilvel K. S. S.; Heintzman, Peter D.; Kapp, Joshua D.; Kirillova, Irina; Moodley, Yoshan; Agusti, Jordi; Kahlke, Ralf-Dietrich; Kiladze, Gocha; Martínez-Navarro, Bienvenido; Liu, Shanlin; Sandoval Velasco, Marcela; Sinding, Mikkel-Holger S.; Kelstrup, Christian D.; Allentoft, Morten E.; Orlando, Ludovic; Penkman, Kirsty; Shapiro, Beth; Rook, Lorenzo; Dalén, Love; Gilbert, M. Thomas P.; Olsen, Jesper V.; Lordkipanidze, David; Willerslev, Eske

doi:10.1038/s41586-019-1555-y

Letter
Published: 11 September 2019

Early Pleistocene enamel proteome from Dmanisi resolves Stephanorhinus phylogeny

Nature volume 574, pages 103–107 (2019)Cite this article

14k Accesses
123 Citations
551 Altmetric
Metrics details

Subjects

Abstract

The sequencing of ancient DNA has enabled the reconstruction of speciation, migration and admixture events for extinct taxa¹. However, the irreversible post-mortem degradation² of ancient DNA has so far limited its recovery—outside permafrost areas—to specimens that are not older than approximately 0.5 million years (Myr)³. By contrast, tandem mass spectrometry has enabled the sequencing of approximately 1.5-Myr-old collagen type I⁴, and suggested the presence of protein residues in fossils of the Cretaceous period⁵—although with limited phylogenetic use⁶. In the absence of molecular evidence, the speciation of several extinct species of the Early and Middle Pleistocene epoch remains contentious. Here we address the phylogenetic relationships of the Eurasian Rhinocerotidae of the Pleistocene epoch^7,8,9, using the proteome of dental enamel from a Stephanorhinus tooth that is approximately 1.77-Myr old, recovered from the archaeological site of Dmanisi (South Caucasus, Georgia)¹⁰. Molecular phylogenetic analyses place this Stephanorhinus as a sister group to the clade formed by the woolly rhinoceros (Coelodonta antiquitatis) and Merck’s rhinoceros (Stephanorhinus kirchbergensis). We show that Coelodonta evolved from an early Stephanorhinus lineage, and that this latter genus includes at least two distinct evolutionary lines. The genus Stephanorhinus is therefore currently paraphyletic, and its systematic revision is needed. We demonstrate that sequencing the proteome of Early Pleistocene dental enamel overcomes the limitations of phylogenetic inference based on ancient collagen or DNA. Our approach also provides additional information about the sex and taxonomic assignment of other specimens from Dmanisi. Our findings reveal that proteomic investigation of ancient dental enamel—which is the hardest tissue in vertebrates¹¹, and is highly abundant in the fossil record—can push the reconstruction of molecular evolution further back into the Early Pleistocene epoch, beyond the currently known limits of ancient DNA preservation.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Location of Dmanisi, stratigraphy, and specimen Dm.5/157–16635.**

**Fig. 2: Degradation of the enamel proteome.**

**Fig. 3: Sequence motif analysis of phosphorylation sites in the proteome of ancient enamel.**

**Fig. 4: Phylogenetic relationships between the comparative dataset of enamel proteomes and specimen Dm.5/157–16635.**

The dental proteome of Homo antecessor

Article 01 April 2020

Deep-time phylogenetic inference by paleoproteomic analysis of dental enamel

Article 26 April 2024

First sequencing of ancient coral skeletal proteins

Article Open access 10 November 2020

Data availability

All of the mass spectrometry proteomics data have been deposited in the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository with the dataset identifier PXD011008. Genomic BAM files used for Rhinocerotidae protein sequence translation and protein sequence alignments used for phylogenetic reconstruction are available on Figshare (https://doi.org/10.6084/m9.figshare.7212746).

Code availability

The in-house R script used to align the peptide sequences confidently identified by the PEAKS searches is available to everyone upon request to the corresponding authors.

References

Cappellini, E. et al. Ancient biomolecules and evolutionary inference. Annu. Rev. Biochem. 87, 1029–1060 (2018).
Article CAS PubMed Google Scholar
Dabney, J., Meyer, M. & Pääbo, S. Ancient DNA damage. Cold Spring Harb. Perspect. Biol. 5, a012567 (2013).
Article CAS PubMed PubMed Central Google Scholar
Meyer, M. et al. Nuclear DNA sequences from the Middle Pleistocene Sima de los Huesos hominins. Nature 531, 504–507 (2016).
Article CAS ADS PubMed Google Scholar
Wadsworth, C. & Buckley, M. Proteome degradation in fossils: investigating the longevity of protein survival in ancient bone. Rapid Commun. Mass Spectrom. 28, 605–615 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Schweitzer, M. H. et al. Analyses of soft tissue from Tyrannosaurus rex suggest the presence of protein. Science 316, 277–280 (2007).
Article CAS ADS PubMed Google Scholar
Schroeter, E. R. et al. Expansion for the Brachylophosaurus canadensis collagen I sequence and additional evidence of the preservation of Cretaceous protein. J. Proteome Res. 16, 920–932 (2017).
Article CAS PubMed PubMed Central Google Scholar
Willerslev, E. et al. Analysis of complete mitochondrial genomes from extinct and extant rhinoceroses reveals lack of phylogenetic resolution. BMC Evol. Biol. 9, 95 (2009).
Article CAS PubMed PubMed Central Google Scholar
Welker, F. et al. Middle Pleistocene protein sequences from the rhinoceros genus Stephanorhinus and the phylogeny of extant and extinct Middle/Late Pleistocene Rhinocerotidae. PeerJ 5, e3033 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kirillova, I. et al. Discovery of the skull of Stephanorhinus kirchbergensis (Jäger, 1839) above the Arctic Circle. Quat. Res. 88, 537–550 (2017).
Article CAS Google Scholar
Lordkipanidze, D. et al. A complete skull from Dmanisi, Georgia, and the evolutionary biology of early Homo. Science 342, 326–331 (2013).
Article CAS ADS PubMed Google Scholar
Eastoe, J. E. Organic matrix of tooth enamel. Nature 187, 411–412 (1960).
Article CAS ADS PubMed Google Scholar
Orlando, L. et al. Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature 499, 74–78 (2013).
Article CAS ADS PubMed Google Scholar
Demarchi, B. et al. Protein sequences bound to mineral surfaces persist into deep time. eLife 5, e17092 (2016).
Article PubMed PubMed Central Google Scholar
Welker, F. et al. Ancient proteins resolve the evolutionary history of Darwin’s South American ungulates. Nature 522, 81–84 (2015).
Article CAS ADS PubMed Google Scholar
Chen, F. et al. A late Middle Pleistocene Denisovan mandible from the Tibetan Plateau. Nature 569, 409–412 (2019).
Article CAS ADS PubMed Google Scholar
Nei, M. Molecular Evolutionary Genetics Vol. 75, 39–63 (Columbia Univ. Press, 1987).
Buckley, M., Warwood, S., van Dongen, B., Kitchener, A. C. & Manning, P. L. A fossil protein chimera; difficulties in discriminating dinosaur peptide sequences from modern cross-contamination. Proc. R. Soc. Lond. B 284, 20170544 (2017).
Article CAS Google Scholar
Gabunia, L. et al. Earliest Pleistocene hominid cranial remains from Dmanisi, Republic of Georgia: taxonomy, geological setting, and age. Science 288, 1019–1025 (2000).
Google Scholar
Ferring, R. et al. Earliest human occupations at Dmanisi (Georgian Caucasus) dated to 1.85–1.78 Ma. Proc. Natl Acad. Sci. USA 108, 10432–10436 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Castiblanco, G. A. et al. Identification of proteins from human permanent erupted enamel. Eur. J. Oral Sci. 123, 390–395 (2015).
Article CAS PubMed Google Scholar
Stewart, N. A. et al. The identification of peptides by nanoLC-MS/MS from human surface tooth enamel following a simple acid etch extraction. RSC Advances 6, 61673–61679 (2016).
Article CAS Google Scholar
van Doorn, N. L., Wilson, J., Hollund, H., Soressi, M. & Collins, M. J. Site-specific deamidation of glutamine: a new marker of bone collagen deterioration. Rapid Commun. Mass Spectrom. 26, 2319–2327 (2012).
Article ADS CAS PubMed Google Scholar
Catak, S., Monard, G., Aviyente, V. & Ruiz-López, M. F. Computational study on nonenzymatic peptide bond cleavage at asparagine and aspartic acid. J. Phys. Chem. A 112, 8752–8761 (2008).
Article CAS PubMed Google Scholar
Hunter, T. Why nature chose phosphate to modify proteins. Phil. Trans. R. Soc. Lond. B 367, 2513–2516 (2012).
Article CAS Google Scholar
Hu, J. C. C., Yamakoshi, Y., Yamakoshi, F., Krebsbach, P. H. & Simmer, J. P. Proteomics and genetics of dental enamel. Cells Tissues Organs 181, 219–231 (2005).
Article CAS PubMed Google Scholar
Tagliabracci, V. S. et al. Secreted kinase phosphorylates extracellular proteins that regulate biomineralization. Science 336, 1150–1153 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Cleland, T. P. Solid digestion of demineralized bone as a method to access potentially insoluble proteins and post-translational modifications. J. Proteome Res. 17, 536–542 (2018).
Article CAS PubMed Google Scholar
Antoine, P.-O. et al. A revision of Aceratherium blanfordi Lydekker, 1884 (Mammalia: Rhinocerotidae) from the Early Miocene of Pakistan: postcranials as a key. Zool. J. Linn. Soc. 160, 139–194 (2010).
Article Google Scholar
Steiner, C. C. & Ryder, O. A. Molecular phylogeny and evolution of the Perissodactyla. Zool. J. Linn. Soc. 163, 1289–1303 (2011).
Article Google Scholar
Hobolth, A., Dutheil, J. Y., Hawks, J., Schierup, M. H. & Mailund, T. Incomplete lineage sorting patterns among human, chimpanzee, and orangutan suggest recent orangutan speciation and widespread selection. Genome Res. 21, 349–356 (2011).
Article CAS PubMed PubMed Central Google Scholar
Rieseberg, L. H. Evolution: replacing genes and traits through hybridization. Curr. Biol. 19, R119–R122 (2009).
Article CAS PubMed Google Scholar
Guérin, C. Les Rhinocéros (Mammalia, Perissodactyla) du Miocène Terminal au Pleistocène Supérieur en Europe occidentale, Comparaison avec les Espèces Actuelles (Documents du Laboratoire de Geologie de la Faculte des Sciences de Lyon, volume 79) (Univ. Claude-Bernard, 1980).
Deng, T. et al. Out of Tibet: Pliocene woolly rhino suggests high-plateau origin of Ice Age megaherbivores. Science 333, 1285–1288 (2011).
Article CAS ADS PubMed Google Scholar
Orlando, L. et al. Ancient DNA analysis reveals woolly rhino evolutionary relationships. Mol. Phylogenet. Evol. 28, 485–499 (2003).
Article CAS PubMed Google Scholar
Yuan, J. et al. Ancient DNA sequences from Coelodonta antiquitatis in China reveal its divergence and phylogeny. Sci. China Earth Sci. 57, 388–396 (2014).
Article CAS Google Scholar
Penkman, K. E. H., Kaufman, D. S., Maddy, D. & Collins, M. J. Closed-system behaviour of the intra-crystalline fraction of amino acids in mollusc shells. Quat. Geochronol. 3, 2–25 (2008).
Article CAS PubMed PubMed Central Google Scholar
Hendy, J. et al. A guide to ancient protein studies. Nat. Ecol. Evol. 2, 791–799 (2018).
Article PubMed Google Scholar
Wiśniewski, J. R., Zougman, A., Nagaraj, N. & Mann, M. Universal sample preparation method for proteome analysis. Nat. Methods 6, 359–362 (2009).
Article CAS PubMed Google Scholar
Cappellini, E. et al. Resolution of the type material of the Asian elephant, Elephas maximus Linnaeus, 1758 (Proboscidea, Elephantidae). Zool. J. Linn. Soc. 170, 222–232 (2014).
Google Scholar
Kulak, N. A., Pichler, G., Paron, I., Nagaraj, N. & Mann, M. Minimal, encapsulated proteomic-sample processing applied to copy-number estimation in eukaryotic cells. Nat. Methods 11, 319–324 (2014).
Article CAS PubMed Google Scholar
Mackie, M. et al. Palaeoproteomic profiling of conservation layers on a 14th century Italian wall painting. Angew. Chem. Int. Edn 57, 7369–7374 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cappellini, E. et al. Proteomic analysis of a Pleistocene mammoth femur reveals more than one hundred ancient bone proteins. J. Proteome Res. 11, 917–926 (2012).
Article CAS PubMed Google Scholar
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
Article CAS PubMed Google Scholar
Zhang, J. et al. PEAKS DB: de novo sequencing assisted database search for sensitive and accurate peptide identification. Mol. Cell. Proteomics 11, M111.010587 (2012).
Article CAS PubMed Google Scholar
The UniProt Consortium. UniProt: the universal protein knowledgebase. Nucleic Acids Res. 45, D158–D169 (2017).
Article CAS Google Scholar
O’Leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745 (2016).
Article CAS PubMed Google Scholar
Welker, F. et al. Palaeoproteomic evidence identifies archaic hominins associated with the Châtelperronian at the Grotte du Renne. Proc. Natl Acad. Sci. USA 113, 11162–11167 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649 (2012).
Article PubMed PubMed Central Google Scholar
Gabriels, R., Martens, L. & Degroeve, S. Updated MS²PIP web server delivers fast and accurate MS² peak intensity prediction for multiple fragmentation methods, instruments and labeling techniques. Nucleic Acids Res. 47, W295–W299 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tyanova, S., Temu, T. & Cox, J. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protocols 11, 2301–2319 (2016).
Article CAS PubMed Google Scholar
Colaert, N., Helsens, K., Martens, L., Vandekerckhove, J. & Gevaert, K. Improved visualization of protein consensus sequences by iceLogo. Nat. Methods 6, 786–787 (2009).
Article CAS PubMed Google Scholar
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: analysis of next generation sequencing data. BMC Bioinformatics 15, 356 (2014).
Article PubMed PubMed Central Google Scholar
Briggs, A. W. et al. Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA. Nucleic Acids Res. 38, e87 (2010).
Article CAS PubMed Google Scholar
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Article CAS PubMed PubMed Central Google Scholar
Sea Urchin Genome Sequencing Consortium. The genome of the sea urchin Strongylocentrotus purpuratus. Science 314, 941–952 (2006).
Article ADS PubMed Central Google Scholar
Katoh, K. & Frith, M. C. Adding unaligned sequences into an existing alignment using MAFFT and LAST. Bioinformatics 28, 3144–3146 (2012).
Article CAS PubMed PubMed Central Google Scholar
Schliep, K. P. phangorn: phylogenetic analysis in R. Bioinformatics 27, 592–593 (2011).
Article CAS PubMed Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS PubMed Google Scholar
Ronquist, F. et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542 (2012).
Article PubMed PubMed Central Google Scholar
Rohland, N. & Hofreiter, M. Comparison and optimization of ancient DNA extraction. Biotechniques 42, 343–352 (2007).
Article CAS PubMed Google Scholar
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 2010, pdb.prot5448 (2010).
Article PubMed Google Scholar
Schubert, M. et al. Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nat. Protocols 9, 1056–1082 (2014).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Dickinson, M. R., Lister, A. M. & Penkman, K. E. H. A new method for enamel amino acid racemization dating: a closed system approach. Quat. Geochronol. 50, 29–46 (2019).
Article Google Scholar

Download references

Acknowledgements

E.C. and F.W. are supported by the VILLUM FONDEN (grant number 17649) and by the European Commission through a Marie Skłodowska Curie (MSC) Individual Fellowship (grant number 795569). E.W. is supported by the Lundbeck Foundation, the Danish National Research Foundation, the Novo Nordisk Foundation, the Carlsberg Foundation, KU2016 and the Wellcome Trust. E.C., C.K., J.V.O., P.R. and D.S. are supported by the European Commission through the MSC European Training Network ‘TEMPERA’ (grant number 722606). M.M. and R.R.J.-C. are supported by the University of Copenhagen KU2016 (UCPH Excellence Programme) grant. M.M. is also supported by the Danish National Research Foundation award PROTEIOS (DNRF128). Work at the Novo Nordisk Foundation Center for Protein Research is funded in part by a donation from the Novo Nordisk Foundation (grant number NNF14CC0001). M.R.D. is supported by a PhD DTA studentship from NERC and the Natural History Museum (NE/K500987/1 & NE/L501761/1). K.P. is supported by the Leverhulme Trust (PLP -2012-116). L.R. and L.P. are supported by the Italian Ministry for Foreign Affairs (MAECI, DGSP-VI). L.P. was also supported by the EU-SYNTHESYS project (AT-TAF-2550, DE-TAF-3049, GB-TAF-2825, HU-TAF-3593 and ES-TAF-2997) funded by the European Commission. L.D. is supported by the Swedish Research Council (grant number 2017-04647) and FORMAS (grant number 2015-676). M.T.P.G. is supported by ERC Consolidator Grant ‘Extinction genomics’ (grant number 681396). L.O. is supported by the ERC Consolidator Grant ‘PEGASUS’ (grant agreement number 681605). B.S., J.K. and P.D.H. are supported by the Gordon and Betty Moore foundation. B.M.-N. is supported by the Spanish Ministry of Sciences (grant number CGL2016-80975-P) and the Generalitat de Catalunya, Spain (grant number 2017SGR 859). J.A. is supported by the Spanish Ministry of Sciences (grant number CGL2016-80000-P). R.F. is supported by National Science Foundation (grant number 1025245). The ancient DNA analysis was carried out using the facilities of the University of Luxembourg, the Swedish Museum of Natural History and UC Santa Cruz. We acknowledge support from the Science for Life Laboratory, the National Genomics Infrastructure (Sweden) and UPPMAX for providing assistance with massive parallel sequencing and computational infrastructure. Research at Dmanisi is supported by the John Templeton Foundation (grant number 52935), and the Shota Rustaveli Science Foundation (grant number 18-27262). We thank B. Triozzi and K. Murphy Gregersen for technical support.

Author information

Authors and Affiliations

Lundbeck Foundation GeoGenetics Centre, Globe Institute, University of Copenhagen, Copenhagen, Denmark
Enrico Cappellini, J. Víctor Moreno-Mayar, Morten E. Allentoft, Ludovic Orlando & Eske Willerslev
Evolutionary Genomics Section, Globe Institute, University of Copenhagen, Copenhagen, Denmark
Enrico Cappellini, Frido Welker, Jazmín Ramos-Madrigal, Anna K. Fotakis, Meaghan Mackie, Shanlin Liu, Marcela Sandoval Velasco, Mikkel-Holger S. Sinding & M. Thomas P. Gilbert
Department of Human Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Frido Welker
Dipartimento di Scienze della Terra, Università degli Studi di Firenze, Florence, Italy
Luca Pandolfi & Lorenzo Rook
Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
Diana Samodova, Patrick L. Rüther, David Lyon, Rosa Rakownikow Jersie-Christensen, Meaghan Mackie, Christian D. Kelstrup & Jesper V. Olsen
Georgian National Museum, Tbilisi, Georgia
Maia Bukhsianidze & David Lordkipanidze
Life Sciences Research Unit, University of Luxembourg, Belvaux, Luxembourg
Aurélien Ginolhac
Department of Geography and Environment, University of North Texas, Denton, TX, USA
Reid Ferring
Department of Anthropology, University of Minnesota, Minneapolis, MN, USA
Martha Tappen
Department of Genetics, Harvard Medical School, Cambridge, MA, USA
Eleftheria Palkopoulou
Department of Chemistry, University of York, York, UK
Marc R. Dickinson & Kirsty Penkman
Stafford Research, Lafayette, CO, USA
Thomas W. Stafford Jr
Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
Yvonne L. Chan & Love Dalén
Department of Archaeology and Classical Studies, Stockholm University, Stockholm, Sweden
Anders Götherström
Sabah Wildlife Department, Kota Kinabalu, Malaysia
Senthilvel K. S. S. Nathan
Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, USA
Peter D. Heintzman, Joshua D. Kapp & Beth Shapiro
Tromsø University Museum, The Arctic University of Norway (UiT), Tromsø, Norway
Peter D. Heintzman
Ice Age Museum, National Alliance of Shidlovskiy ‘Ice Age’, Moscow, Russia
Irina Kirillova
Department of Zoology, University of Venda, Thohoyandou, South Africa
Yoshan Moodley
Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
Jordi Agusti & Bienvenido Martínez-Navarro
Institut Català de Paleoecologia Humana i Evolució Social, Universitat Rovira i Virgili, Tarragona, Spain
Jordi Agusti & Bienvenido Martínez-Navarro
Senckenberg Research Station of Quaternary Palaeontology, Weimar, Germany
Ralf-Dietrich Kahlke
Geology Department, Tbilisi State University, Tbilisi, Georgia
Gocha Kiladze
Departament d’Història i Geografia, Universitat Rovira i Virgili, Tarragona, Spain
Bienvenido Martínez-Navarro
BGI Shenzhen, Shenzen, China
Shanlin Liu
Greenland Institute of Natural Resources, Nuuk, Greenland
Mikkel-Holger S. Sinding
Laboratoire d’Anthropobiologie Moléculaire et d’Imagerie de Synthèse, Université Paul Sabatier, Toulouse, France
Ludovic Orlando
Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, CA, USA
Beth Shapiro
University Museum, Norwegian University of Science and Technology, Trondheim, Norway
M. Thomas P. Gilbert
Department of Zoology, University of Cambridge, Cambridge, UK
Eske Willerslev
Wellcome Trust Sanger Institute, Hinxton, UK
Eske Willerslev
Danish Institute for Advanced Study, University of Southern Denmark, Odense, Denmark
Eske Willerslev

Authors

Enrico Cappellini
View author publications
You can also search for this author in PubMed Google Scholar
Frido Welker
View author publications
You can also search for this author in PubMed Google Scholar
Luca Pandolfi
View author publications
You can also search for this author in PubMed Google Scholar
Jazmín Ramos-Madrigal
View author publications
You can also search for this author in PubMed Google Scholar
Diana Samodova
View author publications
You can also search for this author in PubMed Google Scholar
Patrick L. Rüther
View author publications
You can also search for this author in PubMed Google Scholar
Anna K. Fotakis
View author publications
You can also search for this author in PubMed Google Scholar
David Lyon
View author publications
You can also search for this author in PubMed Google Scholar
J. Víctor Moreno-Mayar
View author publications
You can also search for this author in PubMed Google Scholar
Maia Bukhsianidze
View author publications
You can also search for this author in PubMed Google Scholar
Rosa Rakownikow Jersie-Christensen
View author publications
You can also search for this author in PubMed Google Scholar
Meaghan Mackie
View author publications
You can also search for this author in PubMed Google Scholar
Aurélien Ginolhac
View author publications
You can also search for this author in PubMed Google Scholar
Reid Ferring
View author publications
You can also search for this author in PubMed Google Scholar
Martha Tappen
View author publications
You can also search for this author in PubMed Google Scholar
Eleftheria Palkopoulou
View author publications
You can also search for this author in PubMed Google Scholar
Marc R. Dickinson
View author publications
You can also search for this author in PubMed Google Scholar
Thomas W. Stafford Jr
View author publications
You can also search for this author in PubMed Google Scholar
Yvonne L. Chan
View author publications
You can also search for this author in PubMed Google Scholar
Anders Götherström
View author publications
You can also search for this author in PubMed Google Scholar
Senthilvel K. S. S. Nathan
View author publications
You can also search for this author in PubMed Google Scholar
Peter D. Heintzman
View author publications
You can also search for this author in PubMed Google Scholar
Joshua D. Kapp
View author publications
You can also search for this author in PubMed Google Scholar
Irina Kirillova
View author publications
You can also search for this author in PubMed Google Scholar
Yoshan Moodley
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Agusti
View author publications
You can also search for this author in PubMed Google Scholar
Ralf-Dietrich Kahlke
View author publications
You can also search for this author in PubMed Google Scholar
Gocha Kiladze
View author publications
You can also search for this author in PubMed Google Scholar
Bienvenido Martínez-Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Shanlin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Marcela Sandoval Velasco
View author publications
You can also search for this author in PubMed Google Scholar
Mikkel-Holger S. Sinding
View author publications
You can also search for this author in PubMed Google Scholar
Christian D. Kelstrup
View author publications
You can also search for this author in PubMed Google Scholar
Morten E. Allentoft
View author publications
You can also search for this author in PubMed Google Scholar
Ludovic Orlando
View author publications
You can also search for this author in PubMed Google Scholar
Kirsty Penkman
View author publications
You can also search for this author in PubMed Google Scholar
Beth Shapiro
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Rook
View author publications
You can also search for this author in PubMed Google Scholar
Love Dalén
View author publications
You can also search for this author in PubMed Google Scholar
M. Thomas P. Gilbert
View author publications
You can also search for this author in PubMed Google Scholar
Jesper V. Olsen
View author publications
You can also search for this author in PubMed Google Scholar
David Lordkipanidze
View author publications
You can also search for this author in PubMed Google Scholar
Eske Willerslev
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.C., D. Lordkipanidze and E.W. designed the study. A.K.F., M.M., R.R.J.-C., M.E.A., M.R.D., K.P. and E.C. performed laboratory experiments. M.B., M.T., R.F., E.P., T.W.S. Jr, Y.L.C., A. Götherström, S.K.S.S.N., P.D.H., J.D.K., I.K., Y.M., J.A., R.-D.K., G.K., B.M.-N., M.-H.S.S., S.L., M.S.V., B.S., L.D., M.T.P.G. and D. Lordkipanidze provided ancient samples or modern reference material. E.C., F.W., L.P., J.R.-M., D. Lyon, J.V.M.-M., D.S., C.D.K., A. Ginolhac, L.O., L.R., J.V.O., P.L.R., M.R.D. and K.P. performed analyses and data interpretation. E.C., F.W., J.R.-M., L.P. and E.W. wrote the manuscript with contributions from all authors.

Corresponding authors

Correspondence to Enrico Cappellini, Jesper V. Olsen or Eske Willerslev.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Peer review information Nature thanks Benedikt Kessler, Tina Warinner and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Extended data figures and tables

Extended Data Fig. 1 Generalized stratigraphic profiles for Dmanisi, indicating origins of the specimens.

a, Type section of the Dmanisi M5 excavation block. b, Stratigraphic profile of excavation area M6. M6 preserves a larger gully associated with the pipe-gully phase of stratigraphic–geomorphic development in stratum B1. The thickness of the stratum B1 gully fill extends to the basalt surface but includes ‘rip-ups’ of strata A1 and A2, showing that the deposits in stratum B1 post-date those of stratum A. c, Stratigraphic section of excavation area M17. Here, Stratum B1 was deposited after the erosion of stratum A deposits. The stratigraphic position of specimen Dm.5/157–16635 is highlighted with a red diamond. The Masavara basalt is about 50 cm below the base of the profile shown. d, Northern section of block 2. Following the collapse of a pipe and erosion to the basalt, the deeper part of this area was filled with local gully fill of strata B1x, B1y and B1z. Note the uniform burial of all stratum B1 deposits by strata B2, B3 and B4. The sampled specimens are indicated by the five-digit CGG numbers. Extended Data Table 1 provides both the CGG and GNM specimen numbers.

Extended Data Fig. 2 Proteome-sequence coverage for specimen Dm.5/157-16635.

a, c, e, g, i, j, Peptide–spectrum match (PSM) sequence coverage of the proteins AMBN (a), ENAM (c), AMELX (e), AMTN (g), MMP20 (i) and ALB (j). Annotations include ‘amino acid position, amino acid called in that position (number of PSMs and peptides covering that position)’ for the phylogenetically informative single-amino-acid polymorphisms within Rhinocerotidae. b, d, f, h, Frequency (per cent) of phosphorylated (green) and unphosphorylated (red) PSMs per amino acid position for AMBN (b), ENAM (d), AMELX (f) and AMTN (h). Numbers within the bars provide the PSM counts. k, Violin plot of distribution of PSM coverage for all covered sites (n = 693), and for sites of phylogenetic relevance (single-amino-acid polymorphisms, n = 30). The box plots define the range of the data, with whiskers extending to 1.5× interquartile range, boxes denoting the 25th and 75th percentiles and dots indicating the median. All panels are based only on MaxQuant search results. The Supplementary Data contains examples of MS/MS spectra, and fragment-ion series alignments for each of the marked single-amino-acid polymorphisms.

Source data

Extended Data Fig. 3 Peptide and fragment-ion coverage of AMELX isoform 1 and isoform 2 from specimen Dm.M6/7.II.296–16856.

Peptides specific to AMELX isoform 1 and isoform 2 appear in the top and bottom parts of the figure, respectively. No AMELX isoform 2 is currently reported in public databases for the Cervidae group. Accordingly, the AMELX-isoform-2-specific peptides were identified by MaxQuant spectral matching against bovine (Bos taurus) AMELX isoform 2 (UniProt accession number P02817-2). AMELX isoform 2 (also known as leucine-rich amelogenin peptide (LRAP)) is a naturally occurring isoform of AMELX from the translation product of an alternatively spliced transcript.

Extended Data Fig. 4 Amino acid racemization.

Extent of intra-crystalline racemization in enamel for the free amino acid (FAA, x axis) fraction and the total hydrolysable amino acids (THAA, y axis) fraction for four amino acids (Asp plus Asn (here denoted Asx), Glu plus Gln (here denoted Glx), Ala and Phe). Note the differences in axis scale. Intra-crystalline data from Proboscidea enamel from a range of sites in the UK⁶⁴ have been shown for comparison (grey crosses). Taxa from both Dmanisi and the UK exhibit a similar relationship between FAA and THAA racemization, and R² values have been calculated on the basis of a polynomial relationship (order = 2, all > 0.93).

Source data

Extended Data Fig. 5 Phosphorylation in the proteome of ancient enamel.

Annotated spectra including phosphorylated (here denoted ph) serine (S). a, Phosphorylation in the S-X-E motif of AMELX. b, Phosphorylation in the S-X-phosphorylated S motif of AMBN. Phosphorylation was independently observed in all three separate analyses of Dm.5/157–16635, including multiple spectra and peptides (Extended Data Fig. 2).

Extended Data Fig. 6 Phylogenetic relationships between the comparative reference dataset and specimen Dm.bXI–16857.

Consensus tree from Bayesian inference. The posterior probability of each bipartition is shown as a percentage to the left of each node.

Extended Data Fig. 7 AMELY-specific matches.

a, Specimen Dm.6/151.4.A4.12–16630. b, Specimen Dm.69/64.3.B1.53–16631. c, Specimen Dm.8/154.4.A4.22–16639. d, Specimen Dm.M6/7.II.296–16856. Note the presence of deamidated glutamine (deQ) and asparagine (deN), oxidated methionine (oxM) and phosphorylated serine (phS).

Extended Data Fig. 8 Effect of the missingness in the tree topology.

a, Maximum-likelihood phylogeny obtained using PhyML and the protein alignment that excludes Dm.5/157–16635. b, Topologies obtained from 100 random replicates of the woolly rhinoceros (C. antiquitatis). In each replicate, the number of missing sites was similar to that observed for the Dm.5/157–16635 specimen (72.4% missingness). The percentage shown for each topology indicates the number of replicates in which that particular topology was recovered. c, As in b, but for the Javan rhinoceros (R. sondaicus). d, As in b, but for the black rhinoceros (D. bicornis).

Extended Data Table 1 Genome and proteome survival in 23 specimens of fossil fauna from Dmanisi

Full size table

Extended Data Table 2 Proteome composition and coverage

Full size table

Supplementary information

Supplementary Information

Supplementary Materials, Methods and Results, with Figures and Tables. Detailed description, enriched with figures and tables, of: (i) the studied specimens, (ii) the experimental procedures used to generate the data, and (iii) the results, both positive and negative, supporting the conclusions reported in the main text.

Reporting Summary

Supplementary Data

Selection of automatically and manually annotated tandem MS/MS spectra, retrieved from Dmanisi ~1.77 Myr old specimens as well as synthetic peptides, supporting the identification of: (i) phylogenetically informative amino acid positions observed in specimen Dm.5/157-16635, and (ii) phosphorylated sites.

Source data

Source Data Fig. 2

Source Data Fig. 3

Source Data Extended Data Fig. 2

Source Data Extended Data Fig. 4

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cappellini, E., Welker, F., Pandolfi, L. et al. Early Pleistocene enamel proteome from Dmanisi resolves Stephanorhinus phylogeny. Nature 574, 103–107 (2019). https://doi.org/10.1038/s41586-019-1555-y

Download citation

Received: 16 October 2018
Accepted: 12 August 2019
Published: 11 September 2019
Issue Date: 03 October 2019
DOI: https://doi.org/10.1038/s41586-019-1555-y

This article is cited by

A label-free quantification method for assessing sex from modern and ancient bovine tooth enamel
- Paula Kotli
- David Morgenstern
- Elisabetta Boaretto
Scientific Reports (2024)
Paleoproteomics sheds light on million-year-old fossils
- Ryan Sinclair Paterson
- Palesa Petunia Madupe
- Enrico Cappellini
Nature Reviews Molecular Cell Biology (2024)
Deep-time phylogenetic inference by paleoproteomic analysis of dental enamel
- Alberto J. Taurozzi
- Patrick L. Rüther
- Enrico Cappellini
Nature Protocols (2024)
Palaeoproteomic identification of the original binder and modern contaminants in distemper paints from Uvdal stave church, Norway
- Zahra Haghighi
- Meaghan Mackie
- Enrico Cappellini
Scientific Reports (2024)
Bison sex matters: the potential of proteomic tooth enamel analysis for determination of ancient human subsistence strategies
- Natalia Berezina
- Rustam Ziganshin
- Alexandra Buzhilova
Archaeological and Anthropological Sciences (2024)

Subjects

Abstract

Access options

Similar content being viewed by others

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Extended data figures and tables

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links