Key Points
-
Technical advances in mass spectrometry and NMR spectroscopy enable genome-wide screens to be carried out for the association between genetic variants and hundreds of metabolic traits in a single experiment.
-
Metabolite concentrations are direct readouts of biological processes and can play the part of intermediate phenotypes, providing functional links between genetic variance and disease end points in genome-wide association studies (GWASs).
-
The first GWASs with metabolomics have already discovered many genetic variants in enzyme-, transporter- and other metabolism-related genes that induce major differences in the individual metabolic capabilities of the organism.
-
Knowledge of the genetic basis of human metabolic individuality holds the key to understanding the interactions of genetic, environmental and lifestyle factors in the aetiology of complex disorders.
-
We review emerging insights from recent GWASs with metabolomics and present design considerations for high-throughput metabolomics experiments with metabolic traits in epidemiological and clinical studies.
-
Using ratios between metabolite concentrations can drastically increase the power of a metabolomics study and can provide functional information on the perturbed underlying biochemical pathways.
-
Integration with other biochemical information, including data from other GWASs, can largely improve the value of the study.
-
Current challenges and future directions include the addition of new sample types (other than urine and blood), extension of the metabolite panels, standardization between platforms and the development of adapted statistical and data analysis tools.
Abstract
Many complex disorders are linked to metabolic phenotypes. Revealing genetic influences on metabolic phenotypes is key to a systems-wide understanding of their interactions with environmental and lifestyle factors in their aetiology, and we can now explore the genetics of large panels of metabolic traits by coupling genome-wide association studies and metabolomics. These genome-wide association studies are beginning to unravel the genetic contribution to human metabolic individuality and to demonstrate its relevance for biomedical and pharmaceutical research. Adopting the most appropriate study designs and analytical tools is paramount to further refining the genotype–phenotype map and eventually identifying the part played by genetic influences on metabolic phenotypes. We discuss such design considerations and applications in this Review.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
206,07 € per year
only 17,17 € per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout


Similar content being viewed by others
References
Garrod, A. E. The incidence of alkaptonuria a study in chemical individuality. Lancet 2, 1616–1620 (1902).
Mootha, V. K. & Hirschhorn, J. N. Inborn variation in metabolism. Nature Genet. 42, 97–98 (2010). This comment provides an independant view on the potential of genetic studies with metabolomics.
Garrod, A. E. Inborn Factors in Disease (Oxford Univ. Press, 1931). Archibald Garrod noted more than 80 years ago that “diathesis is nothing else but chemical individuality”.
Psychogios, N. et al. The human serum metabolome. PLoS ONE 6, e16957 (2011).
Link, E. et al. SLCO1B1 variants and statin-induced myopathy—a genomewide study. N. Engl. J. Med. 359, 789–799 (2008).
Dumont, J. et al. FADS1 genetic variability interacts with dietary α-linolenic acid intake to affect serum non-HDL-cholesterol concentrations in European adolescents. J. Nutr. 141, 1247–1253 (2011).
Lu, Y. et al. Dietary n-3 and n-6 polyunsaturated fatty acid intake interacts with FADS1 genetic variation to affect total and HDL-cholesterol concentrations in the Doetinchem Cohort Study. Am. J. Clin. Nutr. 92, 258–265 (2010).
Krug, S. et al. The dynamic range of the human metabolome revealed by challenges. FASEB J. 26, 2607–2619 (2012). This paper reports a series of controlled physiological challenges that may be used in future GWASs with metabolic traits.
Nicholson, G. et al. Human metabolic profiles are stably controlled by genetic and environmental variation. Mol. Syst. Biol. 7, 525 (2011). This paper addresses essential questions about the heritability of metabolic traits.
Kettunen, J. et al. Genome-wide association study identifies multiple loci influencing human serum metabolite levels. Nature Genet. 44, 269–276 (2012). This paper reports a large GWAS with NMR-derived metabolic traits.
Suhre, K. et al. Metabolic footprint of diabetes: a multiplatform metabolomics study in an epidemiological setting. PLoS ONE 5, e13953 (2010). This paper reports a pilot study on three different metabolomics platforms and provides practical insights into the possibilities and pitfalls of high-throughput metabolomics experiments.
Mittelstrass, K. et al. Discovery of sexual dimorphisms in metabolic and genetic biomarkers. PLoS Genet. 7, e1002215 (2011).
Nicholson, G. et al. A genome-wide metabolic QTL analysis in Europeans implicates two loci shaped by recent positive selection. PLoS Genet. 7, e1002270 (2011).
Suhre, K. et al. A genome-wide association study of metabolic traits in human urine. Nature Genet. 43, 565–569 (2011).
Suhre, K. et al. Human metabolic individuality in biomedical and pharmaceutical research. Nature 477, 54–60 (2011). This paper reports 37 loci of human metabolic individuality and provides examples for a wide range of biomedical and pharmaceutical applications.
Illig, T. et al. A genome-wide perspective of genetic variation in human metabolism. Nature Genet. 42, 137–141 (2010).
Kastenmuller, G. Romisch-Margl, W., Wagele, B., Altmaier, E. & Suhre, K. metaP-server: a web-based metabolomics data analysis tool. J. Biomed. Biotechnol. 2011, 839862 (2011).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Abecasis, G. R., Cherny, S. S., Cookson, W. O. & Cardon, L. R. Merlin—rapid analysis of dense genetic maps using sparse gene flow trees. Nature Genet. 30, 97–101 (2002).
Marchini, J. & Howie, B. Genotype imputation for genome-wide association studies. Nature Rev. Genet. 11, 499–511 (2010).
Gieger, C. et al. Genetics meets metabolomics: a genome-wide association study of metabolite profiles in human serum. PLoS Genet. 4, e1000282 (2008). This paper reports the first GWAS with metabolic traits and with ratios between metabolite concentrations.
Tanaka, T. et al. Genome-wide association study of plasma polyunsaturated fatty acids in the InCHIANTI study. PLoS Genet. 5, e1000338 (2009).
Hicks, A. A. et al. Genetic determinants of circulating sphingolipid concentrations in European populations. PLoS Genet. 5, e1000672 (2009).
Demirkan, A. et al. Genome-wide association study identifies novel loci associated with circulating phospho- and sphingolipid concentrations. PLoS Genet. 8, e1002490 (2012).
Wishart, D. S. et al. HMDB: a knowledgebase for the human metabolome. Nucleic Acids Res. 37, D603–D610 (2009).
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–D114 (2012).
Altmaier, E. et al. Bioinformatics analysis of targeted metabolomics--uncovering old and new tales of diabetic mice under medication. Endocrinology 149, 3478–3489 (2008).
Petersen, A. K. et al. On the hypothesis-free testing of metabolite ratios in genome-wide and metabolome-wide association studies. BMC Bioinformatics 13, 120 (2012). This paper provides a statistical underpinning to using ratios between metabolite concentrations in association studies.
Borenstein, M., Hedges, L. V., Higgins, J. P. T. & Rothstein, H. R. Introduction to Meta-Analysis (Wiley, 2009).
Krumiesk, J. et al. Mining the unknown: A systems approach to metabolite identification combining genetic and metabolic information. PLoS Genet. (in the press).
Cornelis, M. C. et al. Genome-wide meta-analysis identifies regions on 7p21 (AHR) and 15q24 (CYP1A2) as determinants of habitual caffeine consumption. PLoS Genet. 7, e1002033 (2011).
Zhang, A., Sun, H., Wang, P., Han, Y. & Wang, X. Recent and potential developments of biofluid analyses in metabolomics. J. Proteomics 75, 1079–1088 (2011).
Yu, Z. et al. Differences between human plasma and serum metabolite profiles. PLoS ONE 6, e21230 (2011).
Tukiainen, T. et al. Detailed metabolic and genetic characterization reveals new associations for 30 known lipid loci. Hum. Mol. Genet. 21, 1444–1455 (2012).
Inouye, M. et al. Metabonomic, transcriptomic, and genomic variation of a population cohort. Mol. Syst. Biol. 6, 441 (2010).
Kottgen, A. et al. New loci associated with kidney function and chronic kidney disease. Nature Genet. 42, 376–384 (2010).
Suhre, K. et al. Identification of a potential biomarker for FABP4 inhibition: the power of lipidomics in preclinical drug testing. J. Biomol. Screen 16, 467–475 (2011).
Dupuis, J. et al. New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nature Genet. 42, 105–116 (2010).
Sanna, S. et al. Common variants in the SLCO1B3 locus are associated with bilirubin levels and unconjugated hyperbilirubinemia. Hum. Mol. Genet. 18, 2711–2718 (2009).
Johnson, A. D. et al. Genome-wide association meta-analysis for total serum bilirubin levels. Hum. Mol. Genet. 18, 2700–2710 (2009).
Kolz, M. et al. Meta-analysis of 28,141 individuals identifies common variants within five new loci that influence uric acid concentrations. PLoS Genet. 5, e1000504 (2009).
Zhai, G. et al. Eight common genetic variants associated with serum DHEAS levels suggest a key role in ageing mechanisms. PLoS Genet. 7, e1002025 (2011).
Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713 (2010).
Kathiresan, S. et al. Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans. Nature Genet. 40, 189–197 (2008).
Willer, C. J. et al. Newly identified loci that influence lipid concentrations and risk of coronary artery disease. Nature Genet. 40, 161–169 (2008).
Kathiresan, S. et al. Common variants at 30 loci contribute to polygenic dyslipidemia. Nature Genet. 41, 56–65 (2009).
Kronenberg, F. in Genetics Meets Metabolomics: from Experiment to Systems Biology (ed. Suhre, K.) 255–264 (Springer, 2012).
The Wellcome Trust Case Control Conortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).
Franke, A. et al. Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nature Genet. 42, 1118–1125 (2010).
Kato, N. et al. Meta-analysis of genome-wide association studies identifies common variants associated with blood pressure variation in east Asians. Nature Genet. 43, 531–538 (2011).
Rothman, N. et al. A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci. Nature Genet. 42, 978–984 (2010).
Chambers, J. C. et al. Genetic loci influencing kidney function and chronic kidney disease. Nature Genet. 42, 373–375 (2010).
Wallace, C. et al. Genome-wide association study identifies genes for biomarkers of cardiovascular disease: serum urate and dyslipidemia. Am. J. Hum. Genet. 82, 139–149 (2008).
Li, S. et al. The GLUT9 gene is associated with serum uric acid levels in Sardinia and Chianti cohorts. PLoS Genet. 3, e194 (2007).
Doring, A. et al. SLC2A9 influences uric acid concentrations with pronounced sex-specific effects. Nature Genet. 40, 430–436 (2008).
Ferreira, M. A. & Purcell, S. M. A multivariate test of association. Bioinformatics 25, 132–133 (2009).
Ried, J. S. et al. PSEA: phenotype set enrichment analysis—a new method for analysis of multiple phenotypes. Genet. Epidemiol. 36, 244–252 (2012).
Raychaudhuri, S. et al. Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions. PLoS Genet. 5, e1000534 (2009).
Krumsiek, J., Suhre, K., Illig, T., Adamski, J. & Theis, F. J. Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data. BMC Syst. Biol. 5, 21 (2011). This paper introduces partial correlation networks to high-throughput metabolomics studies that may be used in a systems biology approach to GWAS with metabolic traits.
Acknowledgements
K.S. is supported by 'Biomedical Research Program' funds at Weill Cornell Medical College in Qatar, a program funded by the Qatar Foundation. The statements made herein are solely the responsibility of the authors. The authors thank G. Kastenmüller, A.-K. Petersen and J. Adamski for critical reading of the manuscript. We thank our reviewers for suggestions that led to the improvement of the manuscript.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Related links
Related links
FURTHER INFORMATION
Helmholtz Zentrum München (German Research Center for Environmental Health)
Ingenuity Systems Pathway Analysis
Nature Reviews Genetics Series on Study designs
NHGRI Catalog of Published Genome-Wide Association Studies
Online Mendelian Inheritance in Man (OMIM)
The Pharmacogenomics Knowledge Base (PharmGKB)
Software — Department of Epidemiology and Biostatistics — Karolinska Institutet
Glossary
- NMR spectroscopy
-
An experimental technique that identifies molecules by the specific pattern in the chemical shift of specific atoms.
- High-performance liquid-phase chromatography
-
(HPLC). A chromatographic technique used to separate a complex mixture of metabolites. Often used in combination with mass spectrometry.
- Metabolomics
-
The field of identifying metabolites in a biological sample using techniques such as NMR spectrometry and liquid- or gas-phase chromatography coupled with mass spectroscopy. 'Metabonomics' is often synonymously used in connection with NMR-based experiments.
- Metabolic traits
-
Quantitative measures of the concentrations of a specific metabolite.
- Genetically influenced metabotypes
-
(GIMs). Associations between a genetic variant and a metabolic phenotype.
- Metabolic individuality
-
The metabolic capacities of an individual, as defined by the ensemble of all functional genetic variants (genetically influenced metabotypes) in their metabolism-related genes. Historically, Garrod introduced the term 'chemical individuality' to represent this concept.
- Metabolome
-
The ensemble of all small molecules (metabolites) that are processed by the body's enzyme and transporter proteins.
- Glycerophosphatidylethanolamines
-
Glycerophospholipids with ethanolamine head groups.
- Q–Q plots
-
A graphical method for comparing probability distributions. In genome-wide association studies, it is used to verify whether the P values are normally distributed; an over-representation of low P values indicates possible true positive associations.
- Additive linear model
-
A mathematical model used in statistical association analysis; here, it assumes a linear additive effect of the minor alleles on the metabolite concentrations.
- Linkage disequilibrium
-
(LD). A nonrandom association between neighbouring gene variants; it is used to describe a region of high correlation between SNPs.
- Glycerophospholipids
-
Glycerol-based phospholipids are major constituents of the membrane bi-layers and are found in association with low-density lipoprotein (LDL) and high-density lipoprotein (HDL) particles.
- Sphingolipids
-
A class of lipids that contain a backbone of sphingoid bases
Rights and permissions
About this article
Cite this article
Suhre, K., Gieger, C. Genetic variation in metabolic phenotypes: study designs and applications. Nat Rev Genet 13, 759–769 (2012). https://doi.org/10.1038/nrg3314
Published:
Issue Date:
DOI: https://doi.org/10.1038/nrg3314
This article is cited by
-
Ancestry-driven metabolite variation provides insights into disease states in admixed populations
Genome Medicine (2023)
-
Metabolomics shows the Australian dingo has a unique plasma profile
Scientific Reports (2021)
-
Plasma amino acids indicate glioblastoma with ATRX loss
Amino Acids (2021)
-
Genetic basis for plasma amino acid concentrations based on absolute quantification: a genome-wide association study in the Japanese population
European Journal of Human Genetics (2019)
-
Epidemiology in Germany—general development and personal experience
European Journal of Epidemiology (2017)