Technical Report
Published: 02 February 2015

LD Score regression distinguishes confounding from polygenicity in genome-wide association studies

Brendan K Bulik-Sullivan^1,2,3,
Po-Ru Loh^1,4,
Hilary K Finucane^4,5,
Stephan Ripke^2,3,
Jian Yang ORCID: orcid.org/0000-0003-2001-2474⁶,
Schizophrenia Working Group of the Psychiatric Genomics Consortium,
Nick Patterson¹,
Mark J Daly^1,2,3,
Alkes L Price^1,4,7 &
…
Benjamin M Neale^1,2,3

Nature Genetics volume 47, pages 291–295 (2015)Cite this article

110k Accesses
2914 Citations
89 Altmetric
Metrics details

Subjects

Abstract

Both polygenicity (many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from a true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of the inflation in test statistics in many GWAS of large sample size.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Results from selected simulations.**

**Figure 2: LD Score regression plot for the most recent schizophrenia meta-analysis.**

Controlling for background genetic effects using polygenic scores improves the power of genome-wide association studies

Article Open access 01 October 2021

Boosting the power of genome-wide association studies within and across ancestries by using polygenic scores

Article 18 September 2023

Genome-wide association studies

Article 26 August 2021

References

Pritchard, J.K. & Przeworski, M. Linkage disequilibrium in humans: models and data. Am. J. Hum. Genet. 69, 1–14 (2001).
Article CAS Google Scholar
Sham, P.C., Cherny, S.S., Purcell, S. & Hewitt, J.K. Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data. Am. J. Hum. Genet. 66, 1616–1630 (2000).
Article CAS Google Scholar
Yang, J. et al. Genomic inflation factors under polygenic inheritance. Eur. J. Hum. Genet. 19, 807–812 (2011).
Article Google Scholar
Voight, B.F. & Pritchard, J.K. Confounding from cryptic relatedness in case-control association studies. PLoS Genet. 1, e32 (2005).
Article Google Scholar
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).
Article CAS Google Scholar
Lin, D.Y. & Sullivan, P.F. Meta-analysis of genome-wide association studies with overlapping subjects. Am. J. Hum. Genet. 85, 862–872 (2009).
Article CAS Google Scholar
1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
Yin, P. & Fan, X. Estimating R² shrinkage in multiple regression: a comparison of different analytical methods. J. Exp. Educ. 69, 203–224 (2001).
Article Google Scholar
Ralph, P. & Coop, G. The geography of recent genetic ancestry across Europe. PLoS Biol. 11, e1001555 (2013).
Article CAS Google Scholar
Bersaglieri, T. et al. Genetic signatures of strong recent positive selection at the lactase gene. Am. J. Hum. Genet. 74, 1111–1120 (2004).
Article CAS Google Scholar
McVicker, G., Gordon, D., Davis, C. & Green, P. Widespread genomic signatures of natural selection in hominid evolution. PLoS Genet. 5, e1000471 (2009).
Article Google Scholar
Price, A.L. et al. The impact of divergence time on the nature of population structure: an example from Iceland. PLoS Genet. 5, e1000505 (2009).
Article Google Scholar
International Multiple Sclerosis Genetics Consortium & Wellcome Trust Case Control Consortium 2. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature 476, 214–219 (2011).
Splansky, G.L. et al. The Third Generation Cohort of the National Heart, Lung, and Blood Institute's Framingham Heart Study: design, recruitment, and initial examination. Am. J. Epidemiol. 165, 1328–1335 (2007).
Article Google Scholar
Sullivan, P.F. et al. Genome-wide association for major depressive disorder: a possible role for the presynaptic protein piccolo. Mol. Psychiatry 14, 359–375 (2009).
Article CAS Google Scholar
Heid, I.M. et al. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat. Genet. 42, 949–960 (2010).
Article CAS Google Scholar
Lango Allen, H. et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467, 832–838 (2010).
Article CAS Google Scholar
Neale, B.M. et al. Meta-analysis of genome-wide association studies of attention-deficit/hyperactivity disorder. J. Am. Acad. Child Adolesc. Psychiatry 49, 884–897 (2010).
Article Google Scholar
Speliotes, E.K. et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat. Genet. 42, 937–948 (2010).
Article CAS Google Scholar
Stahl, E.A. et al. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nat. Genet. 42, 508–514 (2010).
Article CAS Google Scholar
Tobacco & Genetics Consortium. Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nat. Genet. 42, 441–447 (2010).
International Consortium for Blood Pressure Genome-Wide Association Studies. Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478, 103–109 (2011).
Psychiatric GWAS Consortium Bipolar Disorder Working Group. Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4. Nat. Genet. 43, 977–983 (2011).
Schunkert, H. et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat. Genet. 43, 333–338 (2011).
Article CAS Google Scholar
Estrada, K. et al. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture. Nat. Genet. 44, 491–501 (2012).
Article CAS Google Scholar
Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124 (2012).
Article CAS Google Scholar
Manning, A.K. et al. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nat. Genet. 44, 659–669 (2012).
Article CAS Google Scholar
Morris, A.P. et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat. Genet. 44, 981–990 (2012).
Article CAS Google Scholar
Cross-Disorder Group of the Psychiatric Genomics Consortium. Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet 381, 1371–1379 (2013).
Major Depressive Disorder Working Group of the Psychiatric GWAS Consortium. A mega-analysis of genome-wide association studies for major depressive disorder. Mol. Psychiatry 18, 497–511 (2013).
Rietveld, C.A. et al. GWAS of 126,559 individuals identifies genetic variants associated with educational attainment. Science 340, 1467–1471 (2013).
Article CAS Google Scholar
Schizophrenia Working Group of the Psychiatric Genomics Consortium. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
Patterson, N., Price, A.L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Article Google Scholar
Price, A.L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
Article CAS Google Scholar
Kang, H.M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat. Genet. 42, 348–354 (2010).
Article CAS Google Scholar
Lippert, C. et al. FaST linear mixed models for genome-wide association studies. Nat. Methods 8, 833–835 (2011).
Article CAS Google Scholar
Korte, A. et al. A mixed-model approach for genome-wide association studies of correlated traits in structured populations. Nat. Genet. 44, 1066–1071 (2012).
Article CAS Google Scholar
Yang, J., Lee, S.H., Goddard, M.E. & Visscher, P.M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS Google Scholar
Jakkula, E. et al. The genome-wide patterns of variation expose significant substructure in a founder population. Am. J. Hum. Genet. 83, 787–794 (2008).
Article CAS Google Scholar
International HapMap 3 Consortium. Integrating common and rare genetic variation in diverse human populations. Nature 467, 52–58 (2010).
Price, A.L. et al. Long-range LD can confound genome scans in admixed populations. Am. J. Hum. Genet. 83, 132–135, author reply 135–139 (2008).
Article CAS Google Scholar
Smith, A.V., Thomas, D.J., Munro, H.M. & Abecasis, G.R. Sequence features in regions of weak and strong linkage disequilibrium. Genome Res. 15, 1519–1534 (2005).
Article CAS Google Scholar
She, X. et al. The structure and evolution of centromeric transition regions within the human genome. Nature 430, 857–864 (2004).
Article CAS Google Scholar

Download references

Acknowledgements

We would like to thank P. Sullivan for helpful discussion. This work was supported by US National Institutes of Health grants F32 HG007805 (P.-R.L.), R01 HG006399 (A.L.P.), R03 CA173785 (H.K.F.) and R01 MH094421 (PGC) and by the Fannie and John Hertz Foundation (H.K.F.). Data on coronary artery disease and myocardial infarction were contributed by CARDIoGRAMplusC4D investigators and were downloaded from Psychiatric Genomics Consortium.

Author information

Authors and Affiliations

Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
Brendan K Bulik-Sullivan, Po-Ru Loh, Nick Patterson, Mark J Daly, Alkes L Price & Benjamin M Neale
Department of Medicine, Analytical and Translational Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA
Brendan K Bulik-Sullivan, Stephan Ripke, Mark J Daly & Benjamin M Neale
Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
Brendan K Bulik-Sullivan, Stephan Ripke, Mark J Daly & Benjamin M Neale
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
Po-Ru Loh, Hilary K Finucane & Alkes L Price
Department of Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
Hilary K Finucane
Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
Jian Yang
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
Alkes L Price

Authors

Brendan K Bulik-Sullivan
View author publications
You can also search for this author in PubMed Google Scholar
Po-Ru Loh
View author publications
You can also search for this author in PubMed Google Scholar
Hilary K Finucane
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Ripke
View author publications
You can also search for this author in PubMed Google Scholar
Jian Yang
View author publications
You can also search for this author in PubMed Google Scholar
Nick Patterson
View author publications
You can also search for this author in PubMed Google Scholar
Mark J Daly
View author publications
You can also search for this author in PubMed Google Scholar
Alkes L Price
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin M Neale
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Schizophrenia Working Group of the Psychiatric Genomics Consortium

Contributions

B.K.B.-S. conceived the idea, analyzed the data, performed the analyses and drafted the manuscript. B.M.N. conceived the idea and drafted the manuscript. M.J.D. conceived the idea and supplied reagents. N.P. conceived the idea and supplied reagents. A.L.P. conceived the idea and supplied reagents. P.-R.L. analyzed the data and performed the analyses. H.K.F. analyzed the data and performed the analyses. S.R. analyzed the data and performed the analyses. J.Y. provided software. All authors provided input and revisions for the final manuscript.

Corresponding author

Correspondence to Benjamin M Neale.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

A full list of members and affiliations appears in the Supplementary Note.

Supplementary information

Supplementary Text and Figures

Supplementary Note, Supplementary Figures 1–9 and Supplementary Tables 1–10. (PDF 1264 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bulik-Sullivan, B., Loh, PR., Finucane, H. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 47, 291–295 (2015). https://doi.org/10.1038/ng.3211

Download citation

Received: 07 March 2014
Accepted: 07 January 2015
Published: 02 February 2015
Issue Date: March 2015
DOI: https://doi.org/10.1038/ng.3211

This article is cited by

Defining type 2 diabetes polygenic risk scores through colocalization and network-based clustering of metabolic trait genetic associations
- Samuel Ghatan
- Jeroen van Rooij
- Ling Oei
Genome Medicine (2024)
The genetic architecture of youth anxiety: a study protocol
- Laina McAusland
- Christie L. Burton
- Sandra Meier
BMC Psychiatry (2024)
Genetic association and causal relationship between multiple modifiable risk factors and autoimmune liver disease: a two-sample mendelian randomization study
- Weize Gao
- Chong Peng
- Mingjun Liu
Journal of Translational Medicine (2024)
Characterizing the polygenic overlap and shared loci between rheumatoid arthritis and cardiovascular diseases
- Xiaohui Sun
- Yu Qian
- Yingying Mao
BMC Medicine (2024)
Genetic architecture distinguishes tinnitus from hearing loss
- Royce E. Clifford
- Adam X. Maihofer
- Caroline M. Nievergelt
Nature Communications (2024)

LD Score regression distinguishes confounding from polygenicity in genome-wide association studies

Subjects

Abstract

Access options

Similar content being viewed by others

Controlling for background genetic effects using polygenic scores improves the power of genome-wide association studies

Boosting the power of genome-wide association studies within and across ancestries by using polygenic scores

Genome-wide association studies

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

Schizophrenia Working Group of the Psychiatric Genomics Consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Text and Figures

Rights and permissions

About this article

Cite this article

This article is cited by

Defining type 2 diabetes polygenic risk scores through colocalization and network-based clustering of metabolic trait genetic associations

The genetic architecture of youth anxiety: a study protocol

Genetic association and causal relationship between multiple modifiable risk factors and autoimmune liver disease: a two-sample mendelian randomization study

Characterizing the polygenic overlap and shared loci between rheumatoid arthritis and cardiovascular diseases

Genetic architecture distinguishes tinnitus from hearing loss

Search

Quick links

Subjects

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

Schizophrenia Working Group of the Psychiatric Genomics Consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links