Abstract
For many important complex traits, Genome Wide Association Studies (GWAS) have only recovered a small proportion of the variance in disease prevalence known to be caused by genetics. The most common explanation for this is the presence of multiple rare mutations that cannot be identified in GWAS due to a lack of statistical power. Such rare mutations may be concentrated in relatively few genes, as is the case for many known Mendelian diseases, where the mutations are often compound heterozygous (CH), defined below. Due to the multiple mutations, each of which contributes little by itself to the prevalence of the disease, GWAS also lacks power to identify genes contributing to a CH-trait. In this paper, we address the problem of finding genes that are causal for CH-traits, by introducing a discrete optimization problem, called the Phenotypic Distance Problem. We show that it can be efficiently solved on realistic-size simulated CH-data by using integer linear programming (ILP). The empirical results strongly validate this approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bang, S.Y., Na, Y.J., Bae, S.C., et al.: Targeted exon sequencing fails to identify rare coding variants with large effect in rheumatoid arthritis. Arthritis Res. Ther. 16(5), 447 (2014)
Bobadilla, J.L., Macek, M., Fine, J.P., Farrell, P.M.: Cystic fibrosis: a worldwide analysis of CFTR mutations-correlation with incidence data and application to screening. Hum. Mutat. 19(6), 575–606 (2002)
Hudson, R.: Generating samples under the Wright-Fisher neutral model of genetic variation. Bioinformatics 18(2), 337–338 (2002)
Hudson, R., Kaplan, N.: Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111, 147–164 (1985)
Hunt, K.A., Mistry, V., van Heel, D.A., et al.: Negligible impact of rare autoimmune-locus coding-region variants on missing heritability. Nature 498(7453), 232–235 (2013)
Liu, D.J., Leal, S.M.: A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions. PLoS Genet. 6(10), e1001156 (2010)
Lohmuellers, K.E., Sparso, T., Pedersen, O., et al.: Whole-exome sequencing of 2,000 Danish individuals and the role of rare coding variants in type 2 diabetes. Am. J. Hum. Genet. 93(6), 1072–1086 (2013)
Manolio, T.A., Collins, F.S.: The HapMap and genome-wide association studies in diagnosis and therapy. Ann. Rev. Med. 60, 443–456 (2009)
Myerowitz, R.: Tay-Sachs disease-causing mutations and neutral polymorphisms in the Hex A gene. Hum. Mutat. 9(3), 195–208 (1997)
Neal, B.M., Rivas, M.A., Daly, M.J., et al.: Testing for an unusual distribution of rare variants. PLoS Genet. 7(3), e1001322 (2011)
Saenko, E.L., Ananyeva, N.N., Pipe, S., et al.: Molecular defects in coagulation Factor VIII and their impact on Factor VIII function. Vox Sang. 83(2), 89–96 (2002)
Visscher, P.M., Brown, M.A., McCarthy, M.I., Yang, J.: Five years of GWAS discovery. Am. J. Hum. Genet. 90, 7–24 (2012)
Wu, M.C., Lee, S., Cai, T., Li, Y., Boehnke, M., Lin, X.: Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89(1), 82–93 (2011)
Yufeng, Wu.: Personal communication (2014)
Zuk, O., Hechter, E., Sunyaev, S.R., Lander, E.S.: The mystery of missing heritability: Genetic interactions create phantom heritability. Proc. Nat. Acad. Sci. (USA) 109, 1193–1198 (2012)
Zuk, O., Schaffner, S.F., Lander, E.S., et al.: Searching for missing heritability: designing rare variant association studies. Proc. Nat. Acad. Sci. (USA) 111(4), E455–E464 (2014)
Acknowledgements
We thank Yufeng Wu and Charles Langley for helpful conversations and suggestions. Research partially supported by grants IIS-0803564, CCF-1017580, IIS-1219278 from the National Science Foundation.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gusfield, D., Nielsen, R. (2015). Association Mapping for Compound Heterozygous Traits Using Phenotypic Distance and Integer Programming. In: Pop, M., Touzet, H. (eds) Algorithms in Bioinformatics. WABI 2015. Lecture Notes in Computer Science(), vol 9289. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48221-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-662-48221-6_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-48220-9
Online ISBN: 978-3-662-48221-6
eBook Packages: Computer ScienceComputer Science (R0)