[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2975167.2985692acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

G-quadruplex Structure Prediction and integration in the GenData2020 data model

Published: 02 October 2016 Publication History

Abstract

DNA is a long polymer encoding information of all living organisms. It usually folds in a double-helix shape and packed in chromosomes. Nonetheless DNA exists in many other conformations, including A-DNA, B-DNA and Z-DNA. B-DNA is the most common form found in cells. Non-B DNAs comprise of tetraplex (G-quadruplex), left-handed Z-DNA, and others. Several recent publications have provided significant evidence that non-B DNA structures may play a role in DNA instability and mutagenesis, leading to both DNA rearrangements and increased mutation rates, which are often related with a number of diseases, among which cancer. Studying the structure conformation and probability to find non-B DNA structures on a DNA or RNA sequence may help in studying diseases and selectively inhibit genes as well as designing new drugs. Nevertheless, even if there are some examples of prediction tools, the topic of designing efficient prediction algorithms and tools for G-quadruplex prediction is still in its infancy.
As a contribution in this area, we present a set of novel software tools, both pattern-based and machine learning-based which are able to predict G-quadruplex conformations starting from a DNA/RNA input sequence. We show they outperform the state-of-the-art software tools available in literature by comparing their predictive performances.
Results of this work have been used to implement a web-based prediction tool available online. We also report about the integration of such a tool with the GenData2020 cloud-based system.

References

[1]
Veltri, P., Cannataro, M., Tradigo, G., Sharing mass spectrometry data in a grid-based distributed proteomics laboratory, Information processing & management, 43(3), 577--591, 2007
[2]
Gullo, F., Ponti, G., Tagarelli, A., Tradigo, G., Veltri, P., A time series approach for clustering mass spectrometry data, Journal of Computational Science, 3(5), 344--355, 2012
[3]
Guzzi, P. H., Di Martino, M. T., Tradigo, G., Veltri, P., Tassone, P., Tagliaferri, P., Cannataro, M., Automatic summarisation and annotation of microarray data, Soft Computing, 15(8), 1505--1512, 2011
[4]
Metzker, M. L., Sequencing technologies-the next generation, Nature reviews genetics, 11(1), 31--46, 2010
[5]
Frazer, K. A., Ballinger, D. G., Cox, D. R., Hinds, D. A., Stuve, L. L., Gibbs, R. A., ... Pasternak, S., A second generation human haplotype map of over 3.1 million SNPs, Nature, 449(7164), 851--861, 2007
[6]
Lam, E. Y. N., Beraldi, D., Tannahill, D., Balasubramanian, S., G-quadruplex structures are stable and detectable in human genomic DNA, Nature communications, 4, 1796, 2013
[7]
Tradigo, G., Mannella, L., Veltri, P., Assessment of G-quadruplex prediction tools, In Computer-Based Medical Systems (CBMS), 243--246, 2014
[8]
Xu, C.X., Zheng, Y.X., Zheng, X.H., Hu, Q., Zhao, Y., Ji, L.N., Mao, Z.W., V-Shaped Dinuclear Pt(II) Complexes: Selective Interaction with Human Telomeric G-quadruplex and Significant Inhibition towards Telomerase, Scientific Reports, 3, 2060, 2013
[9]
Li, Q., Xiang, J.F., Yang, Q.F., Sun, H.X., Guan, A.J., Tang, Y.L., G4LDB a database for discovering and studying G-quadruplex ligands, Nucleic Acids Research, 41, 1115--1123, 2012
[10]
Harley, C. B., Villeponteau, B., Telomeres and telomerase in aging and cancer, Current opinion in genetics & development, 5(2), 249--255, 1995
[11]
Aubert, G., Lansdorp, P. M., Telomeres and aging, Physiological reviews, 88(2), 557--579, 2008
[12]
Huppert, J.L, Structure, location and interactions of G-quadruplexes, FEBS Journal, 277, 3452--3458, 2010
[13]
Neidle, S., Balasubramanian, S., Quadruplex Nucleic Acids, RSC Publishing, 2006
[14]
Nucleic Acids Research, 41, 94--100, 2013 Cer, R. Z., Donohue, D. E., Mudunuri, U. S., Temiz, N. A., Loss, M. A., Starner, N. J., Halusa, G.N., Volfovsky, N., Yi, M., Luke, B.T., Bacolla, A., Collins, J.R., Stephens, R.M., Non-B DB v2. 0: a database of predicted non-B DNA-forming motifs and its associated tools. Nucleic acids research, 41, 94--100, 2012
[15]
Yadav, V.K., Abraham, J.K., Mani, P., Kulshrestha, R., Chowdhury, S., QuadBase: genome-wide database of G4 DNA-occurrence and conservation in human, chimpanzee, mouse and rat promoters and 146 microbes, Nucleic Acids Research, 36, 381--385, 2008
[16]
Zhang, R., Lin, Y., Zhang, C.T., Greglist: a database listing potential G-quadruplex regulated genes, Nucleic Acids Research, 36, 372--376, 2008
[17]
Kikin, O., D'Antonio, L., Bagga, P., QGRS Mapper: a web-based server for predicting G-quadruplexes in nucleotide sequences, Nucleic Acids Research, 2006, 34, 676--682, 2006
[18]
D'Antonio, L., Bagga, P., Computational Methods for Predicting Intramolecular G-quadruplexes in Nucleotide Sequences, Computational Systems Bioinformatics Conference, 2004
[19]
Cer, R.Z., Bruce, K.H., Donohue, D.E., Temiz, N.A., Mudunuri, U.S., Yi, M., Volfovsky, N., Bacolla, A., Luke, B.T., Collins, J.R., Stephens, R.M., Searching for non-B DNA-forming motifs using nBMST (non-B DNA Motif Search Tool), Current Protocols in Human Genetics, 18, 2012
[20]
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N., Bourne, P. E., The protein data bank, Nucleic acids research, 28(1), 235--242, 2000
[21]
Rodriguez-TomÃl', P., Stoehr, P. J., Cameron, G. N., Flores, T. P., The european bioinformatics institute (EBI) databases, Nucleic acids research, 24(1), 6--12, 1996
[22]
Huppert, J.L., Hunting G-quadruplexes, Biochimie, 90:8, 1140--1148, 2008
[23]
Mirabello, C., Tradigo, G., Pollastri, G., Distill: protein structure prediction by Machine Learning, Poster in IX Ed. of CASP, Critical Assessment of Techniques for Protein Structure Prediction, 2012
[24]
Tradigo, G., Greco, S., Veltri, P., G4Predictor website, http://g4predictor.appspot.com/, 2015
[25]
Masseroli, M., Pinoli, P., Venco, F., Kaitoua, A., Jalili, V., Palluzzi, F., Muller, H., Ceri, S., GenoMetric Query Language: a novel approach to large-scale genomic data management, Bioinformatics, 31(12), 1881--1888, 2015
[26]
Bertoni, M., Ceri, S., Kaitoua, A., Pinoli, P., Evaluating cloud frameworks on genomic applications, Big Data 2015 IEEE International Conference, 193--202, 2015
[27]
Fernandez, J. D., Lenzerini, M., Masseroli, M., Venco, F., Ceri, S., Ontology-based search of genomic metadata, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 13(2), 233--247, 2016
[28]
di Vimercati, S. D. C., Foresti, S., Samarati, P., Data Protection in Cloud Scenarios, International Workshop on Data Privacy Management, 3--10, 2015
[29]
Baldacci, L., Golfarelli, M., Graziani, S., Rizzi, S., GOLAM: a framework for analyzing genomic data, 17th International Workshop on Data Warehousing and OLAP, 3--12, 2014

Cited By

View all
  • (2019)G4-Interacting DNA Helicases and Polymerases: Potential Therapeutic TargetsCurrent Medicinal Chemistry10.2174/092986732466617111612334526:16(2881-2897)Online publication date: 26-Aug-2019
  1. G-quadruplex Structure Prediction and integration in the GenData2020 data model

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    BCB '16: Proceedings of the 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
    October 2016
    675 pages
    ISBN:9781450342254
    DOI:10.1145/2975167
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 October 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Computational genomics
    2. DNA
    3. G-quadruplex
    4. RNA
    5. prediction

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    BCB '16
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 254 of 885 submissions, 29%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)G4-Interacting DNA Helicases and Polymerases: Potential Therapeutic TargetsCurrent Medicinal Chemistry10.2174/092986732466617111612334526:16(2881-2897)Online publication date: 26-Aug-2019

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media