[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

HEMEsPred: Structure-Based Ligand-Specific Heme Binding Residues Prediction by Using Fast-Adaptive Ensemble Learning Scheme

Published: 01 January 2018 Publication History

Abstract

Heme is an essential biomolecule that widely exists in numerous extant organisms. Accurately identifying heme binding residues HEMEs is of great importance in disease progression and drug development. In this study, a novel predictor named HEMEsPred was proposed for predicting HEMEs. First, several sequence- and structure-based features, including amino acid composition, motifs, surface preferences, and secondary structure, were collected to construct feature matrices. Second, a novel fast-adaptive ensemble learning scheme was designed to overcome the serious class-imbalance problem as well as to enhance the prediction performance. Third, we further developed ligand-specific models considering that different heme ligands varied significantly in their roles, sizes, and distributions. Statistical test proved the effectiveness of ligand-specific models. Experimental results on benchmark datasets demonstrated good robustness of our proposed method. Furthermore, our method also showed good generalization capability and outperformed many state-of-art predictors on two independent testing datasets. HEMEsPred web server was available at http://www.inforstation.com/HEMEsPred/ for free academic use.

References

[1]
C. Fufezan, J. Zhang, and M. R. Gunner, "Ligand preference and orientation in b-and c-type heme-binding proteins," Proteins Struct. Function Bioinf., vol. 73, no. 3, pp. 690-704, May 2008.
[2]
M. Sugishima, et al., "Structural basis for the electron transfer from an open form of NADPH-cytochrome P450 oxidoreductase to heme oxygenase," in Proc. Nat. Academy Sci., USA, vol. 111, no. 7, pp. 2524-2529, Jan. 2014.
[3]
C. J. Reedy and B. R. Gibney, "Heme protein assemblies," Chemical Rev., vol. 104, no. 2, pp. 617-650, Jan. 2004.
[4]
L. Ko¿eny, M. Oborn_ik, and J. Luke¿, "Make it, take it, or leave it: Heme metabolism of parasites," PLoS Pathogens, vol. 9, no. 1, Jan. 2013, Art. no. e1003088.
[5]
A. Tappel, "Heme of consumed red meat can act as a catalyst of oxidative damage and could initiate colon, breast and prostate cancers, heart disease and other diseases," Med. Hypotheses, vol. 68, no. 3, pp. 562-564, 2007.
[6]
X. M. Liu, K. J. Peyton, A. R. Shebib, H. Wang, R. J. Korthuis, and W. Durante, "Activation of AMPK stimulates heme oxygenase-1 gene expression and human endothelial cell survival," Amer. J. Physiology-Heart Circulatory Physiology, vol. 300, no. 1, pp. H84-H93, Jan. 2011.
[7]
K. Nienhaus and G.U. Nienhaus, "Probing heme protein-ligand interactions by UV/visible absorption spectroscopy," in Protein-Ligand Interactions: Methods Applications. Totowa, NJ, USA: Humana Press, 2005, pp. 215-241.
[8]
T. G. Spiro, "Resonance Raman spectroscopy as a probe of heme protein structure and dynamics," Adv. Protein Chemistry, vol. 37, pp. 111-159, 1985.
[9]
G. Simonneaux and A. Bondon, "Mechanism of electron transfer in heme proteins and models: The NMR approach," Chemical Rev., vol. 105, no. 6, pp. 2627-2646, May 2005.
[10]
C. Caillet-Saguy, et al., "Deciphering the structural role of histidine 83 for heme binding in hemophore HasA," J. Biol. Chemistry, vol. 283, no. 9, pp. 5960-5970, Feb. 2008.
[11]
D. G. Levitt and L. J. Banaszak, "POCKET: A computer graphics method for identifying and displaying protein cavities and their surrounding amino acids," J. Mol. Graph., vol. 10, no. 4, pp. 229- 234, Dec. 1992.
[12]
M. Hendlich, F. Rippmann, and G. Barnickel, "LIGSITE: Automatic and efficient detection of potential small molecule-binding sites in proteins," J. Mol. Graph., vol. 15, no. 6, pp. 359-363, Dec. 1997.
[13]
R. A. Laskowski, "SURFNET: A program for visualizing molecular surfaces, cavities, and intermolecular interactions," J. Mol. Graph., vol. 13, no. 5, pp. 323-330, Oct. 1995.
[14]
J. Liang, H. Edelsbrunner, and C. Woodward, "Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand design," Protein Sci., vol. 7, no. 9, pp. 1884-1897, Sep. 1998.
[15]
M. Weisel, E. Proschak, and G. Schneider, "PocketPicker: Analysis of ligand binding-sites with shape descriptors," Chemistry Central J., vol. 1, no. 7, pp. 1-17, Mar. 2007.
[16]
P. Bertolazzi, C. Guerra, and G. Liuzzi, "Predicting protein-ligand and protein-peptide interfaces," Eur. Phys. J. Plus, vol. 129, no. 6, pp. 1-10, Jun. 2014.
[17]
L. Cirio, S. Lucidi, F. Parasiliti, and M. Villani, "A global optimization approach for the synchronous motors design by finite element analysis," Int. J. Appl. Electromagn. Mech., vol. 16, no. 1, pp. 13-27, Jan. 2002.
[18]
L. Ellingson and J. Zhang, "Protein surface matching by combining local and global geometric information," PloS One, vol. 7, no. 7, Jul. 2012, Art. no. e40540.
[19]
W. Mehio, G. J. Kemp, P. Taylor, and M. D. Walkinshaw, "Identification of protein binding surfaces using surface triplet propensities," Bioinf., vol. 26, no. 20, pp. 2549-2555, Oct. 2010.
[20]
Z. R. Xie, C. K. Liu, F. C. Hsiao, A. Yao, and M. J. Hwang, "LISE: A server using ligand-interacting and site-enriched protein triangles for prediction of ligand-binding sites," Nucleic Acids Res., vol. 41, Jul. 2013, Art. no. gkt300.
[21]
A. Gutteridge, G. J. Bartlett, and J. M. Thornton, "Using a neural network and spatial clustering to predict the location of active sites in enzymes," J. Mol. Biol., vol. 330, no. 4, pp. 719-734, Jul. 2003.
[22]
A. J. Bordner, "Predicting small ligand binding sites in proteins using backbone structure," Bioinf., vol. 24, no. 24, pp. 2865-2871, Oct. 2008.
[23]
K. Chen, M. J. Mizianty, and L. Kurgan, "ATPsite: Sequence-based prediction of ATP-binding residues," Proteome Sci., vol. 9, no. Suppl 1, Oct. 2011, Art. no. S4.
[24]
T. Li, H. L. Bonkovsky, and J. T. Guo, "Structural analysis of heme proteins: implications for design and prediction," BMC Struct. Biol., vol. 11, no. 1, Mar. 2011, Art. no. 13.
[25]
R. Liu and J. Hu, "HemeBIND: A novel method for heme binding residue prediction by combining structural and sequence information," Bioinf., vol. 12, no. 1, May 2011, Art no. 207.
[26]
R. Liu and J. Hu, "Computational prediction of heme-binding residues by exploiting residue interaction network," PLoS One, vol. 6, no. 10, Oct. 2011, Art. no. e25560.
[27]
Y. Xiong, J. Liu, W. Zhang, and T. Zeng, "Prediction of heme binding residues from protein sequences with integrative sequence profiles," Proteome Sci., vol. 10, no. 1, Jun. 2012, Art. no. S20.
[28]
D. J. Yu, J. Hu, J. Yang, H. B. Shen, J. Tang, and J. Y. Yang, "Designing template-free predictor for targeting protein-ligand binding sites with classifier ensemble and spatial clustering," IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 10, no. 4, pp. 994-1008, Jul./Aug. 2013.
[29]
N. V. Chawla, N. Japkowicz, and A. Kotcz, "Editorial: Special issue on learning from imbalanced data sets," ACM SIGKDD Explorations Newslett., vol. 6, no. 1, pp. 1-6, Jun. 2004.
[30]
J. Zhang, X. Zhao, P. Sun, B. Gao, and Z. Ma, "Conformational B-cell epitopes prediction from sequences using cost-sensitive ensemble classifiers and spatial clustering," BioMed Res. Int., vol. 2014, Jun. 2014, Art. no. 689219.
[31]
X. Y. Liu, J. Wu, and Z. H. Zhou, "Exploratory undersampling for class-imbalance learning," IEEE Trans. Syst. Man. Cybern. Part B: Cybern., vol. 39, no. 2, pp. 539-550, Dec. 2008.
[32]
N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: Synthetic minority over-sampling technique," J. Artif. Intell. Res., vol. 16, pp. 321-357, 2002.
[33]
N. D. Rubinstein, M. Itay, and P. Tal, "A machine-learning approach for predicting B-cell epitopes," Mol. Immunology, vol. 46, no. 5, pp. 840-847, Feb. 2009.
[34]
P. W. Rose, et al., "The RCSB protein data bank: New resources for research and education," Nucleic Acids Res., vol. 41, pp. 475-482, Nov. 2012.
[35]
S. Soga, D. Kuroda, H. Shirai, M. Kobori, and N. Hirayama, "Use of amino acid composition to predict epitope residues of individual antibodies," Protein Eng. Des. Selection, vol. 23, no. 6, pp. 441-448, Mar. 2010.
[36]
J. Zhang, X. Zhao, P. Sun, and Z. Ma, "PSNO: Predicting cysteine S-nitrosylation sites by incorporating various sequence-derived features into the general form of Chou's PseAAC," Int. J. Mol. Sci., vol. 15, no. 7, pp. 11204-11219, Jun. 2014.
[37]
H. C. Lee, T. Hon, C. Lan, and L. Zhang, "Structural environment dictates the biological significance of heme-responsive motifs and the role of Hsp90 in the activation of the heme activator protein Hap1," Mol. Cellular Biol., vol. 23, no. 16, pp. 5857-5866, Aug. 2003.
[38]
M. Kull, "Involvement of heme regulatory motif in heme-mediated ubiquitination and degradation of IRP2," Mol. Cell, vol. 19, no. 2, pp. 171-181, 2005.
[39]
J. Igarashi, M. Murase, A. Iizuka, F. Pichierri, M. Martinkova, and T. Shimizu, "Elucidation of the heme binding site of hemeregulated eukaryotic initiation factor 2alpha kinase and the role of the regulatory motif in heme sensing by spectroscopic and catalytic studies of mutant proteins," J. Biol. Chemistry, vol. 283, no. 27, pp. 18782-18791, May 2008.
[40]
L. Yi, P. M. Jenkins, L. I. Leichert, U. Jakob, J. R. Martens, and S.W. Ragsdale, "Heme regulatory motifs in heme oxygenase-2 form a thiol/disulfide redox switch that responds to the cellular redox state," J. Biol. Chemistry, vol. 284, no. 31, pp. 20556-20561, May 2009.
[41]
Z. Chen, Y. Z. Chen, X. F. Wang, C. Wang, R. X. Yan, and Z. Zhang, "Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs," PloS One, vol. 6, no. 7, Jul. 2011, Art. no. e22930.
[42]
H. Zhang, et al., "Critical assessment of high-throughput standalone methods for secondary structure prediction," Briefings Bioinf., vol. 12, no. 6, pp. 672-688, Jan. 2011.
[43]
J. Mihel, M. ¿iki_c, S. Tomi_c, B. Jeren, and K. Vlahovi¿ek, "PSAIA-protein structure and interaction analyzer," BMC Struct. Biol., vol. 8, no. 1, Apr. 2008, Art. no. 21.
[44]
M. Paoli, J. Marles-Wright, and A. Smith, "Structure-function relationships in heme-proteins," DNA Cell Biol., vol. 21, no. 4, pp. 271-280, 2002.
[45]
Y. Sasakura, T. Yoshimura-Suzuki, H. Kurokawa, and T. Shimizu, "Structure--Function relationships of EcDOS, a heme-regulated phosphodiesterase from escherichia coli," Accounts Chemical Res., vol. 39, no. 1, pp. 37-43, Apr. 2006.
[46]
K. P. Holbourn, K. R. Acharya, and B. Perbal, "The CCN family of proteins: Structure--Function relationships," Trends Biochemical Sci., vol. 33, no. 10, pp. 461-473, Oct. 2008.
[47]
J. Zhang, P. Sun, X. Zhao, and Z. Ma, "PECM: Prediction of extracellular matrix proteins using the concept of Chou's pseudo amino acid composition," J. Theoretical Biol., vol. 363, pp. 412-418, Dec. 2014.
[48]
J. Zhang, W. Chen, P. Sun, X. Zhao, and Z. Ma, "Prediction of protein solvent accessibility using PSO-SVR with multiple sequence-derived features and weighted sliding window scheme," BioData Mining, vol. 8, no. 1, Jan. 2015, Art. no. 3.
[49]
W. Kabsch and C. Sander, "Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features," Biopolymers, vol. 22, no. 12, pp. 2577-2637, 1983.
[50]
D. J. Yu, et al., "TargetATPsite: A template-free method for ATP-binding sites prediction with residue evolution image sparse representation and classifier ensemble," J. Comput. Chemistry, vol. 34, no. 11, pp. 974-985, Apr. 2013.
[51]
G. C. Cawley and N. L. C. Talbot, "On over-fitting in model selection and subsequent selection Bias in performance evaluation," J. Mach. Learn. Res., vol. 11, no. 1, pp. 2079-2107, 2010.
[52]
V. Vacic, M. L. Iakoucheva, and P. Radivojac, "Two Sample Logo: A graphical representation of the differences between two sets of sequence alignments," Bioinf., vol. 22, no. 12, pp. 1536- 1537, Apr. 2006.
[53]
V. Vacic, V. N. Uversky, A. K. Dunker, and S. Lonardi, "Composition Profiler: A tool for discovery and visualization of amino acid composition differences," Bioinf., vol. 8, no. 1, Jun. 2007, Art. no. 211.
[54]
B. Alberts, et al., Essential Cell Biology. New York, NY, USA: Garland Science, 2013.
[55]
R. H. Lopes, "Kolmogorov--Smirnov test," in International Encyclopedia of Statistical Science, M. Lovric, Ed. Berlin, Germany: Springer-Verlag, pp. 718-720, Dec. 2014.
[56]
H. Javadzadegan, N. Nezami, K. Ghobadi, A. Sadighi, A. A. Abolfathi, and A. N. Nader, "High-sensitivity C-reactive protein (hs-CRP) and tumor necrotizing factor-alpha (TNF-alpha) after onand off- pump coronary artery bypass grafting," HSR Proc. Intensive Care Cardiovasc Anesth, vol. 2, no. 1, pp. 27-33, 2010.
[57]
G. Rätsch, T. Onoda, and K. R. Müller, "Soft margins for AdaBoost," Mach. Learn., vol. 42, no. 3, pp. 287-320, 2001.
[58]
N. Eswar, D. Eramian, B. Webb, M. Y. Shen, and A. Sali, "Protein structure modeling with modeller," Methods Mol. Biol., vol. 426, no. 1137, pp. 145-159, 2008.
[59]
N. Eswar, et al., "Comparative protein structure modeling using modeller," in Current Protocols Bioinformatics. Hoboken, NJ, USA: Wiley, 2006, pp. 5.6.1-5.6.30.
  1. HEMEsPred: Structure-Based Ligand-Specific Heme Binding Residues Prediction by Using Fast-Adaptive Ensemble Learning Scheme

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
      IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 15, Issue 1
      January 2018
      352 pages

      Publisher

      IEEE Computer Society Press

      Washington, DC, United States

      Publication History

      Published: 01 January 2018
      Published in TCBB Volume 15, Issue 1

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 27
        Total Downloads
      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 01 Jan 2025

      Other Metrics

      Citations

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media