[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Software Tool Article

Biological network analysis with CentiScaPe: centralities and experimental dataset integration

[version 1; peer review: 2 approved with reservations]
PUBLISHED 01 Jul 2014
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Cytoscape gateway.

Abstract

The growing dimension and complexity of available experimental data generating biological networks has increased the need for tools allowing to categorize nodes by their topological relevance in biological networks. Here we present CentiScaPe, a Cytoscape app specifically designed to calculate centrality indexes for the identification of the most important nodes of a network. CentiScaPe is a comprehensive suite of algorithms dedicated to network nodes centrality analysis, computing several centralities for undirected, directed and weighted networks. The results of the topological analysis can be also integrated with data sets from lab experiments, such as expression or phosphorylation levels of the proteins represented in the network, using the graphical features of the tool. This opens a new perspective in the analysis of biological networks, since integration of topological analysis with lab experimental data can increase the predictive power of a bioinformatical analysis.

Introduction

Biological processes can be displayed as networks where the nodes represent biological entities, and edges represent interactions between these entities. Several kinds of biological networks have been introduced, including metabolic networks, gene networks, signal transduction networks, and protein-protein interaction networks1. Such networks are a static representation of the dynamics of biological processes, where molecular interactions give rise to cascades of reactions, called pathways, determining the life processes of living organisms. Even if the scientific community is far from being able to simulate the dynamic behavior of such pathways, important information can be extracted from the topological analysis of biological networks2,3 since the structure of a network can affect its function4. In this context, several global parameters are commonly used to describe the properties of the whole networks, while centralities5 are parameters allowing the identification of the most important nodes which could be key regulators in the biological process being studied.

In this paper we present CentiScaPe 2.1, a Cytoscape app6,7 for network centralities analysis. While the built in Analyze Network tool is oriented to characterize the global behavior of the network. Provided with several global network statistics, CentiScaPe is designed to identify the most relevant nodes, and it provides a more complete set of centralities. This new version introduces the computation of centralities for directed and weighted networks not available in any other Cytoscape app. As in the previous version, it computes Average Distance, Diameter, Degree, Stress, Betweenness, Radiality, Closeness, Centroid Value, and Eccentricity. Other parameters as Eigenvector, Bridging centrality and Edge Betweenness have been added, making it the most complete app for network centrality analysis.

A web version of CentiScaPe, FastCentiScaPe, is also available (http://www.cbmc.it/fastcent/), which performs very fast computations for large networks sending the network to a multiprocessor server. The centrality analysis results are sent to the user by e-mail in xgmml format.

CentiScaPe’s main goal is to produce results that should drive further lab experiments, as the high score nodes identified by the computation can be considered as potential targets for drug development and new experiments.

Methods and implementation

To calculate all the centralities, the computation of the shortest path between each pair of nodes in the graph is needed. The algorithm for the shortest path is the Dijkstra algorithm8, that has been adjusted in order to compute all the shortest paths pairs (needed for Stress and Betweenness computation)9 and to use also edge direction and edge weight in the shortest path identification. A good description of this and other centrality algorithms can be found in Koschützki et al.5. CentiScaPe 2.1 introduces the computation of centralities for directed networks, networks where the edges are considered to have a direction. Consequently, some nodes cannot be reached by others: given two nodes s and t, it is possible that there is no path from s to t, or from t to s or both. Several centrality parameters are based on the computation of the shortest path between each couple of nodes and are not defined when there are two nodes not connected by a path. In this case CentiScaPe consider the distance from s to t equal to infinity. The centrality definitions have been modified to consider this case (see supplementary materials: CentralitiesTutorial), so the directed centralities can also be used to compute centralities for disconnected networks, i.e. networks where some nodes cannot be reached by others. This gives the use of centrality analysis great flexibility.

In the case of weighted networks the edges are supposed to have a numerical integer attribute depending on experimental data or on some feature of the network. This numerical value is treated as a distance in the computation of the shortest path between two nodes. In an unweighted network the distance between two node connected by an edge is equals to 1, and the distance of two generic nodes is the number of edge of the shortest path connecting them. In a weighted network the distance is the sum of the attribute values of the edges connecting the nodes, so the shortest path is not necessary the one with the lowest number of edges, but the one with the least distance.

CentiScaPe is written in Java as a Cytoscape app, in order to exploit all the excellent features of Cytoscape and to reach the largest number of users. It has a multi-thread core that can exploit multiprocessor architecture. The Java library JFreechart10 has been used for some of the graphic features.

Results and discussion

The Main use of CentiScaPe is to rank the nodes of a network depending on their topological and experimental relevance. The numerical results are saved as node, edge or network attributes in the Cytoscape attributes browser, depending on the kind of parameters, so all the Cytoscape features for managing attributes are supported. After the computation the centralities are treated as normal Cytoscape attributes. CentiScaPe can be used in undirected networks11, in directed networks and in weighted networks.

Centralities for directed networks (see Supplementary Files: CentralitiesTutorial) are useful in the case of metabolic networks where the direction is from substrates and reactants to the products of the chemical reactions and in signal transduction networks, where the direction depends on the flux of information. Considering direction in the computation of centralities can lead to different and more precise results than the undirected version.

As example, in Figure 1 the computation of the directed and undirected Stress applied to a network of Oncogenes is shown (see Supplementary Files: Oncogenes.txt and Oncogenes_edge_directions.txt). Results of both the computations are shown. The image, obtained using Cytoscape, represents the different Stress values using the color and size of nodes. The node’s size represents the value obtained using the directed Stress, so the bigger the node the higher the value; the color represents the values obtained using the undirected Stress: red is used for the highest values, green for the lowest values. For example a large green node is interesting because it means that a node with a high value of the new algorithm has a low value using CentiScaPe. While analyzing the oncogenes network we saw that the large red node, AKT1, shows how its stress values are high using both algorithms. However the green medium-sized node, FANCE or RAF1, shows how, using undirected Stress, we obtain a low value for stress, but using the new algorithm we obtain a high stress value. The opposite situation is found in the third highlighted node, the small yellow node, RB1, in the right bottom corner. Here, the value computed with undirected Stress is not very high, similar to the red node, but the value computed with the directed Stress is very low. This can be explained by suggesting that AKT1 is a very important node, that is essential for maintaining connections within this network. For RB1 and CREB1 the situation is not very clear because we have opposite situations. If we use CentiScape, for FANCE, we assume that this node is not essential, but using the directed values it appears to be very central.

eed4b08c-ce97-4bce-9907-f0016e8825f0_figure1.gif

Figure 1. Stress computed on oncogenes.

Size represents directed Stress, color undirected Stress (green=low, red=high).

Second important features of the new version of CentiScaPe is the possibility of computing centralities for weighted networks, networks where the edges are provided with an attribute that can be interpreted as a distance between the two connected nodes.

In the network depicted in Figure 2 we have a distance (dist) attribute for each edge. We have dist(A,B)=2, dist(B,C)=3 and dist(A,C)=7. Since A and C are connected by a single edge, in an unweighted computation, the distance from A to C is equal to 1. But if the attributes of the edges are considered as distances, the shortest path between A and C is the one passing through B (=2+3=5) since it is shorter than the one connecting A directly to C (=7). The computation of weighted shortest paths will result in completely different values compared to a situation where the weight is not considered. The user should take care that the weight is used in cases where close nodes are more important than distant nodes. So, depending on the meaning of the attributes, one can use the value or its reciprocal. For example if the attribute represents the speed of a reaction instead of a distance, the reciprocal should be used.

eed4b08c-ce97-4bce-9907-f0016e8825f0_figure2.gif

Figure 2. A weighted network.

If the weight of the edge is not considered the shortest path from A to C is the directed edge, and the distance is 1. If the weights are considered the shortest path from A to C is the one passing through B, and the distance is equals to 5.

An example where weighted networks centrality analysis is used, is provided by Currie et al.12 where an euclidean distance is given to each edge depending on the difference between the phosphorylation level of the proteins connected by that edge.

All the graphical features of the previous version of CentiScaPe, such as the plot by node, the plot by centrality and the boolean-based result panel have been maintained in the new version. A complete guide can be found in Scardoni et al.11 or in the CentiScaPe userguide available from the website.

Conclusions

CentiScaPe 2.1 has been enriched with new centrality parameters such as Eigenvector, Bridging centrality and Edge Betweenness centrality, and with the possibility to analyze directed and weighted networks. It allows integrating centrality-based network analysis with experimental data. The results of the computation can be used and exported as Cytoscape attributes, allowing the user to exploit all the other features of Cytoscape and its apps. Compared to the built-in Analyze Network tool of Cytoscape, CentiScaPe is an excellent integrative tool allowing the identification of potential target nodes from both the topological and the experimental point of view, and can be considered as an essential instrument for the characterizations of nodes in order to drive further experiments.

Software availability

Software available from the Cytoscape App Store: http://apps.cytoscape.org/apps/centiscape

Latest source code: https://bitbucket.org/giovanniscardoni/centiscapepublic/

Source code as at the time of publication: https://bitbucket.org/F1000Research/centiscapepublic-archive

Archived source code as at the time of publication: http://dx.doi.org/10.5281/zenodo.1065213

License: Lesser GNU Public License 3.0: https://www.gnu.org/licenses/lgpl.html

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 01 Jul 2014
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Scardoni G, Tosadori G, Faizan M et al. Biological network analysis with CentiScaPe: centralities and experimental dataset integration [version 1; peer review: 2 approved with reservations]. F1000Research 2014, 3:139 (https://doi.org/10.12688/f1000research.4477.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 01 Jul 2014
Views
79
Cite
Reviewer Report 11 Jul 2014
Ankush Sharma, Laboratory for Genomics,Transcriptomics and Proteomics, High Performance Computing and Networking Institute, National Research Council, Naples, Italy 
Approved with Reservations
VIEWS 79
In this research article, the authors describe the CentiScaPe app for Cytoscape (and its web version FastCentiScaPe) for computing the several centrality parameters in networks, analyzing the elemental level importance of each node in the network, based on network topology. ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Sharma A. Reviewer Report For: Biological network analysis with CentiScaPe: centralities and experimental dataset integration [version 1; peer review: 2 approved with reservations]. F1000Research 2014, 3:139 (https://doi.org/10.5256/f1000research.4788.r5341)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
71
Cite
Reviewer Report 07 Jul 2014
Ferenc Jordan, Centre for Computational and Systems Biology, University of Trento, Trento, Italy 
Approved with Reservations
VIEWS 71
Analysing several centrality measures in parallel is important and there is a need for this, however, it would be even nicer to compare them (see http://www.sciencedirect.com/science/article/pii/S0304380007001184 ) and discuss better their biological relevance (i.e. what is Eigenvector centrality good for?). ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Jordan F. Reviewer Report For: Biological network analysis with CentiScaPe: centralities and experimental dataset integration [version 1; peer review: 2 approved with reservations]. F1000Research 2014, 3:139 (https://doi.org/10.5256/f1000research.4788.r5358)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 2
VERSION 2 PUBLISHED 01 Jul 2014
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.