Professional Documents
Culture Documents
Introduction to Network
Analysis in Systems Biology
Avi Maayan, Ph.D.
Department of Pharmacology and
Systems Therapeutics
Systems Biology Center New York (SBCNY)
A1
A2
B1
E1
A1
A2
B1
E1
C1
E2
E3
D1
C1
E2
E3
D1
D2
A3
D3
C2
D2
A3
D3
C2
A1.2
A1.1
A2
B1
E1
E3
D1
A- gene ontology
B- protein-protein interactions
(undirected graphs)
c
d
A1
A2
B1
E1
C1
E2
E3
D1
C1
E2
D2
A3
D3
C2
D2
A3
D3.1
D3.2
C2.2
C2.1
e
D2
E2
A2
A1.1
A1.2
B1
E1
D1
C2.1
C2.2
A3
E3
D3.1
D3.2
C1
http://en.wikipedia.org
Metabolic
Networks
Glycolysis
Cell Signaling
Pathways
Gi/o Pathway
Nodes are proteins, metabolites, lipids,
second messengers, or peptides
Interactions designate information flow, can
be activation or inhibition, and are direct and
physical
Signaling
pathways are not
isolated and can
be merged into
large networks
Kinase-Substrate Network
Protein kinasesubstrate networks
are directed bipartite
graphs that connect
kinases to their
substrates through
protein
phosphorylation
10
Worm
14
Identified disease
genes
Assessed overlap with
literature-based
interactions
Used GO annotation
Blue- literature
Red- Y2H screen (~78% verified by Co-IP)
15
16
17
18
19
20
Drug-Target Networks
Drugs can be connected to their known protein targets
22
http://vlado.fmf.uni-lj.si/pub/networks/pajek/
23
24
Summary
Different types of biological intracellular molecular
networks can be represented by different types of
graphs
Networks can be created from collecting interactions
published in many papers, or networks can be
reconstructed directly from data
Protein interaction networks and cell signaling
networks can be connected to drugs and diseases
Network representation can be used to integrate
different datasets using genes as anchors
25
Introduction to Network
Analysis in Systems Biology
Avi Maayan, Ph.D.
Department of Pharmacology and
Systems Therapeutics
Systems Biology Center New York (SBCNY)
26
27
29
Clustering Coefficient
30
31
Watts DJ, Strogatz SH. Collective dynamics of 'small-world' networks. Nature. 1998 Jun 4;393(6684):440-2.
Barabasi, Albert and colleagues found that many real networks including the
Internet and the WWW are scale-free. This means that the connectivity
distribution of nodes fits a power-law.
Barabasis group analyzed databases of metabolic networks in lower organisms
and the protein-protein interactions map of the yeast proteome inferred from highthroughput yeast-2-hybrid screens. All shown to have scale-free connectivity
distribution.
32
33
34
35
37
Graphlets motifs in
undirected networks
Evolutionary conservation of
motif constituents in the
yeast protein interaction
network
S Wuchty et al.
Nature Genetics
35, 176 179 (2003)
39
40
Ma'ayan et al. Phys Rev E Stat Nonlin Soft Matter Phys. 2006 73:061912
Power-law distribution of branched pathways
A topology common for systems that need to make discrete
decisions based on a continues complex state of the environment
42
43
44
45
46
MacArthur, Sanchez-Garcia and Maayan, Phys. Rev. Lett. 104, 168701 (2010)
47
Summary
Real networks are small world and scale free
Simple algorithms can recreate the structure of real
networks
Shuffled networks are created for statistical control
Network motifs and graphlets define the topology at
the microscopic level
Real biological regulatory networks have date-andparty hubs, hubs are either multi or single site,
pathways branching follows a power-law, signaling
networks display bow-tie structure, bifans are highly
enriched, feedback loops are depleted and nested to
48
provide dynamical stability.
Introduction to Network
Analysis in Systems Biology
Avi Maayan, Ph.D.
Department of Pharmacology and
Systems Therapeutics
Systems Biology Center New York (SBCNY)
49
50
51
52
+
Computational Modeling
Experiments
(High-content)
53
54
Protein-DNA Arrays:
Measuring Transcription Factor Activation
20 min
DMSO
AP-2
RAR
CREB
MYB
STAT3
PAX6
55
signal
Transcription
Factor
Consensus promoter sequence
Transcription
Factor
Consensus promoter sequence
Genes2Networks
29,317
List of
TFs
_____
_____
_____
_____
18,675
7,241
Vidal
Stelzl
Filter
6,149
4,242
3,155
Integrator
Unfiltered
Dataset
Filtered
Dataset
3,121
Genes2Networks
Output
subnetwork
1,418
1,059
242
Significant
Intermediates
56
Algorithm
Output
Subnetwork
connecting
the seed
nodes
Table with
ranked
intermediate
proteins
Genes2Networks
Web Interface
- Hash function for fast loading
of the datasets
- Implementation of AJAX
allows changing the page
without reloading
- GraphViz, Overlib, and
PerlMagic library utilization
http://actin.pharm.mssm.edu/genes2networks
58
59
60
Experimental Validation
Noonans Symptoms
- Heart Defects
- Distinct Facial Features
- Learning Difficulties
- Bruising and Bleeding
- Mild up regulation in the MAPK pathway (gain of function mutations)
- Four disease genes were identified in about 60% of patients
62
Use known disease genes to build a network around these genes to identify new
genes/nodes that could be additional disease genes
Cordeddu V, Di Schiavi E, Pennacchio LA, Ma'ayan A, et al. Nat Genet. 41:1022 (2009)
63
65
66
KEA- kinase-substrate interaction database and webbased system for kinase enrichment analysis
http://amp.pharm.mssm.edu/lib/kea.jsp
Lachmann and Maayan. Bioinformatics 11, 87 (2010)
67
118
107
35286
>150
69
Lachmann A, Xu H, Krishnan J, Berger SI, Mazloom AR, and Maayan A. ChEA: Transcription Factor
Regulation Inferred from Integrating Genome-Wide ChIP-X Experiments. Bioinformatics, 26, 2438-44 (2010)
Summary
Prior knowledge networks can be used to predict
function of proteins, protein interactions and disease
genes
Different algorithms can be used to connect seed
lists of proteins with known interactions from prior
knowledge networks
Network analysis can be used to develop hypotheses
for functional experiments by combining highthroughput profiling data with prior knowledge
networks
70