Module1 Science of Network Analysis
Module1 Science of Network Analysis
Module1 Science of Network Analysis
University of Pittsburgh
Both
Fairly
Course details
Highly
technical course
Grade
Homework (15%)
Pop-up quizzes (15%)
One
http://www.sis.pitt.edu/~kpele/telcom2125.html
Textbook
There
possible references
The
A graph
Graph terminology
A
When
Depending
we denote an edge between vertices i and j by (i,j) then the complete network can be spec
ving the value of n and a list of all the edges. For example, the network in Fig. 6.1a has n
Example
es and edges (1,2), (1,5), (2,3), (2,4), (3,4), (3,5), and (3,6). Such a specification is calle
list. Edge lists are sometimes used to store the structure of networks on computers, bu
ematical developments like those in this chapter they are rather cumbersome.
re 6.1: Two small networks. (a) A simple graph, i.e., one having no multiedges or self-ed
8
network
with both multiedges and self-edges.
The
Aij
1,
0,
Example
10
nts to notice about the adjacency matrix are that, first, for a network with no self-e
Observations
For
For
If
Multiedges
11
Weighted networks
Some
In
12
Directed networks
In
relations
networks/graphs
The
13
are
captured
through
directed
Example
ected graph, also called a digraph for short, is a network in which each
nting from one vertex to another. Such edges are themselves called
represented by lines with arrows on themsee Fig. 6.2.
(6.6)
etwork.
A small directed network with arrows indicating the directions
14
Co-citation coupling
The
k1
k1
C AA
15
Co-citation coupling
The
Cii A Aik
k1
2
ik
k1
16
Bibliographic coupling
The
k1
k1
B A A
17
the
corresponding
Bibliographic coupling
The
k1
k1
The
Notes on coupling
While
They
While
19
Acyclic directed
networks
A cycle in a directed network.
A
can depict
a network
as in Fig.
6.3,those
with thewith
vertices time Networks without we
cycles
are such
called
acyclic,
while
to top of the picture in this caseso that all the edges representing
An
20
21
precisely, since the acyclic graph does not have self loops,
the diagonal elements are zero strictly triangular matrix
All
22
Bipartite networks
In
The
23
Bij
1,
0,
One-mode projections
We
One
24
ach of those four is connected to each of the others in the one-mode projection b
on membership in that group. (Such a clique of four vertices is visible in the ce
projection in Fig. 6.5.) Thus the projection is, generically, the union of a number
r each group in the original bipartite network. The same goes for the other proje
oups.
Projections
25
The total number of groups Pij that i and j belong is given by:
g
k1
k1
The
Trees
Connected,
27
Trees
Since
28
Degree
The
ki Aij
j1
n
n
And the number of 1edges
of1angraph
is given by:
k A
2
2
i
i1
Mean
29
ij
i1 j1
degree c of a vertex
n in an undirected graph is:
1
2m
c ki
n i1
n
Connectance
The
Connectance
n 1
n(n1)
2 2
2m
c
n(n1) n1
is called dense
A network for which the density tends to zero as n is called sparse
30
k Aij
in
i
j1
The
j1
31
k Aji
out
i
i1
ij
Paths
A
Length
path
When a path traverses the same edge e two times, e is counted
twice
32
33
A = UKUT
U
Ar (UKU T )r UK rU T
Lr TrAr Tr(UK rU T ) Tr(U TUK r ) TrK r ir
i
34
We
A=QTQT
Q
is orthogonal
T is upper triangular
o Its diagonal elements are its eigenvalues
*They are equal with those of A (why?)
35
Geodesic paths
A
By
Diameter
36
Other definitions are extensively used in the literature as well, such as, the
average value of all geodesic paths in the network etc.
Self avoiding
Konisberg Bridge Problem
37
Components
A
network
there is no
Note, however, that the vertex labels must be chosen correctly to pro
appearance of blocks in the adjacency matrix depends on the vertice
represented by adjacent rows and columns and choices of labels t
produce non-block-diagonal matrices, even though the choice of la
of the network itself. Thus, depending on the labeling, it may
for which structure
there
pairs
thatcomp
obvious fromexists
the adjacency
matrix of
that avertices
network has separate
exist them
computer is
algorithms,
suchdisconnected
as the breadth-first search algorithm
path between
called
that can take a network with arbitrary vertex labels and quickly determi
Component
38
Assuming
Imposing
39
y if there is a directed path between them, i.e., a path in which we follow edges onl
d direction.
It would be useful to define components for directed networks based on s
Visually
ths, but this raises some problems. It is certainly possible for there to be a directed
x A to vertex B but no path back from B to A. Should we then consider A and B t
Are they in the same component or not?
13: Components in a directed network. This network has two weakly conne
s of four vertices each, and five strongly connected components (shaded).
40
Out-components
The
All
All
41
In-components
The
All
As
42
Theor number
of6.10independent
paths
between
pair of
vertices
edges, as in Fig.
for instance (page 140).
If we restrict
ourselves toaindependent
paths, then the number of paths between a given pair of vertices is much smaller. The number of
vertices is called connectivity of the vertices
independent paths between vertices gives a simple measure of how strongly the vertices are
connected
to one another,
and has been the topi c of much study in the graph theory literature.
Edgeand vertex-connectivity
43
Cut set
A
44
Mengers theorem
The
45
Max-flow/min-cut theorem
In
Minimum edge cut set is a cut set such that the sum of the
weights on the edges has the minimum possible value
The
46
Diffusion
Suppose
The
Diffusion
Using
d
CLequation
0
diffusion
dt
48
d
C(A D)
dt
We
Lij ij ki Aij
(t) i (t)vi
i1
We
d i (t)vi
n
n d (t)v
Lvi ivi n d (t)
i
i
i
i1
CL i (t)vi 0 (
C i (t)Lvi ) 0 (
Ci i (t))vi 0
dt
dt
dt
i1
i1
i1
Since
49
d i (t)
Cii 0, i i (t) i (0)eCit
dt
Let
one end of an edge annotated with 1 and the other with 2. Then
we can define the mxn edge incidence matrix
BkiBkj
50
Hence,
L=BTB
Consider
T
T
T T
T
Lvi Hence
i vi
v
L
v
v
B
B
v
v
v
(B
v
)
(B
v
)
i
i
i
i
i
i
i
i
i
i
i
i
i
0
i
It
51
First component has n1 vertices, second has n2 and the c-th has nc
The Laplacian (just as the adjacent matrix) can be written in a block
diagonal form
There are c eigenvectors of L with eigenvalue
zero vectors that have ones in all positions
corresponding to vertices in single components
and zero elsewhere. E.g.,
The
52
Random walks
A
Let
53
pi (t)
j
Aij
kj
1
pj (t1) p(t) AD p(t1)
Random walks
Since
1
pi () pj () p AD p
j kj
The
54
this case we know that there is only one null eigenvalue, which
corresponds to an eigenvector with all components equal:
If
D p 1 p D1 pi ki
In a connected
network the probability that a random walk will be found in a
1
vertex is proportional to its degree
ki
ki
pi
kj 2m
55
56
t[ pv(t) pv(t1)]
t0
For
In
any iv
pi (t)
j
Aij
kj
pj (t1)
Aij
j (v)
pj (t1)
the sum there are no terms Avj and hence we can write:
AD1 p(t1)
[ AD1 ]t p(0)
p(t)
p(t)
Where vector p does not include the entry for v and matrices A
and D do not include the v-th row and column
57
Substituting
T
T
T
1 1 p(t1)]
t[ pu(t) pu(t1)] t[1 1 p(t)
t1 [ p(t1)
p(t)]
t0
t0
t0
t0
T
1 t1
1 t
T
1 t1
1 t
(AD ) p(0)]
t1 p(0)[(
t1 [( AD ) p(0)
AD ) ( A D ) ]
t0
1 [I AD1 ]1 p(0)
58
t0
we further simplify:
get:
T
1
1 DL p(0)
59