Introduction to graph theory 图论/脑网络基础
Source: Connected Brain
Figure above: Bullmore E, Sporns O. Complex brain networks: graph theoretical analysis of structural and functional systems.[J]. Nature Reviews Neuroscience, 2009, 10(3):186-198.
Graph measures
A graph G consisting of a set of vertices V and a set of edges E can be characterized by several measures, some relatively simple, others quite complex. Here we describe measures for unweighted networks. For most of these measures weighted versions have now been described as well.
- Degree
One of the most important and elementary measures is the degree, often indicated by k. The degree of a vertex is the number of connections or edges it has. The probability that a randomly chosen vertex will have degree k is given by the degree distribution, indicated by P(k). The form of the degree distribution provides important information about the structure of the network. As described below, different types of graphs have their own characteristic degree distribution.
- Clustering coefficient
The clustering coefficient of a vertex is the probability that the neighors of this vertex (all other vertices to which it is connected by an edge) are also connected to each other. The clustering coefficient of a vertex ranges between 0 and 1. The average clustering coefficient C of the whole network is the average of the clustering coefficients of all individual vertices. The clustering coefficient is considered to be a measure of the local connectivity or “cliqueness” of a graph. High clustering is associated with robustness of a network, that is resilience against random network damage.
- Motifs and modules
The clustering coefficient is a special case of a more general graph property referred to as motifs. The clustering coefficient depends upon the presence of the triangle motif, consisting of three vertices fully connected by edges. Other more complex motifs exists as well, and their presence in graphs can be quantified. Subgraphs that consist of sets of vertices that are more strongly connected to each other than to the rest of the network are called modules. Identification of modules within complex networks is important, since modules often correspond to different functional aspects of the networks, and modules may also be important for the way normal and abnormal activity can spread through the network. It is also possible to define modules within modules. Networks with such a structure are said to have a hierarchical modularity. The concept of a module is a statistical one. Different definitions of modularity exist, the most well known being the modularity as defined by Newman. Alternatively, modularity can also be defined in terms of the eigenvalues and eigenvectors of the graphs matrix. Please note that the clustering coefficient, motifs and modules are descriptions of network structure at increasingly larger scales.
- Pathlength and efficiency
Whereas clustering reflects local network structure, the shortest pathlength reflects the level of global integration in the network. A shortest path between two nodes A and B is the path between A and B with the smallest number of edges. The average shortest path L of a network is the average of all shortest paths between all pairs of vertices. The diameter of a graph is the longest of all shortest paths. Related to the idea of the average shortest path is that of global efficiency, which is the inverse of the average shortest path. The local efficiency of a particular vertex is the inverse of the average shortest path connecting all neighbors of that vertex.
- Assortativity
Related to the notion of degree is the concept of mixing or assortativity. If vertices with a high degree tend to be connected to other vertices with a high degree, and vertices with a low degree to other low degree vertices, the graph is said to be assortative. An assortative graph has a positive degree correlation. In a disassortative graph, the degree correlation is negative, and high degree vertices tend to connect to low degree vertices and vice versa.
- Centrality and hubs
Another important concept related to that of node degree is centrality. Centrality refers to the relative importance of a node or vertex within the network. Node degree is in fact one, relatively simple measure of centrality. A more sophisticated measure of centrality is betweenness. The betweenness centrality of a particular vertex is the fraction of shortest paths in the network that pass through this vertex. In a similar way, betweenness centrality can also be defined for edges (edge centrality). Another concept of centrality is based upon graph spectral analysis. Eigenvector centrality of a vertex is the value of the vector component, where the vector is vector that corresponds to the largest eigenvalue (spectral radius) of the adjacency matrix. If the notion of hub centrality is combined with the definition of modules it becomes possible to classify hubs. Hubs that are mainly connected to other vertices in the same module are referred to as provincial hubs; hubs that are mainly connected to vertices in other modules are called connector hubs. Provincial and connector hubs may play different functional roles within a network.
- Graph spectral analysis
Graph spectral analysis is an interesting alternative way to characterize the adjacency matrix of a graph and its related Laplacian matrix. The Laplacian matrix contains the node degree as diagonal elements, and -1 for all cells corresponding to existing edges and 0 for cells corresponding to absent edges. If the adjacency and Laplacian matrix are symmetrical, and eigenvalue / eigenvector analysis can be performed, resulting in a series of eigenvalues and corresponding eigenvectors. It is assumed that the series of eigenvalues and eigenvectors represent all information present in the graph. Some graph spectral measures have special significance. The largest eigenvalue of the adjacency matrix is called the spectral radius and is inversely related to the synchronization threshold of dynamical processes on the graph. In addition, the values of its corresponding vector are a measure of centrality. The spectral gap (the difference between the largest and the second largest eigenvalue) provides information on how rapid the synchronous state is reached. The second smallest eigenvalue of the Laplacian matrix is called the algebraic connectivity. It is a measure of network robustness. If the algebraic connectivity is 0, the network consists of at least two disconnected components. The ratio of the largest and the second smallest eigenvalues of the Laplacian matrix is a measure of the stability of the synchronous state of a dynamical process on the network. The information contained in graph spectral analysis can also be used to identify modules within the network.
Models of complex networks
Models are extremely important in modern network theory. It can be argued that the discovery of models for very large networks with a mixture of randomness and order lies at the heart of the transition from conventional graph theory to the modern science of networks. Here we describe three prototype models that illustrate many of the key principles.
- Random graphs
The oldest model of complex networks is that of random graphs as introduced by Rapoport and analyzed in detail by Erdos and Renyi. In a random graph G(V,E) edges between any pair of vertices exist with a probability p. The properties of the random graph have been studies extensively and many important mathematical results have been obtained. For instance, if p is increased from 0 to 1, the size of the largest connected component in the graph will undergo a phase transition at p=0.5. Random graphs have a low clustering coefficient, a small average shortest path length, no assortativity, a narrow degree distribution and not real hubs. While random graphs can explain some properties of real complex networks, notably the short distances between any two nodes, they fail in other respects. In particular, random graphs cannot explain the ubiquitous presence of clustering, modularity and hubs in real networks. Some of these problems were solved by the introduction of more sophisticated models at the end of the nineties in the last century.
- Small-world networks
Watts and Strogatz (1998) considered a network on a ring, where each vertex is connected to k neighbors, k/2 in the clockwise direction, and k/2 counter clockwise. This is an ordered, lattice-like network. It has a high clustering coefficient, a long average shortest path length, no modularity or assortativity, a symmetric degree distribution, and no hubs. Next, with a probability p, edges are disconnected and attached to a randomly chosen other vertex. For p = 1 all edges are reconnected, and a random network is obtained, with all corresponding properties such as low clustering and short path lengths. The interesting region is for intermediate values of p. Even for small but non zero values of p, with only a small fraction of rewired edges, the path length already drops to very low values, while the clustering coefficient still maintains its original high values. This type of network, that combines high clustering with short pathlengths, is called a small-world network. Despite its apparent simplicity it is in fact a very powerful model of many real networks that often display the same combination of high clustering and short pathlengths. However, the Watts and Strogatz (WS) model fails to explain other important properties of natural networks such as modularity and broad degree distributions with hub like nodes. This last problem was solved in another model.
- Scale-free networks
Barabasi and Albert proposed a model of a growing network. At each iteration, a new vertex is added, and it is connected to existing vertices with a probability that depends upon the degree of that node. As a consequence, nodes with a high degree are more likely to receive more connections, increasing their degree even further. This is an example of positive feedback or preferential attachment. The most interesting feature of the model is the shape of its degree distribution. After a sufficient number of iterations the degree distribution becomes a power law: P(k) = kgamma, where gamma = 3. This power law distribution reflects the presence of large number of highly connected nodes or hubs. Networks with a power law degree distribution are referred to as scale-free (SF). In contrast to WS networks, scale-free networks can explain the presence of hubs in networks, and suggest a growth scenario that gives rise to these hubs. For these reasons SF models have become very important in modern network research. However, even SF models have their limitations: they do not explain clustering very well, they are not assortative, and have no real modules.
Introduction to graph theory 图论/脑网络基础的更多相关文章
- 图论介绍(Graph Theory)
1 图论概述 1.1 发展历史 第一阶段: 1736:欧拉发表首篇关于图论的文章,研究了哥尼斯堡七桥问题,被称为图论之父 1750:提出了拓扑学的第一个定理,多面体欧拉公式:V-E+F=2 第二阶段( ...
- ACM学习历程—NPU1045 2015年陕西省程序设计竞赛网络预赛(热身赛)C题 Graph Theory(递推 && 组合数学 && 大数)
Description In graph theory, a matching or independent edge set in a graph G = (V , E) is a set of e ...
- The Beginning of the Graph Theory
The Beginning of the Graph Theory 是的,这不是一道题.最近数论刷的实在是太多了,我要开始我的图论与树的假期生活了. 祝愿我吧??!ShuraK...... poj18 ...
- 迈向高阶:优秀Android程序员必知必会的网络基础
1.前言 网络通信一直是Android项目里比较重要的一个模块,Android开源项目上出现过很多优秀的网络框架,从一开始只是一些对HttpClient和HttpUrlConnection简易封装使用 ...
- Codeforces 1109D. Sasha and Interesting Fact from Graph Theory
Codeforces 1109D. Sasha and Interesting Fact from Graph Theory 解题思路: 这题我根本不会做,是周指导带飞我. 首先对于当前已经有 \(m ...
- 2018 Multi-University Training Contest 4 Problem L. Graph Theory Homework 【YY】
传送门:http://acm.hdu.edu.cn/showproblem.php?pid=6343 Problem L. Graph Theory Homework Time Limit: 2000 ...
- Graph Theory
Description Little Q loves playing with different kinds of graphs very much. One day he thought abou ...
- 网络基础,tpc,udp
一 , 网络基础相关知识 1. 架构 (重点) C / S 架构 : client 客户端(APP) 和 server 服务器端 能充分发挥pc机的性能 B / S 架构 : browser 浏览器 ...
- An Introduction to Measure Theory and Probability
目录 Chapter 1 Measure spaces Chapter 2 Integration Chapter 3 Spaces of integrable functions Chapter 4 ...
随机推荐
- RHEL6.4 + Oracle 11g DG测试环境快速搭建参考
环境现状: 两台虚拟主机A和B: 1. A机器已安装ASM存储的Oracle 11g 实例 参考:http://www.cnblogs.com/jyzhao/p/4332410.html 2 ...
- 利用C#开发移动跨平台Hybrid App(一):从Native端聊Hybrid的实现
0x00 前言 前一段时间分别读了两篇博客,分别是叶小钗兄的<浅谈Hybrid技术的设计与实现>以及徐磊哥的<从技术经理的角度算一算,如何可以多快好省的做个app>.受到了很多 ...
- 如何在SSM项目配置springMVC校验框架validator
1.在springMVC配置文件配置添加如下信息 <!-- 表单验证框架 --> <bean id="validator" class="org.spr ...
- JavaWeb_day01_HTTP_Servlet
本文为博主辛苦总结,希望自己以后返回来看的时候理解更深刻,也希望可以起到帮助初学者的作用. 转载请注明 出自 : luogg的博客园 谢谢配合! JavaWeb_day01 HTTP协议 HTTP(H ...
- 最小生成树计数 bzoj 1016
最小生成树计数 (1s 128M) award [问题描述] 现在给出了一个简单无向加权图.你不满足于求出这个图的最小生成树,而希望知道这个图中有多少个不同的最小生成树.(如果两颗最小生成树中至少有一 ...
- Oracle Blob数据保存为文件
好久不写文,最近得空写一点.Oracle数据库国内用户量主要在企业上,其中有一种byte的存储称为Blob,并不能直接看. 有时候为了调试需要,可以通过: ,)) ; 这种sql去转为字符串查看,但是 ...
- C标准头文件<math.h>
定义域错误可以理解为超出了函数的适用范围,如果发生了定义域错误,设errno为EDOM 如果结果不能表示为double值,则发生值域错误,如果结果上溢,则函数返回HUGE_VAL的值,设errno为E ...
- Linux常用命令(一)
Linux常用命令 1. pwd查看当前路径(Print Working Directory) [root@CentOS ~]# pwd/root 2. cd .. 返回上一级 .. 表示上一级 ...
- dubbo连接zookeeper注册中心因为断网导致线程无限等待问题【转】
最近维护的系统切换了网络环境,由联通换成了电信网络,因为某些过滤规则导致系统连不上zookeeper服务器(应用系统机器在深圳,网络为电信线路,zookeeper服务器在北京,网络为联通线路),因为我 ...
- [JS,Canvas]日历时钟
[JS,Canvas]日历时钟 Html: <!doctype html> <html> <head> <meta charset="UTF-8&q ...