Community detection and graph partitioning

Document Type:Thesis

Subject Area:Physics

Document 1

Newman Department of Physics and Center for the Study of Complex Systems, University of Michigan, Ann Arbor, MI 48109 ABSTRACT A variety of methods have been brought up for community detection in networks. This first section illustrates two most used types of inference strategies which can be mapped out on a standard minimum cut graph partition directly. The second section highlights the importance of community structure, by determining its robustness when small perturbations are made in the network. The third section is based on finding a good community network in a variety of networks which are basically large. The fourth section is basically about the overlapping and or disjoint of generative networks. A Poisson model of this version is utilized. The poison method is used because it provided a straight forward analysis.

Sign up to view the full document!

An important question is asked in order to lay emphasis on the statistical inference model. What must be the parameters, if an assumed observed type of network is generated in relation to our model? The question here is the determination of the ωin and ωout, which are used in the generation of the network and also which vertices fall in the various groups. An answer to the question above is what is needed in getting the true estimate of the community structure. This is based on the Laplacian spectral bisection strategy which is used in the partitioning of graphs and was introduced by Fiedler. The Fieldler vector is determined by the approximation of the minimum cut division of the network into parts which are of specified sizes.

Sign up to view the full document!

This vector is referred to as the eigenvector of this graph of the Laplacian matrix, L, which is in correspondence with the smallest eigenvalue. n*n symmetrical matrix illustrates the Laplacian graph, where L=D-A. A is referred to as the adjacency matrix and then D is referred to as the n*n matrix. This method utilizes a simulated annealing technique which is used in combination with a parallel tempering scheme for glassy systems. This observation can be utilized in the identification of a community structure of the type that is only found in random graphs and which specifically occurs due to fluctuations. The method is used also to rule out random graphs. From these experiments and studies, it is now clear that when very minute changes are made to a network, for instance, the removal or addition or several edges or vertices, can result in the general value of the modularity of various partition of the network.

Sign up to view the full document!

QUANTIFICATION OF NETWORK ROBUSTNESS The major areas of concern in this subtopic are based on the quantification and the various permutations of the network which eventually result in the change of the community structure. This method illustrates a true metric value on community assignments. Finding community structure in very large networks A variety of methods and proposals are made which can be used in the identification and evaluation of structure in large networks. They are however not implemented since they prove to have very high computational costs. A good proposal, however, has been implemented which is based on a hierarchical agglomeration algorithm which is used for the detection of community structure and is very fast as compared to other forms or algorithms.

Sign up to view the full document!

The running time of this algorithm based on a network with m edges and n vertices is O (d, n). The following equation is then obtained after the substitution of the summation of the edges and vertices Cv, I, j Cw, to give, The operation and calculation of the above algorithms, inculcate the finding of the various changes that occur in Q, which occasionally occur from the combination and or amalgamation of the pairs of communities. An efficient and principled method for detecting communities in networks In the analysis of network data, the aspect of network community detection becomes a fundamental issue and problem, especially for groups that are overlapping and or has disjoint, or in other words has densely populated interconnected nodes.

Sign up to view the full document!

When these nodes overlap, there is a problem of detection of network data. A principled statistical approach can be utilized for networks that are generative based models. A method that can be used in the analysis of millions of nodes, in fast and reasonable running time is what is needed for such community networks. The equation below, therefore, is used to aid in obtaining the probability of generating a graph, denoted as G, with Aij adjacency matrix properties, After dropping and rearranging all the multiplicative and additive constants, the following log is derived, In the maximization of the expression, by differentiation, we can first apply an inequality known as Jenses’ in the following format, IMPLEMENTATION The implementation of the above-outlined method of detection can be utilized in various strategies.

Sign up to view the full document!

One, it can be implemented directly, in the form of an algorithm that is computer-based, which is used for finding overlapping communities and can work well for networks that are of moderate sizes, for instance, over ten thousand vertices. The implementation of the above method for large networks is main,y due to their run time and memory usage. The memory of the algorithm is mainly evaluated by the amount of space that is needed to store the available parameters. Working with these parameters, we can work with an average number Kiz, which has ended with z color, and are connected to the I vertex, hence, We can also go ahead and find the calculation of the average number of Kz of edges with the z color and which have a summation on all the vertices, hence, Synthetic and Real-world networks In the detection of community networks, both synthetic and real-world network systems can be utilized.

Sign up to view the full document!

From $10 to earn access

Only on Studyloop

Original template

Downloadable