Structures and organization in complex systems Minimal paths between communities induced by geographical networks Henrique Ferraz de Arruda∗ Institute of Mathematics and Computer Science at São Carlos, University of São Paulo, São Carlos, SP, Brazil César Henrique Comin† and Luciano da Fontoura Costa‡ Institute of Physics at São Carlos, University of São Paulo, São Carlos, SP, Brazil Inthisworkweinvestigatethebetweennesscentralityingeographicalnetworksanditsrelationship with network communities. We show that vertices with large betweenness define what we call 5 characteristic betweenness paths in both modeled and real-world geographical networks. We define 1 a geographical network model that possess a simple topology while still being able to present such 0 betweennesspaths. Usingthismodel,weshowthatsuchpathsrepresentpathwaysbetweenentryand 2 exitpointsofhighlyconnectedregions,orcommunities,ofgeographicalnetworks. Bydefininganew ct network,containinginformationaboutcommunityadjacenciesintheoriginalnetwork,wedescribea O meanstocharacterizethemesoscaleconnectivityprovidedbysuchcharacteristicbetweennesspaths. 9 PACSnumbers: 89.75.Fb 1 ] I. INTRODUCTION tweenness centrality, and the movement of people in a h city have also been shown to be power-law distributed, p meaning that a few paths concentrate most of the city - c As a consequence of their potential to represent real traffic. o systems, complex networks have become the subject of Given that a hierarchical structure of pathways has s growing interest in recent years [1]. Some examples of . been observed in a range of systems, it is interesting to s systems that have been studied using complex networks c studythereasonswhytheyareformed,theirimportance are electric power grids [2], airline routes [3], the World- i for the correct operation of the system, their influence s WideWeb[4],amongothers[5]. Networkswhosevertices y in the network flow, and in which kind of geographi- h are embedded in a space with well-defined coordinates cal networks the betweenness paths can be found. To p and their connectivity is related to the distance between better understand the hierarchical structures, it is nec- [ the vertices are named geographical networks [6]. Ex- essary to study how the network topology is inducing amples of geographical networks include street, airline, 2 thosepaths. Furthermore,theknowledgeaboutbetween- v and neural networks. Models to generate artificial net- nesspathsandtheircharacteristicscanbeusedforbetter 8 works with geographic dependency have been created [7] planning the system. 2 in order to better understand the real systems. 7 In this work, we study street networks composed by 2 The spatial influence on the connectivity gives rise to two hierarchical levels: a main structure forming the 0 many interesting characteristics [8]. In a previous work backboneofthenetwork,andsecondaryconnectionssup- . 1 about the influence of gene expression on neuronal char- ported by such backbone. We considered networks gen- 0 acteristics [9], we computed the betweenness centrality eratedfromgeographicmodelsandreal-worldstreetnet- 5 [10] of neuronal networks. While visualizing those neu- works. From these networks, we identified connected 1 ronal networks we observed that the vertices having the vertices with high betweenness centrality. We observed : v highest betweenness centrality values were connected to- that these vertices provide a good covering of the topol- Xi gether creating chains of vertices, which here we call be- ogy, in the sense that any network vertex is aways near tweenness paths. The presence of betweenness paths in at least one or more characteristic betweenness paths. r a theneuronalnetworksuggestsatwo-levelhierarchyofthe We found that such paths tend to be related with the system structure, where the betweenness paths form the community structure [19] of the network. So, to bet- main pathways for information flow and the remaining ter understand the characteristic betweenness paths and paths distribute this information locally. Such hierarchi- its relationship with communities, we created a random calorganizationhasalsobeenobservedforroadnetworks geographic model and used community detection to di- [11–15]. In [16] its argued that road networks should be vide the space in well-defined regions. In order to study planned in different scales, according to their purpose, the relationship between such communities and their ge- functionormanagementpolicies. Suchscalesrangefrom ographical positions, we created a sub-summed commu- the entire city level down to local streets used for prop- nitynetwork,whereeachvertexisadifferentcommunity ertyaccessorpedestrianmovement. Accordingto[17,18] and the connection weights are based on the amount of thetrafficinroadnetworks,modeledbyapplyingthebe- shortest paths crossing the community border. 2 This paper is organized as follows. Section II presents thismeasurecanquantifytheloadofeachvertex[27]. It the basic network theory concepts that are used in this is also used in the cascading-failure dynamics [28]. paper. In Section III we discuss betweenness paths in In addition to the centrality measures, another impor- network models and real cases. In Section IV we create tant concept studied in complex networks theory is the a network model to better understand the betweenness communitystructureofthesystem. Whereacommunity paths. InSectionV,wedescribetherelationshipbetween isdefinedasasetofverticesdenselyinterconnected, and geographic and topological measurements. with few connections to vertices outside this set. This property can be found in many real networks such as so- cial networks [29], metabolic networks [30], and in the II. BASIC CONCEPTS structure of cities [31]. In city networks, those commu- nities are shown to be subdivisions of urban structure There aremany ways tocomputenetworkcentralities, emerging urban elements and their interactions [31]. each being developed to evaluate different network char- A common approach to quantify how well-defined are acteristics [7]. the communities in the network is the modularity mea- In order to locally quantify the network centrality, it sure [19]. The modularity is represented as is possible to measure the degree k for each vertex i. i (cid:18) (cid:19) Furthermore, in weighted networks, we can consider the Q= 1 (cid:88) A − kikj δ(c ,c ), (4) edges to define a measurement called strength [8], which 2m ij 2m i j i,j is computed for each vertex i as (cid:88) wheremisthenumberofnetworkedges,Aisthenetwork S = w , (1) i ij adjacency matrix, k and k are the vertex degree i and i j j∈ri j, respectively, c and c are the communities of vertex i j where ri is the set of neighbors of vertex i and wij is the i and j, respectively, and δ(ci,cj) is 0 if ci (cid:54)= cj or 1 if weight of the edge connecting i to j. ci =cj. The network centrality can also be computed globally, Networkcommunitiescanbedetectedthroughtheop- that is, considering the whole network structure. One timization of modularity. For a comprehensive review of important global centrality measure is the stress central- modularity optimization methods, please refere to [32]. ity (SC ) [20]. This measure is defined for each vertex u In this work we use the fast greedy [33] algorithm to de- u as tect the communities. (cid:88) SC = σ(i,u,j), (2) u ij III. CHARACTERISTIC BETWEENNESS PATHS where σ(i,u,j) represents the number of shortest paths between the vertices i and j crossing the vertex u. If In this section we define and analyze the character- one vertex is removed from the network it can affect the istic betweenness paths. We compute the betweenness stress of all other vertices. centrality of all the vertices. Most of these vertices are Anotherimportantglobalmeasurementisthebetween- connected, creating chains of vertices. Thus, most of nesscentrality,whichalsoconsiderstheamountofshort- them have degree two. We define a betweenness path as estpathscrossingavertices[10]. Thismeasureisdefined a sub-network considering only the the highest between- as ness centrality values. In other words we selected only (cid:88)σ(i,u,j) verticeswithbetweennesscentralityhigherorequalthan B = , (3) u σ(i,j) athresholdandcreatedasub-networkcomposedbysuch ij vertices. In order to do so, we compute the betweenness where σ(i,u,j) represents the number of shortest paths centrality of the vertices in a network and divide them between the vertices i and j crossing the vertex u and intotwogroups. Verticeshavingbetweennesslargerthan σ(i,j) is the total amount of shortest paths between i athresholdT arecalledcharacteristic vertices, whilethe and j. The betweenness centrality can be implemented rest of vertices in the network are called secondary ver- using a fast algorithm [21, 22], where the betweenness tices. centrality is computed and the result is used to compute Wewillshowfournetworksinwhichthecharacteristic the stress centrality. paths can be observed. Two of these networks are artifi- Betweenness centrality is frequently used to study cially generated, and the other two are street networks. flowsinnetworks[23–25]andcanbeusedinseveralappli- In the first model, known as Waxman [34], the vertices cations. Insocialnetworksitcanbedescribedintermsof are organized randomly in a plane. For each vertex, the its message communications, considering that each mes- probability to connect with all other vertices follows an sage take the shortest path [26]. In power grid networks, exponential decay of the distance between them. In the 3 second model, known as random geometric graph [35], 1.0 the vertices are randomly organized and the connections are created only when the geographical distance is lower than a fixed value. We compared the network models 0.8 with two real street networks: from San Joaquin County in USA and Oldenburg city in Germany [36]. s e In order to find a reasonable value for the threshold, ertic0.6 we computed the fraction of characteristic vertices ac- ofv cordingtosomeselectedvaluesofbetweennessthreshold. on cti0.4 The fraction is found by dividing the number of charac- Fra teristic vertices by the size of the network. We perform these tests using the street networks, the results can be Oldenburg (giant) 0.2 Oldenburg (characteristic vertices) seen in Figure 1. Moreover, we also plot in Figure 1 San Joaquin (giant) the largest connected component (giant component) of San Joaquin (characteristic vertices) characteristic vertices. The majority of the characteris- 0.0 100 101 102 103 104 105 106 107 108 tic vertices are included in the giant component for all Betweennessthreshold different threshold values. Furthermore, the curves are similarforbothcities. Thereisnoindicationofaproper FIG.1. (Coloronline)Theplotshowsthefractionofcharac- threshold for defining the characteristic vertices. There- teristic vertices against the betweenness threshold. For each fore, we empirically selected a threshold for which 20% networkwealsoshowthenumberofcharacteristicverticesin of the network vertices become characteristic vertices. If the giant component. we consider the subnetwork formed by such vertices and their respective connections it is possible to show that street networks this result can indicate a tendency that for each analyzed network an interesting structure is ob- the volume of cars is bigger in some streets than in oth- tained. Such structure contains a chain-like connectivity ers if we consider that all drivers are using the shortest betweenthecharacteristicvertices,meaningthatthema- paths for the formation of high flow streets. jority of vertices have degree two. Such chain-like struc- In every example we could observe the characteristic turecontainsmostoftheflowinthenetwork,represented betweenness paths and it was possible to identify more by the betweenness, and can be seen as the backbone of than one component with high betweenness centrality in the original network. The secondary vertices rely upon thesamenetwork. Thecomponentsizesweredifferentfor this backbone for proper communication with the rest of each network. In the random geometric model, the giant the system. The characteristic betweenness paths were component represents 85% of all vertices in the charac- also found in other works, but their origins were not an- teristicpath. Intheothernetworksthegiantcomponent alyzed [12, 13, 17]. represents more than 98% of all vertices in the paths. In order to illustrate the betweenness paths, we simu- We analyze the coverage of the characteristic paths. lated the Waxman network with 1707 vertices and aver- In order to do so, we measured the shortest distance age degree 2.20 (The network is shown in Figure 2(a)). between each secondary vertex i and preferential paths The random geometric graph has 4883 vertices and av- (represented as d ), because the shorter the distances i erage degree 6.18 (as can be shown in Figure 2(b)). The are, the better are the paths coverage. In order to pro- street network from San Joaquin has 14503 vertices and vide a fair comparison between different networks, the average degree 2.75, and the network representing Old- calculated distances are normalized by the average path enburgstreetshas2873verticesandaveragedegree2.57. length, l, of each network. Therefore, we define the nor- TheOldenburgnetworkisshownFigure2(c),whereeach malized shortest path distance, Θ , as i colorrepresentsadifferentcommunity. TheSanJoaquin d network is not shown in Figure 2 because the network Θ = i. is very large, which hinders a proper visualization of its i l paths and communities. Wegenerate60samplesofeachmodel. Inthiscase,we For each network we computed the betweenness cen- fitthemodelvariablesinordertocomparewiththeOld- trality and highlighted the 20% of vertices with largest enburg network (modularity 0.92). The samples of Wax- betweenness values, which defines the characteristic be- man model have 2976.70±16.61 vertices, average degree tweenness paths. The characteristic betweenness paths 2.71±0.04, and modularity 0.79±0.01. The samples of occurredinallexamples,whicharethecharacteristicver- random geographic model have 2892.33±58.46 vertices, tices. By visual inspection, in each network the charac- average degree 4.50±0.09, and modularity 0.97±0.001. teristicverticesseemtodefineabackbonestructurehav- In order to do a comparison considering the statistical ing many two-degree vertices. Such backbone is formed variations of the models, these results are shown as his- by what we call the characteristic betweenness paths. In tograms in Figure 3. We observed that the fraction of 4 (a) (b) (c) FIG. 2. (Color online) Geographical networks, where black circles and highlighted edges show the preferential paths. Item (a) represents the network of Waxman model, item (b) represents the network of the random geometric model, and item (c) represents the network of Oldenburg streets, where each color represents a different community. 5 vertices with the fewer distance (first bin in the his- Random geometric model 0.9 togram) is longer in the random geometric model than 0.8 0.7 the Waxman model comparing the two models, we ob- cy0.6 n servethatsecondaryverticestendtobeclosertocharac- ue0.5 q0.4 teristic paths in the random geometric graph model, but Fre0.3 0.2 in this network the longest distance is small than that 0.1 case (as can be seen in Figures 3(a) and (b)). Further- 0.00.0 0.2 0.4 0.6 0.8 1.0 more, the standard deviation is smaller in two networks Θ forbothmodels,showingthatthiseffectisnotastatisti- (a) calartifactcausedbyfluctuation. Theshortestdistances werealsoanalyzedforthestreetnetworks(ascanbeseen 0.9 Waxman in Figures 3(c) and (d)). The dashed lines represent the 0.8 0.7 y average values of Θ, which we call the coverage of the c0.6 n e0.5 characteristicpaths. ThecoverageofOldenburgislarger qu0.4 thanforSanJoaquin,whichmaybeaconsequenceofthe Fre0.3 0.2 higher complexity of Oldenburg’s topology [36]. 0.1 0.0 0.0 0.2 0.4 0.6 0.8 1.0 Θ IV. CHARACTERISTIC PATHS MODEL (b) Oldenburg Inordertobetterunderstandtheemergenceofcharac- 0.9 0.8 teristic paths in geographic networks, we define a model 0.7 y c0.6 abletogeneratenetworkswithasimplelattice-liketopol- n e0.5 u ogy, but still containing a well-defined backbone struc- q0.4 ture. The initial structure of the network is a 2D regu- Fre00..32 0.1 lar lattice, where the vertices are organized in lines and 0.0 0.0 0.2 0.4 0.6 0.8 1.0 equally spaced columns. First we defined the size, L, of Θ thenetwork(numberoflinesandcolumns)andtwovari- (c) ables, the maximum distance to connect two vertices as d andaprobabilityxtoremoverandomlytheedges MAX San Joaquin 0.9 for each vertex. The algorithm follows three steps: 1- 0.8 The vertices are organized as a square grid with toroidal y0.7 c0.6 boundary condition; 2- For each pair of vertices i and j en0.5 u whenthedistancedij islessorequalthandMAX anedge Freq00..43 connectingitoj iscreated;3-Eachedgeisremovedwith 0.2 0.1 probability x. 0.0 0.0 0.2 0.4 0.6 0.8 1.0 In order to visualize the betweenness paths in the Θ model presented in this section, we created a network (d) using the parameter L = 200, so that the network has N = 40000. The connectivity range was set to √ FIG. 3. The histograms show the normalized shortest path dMAX = 2. Therefore, each vertex of the network distance, Θ, between each secondary vertex and preferential has degree 8. We randomly remove the connections with paths. We consider the (a) random geometric model, (b) probability x=0.65. Waxman model, (c) the Oldenburg network and (d) the San We measured the betweenness centrality for each net- Joaquinnetwork. Thedashedlinesrepresentthecoverageand errorbarsinitems(a)and(b)showsthestandarddeviation. work vertex and the results can be seen in Figure 4(a). We can observe vertices with higher values (characteris- ticvertices)generatingpreferentialpathsinthenetwork, in a similar manner as main streets. Our hypothesis is to infer the statistical variation of the model, we created that the network communities are inducing the forma- 60 network samples. We used the parameter L = 56 tion of betweenness paths through connections between and we randomly removed the connections with proba- them. In Figure 4(b) we show the network communities bility x = 0.7. These parameters were set so that the and characteristic paths (indicated in black). The result generated networks have a size and average degree as suggests that the paths are going through communities. close as possible to compare with the Oldenburg net- To compare the characteristic betweenness paths dis- work. The network measurements are: number of ver- played by our network model and real networks, we exe- tices 2829.95 ± 21.34, average degree 2.59 ± 0.03, and cuted the same tests as in the previous section. In order modularity 0.94±0.002. 6 200 200 2.2e+07 2.0e+07 B e 150 1.8e+07 tw 150 e 1.5e+07 e n n 1.2e+07 e 100 ss 100 1.0e+07 c e n 7.5e+06 tr a 50 5.0e+06 lity 50 2.5e+06 0 0 0.0e+00 0 50 100 150 200 0 50 100 150 200 (a) (b) FIG. 4. (Color online) Item (a) presents a heat map showing the betweenness centrality of vertices belonging to a simple realizationofourmodel. Thecolorblueshowsregionswithoutvertices. Initem(b)eachcolorrepresentsadifferentcommunity, some distinct, non-adjacent, communities are associated to the same color due to color palette limitations.. The color black shows the preferential paths. The color white shows regions without vertices. 0.9 0.06 0.8 0.05 Pathsmodel 0.7 ncy0.6 ncy0.04 Randomgeo. ue0.5 ue0.03 Waxman Freq00..43 Freq0.02 0.2 0.01 0.1 0.00.0 0.2 0.4 0.6 0.8 1.0 0.000.00 0.05 0.10 0.15 0.20 Θ 〈Θ〉 FIG. 5. Histogram of normalized shortest path distance, Θ, FIG.6. Histogramofthecoveragefor60generatednetworks between each secondary vertex and preferential betweenness from each considered model. The dashed line represents the paths. coverage in Oldenburg’s network. Our model has similarities and differences from the Toquantifyournetworkcommunitiesintermsoftheir street networks and the other models. The giant com- respective degree and betweenness centrality, we con- ponet represents (85±8)% of all characteristic vertices. sidered two types of vertices, the geographical border Regarding the shortest distance between secondary ver- communityverticesandtheinternalcommunityvertices. ticesandpreferentialpaths(Figure5),thesecondaryver- The first type represents vertices that have at least one tices tend to be more distant from the backbone than in neighbor in the grid belonging to a different community, the Oldenburg and Waxman networks. no matter if they are connected or not. The second type Wecomparedtheaverageofshortestdistancebetween represents a vertex that is not a geographical border. In the secondary vertices and the preferential betweenness order to define each vertex type, we considered all eight pathsinnetworkmodelsandtheOldenburgnetwork. For geographical neighbors, of the vertices. In some analy- each generated network we calculate coverage, which we ses we also considered the topological border community represent by (cid:104)Θ(cid:105), and in Figure 6 we show the distri- vertices, which are the vertices connected with one or bution of (cid:104)Θ(cid:105) for the generated networks. The dashed more vertices from different community. We generated a line shows the coverage for Oldenburg. The coverage for networkwithL=200andaveragedegree2.92according Oldenburgisclosesttothemeasuredvaluesforthechar- toourmodel. Usingthisnetworkwecomputedthreedif- acteristic path model. Besides, the standard deviation is ferent comparisons. In the first, we compare the average low, which confirms that the betweenness paths have a degree between internal vertices and border community similarstructureinallgeneratednetworks. Therefore,in vertices (as can be seen in Figure 7(a)). In this compar- the following we further analyze the betweenness paths ison it is possible to observe that the average degree in using a single realization of our model. theinternalcommunityverticesishigherthantheborder 7 communityvertices. Thereby,wecanaffirmthatthever- V. COMMUNITY ADJACENCY NETWORK tex degree is influencing how the community is formed. Inordertoobservetherelationshipbetweencommuni- In order to describe how the vertices are connecting tiesandtheirimportance,consideringgeographicalchar- communities, we measured the average degree in the ge- acteristics, we created a network based on the topolog- ographicalcommunityborderandinthetopologicalcom- ical adjacency between communities of our characteris- munityborder,representedinFigure7(b). In98%ofthe tic paths model. Through this model it is possible to communities, the average degree of vertices belonging to compare the geographical characteristics (e.g., the com- the geographical border is lower than for vertices con- munityarea)withtopologicalcharacteristics,suchasthe tained in the topological border. degree of a vertex, which represents the number of com- munitiesconnectedwithagivencommunity. Inthisnet- We made a similar comparison, between geographic work, each vertex represents a community. A weight andtopologicalborders,butnowmeasuringthebetween- equal to SC + SC (defined in Equation 2) is associ- ness centrality instead of the degree (as can be seen in i j ated to each edge connecting communities i and j. The Figure 7(c)). This analysis can show if connections be- resulting network is undirected. In Figure 11 we show a tween communities (topological border) are generating a typical realization of this network. It is possible to ob- characteristic betweenness paths. Figure 7 shows that serve small communities having strong connections, in- the average geographical border betweenness is lower dicating that betweenness paths are also present in such than the topological border betweenness for all commu- small communities. On the other hand, there are big nities. Thus, in average, the vertices where the paths communities with weak connections, showing that few arecrossingthecommunitybordershavehigherbetween- preferential paths are crossing them. nesscentralityanddegreethanthegeographicalborders. Wecomparedlocaltopologycharacteristicsofthecom- Someverticesofcharacteristicpathsarepartofthetopo- munity adjacency network with geometrical characteris- logical border. In other words, they are connecting the tics (the scatter plots can be seen in Figure 12). Fig- communities. ures 12(a), (b) and (c) show the degree as a function of area, perimeter and diameter, respectively and Fig- The results indicate that the paths represent routes of ures 12(d), (e) and (f) show the same measures using efficient communication between the communities. This the vertex strength instead of the degree. The degree idea is illustrated in Figure 8. Vertices included in the is highly correlated with the geographical characteristics highlighted gray region belong to the same community. of communities. On the other hand, the strength has Thecommunityhasthreeverticesonitstopologicalbor- little relationship with the communities geometry. This der and the shortest paths connecting them, shown in happens because the strength is uncorrelated with the red, are creating the preferential paths with higher be- geographical characteristics of communities. tweenness centrality. In order to validate these results, we created an artifi- VI. CONCLUSION cial geographic network with fixed communities. In this network, each community is a regular lattice, where the verticesareconnectedwiththeireightnearestneighbors, The structure of a geographical network has a direct andthecommunitiesareinterconnectedthroughsomese- impactonhowthesystemcanbenavigated. Ouranalysis lectedvertices. Figure9showsthenetworkandhighlight revealed that betweenness paths are a common charac- the highest values of betweenness centrality (the charac- teristicofgeographicalnetworks,sincetheycanbefound teristic vertices). In this test, it is possible to observe in geographical models as well as real world networks. that the characteristic paths are connecting the commu- We characterized the betweenness paths in terms of the nities. Besides, through this model, it is possible to note distance between each network secondary vertex and the that the paths are not created at random, that is, their closest betweenness path. Such measurement can be re- emergence have a well-defined mechanism. lated to the covering provided by the paths to reach the entire network. We showed that in both the geographi- It is also interesting to analyze if the number of short- calmodelsandthestreetnetworksthebetweennesspaths est paths connecting pairs of communities are related to provideagoodcoveringofthesystem. Inordertobetter the sizes of the communities geographical border. In or- understandtheestablishmentofthecharacteristicpaths, der to do so, we plot the perimeter of adjacency between we proposed a model to generate geographical networks communitiesagainstthesumofthevertexstressconnect- havingasimpletopologybutthatcanstilloriginatesuch ing such communities, which can be seen in Figure 10. paths. Besides,wecalculatedthePearsoncorrelationcoefficient Itwasfoundthatthecharacteristicbetweennesspaths between the measurements, obtaining a value of 0.23. have a strong relationship with the presence of well- Thus, these measurements are uncorrelated. defined communities in the networks. We compared 8 5.5 5.5 1e+07 5.0 Internalvertices 5.0 Topological border Topological border Geographical border Geographical border 8e+06 Geographical border 4.5 4.5 s s gree4.0 gree4.0 enne6e+06 De 3.5 De 3.5 we4e+06 3.0 3.0 Bet 2.5 2.5 2e+06 2.0 2.0 0 20 40 60 80 100 0 20 40 60 80 100 0 20 40 60 80 100 Community Community Community (a) (b) (c) FIG.7. (Coloronline)Image(a)showsthedegreecomparisonbetweeninternalverticesandbordercommunityvertices,image (b)showsthedegreecomparisonbetweentopologicalandgeographicalborder,andimage(c)showsthebetweennesscentrality comparison between topological and geographical border. The community indexes were ordered according to the degree of internal community vertices. FIG. 8. (Color online) The highlighted region represents a community and the red squares represent characteristic ver- tices crossing the community. FIG.9. (Coloronline)Networkcreatedbyconnectinglattice- likecommunities. Eachcommunityisrepresentedbyadiffer- the vertices degrees and betweenness centrality among ent color and the characteristic paths are highlighted. geographical and topological borders of the communi- ties, as well as the internal vertices of the communities. This analysis revealed that the betweenness characteris- y tic paths are usually induced by the communities, in the enc 60 ρ = 0.23 ac 50 sense that such paths provide a communication between dj A 40 entry and exit points of the communities. of 30 The number of shortest paths passing through the er et 20 topological border of adjacent communities seems to be m independentofthegeometryoftheinterfacebetweenthe Peri 100 communities. This was revealed by the absence of corre- 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 Pathstress(sum) 1e14 lation between the total stress of vertices connecting ad- jacent communities and the perimeter of the adjacency. FIG. 10. Scatter plot showing for each community the sum- In order to better understand the relationship between mation of all stress connecting vertices from a community to communities and characteristic betweenness paths, we anotheragainsttheperimeterofadjacency. ThePearsoncor- defined a new structure based on the vertices between- relation (ρ) is shown inside the plot. ness and the community organization of the network be- 9 [2] A.E.Motter,S.A.Myers,M.Anghel, andT.Nishikawa, Nat Phys 9, 191 (2013). [3] O. Lordana, J. M. Sallana, P. Simoa, and D. Gonzalez- Prietob, Transportation Research Part E: Logistics and Transportation Review 68, 155 (2014). [4] R. Albert and A.-L. Barabási, Rev. Mod. Phys. 74, 47 (2002). [5] M. E. J. Newman, SIAM Review 45, 167 (2003). [6] M. Barthélemy, Physics Reports 499, 1 (2011). [7] S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, and D.-U. Hwang, Phys. Rep. 424, 175 (2006). [8] L. da F. Costa, F. A. Rodrigues, G. Travieso, and V.P.R.Boas,AdvancesinPhysics56,167(2007),cond- mat/0505185. [9] H.F.deArruda,C.H.Comin,M.Miazaki,M.P.Viana, andL.daF.Costa,Journalofneurosciencemethods245, 1 (2015). [10] L. Freeman, Sociometry 40, 35 (1977). [11] J.F.Bigotte,D.Krass,A.P.Antunes, andO.Berman, TransportationResearchPartA:PolicyandPractice44, 506 (2010). [12] P. Crucitti, V. Latora, and S. Porta, Phys. Rev. E 73, 036125 (2006). [13] P. Crucitti, V. Latora, and S. Porta, Chaos: An In- terdisciplinary Journal of Nonlinear Science 16, 015113 FIG. 11. (Color online) Heat map representing the commu- (2006). nityadjacencynetworkfromcommunities,wherethecolorof [14] S.Scellato,A.Cardillo,V.Latora, andS.Porta,TheEu- edges represent the weight (black is zero and white is one). ropeanPhysicalJournalB-CondensedMatterandCom- plex Systems 50, 221 (2006). [15] B.M.YerraandD.M.Levinson,TheAnnalsofRegional ing studied, which we called community network. By Science 39, 541 (2005). characterizingthismetastructure,weconcludedthatge- [16] V. Eppell, B. A. McClurg, and J. M. Bunker, (2001). [17] S.Lämmer,B.Gehlsen, andD.Helbing,PhysicaA:Sta- ographical characteristics can influence the connectivity tistical Mechanics and its Applications 363, 89 (2006). between communities, which takes place along between- [18] G. Chowell, J. M. Hyman, S. Eubank, and C. Castillo- ness characteristic paths. Chavez, Physical Review E 68, 066102 (2003). Itremainsanopenquestionifother,non-geographical, [19] M.E.J.Newman,ProceedingsoftheNationalAcademy network models can also present such characteristic be- of Sciences 103, 8577 (2006). tweennesspaths. Also,theproperimpactthatsuchpaths [20] A.Shimbel,BulletinofMathematicalBiophysics15,501 may have on a dynamics taking place on the system is (1953). [21] U. Brandes, Journal of Mathematical Sociology 25 also an interesting aspect for future studies. (2001). [22] U. Brandes, Social Networks 30, 136 (2008). [23] M. E. J. Newman, Social networks 27, 39 (2005). VII. ACKNOWLEDGMENTS [24] S. P. Borgatti, Social networks 27, 55 (2005). [25] L. C. Freeman, S. P. Borgatti, and D. R. White, Social networks 13, 141 (1991). Henrique Ferraz de Arruda thanks CNPq (Grant no. [26] M. E. J. Newman, Networks: An Introduction (Oxford 132363/2013-5) and CAPES for financial support. Ce- University Press, Inc., New York, NY, USA, 2010). sar Henrique Comin acknowledges FAPESP (Grant no. [27] A. E. Motter and Y.-C. Lai, Physical Review E 66, 11/22639-8) for financial support. Luciano da Fontoura 065102 (2002). Costa is grateful to CNPq (Grant no. 307333/2013-2), [28] B.Mirzasoleiman,M.Babaei,M.Jalili, andM.A.Safari, Physical Review E 84, 046114 (2011). FAPESP (Grants no. 11/50761-2), and NAP-PRP-USP [29] A. Arenas, L. Danon, A. Diaz-Guilera, P. M. Gleiser, for sponsorship. The authors thanks Filipi Nascimento and R. Guimera, The European Physical Journal B- Silva for fruitful discussions. CondensedMatterandComplexSystems38,373(2004). [30] R. Guimera and L. A. N. Amaral, Nature 433, 895 (2005). [31] C. Zhong, S. M. Arisona, X. Huang, M. Batty, and G.Schmitt,InternationalJournalofGeographicalInfor- ∗ [email protected] mation Science 28, 2178 (2014). † [email protected] [32] S. Fortunato, Physics Reports 486, 75 (2010). ‡ [email protected] [33] A.Clauset,M.E.J.Newman, andC.Moore,Phys.Rev. [1] S. H. Strogatz, Nature 410, 268 (2001). E 70, 066111 (2004). 10 60 700 ρ = 0.85 200 ρ = 0.82 600 ρ = 0.87 50 500 er 150 er 40 Area 430000 Perimet 100 Diamet 2300 200 50 10 100 0 2 4 6 8 10 2 4 6 8 10 2 4 6 8 10 Degree Degree Degree (a) (b) (c) ρ = 0.17 60 700 300 ρ = 0.12 ρ = 0.17 50 600 250 Area 423500000000 Perimeter 211005000 Diameter 42310000 100 50 0 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 Strength Strength Strength (d) (e) (f) FIG. 12. Scatter plots comparing geographical and topological measurements in the community networks. The items (a), (b), and (c) represent the unweighted degree against geographical characteristics and the items (d), (e), and (f) represent the strength against geographical characteristics. The Pearson correlation, ρ, is shown inside each plot. [34] B.M.Waxman,IEEEJournalonSelectedAreasinCom- [36] T. Brinkhoff, Geoinformatica 6, 153 (2002). munications 6, 1617 (1988). [35] M. Penrose, Random Geometric Graphs, Oxford studies in probability (Oxford University Press, 2003).