ebook img

Navigation in non-uniform density social networks PDF

0.08 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Navigation in non-uniform density social networks

Navigationinnon-uniform density socialnetworks Yanqing Hu, Yong Li, Zengru Di, Ying Fan∗ Department of Systems Science, School of Management and Center for Complexity Research, Beijing Normal University, Beijing 100875, China (Dated:January6,2011) Recentempiricalinvestigationssuggestauniversalscalinglawforthespatialstructureofsocialnetworks.Itis foundthattheprobabilitydensitydistributionofanindividualtohaveafriendatdistancedscalesasP(d)∝d−1. Since population density is non-uniform in real social networks, a scale invariant friendship network(SIFN) basedontheaboveempiricallawisintroducedtocapturethisphenomenon. Weprovethetimecomplexityof navigationin2-dimensionalSIFNisatmostO(log4n).Intherealsearchingexperiment,individualsoftenresort 1 toextrainformationbesidesgeographylocation.Thus,real-worldsearchingprocessmaybeseenasaprojection 1 ofnavigationinak-dimensionalSIFN(k>2).Therefore,wealsodiscusstherelationshipbetweenhighandlow 0 dimensionalSIFN.Particularly,weprovea2-dimensionalSIFNistheprojectionofa3-dimensionalSIFN.As 2 amatteroffact,thisresultcanalsobegeneratedtoanyk-dimensionalSIFN. n a PACSnumbers: J 5 I. INTRODUCTION operators have provided exact solutions respectively for the ] asymptoticbehaviorofKleinberg’snavigationmodel[20,21]. h More recently, the navigation probolem with a total cost re- p Tounderstandthestructureofthesocialnetworksinwhich striction has also been discussed, where the cost denotesthe - we live is a very interesting problem. As part of the re- c lengthofthelong-rangeconnections[22,23]. cent surge of interest in networks, there have been active o research about social networks[1–6]. Besides some well Meanwhile, recent empirical investigations suggest a uni- s . knowncommonpropertiessuchas small-worldand commu- versal spatial scaling law on social networks. Liben-Nowell s c nitystructure[7–9],muchattentionhasbeendedicatedtonav- et al explored the role of geography alone in routing mes- i igationinrealsocialnetworks. sageswithintheLiveJournalsocialnetwork[24]. Theyfound s y In the 1960s, Milgram and his co-workers conducted the thattheprobabilitydensityfunction(PDF)ofgeographicdis- h firstsmall-worldexperiment[10]. Randomlychosenindivid- tance d between friendship was about P(d) ∝ d−1. Adamic p and Ada also observed the P(d) ∝ d−1 law when investigat- ualsintheUnitedStateswereaskedtosendalettertoapartic- [ ingtheHewlett-PackardLabsemailnetwork[25]. Lambiotte ularrecipientusingonlyfriendsoracquaintances.Theresults 2 oftheexperimentrevealthattheaveragenumberofinterme- et al analyzed the statistical properties of a communication v diatestepsinasuccessfulchainisaboutsix. Sincethen,“six networkconstructedfromtherecordsofamobilephonecom- 5 pany [26]. Their empiricalresults showed that the probabil- degreesofseparation”hasbecamethesubjectofbothexper- 4 ity that two people u and v living at a geographic distance imentalandtheoreticalresearch[11, 12]. Recently, Doddset 8 d(u,v)wereconnectedbyalinkwasproportionaltod(u,v)−2. alcarriedoutanexperimentstudyinaglobalsocialnetwork 1 Becausethenumberofnodeshavingdistanced toanygiven . consistingabout60,000emailusers[13]. Theyestimatedthat 0 nodeisproportionaltodin2-dimensionalworld,sotheprob- social navigation can reach their targets in a median of five 1 abilityforanindividualtohavea friendatdistanced should to seven steps, which is similar to the results of Milgram’s 0 be P(d) ∝ d · d−2 = d−1. More recently, Goldenberg et al 1 experiment. studied the effect of IT revolutionon social interactions[27]. : The first theoretical navigation model was proposed by v ThroughanalyzinganextensivedatasetoftheFacebookon- Kleinberg[14,15]. Heintroducedann×nlatticetomodelso- Xi cialnetworks. Inadditiontothelinksbetweennearestneigh- linesocialnetwork,theypointedoutthatsocialcommunica- tiondecreaseinverselywiththedistancedfollowingthescal- r bors, each node u is connected to a random node v with a a probabilityproportionaltod(u,v)−r,whered(u,v)denotesthe inglawP(d)∝d−1aswell. lattice distance between u and v. Kleinberg has proved that SuchasintheLiveJournalsocialnetwork,populationden- the optimal navigation can be obtained when the power-law sityisnon-uniforminrealsocialnetworks[24]. Todealwith exponentr equals to d, where d is the dimensionalityof the the navigation problem with non-uniform population den- lattice, and the time complexityof navigation in that case is sity, a scale invariant friendship network (SIFN for short) at most O(log2n). Since then, muchattention has been ded- model based on the above spatial scaling law P(d) ∝ d−1 icated to Kleinberg’snavigationmodel[16–18]. Robersonet of social networks is proposed in this paper. We prove the al. studiedthenavigationprobleminfractalnetworks,where time complexityof navigationin a 2-dimensionalSIFN is at they proved that r = d was also the optimal power-law ex- most O(log4n), which indicates social networks is naviga- ponentin the fractal case[19]. Carmi, Cartozo and their co- ble. Dodds et al have pointed out that individuals often re- sort to extra information such as education and professional informationbesides geographylocationin the realsearching experiment[13]. Considering this phenomenon, navigation ∗[email protected] process in real world may be seen as the projection of nav- 2 igation in a higher dimensionalSIFN. Therefore, we further discuss the relationship between high and low dimensional SIFN. Particularly, we prove that a 2-dimensionalSIFN can be seen as the projection of any k-dimensionalSIFN(k > 2) throughtheoreticalanalysis. II. NAVIGATIONINNON-UNIFORMDENSITYSOCIAL NETWORKS Todealwiththenon-uniformpopulationdensityinrealso- cialnetworks,wedividethewholepopulationintosmallareas andgivethefollowingtwoassumptions. First,thepopulation densityisuniformin eachsmallarea. Second,theminimum populationdensityamongtheareasism,whilethemaximum is M. We set m > 0 to guaranteethat a searchingalgorithm FIG.1:Twostrategiesofsendingmessageina2-dimensionalSIFN. Strategy A, send the message directly to target t from the current can always make some progress toward any target at every messageholder usingKleinberg’sgreedyroutingstrategy. Ateach stepofthechain. step,themessageissenttooneofitsneighborswhoismostcloseto Like Kleinberg’s network (KN for short) and Liben- thetargetinthesenseoflatticedistance. StrategyB,themessageis Nowell’srank-basedfriendshipnetwork(RFN forshort), we firstsenttoagivennode jusingKleinberg’sgreedystrategyandthen employann×nlatticetoconstructSIFN.Withoutlossofgen- tothetargetnodetusingthesamestrategy.Supposewestartfroma erality,weassumeeachnodeuhasqdirectedlong-rangecon- sourcenode s,afteronestep,themessagereachesnodes A and B 1 1 nections,whereqisaconstant[15]. Togeneratealong-range respectivelywithstrategyAandB. Consider B asthenewsource 1 connectionof nodeu, we first randomlychoosea distance d node, thenweshould get A and B respectively withstrategiesA 2 2 according to the observed scaling law P(d) ∝ d−1 in social andBinthenextstep. networks. Thenrandomlychooseanodevfromthenodeset, whose elements have the same lattice distance d to node u, andcreateadirectedlong-rangeconnectionfromutov. The in social networks compared with KN and RFN. Further, latticeisassumedtobelargeenoughthatthelong-rangecon- Pr (u,v,k),Pr (u,v) and Pr (u,v) can be quite differ- KN SIFN RFN nectionswillnotoverlap. entwhenthepopulationdensityisnon-uniform. For simplicity, we set q = 1. Let S denote the set of all Since our 2-dimensional SIFN captures the non-uniform nodes, then the probability that u chooses v as its long-rang population density property in the real social networks, we connectioninSIFNcanbegivenbyeq.(1). purposefullydividethenavigationprocessintotwostagesfor simplicity. First send messages inside a small area and then 1 d(u,v)−1 Pr (u,v)= (1) amongthe areas. To analyze the time complexityof naviga- SIFN c(u,v) n d−1 d=1 tion in a 2-dimensional SIFN, we first compare the follow- where c(u,v) = |{x|d(u,x) = d(u,v),Px ∈ S}| and d(u,v) de- ing two searchingstrategies as shownin FIG.1. StrategyA, send the message directly to target t from the current mes- notes the lattice distance between nodes u and v. Likewise, sageholderusingKleinberg’sgreedyroutingstrategy.Ateach theprobabilitythatuchoosesvasitslong-rangconnectionin step, themessageissentto oneofitsneighborswhoismost KNandRFNaregivenrespectivelybyeq.(2)andeq.(3). close to the target in the sense of lattice distance. Strategy d(u,v)−r B, the message is first sent to a given node j using Klein- PrKN(u,v,r)= d(u,w)−r (2) berg’sgreedystrategyandthentothetargetnodet usingthe w,u samestrategy.ItcanbeprovedthatstrategyAperformsbetter P thanstrategyBonaverage.Supposewestartsendingmessage PrRFN(u,v)= ranrkaun(kv)(−w1)−1 (3) rferospmecthtievesloyurwceithnosdtreaste,gthyeAmeasnsdagBeraefatcehreosnneosdteespA. 1ItanisdaBl1- w,u u ways correctthat lattice distance d(A ,t) ≤ d(B ,t), because P 1 1 where rank (v) = |{w|d(u,w) < d(u,v),x ∈ S}| denotes greedyroutingstrategyalwayschoosethenodemostcloseto u the number of nodes within distance d(u,v) to node u in targettfromitsneighbors.Accordingtotheresultsof[18,21], RFN[15, 24]. Notice that, the number of nodes with a dis- the longer the distance between a source and a given tar- tanced(u,v)inak-dimensional(k > 1)latticeisproportional get, the more is the expected steps. Thus we should have to d(u,v)k−1. Thus, a node u connectsto nodev with proba- T(A1 → t) ≤ T(B1 → t), where T(A1 → t) and T(B1 → t) bilityproportionaltod(u,v)−a doesnotmean P(d) ∝ d−a but denotetheexpecteddeliverytime to targett from A1 and B1 P(d) ∝ d−a+k−1 instead. Therefore, Pr (u,v,k), Pr (u,v) respectively. KN SIFN andPr (u,v)areexactlythesameforanyk-dimensionallat- LetT(s → j → t)denotetheexpecteddeliverytimefrom RFN ticebasednetworkwhenpopulationdensityisuniform.How- stotviaatransportnode j,thenwehaveT(s→ t) ≤ T(s→ ever, SIFN always satisfies the empirical results P(d) ∝ d−1 B →t).ConsiderB asanewsourcenode,thenmessagewill 1 1 3 reach A and B with strategies A and B respectivelyin the complexityof navigationin SIFN with strategy B is at most 2 2 next step. Following the same deduction, we have T(B → O(M log4n).However,actualnavigationprocessinrealworld 1 m A → t) ≤ T(B → B → t). Repeat this process until shouldbecarriedoutregardlessoftheabovetwoassumptions, 2 1 2 the message reaches the given node j with strategy B, then whichindicatesindividualsshouldusestrategyA. Basedon weshouldhaveamonotoneincreasingsequenceofexpected the above analysis, strategy A performs better than strategy deliverytime{T(s → B → t),T(s → B → t),···,T(s → B on average. Therefore, the time complexityof navigation 1 2 j → t) }. Therefore,we can obtainT(s → t) ≤ T(s → j → in2-dimensionalSIFNisatmostO(log4n)withnon-uniform t), which means strategy A is better than strategy B. This populationdensity. analysiscanbeextendedtoanyk-dimensionalSIFN. BasedonthefirstassumptionandthefactthatSIFNisiden- ticaltoKNwhenpopulationdensityisuniform,theexpected III. RELATIONSHIPBETWEENHIGHANDLOW steps spent in each small area using Kleinberg greedy algo- DIMENSIONALSIFN rithmisatmostO(log2n).Considereachsmallareaasanode, wewillgetanew2-dimensionalweightedlattice. Theweight Theempiricalresultsshowindividualsalwaysresorttoex- (population)of the nodes is between m and M based on the tra information such as profession and education informa- secondassumption.Thuswehave tion besides the target’s geography location when routing messages[13]. Then, real navigation process in social net- m M c d−1 ≤Pr′ (u,v)≤c d−1 (4) worksmaybemodeledwithahigherdimensionalSIFN.Inthe M SIFN m following, we will discuss the relationship between the high and low dimensional SIFN and prove that a 2-dimensional where c is a constant and Pr′ (u,v) represents the proba- SIFN SIFN can be obtained by any k-dimensional SIFN (k > 2). bility that area u is connected to area v in the new weighted Particularly,wewillprovidethetheoreticanalysisforthecase lattice. wherek = 3. Theanalysiscanbegeneratedtoanyk dimen- We say that the executionof greedyalgorithm is in phase sionalcases. j (j > 0) when the lattice distance from the currentnode to WeemployearandomvariableD todenotethefriendship target t is greater than 2j and at most 2j+1. Obviously, we 3 distanceina3-dimensionalSIFN.Forsimplicity,acontinuous have expressionsis used. Since, the long-rangeconnectionsin 3- n dimensionalSIFNsatisfiestheaboveempiricallaw,thePDF n d−1 ≤1+ x−1dx=1+logn<2logn. (5) ofD3canbeexpressedby Z Xd=1 1 P(D =d)= 1 1,d ≤d≤d (9) 3 lnd −lnd d m M Further, we define B as the node set whose elements are M m j within lattice distance 2j +2j+1 < 2j+2 to u. Let|B | denote whered andd denotetheminimumandmaximumdistance j m M thenumberofnodesinsetB ,weshouldhave respectivelyinthe3-dimensionalSIFN. j Wecanobtaina2-dimensionalnetworkmodelifweproject 2j a3-dimensionalSIFNtoa2-dimensionalworld. Similarly,a |Bj|>1+ i>22j−1. (6) randomvariable D2 isusedto denotethefriendshipdistance Xi=1 inthenew2-dimensionalnetworkmodel. Itisnotdifficultto understandthattheconditionfora2-dimensionalSIFNshould Supposethatthemessageholderiscurrentlyinphase j,then be the PDF of D satisfies P(d) ∝ d−1. Since D is the pro- theprobabilitythatthenodeisconnectedbyalong-rangelink 2 2 toanodeinphase j−1isatleast(Mm−12logn·4·22j+4)−1. jection of D3, then D2 can be seen as the productof D3 and X. HererandomvariableXisindependentonD anditsPDF Theprobabilityψ(x)toreachthenextphase j−1inmorethan 3 canbegivenbyeq.(10). xstepscanbegivenby ψ(x)=(1−(Mm−12logn·4·22j+4)−1)x (7) 1 P(X = x)= ,0≤ x≤λ (10) λ andtheaveragenumberofstepsrequiredtoreachphase j−1 where0≤λ≤1. Finally,thePDFofD canbewrittenas is 2 ∞ m i−1 256Mlogn < x>= (1− ) = . (8) 0, d≤0 tOSoi(tnaMmclelnotuhgme2bnine)i.rtioaflXi=vs1taelpuseroef2q5ju6iirsMeadtlotmogonrsetalcohgtnh,ethtaernmgethteisexaptemctoesdt P(D2 =d)=0dlnM,dd1dM−m−λddMl(1nlMλnd−dmdM,m−lndm), 0ddm<>λdd<M≤dλd≤mλdMλ (11) As a matter of fact, it means that we are using strategy B When taking account of real social networks, d is large M tosendmessagein2-dimensionalSIFNwhenthenavigation enoughthattheterm 1 willapproachitslimitof0. Mean- process is divided into the above 2 stages. Thus, the time while, the term d λ cdaMnλ be neglected when compared with m 4 d λ,becauseλ ≤ 1andd isrelativelysmall. ThusthePDF tion density. It has been proved that the time complexity of M m ofD canbesimplifiedintoP(d) ∝ d−1,whichisidenticalto navigationin2-dimensionalSIFNisatmostO(log4n),which 2 thatofD ina3-dimensionalSIFN. correspondstotheupperbondofnavigationinrealsocialnet- 3 Through theoretical analysis, we have proved a 2- works.Giventhefactthatindividualsarealwaysrestrictedon dimensional SIFN can be seen as the projection of a 3- the 2-dimensionalgeographyworld even they possess infor- dimensional SIFN. Likewise, we can get a 2-dimensional mationofthehigherdimensions,actualsearchingprocesscan SIFN from any k-dimensional(k > 2) SIFN. Notice that in- beseenasaprojectionofnavigationinahigherk-dimensional dividualsare always restricted on the 2-dimensionalgeogra- SIFN.Throughtheoreticalanalysis,weprovethattheprojec- phyworldeventheypossessextrainformationfromotherdi- tionofahigherk-dimensionalSIFNresultsina2-dimensional mensions.Thus,real-worldsearchingprocessmaybeseenas SIFN.Therefore,SIFNmodelmayexplainthenavigabilityof theprojectionofnavigationinahighdimensionalSIFN.Our realsocialnetworkseventakeaccountoftheinformationfrom analysisindicatethatSIFNmodelmayexplainthenavigabil- higherdimensionsotherthangeographydimensions. ity of real social networkseven take accountof the fact that individualsalways resortto extra informationin real search- ingexperiments. V. ACKNOWLEDGEMENT IV. CONCLUSION WethankProf.ShlomoHavlinforsomeusefuldiscussions. ThisworkispartiallysupportedbytheFundamentalResearch Recentinvestigationssuggestthatthe probabilitydistribu- Funds for the Central Universities and NSFC under Grants tion of having a friend at distance d scales as P(d) ∝ d−1. No.70771011and No. 60974084and NCET-09-0228. Yan- We propose an SIFN model based on this spatial property qing Hu is supportedby Scientific Research Foundationand to deal with navigation problem with non-uniform popula- ExcellentPh.DProjectofBeijingNormalUniversity. [1] J. Kleinberg, The Convergence of Social and Technological micPerspective,Proceedingsofthethirty-secondannualACM Networks,CommunicationsoftheACM.november 51,66-72 symposiumonTheoryofcomputing,163-170(2000). (2008). [16] C.Martel,AnalyzingKleinberg’s(andother)small-worldmod- [2] D.Brockmann,L.Hufnagel,andT.Geisel,Thescalinglawsof els, Proceedings of the twenty-third annual ACM symposium humantravel,Nature(London)439462-465(2006). onPrinciplesofdistributedcomputing,179-188(2004). [3] M. F. Shlesinger, Random walks: Follow the money, Nature [17] O. Sandberg, Searching in a Small World, Licentiate thesis, Physics2,69-70(2006). ChalmersUniversity,(2005). [4] M. C. Gonzalez, C. A. Hidalgo, and A.-L. Barabasi, Under- [18] L.Barrire,P.Fraigniaud,E.Kranakis,andD.Krizanc,Efficient standing individual human mobility patterns, Nature(London) routing in networks with long range contacts (Springer,New 453,779-782(2008). York,2001) [5] S.H.Strogatz, Exploring complex networks, Nature(London) [19] M. R. Roberson and D. Ben-Avraham, Kleinberg navigation 410,268-276(2001). infractalsmall-worldnetworks,PhysicalReviewE74,17101 [6] R.AlbertandA.-LBarabasi,Statisticalmechanicsofcomplex (2006). networks,ReviewsofModernPhysics74,47-97(2002). [20] C.CarettaCartozoandP.DeLosRios,Extendednavigabilityof [7] M.GirvanandM.E.J.Newman,Communitystructureinsocial smallworldnetworks: exactresultsandnewinsights,Physical andbiologicalnetworks,ProceedingsoftheNationalAcademy ReviewLetters102,238703(2009). ofSciences99,7821-7826(2004). [21] S.Carmi,S.Carter,J.Sun,andD.Ben-Avraham,Asymptotic [8] M.E.J.Newman, Thestructureandfunctionofcomplexnet- behavioroftheKleinbergmodel,PhysicalReviewLetters102, works,SIAMRev.45,167-256(2003). 238702(2009). [9] D. J. Watts, P. S. Dodds, and M. E. J. Newman, Identity and [22] G. Li, S. D. S. Reis2 A. A. Moreira, S. Havlin, H. E. Stan- searchinsocialnetworks,Science296,1302-1305(2002). ley1 andJ.S.Andrade, Jr., Towards design principlesfor op- [10] S.Milgram,Thesmallworldproblem,Psychologytoday2,60- timaltransportnetworks,PhysicalReviewLetters104,018701 67(1967). (2010). [11] J. Guare, Six degrees of separation: A play, (Vintage Books, [23] Y.Li,D.Zhou, Y.Hu, J.ZhangandZ.Di,Exactsolutionfor NewYork,1990) optimal navigation with total cost restriction, EPL 92, 58002 [12] F.Macrae,MicrosoftprovesyouAREjustsixdegreesofsepa- (2010). rationfromanyoneintheworld,Mailonline,ScienceandTech, [24] D. Liben-Nowell, J. Novak, R. Kumar, P. Raghavan, and A. 04thAugust,(2008). Tomkins, Geographic routinginsocial networks, Proceedings [13] P. S. Dodds, R. Muhamad and D.J. Watts, An experimental oftheNationalAcademyofSciences102,11623-11628(2005). study of search in global social networks, Science 301, 827 - [25] L.AdamicandE.Adar,Howtosearchasocialnetwork,Social 829(2003). Networks27,187-203(2005). [14] J.Kleinberg,Navigationinasmallworld,Nature(London)406, [26] R. Lambiotte, V. D. Blondel, C. de Kerchove ,E. Huens , 845-845(2000). C. Prieur , Z. Smoreda, and P. V. Dooren, Geographical dis- [15] J. Kleinberg, The Small-World Phenomenon: An Algorith- persal of mobile communication networks, Physica A 387, 5 5317C5325(2008). arXiv:0906.3202v2. [27] J. Goldenberg and M. Levy, Distance is not dead: So- cial interaction and geographical distance in the internet era,

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.