IEEE/ACMTRANSACTIONSONNETWORKING,VOL.13,NO.1,FEBRUARY2005 121 Fair Distributed Congestion Control in Multirate Multicast Networks SaswatiSarkar,Member,IEEE,and LeandrosTassiulas, Member,IEEE Abstract—Westudyfairnessofresourceallocationinmultirate, rate of transmission per session is likely to either overwhelm multicast networks. In multirate networks, different receivers of the slow receivers or starve the fast ones, in absence of addi- the same multicast session can receive service at different rates. tional provisions. For real-time traffic, multirate transmission Wedevelopamathematicalframeworktomodelthemaxminfair canbeusedtocounternetworkheterogeneity.Thereceiversof allocation of bandwidth with minimum and maximum rate con- straints.Wepresentanecessaryandsufficientconditionforarate thesamesessionareallowedtoreceiveatdifferentservicerates. allocationtobemaxminfairinamultirate,multicastnetwork.We Wewoulddiscussseveralmultirateencodingschemeslater,but propose a distributed algorithm for computing the maxmin fair we mention one of the possibilities now. A source encodes its ratesallocatedtovarioussource–destinationpairs.Thisalgorithm signalinseverallayers,andtheselayerscanbecombinedatthe hasalowmessageexchangeoverhead,andisguaranteedtocon- decoderforsignalreconstruction.Theselayersaretransmitted vergetothemaxminfairratesinfinitetime. asseparatemulticastgroups,andreceiversadapttocongestion Index Terms—Algorithms, complexity theory, fairness, multi- byjoiningandleavingthesegroups[8].Precisionoftherecon- cast. structionimproveswiththenumberoflayersreceived.Thislay- eredtransmissionschemehasbeenusedforbothvideo[21]and I. INTRODUCTION audio[4]transmissionsovertheinternetandhaspotentialsfor MULTICASTING provides an efficient way of transmit- useinATMnetworksaswell[11]. Layer bandwidth may be flexible or predetermined. In the ting information from a sender to a set of receivers. A former,layerbandwidthcanbetunedtocloselymatchanyde- single source node or a collection of source nodes send iden- sired receiver rates with fine granularity [15]. The feasible set tical messages simultaneously to multiple destination nodes. canbeassumedtobecontinuousinthiscase.Inthelatter,layer Single destination or unicast and broadcast to the entire net- bandwidths are predetermined and have coarse granularity. A workarespecialcasesofmulticast.Multicastingreducesband- receiver either receives a layer fully or does not receive the width consumption as message replication takes place at only layeratall.Itcannotpartiallysubscribetoalayer.Effectively, theforkingnodes.Thisisparticularlyusefulforreal-timemul- the network can only allocate a discrete set of rates to the re- tiparty communications like audio or video teleconferencing, ceivers,whereasacontinuoussetofratescanbeallocatedwhen video-on-demandservices,distancelearning,etc.,astheseap- receiverscan subscribe to fractional layers.We study the con- plicationsconsumealotofbandwidth.Wewouldstudyresource tinuouscasehereandhavestudiedthediscretecasein[20].We allocationforreal-timemulticastapplications. discuss how to attain a continuous allocation of rates in Sec- Therearetwopossibletransmissionmodesinmulticastnet- tion IV. worksforlosstolerantreal-timetrafficlikeaudio,video,etc.In We study fair allocation of rates in multirate, multicast net- one,allreceiversinthesamesessionreceiveinformationatthe works.Max-minfairnessisawellaccepteddefinitionoffairness samerate.However,thisuniratetransmissionhassevereshort- [3].Abandwidthallocationismaxminfair,ifnoreceivercanbe comingsformulticastnetworks.Thisisbecauseofnetworkhet- allocatedahigherserviceratewithoutloweringthatofanother erogeneity. A single session may havemany destinations. The receiver having equal or lower rate. The objective is to serve pathstodifferentdestinationsmayhavedifferentbandwidthca- every receiver of every session at a fair rate. The service rate pacities,e.g.,onemayconsistofmultimegabitlinks,suchas, ofareceivershoulddependonlyonthecongestioninthepath (45Mb/s)andanothermayhavea128kb/sISDNline.Asingle leadingtothereceiver,anditsprocessingcapability.Attaining thisobjectiveiscomplicatedinmulticastnetworksasdifferent Manuscript received June 19, 2002; revised June 23, 2003; approved by receivershavedifferentpathsandprocessingcapabilities. IEEE/ACM TRANSACTIONS ON NETWORKING Editor M. Grossglauser. The workofS.SarkarwassupportedinpartbytheNationalScienceFoundation We first review the previous work in this area. We have under Grants ANI01-06984, NCR02-38340, and CNS04-35506. This paper proposed a routing and scheduling policy which stabilizes the waspresentedinpartattheIEEEGLOBECOM1999,RiodeJaneiro,Brazil, system,ifthenetworkcanaccommodateallthetrafficdemands andtheIEEEINFOCOM2000,TelAviv,Israel. S. Sarkar is with the Department of Electrical and Systems Engineering, [18].However,resourcelimitationsmaynotallowfulfillingall UniversityofPennsylvania,Philadelphia,PA19104USA(e-mail:swati@ee. trafficdemands,particularlyforbandwidthexpensivereal-time upenn.edu). applications.Fairnessofresourceallocationbecomesimportant L. Tassiulas is with the Computer Engineering and Telecommunications Department,UniversityofThessaly,38334Volos,Greece,andalsowiththe in such a scenario. Tzeng et al. have investigated the problem DepartmentofElectricalandComputerEngineeringandInstituteforSystems of fair allocation of bandwidth to multicast sessions under the Research, University of Maryland, College Park, MD 20742 USA (e-mail: constraint that all receivers of the same session must receive [email protected]). DigitalObjectIdentifier10.1109/TNET.2004.842234 service at the same rate [22]. However, under this unirate 1063-6692/$20.00©2005IEEE 122 IEEE/ACMTRANSACTIONSONNETWORKING,VOL.13,NO.1,FEBRUARY2005 TABLE I SUMMARYOFSYMBOLSUSEDTHROUGHOUTTHEPAPER transmission, due to network heterogeneity, service rate may associatedwithasourceanddestinationsetpair ,where not match the path bandwidth and the processing capability isthesourcenodeofthesessionand isthesetofdestination of every receiver. Fairness properties of a multicast network nodes.Thereisatreeassociatedwitheachsessionthatcarries improvesifmultiratetransmissionisusedinsteadofsinglerate thetrafficofthesession.Thetreecanbeestablishedduringcon- transmission[17].Chiungetal.[6]advocatesimulcast,butthat nection establishment phase in connection-oriented networks, does not utilize bandwidth efficiently as it requires multiple orcanbeestablishedbysomewellknownmulticastroutingpro- transmission of the same information. Some well known net- tocollikeDVMRP[7],MOSPF[14],CBT[2],andPIM[8]in work protocols proposed for fair allocation of rates in layered connectionlessnetworks. transmission, RLM (Receiver-driven Layered Multicast) [16] We call every source destination pair of a session a virtual and LVMR (Layered Video Multicast with Retransmissions) session.Ifasession hassource anddestinationset ,where [12] improve fairness among members of the same session, , then it corresponds to virtual sessions, but do not distribute the bandwidth fairly among members of . For example, in Fig. 1, session 1 has two differentsessions [13].Li etal. [13]suggesta schemefor fair receivers, , , and two virtual sessions, allocation of layers for multisession layered video multicast and .Ourobjectiveistoachievethemaxminfair to rectify this defect in RLM and LVMR. The authors present rate allocation for the virtual sessions. Every virtual session empiricalevidencethattheschemecansharebandwidthfairly (source–destinationpair)hasaminimumandamaximumrate. with TCP and improves inter-session fairness for networks Thesearedecidedfromapplicationrequirements.Forexample, withmultiplevideosessionssharingonlyonelink.Rubenstein the reception quality may be poor for a high fidelity video et al. [17] propose a centralized algorithm for computing the transmission if the bandwidth is below a certain threshold, maxminfairratesinamultiratemulticastnetwork.Centralized whichconstitutestheminimumrateforthereceiver. algorithmscannotbeimplementedinlargenetworks. Let there be virtual sessions in the net- InSectionII,weformulatetheproblemoffairallocationof work. Rate allocation is an -dimensional vector bandwidth in multirate multicast networks. In Section III-A, , with we present an algorithm for computing the maxmin fair beingtherateallocatedtothe thvirtualsessionofthe th rates in an arbitrary network with any number of multicast session.Forsimplicity,henceforth,wewilluseasingleindex. sessions. This algorithm requires only local information at A rate allocation is feasible if the following any node in the network and is thus amenable to distributed conditionshold. computation. In Section III-B, we present a framework for 1. , where and are, respectively, scalable distributed implementations of the above algorithm. the minimum and the maximum rates of virtual session Our analytical results guarantee that the distributed algorithm , , converges to the maxmin fair rates in finite time, and presents 2. Let denote the set of sessions passing through link worst case convergence time bounds. Our algorithm can be , denote the set of virtual sessions of session used in internet and ATM networks. We have incorporated passingthroughlink and denotethecapacityoflink minimum rate constraints keeping in mind ATM networks. In .Therateallocatedtosession inlink , ,underrateal- SectionIII-C,weinvestigateusingsimulationstheperformance location isthemaximumoftheratesallocatedtothevir- of the distributed algorithm. In Section IV, we discuss several tualsessionsin ,i.e., .Then features of the fairness framework. We conclude in Section V. the total bandwidth consumed by all sessions traversing We have summarized the notations used throughout the paper link cannotexceed ’scapacity,i.e., . in Symbol Table I. Unless otherwise stated, the proofs can be Inotherwords, foundintheAppendix. capacitycondition II. NETWORKMODEL Weconsideranarbitrarytopologynetworkwith multicast Fig. 1 illustrates an example network with a few capacity and sessions.Amulticastsessionisidentifiedbyanumber ,andis maximumandminimumrateconstraints.Weassumeherethat SARKARANDTASSIULAS:FAIRDISTRIBUTEDCONGESTIONCONTROLINMULTIRATEMULTICASTNETWORKS 123 • Thevirtualsession hasthemaximumrate amongstall virtualsessionsofthesamesessiontravelingthelink,i.e., ,where isthesessionofthevirtualses- sion . • If any other virtual session traversing link has a rate higherthan thatofvirtual session , then ’srate isless thanorequaltotheminimumrateofsomevirtualsession in .Thus,if then . RefertoFig.1foranexampleofbottlenecklinks. Fig.1. Thenumbersinbrackets()denotethecapacitiesoftherespectivelinks. Thecapacityconstraintforlinke ismax(r ;r )+r (cid:20)7andthatforlink Wenowcomparethedefinitionofabottlenecklinkwiththat e isr +r (cid:20) 6:5.Theminimumandthemaximumrateconstraintsare intheunicastcontext.Firstassumethattheminimumratecon- 4 (cid:20) r (cid:20) 5,1 (cid:20) r (cid:20) 1and0 (cid:20) r (cid:20) 5.Themaxminfairratevector straintsdonotexist.Inaunicastnetwork,alinkisbottlenecked is(4,3.5,3).Underthemaxminfairrateallocation,linke isbottlenecked w.r.t.virtualsessions(v;u )and(v;u )ande isbottleneckedw.r.t.virtual w.r.t.asessionifitscapacityisfullyutilizedandnootherses- session(v;u ).Linke isnotbottleneckedw.r.t.anyvirtualsessionbecause siontraversingthelinkhasagreaterrate.Inamultiratemulticast itscapacityisnotfullyutilized.Consideranotherfeasibleratevector(4,4.5,2). network,thedefinitionforabottlenecklinkcanbeobtainedby Nownolinkisbottleneckedw.r.t.virtualsession(v;u ). replacing session with virtual session in the above definition. Nowassumethattheminimumrateconstraintsexist.Intheuni- theminimumraterequirementsofallsessionscanbesatisfied. cast context,let be the bottleneck link of session . Thenthe Ifnot,thenthenetworkcaneithernotacceptthesessionorlower capacityoflink isfullyutilizedlikeinthemulticastcase.How- theminimumrateguarantee. ever,asession traversing canhavegreaterbandwidththan , Afeasiblerateallocationvector ismaxminfairifitsatisfies butthen ’sratemustequal ’sminimumrequiredrate.Inthe the following property with respect to any other feasible rate multicast case, let be the bottleneck link of virtual session . allocationvector :ifthereexists suchthatthe thcomponent Nowthedifferencewiththeunicastcaseisthatavirtualsession of isstrictlygreaterthanthatof ,thenthereexists traversinglink canhavegreaterbandwidththan ,and ’srate suchthatthe thcomponentof , islessthanorequaltothe canbe greaterthan ’sminimum requiredrate.Theconstraint thcomponentof , andthe thcomponentof inthiscaseisthat ’sratemustbelessthantheminimumrate isstrictlylessthanthe thcomponentof .The requirementofitssessioninlink , ,andvirtualsessions bandwidthallocationsaccordingto arelesseventhanthose and mustbelongtodifferentsessions.ConsiderFig.1foran accordingto insomesense.RefertoFig.1foranexampleof example.Let havecapacity6.5unitsnow.Lettheminimum amaxminfairallocation. raterequirementsbe(0,4,0).Consideranallocation(4,4,2.5). Lemma1: Maxminfairallocationexistsuniquely. Link is bottlenecked with respect tovirtual session 3. Note Proof of lemma 1: We will present an algorithm which that .However,virtualsessions1and attainsthemaxminfairallocationin iterations(Theorem1). 3areindifferentsessions,and . Thisprovestheexistenceresult. Lemma 2 (Bottleneck Lemma): A feasible rate vector is Suppose two different rate allocations and are both maxminfairiffeveryvirtualsessionhasabottlenecklink. maxminfair.Thereexistsacomponent suchthat . Remark: TheBottleneckLemmaservesasatestformaxmin Without loss of generality . Since is maxmin fairnessofafeasiblerateallocationvector.Itindicatesthatifa fair, there exists such that and . Thus, ratevectorismaxminfair,thentherateofavirtualsession isat . Since , from the maxmin fairness of least forsomelink initspathiftherearenominimum , there exists such that and . Thus, rate requirements. In presence of minimum rate requirements, . Continuing similar arguments, we get an infinite this lower bound becomes , sequence such that , where isthesetofsessionstraversinglink whosesession and . It follows that the elements in this sequence ratesinlink aregreaterthan ’srate. are all distinct. This contradicts the fact that the number of componentsisfinite. Henceforth, we shall ignore the maximum rate constraints. III. A DISTRIBUTED SCHEME FOR COMPUTATION OF This does not cause any loss of generality because maximum THEMAXMINFAIRRATES rateconstraintscanbeincorporatedbyaddingartificiallinksbe- We now present a distributed scheme for computing the tweenreceiversandtherestofthenetwork.Thecapacityofan maxmin fair rates. We first describe the basic algorithm and artificiallinkequalsthemaximumrateofthereceiver.Thesize thendiscussitsdistributedimplementation. of the augmented network and hence the computational com- plexityforthefairratesissimilartothatofthegivennetwork. Next, we introduce the concept of bottleneck links. A link A. BasicAlgorithm is said to be bottlenecked with respect to a virtual session Wedescribethebasicalgorithmhere.Wefirstpresentadefi- traversing ifthefollowingconditionshold. nition.InTableII,wehavesummarizedothernotationsusedin • Capacity of link is fully utilized, i.e., the sum of the thisalgorithm. ratesallocatedtothesessionstravelingthelinkequalsthe Avirtualsession issaturatedunderratevector ifthere capacityofthelink: . existsa link inits pathsuchthat ’scapacityis fullyutilized 124 IEEE/ACMTRANSACTIONSONNETWORKING,VOL.13,NO.1,FEBRUARY2005 TABLE II SUMMARYOFSYMBOLSUSEDINTHERATECOMPUTATIONALGORITHM and hasthemaximumrateamongstallvirtualsessionsofits saturated sessions passing through sessionsthattraverse ,i.e., link , . 10. Go to step (2). and Wenowdescribethealgorithm’soperation.Themaxminfair Asessionissaturatedinalink ifallthevirtualsessionsofthe bandwidth is computed via an iterative procedure. The algo- sessiontravelinglink aresaturated. rithm classifies every virtual session as either saturated or un- Now,wepresentthealgorithm. saturated.Initially,allvirtualsessionsareunsaturated,andtheir status change from unsaturation to saturation. Ignore the min- imumrateconstraintsfortheinitialintuition.Atthebeginning 1. , , , ofaniteration ,everylink computesa“fairshare”forevery link , virtual virtual session traversing (“link control parameter” ) as session . per step 3. This fair share is assigned to every virtual session 2. traversing iftherearenobandwidthconstraintsonotherlinks. 3. For every link in the network, com- Thebandwidthrestrictionsofotherlinksareconsideredasfol- pute the link control parameter . lows. A virtual session is allocated a rate equal to the min- If , then is the maximum imumofthelinkcontrolparametersonitspath.Inpresenceof possible , which satisfies the equa- minimum rate requirements, a link additionally computes the tion, , sessionlinkparameter, foreverysession traversingthe else . Here, link,as .Thereafter,avirtualsession .1 isallocatedarateequaltotheminimumofitssessionlinkpa- 4. Compute for all virtual ses- rameters in its path. Now, is the bandwidth allocation in sions , where , iteration . The algorithm subsequently checks the saturation if , else . conditionforeachunsaturatedvirtualsession.Itturnsoutthat 5. For every link in the net- whenavirtualsessionissaturateditsbandwidthismaxminfair, work compute the session rate in anditsbandwidthallocationdoesnotchangesubsequently.The link , for every session in , algorithmterminatesifallthevirtualsessionsaresaturated,oth- . erwisethereisatleastonemoreiteration.Inthelattercase,the 6. Compute the set of virtual sessions algorithmmakescomputationswhichareusedinthenextiter- unsaturated after the th iteration, ation.Wedescribethemnow.Thebandwidthconsumedbythe s.t. saturated sessions in a link, if any, are computed. This band- and . width is subtracted from the link capacity, and the link con- 7. If , i.e., all virtual sessions trol parameters are recomputed in the next iteration using this are saturated, the algorithm termi- residual capacity. We illustrate the operation of the algorithm nates, else go to next step. withanexample. 8. For every link , compute the set of Example 3.1.1: Consider the network of Fig. 1. The max- unsaturated sessions passing through imumrateconstraintsdonotexist.Linkcontrolparametersare link at the end of the th itera- asfollows: , , , , tion: , .Now,thesessionlinkcontrolparam- . eters are as follows. , , , 9. For every link , for which , , , , , compute the bandwidth consumed by the . Computing the s as per step 4, we have , , . Observe that virtual 1This computation needs to be done for all unsatu- rated sessions traversing link l only. sessions 1 and 3 are saturated, while virtual session 2 is not. SARKARANDTASSIULAS:FAIRDISTRIBUTEDCONGESTIONCONTROLINMULTIRATEMULTICASTNETWORKS 125 .Thus,session2issaturatedonalllinks.Session1 updatethesaturationstatusofthesessionsandthereceiversat isunsaturatedononlythoselinkswhichareonthepathofvir- theintermediatenodesandthereceivers,respectively. tualsession2. , , Initially,allthesessionsandvirtualsessionsareunsaturated. if . , and . Receivers send backward rate packets toward the respective Computationsforthenextiterationareasfollows: , sourcewithlargeratevalues.Linkscomputetheirlinkcontrol , ,and fortherestofthe parameters. The intermediate nodes modify the rate values in links. Now, , , . Thus, the backward rate packets as they travel toward the source. . , . Now virtual session The rate value in a backward rate packet is decreased to the 2isalsosaturated.So andthealgorithmterminates. minimumofthelinkcontrolparameterandthecurrentvaluein Theratesobtaineduponterminationare(4,3.5,3). theratepacket.Thenodesrecordthesemodifiedratevaluesas Theorem1: Thealgorithmyieldsamaxminfairratealloca- thesessionlinkrates.Afterthismodification,nodesmergethe tioninatmost iterations,where isthenumberofvirtual backwardratepacketsofasession.Themergedbackwardrate sessions. packet is transmitted toward the source with a rate value that The intuition behind the result is as follows: maxmin fair equals the maximum of those in the individual backward rate sharingimpliesthatifthereare sessionssharingalink,each packets. sessionshouldgeta“fairshare”ofthelinkbandwidth.Ifases- A sessionsource generates aforward rate packetafter itre- sionisconstrainedtohavearatelessthanitsfairsharebecauseit ceivesabackwardratepacket,withthesameratevalueasinthe isassignedalowerbandwidthonanotherlink,thentheresidual incoming backward rate packet, and transmits it downstream. bandwidth is split fairly among other sessions. This is exactly Onceaforwardratepacketreachesanode,thenodeupdatesthe whatthealgorithmdoes. session link rates in each of its outgoing links, i.e., sets them Let be the setof links. Everystep ofthis algorithm has a equaltotheminimumoftheratevalueintheincomingforward complexity of . Since the algorithm must terminate ratepacketandthecurrentsessionlinkrateintheincominglink. in iterations, the overall complexity of this algorithm is Thismodifiedvalueisthecurrentiterationsessionlinkrateof . thebasicalgorithm.Thenodethenmulticaststheforwardrate Thealgorithmterminatesinatmost iterationsinaspecial packetineachofitsoutgoinglinks onthepath ofthesession, case,i.e.,ifallthevirtualsessionsofthesamesessionsharing witharatevalueequaltothatinthemodifiedsessionlinkrate a link have the same minimum rate requirement ( if field of the session. When a receiver receives a forward rate and )(Lemma3).Thisconditionon packet, it records its rate and sends a probe packet toward the minimumratesalwaysholdsinunicastnetworksbecauseevery sourcetoqueryitssaturationstatus. sessionhasonlyonevirtualsession,andinmulticastnetworks Anintermediatenodedeterminesthesaturationstatusofthe withoutanyminimumraterequirements. sessioninthelinkwhenitreceivesaprobepacket.Anodere- Lemma3: Thealgorithmterminatesinatmost ceives probe packets of a session in each of its outgoing links iterations, if for all , such that and on the path of the session. It merges all of these into a single . probe packet, and sends it upstream. The content of the probe packet reflects the saturation status of the session in the links B. DistributedImplementationoftheBasicAlgorithm originatingfromthenode[19]. Weoutlinethedistributedimplementationofthebasicalgo- Whenasourcereceivesaprobepacket,itgeneratesa“satura- rithm presented in the previous subsection. The details can be tion”oran“unsaturation”message.Thenatureofthemessage foundintechnicalreport[19].Wewillexploitthefactsthat:1) dependsonthecontentsoftheprobepacket[19].Theinterme- the computation of the session link parameters of sessions in diatenodesandthereceiversupdatethesaturationstatusofthe any link needs information only about the saturation status of sessionsbasedonthenatureofthismessage.Areceiversends thesessionstraversingthelinkandthepreviousiterationrates a backward rate packet upstream on receiving an unsaturation oftheunsaturatedsessionstraversingthelink,and2)avirtual messagebutdoesnotsendanyfurthercontrolmessageifitre- session(receiver)candetermineitsrateandsaturationstatusif ceivesasaturationmessage.Thebackwardratepacketsstarta itknowsthesessionlinkratesandthebandwidthutilizationsin newiteration. thelinksonitspath. Complexity: The distributed implementation terminates in Every node maintains an information record for each of its unitsoftimewhere isthemaximumroundtripdelay outgoinglinks .Therecordmaintainsthefollowingentriesfor fromasourcetoareceiverinthenetwork.Thecomplexitycan everysession traversinglink :1)minimumsessionlinkrate: be analyzed as follows. Let a receiver send a backward rate ;2)sessionlinkrate;3)ratebit;and4)saturationentry.Every packet at time . The forward rate packet returns to the linkalsostoresitslinkcontrolparameter.Notethatthestorage receiverin unitsoftime.Itsendsaprobeafterreceivingthe in a record does not maintain separate information about the forwardratepacket.Thesaturationmessagereachesthereceiver virtualsessionsofthesession.Nodesinterchangecontrolmes- in unitsoftime.Thus,oneiterationiscompletedin units sagesduringthedistributedcomputation.Thecontrolmessages oftime.AsTheorem1indicates,anynodeneedstoperformat are:1)backwardratepackets;2)forwardratepackets;3)probe most iterations.Notethattheterminationtimedoesnotex- packets;and4)saturation/unsaturationmessages.Thefirsttwo plicitlydependonthenumberoflinks,whereasthecomplexity areusedtoupdatethesessionlinkratesattheintermediatenodes of the basic distributed algorithm is . However, the andthereceiverratesatthereceivers.Thelasttwoareusedto roundtripdelay dependsonthenumberoflinksinthepath 126 IEEE/ACMTRANSACTIONSONNETWORKING,VOL.13,NO.1,FEBRUARY2005 of a session. The convergence time for the distributed imple- mentation does not depend on the total number of links in the networkasmanyofthelinkcomputationscanbeperformedin parallel.Theroundtripdelaycombinesthepropagationandthe processing delays. The worst case bound of can be- come large for a large network. However,as our experimental resultsindicate,fasterconvergenceisattainedinpractice. C. SimulationResults Usingsimulation,weevaluatetheperformanceofthescheme presentedinSectionIII-Binadynamicnetworkwheremulticast groupmembershipschange.Simulationresultsindicatethatthe convergence time is much less than the analytical worst case bound of , and also corroborate that the control mes- Fig. 2. We study the relative computation error in a dynamic network sageoverheadislow. with frequent session membership changes. Mean time between the session Weconsidera15-session400-noderandomnetworkforex- membershipchangesis171.33s. perimentalevaluation.Nodesarepointsona20 20grid.There exists an edge between any two nodes with a probability 0.2). Thus, even in presence of frequent session membership that depends on the euclidean distance between the nodes changes, the computed rate of an average receiver is close to , where is the decay constant. We as- themaxminfairrate.Inthisfigure,areceiver’saveragerateis sume . We adopt this edge probability model because alwaysabove80%ofitsmaxminfairrate.Themaximumrela- distantnodesarelesslikelytohaveanedgebetweenthem.The tiveerrorissomewhathigher,indicatingthattheconvergenceis sourceofeverysessionhasbeenselecteduniformly.Anodeis slowerforafewreceivers. areceiverofasessionwithagivenprobabilitywhichwas0.02 Fig. 3(a) plots the percentage of receivers which attain the inthiscase.Thesessionrouteconsistsoftheshortestpathsbe- maxminfair rate before the nextchange versusthe mean time tween the source and the destinations. Propagation delay in a betweenchanges.Asexpected,thispercentageislowforahigh link is equal to the euclidean distance between the end nodes frequencyofmembershipchange,andincreaseswithdecrease ofthelink.Maximumround-trippropagationdelayinthisnet- in the frequency of membership change. However, on an av- work is 45.54 s. We ignore processing delays. Every link is erage,around80%receiversconvergetotheirmaxminfairrates, bidirectional.Receiversjoinandleavethesessionsrandomlyas whentheaveragetimebetweenchangesequal500s. per Poisson random processes, and do not have any minimum Fig.3(b)presentsthecurveofaverageconvergencetimefor raterequirement.Linkoccupancy(i.e.,thenumberofsessions thereceiverswhichconvergetotheirmaxminfairrates,inbe- traversingalink)changesasreceiversjoinandleave.Maximum tweentwomembershipchangeeventsversusthemeantimebe- number of receivers at any time is 96 and when all these re- tween changes. This convergence time is low for a high fre- ceiversarepresenttheaveragelinkoccupancyis1.118557and quencyofmembershipchange,becausewhensessionmember- themaximumnumberofsessionstraversingalinkis5. ship changes rapidly, only the receivers who need little time First, we examine the convergence time of the algorithm in to converge to the maxmin fair rates, actually converge to the thisdynamicnetwork.Thecomputationrestartswheneverthere maxminfairrates,andothersdonotconverge.Asthefrequency isasessionmembershipchange.Thecomputedrateforauser of membership change decreases, most of the receivers’ rates may not converge to its maxmin fair rate before the next ses- convergetothemaxminfairvaluesbetweenthechangeevents, sion membership change.Thisoccurswhen there arefrequent and the convergence time represents the convergence time of changes. So we studied the discrepancy from the maxmin fair an average receiver in the network. When the mean time be- rateatanytime.Ifthemaxminfairrateofareceiver attime tweenchangesequal2259.34s,around95.6%receivers’rates is ,andthecomputedrateattime is ,thenrelative convergetothemaxminfairvaluesbetweenchangesonanav- computationerrorforreceiver is attime erage.Themeanconvergencetimeoftheconvergingreceivers .Fig.2plotsamovingaverageofthemaximumoftherelative in this case is 102.81 s. This represents the convergence time errors ofthe receiversand the averageofthe relative errorsof of an average receiver, because the percentage of converging thereceivers.Themaximumandtheaveragearetakenoverall onesisquitehigh.Onanaverage,around47receiversaremem- currentlyactivereceivers,andthemovingaverageiscomputed bersofdifferentsessionsatanytime.Thus,worstcaseconver- with a window of 5000 s. The algorithm restarts and the rela- gencetimeof is4280.76.Similarly,convergencetimeis tivecomputationerrorincreasesto1everytimethereisases- 155.89 s in the same random network with 96 receivers. The sion membership change. Subsequently, the relative computa- worst case bound is 8743.68 in this case. Thus, convergence tionerrordecreasessteadilytillthenextchangeevent.Sincethe time is normally significantly better than the analytical bound timewindowislarge,themovingaverageoftheerrorsdoesnot of . droptozero.Themovingaverageexhibitssharpincreaseover Nowweexaminethemessageexchangeoverhead.Wemea- the periods with more frequent session membership changes. sure the number of control message bits traversing a link per However,averageerrorsaresmallingeneral(alwayslessthan second. The control message comprises of rate packets, probe SARKARANDTASSIULAS:FAIRDISTRIBUTEDCONGESTIONCONTROLINMULTIRATEMULTICASTNETWORKS 127 changeslessfrequently,alargernumberofreceiverssaturatein betweenthechanges,andareceiverstopsexchangingmessages afteritsaturates.So,foralowfrequencyofmembershipchange, receiversdonotsendcontrolmessagesforlongperiodsoftimes between the changes. Fig. 3(c) indicates that the message ex- change complexity is indeed low for this algorithm. This plot doesnotshowthebytesconsumedintheTCPandIPheaders. IfIP headersare considered(20 octet[5]), then the maximum overheadisaround58b/sinsteadof10,whichisstilllow. IV. DISCUSSION Inthissection,wediscusscertaingenericfeaturesofthefair rateallocationprocedure. Allocation of Rates: First we discuss how to attain contin- uousrates.Wewouldalsodiscusshowtoallocatethefairrates oncetheyarecomputed.Severalapproachesarepossible. 1) Eachsourcetransmitsatarateequaltothemaximumof thefairratesallocatedtoitsreceivers.Asourceknowsthis maximum value from the distributed computation. Rate adaptive video gateways are used at the forking points totranscodethesignalintoalowerbitratesuchthatthe rateineverylinkisequaltothemaxminfairsessionlink rate[1],[21].Thesegatewayscontroltheratebydropping framesasnecessary.Activenetworkarchitecture[23]pro- videsaframeworkfordeploymentofrateadaptivevideo gatewayswithinthenetwork. 2) Asecondsolutionistouselayeredencoding[4],[9],[24]. An information stream is partitioned in to a base layer, comprisingtheinformationneededtorepresentthelowest fidelitymediaandanumberofenhancementlayers.Some numberoftheseenhancementlayersarecombinedbythe decoder with the base layer to recover a signal of incre- mentalfidelity.Oftentimes,layerbandwidthcanbetuned to provide the desired rates, e.g., by using an embedded code. In an embedded code, any prefix of a valid code- word is a valid codeword and if the code is chosen ap- propriately, the prefix codeword corresponds to a lower quality version of the longer codeword. Hence, one can triviallyformalayeredcodefromanembeddedcodeby breakingupthelongercodeatarbitraryboundaries.More- over,onecangenerateasmanylayersasdesiredandtune theratesofthedifferentlayerssincetheembeddedcode canbebrokenatarbitrarypoints.McCanne[15]presents Fig.3. Westudytheconvergencespeedandthecontrolimformationexchange alowcomplexityvideocodecwhichusesaninstanceof complexity for a dynamic network with session membership changes. The distributed computation does not always converge in between the changes, an embeddedcode,PVH(progressivevideowithhybrid on account of frequent session membership changes. (a) plots the average transform),andisthusamenabletodynamicratecontrol. percentageofreceiversattainingtheirrespectivemaxminfairratesinbetween Oncethefairratesarecomputed,thesourcepartitionsits thechanges.(b)plotstheaverageconvergencetimeofthesereceivers.(c)plots themaximumcontrolinformationsentperunittimeinanylink,themaximum signaltoformasmanylayersastherearedistinctreceiver is taken over all links. This control information comprises of rate packets, ratesandtunesthelayerbandwidthtomatchthereceiver probepacketsandsaturation/unsaturationmessages. rates. Each layer is sent as a separate multicast group, andareceiver’sfairratedeterminesthenumberoflayers packetsandsaturation/unsaturationmessages.Weplotthemax- deliveredtoareceiver.Thetotalnumberoflayerstrans- imum of this overhead per second against the mean time be- mittedacrossalinkequalsthemaximumnumberoflayers tweenmembershipchangesinFig.3(c).Themaximumistaken allotted to the session receivers downstream. Layers are over all links. This maximum is always less than 10 b/s, and dropped at the forking points selectively. The advantage decreases further with a decrease in the frequency of session ofthisapproachisthatitisanend-to-endone,whilethe membershipchange.Thisisbecauseasthesessionmembership previousrequiresnetworkparticipation. 128 IEEE/ACMTRANSACTIONSONNETWORKING,VOL.13,NO.1,FEBRUARY2005 We would like to mention that neither of these approaches have short lives, but operates very well if sessions have long is crucial for attaining the fair rates. We described the above lives,andsessionmembershipschangelessfrequently.So,the approachesaspossibilitiesforattaininganydesiredbandwidth target applications are video conferencing, distance learning, granularity.Thebasicassumptioninourcaseisthattheband- etc.,andtheapplicationdomainscanbeintranetsandVPNs. width can be allotted in reasonably fine granularity. If a per- An interesting direction for future research will be to adapt fect match between the layer bandwidth and the maxmin fair thealgorithmforapplicationswithalargenumberofreceivers receiver rates is not feasible, then the allocated rates can still that join and leave frequently, e.g., a CNN live telecast of an approximatethecomputedvalues.Thereceiverscansubscribe Olympicevent.Wepresentpossibleapproacheshere.Receivers toasmanylayersaspermittedbythecomputedfairrate.Forin- do not have minimum rate constraints in these applications. stance,ifthesourceofsession1inExample3.1.1.(Fig.1)can Maxminfairratesarecomputedperiodically.Ifareceiverleaves transmit unit bandwidth layers only, then receivers and anapplicationinbetween,thenthemaxminfairratesofthere- will be allotted four and three layers, respectively. If the layer mainingonesarenotaltered.Thismaycauseunder-utilization granularityisfine,thentheapproximationwillbesatisfactory. oflinkbandwidth.Theamountofunder-utilizationwilldepend If the layer granularity is coarse, then the assumption of a on the frequency of the computation and the frequency of the continuous feasible set is not valid, and discrete feasible sets leaveevents.Notethatthelinkbandwidthwillbeunder-utilized mustbeconsidered.Fairnessinadiscretefeasiblesetisvastly onlyinthoselinkswherethereceiverwhichlefthadthehighest differentfromfairnessinacontinuousfeasibleset,andneither bandwidthamongallreceiversinitssession.Considerthecase isaspecialcaseoftheother.Forexample,theusualnotionsof whena receiverjoins anapplication.Assumethatthe receiver fairnesseitherdonotexist(e.g.,maxminfairness)orarecom- hasadedicatedbandwidthinitslastlink,andotherexistingre- putationallyNP-hardinthediscretecase,whilethesearepoly- ceiversofthesameapplicationshareotherlinksinitspath.This nomialcomplexitycomputableinthecontinuouscase.Weaker istypicallythecase,e.g.,quiteafewuserssubscribetoaCNN notionsoffairnesslikemaximalfairnessneedtobeusedinthe event in each domain. Maxmin fair bandwidth of the new re- discretecase.Ifthegranularityisfine,thenthecomputationof ceiveristheminimumofitsdedicatedbandwidthinitslastlink themaxminfairallocationwithacontinuoussetassumptionand andthebandwidthallocatedtotheapplicationinthelinkbefore subsequentapproximationasdiscussedbeforegivesabetterap- thelast.Themaxminfairbandwidthoftheremainingreceivers proximation of the maxmin fair rates than the weaker notion. arethesameasbefore.Ifthelastlinkisasharedmedium,e.g., Also, the computation strategy in the discrete case is different an ethernet connection, then it is likely that a few other users from that in the continuous case. We address the discrete sce- usingthesharedmediumhavealreadysubscribedtotheapplica- narioin[20]. tion.Inthiscase,duetothemulticastnatureofthetransmission, Scalability:Therateallocationneedstoconsidervirtualses- thenewreceivercanbeassignedthesamebandwidthastheex- sionrates.Nevertheless,thedistributedalgorithmisdesignedso isting receivers without introducing any additional load in the thatthesystemdoesnotneedseparateentriesforseparatevir- links.Thechallengewillbetoaccommodatethecaseswherea tualsessionstraversingalink.Thus,theroutersareobliviousof receiveristheonlyoneofitsapplicationinalinkwhichisnot thenumberandidentityofthereceiversdownstreamofalink. thelasthop.Inthesecaseswhicharelikelytobeinfrequent,the Also,theamountofcontrolmessagesofasessioninalinkinan residualbandwidthintheapplicablelinkscanbeusedtoserve iterationisindependentofthenumberofreceiversdownstream thenewreceiverintheintervalsbetweenthecomputations.Nor- of the link. mally, the links haveunused bandwidth to accommodate tran- The system needs per-session states at the intermediate sients.ThenewreceiverscanalsoconductRLM-likelayerjoin nodes. Arguing in the lines of Grossglausser and Bolot [10], experimentstodecidetheamountofsubscriptionintheinterim implementing a multicast/multilayer service requires per-ses- period.Theexistingreceiverscanutilizetheresidualbandwidth sion state in the routers anyway. So, the incremental cost of generatedbytheleaveeventsbythesamemechanism.Thedif- maintainingsomemoreinformationforeachsessionandusing ferencebetweenthebandwidthallocatedtothereceiversandthe this additional information in the fair rate allocation policy is maxminfairallocationwilldependonthefrequencyofcompu- much smaller than that in the unicast case. However, if these tationandthenetworktopology. additional session states become an issue, then this policycan Wenowpresenttechniquesforattainingthemaxminfairrates be used in the VPNs and intranets, and state aggregation may withpartialrecomputationintheeventoftopologychanges.Let beresortedtointhebackbones.Thealgorithmwillprovidethe thealgorithmexecuteits thiteration2whenthereisachange. samefairnessguaranteesifindividualsessionshavebandwidth Wefirstdetermineavalue . guarantees from the backbone. In this case, the backbone will • Whenareceiver leaves, . be treated as a single link with only one session and capacity • Whenareceiver joins, ( is equalingthebandwidthguaranteeforthesession. theupdatedsetofsessionstraversingalink ). IntermediateFeasibility:Theintermediateratesarealways • Ifthecapacityofalink changes, . feasible (Lemma 7). So, during the computation, sources can Ineachofthesecases,anyvirtualsessionwithrategreaterthan stilltransmitatintermediaterates,withoutcausingcongestion. orequalto isconsideredunsaturated.Thesaturationstateand DynamicVersion:Thecomputationsmustrestartifthenet- the rate of any other virtual session is not changed. In the last work changes, i.e., if a new receiver joins a session, or an ex- case,if increases,and isthe onlybottlenecklinkofsome istingreceiverleavesasession,oriftheavailablelinkcapacity changes.Sothisalgorithmisnotappropriatewhenthesessions 2Algorithmmayalsohaveterminated.Inthatcasek=M. SARKARANDTASSIULAS:FAIRDISTRIBUTEDCONGESTIONCONTROLINMULTIRATEMULTICASTNETWORKS 129 virtual session , the saturation state of is set unsaturated as Ifalllinkson ’spathsatisfythefirsttwoconditions,then ’s well. ratecanbeincreasedwithoutdecreasingthatofanyothervirtual Thealgorithmcontinues(startsifithadterminated)withthe session.Ifsomelinkson ’spathsatisfyonlythelastproperty, updatedsaturationstatesandratevalues.Thealgorithmcanuse then to increase ’s rate, we must decrease the rate of some the previously computed rates and the saturation states of all virtualsessionshavinggreaterratethanthatof .Howeverwe virtual sessions with rates less than . The rates of the other canstillincrease ’srate ,withoutdecreasingthatofanyother virtual sessions are not used as they are rendered unsaturated. virtualsessionhavingratelessthanthatof andstillmaintain The output still converges to the maxmin fair rates. The intu- feasibility.Thus, isnotamaxminfairratevector. itionbehindtheresultisthatanychangeinthemaxminfairrate Let beafeasibleratevectorsuchthateachvirtualsession of a virtual session , , does not affect the rates of receivers has a bottleneck link. Consider any other feasible rate vector thathavemaxminfairrateslowerthan ,butmayaffectthose . Let there exist a virtual session such that . Let ofreceiverswithhighermaxminfairrates.Thisisbecausethe be a bottleneck link for virtual session . maxminfairrateofareceiverisdeterminedbythatofferedby . The last equality follows from the property of a its bottleneck link, and a receiver does not share its bottle- bottlenecklink.Since isabottlenecklinkw.r.t.virtualsession neck link with any other receiver which has maxmin fair rate ,itscapacityisfullyutilized,i.e., .Fromthe higherthanthatof .Thus,ifthefrequencyofdeparture/arrival feasibility of , . Since , isthe sameforallreceivers,then 50%ofthecomputationcan and (sincevirtualsession traversesthroughlink bere-usedonanaverage. ), it follows that there exists a session such that . Thealgorithmcanalsoterminateprematurelyafterafewit- Fromfeasibilityof , .Thus, .Thereexists erationssoastoreducethecomputationcomplexity.Wepresent virtual session such that , . Thus, someboundsontherateofvirtualsessionsintheintermediate .Fromthelastconditionforalinktobebottleneckw.r.t. steps for this purpose. The proof for theorem 1 shows that at avirtualsession, .Now .Thefirst least one virtual session saturates in every iteration. A virtual inequality follows since . The second inequality and sessionattainsitsmaxminfairratebythetimeitsaturates.Thus, thelastequalityhavebeenarguedbefore.Thus,if ,then atleast virtualsessionsattaintheirmaxminfairratesattheend thereexistsavirtualsession suchthat .Hence, of iterations.Iftherearenominimumrateconstraints,thenit isamaxminfairratevector. followsfromthe prooffor Lemma3thatat theend ofthe th iterationtherate ofanyvirtualsession isupperbounded APPENDIX B byitsmaxminfairrateandlowerboundedbytheminimumofits PROOFOFCORRECTNESSANDTERMINATIONGUARANTEEFOR maxminfairrateandthe thsmallestcomponentinthemaxmin THEALGORITHMFORCOMPUTATIONOFMAXMINFAIRRATES fairallocation.Thus,partialconvergenceisattainedifthealgo- (THEOREM1) rithmterminatesprematurely. Weoutlinetheproofasfollows.Weassumethatthesetoffea- sibleratevectorsisnonempty.Thefirstpartoftheproofshows V. CONCLUSION thattheoutputofthisalgorithmismaxminfair.Forthis,wefirst Wehavepresentedaquantitativeframeworkthatcanmodel showthatthelinkcontrolparametersincreasewitheveryitera- thefairallocationofbandwidthinthemulticastscenario,while tion(Lemma4).Thus,thevirtualsessionratesandthesession consideringbothfairnessamongthemembersofthesameand ratesdonotdecreaseinsubsequentiterations(Lemma5).Using different sessions. We have presented an algorithm for com- this,weshowthattherateallocationattheendofeveryiteration putingthemaxminfairratesinarbitrarymulticastnetworkswith isfeasible(Lemma7).Nextweshowthat,ifavirtualsessionsat- minimum rate requirements. We have also presented a frame- uratesinthe thiteration,ithasabottlenecklinkinsubsequent workforadistributedimplementationofthealgorithm. iterations(Lemma8).Sincethealgorithmterminatesonlywhen allvirtualsessionssaturate,eachvirtualsessionhasabottleneck linkwhenthealgorithmterminates.Therateallocationuponter- APPENDIX A minationisalsofeasiblebyLemma7.Thus,maxminfairnessof PROOFOFBOTTLENECKLEMMA therateallocationuponterminationfollowsfromtheBottleneck Proofoflemma2: Letavirtualsession nothaveabot- Lemma.ForthesecondpartofthetheoremweshowinLemma tlenecklinkunderafeasibleratevector .If traversesalink , 9thatthealgorithmterminatesinatmost iterations. thenatleastoneofthefollowingholds. Lemma4: If andthealgorithmhasnotterminatedin 1) Link hassomeunusedcapacity. iterations, . 2) Therateofsomeothervirtualsessionofthesamesession Proofoflemma4: Weprovebyinduction.Let .The as traversing isgreater thanthatof ,i.e.,bandwidth algorithmcannotterminatein0iteration. , . consumedbythesessionof inlink isgreaterthan ’s If , then , and the lemma holds rate. forlink anditeration1.Let .Sincethefeasibleset 3) Asession traversinglink consumesgreaterbandwidth of rate vectors is nonempty, . , for than in link . This bandwidth is greater than the min- everysession andlink .Thus, ,satisfiestheinequality imum rate of session in link . Virtual session .Since isthemaximumpos- doesnotbelongtosession . sible whichsatisfiestheaboveinequality, . 130 IEEE/ACMTRANSACTIONSONNETWORKING,VOL.13,NO.1,FEBRUARY2005 Theequalityfollowsfromtheinitializationof .Thus,the Remark: We will use this lemma in proofs of feasibility of lemmaholdsfor . rateallocations(Lemma7),thefactthateverysaturatedvirtual Letthelemmaholdforiterations and .We session has a bottleneck link (Lemma 8) and the fact that the showthatthelemmaholdsforthe thiteration.If , algorithmterminatesinfinitenumberofiterations(Theorem9). then . Let . Consider any virtual Proof of lemma 6: We prove (3) by induction. session traversing through link , i.e., . If .Thelastinequalityfollows , . Let since and , since .Thus, , . .Thus,(3)holdsfor . ,since , Let(3)holdfor .If , . and by induction hypothesis. If , Since ( and , ), .Thus, frominductionhypothesis . forall (1) Thus (2) since byLemma Thus,(3)holdsfor .Thus,(3)holdsforall byinduc- from tion.Now,(4)followsfrom(3)andthedefinitionof . Lemma7: Therateallocationattheendofthe thiteration isfeasible, . Proofoflemma7: Weprovebyinduction.Wefirstprove the lemma for . Since the set of feasible rate vectors is nonempty,arateallocationwhereeachvirtualsession ’srate equalsitsminimumrate,satisfiesthecapacityconstraints.Thus, satisfies the capacity constraints. Since , satisfiestheminimumraterequirements.Thus, isfeasible andthelemmaholdsfor . Lettherateallocationattheendofthe thiteration, ,be feasible.Considerthe thiteration. . The first inequality follows from Lemma 5 and the last from the feasibility of . Thus, , for all virtual Thus, satisfies the inequality sessions . .Clearly isthemaximum if possible value of which satisfies the inequality and hence there exists a and . Hence, the sincethen resultfollowsfrominduction. Thus if Lemma5: if forallvirtualsessions . if forallsessions andlinks . if (5) Proof of lemma 5: Let . Now, , If ,from(5), . , , for all virtual sessions . The last inequality followssince .Thus, .Thus, theresultholdsfor .Let .If , fromthefeasibilityof . If , . If , . . Thus, . Thus, .Since ,and , .Thus, .Hence .Thesecondpartofthelemmafollowsfrom thefirstandthefactthat . Lemma 6: forallvirtualsessions from ofLemma and links and (3) Thus forallsessions links and (4)