ebook img

Quality-aware Scheduling for Key-value Data Stores PDF

102 Pages·2015·2.995 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Quality-aware Scheduling for Key-value Data Stores

SPRINGER BRIEFS IN COMPUTER SCIENCE Chen Xu Aoying Zhou Quality-aware Scheduling for Key-value Data Stores 123 SpringerBriefs in Computer Science Series editors Stan Zdonik, Brown University, Providence, USA Shashi Shekhar, University of Minnesota, Minneapolis, USA Jonathan Katz, University of Maryland, College Park, USA Xindong Wu, University of Vermont, Burlington, USA Lakhmi C. Jain, University of South Australia, Adelaide, Australia David Padua, University of Illinois Urbana-Champaign, Urbana, USA Xuemin (Sherman) Shen, University of Waterloo, Waterloo, Canada Borko Furht, Florida Atlantic University, Boca Raton, USA V.S. Subrahmanian, University of Maryland, College Park, USA Martial Hebert, Carnegie Mellon University, Pittsburgh, USA Katsushi Ikeuchi, University of Tokyo, Tokyo, Japan Bruno Siciliano, Università di Napoli Federico II, Napoli, Italy Sushil Jajodia, George Mason University, Fairfax, USA Newton Lee, Newton Lee Laboratories, LLC, Tujunga, USA More information about this series at http://www.springer.com/series/10028 Chen Xu Aoying Zhou (cid:129) Quality-aware Scheduling for Key-value Data Stores 123 ChenXu AoyingZhou Institute for Data ScienceandEngineering Institute for Data ScienceandEngineering EastChinaNormal University EastChinaNormal University Shanghai Shanghai China China ISSN 2191-5768 ISSN 2191-5776 (electronic) SpringerBriefs inComputer Science ISBN978-3-662-47305-4 ISBN978-3-662-47306-1 (eBook) DOI 10.1007/978-3-662-47306-1 LibraryofCongressControlNumber:2015940432 SpringerHeidelbergNewYorkDordrechtLondon ©TheAuthor(s)2015 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpart of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission orinformationstorageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilar methodologynowknownorhereafterdeveloped. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publicationdoesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfrom therelevantprotectivelawsandregulationsandthereforefreeforgeneraluse. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authorsortheeditorsgiveawarranty,expressorimplied,withrespecttothematerialcontainedhereinor foranyerrorsoromissionsthatmayhavebeenmade. Printedonacid-freepaper Springer-VerlagGmbHBerlinHeidelbergispartofSpringerScience+BusinessMedia (www.springer.com) Preface Key-value stores widely served as data platforms for various web applications, ranging from Facebook social networking based on Cassandra to Amazon online shopping hosted by Dynamo, providing a distributed solution to cloud computing andbigdatamanagement.Inmodernwebapplications,userexperiencesatisfaction determines the applications’ success rate. In order to answer web queries quickly, key-valuestoresgenerallyemployweakconsistencymodel.Thismodelrelaxesthe data consistency to improve system performance on query response. However, a drawback isthat data accessed by users might be stale. Hence, there isan intrinsic trade-off between query latency and data consistency, which has become a key factor in the design of large-scale data management systems. In particular, data consistency expresses as the freshness of data accessed by queriesatalocalnode.Clearly,thelatency/consistencytrade-offatnodelevelboils downtofindingthetrade-offbetweenquerylatency(i.e.,QualityofService(QoS)) and data freshness (i.e., Quality of Data (QoD)). In real application, different web queries or users would have different expectations in terms of QoS and QoD. Hence, how to optimize QoS and QoD by scheduling queries and updates in key-value stores becomes an important research issue. Our book comprehensively illustrates quality-aware scheduling in key-value stores. In addition, this work provides scheduling strategies and a prototype framework of quality-aware scheduler as well as a demonstration of online appli- cations. The book offers a rich blend of theory and practice. It is suitable for students, researchers, and practitioners interested in distributed systems, NoSQL key-value stores, and scheduling. Shanghai, China Chen Xu April 2015 Aoying Zhou v Acknowledgments This work is partially supported by the National Natural Science Foundation of China under grant Nos. 61332006 and 61232002, and the National 863 Program under grant No. 2015AA011508. The authors conducted the research at Shanghai Collaborative Innovation Center of Trustworthy Software for Internet of Things (ZF1213). During the process of writing this book, we refer to the scientific researches of many scholars and experts. Hence, we would like to express our heartfelt thanks to them. Over time, we have been accompanied by a number of people who helped to shape the outcome of this book in one way or another. Hereby, we want to take this opportunity to convey our gratitudes. We thank Fan Xia, Tao Zhu, and A/Prof. Minqi Zhou at East China Normal University for their countless discussions and comments. Meanwhile, we want to thank Dr. Manohar Kaul at Technische Universität Berlin, Prof. Kun Yue at Yunnan University, and the editors at Springer for their valuable feedback to improvethepresentationofthisbook.Inparticular,partoftheresearchinthisbook was done during Chen Xu’s visiting at The University of Queensland hosted by Prof.XiaofangZhouandDr.MohamedA.Sharaf.Wewouldliketothankthemfor their significant collaborations and insights. Also, we worked with many great colleagues at Institute for Data Science and Engineering,EastChinaNormalUniversity,onaday-to-daybasis.Thankyouguys for the great work atmosphere, your sense of humor, and for making the frequent long working hours not feel that bad. Lastbutnotleast,wewouldliketothankourownfamiliesforhavingourback. Thanks for all of your kind understandings and great supports. vii Contents 1 Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1 Application Scenarios . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 The Research Significance and Challenges. . . . . . . . . . . . . . . 5 1.3 Implementation Framework . . . . . . . . . . . . . . . . . . . . . . . . . 7 1.4 Overview of the Book. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2 Literature and Research Review . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.1 Metrics for Quality-Aware Scheduling. . . . . . . . . . . . . . . . . . 11 2.1.1 QoS Metrics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.1.2 QoD Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 2.2 Quality-Aware Scheduling in Data Management System . . . . . 15 2.2.1 Quality-Aware Scheduling in RTDBMS . . . . . . . . . . 15 2.2.2 Quality-Aware Scheduling in DSMS. . . . . . . . . . . . . 17 2.2.3 Quality-Aware Scheduling in RDBMS . . . . . . . . . . . 17 2.2.4 Quality-Aware Scheduling in Key-Value Stores . . . . . 19 2.3 Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 3 Problem Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 3.1 Background Knowledge. . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 3.1.1 Data Organization. . . . . . . . . . . . . . . . . . . . . . . . . . 26 3.1.2 Data Replication and Consistency. . . . . . . . . . . . . . . 26 3.1.3 User Queries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 3.1.4 System Updates: State-Transfer Versus Operation-Transfer . . . . . . . . . . . . . . . . . . . . . . . . . 29 3.2 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 3.2.1 QoS Penalty. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 3.2.2 QoD Penalty . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 3.2.3 Combined Penalty. . . . . . . . . . . . . . . . . . . . . . . . . . 33 ix x Contents 3.3 Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 4 Scheduling for State-Transfer Updates. . . . . . . . . . . . . . . . . . . . . 37 4.1 On-Demand (OD) Mechanism . . . . . . . . . . . . . . . . . . . . . . . 37 4.1.1 WSJF-OD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 4.2 Hybrid On-Demand (HOD) Mechanism. . . . . . . . . . . . . . . . . 40 4.2.1 WSJF-HOD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 4.3 Freshness/Tardiness (FIT) Mechanism. . . . . . . . . . . . . . . . . . 41 4.3.1 WSJF-FIT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 4.4 Adaptive Freshness/Tardiness (AFIT) Mechanism. . . . . . . . . . 45 4.4.1 Query Routing . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 4.4.2 Query Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 4.4.3 WSJF-AFIT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 4.5 Popularity-Aware Mechanism. . . . . . . . . . . . . . . . . . . . . . . . 52 4.5.1 Populairty-Aware WSJF-OD . . . . . . . . . . . . . . . . . . 53 4.5.2 Populairty-Aware WSJF-HOD . . . . . . . . . . . . . . . . . 53 4.5.3 Popularity-Aware WSJF-FIT . . . . . . . . . . . . . . . . . . 54 4.5.4 Popularity-Aware WSJF-AFIT . . . . . . . . . . . . . . . . . 54 4.6 Experimental Study. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 4.6.1 Baseline Policies. . . . . . . . . . . . . . . . . . . . . . . . . . . 55 4.6.2 Parameter Setting . . . . . . . . . . . . . . . . . . . . . . . . . . 56 4.6.3 Impact of Query Arrival Rate. . . . . . . . . . . . . . . . . . 58 4.6.4 Impact of Update Cost . . . . . . . . . . . . . . . . . . . . . . 59 4.6.5 Impact of Different QoS and QoD Preferences. . . . . . 60 4.6.6 Impact of Popularity . . . . . . . . . . . . . . . . . . . . . . . . 61 4.7 Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 5 Scheduling for Operation-Transfer Updates. . . . . . . . . . . . . . . . . 65 5.1 Hybrid On-Demand (HOD) Mechanism. . . . . . . . . . . . . . . . . 65 5.1.1 WSJF-HOD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 5.2 Freshness/Tardiness (FIT) Mechanism. . . . . . . . . . . . . . . . . . 67 5.2.1 WSJF-FIT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 5.3 Popularity-Aware Mechanism. . . . . . . . . . . . . . . . . . . . . . . . 73 5.3.1 Popularity-Aware WSJF-HOD . . . . . . . . . . . . . . . . . 73 5.3.2 Popularity-Aware WSJF-FIT . . . . . . . . . . . . . . . . . . 74 5.4 Experimental Study. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 5.4.1 Parameter Setting . . . . . . . . . . . . . . . . . . . . . . . . . . 77 5.4.2 Impact of Update Arrival Rate . . . . . . . . . . . . . . . . . 78 5.4.3 Impact of Popularity and Approximation . . . . . . . . . . 79 5.5 Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Contents xi 6 AQUAS: A Quality-Aware Scheduler. . . . . . . . . . . . . . . . . . . . . . 83 6.1 System Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 6.1.1 System Goals. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 6.1.2 System Design. . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 6.2 System Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 6.2.1 Benchmark. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 6.2.2 Evaluation Result . . . . . . . . . . . . . . . . . . . . . . . . . . 88 6.3 A Demonstration on MicroBlogging Application. . . . . . . . . . . 91 6.3.1 Timeline Queries in AQUAS . . . . . . . . . . . . . . . . . . 91 6.3.2 A Case Study. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 6.4 Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94 References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94 7 Conclusion and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 7.1 Conclusion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 7.2 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.