ebook img

Chapman & Hall/CRC Big Data Series : Big Data Management and Processing (1) PDF

489 Pages·2017·23.472 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Chapman & Hall/CRC Big Data Series : Big Data Management and Processing (1)

Computer Science & Engineering Chapman & Hall/CRC B Big Data Series From the Foreword i g “Big Data Management and Processing is [a] state-of-the-art book that deals D with a wide range of topical themes in the field of Big Data. The book, which probes a many issues related to this exciting and rapidly growing field, covers processing, t management, analytics, and applications... [It] is a very valuable addition to the a literature. It will serve as a source of up-to-date research in this continuously M developing area. The book also provides an opportunity for researchers to explore a the use of advanced computing technologies and their impact on enhancing our n capabilities to conduct more sophisticated studies.” a —Sartaj Sahni, University of Florida, USA g e “Big Data Management and Processing covers the latest Big Data research m results in processing, analytics, management and applications. Both fundamental e insights and representative applications are provided. This book is a timely and n valuable resource for students, researchers and seasoned practitioners in Big t Data fields. Big Data a —Hai Jin, Huazhong University of Science and Technology, China n d Big Data Management and Processing explores a range of big data related Management issues and their impact on the design of new computing systems. The twenty- P one chapters were carefully selected and feature contributions from several r o outstanding researchers. The book endeavors to strike a balance between c and Processing theoretical and practical coverage of innovative problem solving techniques for e a range of platforms. It serves as a repository of paradigms, technologies, and s applications that target different facets of big data computing systems. s i n The first part of the book explores energy and resource management issues, g as well as legal compliance and quality management for Big Data. It covers In- Edited by Memory computing and In-Memory data grids, as well as co-scheduling for high Kuan-Ching Li performance computing applications. The second part of the book includes L comprehensive coverage of Hadoop and Spark, along with security, privacy, and i • Hai Jiang trust challenges and solutions. J i a Albert Y. Zomaya n The latter part of the book covers mining and clustering in Big Data, and includes g applications in genomics, hospital big data processing, and vehicular cloud • computing. The book also analyzes funding for Big Data projects. Z o m a y a K29224 www.crcpress.com K29224_cover.indd 1 4/10/17 10:36 AM Big Data Management and Processing Big Data Management and Processing Edited by Kuan-Ching Li Guangzhou University, China Providence University, Taiwan Hai Jiang Arkansas State University, USA Albert Y. Zomaya University of Sydney, Australia CRCPress Taylor&FrancisGroup 6000BrokenSoundParkwayNW,Suite300 BocaRaton,FL33487-2742 (cid:2)c 2017byTaylor&FrancisGroup,LLC CRCPressisanimprintofTaylor&FrancisGroup,anInformabusiness NoclaimtooriginalU.S.Governmentworks Printedonacid-freepaper InternationalStandardBookNumber-13:978-1-4987-6807-8(Hardback) Thisbookcontainsinformationobtainedfromauthenticandhighlyregardedsources.Reasonableeffortshavebeen madetopublishreliabledataandinformation,buttheauthorandpublishercannotassumeresponsibilityforthevalidity ofallmaterialsortheconsequencesoftheiruse.Theauthorsandpublishershaveattemptedtotracethecopyright holdersofallmaterialreproducedinthispublicationandapologizetocopyrightholdersifpermissiontopublishinthis formhasnotbeenobtained.Ifanycopyrightmaterialhasnotbeenacknowledgedpleasewriteandletusknowsowe mayrectifyinanyfuturereprint. ExceptaspermittedunderU.S.CopyrightLaw,nopartofthisbookmaybereprinted,reproduced,transmitted,or utilizedinanyformbyanyelectronic,mechanical,orothermeans,nowknownorhereafterinvented,includingpho- tocopying,microfilming,andrecording,orinanyinformationstorageorretrievalsystem,withoutwrittenpermission fromthepublishers. For permission to photocopy or use material electronically from this work, please access www.copyright.com (http://www.copyright.com/)orcontacttheCopyrightClearanceCenter,Inc.(CCC),222RosewoodDrive,Danvers, MA01923,978-750-8400.CCCisanot-for-profitorganizationthatprovideslicensesandregistrationforavarietyof users.FororganizationsthathavebeengrantedaphotocopylicensebytheCCC,aseparatesystemofpaymenthasbeen arranged. TrademarkNotice:Productorcorporatenamesmaybetrademarksorregisteredtrademarks,andareusedonlyfor identificationandexplanationwithoutintenttoinfringe. VisittheTaylor&FrancisWebsiteat http://www.taylorandfrancis.com andtheCRCPressWebsiteat http://www.crcpress.com Contents Foreword................................................................................................vii Preface ..................................................................................................ix Acknowledgments......................................................................................xi Editors .................................................................................................xiii Contributors.............................................................................................xv Chapter1 BigData:LegalComplianceandQualityManagement................................1 PaoloBalboniandTheodoraDragan Chapter2 EnergyManagementforGreenBigDataCenters .................................... 17 ChonglinGu,HejiaoHuang,andXiaohuaJia Chapter3 TheArtofIn-MemoryComputingforBigDataProcessing......................... 45 Mihaela-AndreeaVasileandFlorinPop Chapter4 SchedulingNestedTransactionsonIn-MemoryDataGrids ......................... 61 JunwhanKim,RobertoPalmieri,andBinoyRavindran Chapter5 Co-SchedulingHigh-PerformanceComputingApplications......................... 81 GuillaumeAupy,AnneBenoit,LoicPottier,PadmaRaghavan,YvesRobert, andManuShantharam Chapter6 ResourceManagementforMapReduceJobsPerformingBigDataAnalytics .....105 NormanLimandShikhareshMajumdar Chapter7 Tyche:AnEfficientEthernet-BasedProtocolforConverged NetworkedStorage....................................................................135 PilarGonza´lez-Fe´rezandAngelosBilas Chapter8 ParallelBackpropagationNeuralNetworkforBigDataProcessingon Many-CorePlatform ..................................................................159 BoyangLiandChenLiu Chapter9 SQL-on-HadoopSystems:State-of-the-ArtExploration,Models,Performances, Issues,andRecommendations........................................................173 AlfredoCuzzocrea,RimMoussa,andSororSahri Chapter10 OnePlatformRulesAll:FromHadoop1.0toHadoop2.0andSpark..............191 XiongpaiQinandKeqinLi v vi Contents Chapter11 Security,Privacy,andTrustforUser-GeneratedContent:TheChallengesand Solutions...............................................................................215 YuhongLiu,YuWang,andNamLing Chapter12 RoleofReal-TimeBigDataProcessingintheInternetofThings..................239 MiyuruDayarathna,PaulFremantle,SrinathPerera, andSriskandarajahSuhothayan Chapter13 End-to-EndSecurityFrameworkforBigSensingDataStreams....................263 DeepakPuthal,SuryaNepal,RajivRanjan,andJinjunChen Chapter14 ConsiderationsontheUseofCustomAcceleratorsforBigDataAnalytics .......279 VitoGiovanniCastellana,AntoninoTumeo,MarcoMinutoli,MarcoLattuada, andFabrizioFerrandi Chapter15 ComplexMiningfromUncertainBigDatainDistributedEnvironments: Problems,Definitions,andTwoEffectiveandEfficientAlgorithms ...............297 AlfredoCuzzocrea,CarsonKai-SangLeung,FanJiang, andRichardKyleMacKinnon Chapter16 ClusteringinBigData ................................................................333 MinChen,SimoneA.Ludwig,andKeqinLi Chapter17 LargeGraphComputingSystems ....................................................347 ChengwenWu,GuangyanZhang,KeqinLi,andWeiminZheng Chapter18 BigDatainGenomics.................................................................363 HuamingChen,JiangningSong,JunShen,andLeiWang Chapter19 MaximizingtheReturnonInvestmentinBigDataProjects:AnApproach BasedupontheIncrementalFundingofProjectDevelopment......................385 AntonioJuarezAlencar,MauroPenhaBastos,EberAssisSchmitz, MonicaFerreiradaSilva,andPetrosSotiriosStefaneas Chapter20 ParallelDataMiningandApplicationsinHospitalBigDataProcessing...........403 JianguoChen,ZhuoTang,KenliLi,andKeqinLi Chapter21 BigDataintheParkingLot...........................................................425 RyanFlorin,SyedmeysamAbolghasemi,AidaGhaziZadeh,andStephanOlariu Index..................................................................................................451 Foreword BigDataManagementandProcessing(editedbyLi,Jiang,andZomaya)isastate-of-the-artbook thatdealswithawiderangeoftopicalthemesinthefieldofBigData.Thebook,whichprobesmany issuesrelatedtothisexcitingandrapidlygrowingfield,coversprocessing,management,analytics, andapplications. The many advances in Big Data research that we witness today are brought about because of themanydevelopmentsweseeinalgorithms,high-performancecomputing,databases,datamining, machinelearning,andsoon.Thesedevelopmentsarediscussedinthisbook.Thebookalsoshow- casessomeoftheinterestingapplicationsandtechnologiesthatarestillevolvingandthatwilllead tosomeseriousbreakthroughsinthecomingfewyears. IbelievethatBigDataManagementandProcessingisaveryvaluableadditiontotheliterature. Itwillserveasasourceofup-to-dateresearchinthiscontinuouslydevelopingarea.Thebookalso providesanopportunityforresearcherstoexploretheuseofadvancedcomputingtechnologiesand theirimpactonenhancingourcapabilitiestoconductmoresophisticatedstudies. I expect that Big Data Management and Processing will be well received by the research and development community. It should prove very beneficial for researchers and graduate students focusing on Big Data and will serve as a very useful reference for practitioners and application developers. SartajSahni UniversityofFlorida vii Preface ThescopeofBigDatatodayspansmanyaspectsanditisnotlimitedtomaincomputingcomponents (e.g.,processors,storagedevices,andvisualizationfacilities)alone,butitexpandsintoamuchlarger rangeofissuesrelatedtomanagementandpolicy.Also,“BigData”canmean“BigEnergy,”because ofthepressurethatdataplacesonavarietyofinfrastructuresneededtohost,manage,andtransport data.Thisinturnraisesvariousmonetary,environmental,andsystemperformanceconcerns. Recentadvancesinsoftwarehardwaretechnologieshaveimprovedthehandlingofbigdata.How- ever, there still remain many issues that are pertinent to the overloading that happens due to the processing of massive amounts of data, which calls for the development of various software and hardwaresolutionsaswellasnewalgorithmsthataremorecapableofprocessingofdata. Thisbook,BigDataManagementandProcessing,seekstoprovideanopportunityforresearchers toexplorearangeofbigdata-relatedissuesandtheirimpactonthedesignofnewcomputingsystems. Thebookisquitetimely,sincethefieldofbigdatacomputingasawholeisundergoingrapidchanges onadailybasis.Vastliteratureexiststodayonsuchdataprocessingparadigmsandframeworksand theirimplicationsforawiderangeofdistributedplatforms. Thebookisintendedtobeavirtualroundtableofseveraloutstandingresearchersthatonemight invite to attend a conference on big data computing systems. Of course, the list of topics that is explored here is by no means exhaustive, but most of the conclusions provided here should be extendedtotheothercomputingplatformsthatarenotcoveredhere.Therewasadecisiontolimit thenumberofchapterswhileprovidingmorepagesforcontributedauthorstoexpresstheirideas,so thatthebookremainsmanageablewithinasinglevolume. It is also hoped that the topics covered will get the readers to think of the implications of such newideasonthedevelopmentsintheirownfields.Thebookendeavorstostrikeabalancebetween theoreticalandpracticalcoverageofinnovativeproblem-solvingtechniquesforarangeofplatforms. Thebookisintendedtobearepositoryofparadigms,technologies,andapplicationsthattargetthe differentfacetsofbigdatacomputingsystems. The21chaptersarecarefullyselectedtoprovideawidescopewithminimaloverlapbetweenthe chapterssoastoreduceduplications.Eachcontributorwasaskedthathis/herchaptershouldcover reviewmaterialaswellascurrentdevelopments.Inaddition,thechoiceofauthorswasmadesoas toselectauthorswhoareleadersintherespectivedisciplines. ix

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.