Yunjun Gao Kyuseok Shim Zhiming Ding Peiquan Jin Zujie Ren Yingyuan Xiao An Liu Shaojie Qiao (Eds.) 1 0 9 Web-Age 7 S C Information Management N L WAIM 2013 International Workshops: HardBD, MDSP, BigEM, TMSN, LQPM, BDMS Beidaihe, China, June 2013, Proceedings 123 Lecture Notes in Computer Science 7901 CommencedPublicationin1973 FoundingandFormerSeriesEditors: GerhardGoos,JurisHartmanis,andJanvanLeeuwen EditorialBoard DavidHutchison LancasterUniversity,UK TakeoKanade CarnegieMellonUniversity,Pittsburgh,PA,USA JosefKittler UniversityofSurrey,Guildford,UK JonM.Kleinberg CornellUniversity,Ithaca,NY,USA AlfredKobsa UniversityofCalifornia,Irvine,CA,USA FriedemannMattern ETHZurich,Switzerland JohnC.Mitchell StanfordUniversity,CA,USA MoniNaor WeizmannInstituteofScience,Rehovot,Israel OscarNierstrasz UniversityofBern,Switzerland C.PanduRangan IndianInstituteofTechnology,Madras,India BernhardSteffen TUDortmundUniversity,Germany MadhuSudan MicrosoftResearch,Cambridge,MA,USA DemetriTerzopoulos UniversityofCalifornia,LosAngeles,CA,USA DougTygar UniversityofCalifornia,Berkeley,CA,USA GerhardWeikum MaxPlanckInstituteforInformatics,Saarbruecken,Germany Yunjun Gao Kyuseok Shim Zhiming Ding Peiquan Jin Zujie Ren Yingyuan Xiao An Liu Shaojie Qiao (Eds.) Web-Age Information Management WAIM 2013 International Workshops: HardBD, MDSP, BigEM, TMSN, LQPM, BDMS Beidaihe, China, June 14-16, 2013 Proceedings 1 3 VolumeEditors YunjunGao,ZhejiangUniversity,Hangzhou,China E-mail:[email protected] KyuseokShim,SeoulNationalUniversity,Korea E-mail:[email protected] ZhimingDing,ChineseAcademyofSciences,Beijing,China E-mail:[email protected] PeiquanJin,UniversityofScienceandTechnologyofChina,Hefei,China E-mail:[email protected] ZujieRen,HangzhouDianziUniversity,China E-mail:[email protected] YingyuanXiao,TianjinUniversityofTechnology,China E-mail:[email protected] AnLiu,UniversityofScienceandTechnologyofChina,Hefei,China E-mail:[email protected] ShaojieQiao,SouthwestJiaotongUniversity,Chengdu,China E-mail:[email protected] ISSN0302-9743 e-ISSN1611-3349 ISBN978-3-642-39526-0 e-ISBN978-3-642-39527-7 DOI10.1007/978-3-642-39527-7 SpringerHeidelbergDordrechtLondonNewYork LibraryofCongressControlNumber:2013942401 CRSubjectClassification(1998):H.3,H.4,H.2.8,H.2.4,C.2.1,E.1,F.2.2 LNCSSublibrary:SL3–InformationSystemsandApplication,incl.Internet/Web andHCI ©Springer-VerlagBerlinHeidelberg2013 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpartof thematerialisconcerned,specificallytherightsoftranslation,reprinting,reuseofillustrations,recitation, broadcasting,reproductiononmicrofilmsorinanyotherphysicalway,andtransmissionorinformation storageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilarmethodology nowknownorhereafterdeveloped.Exemptedfromthislegalreservationarebriefexcerptsinconnection withreviewsorscholarlyanalysisormaterialsuppliedspecificallyforthepurposeofbeingenteredand executedonacomputersystem,forexclusiveusebythepurchaserofthework.Duplicationofthispublication orpartsthereofispermittedonlyundertheprovisionsoftheCopyrightLawofthePublisher’slocation, initscurrentversion,andpermissionforusemustalwaysbeobtainedfromSpringer.Permissionsforuse maybeobtainedthroughRightsLinkattheCopyrightClearanceCenter.Violationsareliabletoprosecution undertherespectiveCopyrightLaw. Theuseofgeneraldescriptivenames,registerednames,trademarks,servicemarks,etc.inthispublication doesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevant protectivelawsandregulationsandthereforefreeforgeneraluse. Whiletheadviceandinformationinthisbookarebelievedtobetrueandaccurateatthedateofpublication, neithertheauthorsnortheeditorsnorthepublishercanacceptanylegalresponsibilityforanyerrorsor omissionsthatmaybemade.Thepublishermakesnowarranty,expressorimplied,withrespecttothe materialcontainedherein. Typesetting:Camera-readybyauthor,dataconversionbyScientificPublishingServices,Chennai,India Printedonacid-freepaper SpringerispartofSpringerScience+BusinessMedia(www.springer.com) Preface Web-AgeInformationManagement(WAIM)isaleadinginternationalconference for researchers,practitioners, developers, and users to share and exchange their cutting-edgeideas,results,experiences,techniques,andtoolsinconnectionwith all aspects of Web data management. The conference invites original research papers on the theory, design, and implementation of Web-based information systems. As the 14th event in the increasingly popular series, WAIM 2013 was held in Beidaihe, China, during June 14–16, 2013. Along with the main conference, WAIM workshops intend to provide inter- national groups of researchers with a forum for the discussion and exchange of researchresults contributing to the main themes of the WAIM conference. This WAIM2013workshopvolumecontainsthe papersacceptedforthe followingsix workshopsthatwereheldinconjunctionwithWAIM2013.Thesesixworkshops wereselectedafterapubliccall-for-proposalsprocess,eachofwhichfocusesona specific area that contributes to the main themes of the WAIM conference. The six workshops were as follows: – The International Workshop on Big Data Management on Emerging Hard- ware (HardBD 2013) – TheSecondInternationalWorkshoponMassiveDataStorageandProcessing (MDSP 2013) – The First International Workshop on Emergency Management in Big Data Age (BigEM 2013) – TheInternationalWorkshoponTrajectoryMininginSocialNetworks(TMSN 2013) – The First International Workshop on Location-Based Query Processing in Mobile Environments (LQPM 2013) – The First International Workshop on Big Data Management and Service (BDMS 2013) All the organizersof the previous WAIM conferences and workshopshave made WAIMavaluabletrademark,andweareproudtocontinuetheirwork.Wewould like to express our thanks and acknowledgments to all the workshop organizers and Program Committee members who contributed to making the workshop programsuch a success. They put a tremendous amount of effort into soliciting and selecting researchpapers with a balance of high quality, novelty, and appli- cations. They also followeda rigorousreview process. A total of 37 papers were accepted. Last but not least, we are grateful to the main conference organiz- ers and the local Organizing Committee for their great support and wonderful arrangements. Yunjun Gao Kyuseok Shim HardBD 2013 Workshop Organizers’ Message Data properties and hardware characteristics are two key aspects for efficient data management. A clear trend in the first aspect, data properties, is the in- creasing demand to manage and process Big Data, characterized by the fast evolution of “Big Data Systems,” where nearly every aspect of both enterprise and consumer services is being driven by data processing and analysis. Exam- ples of big data systems include NoSQL storage systems, Hadoop/MapReduce, data analytics platforms, search and indexing platforms, and messaging infras- tructures. These systems address needs for structured and unstructured data across a wide spectrum of domains such as the Web, social networks, enter- prise,cloud,mobile,sensornetworks,multimedia/streaming,cyber-physicaland high-performancesystems,andformultipleapplicationareassuchashealthcare, transportation, and scientific computing. Atthesametime,hardwarecharacteristicsareundergoingrapidchanges,im- posing new challenges for an efficient utilization of hardware resources. Recent trendsincludestorage-classmemory,massivemulti-coreprocessingsystems,very large main memory systems, fast networking components, big computing clus- ters, and large data centers that consume massive amounts of energy. It is clear that many aspects of data management need to evolve with these trends. Uti- lizing new hardware technologies for efficient big data management is of urgent importance. The First International Workshop on Big Data Management over Emerging Hardware(HardBD2013)washeldonJune14,2013,atBeidaiheinconjunction with The 14th International Conference on Web-Age Information Management (WAIM2013).Theoverallgoalofthe workshopistobringtogetherresearchers, practitioners,systemadministrators,systemprogrammers,andothersinterested in sharing and presenting their perspectives on the effective management of big data over new hardware platforms, and also to discuss and identify future directions and challenges in this area. The workshopattractedsix submissions. All submissions were peer reviewed by at least three Program Committee members to ensure that high-quality pa- perswereselected.Onthebasisofthereviews,theProgramCommitteeselected three papers for inclusion in the workshop proceedings (acceptation rate 50%). The final program of the workshop also consisted of three invited talks. One of them was from Alibaba, presented by Zhenkun Yang, and the other two were from academia, presented by Changsheng Xie (Huazhong University of Science and Technology) and Nong Xiao (National University of Defense Technology). VIII HardBD 2013 Workshop Organizers’ Message The Program Committee of the workshop consisted of 15 experienced re- searchers and experts. We would like to thank the valuable contributions of all theProgramCommitteemembersduringthepeerreviewprocess.Also,wewould like to acknowledgethe WAIM 2013WorkshopChairs for their greatsupport of HardBD 2013, and the support from the Natural Science Foundation of China (No.60833005). Xiaofeng Meng Theo H¨arder Peiquan Jin Binsheng He HardBD 2013 Workshop Organization General Co-chairs Xiaofeng Meng Renmin University of China, China Theo H¨arder Technical University of Kaiserslautern, Germany Program Co-chairs Peiquan Jin UniversityofScience andTechnologyofChina, China Binsheng He Nanyang Technological University, Singapore Publicity Chair Yi Ou Technical University of Kaiserslautern, Germany Program Committee Bin Cui Peking University, China Bin He IBM Almaden Research, USA Sang-Wook Kim Hanyang University, Korea Ioannis Koltsidas IBM Research - Zurich, Switzerland Ziyu Lin Xia’Men University, China Yi Ou TU Kaiserslautern, Germany Ilia Petrov Reutlingen University, Germany Vijayan Prabhakaran Microsoft Research, USA Jianliang Xu Hong Kong Baptist University, SAR China MDSP 2013 Workshop Organizers’ Message On behalf of the Program Chairs for MDSP 2013, consisting of two General Co-chairs and two Program Co-chairs, we are pleased to present you with this volume. It contains the papers accepted for presentation in the workshop pro- gramofthe14thInternationalConferenceonWeb-AgeInformationManagement held in Beidaihe, China, during June 14–16, 2013. This was the second International Conference on Massive Data Storage and Processing (MDSP). In all, 21 papers were submitted to the MDSP program, from which eight were accepted for presentationand inclusion in the conference proceedings. An acceptance rate of 40% makes MDSP one of the most selective workshops of WAIM 2013. We would like to thank all the authors of submitted papers for choosing MDSP 2013 for the presentation of their research results. Because of the high quality of the submitted papers, selecting the eight papers for the main confer- encewasaverydifficulttask.WearedeeplyindebtedtothefourprogramChairs and16ProgramCommitteemembersfortheirconscientiousandimpartialjudg- ment and for the time and effort they contributed in preparation of this year’s conference. All Area Chairs and reviewers are listed on the following pages. Theorganizersoftheconferenceareveryhappywiththeresponsetoourcall for papers, noting the interest of the data storage and processing community in this field.The workshopwascomposedofeightpapersselectedforpresentation, covering a wide range of topics and showing interesting experiences. A brief summary of all the contributions, classified in three main areas, is presented below. • “Adaptive Sequential Prefetching for Multiple Streams” submitted by Yong Li, Dan Feng, ZhanShi, andQing Liu fromHuazhong University of Science and Technology. The authors presented an adaptive sequential prefetching algorithmcalledASPM,solvingtheun-fairnessandperformancedegradation introduced by streams with diverse access rates. • “Research of Network Coding Data Collection Mechanism Based on The Rough Routing in Wireless Multi-hop Network” submitted by Jian Wan, Ligang He, and Wei Zhang et al. from Hangzhou Dianzi University. The authors proposed a rough routing-based data collection mechanism used in wirelesssensornetworkcalledBRRCD,whichaimstorestrainthecliffeffect to some extent. • “Incremental Truth Discovery for Information from Multiple Data Sources submitted by Li Jia, Hongzhi Wang, Jianzhong Li” and Hong Gao from HarbinInstitute ofTechnology.Theauthorspresentedanincrementalstrat- egy for discovering truth in multisource integration using boosting-like en- semble classifiers. XII MDSP 2013 Workshop Organizers’ Message • “Simdedup:ANewDeduplicationSchemeBasedonSimhash”submittedby WenbinYaoandPengdiYefromBeijingUniversityofPostsandTelecommu- nications. The authors presented a near-exact deduplication scheme named Simdedup, which exploits file similarity and chunk locality to improve the accuracy of deduplication. • “InfoMall: A Large-Scale Storage System for Web Archiving” submitted by Lian’en Huang, Jinping Li, and Xiaoming Li from Peking University Shen- zhen Graduate School. The authors proposed a system designed for storing massive Web pages effectively and efficiently. • “Sequential Record Based Compression for Massive Image Storage in Database1” submitted by Ziyun Ma, Xiaonian Wang, and Ping Jiang et al. fromTongjiUniversity.Theauthorsproposedanimagecompressionscheme thatislearntfromvideocompressiontoremovetemporalandspatialredun- dancy in the image sequence. • “Continuous,OnlineAnomaly RegionDetection andTrackinginNetworks” submitted by Shuiyuan Xie, Xiuli Ma, and Shiwei Tang from Peking Uni- versity. The authors presented a frameworkto detect and track an anomaly region continuously. • “Event Matching Algorithm to Collaborate Sensor Network and the Cloud through a Middleware Service” submitted by Mohammad Hasmat Ullah, Sung-Soon Par, and Gyeong Hun Kim from the Department of Computer Science and Engineering,Anyang University, South Koreaand Gluesys Co., Ltd. The authors proposed a content-based event-matching algorithm to analyzesubscriptionsandmatchpropercontenteasilytoconveyWSN-driven data to the subscribers. Wewouldliketothankeveryonewhohelpedus.Wegreatlyappreciatetheadvice and support by the WAIM 2013 General Co-chairs, Xiaofeng Meng (Renmin University of China) and Huan Liu (Arizona State University, USA), Program Co-chairs,JianyongWang(TsinghuaUniversity,China)andHuiXiong(Rutgers University, USA), and WorkshopsChairs Yunjun Gao(Zhejiang University)and Kyuseok Shim (Seoul National University, South Korea). Weisong Shi Yunjun Gao Weiping Wan Zujie Ren