ebook img

Recent Developments in Machine Learning and Data Analytics: IC3 2018 PDF

525 Pages·2019·22.779 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Recent Developments in Machine Learning and Data Analytics: IC3 2018

Advances in Intelligent Systems and Computing 740 Jugal Kalita · Valentina Emilia Balas  Samarjeet Borah · Ratika Pradhan Editors Recent Developments in Machine Learning and Data Analytics IC3 2018 Advances in Intelligent Systems and Computing Volume 740 Series editor Janusz Kacprzyk, Polish Academy of Sciences, Warsaw, Poland e-mail: [email protected] The series “Advances in Intelligent Systems and Computing” contains publications on theory, applications, and design methods of Intelligent Systems and Intelligent Computing. Virtually all disciplines such as engineering, natural sciences, computer and information science, ICT, economics, business,e-commerce,environment,healthcare,lifesciencearecovered.Thelistoftopicsspansallthe areasofmodernintelligentsystemsandcomputingsuchas:computationalintelligence,softcomputing includingneuralnetworks,fuzzysystems,evolutionarycomputingandthefusionoftheseparadigms, social intelligence, ambient intelligence, computational neuroscience, artificial life, virtual worlds and society, cognitive science and systems, Perception and Vision, DNA and immune based systems, self-organizing and adaptive systems, e-Learning and teaching, human-centered and human-centric computing, recommender systems, intelligent control, robotics and mechatronics including human-machine teaming,knowledge-based paradigms,learning paradigms,machineethics,intelligent data analysis, knowledge management, intelligent agents, intelligent decision making and support, intelligentnetworksecurity,trustmanagement,interactiveentertainment,Webintelligenceandmultimedia. Thepublicationswithin“AdvancesinIntelligentSystemsandComputing”areprimarilyproceedings ofimportantconferences,symposiaandcongresses.Theycoversignificantrecentdevelopmentsinthe field,bothofafoundationalandapplicablecharacter.Animportantcharacteristicfeatureoftheseriesis theshortpublicationtimeandworld-widedistribution.Thispermitsarapidandbroaddisseminationof researchresults. AdvisoryBoard Chairman NikhilR.Pal,IndianStatisticalInstitute,Kolkata,India e-mail:[email protected] Members RafaelBelloPerez,UniversidadCentral“MartaAbreu”deLasVillas,SantaClara,Cuba e-mail:[email protected] EmilioS.Corchado,UniversityofSalamanca,Salamanca,Spain e-mail:[email protected] HaniHagras,UniversityofEssex,Colchester,UK e-mail:[email protected] LászlóT.Kóczy,SzéchenyiIstvánUniversity,Győr,Hungary e-mail:[email protected] VladikKreinovich,UniversityofTexasatElPaso,ElPaso,USA e-mail:[email protected] Chin-TengLin,NationalChiaoTungUniversity,Hsinchu,Taiwan e-mail:[email protected] JieLu,UniversityofTechnology,Sydney,Australia e-mail:[email protected] PatriciaMelin,TijuanaInstituteofTechnology,Tijuana,Mexico e-mail:[email protected] NadiaNedjah,StateUniversityofRiodeJaneiro,RiodeJaneiro,Brazil e-mail:[email protected] NgocThanhNguyen,WroclawUniversityofTechnology,Wroclaw,Poland e-mail:[email protected] JunWang,TheChineseUniversityofHongKong,Shatin,HongKong e-mail:[email protected] Moreinformationaboutthisseriesathttp://www.springer.com/series/11156 Jugal Kalita Valentina Emilia Balas (cid:129) Samarjeet Borah Ratika Pradhan (cid:129) Editors Recent Developments in Machine Learning and Data Analytics IC3 2018 123 Editors Jugal Kalita Samarjeet Borah CollegeofEngineeringandAppliedScience Department ofComputer Applications University of Colorado ColoradoSprings Sikkim Manipal University ColoradoSprings, CO,USA Sikkim,India Valentina EmiliaBalas Ratika Pradhan Automation andApplied Informatics Department ofComputer Applications AurelVlaicu University of Arad Sikkim Manipal University Arad,Romania Sikkim,India ISSN 2194-5357 ISSN 2194-5365 (electronic) Advances in Intelligent Systems andComputing ISBN978-981-13-1279-3 ISBN978-981-13-1280-9 (eBook) https://doi.org/10.1007/978-981-13-1280-9 LibraryofCongressControlNumber:2018945899 ©SpringerNatureSingaporePteLtd.2019 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpart of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission orinformationstorageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilar methodologynowknownorhereafterdeveloped. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publicationdoesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfrom therelevantprotectivelawsandregulationsandthereforefreeforgeneraluse. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authorsortheeditorsgiveawarranty,expressorimplied,withrespecttothematerialcontainedhereinor for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictionalclaimsinpublishedmapsandinstitutionalaffiliations. ThisSpringerimprintispublishedbytheregisteredcompanySpringerNatureSingaporePteLtd. Theregisteredcompanyaddressis:152BeachRoad,#21-01/04GatewayEast,Singapore189721, Singapore Preface Recent Developments in Machine Learning and Data Analytics is a collection of research findings of the Second International Conference on Computing and Communication. The conference is centered upon the theme of machine learning and data analytics. The works incorporated in this volume can roughly be divided into three parts, namely data analytics, natural language processing, and soft computing. The fol- lowing section contains a brief information about the various contributions to this volume. In the first paper, Aski et al. provide an architectural overview of IoT-enabled ubiquitous healthcare data acquisition and monitoring system for personal and medical usage powered by cloud application. The next one is also an IoT-based paper where Ishita Chakraborty, Anannya Chakraborty, and Prodipto Das discuss sensor selection and data fusion approach for IoT applications. An overview of HadoopMapReduce,Spark,andscalablegraphprocessingarchitectureisprovided by Talan et al. in their paper. On the other hand, Hore et al. discuss a machine intelligence-based approach to analyze social trend toward girl child in India. Analyzing class-imbalanced data is found to be a difficult task always. In the next paper, an improvement in boosting method for class-imbalanced datasets is dis- cussed by Kumar et al. In their paper, Ambika Choudhury and Deepak Gupta provide a survey on medical diagnosis of diabetes using machine learning tech- niques.Anotherclassification-relatedissueondiabetesdataispresentedbySantosh Kumar Majhi in his research work. Findings on the usefulness of big data tech- nologies for statistical homicide dataset are discussed by Askew et al. The next research work discusses a journal recommendation system through content-based filtering approach. Another content-based filtering and collaborative filtering technique for movie recommendation is provided by Bharti et al in their research work. The next 13 contributions are from the domain of natural language processing. The first work of such kind discusses a word-sense disambiguation for Assamese language. In addition to this, other two works are found for Assamese language where Choudhury et al. present a context-sensitive spelling checker for Assamese v vi Preface language and Mirzanur Rahman and Shikhar Kumar Sarma discuss a hybrid approachtoanalyzethemorphologyofanAssameseword.Thenextworkpresents an aptitude question paper generator and answer verification system. Ghosh et al. discuss affinity maturation of homophones in a word-level speech recognition in their work. This follows a discussion on feature map reduction in CNN for hand- written digit recognition. Multilingual text localization from camera-captured images is presented by Dutta et al. The technique is based on foreground homo- geneity analysis. Jajoo et al. propose script identification from camera-captured multiscriptscenetextcomponentsintheirresearchwork.Again,Khanetal.present adistancetransform-basedstrokefeaturedescriptorfortext–non-textclassification. The volume also includes a contribution on emotion mining. This follows two worksonNepalilanguage,whereThapaetal.discussafingerspellingrecognition for Nepali sign language and Yajnik et al. present a work on parsing in Nepali language. Anumberofcontributionsarefoundwhichcanroughlybecategorizedunderthe domain of soft computing. Mishra et al. discuss a BFS-NB hybrid model in intrusion detection system, whereas Saikia et al. propose an effective alert corre- lation method in their research work. An application of ensemble random forest classifier for detecting financial statement manipulation of listed Indian companies is discussed by Hiral Patel and co-authors. Another security-related paper is dis- cussedondynamicshiftinggeneticnon-adjacentformellipticcurveDiffie–Hellman key exchange procedure for IoT heterogeneous network. This follows few classification-related works, where Vijaya et al. discuss fuzzy clustering with ensemble classification techniques to improve the customer churn prediction in telecommunication sector and Ahmed et al. propose a technique to remove the bottleneck of FP tree. Additional works include elephant herding algorithm, improved K-NN algorithm through class discernibility and cohesiveness, a reduction-level fusion of PCA and random projection for high-dimensional data, a stable clustering algorithm for mobile ad hoc networks (MANETs), and interval-valued complex fuzzy concept lattice and its granular decomposition. Umesh Gupta and Deepak Gupta discuss their findings on twin-bounded support vectormachinebasedonL2-norm.Aworktoperformnaturalscenelabelingusing neuralnetworksispresentedbyDasetal.Kalvapallietalpresenttheirfindingsona city-scale transportation system using XGBoost. A selfish controlled scheme in an opportunistic mobile communication network is presented by Moirangthem Tiken Singh and Surajit Borkotokey. Few quality works on image processing techniques are also included in this volume. The first part of these papers discusses a fusion-based underwater image enhancement using weight map techniques. The next work proposes an algorithm for automatic segmentation of pancreas histo- logical images for glucose intolerance identification. An edge detection technique using ACO with PSO for the noisy image is discussed by Aditya Gautam and Mantosh Biswas in their research work. Another work discusses improved con- volutional neural networks for hyperspectral image classification. Mohanraj et al. present a neural network-based approach for face recognition. A method on auto- matedvisioninspectionsystemforcylindrical metallic componentsisproposedby Preface vii Govindaraj and co-authors in their research work. The volume also includes a research work on gene selection of microarray datasets. A case study on geo-statistical modeling of remote sensing data for forest carbon estimation is presented by Kumar et al. Finally, it includes a study of DC–DC converters with MPPT for stand-alone solar water pumping. IC32018representsaglobalforumforresearchoncomputationalapproachesto learning. It includes mostly the current works and research findings from various researchlaboratories,universities,andinstitutionsandmayleadtothedevelopment ofmarket-demandedproducts.Theworksreportsubstantiveresultsonawiderange of learning methods applied to a variety of learning problems. It provides solid support via empirical studies, theoretical analysis, or comparison to psychological phenomena.Thevolumeincludesworkstoshowhowtoapplylearningmethodsto solve important application problems as well as how machine learning research is conducted. The volume editorsare very thankful to all theauthors, contributors, reviewers, and the publisher for making this effort a successful one. Colorado Springs, USA Jugal Kalita Arad, Romania Valentina Emilia Balas Sikkim, India Samarjeet Borah Sikkim, India Ratika Pradhan Contents IoTEnabledUbiquitousHealthcareDataAcquisitionandMonitoring System for Personal and Medical Usage Powered by Cloud Application: An Architectural Overview . . . . . . . . . . . . . . . . . . . . . . . . 1 Vidhaydhar J. Aski, Shubham Sanjay Sonawane and Ujjwal Soni Sensor Selection and Data Fusion Approach for IoT Applications. . . . . 17 Ishita Chakraborty, Anannya Chakraborty and Prodipto Das An Overview of Hadoop MapReduce, Spark, and Scalable Graph Processing Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 Pooja P. Talan, Kartik U. Sharma, Pratiksha P. Nawade and Karishma P. Talan Analyzing Social Trend Towards Girl Child in India: A Machine Intelligence-Based Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 Sirshendu Hore and Tanmay Bhattacharya Improvement in Boosting Method by Using RUSTBoost Technique for Class Imbalanced Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 Ashutosh Kumar, Roshan Bharti, Deepak Gupta and Anish Kumar Saha A Survey on Medical Diagnosis of Diabetes Using Machine Learning Techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 Ambika Choudhury and Deepak Gupta How Effective Is the Moth-Flame Optimization in Diabetes Data Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Santosh Kumar Majhi Evaluating Big Data Technologies for Statistical Homicide Dataset . . . . 89 Roland Askew, Sreenivas Sremath Tirumala and G. Anjan Babu Journal Recommendation System Using Content-Based Filtering . . . . . 99 Sonal Jain, Harshita Khangarot and Shivank Singh ix x Contents Recommending Top N Movies Using Content-Based Filtering and Collaborative Filtering with Hadoop and Hive Framework . . . . . . 109 Roshan Bharti and Deepak Gupta WSD for Assamese Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Pranjal Protim Borah, Gitimoni Talukdar and Arup Baruah Aptitude Question Paper Generator and Answer Verification System. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 Meghna Saikia, Saini Chakraborty, Suranjan Barman and Sarat Kr. Chettri Affinity Maturation of Homophones in Word-Level Speech Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 P. Ghosh, T. S. Chingtham and M. K. Ghose Feature Map Reduction in CNN for Handwritten Digit Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143 Sinjan Chakraborty, Sayantan Paul, Ram Sarkar and Mita Nasipuri Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis. . . . . . . . . . . . . . . . . . . . . . 149 Indra Narayan Dutta, Neelotpal Chakraborty, Ayatullah Faruk Mollah, Subhadip Basu and Ram Sarkar Script Identification from Camera-Captured Multi-script Scene Text Components . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159 Madhuram Jajoo, Neelotpal Chakraborty, Ayatullah Faruk Mollah, Subhadip Basu and Ram Sarkar Implementation of BFS-NB Hybrid Model in Intrusion Detection System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167 Sushruta Mishra, Chandrakanta Mahanty, Shreela Dash and Brojo Kishore Mishra Context-Sensitive Spelling Checker for Assamese Language . . . . . . . . . 177 Ranjan Choudhury, Nabamita Deb and Kishore Kashyap Distance Transform-Based Stroke Feature Descriptor for Text Non-text Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189 Tauseef Khan and Ayatullah Faruk Mollah A Hybrid Approach to Analyze the Morphology of an Assamese Word. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201 Mirzanur Rahman and Shikhar Kumar Sarma Netizen’s Perspective on a Recent Scam in India—An Emotion Mining Approach. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211 Aviroop Mukherjee, Agnivo Ghosh and Tanmay Bhattacharya

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.