HANDBOOK OF RESEARCH FOR BIG DATA Concepts and Techniques HANDBOOK OF RESEARCH FOR BIG DATA Concepts and Techniques Edited by Brojo Kishore Mishra, PhD Vivek Kumar, PhD Sanjaya Kumar Panda, PhD Prayag Tiwari, PhD First edition published 2022 Apple Academic Press Inc. CRC Press 1265 Goldenrod Circle, NE, 6000 Broken Sound Parkway NW, Palm Bay, FL 32905 USA Suite 300, Boca Raton, FL 33487-2742 USA 4164 Lakeshore Road, Burlington, 2 Park Square, Milton Park, ON, L7L 1A4 Canada Abingdon, Oxon, OX14 4RN UK © 2022 by Apple Academic Press, Inc. Apple Academic Press exclusively co-publishes with CRC Press, an imprint of Taylor & Francis Group, LLC Reasonable efforts have been made to publish reliable data and information, but the authors, editors, and publisher cannot assume responsibility for the validity of all materials or the consequences of their use. The authors, editors, and publishers have attempted to trace the copyright holders of all material reproduced in this publication and apologize to copyright holders if permission to publish in this form has not been obtained. If any copyright material has not been acknowledged, please write and let us know so we may rectify in any future reprint. Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted, or utilized in any form by any electronic, mechanical, or other means, now known or hereafter invented, including photocopying, microfilming, and recording, or in any information storage or retrieval system, without written permission from the publishers. For permission to photocopy or use material electronically from this work, access www.copyright.com or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923, 978-750-8400. For works that are not available on CCC please contact [email protected] Trademark notice: Product or corporate names may be trademarks or registered trademarks and are used only for identification and explanation without intent to infringe. Library and Archives Canada Cataloguing in Publication Title: Handbook of research for big data : concepts and techniques / edited by Brojo Kishore Mishra, PhD, Vivek Kumar, PhD, Sanjaya Kumar Panda, PhD, Prayag Tiwari, PhD. Names: Mishra, Brojo Kishore, 1979- editor. | Kumar, Vivek (Researcher in natural language processing), editor. | Panda, Sanjaya Kumar, editor. | Tiwari, Prayag, editor. Description: First edition. | Includes bibliographical references and index. Identifiers: Canadiana (print) 20210339160 | Canadiana (ebook) 20210339209 | ISBN 9781771889803 (hardcover) | ISBN 9781774639283 (softcover) | ISBN 9781003144526 (ebook) Subjects: LCSH: Big data. Classification: LCC QA76.9.B45 H36 2022 | DDC 005.7—dc23 Library of Congress Cataloging-in-Publication Data Names: Mishra, Brojo Kishore, 1979- editor. Title: Handbook of research for big data : concepts and techniques / edited by Brojo Kishore Mishra, PhD, Vivek Kumar, PhD, Sanjaya Kumar Panda, PhD, Prayag Tiwari, PhD. Description: First edition. | Palm Bay, FL : Apple Academic Press, [2022] | Includes bibliographical references and index. | Summary: "Data has become a valuable asset like never before. Today the challenge is not a shortage of data but the need for techniques and methods capable enough to be able to glean valuable insights from the fast-flowing mass of Big Data. This new volume, Handbook of Research for Big Data: Concepts and Techniques, helps to meet the challenge of managing and using Big Data by presenting new research on various technological advances in the field. The chapters in the book present information on important applications, concepts, and technologies for Big Data in the present industry and market scenario. It looks at research domain issues and their solutions as well as various research case studies, research plans, methodologies, and related data sets for the four Vs: volume, velocity, variety, and veracity. Chapters discuss Big Data in governance, transportation, disaster management, epidemiology, and more. The book covers design and analysis of reconfigurable computing of SoC for IoT, data mining techniques and applications, the use of natural language processing in big data, and more. The volume is a valuable resource for researchers from both academia and industry to learn about and enhance their knowledge and skills in the broad area of big data computing and applications"-- Provided by publisher. Identifiers: LCCN 2021047270 (print) | LCCN 2021047271 (ebook) | ISBN 9781771889803 (hardback) | ISBN 9781774639283 (paperback) | ISBN 9781003144526 (ebook) Subjects: LCSH: Big data. Classification: LCC QA76.9.B45 H3648 2022 (print) | LCC QA76.9.B45 (ebook) | DDC 005.7--dc23/eng/20211007 LC record available at https://lccn.loc.gov/2021047270 LC ebook record available at https://lccn.loc.gov/2021047271 ISBN: 978-1-77188-980-3 (hbk) ISBN: 978-1-77463-928-3 (pbk) ISBN: 978-1-00314-452-6 (ebk) About the Editors Brojo Kishore Mishra, PhD Professor, Computer Science and Engineering Department, GIET University, Gunupur, Odisha, India Brojo Kishore Mishra, PhD, is Professor in the Computer Science and Engineering Department at the Gandhi Institute of Engineering and Technology University (GIET), Gunupur, Odisha, India. He has published more than 30 research papers in national and international conference proceedings, over 25 research papers in peer-reviewed journals, and over 20 book chapters, and has authored two books and edited three books to date. His research interests include data mining and big data analysis, machine learning, soft computing, and evolutionary computation. He received his PhD degree in Computer Science from the Berhampur University, Brahmapur, Odisha, India. Vivek Kumar, PhD Marie Sklodowska-Curie Researcher, University of Cagliari, Italy & Philips Research, The Netherlands Vivek Kumar, PhD, is a researcher in the NLP field. He formerly worked as a Research Engineer for a SHiP (search for hidden particles) project of CERN extended with NUST-MISIS. He has also rendered his services to the Embassy of India in Moscow to the education and defense sector for strengthening Indo-Russian bilateral relations. He is a member of the Defense and Security Software Engineers Association, Italy. He has authored several publications and is also a reviewer, editor, and TP member of several conferences and journals of IEEE, ACM, Springer, Elsevier, MDPI, and IGI-Global. His research interests include machine learning, deep learning, natural language processing, and sentiment analysis applied in the healthcare domain. He received his MS degree from NUST-MiSIS, Russian Federation, in 2007.1 vi About the Editors Sanjaya Kumar Panda, PhD Assistant Professor and Head, Department of Computer Science & Engineering, Indian Institute of Information Technology, Design and Manufacturing, Kurnool, Andhra Pradesh, India Sanjaya Kumar Panda, PhD, is Assistant Professor and Head of the Department of Computer Science & Engineering at the Indian Institute of Information Technology, Design and Manufacturing, Kurnool, Andhra Pradesh, India. He formerly worked as Assistant Professor in the Department of IT at VSSUT, Burla, Odisha, India. He received a PhD degree from IIT (ISM) Dhanbad, Jharkhand, India; his MTech degree from NIT, Rourkela, Odisha, India; and a BTech degree from VSSUT, Burla, Odisha, India in CSE. He received two silver medal awards for the best graduate and best postgraduate in CSE. Other awards include an institution award, IEEE brand ambassador designation, and SGSITS national award for the best research work by a young teacher of engineering college for the year 2017. He was also a faculty with maximum publishing in CSI publications award, young IT professional award (2017 and 2016), young scientist award, CSI paper presenter award at an international conference, and CSI distinguished speaker award. He has published more than 60 papers in reputed journals and conferences. He is a member of IEEE, an associate member of IEI, a life member of ISTE, and a life member of CSI, IAENG, IACSIT, UACEE, ACEEE, and SDIWC. His current research interests include recommender systems, cloud computing, big data analytics, grid computing, fault tolerance, and load balancing. He has delivered several invited talks and has chaired sessions at many national and international conferences and workshops. He acted as a reviewer for many reputed journals. He also acted as guest editor in many international journals. Prayag Tiwari, PhD Marie Sklodowska Curie Researcher, University of Padova, Italy Prayag Tiwari, PhD, is currently Marie Sklodowska Curie Researcher with the University of Padua, Italy. He was previously a Research Assistant with NUST MISIS, and he has had teaching and industrial work experience. He has several About the Editors vii publications in journals, book series, and conferences of IEEE, ACM, Springer, Elsevier, MDPI, Taylor and Francis, and IGI-Global. His research interests include machine learning, deep learning, quantum-inspired machine learning, and information retrieval. He received his MSc degree from NUST MISIS, Moscow. He is currently pursuing a PhD degree at the University of Padova, Italy. Contents Contributors .............................................................................................................xi Abbreviations .........................................................................................................xiii Preface ..................................................................................................................xvii 1. Big Data in Governance in India: Case Studies ...........................................1 Mukesh Choubisa 2. Design and Analysis of Reconfigurable Computing of SoC for IoT Applications ..............................................................................29 Ipseeta Nanda and Nibedita Adikari 3. A Review of Different Data Mining Techniques Used in Big Data Applications ...................................................................................59 Chandrakanta Mahanty, Devpriya Panda, and Brojo Kishore Mishra 4. Big Data Applications in Transportation Systems Using the Internet of Things .........................................................................................91 Sunil Kumar Gautam, Riddhi B. Prajapati, and Hari Om 5. Overview of Big Data and Natural Language Processing: A Powerful Combination for Research .....................................................113 Bishwa Ranjan Das and Brojo Kishore Mishra 6. An Insight into Big Data and Its Pertinence .............................................137 Lopamudra Hota and Prasant Kumar Dash 7. Big Data Science: Models and Approaches, Characteristics, Challenges, and Applications .....................................................................157 Riyanshi Gupta, Kartik Krishna Bhardwaj, and Deepak Kumar Sharma 8. Conceptual Frameworks for Big Data Visualization: Discussion on Models, Methods, and Artificial Intelligence for Graphical Representations of Data ...........................................................197 Cherilyn Conner, Jim Samuel, Myles Garvey, Yana Samuel, and Andrey Kretinin 9. Role of Machine Learning in Big Data Peregrination .............................235 Aradhana Behura and Sanjaya Kumar Panda 10. Artificial Neural Networks: Fundamentals, Design, and Applications .... 277 Nishant Kashyap, Anjana Mishra, and Brojo Kishore Mishra