ebook img

Big Data Governance: Modern Data Management Principles for Hadoop, NoSQL & Big Data Analytics PDF

174 Pages·2015·2.77 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Big Data Governance: Modern Data Management Principles for Hadoop, NoSQL & Big Data Analytics

BIG DATA GOVERNANCE: Modern Data Management Principles for Hadoop, NoSQL & Big Data Analytics Peter K. Ghavami, PhD Copyright © 2016 Peter Ghavami All rights reserved. ISBN: 1519559720 ISBN-13: 978-1519559722 BIG DATA GOVERNANCE: Modern Data Management Principles for Hadoop, NoSQL & Big Data Analytics Peter K. Ghavami, PhD Peter K. Ghavami, Ph.D. [email protected] Copyright © 2016 Peter K. Ghavami, Washington, D.C. All rights reserved. This publication is protected by copyright, and permission must be obtained from the copyright holder prior to any prohibited reproduction, storage in a retrieval system, or transmission in any form or by any means, electronic, mechanical, photocopying, recording or likewise. For information regarding permissions, write to or email to: Peter K. Ghavami, Ph.D. at [email protected] The author and publisher have taken care in preparations of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for the incidental or consequential damages in connection with or arising out of the use of the information or designs contained herein. Keywords: 1. Big Data Analytics, 2. Data governance, 3. Hadoop, 4. Data Security, 5. Data Management, 6. Data Lifecycle Management Copyright © 2016 Peter Ghavami All rights reserved. ISBN: 1519559720 ISBN-13: 978-1519559722 BIG DATA GOVERNANCE Modern Data Management Principles for Hadoop, NoSQL & Big Data Analytics Peter K. Ghavami, PhD First Edition 2016 Acknowledgements This book was only possible as a result of my collaboration with many world renowned data scientists, researchers, CIOs and leading technology innovators who have taught me a tremendous deal about scientific research, innovation and more importantly about the value of collaboration. To all of them I owe a huge debt of gratitude. Peter Ghavami January 2016 To my beautiful wife Massi, whose unwavering love and support make these accomplishments possible and worth pursuing. CONTENTS INTRODUCTION Purpose SECTION I: INTRODUCTION TO BIG DATA Introduction to Big Data The Three Dimensions of Analytics The Distinction between BI and Analytics Analytics Platform Framework Data Connection layer Data Management layer Analytics Layer Presentation Layer The Diverse Applications of Big Data Data Management Body of Knowledge (DMBOK) DMBOK2: What is it? Eight Reasons for DMBOK2 Data Maturity Model (DMM) SECTION II: BIG DATA GOVERNANCE FUNDAMENTALS Introduction Top 10 Data Breaches What is Governance? Why Big Data Governance? Data Steward Responsibilities Corporate Governance Big Data Governance Certifications Case for Big Data Governance Strategic Data Governance, or Tactical Data Governance? TOGAF View of Data Governance Data Lake vs. Data Warehouse History of Hadoop Hadoop Overview HDFS Overview MapReduce Security Tools for Hadoop The Components of Big Data Governance Myths about Big Data & Hadoop Lake Enterprise Data Governance Directive: Data Governance is More than Risk Management First Steps Toward Big Data Governance What Your Data Governance Model should address: Data Governance Tools Big Data Governance Framework: A Lean & Effective Model Organization Data Quality Management Metadata Management Compliance, Security, Privacy Policies The Enterprise Big Data Governance Pyramid Introduction to Big Data Governance Rules Organization Data Governance Council Data Stewardship Authority: Policy, decision-making, governance Governance Activities: Monitoring, Support, and more… Users: Developers, Data Scientists, End-users Master Data Management Meta Data Management Lake Data Classification Security, Privacy & Compliance Apply Tiered Data Management Model Big Data Security Policy Big Data Security Policy Big Data Security – Access Controls Big Data Security: Key Policies Data Usage Agreement Policy Security Operations Policies Information Lifecycle Management Quality Management Big Data Quality & Monitoring Data Classification Rules Data Quality Policies Metadata Best Practices Big Data Governance Rules: Best Practices Sample Data Governance & Management Tools Data Governance in the GRC Context The Costs of Poor Data Governance Big Data Governance Budget Planning What Other Companies are doing? SECTION III: BIG DATA GOVERNANCE BEST PRACTICES Data Governance Best Practices Data Protection Sensitive Data Protection Data Sharing Considerations Adherence to Governance Policies Data Protection at the High Level Low Level Data Access Protection Security Architecture for Data Lake Data Lake Classification Hadoop Lake Data Classification and Governance Policies Metadata Rules Data Structure Design Sandbox Functionality Overview Detailed Access Policies Split Data Design SECTION IV: BIG DATA GOVERNANCE FRAMEWORK PROGRAM Big Data Governance Framework Program Overview Integrity of Information and Data Resource and Assets Benefits of Compliance and Risks of non-compliance Definitions I. Organization II. Metadata Management III. Data Classification IV. Big Data Security, Privacy & Compliance V. Data Usage Agreement (DUA) VI. Security Operations Considerations & Policies VII. Information Lifecycle Management VIII. Quality Standards Data Quality Reporting Summary

Description:
Data is the new Gold and Analytics is the machinery to mine, mold and mint it. Data analytics has become core to business and decision making. The rapid increase in data volume, velocity and variety, known as big data, offers both opportunities and challenges. While open source solutions to store bi
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.