OCITA Spring Event Mike King Enterprise Technologist, Big Data Wright Patterson AFB; May 19, 2016 Acronym Key - Part 1 • VLDB – Very Large Database • CDH – Cloudera Distribution for Hadoop • PK – Primary Key • EDH – Enterprise Data Hub • AK – Alternate Key • EDW – Enterprise Data Warehouse • COTS – Commercial Off-the-Shelf • ETL – Extract, Transform & Load • KV – Key value • ELK – Elastic Search, Logstash & Kibanna • JSON – Java Script Object Notation • XML – eXtensible Markup Language • BSON – Binary Structured Object Notation • SQL – Structured Query Language • iOT – internet of things • CRM – Customer Relationship Management • JDBC – Java DataBase Connectivity • TPC – Transaction Performance Council Acronym Key - Part 2 • SOA – Service Oriented Architecture • BDE – Big Data Extensions (Vmware) • API – Application Programming Interface • FTE – Full Time Equivalent • CSV – Comma Separated Values • SIEM – Security Information Event Management • RDBMS – Relational DataBase Management System • MQ – Message Queuing • MPP – Massively Parallel Platform • ERP – Enterprise Resource Planning • ML – Machine Learning • HA – High Availability • CoE – Center of Excellence • DBA – DataBase Administrator • HTTP – HyperText Transfer Protocol • DWFT – DataWarehouse Fast Track • HDFS – Hadoop Distributed File System • *aaS – anything as-a Service Big Data 4 Dell - Internal Use - Confidential Confidential Dell - Internal Use - Confidential Trends Affecting Big Data Technology • Virtualization: App, CM, Mgt, Client Tools • Automation Consumption pattern • Integration The profession • Cloud • Tools • Analytics for all, & all… – Varying needs – Three types • Data Science • *aaS • Skills demand – I, a, p, s, DB – Needs Data tsunami – <You Name It> – Roles • iOT – How to fill • Training • Data, data & more data • Mobile Confidential Dell - Internal Use - Confidential Big Data is really complex data, with needs that extend beyond the existing tool chain Relational data Application data Sensor data (Database) MS Excel and Facebook LinkedIn Photos MS Access PDF, Word and text Twitter Videos files Different data types • Large volumes • Varying speeds Confidential Dell - Internal Use - Confidential Confidential Dell - Internal Use - Confidential Customer Success Stories Confidential Dell - Internal Use - Confidential Customer lifetime value of Big Data UK – online services G500 SI-Telco • Jan 2013: 200 nodes • Jan 2013 = 150 nodes Always ran Hadoop – saw mega • Primary use case: Top Secret growth from Jan 2013: 200 nodes Government Work • Primary use case: Web 2.0 as core • Growth: Jan 2014 = +150 nodes • Growth: Jan 2015 = +800 nodes US-based Telco Financial Services • Feb 2013 – 42 node POCs • Nov 2013: 12 nodes POC • Primary use case: Log Files, • Primary use case: Log Files, Fraud BDaaS, & Churn Analysis Analysis, 360 Customer View • Growth: March 2015- +2200 nodes • Growth: March 2015 = +220 nodes Confidential Dell - Internal Use - Confidential
Description: