ebook img

for EMC use only EMC Data Computing Appliance Getting Started Guide PDF

104 Pages·2014·4.88 MB·English
by  
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview for EMC use only EMC Data Computing Appliance Getting Started Guide

® EMC Data Computing Appliance Getting Started Guide Appliance Version 2.x APPLIES TO THE FOLLOWING VERSIONS: - 2.0.0.0 - 2.0.1.0 - 2.0.2.0 - 2.0.3.0 - 2.0.4.0 EMC Confidential - for EMC use only Copyright © 2014 EMC Corporation. All rights reserved. Published June, 2014 EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com All other trademarks used herein are the property of their respective owners. EMC DCA Getting Started Guide – Contents EMC DCA Getting Started Guide - Contents Preface...............................................................................................5 About This Guide..............................................................................5 Document Conventions....................................................................6 Text Conventions........................................................................6 Command Syntax Conventions...................................................7 Getting Support ...............................................................................7 Product information....................................................................7 Technical support .......................................................................7 Chapter 1: About EMC DCA...........................................................8 About the DCA.................................................................................8 Power Connection Requirements when Plugging in a New Rack..9 Connecting New Racks to the Power Supply................................9 DCA Configurations.....................................................................9 Rack Types................................................................................13 About the Network Configuration...............................................14 DCA Modules and Master Servers....................................................17 Master Servers..........................................................................18 GPDB Modules...........................................................................19 Data Integration Accelerator Modules........................................21 HD Compute Modules................................................................24 Hadoop Master and Worker Modules..........................................25 About Greenplum Database.............................................................27 About the Master Servers..........................................................28 About the Segment Hosts..........................................................29 Chapter 2: Preparing the Data Center Environment.............31 Confirming Site Requirements.........................................................31 Floor Space Requirements.........................................................31 DCA Rack Dimensions................................................................32 Power Cord Specifications..........................................................33 Environmental Requirements.....................................................34 Air Quality Requirements...........................................................34 Optional Securing Brackets.............................................................35 Anti-Tip Bracket.........................................................................36 Anti-Move Bracket.....................................................................36 Seismic Restraint Bracket..........................................................37 Cabinet Positioning..........................................................................38 Package Dimensions and Clearance.................................................39 Chapter 3: Planning for a Multiple Rack DCA..........................40 Chapter 4: Gathering Site-Specific Information....................41 VLAN Overlay..................................................................................44 Planning for Remote Support - ESRS and Dialhome........................45 Chapter 5: DCA Administration...................................................46 DCA utilities....................................................................................46 Description................................................................................51 Options......................................................................................52 ConnectEMC Dial Home Capability.............................................55 Table of Contents 3 EMC DCA Getting Started Guide – Contents Greenplum Command Center.....................................................59 Pivotal Command Center...........................................................60 Greenplum Database Email and SNMP Alerting..........................60 SNMP on the DCA............................................................................61 DCA MIB information.......................................................................61 MIB Locations............................................................................61 MIB Contents.............................................................................61 View MIB...................................................................................63 Integrate DCA MIB with environment..............................................76 Change the SNMP community string..........................................76 Set an SNMP Trap Sink..............................................................77 General Database Maintenance Tasks.............................................78 Routine Vacuum and Analyze.....................................................78 Routine Reindexing....................................................................79 Managing Greenplum Database Log Files...................................79 Next Steps......................................................................................80 Chapter 6: Power Down the DCA ...............................................81 Chapter 7: Next Steps...................................................................86 Documentation Resources...............................................................86 Providing User Access to Greenplum Database................................86 Creating Databases and Loading Data.............................................87 Appendix A: Red Hat Enterprise Linux End User License Agreement.......................................................................................88 Appendix B: Apache Hadoop End User License Agreement91 Glossary............................................................................................95 Table of Contents 4 EMC DCA Getting Started Guide – Preface Preface This guide is intended for EMC personnel, partners, database and system administrators, and customers to plan for installing a new Greenplum Data Computing Appliance (DCA) into a data center. This guide provides an overview of the system, information on data center requirements, a checklist of items needed for software configuration, and links to relevant documentation for use in the next steps of deployment. This guide also contains an overview of the appliance configuration. Make sure that you verify that the requirements listed in this document are satisfied before performing a DCA installation. • About This Guide • Document Conventions • Getting Support About This Guide This guide assumes knowledge of Linux/UNIX system administration, database management systems, database administration, and structured query language (SQL). This guide contains the following chapters and appendices: • Chapter 1, “About EMC DCA” explains the architecture, components, and configuration of Greenplum Database on the DCA. • Chapter 2, “Preparing the Data Center Environment” describes site requirements for the DCA, securing brackets, cabinet positioning, and package dimensions and clearance. • Chapter 3, “Planning for a Multiple Rack DCA” contains information required to plan for a multiple rack DCA. • Chapter 4, “Gathering Site-Specific Information” contains a site requirements checklist, a plan for Hadoop networking, and information on remote support. • Chapter 5, “DCA Administration” describes the general database maintenance tasks and the tools available to diagnose, monitor, and troubleshoot a Greenplum Database system running on the Greenplum Data Computing Appliance. • Chapter 6, “Power Down the DCA” explains how to power down the DCA safely. • Chapter 7, “Next Steps” explains the next steps for implementing your data warehouse requirements in Greenplum Database. • “Glossary” defines Greenplum Database components and terminology. About This Guide 5 EMC DCA Getting Started Guide – Preface Document Conventions The following conventions are used throughout the Greenplum Database documentation to help you identify certain types of information. • Text Conventions • Command Syntax Conventions Text Conventions Table 0.1 Text Conventions Text Convention Usage Examples bold Button, menu, tab, page, and field Click Cancel to exit the page without names in GUI applications saving your changes. italics New terms where they are defined The master instance is the postgres process that accepts client Database objects, such as schema, connections. table, or columns names Catalog information for Greenplum Database resides in the pg_catalog schema. monospace File names and path names Edit the postgresql.conf file. Programs and executables Use gpstart to start Greenplum Database. Command names and syntax Parameter names monospace italics Variable information within file /home/gpadmin/config_file paths and file names COPY tablename FROM Variable information within 'filename' command syntax monospace bold Used to call attention to a particular Change the host name, port, and part of a command, parameter, or database name in the JDBC code snippet. connection URL: jdbc:postgresql://host:5432/m ydb UPPERCASE Environment variables Make sure that the Java /bin directory is in your $PATH. SQL commands SELECT * FROM my_table; Keyboard keys Press CTRL+C to escape. Document Conventions 6 EMC DCA Getting Started Guide – Preface Command Syntax Conventions Table 0.2 Command Syntax Conventions Text Convention Usage Examples { } Within command syntax, curly FROM { 'filename' | STDIN } braces group related command options. Do not type the curly braces. [ ] Within command syntax, square TRUNCATE [ TABLE ] name brackets denote optional arguments. Do not type the brackets. ... Within command syntax, an ellipsis DROP TABLE name [, ...] denotes repetition of a command, variable, or option. Do not type the ellipsis. | Within command syntax, the pipe VACUUM [ FULL | FREEZE ] symbol denotes an “OR” relationship. Do not type the pipe symbol. $ system_command Denotes a command prompt - do $ createdb mydatabase # root_system_command not type the prompt symbol. $ and # chown gpadmin -R /datadir # denote terminal command => gpdb_command => SELECT * FROM mytable; prompts. => and =# denote =# su_gpdb_command Greenplum Database interactive =# SELECT * FROM pg_database; program command prompts (psql or gpssh, for example). Getting Support EMC support, product, and licensing information can be obtained as follows. Product information For DCA product-specific documentation, release notes, or software updates, go to the EMC Online Support site at http://support.emc.com, click Support By Product, and search for Data Computing Appliance. Technical support For technical support, go to http://support.emc.com. The Support page includes several support options, including an option to request service. Note that to open a service request, you must have a valid support agreement. Please contact your EMC sales representative for details about obtaining a valid support agreement or with questions about your account. Getting Support 7 EMC DCA Getting Started Guide – Chapter 1: About EMC DCA 1. About EMC DCA The EMC DCA is a self-contained data warehouse solution that integrates all of the database software, servers, and switches necessary to perform big data analytics. The DCA is a turn-key, easily installed data warehouse solution that provides extreme query and loading performance for analyzing large data sets. The DCA integrates Greenplum Database, data loading, and Hadoop software with compute, storage, and network components. The DCA is delivered racked and ready for immediate data loading and query execution. This chapter includes the following sections: • About the DCA • DCA Modules and Master Servers About the DCA This section explains the hardware components and specifications of the Greenplum Data Computing Appliance. • Power Connection Requirements when Plugging in a New Rack • Connecting New Racks to the Power Supply • DCA Configurations • Rack Types • About the Network Configuration About the DCA 8 EMC DCA Getting Started Guide – Chapter 1: About EMC DCA Power Connection Requirements when Plugging in a New Rack If your DCA cluster is comprised of more than one module (four servers) in a rack then four power cords are required. If there is a single module in a rack (four servers) then two power cords are required. Connecting New Racks to the Power Supply When installing a new rack, the power source must be connected. When upgrading a rack with one module to two, three or four modules, the power distribution panel (PDP) to power distribution unit (PDU) connections may need to be re-routed. The customer power feeds connect to PDPs which feed PDUs. The switches and servers connect to PDUs. DCA Configurations The DCA is built from server increments called modules. Supported configurations are described next: • All DCA configurations include all required switches and two master nodes for cluster management. • Greenplum Database (GPDB) DCA (can be GPDB-only or a mix of GPDB and other types of servers): • Requires a minimum of 1 GPDB module in the System Rack occupying the lowest rack position • A GPDB module is comprised of x4 Intel 2U 24-drive servers • Maximum GPDB modules per rack: x4 modules (x16 24 drive servers) • Hadoop-only DCA (applies to DCA version 2.0.1.0 and later): • Minimum Hadoop configuration: 1 hdw module + 1 hdm module • A Hadoop Worker module (hdw) is comprised of x4 2U Intel 12-drive servers • A Hadoop Master module (hdm) is comprised of x4 2U Intel 12-drive servers • Hadoop Compute configuration: • Four HDC modules • There are three different DIA modules: • Two, 1U, six drive Intel servers • Two, 2U, twelve drive Intel servers • Two, 2U, twenty four drive Intel servers About the DCA 9 EMC DCA Getting Started Guide – Chapter 1: About EMC DCA Minimum GPDB configuration The minimum Greenplum Database (GPDB)-based DCA is comprised of a single Greenplum Database (GPDB) module. The maximum GPDB configuration is 48 modules occupying 12 racks. Figure 1.1 Minimum GPDB configuration About the DCA 10

Description:
EMC DCA Getting Started Guide – Chapter A: Red Hat Enterprise Linux End User License Agreement. 30 days from the date of delivery to Customer. Red Hat does not warrant that the functions contained in the Software will meet Customer's requirements or that the operation of the Software will be
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.