Grid Data Management Systems & Services Data Grid Management Systems – Part I Arun Jagatheesan, Reagan Moore Grid Services for Structured data –Part II Paul Watson, Norman Paton VLDB Tutorial Berlin, 2003 VLDB 2003 Berlin Part I: Data Grid Management Systems Arun Jagatheesan Reagan Moore {arun, moore}@sdsc.edu San Diego Supercomputer Center University of California, San Diego http://www.npaci.edu/DICE/SRB/ VLDB Tutorial Berlin, 2003 VLDB 2003 Berlin Tutorial Part I Outline (cid:127) Concepts (cid:127) Introduction to Grid Computing (cid:127) Proliferation of Data Grids (cid:127) Data Grid Concepts (cid:127) Practice (cid:127) Real life use cases SDSC Storage Resource Broker (SRB) (cid:127) Hands on Session (cid:127) Research (cid:127) Active Datagrid Collections (cid:127) Data Grid Management Systems (DGMS) (cid:127) Open Research Issues VLDB 2003 Berlin 3 Distributed Computing © Images courtesy of Computer History Museum VLDB 2003 Berlin 4 Distributed Data Management (cid:127) Data collecting (cid:127) Sensor systems, object ring buffers and portals (cid:127) Data organization (cid:127) Collections, manage data context (cid:127) Data sharing (cid:127) Data grids, manage heterogeneity (cid:127) Data publication (cid:127) Digital libraries, support discovery (cid:127) Data preservation (cid:127) Persistent archives, manage technology evolution (cid:127) Data analysis (cid:127) Processing pipelines, manage knowledge extraction VLDB 2003 Berlin 5 What is a Grid? “Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations” Ian Foster, ANL What is Middleware? Software that manages distributed state information for results of remote services Reagan Moore, SDSC VLDB 2003 Berlin 6 Data Grids (cid:127) A datagrid provides the coordinated management mechanisms for data distributed across remote resources. (cid:127) Data Grid (cid:127) Coordinated sharing of information storage (cid:127) Logical name space for location independent identifiers (cid:127) Abstractions for storage repositories, information repositories, and access APIs (cid:127) Computing grid and the datagrid part of the Grid. (cid:127) Data generation versus data management VLDB 2003 Berlin 7 Tutorial Part I Outline (cid:127) Concepts (cid:127) Introduction to Grid Computing Are data grids in production use? (cid:127) Proliferation of Data Grids How are they (cid:127) Data Grid Concepts applied? (cid:127) Practice (cid:127) Real life use cases SDSC Storage Resource Broker (SRB) (cid:127) Hands on Session (cid:127) Research (cid:127) Active Datagrid Collections (cid:127) Data Grid Management Systems (DGMS) (cid:127) Open Research Issues VLDB 2003 Berlin 8 Storage Resource Broker at SDSC More features, 60 Terabytes and counting VLDB 2003 Berlin 9 NSF Infrastructure Programs (cid:127) Partnership for Advanced Computational Infrastructure - PACI (cid:127) Data grid - Storage Resource Broker (cid:127) Distributed Terascale Facility - DTF/ETF (cid:127) Compute, storage, network resources (cid:127) Digital Library Initiative, Phase II - DLI2 (cid:127) Publication, discovery, access (cid:127) Information Technology Research projects - ITR (cid:127) SCEC Southern California Earthquake Center (cid:127) GEON GeoSciences Network (cid:127) SEEK Science Environment for Ecological Knowledge (cid:127) GriPhyN Grid Physics Network (cid:127) NVO National Virtual Observatory (cid:127) National Middleware Initiative - NMI (cid:127) Hardening of grid technology (security, job execution, grid services) (cid:127) National Science Digital Library - NSDL (cid:127) Support for education curricula modules VLDB 2003 Berlin 10
Description: