Skip to main content

Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution

An IBM Redpaper publication

thumbnail 

Published on 26 June 2018

  1. .EPUB (0.6 MB)
  2. .PDF (2.3 MB)

Apple BooksGoogle Play BooksRead in Google Books
Share this page:   

ISBN-10: 0738456969
ISBN-13: 9780738456966
IBM Form #: REDP-5448-01


Authors: R. Sandeep Patil, Wei G. Gong, Pallavi Galgali, Piyush Chaudhary, Muthu Muthiah, Yong ZY Zheng and Larry Coyne

menu icon

Abstract

This IBM® Redpaper™ publication provides guidance on building an enterprise-grade data lake by using IBM Spectrum™ Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models.

Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation.

IBM Spectrum Scale™ is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.

Table of Contents

Hortonworks Data Platform

IBM SPectrum Scale

Integrated solution overview

Component diagram

Deployment diagram

Deployment models

Shared Storage model

Shared Nothing Storage model

System configuration

HDP and IBM Spectrum Scale frequently asked questions

 

Others who read this also read