Integration of IBM Platform Symphony and IBM InfoSphere BigInsights

An IBM Redpaper publication

Published 30 April 2013

cover image

IBM Form #: REDP-5006-00
(36 pages)

More options

Rate and comment

Authors: Dino Quintero, Ricardo Dobelin Barros, Ashraf Gomaa, José Higino, Archana Kumar, Majid Ouassir, Adam Parker, Joanna Wong

Abstract

Hadoop can store big data and unlock the answers by analyzing them. IBM® InfoSphere® BigInsights™ is built on top of open source Hadoop and extends it with advanced analytic tools and other capabilities with added value. InfoSphere BigInsights helps organizations of all sizes to more efficiently manage the vast amounts of data that consumers and businesses create every day. At its core, Hadoop is a Distributed Computing Environment that manages the execution of distributed jobs and tasks on a cluster. As with any Distributed Computing Environment, the Hadoop software needs to provide facilities for resource management, scheduling, remote execution, and exception handling. Although Hadoop provides basic capabilities in these areas, IBM Platform Computing has been working on these problems and perfecting them for twenty years.

This IBM Redpaper™ publication describes the integration of IBM Platform Symphony® 5.2 and IBM InfoSphere BigInsights 1.4 in an IBM System x® cluster. IBM Platform Symphony is a low-latency scheduling solution that supports true multitenancy and sophisticated workload management capabilities.

Table of contents

IBM Platform Symphony
Environment
Configuring InfoSphere BigInsights
Installing IBM Platform Symphony Advanced Edition
Integrating IBM Platform Symphony and InfoSphere BigInsights
Additional configuration for IBM Platform Symphony
Benchmark tests
Adding users
Adding nodes
Troubleshooting

Follow IBM Redbooks

Follow IBM Redbooks