Deploy Tall Arrays to a Spark Enabled Hadoop Cluster

Create and execute MATLAB^® applications with tall arrays against a Spark™ enabled Hadoop^® cluster

Supported Platform: Linux^® only.

Deploying a MATLAB application that contains tall arrays against a Spark enabled Hadoop cluster consists of two parts:

Creating and packaging a standalone application in the MATLAB desktop environment.
Executing the standalone application on a Spark enabled Hadoop cluster from a Linux shell.

For a complete example on deploying tall arrays to a Spark enabled Hadoop cluster, see Deploy Tall Arrays to a Spark Enabled Hadoop Cluster. You can follow the same instructions to deploy tall array Spark applications to CLOUDERA^® CDH.

Classes

matlab.mapreduce.DeploySparkMapReducer Configure a MATLAB tall array application with Spark parameters as key-value pairs

Functions

mapreducer Define execution environment for mapreduce or tall arrays

Topics

Apache Spark Basics
Learn basic Apache^® Spark concepts and see how these concepts relate to deploying MATLAB applications to Spark.
Deploy Tall Arrays to a Spark Enabled Hadoop Cluster
Try an example on how to deploy a tall array MATLAB application to a Spark enabled Hadoop cluster.