Using Talend Data Mapper on Spark

 

Subscription onlyThis content is available for Talend Academy subscription users only. Open use case - EN

 

Prerequisites

Talend Data Mapper Essentials, Talend Data Integration Basics, Talend Big Data Basics

Third-party software

Hadoop cluster

Description

 

 

This use case introduces you to how to use Talend Data Mapper strengths on Spark batch and Spark Streaming.

 

Talend Data Mapper offers multiple specialized components that allow you to process hierarchical files at the speed of Spark.

 

Throughout this use case, you will create Big Data batch and streaming Jobs, invoke TDM maps from these Jobs to transform hierarchical files and streams of hierarchical records.