Working with large data volumes

 

Subscription onlyThis content is available for Talend Academy subscription users only. Open module - EN

 

Talend Data Preparation can use a database as a source for creating datasets. In this Talend Data Preparation for Developers training module, you create a dataset from a MySQL database stored on your virtual machine. Then you create a small preparation for this dataset. This database contains a substantial number of rows. You use some sampling and export features that are available only for large data volumes.

 

Catalog

Data governance

Languages

EN

Format

Presentation, hands-on practices

Roles

Data governance developer

Badge track

 

Learning plan

Talend Data Preparation for Developers

Hands-on tasks
  • Create a dataset from a MySQL database 

  • Progressively apply filters to a large dataset to get the most accurate data sample for your preparation 

  • Export a sample of cleansed data 

  • Export the full, cleansed dataset