CASD DATA TECH Webinar: Spark in cluster and local mode

The next webinar will focus on Spark in local and cluster mode, and will take place on April 30 from 11:00 to 12:30.

Spark is a tool that enables you to process large volumes of data efficiently, taking advantage of parallelization.

This webinar will cover the following topics:
• “Spark” logic for calls from other languages (APIs)
• Spark processing (workers) distribution methods according to data location, local or cluster mode
• Types of transformations in Spark
• Examples of Spark actions (show, count, collect…)
• “Lazy evaluation” principle
• Spark resource management

To register (and receive the connection link) : click here