The next webinar will focus on Spark in local and cluster mode, and will take place on April 30 from 11:00 to 12:30.
Spark is a tool that enables you to process large volumes of data efficiently, taking advantage of parallelization.
This webinar will cover the following topics:
• “Spark” logic for calls from other languages (APIs)
• Spark processing (workers) distribution methods according to data location, local or cluster mode
• Types of transformations in Spark
• Examples of Spark actions (show, count, collect…)
• “Lazy evaluation” principle
• Spark resource management
To register (and receive the connection link) : click here