Making Apache Spark™ Better with Delta Lake

Audience:

Topic:

Apache Spark™ is the dominant processing framework for big data. Delta Lake adds reliability to Spark so your analytics and machine learning initiatives have ready access to quality, reliable data. This presentation covers the use of Delta Lake to enhance data reliability for Spark environments.

Topics:

The role of Apache Spark in big data processing
Use of data lakes as an important part of the data architecture
Data lake reliability challenges
How Delta Lake helps provide reliable data for Spark processing
Specific improvements improvements that Delta Lake adds
The ease of adopting Delta Lake for powering your data lake

Presentation:

SCaLE18x - Making-Apache-Spark-Better-with-Delta-Lake.pdf

Room:

Room 104

Time:

Saturday, March 7, 2020 - 13:30 to 14:30

Learn about the steps we’re taking to mitigate the risk against Coronavirus at SCALE 18x.

Making Apache Spark™ Better with Delta Lake