Apache Spark

Apache Spark
Original author(s)Matei Zaharia
Developer(s)Apache Spark
Initial releaseMay 26, 2014 (2014-05-26)
Stable release
3.5.4 (Scala 2.13) / December 20, 2024 (2024-12-20)
RepositorySpark Repository
Written inScala[1]
Operating systemMicrosoft Windows, macOS, Linux
Available inScala, Java, SQL, Python, R, C#, F#
TypeData analytics, machine learning algorithms
LicenseApache License 2.0
Websitespark.apache.org Edit this at Wikidata

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance. Originally developed at the University of California, Berkeley's AMPLab starting in 2009, in 2013, the Spark codebase was donated to the Apache Software Foundation, which has maintained it since.

  1. ^ "Spark Release 2.0.0". MLlib in R: SparkR now offers MLlib APIs [..] Python: PySpark now offers many more MLlib algorithms"

© MMXXIII Rich X Search. We shall prevail. All rights reserved. Rich X Search