« Apache Spark » : différence entre les versions

Dernière version du 30 août 2024 à 14:53

Définition

Spark ou Apache Spark est un cadre open source de calcul distribué. Il s'agit d'un ensemble d'outils et de composants logiciels structurés selon une architecture définie.

Développé à l'université de Californie à Berkeley par AMPLab, Spark est aujourd'hui un projet de la fondation Apache. Ce produit est un cadre applicatif de traitements de mégadonnées (big data) pour effectuer des analyses complexes à grande échelle.

Français

Apache Spark

Spark

Anglais

Apache Spark

Spark

Sources

Source : Wikipedia

Source : 277 Data Science Key Terms, Explained

@@ Ligne 1 : / Ligne 1 : @@
-== en construction ==
+== Définition ==
-[[Catégorie:Vocabulary]]
+Spark ou Apache Spark est un cadre open source de calcul distribué. Il s'agit d'un ensemble d'outils et de composants logiciels structurés selon une architecture définie.
-[[Catégorie:Mégadonnées]]
-[[Catégorie:Intelligence artificielle‏‎]]
-== Définition ==
+Développé à l'université de Californie à Berkeley par AMPLab, Spark est aujourd'hui un projet de la fondation Apache. Ce produit est un cadre applicatif de traitements de mégadonnées (big data) pour effectuer des analyses complexes à grande échelle.
-...
 == Français ==
-...
+'''Apache Spark'''
+'''Spark'''
 == Anglais ==
 ''' Apache Spark'''
-Apache Spark is a powerful open-source processing engine built around speed, ease of use, and sophisticated analytics, with APIs in Java, Scala, Python, R, and SQL. Spark runs programs up to 100x faster than Apache Hadoop MapReduce in memory, or 10x faster on disk. It can be used to build data applications as a library, or to perform ad-hoc data analysis interactively. Spark powers a stack of libraries including SQL, DataFrames, and Datasets, MLlib for machine learning, GraphX for graph processing, and Spark Streaming. You can combine these libraries seamlessly in the same application. As well, Spark runs on a laptop, Apache Hadoop, Apache Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Apache Cassandra, Apache HBase, and S3.
+'''Spark'''
+==Sources==
-(From Denny Lee and Jules Damji's Apache Spark Key Term's, Explained)
+[https://fr.wikipedia.org/wiki/Apache_Spark  Source : Wikipedia]
+[https://www.kdnuggets.com/2017/09/data-science-key-terms-explained.html  Source : 277 Data Science Key Terms, Explained]
-<small>
+[[Catégorie:ENGLISH]]
-[https://www.kdnuggets.com/2017/09/data-science-key-terms-explained.html  Source : 277 Data Science Key Terms, Explained]
+[[Catégorie:GRAND LEXIQUE FRANÇAIS]]

« Apache Spark » : différence entre les versions