Apache Spark is a unified analytics engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing. this tags contains basic tutorial on various Apache spark functions
Apache Spark is a Big data processing engine which has components like "Spark SQL", "Spark Mlib" & "Spark streaming", we generally uses Apache spark for processing big
This Tutorial explain what is Spark imputer, implement the Imputer and basic terminologies used while using the imputer.And strategies available in spark imputer.
Overview of this tutorial
* Replace the data with new value in Data Frame
* Filter the row values with basic conditions in Data Frame
* Type Casting the Column Value in Data Frame
To start
What is Apache Spark ?
Apache Spark is all referred as big data processing tool or framework developed under Apache. Spark has various inbuilt tool like SparkSQL, Spark Streaming,Spark Mllib,GraphX to handle