1 Answer

0 votes
by

A DataFrame in SparkSQL is a Dataset organized into names columns. It is conceptually like a table in SQL.

In Java and Scala, a DataFrame is a represented by a DataSet of rows.

We can create a DataFrame from an existing RDD, a Hive table or from other Spark data sources.

 

...