SparkSQL only examples that process William Shakespeare's plays. The text of William Shakespeare's plays are from https://old.datahub.io/dataset/william-shakespeare-plays .
SQL scripts:
$SPARK_HOME/bin/spark-sql -f length_of_play.sqlPython scripts:
$SPARK_HOME/bin/spark-submit length_of_play.py- Spark SQL, DataFrames and Datasets Guide: http://spark.apache.org/docs/latest/sql-programming-guide.html
- Spark SQL Reference: https://docs.databricks.com/spark/latest/spark-sql/index.html