Course curriculum

    1. Why Big Data

    2. Applications of PySpark

    3. Introduction to Instructor

    4. Introduction to Course

    5. Projects Overview

    1. Why Spark

    2. Hadoop EcoSystem

    3. Spark Architecture and EcoSystem

    4. DataBricks SignUp

    5. Create DataBricks Notebook

    6. Download Spark and Dependencies

    7. Java Setup on Window

    8. Python Setup on Window

    9. Spark Setup on Window

    10. Hadoop Setup on Window

    11. Runing Spark on Window

    12. Java Download on MAC

    13. Installing JDK on MAC

    14. Setting Java Home on MAC

    15. Java check on MAC

    16. Installing Python on MAC

    17. Setup Spark on MAC

    1. Spark RDDs

    2. Creating Spark RDD

    3. Running Spark Code Locally

    4. RDD Map (Lambda)

    5. RDD Map (Simple Function)

    6. Quiz (Map)

    7. Solution 1 (Map)

    8. Solution 2 (Map)

    9. RDD FlatMap

    10. RDD Filter

    11. Quiz (Filter)

    12. Solution (Filter)

    13. RDD Distinct

    14. RDD GroupByKey

    15. RDD ReduceByKey

    16. Quiz (Word Count)

    17. Solution (Word Count)

    18. RDD (Count and CountByValue)

    19. RDD (saveAsTextFile)

    20. RDD (Partition)

    21. Finding Average-1

    22. Finding Average-2

    23. Quiz (Average)

    24. Solution (Average)

    25. Finding Min and Max

    26. Quiz (Min and Max)

    27. Solution (Min and Max)

    28. Project Overview

    29. Total Students

    30. Total Marks by Male and Female Student

    31. Total Passed and Failed Students

    32. Total Enrollments per Course

    33. Total Marks per Course

    34. Average marks per Course

    35. Finding Minimum and Maximum marks

    36. Average Age of Male and Female Students

    1. Introduction to Spark DFs

    2. Creating Spark DFs

    3. Spark Infer Schema

    4. Spark Provide Schema

    5. Create DF from Rdd

    6. Rectifying the Error

    7. Select DF Colums

    8. Spark DF withColumn

    9. Spark DF withColumnRenamed and Alias

    10. Spark DF Filter rows

    11. Quiz (select, withColumn, filter)

    12. Solution (select, withColumn, filter)

    13. Spark DF (Count, Distinct, Duplicate)

    14. Quiz (Distinct, Duplicate)

    15. Solution (Distinct, Duplicate)

    16. Spark DF (sort, orderBy)

    17. Quiz (sort, orderBy)

    18. Solution (sort, orderBy)

    19. Spark DF (Group By)

    20. Spark DF (Group By - Multiple Columns and Aggregations)

    21. Spark DF (Group By -Visualization)

    22. Spark DF (Group By - Filtering)

    23. Quiz (Group By)

    24. Solution (Group By)

    25. Quiz (Word Count)

    26. Solution (Word Count)

    27. Spark DF (UDFs)

    28. Quiz (UDFs)

    29. Solution (UDFs)

    30. Solution (Cache and Presist)

    31. Spark DF (DF to RDD)

    32. Spark DF (Spark SQL)

    33. Spark DF (Write DF)

    34. Project Overview

    35. Project (Count and Select)

    36. Project (Group By)

    37. Project (Group By, Aggregations and Order By)

    38. Project (Filtering)

    39. Project (UDF and WithColumn)

    40. Project (Write)

    1. Collaborative filtering

    2. Utility Matrix

    3. Explicit and Implicit Ratings

    4. Expected Results

    5. Dataset

    6. Joining Dataframes

    7. Train and Test Data

    8. ALS model

    9. Hyperparameter tuning and cross validation

    10. Best model and evaluate predictions

    11. Recommendations

    1. Introduction to Spark Streaming

    2. Spark Streaming with RDD

    3. Spark Streaming Context

    4. Spark Streaming Reading Data

    5. Spark Streaming Cluster Restart

    6. Spark Streaming RDD Transformations

    7. Spark Streaming DF

    8. Spark Streaming Display

    9. Spark Streaming DF Aggregations

About this course

  • $199.99
  • 187 lessons
  • 19.5 hours of video content