A Big Data Hadoop and Spark Project For Absolute Beginners — Udemy — Last updated 9/2020 — Free download Hadoop, Spark, Python,PySpark, Scala, Dataproc, AWS S3 Data Lake, Glue, Athena What you’ll learn Big Data , Hadoop and Spark from scratch using Python and Scala. You will also learn how to use free cloud tools to get started with Hadoop and Spark programming in minutes. Additionally you will find two bonus projects on AWS data lake solution and Machine Learning Classification model Requirements Students should have some programming background and some knowledge of SQL queries. Description A bank is launching a new credit card and wants to identify prospects it can target in its marketing campaign. It has received prospect data from various internal and 3rd party sources. The data has various issues such as missing or unknown values in certain fields.The data needs to be cleansed before any kind of analysis can be done. Since the data is in huge volume with billions of records, the bank has asked you to use Big Data Hadoop and Spark technology to cleanse, transform and analyze this data. What you will learn : Big Data, Hadoop concepts How to create a free Hadoop and Spark cluster using Google Dataproc Hadoop hands-on – HDFS, Hive Why there was a need for Spark Python basics PySpark RDD – hands-on PySpark SQL, DataFrame – hands-on Project work using PySpark and Hive Scala basics Spark Scala DataFrame Project working using Spark Scala Google Colab environment Bonus project – Applying spark transformation on data stored in AWS S3 using Glue and viewing data using Athena Prerequisites : Some basic programming skills Some knowledge of SQL queries Who this course is for: Beginners who want to learn Big Data or experienced people who want to transition to a Big Data role DOWNLOAD Filename: A Big Data Hadoop and Spark project for absolute beginners.zip (download) Filesize: 1.89 GB Course content: https://www.udemy.com/course/big-data-hadoop-spark-project/ Скачать с облака. Download from the cloud