Syllabus: Data Mining with SQL, etc, Prof. Kurnicki, Hult International (Spring 2020)
Spring 2020: Data Mining With MySQL, NoSQL, Hadoop, Spark, and Hive
A deep dive into the principles and techniques of data mining and driving business insight using noSQL, Hadoop, Apache Spark, and Apache Hive. Topics include querying datasets using SQL using the Dataiku Studio. What is more, topics will include machine learning using PySpark, querying data using (HQL) and creating business value based on data stored in Hadoop (Spark environment). Dataiku Studio will be used to practice SQL, queries on databases in PostgreSQL and MongoDB. Students will also learn how use Python packages to connect to a Hadoop database.