LIRR STRIKE: The LIRR is on strike, disrupting the ability for commuters to get to and from the Stony Brook campus. More details here.

 

AMS 524, Modern Computational Data Analytics

This course introduces the tools for the analysis of big data sets on server machines. It teaches how to store, preprocess, analyze and visualize data arriving at high volume and velocity. In the first part of the course, we will cover programming in Python, from its basic libraries to more
advanced methods for big data analytics, and machine learning. Emphasis will be on the implementation in Python and practical hands-on examples. Next, we will learn essential Shell scripting and terminal window commands for computations on server machines. We will introduce database management systems and SQL querying. In the second part of the course, we will discuss code version control and collaboration solutions in GitHub and GitHub Actions, microservices, containers (Docker and Kubernetes), API gateways, and other tools necessary in a professional data science pipeline.  

Note:  Instructor consent

Summer, 3 credits,  ABCF grading
May be repeated for credit