A foundations course in data science, emphasizing both concepts and techniques.
The course provides an overview of data analysis tasks and the associated
challenges, spanning data preprocessing, model building, model evaluation, and
visualization. The major areas of machine learning, such as unsupervised,
semi-supervised and supervised learning are covered by data analysis techniques
including classification, clustering, association analysis, anomaly detection,
and statistical testing. The course includes a series of assignments utilizing
practical datasets from diverse application domains, which are designed to
reinforce the concepts and techniques covered in lectures. A substantial project
related to one or more data sets culminates the course.