Nov 23, 2024  
2022-2023 Graduate Academic Catalog 
    
2022-2023 Graduate Academic Catalog [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

CS 6330 - Data Science

3 lecture hours 2 lab hours 4 credits
Course Description
Applied data science including data preparation, exploratory data analysis, data visualization, statistical testing, and predictive modeling.  Emphasis will be placed on extracting information from data sets that can be turned into actionable insights or interventions.  Problems and data sets may be selected by disciplines of interest to the students and expertise as appropriate. (prereq: CS 6230 )
Course Learning Outcomes
Upon successful completion of this course, the student will be able to:
  • Understand the basic process of data science and exploratory data analysis including modes of inquiry (hypothesis driven, data driven, and methods driven)
  • Identify, access, load, and prepare (clean) a data set for a given problem
  • Select, apply, and interpret appropriate visual and statistical methods to analyze distributions of individual variables and relationships between pairs of variables
  • Communicate findings through generated data visualizations and reports
  • Apply and interpret unsupervised learning models for exploratory data analysis
  • Generate appropriate supervised learning problem descriptions
  • Determine and apply appropriate experimental setup, evaluation metrics, and models for supervised learning problems
  • Engineer features for machine learning tasks
  • Perform and interpret feature selection to identify relationships between features and predicted variables
  • Apply methods to real-world data sets

Prerequisites by Topic
  • Application of the scientific method to data analysis and inference
  • Cleaning and managing data using functional programming libraries such as pandas
  • Using matrices and software libraries to structure data for analysis and manipulation
  • Manipulation of models of real-world problems based on data
  • Communicationg of data interpretations including generating meaningful data visualizations

Coordinator
Dr. RJ Nowling



Add to Portfolio (opens a new window)