### Overview

**This course provides an overview of bachelor-level statistics. You will review the concepts of descriptive and inferential statistics. You will use the statistical software package R on real data to gain insight in these topics.**

A strong foundation in mathematics is critical for success in all science and engineering disciplines. Whether you want to make a strong start to a master’s degree, prepare for more advanced courses, solidify your knowledge in a professional context or simply brush up on fundamentals, this course will get you up to speed.

In many engineering master’s programs, statistics is used quite intensively. As soon as you are dealing with real-life data, you will need to get an idea of what these data tell you and how you can visualize this (descriptive statistics). But you will also want to perform some analysis (inferential statistics): you may want to build a model that mimics reality, estimate some quantities, or test some hypotheses.

The statistics course in this series will help you refresh your knowledge on these topics. Along the way you will learn how to apply these concepts to datasets, using the statistical software R.

This course offers enough depth to cover the statistics you need to succeed in your engineering master’s or profession in areas such as machine learning, data science and more.

**This is a review course**

This self-contained course is modular, so you do not need to follow the entire course if you wish to focus on a particular aspect. As a review course you are expected to have previously studied or be familiar with most of the material. Hence the pace will be higher than in an introductory course.

This format is ideal for refreshing your bachelor level mathematics and letting you practice as much as you want. Through the Grasple platform, you will have access to plenty of exercises – often with different numbers – and receive intelligent, personal and immediate feedback.

### What You'll Learn:

- Make and interpret numerical and graphical summaries of datasets.
- Use various techniques to find estimators for unknown parameters and how to compare them.
- Construct and interpret confidence intervals, learn how to perform hypothesis testing in various settings, and know how these two concepts are related.
- Perform simple and multiple linear regression on quantitative and categorical variables.
- Apply certain procedures (resampling, bootstrapping, non-parametric approach) when confronted with non-standard situations.
- Use the R software package to perform all these tasks.

### Details

### Course Syllabus

Week 1: Descriptive statistics

- graphical summaries of datasets
- numerical summaries of datasets
- connection with probability theory

Week 2: Estimator theory

- quality of estimators
- methods to obtain estimators

Week 3: Hypothesis testing

- concepts
- how to perform a test in various settings

Week 4: Confidence intervals (CI)

- motivation
- how to construct a CI in various settings

Week 5: Linear regression

- simple and multiple linear regression
- categorical variables
- interpreting output

Week 6: Bootstrap and resampling

- parametric and non-parametric approach
- how to deal with non-standard situations

### Admission

This is a Massive Open Online Course (MOOC) that runs on edX.

### Prerequisites

Prior knowledge of all the material covered.

Some basic calculus will be used, along with some aspects of probability theory: computation of expectation and variance of a random variable with known PDF, the central limit theorem, Bayes’ theorem ... We expect you to be familiar with these topics.

This course is a review course. As such we expect that you are already familiar with some basic topics in statistics.