Introduction

Course structure

This course combines asynchronous teaching elements (e.g., texts and pre-recorded videos on this website) with synchronous elements (e.g., weekly in-person interactive sessions in the PC Lab). The syllabus consists of three main parts, as reflected by the structure of this website:

  1. Lecture notes: the lecture part will explain the theory behind the concepts and methods and provide you with example applications using the statistical software R.
  2. Individual assignments: the individual assignments require you to apply the acquired knowledge to new data sets.
  3. Group project: in the group you will design and conduct your own market research project and transfer the knowledge to a real business setting.

The general approach is that students will first familiarize themselves with the contents by going through the materials on their own. This self-study process is complemented with in-person weekly interactive sessions in the PC lab, which provide ample opportunities to ask questions and clarify points that require further discussion. The schedule for each of the three parts will be explained below.

Schedule

In-person lecture

The contents on this website are divided into weekly readings. To be able to follow the curriculum and complete the assignments, you need to read the materials assigned for the respective week. The relevant chapters are indicated in the table below. The weekly readings will be complemented with weekly interactive sessions in the PC lab, which provide you with an opportunity to ask questions about the assigned readings. Please note that you need to go through the materials on your own in the week before the respective session. For example, chapters 2, 3 & 4 will be discussed in the second session. The dates and times for the classroom sessions are indicated in the table below for each group separately. It is highly recommended to prepare questions or comments about the materials for these sessions that you think might be interesting and helpful to the class. As a preparation for the in-class discussions, you should go through the Learning check section at the end of each chapter. By working through these questions, you may self-assess your progress and identify knowledge gaps regarding the materials that were assigned for the previous week.


Lecture dates - Group A
Date Day Time Room Topics Chapters
Oct. 3 Monday 01:00PM - 05:00PM TC.-1.61 Introduction to the course
Basic concepts
1
Oct. 10 Monday 01:00PM - 05:00PM TC.-1.61 Introduction to R & R Markdown 2, 3, 4
Oct. 17 Monday 01:00PM - 05:00PM TC.-1.61 Introduction to inferential statistics
Hypothesis testing I
5, 6.1, 6.2, 6.3
Oct. 24 Monday 01:00PM - 04:00PM TC.5.03 Hypothesis testing II 6.4, 6.5, 6.6
Oct. 31 Monday 01:00PM - 05:00PM EA.6.026 Regression I 7.1, 7.2, 7.3
Nov. 7 Monday 01:00PM - 05:00PM TC.-1.61 Regression II 7.4, 7.5, 7.6
Nov. 14 Monday 01:00PM - 05:00PM TC.-1.61 Factor analysis
Cluster analysis
8, 9
Note: Dates and times are indicated for group A.

Lecture dates - Group B
Date Day Time Room Topics Chapters
Oct. 5 Wednesday 01:00PM - 05:00PM TC.-1.61 Introduction to the course
Basic concepts
1
Oct. 12 Wednesday 12:00PM - 04:00PM TC.-1.61 Introduction to R & R Markdown 2, 3, 4
Oct. 19 Wednesday 12:00PM - 04:00PM TC.-1.61 Introduction to inferential statistics
Hypothesis testing I
5, 6.1, 6.2, 6.3
Oct. 28 Friday 01:00PM - 05:00PM TC.5.03 Hypothesis testing II 6.4, 6.5, 6.6
Nov. 2 Wednesday 12:00PM - 04:00PM EA.6.026 Regression I 7.1, 7.2, 7.3
Nov. 9 Wednesday 12:00PM - 04:00PM TC.-1.61 Regression II 7.4, 7.5, 7.6
Nov. 16 Wednesday 12:00PM - 04:00PM TC.-1.61 Factor analysis
Cluster analysis
8, 9
Note: Dates and times are indicated for group B.


We understand that the self-study format might pose challenges to the learning process because we cannot troubleshoot in person outside of the classroom sessions. Remember that it is very unlikely that you are the only student encountering a particular problem. So please make use of the forum on Learn@WU (see below) to interact with your peers or ask us questions so that everyone else will benefit from the answer (there are no stupid questions!). In case you cannot get answers to address a specific problem, we will be available during the in-person classroom sessions for coaching.

Individual assignments

There will be 4 individual assignments complemented with optional coaching sessions. Note that the assignments need to be submitted in the R Markdown format (see chapter 10) via Learn@wu. There will be a coaching session dedicated to the R Markdown reporting format in the second session, when the first homework is assigned.

Your main point of contact for the individual assignments will be Daria Yudaeva (daria.yudaeva@wu.ac.at). You may contact Daria to schedule individual coaching sessions during the times indicated in the table below. These coaching sessions will be held online. However, please make sure that you have exhausted all other resources to solve a particular problem, such as the online tutorial, the forum on Learn@wu, and other web resources (see below) before you schedule a coaching session. If you feel that other students might have similar questions and would benefit from an answer to a particular question, you should post the question in the forum on Learn@wu.


Assignment schedule - Group A
Assignment Assigned Coaching Submission
Assignment 1: R Basics Oct. 10 Oct. 14, 09:00AM-01:00PM (online)* Oct. 16, 11:59PM
Assignment 2: Hypothesis testing Oct. 17 Oct. 28, 09:00AM-01:00PM (online)* Oct. 30, 11:59PM
Assignment 3: Regression Oct. 31 Nov. 11, 09:00AM-01:00PM (online)* Nov. 13, 11:59PM
Assignment 4: Factor and cluster analysis Nov. 14 Nov. 25, 09:00AM-01:00PM (online)* Nov. 27, 11:59PM
Note: Dates and times are indicated for group A.
Dates indicated with ’*’ are optional time slots for individual online video coaching sessions with Daria.
If you need to schedule a meeting, please contact Daria at least one day in advance ().

Assignment schedule - Group B
Assignment Assigned Coaching Submission
Assignment 1: R Basics Oct. 12 Oct. 14, 02:00PM-06:00PM (online)* Oct. 18, 11:59PM
Assignment 2: Hypothesis testing Oct. 19 Oct. 31, 12:00PM-04:00PM (online)* Nov. 01, 11:59PM
Assignment 3: Regression Nov. 2 Nov. 11, 02:00PM-06:00PM (online)* Nov. 15, 11:59PM
Assignment 4: Factor and cluster analysis Nov. 16 Nov. 25, 02:00PM-06:00PM (online)* Nov. 29, 11:59PM
Note: Dates and times are indicated for group B.
Dates indicated with ’*’ are optional time slots for individual online video coaching sessions with Daria.
If you need to schedule a meeting, please contact Daria at least one day in advance ().


Group project

The marketing research group project consists of multiple consecutive steps:

  • the design of a questionnaire
  • data collection using an online survey
  • data handling & data analysis
  • recording of a video with a presentation of you results

We will offer coaching sessions throughout this process, providing feedback and allowing you to ask questions. The submission dates and the dates and times for the coaching sessions are summarized in the table below.

Your main point of contact for the group project will be Daniel Winkler (daniel.winkler@wu.ac.at). You may contact Daniel at least one week in advance to schedule individual coaching sessions for your groups (in-person or online) during the following times:

  • Thursdays 09:00AM - 05:00PM
  • Fridays 01:00PM - 05:00PM

Again, please make sure that you have exhausted all other resources to solve a particular problem, such as the online tutorial, the forum on Learn@wu, and other web resources (see below) before you schedule a coaching session. If you feel that other students might have similar questions and would benefit from an answer to a particular question, you should post the question in the forum on Learn@wu.


Group project schedule - Group A
Component Date Room
Intro to Questionnaire design Nov. 21, 01:00PM - 05:00PM TC.-1.61
Questionnaire coaching* Nov. 28, 12:30PM - 04:30PM TC.-1.61
Data analysis coaching* Dec. 12, 01:00PM - 05:00PM D4.0.022
(Pre-recorded) presentation submission Jan. 6, 11:59PM
Note: Dates and times are indicated for group A.
Sessions indicated with ’*’ are group coaching sessions. Slots of 30 min. are assigned to each group within the indicated times.

Group project schedule - Group B
Component Date Room
Intro to Questionnaire design Nov. 23, 12:00PM - 04:00PM TC.-1.61
Questionnaire coaching* Nov. 30, 12:00PM - 04:00PM TC.-1.61
Data analysis coaching* Dec. 14, 02:00PM - 06:00PM EA.6.032
(Pre-recorded) presentation submission Jan. 6, 11:59PM
Note: Dates and times are indicated for group B.
Sessions indicated with ’*’ are group coaching sessions. Slots of 30 min. are assigned to each group within the indicated times.


Grading

Grading is based on the following components:

  • Market research group project (questionnaire design, data collection & analysis, reporting & presentations) [weight: 30%]
  • Individual take-home computer exercises (statistical analysis of data sets; 4 assignments accounting for 6% (A1), 8% (A2), 8% (A3), and 8% (A4)) [weight: 30%]
  • Final online exam (concepts & methods) [weight: 30%]
  • Class participation (quantity & quality of contributions during the weekly Q&A sessions, contributions in the online forum, etc.) [weight: 10%]

These grading components will be weighted with the respective weights to arrive at the final grade percentage.

The final exam will take place online at 10:00AM on December 22, 2022. The exam covers questions about the concepts and methods (no coding) and we will provide example exams from the previous years to give you an idea about what type of questions you can expect.

To ensure an equal contribution of group members for the group assignment, a peer assessment will be conducted among group members, which enters into the computation of the individual grades for the project. This means that the members of a group are required to assess other students regarding their relative contribution.

To successfully pass this course, your weighted final grade needs to exceed 60%.

Course materials

Main reference

The main reference for this course is this website along with the corresponding slides and the pre-recorded video lectures. The relevant materials for each week are indicated in the tables above. The aim of the materials is to condense the contents and direct your attention to the most relevant aspects. This should enable students to study the materials on their own and we can focus our attention during the classroom sessions on clarifying points that require further discussion.

At the end of each chapter, you will find a section with references. It is highly recommended that you consult these references for further clarification in case you require additional information on a topic.

Further readings

DSUR cover  ISL cover  R4mra cover  advr cover  tmwr cover  Rpacks cover Rpacks cover Rgraph cover

In addition to these lecture notes, there are many excellent books available (many of them for free) that focus on different aspects of R. In fact, there are so many free resources available by now that a team of R programmers has set up a website that provides an overview over the available resources by topic. You can find this overview here: Big Book of R.

In case you would like to learn more about the capabilities of R related to the contents of this course, I can particularly recommend the following books:

  • R for Data Science An excellent book by Hadley Wickham, which introduces you to R as a tool for doing data science, focusing on a consistent set of packages known as the tidyverse. [FREE online version]

  • An Introduction to Statistical Learning This book provides an introduction to statistical learning methods and covers basic methods (e.g., linear regression) as well as more advanced methods (e.g., Support Vector Machines). [FREE online version]

  • R for Marketing Research and Analytics A great book that is designed to teach R to marketing practitioners and data scientists.

  • Statistical Inference via Data Science Another great book covering topics around Statistical Inference. [FREE online version]

  • Text Mining with R This book explains how you can analyze unstructured data (texts) using R. [FREE online version]

  • Advanced R Another great book written by Hadley Wickham. Explains more advanced R concepts. [FREE online version]

  • Hands-On Machine Learning with R A great reference to learn about machine learning methods in R. The book favors a hands-on approach, growing an intuitive understanding of machine learning through concrete examples and little bit of theory.[FREE online version]

  • Hands-On Data Science for Marketing Another good reference regarding Data Science for Marketing. [FREE Code exercises]

  • R Markdown A great book about the reporting format ‘R Markdown’, which we will also use for the assignments in this course. [FREE Code exercises]

  • R Packages A book which teaches you how to make the most of R’s fantastic package system. [FREE online version]

  • R Graphics Cookbook A practical guide that provides more than 150 recipes to help you generate high-quality graphs quickly. [FREE online version]

  • Using R For Introductory Econometrics This book covers a nice introduction to R with a focus on the implementation of standard tools and methods used in econometrics. [FREE online version]

  • Data Science in a Box Another book covering topics around Data Science using R. [FREE online version]

  • Efficient R Programming A good reference to learn efficient workflows using R. [FREE online version]

  • Discovering Statistics Using R (Field, A., Miles, J., & Field Zoe, 2012, 1st Edtn.) This textbook offers an accessible and comprehensive introduction to statistics.

Discussion forum

We strongly encourage you to ask your questions via the online forum on the course page on the WU learning platform. The purpose of the forum is to allow you to discuss questions related to the contents with your class mates and us. Please make use of this forum as much as possible and ask questions if something remained unclear. Remember that there are no stupid questions! And if you know the answer to a question that is asked in the forum, it is also a good exercise to explain the concepts to your classmates.

DataCamp

Please also make use of the abundance of web resources. For students who would like to further train the materials covered in class, we recommend DataCamp, an online platform that offers interactive courses in data science at different levels. To facilitate the learning process you will obtain full access to the entire DataCamp course curriculum for the duration of the course. You will receive and invitation via your WU student email address.

Other web-resources

Contact

We are happy to answer your questions, so feel free to send us a short email - either to Daniel Winkler (daniel.winkler@wu.ac.at), Daria Yudaeva (daria.yudaeva@wu.ac.at), or myself (nils.wloemert@wu.ac.at). We will be available during the weekly sessions to clarify your questions. Daniel will be your point of contact for questions regarding the group project and Daria will be your point of contact for the individual assignments. However, please note that before you contact us, you should try to solve problems on your own first (e.g., by using the online tutorial, doing research online, asking questions via the online forum, or discussion with your class mates).

Acknowledgements

This tutorial is supported through Digital Learning Project Funding by WU Vienna. None of the materials covered in this tutorial are new. We intend to provide a summary of existing methods from a marketing research perspective and cite the corresponding sources. If you should have any comments or suggestions, please contact us through the github page of this course.

## Warning: package 'knitr' was built under R version 4.2.3