Data assignments
Here are the group and data set assignments (UPDATED Saturday 7 March 15h15). For each data set there is also an explanation to go along with it, including which columns contain which variables and the outcome variable. There is also a literature reference that you should be able to access from EPFL / vpn.epfl.edu. You can use that as an aid to guide you in your analyses, but you can also do additional or different analyses if you want.
Also, you can use the literature paper as a guide to how you might write your report. You should include a short intro / background, including a clear statement of the problem of interest; a complete exploratory data analysis (EDA); a description of your model fitting and selection analysis; a description of your model assessment and justification / results of that; your final chosen model written in mathematical terms; inclusion of relevant plots (they should be 'pretty'); any conclusions adressing the problem of interest. You will also be evaluated on the quality of language and the overall presentation of your report.
If you return your report before the preliminary deadline, I will be able to give you commentary on how to improve your report that you can incorporate into your final submission.