Math 496: Post-Selection Inference
Spring 2020

Instructor: Todd Kuffner

Lecture: MWF 4:00-4:50pm

Course Description: Model selection is ubiquitous in modern statistical applications. When the model is chosen after viewing the data, classical procedures for statistical inference are no longer valid. The study of post-selection inference in the context of inference for linear regression coefficients after variable selection is one of the most popular and important topics in statistics today. In this course, we will explain the sources of the problem, discuss the different perspectives on what are the inferential targets and goals, and present cutting-edge solutions to the problem of post-selection inference. Paradigms to be studied include high-dimensional or post-regularization inference, simultaneous inference intended to control familywise error rates, and selective inference to control false discovery rates for selected parameters. The material will be taught at the level of advanced undergraduates, and is also suitable for graduate students having the necessary background.

Prerequisite: Math 493, Math 494, Math 439, and experience using R.

Textbook: There are no reference books on this topic, as it is very new. Students will be required to read articles published in peer-reviewed journals and/or on the arXiv. Lectures will fill in gaps in students' background knowledge.

Course Topics: Here is a list of potential topics. We will cover some subset of these, depending on time and background of the students:

Important Dates and Course Schedule:   Details will be posted on Canvas. I will probably update the table below later in the semester to detail what was covered for future reference.

Jan. 13
First day of classes
Jan. 20
No class (Martin Luther King Holiday)
Jan. 23
Last day to drop/add
March 9-13
No classes (Spring Break)
April 24
Last day of classes


Course Policies and Grades

Canvas: During the semester, all course-related materials and announcements will be posted to Canvas and/or sent by email to registered students.

Grades: Homework 35%, Paper Discussion 20%, Participation 15%, Final Project & Group Presentation 30%

Homework:
Roughly 1 homework for every 5-6 lectures. You may discuss problems with other students, but the solutions you submit must be entirely your own work. Explanations detailing the steps of proofs or other mathematical arguments are required for full credit. You are encouraged, but not required, to write your solutions in TeX/LaTeX, and submit the printed version. I will drop the lowest homework grade under the condition that you have submitted all homeworks and genuinely attempted all of the problems; I will not drop the lowest homework grade if you did not do this.

Homework assignments may include the following tasks: (i) data analysis and implementation of post-selection inference procedures in R (using the newest packages); (ii) designing simulation experiments and writing R code; (iii) mathematical derivations; (iv) reproducing simulations or analyses in academic papers; (v) writing critical analyses of studies published in applied journals; (vi) critical analysis of historical statistical literature.

Paper Discussion: For many of the academic papers that we examine during lectures, we will hold discussions during lecture. For each of these papers, I will ask two or three students (separately) to prepare brief comments and questions (amounting to about 3-5 minutes of speaking) to facilitate the class discussions; it's best if these students do not talk about the paper beforehand, to maximize the number of unique points of view. Each student will be asked to do this for 2 or 3 papers, depending on the actual pace of the course.

Participation: Attendance and participation are required for all lectures. Attendance is not enough. Participation includes: (i) reading the relevant paper or background material before lecture, and bringing it with you for reference; (ii) answering questions that I ask the class, and participating in the class discussions; (iii) providing a summary, definition, or result from the previous lecture when I ask you to.

Final Project & Group Presentation: Groups will be assigned after the drop/add deadline. It's a good idea to start on this project early in the semester, though it cannot be completed until late in the semester.

Final Course Grade: The letter grades for the course will be determined according to the following numerical grades on a 0-100 scale.
A+
impress me
B+
[87, 90)
C+
[77, 80)
D+
[67, 70)
F
[0,60)
A
93+
B
[83, 87)
C
[73, 77)
D
[63, 67)


A-
[90, 93)
B-
[80, 83)
C-
[70, 73)
D-
[60, 63)




Other Course Policies: Students are encouraged to look at the Faculty of Arts & Sciences policies.