Regression Modeling for Linguistic Data

by Sonderegger

ISBN: 9780262375030 | Copyright 2023

Click here to preview

Instructor Requests

Digital Exam/Desk Copy Print Desk Copy Ancillaries
Tabs

The first comprehensive textbook on regression modeling for linguistic data offers an incisive conceptual overview along with worked examples that teach practical skills for realistic data analysis.

In the first comprehensive textbook on regression modeling for linguistic data in a frequentist framework, Morgan Sonderegger provides graduate students and researchers with an incisive conceptual overview along with worked examples that teach practical skills for realistic data analysis. The book features extensive treatment of mixed-effects regression models, the most widely used statistical method for analyzing linguistic data.

Sonderegger begins with preliminaries to regression modeling: assumptions, inferential statistics, hypothesis testing, power, and other errors. He then covers regression models for non-clustered data: linear regression, model selection and validation, logistic regression, and applied topics such as contrast coding and nonlinear effects. The last three chapters discuss regression models for clustered data: linear and logistic mixed-effects models as well as model predictions, convergence, and model selection. The book's focused scope and practical emphasis will equip readers to implement these methods and understand how they are used in current work.

•The only advanced discussion of modeling for linguists
•Uses R throughout, in practical examples using real datasets
•Extensive treatment of mixed-effects regression models
•Contains detailed, clear guidance on reporting models
•Equal emphasis on observational data and data from controlled experiments
•Suitable for graduate students and researchers with computational interests across linguistics and cognitive science

Expand/Collapse All
Contents (pg. vii)
Preface (pg. xi)
1. Preliminaries (pg. 1)
1.1 Our R Toolset (pg. 1)
1.2 Our Approach (pg. 1)
1.3 Context (pg. 3)
2. Samples, Estimates, and Hypothesis Tests (pg. 7)
2.1 Preliminaries (pg. 7)
2.2 Point Estimation (pg. 9)
2.3 Uncertainty and Interval Estimation (pg. 14)
2.4 Hypothesis Testing (pg. 19)
2.5 Parametric and Nonparametric Tests (pg. 27)
2.6 Common Misconceptions about p-Values (pg. 33)
2.7 Reporting Hypothesis Tests (pg. 34)
2.8 Further Reading (pg. 36)
Exercises (pg. 37)
3. Effect Size, Power, and Error (pg. 39)
3.1 Preliminaries (pg. 39)
3.2 Effect Size (pg. 40)
3.3 Type I/II Errors and Power (pg. 47)
3.4 Error in Effect Size: Type M and Type S Errors (pg. 59)
3.5 Assumptions of Hypothesis Tests and Consequences (pg. 63)
3.6 Further Reading (pg. 68)
Exercises (pg. 68)
4. Linear Regression 1 (pg. 69)
4.1 Preliminaries (pg. 69)
4.2 Regression: General Introduction (pg. 70)
4.3 Simple Linear Regression (pg. 73)
4.4 Multiple Linear Regression (pg. 81)
4.5 Interactions (pg. 85)
4.6 Reporting a Linear Regression Model (pg. 91)
4.7 Further Reading (pg. 93)
Exercises (pg. 93)
5. Linear Regression 2 (pg. 95)
5.1 Preliminaries (pg. 95)
5.2 Linear Regression Assumptions (pg. 96)
5.3 Problems with the Errors (pg. 98)
5.4 Problems with the Model (pg. 102)
5.5 Transformations (pg. 106)
5.6 Problems with Predictors (pg. 113)
5.7 Problems with Observations (pg. 121)
5.8 Trade-Offs between Models (pg. 126)
5.9 Model Comparison (pg. 130)
5.10 Variable Selection (pg. 136)
5.11 Further Reading (pg. 145)
Exercises (pg. 146)
6. Categorical Data Analysis and Logistic Regression (pg. 147)
6.1 Preliminaries (pg. 147)
6.2 Categorical Data Analysis (pg. 149)
6.3 Odds and Odds Ratios (pg. 154)
6.4 Simple Logistic Regression (pg. 158)
6.5 Inference for Logistic Regression (pg. 163)
6.6 Goodness of Fit (pg. 166)
6.7 Multiple Logistic Regression (pg. 168)
6.8 Model Validation (pg. 177)
6.9 Reporting and Summarizing (pg. 184)
6.10 Further Reading (pg. 188)
Exercises (pg. 188)
7. Practical Regression Topics (pg. 191)
7.1 Preliminaries (pg. 191)
7.2 Multilevel Factors: Contrast Coding (pg. 193)
7.3 Omnibus and Post Hoc Tests (pg. 212)
7.4 Interpreting Interactions (pg. 218)
7.5 Nonlinear Effects (pg. 227)
7.6 Collinearity Diagnostics Revisited (pg. 238)
7.7 Further Reading (pg. 239)
Exercises (pg. 239)
8. Mixed-Effects Models 1: Linear Regression (pg. 241)
8.1 Preliminaries (pg. 241)
8.2 Motivation: Grouped Data (pg. 242)
8.3 Linear Mixed Models: Introduction (pg. 244)
8.4 Random Slopes (pg. 256)
8.5 Hypothesis Tests (pg. 267)
8.6 Model Summaries (pg. 278)
8.7 Random-Effect Correlations (pg. 280)
8.8 Model Predictions (pg. 291)
8.9 Reporting the Fitted Model (pg. 297)
8.10 Model Validation (pg. 298)
8.11 Further Reading (pg. 309)
Exercises (pg. 310)
9. Mixed-Effects Models 2: Logistic Regression (pg. 313)
9.1 Preliminaries (pg. 313)
9.2 Introduction (pg. 315)
9.3 Two Grouping Factors, Random Intercepts and Slopes (pg. 321)
9.4 Hypothesis Tests (pg. 327)
9.5 Model Summaries (pg. 332)
9.6 Model Validation (pg. 337)
9.7 Nonlinear and Factor Effects (pg. 340)
9.8 Variable Importance (pg. 350)
9.9 Reporting a Mixed-Effects Logistic Regression (pg. 354)
9.10 Further Reading (pg. 354)
Exercises (pg. 355)
10. Mixed-Effects Models 3: Practical and Advanced Topics (pg. 357)
10.1 Preliminaries (pg. 357)
10.2 More on Random Effects (pg. 360)
10.3 Model Convergence (pg. 364)
10.4 Singular Models (pg. 374)
10.5 Model Selection (pg. 379)
10.6 Predictions and Uncertainty for Individual Levels (pg. 395)
10.7 Nonlinear Effects (pg. 403)
10.8 Power for Mixed-Effects Models (pg. 405)
10.9 Further Reading (pg. 407)
Exercises (pg. 407)
A. Appendix: Datasets (pg. 409)
A.1 transitions (pg. 409)
A.2 vot (pg. 409)
B. Appendix: R Packages (pg. 411)
Selected Abbreviations (pg. 413)
Bibliography (pg. 415)
Topic Index (pg. 427)
Function Index (pg. 437)

Morgan Sonderegger

Morgan Sonderegger is Associate Professor of Linguistics at McGill University.

eTextbook
Go paperless today! Available online anytime, nothing to download or install.

Features

  • Bookmarking
  • Note taking
  • Highlighting