class: center, middle, inverse, title-slide .title[ # Introduction to Multilevel Analysis ] .subtitle[ ## PSYC 575 ] .author[ ### Hok Chio (Mark) Lai ] .institute[ ### University of Southern California ] .date[ ### 2020/07/30 (updated: 2022-08-19) ] --- background-image: url(img/hierarchical_structure.png) ??? Hi everyone, welcome to PSYC 575, multilevel modeling. In the first week, you'll be hearing from me a quick introduction to multilevel analysis, and some basic concepts. Some of the concepts will be further elaborated on in future lectures, so don't worry if some of the ideas may not be very intuitive to you---we'll come back to them. So what is multilevel analysis? Multilevel analysis is a statistical analysis to analyze data that have some form of a hierarchical structure. A classic example is shown in this picture. Imagine you're collecting student data in Los Angeles, from multiple schools, and multiple school districts. Now, it is very likely that different schools will have different student demographics, so for example, these students here may be quite different from, say, these students here from another school. Now, at the school level, we may have these two schools from one school district, and these two other schools from another school district. Again, with the different geographic locations and policies of the school districts, these two clusters of schools may be different. Traditional regression analysis is not suitable for analyzing data with this kind of hierarchical structure, because it violates the "independence observation" assumption, which we will further discuss in week 2. Multilevel modeling, on the other hand, is well suited to analyze such data. --- # Learning Objectives ### Other names used for MLM ### Types of data structure MLM handles ??? In the next few videos, I'm going to talk about the history of multilevel modeling, abbreviated as MLM, and why it leads to some confusing names in the literature. I'm also going to provide more examples on the types of data MLM can handle. --- class: inverse, center, middle # Naming and History of MLM ??? The title of this course is multilevel modeling, but you may sometimes hear people referring to this kind of analysis by a different name. --- # Alternative names for MLM: - Hierarchical linear model: Education/Sociology/Psychology <sup>1</up> - Variance component model: Statistics <sup>2</up> .footnote[ [1]: [Raudenbush & Bryk, 1986](https://doi.org/10.2307/2112482) [2]: [Aitkin et al., 1981](https://doi.org/10.2307/2981826) ] -- - Mixed model/mixed-effect model: Statistics, biomedical - Random coefficient model: Econometrics ??? The reason is that there are several different names. This is not uncommon in statistics; indeed, some have made the joke that the more names are given to a concept, the more important the concept is, so this really says that multilevel modeling is a very important technique. But really, the different naming is mainly because people in different areas of research call them differently. We've seen HLM, which is commonly used in education, sociology, and psychology. The variance component model is used in statistics. Another very popular name for multilevel modeling in statistics and biomedical research is mixed model or mixed-effect model; indeed, this is how multilevel modeling is called in many software, including R, SAS, SPSS, and Stata. Finally, you may also see the term random coefficient model in economics and econometrics. In this course, we're going to use the term multilevel modeling, which is easier to understand and is also becoming the standard name in social and behavioral sciences. However, do realize that some other researchers may call it differently even though they are actually using the same technique. --- # History of MLM MLM naturally handles data coming from different "levels" -- Robinson (1950)<sup>1</sup> - State level: Correlation between % illiterate and % foreign born: * `\(r = -.53\)` - Individual level: Within-state correlation between illiteracy and foregin born status: * `\(r = .12\)` [1]: p. 354, https://doi.org/10.2307/2087176 ??? MLM is a technique that naturally handles data coming from different "levels". And it is important to know what the level is when interpreting results. One of an early example of analyzing data across different levels come from Robinson (1950), who used data from the 1930 Census of the United States. The figure here shows the relation between illiteracy, in terms of American English, at the state level. Each point here represents one of the 48 states, as in 1930 Hawaii and Alaska are not part of the union. As you can see, there seems to be a negative association, in that a state with more immigrants would actually have higher literacy in American English, which would seem counterintuitive. I suggest you pause the video to think about why this is the case. What's interesting here is that when you talk about associations between variables, they can be very different depending on what levels you're looking at. At the state level, we're talking about proportion of people who are illiterate and the proportion of people who are foreign-born. And the correlation of that, based on Robinson's account, was -.53. However, we may also be talking about whether an immigrant is more or less likely to be illiterate, in which case we're talking about a person-level association. In this case, we need to look at the correlation within a state. Robinson tells us that the correlation is positive, meaning that an immigrant is more likely to be illiterate. You will see many more examples whether the correlations are quite different across levels of analysis. These observations are part of what motivates people to develop methods to analyze such data. --- class: inverse, middle, center # Usage of MLM --- # Hierarchical Data Structure .pull-left[ Multiple units at a lower level nested within a unit at a higher level Level 1 | Level 2 --------| ------- Clients | Therapist Classrooms | School Employees | Organization People | Family Citizens | Country Measurements | Person ] ??? As we have seen earlier, a hierarchical structure is one where multiple lower-level units are nested within a higher level unit. There are plenty of examples in social and behavioral sciences, and here are some examples. Notice that by convention, we call the units at the lowest level as level 1. Here are some other examples. Note that although the lower level is usually persons, sometimes it can be things like classrooms nested within multiple schools. And the last one here is one where a person gives multiple measurement, which essentially covers all within-subject experiments, and longitudinal studies. -- .pull-right[ ## Network Graph
] ??? A useful way to represent hierarchical data structures is to use what is called a network graph. Here there are eight lower level units and three higher level units, and the lines show which lower level units are related to which upper level units, like here A is related to unit 1. I encourage you take a moment to convince yourself that all the examples in the table on the left can be represented by a network graph like the one here. --- class: middle, center <a title="Salvor / CC0" href="https://commons.wikimedia.org/wiki/File:Vistfr%C3%A6%C3%B0ikenning.svg"><img width="512" alt="Vistfræðikenning" src="https://upload.wikimedia.org/wikipedia/commons/thumb/f/f0/Vistfr%C3%A6%C3%B0ikenning.svg/512px-Vistfr%C3%A6%C3%B0ikenning.svg.png"></a> ??? One way to think about MLM is to consider Urie Bronfenbrenner's ecological systems theory. It is a statistical technique that is geared to study how contexts, be it families, institutions, and culture, influences individuals, and also brings in the time perspective. --- class: center, middle # Once you know that hierarchies exist, you see them everywhere. [Kreft & de Leeuw (1998, p. 1)](https://dx.doi.org/10.4135/9781849209366) .footnote[ Kreft, I. G., & de Leeuw, J. (1998). _Introducing multilevel modeling_. Sage. https://dx.doi.org/10.4135/9781849209366 ] ??? Because individuals are always situated in some contexts, the use of MLM is extremely common. To use the quote by Kreft and de Leeuw's classic textbook on MLM, "Once you know that hierarchies exist, you see them everywhere." --- class: middle, center # When it comes to regression, multilevel regression deserves to be the default approach [McElreath (2020, p. 400)](https://doi.org/10.1201/9780429029608) .footnote[ McElreath, R. (2020). _Statistical rethinking: A Bayesian course with examples with R and Stan (2nd ed.)_. Chapman & Hall/CRC. https://doi.org/10.1201/9780429029608 ] ??? Because of how common it is used, there are also scholars that propose that multilevel regression should be the default method for statistical analyses. --- # Sampling the Literature PsycINFO, 2022 May-Aug, with keywords ``` "multilevel model*" OR "mixed model*" OR "mixed-effect model*" OR "hierarchical linear model*" ``` 266 results .center[ ![](img/PsycINFO_search_2022.png) ] ??? To show the broad range of studies that used MLM, I quickly searched PsycINFO on studies in the past three months. As you can see, it has been used in many different areas of psychological research. --- Recent literature in 2022 class: middle - A lasso and a regression tree mixed-effect model with random effects for the level, the residual variance, and the autocorrelation - Crossover of resources within formal ties: How job seekers acquire psychological capital from employment counselors - On the basis of source: Impacts of individual differences on multiple-document integrated reading and writing tasks - The tone atlas of perceptual discriminability and perceptual distance: Four tone languages and five language groups - Translational research on caregiver reading and playing behaviors: Evidence from an in vivo community-based intervention throughout the covid-19 pandemic - How does goal orientation fuel hotel employees' innovative behaviors? A cross-level investigation - Effect of exposure to maternal diabetes during pregnancy on offspring's brain cortical thickness and neurocognitive functioning ??? Here are some of the titles, some of which may be relevant to your research. Indeed, you're encouraged to check out some articles in your area of research that use MLM, and you'll be doing exactly that in Homework 1. --- # Software .pull-left[ ## General stat software - R (**`lme4`**, **`brms`**, `nlme`) - SAS - SPSS - Stata ] .pull-right[ ## Specialized software - HLM - MLwiN - Mplus ] ??? The last two pages listed some software and resources that are useful for doing MLM. I encourage you to check out those resources. --- # Resources - Centre for Multilevel Modeling, University of Bristol http://www.bristol.ac.uk/cmm/learning/multilevel-models/ - Textbook examples (with syntax for different software), Institute for Digital Research and Education, UCLA https://stats.idre.ucla.edu/other/examples/ - GLMM FAQ by Ben Bolker https://bbolker.github.io/mixedmodels-misc/glmmFAQ.html - Curran-Bauer Analytics, Multilevel Modeling Archives https://curranbauer.org/category/news-and-updates/multilevel-modeling/ - Multilevel Modeling Discussion List (Listserv) https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=multilevel