I am trying out a novel hierarchical linear model but the data structure makes me wonder if this is even possible in R. My previous attempts at the model were incorrectly specified (oops) and now I'm not sure how to deal with this piece of work. My coursework in HLM covered multilevel models and cross-classified models, but not a 3-level double cross-classified model.
Level 1:
- Responses to dichotomously scored items. Categorical dependent
variable, so I think I'm going to be using
glmer()
. (~1.5 million responses to items)
Level 2:
- Responses are nested within items - An item will have many responses (from different people), but a single response will not be linked to multiple items.
- Responses are also nested within testing instances - A test instance will have many responses (50), but a response cannot link to multiple test instances.
- Items are not nested with test instances and test instances are not nested within items. An item will appear at multiple test instances (every time someone takes Form A) and a test instance will be related to multiple items (Each item on the test form).
Level 3:
- Items are nested within test forms - A form can have several items on it, but (in this case) items cannot appear on multiple forms.
- Testing instances are nested within people - A person can participate in several test instances but a testing instance can't be executed by multiple people.
- Testing instances are also nested within location - A location can have several test instances there, but a test instance can't occur at multiple locations
- Test forms are not nested within people, people are not nested within test forms. A person can take multiple test forms and a test form can be taken by multiple people.
- People are not nested within location and locations are not nested within people - A person can take a test at multiple locations and several people can take a test at a single location.
- Test forms are not nested within location and locations are not nested within test forms - A test form can be used at multiple locations; a location can be used to administer many test forms.
I hypothesize that some location variables may have an impact on performance on particular items, but I think that will be moderated by things like the ability of the person taking the test. I have explanatory variables at the location, student, and item levels that I'm interested in exploring, like noise level, GPA, and subject matter.
Please let me know if you have any questions or suggestions.