As part of the eNAEP digital assessment delivery system, process data are collected. Process data represent student interactions with the assessment platform and assessment tasks; the data consist of time-stamped records of student actions or activities (e.g., highlighter use and answer selection) as well as automatically generated actions (e.g., switching to the next section due to time out). Process data have the potential to provide valuable insights into students’ response processes and behaviors as well as aid in evaluating how students interact with assessment items, system functions, and the exam delivery platform. Process data can be used as the primary data source or the auxiliary data source on a wide range of projects.
One potential research topic may involve investigating student writing behaviors in open-ended items. Writing aptitude and behaviors can be captured via certain linguistic features, such as students’ language use, mechanics, lexical choices, style, etc. Process data provide an interesting overview into how students build their responses. Mining process data for such features would allow us to better investigate student writing behaviors and their relationship with performance.
Methods of analysis will include natural language processing (NLP) and other computational linguistic methods. We expect the candidate to have strong statistical skills, strong programming skills in Python and/or R, some knowledge of linguistics, and the ability to work effectively with large-scale and complex data.
Selected process data projects from prior interns include:
- Analyzing response changes in constructed response items in NAEP mathematics grade 8 2022 process data
- Understanding students’ problem-solving processes via action sequence analyses
- Item profiling: Insights into characteristics and design of each item
- A social network analysis of answer change behavior using NAEP process data
- Insights from students’ computations using the calculator
- A comparison of student disengagement across years
- Exploration of whether items can be reconstructed from the process data