References

American Institutes for Research and Jon Cohen. (n.d.). AM. https://am.air.org/help/JSTree/MainFrame.asp
Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B (Methodological), 57(1), 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
Binder, D. A. (1983). On the variances of asymptotically normal estimators from complex surveys. International Statistical Review/Revue Internationale de Statistique, 51(3), 279–292.
Caro, D. H., & Biecek, P. (2017). Intsvy: An r package for analyzing international large-scale assessment data. Journal of Statistical Software, 81(7), 1–44. https://doi.org/10.18637/jss.v081.i07
Cohen, J. D., & Jiang, T. (1999). Comparison of partially measured latent traits across nominal subgroups. Journal of the American Statistical Association, 94(448), 1035–1044. http://www.jstor.org/stable/2669917
Educational Progress, N. A. of. (2018). NAEP technical documentation on the web. National Center for Education Statistics. https://nces.ed.gov/nationsreportcard/tdw/
Johnson, E. G., & Rust, K. F. (1992). Population inferences and variance estimation for NAEP data. Journal of Statistical Software, 17(2), 175–190.
Judkins, D. R. (1990). Fay’s method for variance estimation. Journal of Official Statistics, 6(3), 223–239.
Korn, E. L., & Graubard, B. I. (1990). Simultaneous testing of regression coefficients with complex survey data: Use of bonferroni t statistics. The American Statistician, 44(4), 270–276.
LaRoche, S., Joncas, M., & Foy, P. (2016). Sample design in TIMSS 2015 (&. M. H. M. O. Martin I. V. S. Mullis, Ed.; pp. 3.1–3.37). http://timss.bc.edu/publications/timss/2015-methods/chapter-3.html
Lumley, T. (2004). Analysis of complex survey samples. Journal of Statistical Software, 9(8), 1–19. https://doi.org/10.18637/jss.v009.i08
Manuel, R., & Peterbauer, J. (2014). A package for complex surveys including plausible values. https://cran.r-project.org/src/contrib/Archive/svyPVpack/
R package version 0.1-1
Mislevy, R. J., Beaton, A., Kaplan, B. A., & Sheehan, K. (1992). Estimating population characteristics from sparse matrix samples of item responses. Journal of Educational Measurement, 29(2), 133–161.
Mulligan, et al., G. M. (2018). Findings from the fourth-grade round of the early childhood longitudinal study, kindergarten class of 2010–11 (ECLS-k:2011): First look (NCES 2018-094). National Center for Education Statistics. https://nces.ed.gov/pubs2018/2018094.pdf
Oberski, D. (2017). An r package for complex survey analysis of structural equation models. Journal of Statistical Software, 57(1), 1–27. https://doi.org/10.18637/jss.v057.i01
OECD. (2018). PISA 2018 technical report. Organization for Economic Co-operation; Development (OECD). https://www.oecd.org/pisa/data/pisa2018technicalreport/
R Core Team. (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Robitzsch, A., & Oberwimmer, K. (2019). BIFIEsurvey: Tools for survey statistics in educational assessment. Federal Institute for Educational Research, Innovation; Development of the Austrian School System. https://CRAN.R-project.org/package=BIFIEsurvey
R package version 3.3-12
Rubin, D. B. (1987). Multiple imputation for nonresponse in surveys. Wiley.
Rust, K. F., & Rao, J. N. K. (1996). Replication methods for analyzing complex survey data. Statistical Methods in Medical Research: Special Issue on the Analysis of Complex Surveys, 5, 283–310.
Rutkowski, L., Gonzalez, E., Joncas, M., & Davier, M. von. (2010). International large-scale assessment data: Issues in secondary analysis and reporting. Educational Researcher, 39(2), 142–151. https://doi.org/10.3102/0013189X10363170
Satterthwaite, F. E. (1946). An approximate distribution of estimates of variance components. Biometrics Bulletin, 2(6), 110–114.
Tourangeau, et al., K. (2015a). Early childhood longitudinal study, kindergarten class of 2010–11 (ECLS-k:2011): User’s manual for the ECLS-k:2011 kindergarten data file and electronic codebook, public version (NCES 2015-074). National Center for Education Statistics. https://nces.ed.gov/pubs2015/2015074.pdf
Tourangeau, et al., K. (2015b). Early childhood longitudinal study, kindergarten class of 2010–11 (ECLS-k:2011): User’s manual for the ECLS-k:2011 kindergarten–first grade data file and electronic codebook, public version (NCES 2015-078). National Center for Education Statistics. https://nces.ed.gov/pubs2015/2015078.pdf
Tourangeau, et al., K. (2018a). Early childhood longitudinal study, kindergarten class of 2010–11 (ECLS -k:2011): User’s manual for the ECLS-k:2011 kindergarten–third grade data file and electronic codebook, public version (NCES 2018-034). National Center for Education Statistics. https://nces.ed.gov/pubs2018/2018034.pdf
Tourangeau, et al., K. (2018b). Early childhood longitudinal study, kindergarten class of 2010–11 (ECLS-k:2011): User’s manual for the ECLS-k:2011 kindergarten–fourth grade data file and electronic codebook, public version (NCES 2018-032). National Center for Education Statistics. https://nces.ed.gov/pubs2018/2018032.pdf
Tourangeau, et al., K. (2017). Early childhood longitudinal study, kindergarten class of 2010–11 (ECLS-k:2011): User’s manual for the ECLS-k:2011 kindergarten–second grade data file and electronic codebook, public version (NCES 2017-285). National Center for Education Statistics. https://nces.ed.gov/pubs2017/2017285.pdf
Tourangeau, et al., K. (2019). Early childhood longitudinal study, kindergarten class of 2010–11 (ECLS-k:2011): User’s manual for the ECLS-k:2011 kindergarten–fifth grade data file and electronic codebook, public version (NCES 2019-051). National Center for Education Statistics. https://nces.ed.gov/pubs2019/2019051.pdf
Weirich, S., Haag, N., Hecht, M., Böhme, K., Siegle, T., & Lüdtke, O. (2014). Nested multiple imputation in large-scale assessments. Large-Scale Assessments in Education, 2, 1–18.
Weisberg, S. (1985). Applied linear regression. Wiley.
Welch, B. L. (1947). The generalization of “student’s” problem when several different population variances are involved. Biometrika, 34(1/2), 28–35.
Wolter, K. (2007). Introduction to variance estimation (2nd ed.). Springer Science & Business Media.