--- title: "Flexible Latent Trait Metrics in Item Response Theory" author: "Leah Feuerstahler" date: "`r Sys.Date()`" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Flexible Latent Trait Metrics} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r setup, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) library(flexmet) ``` The flexmet package provides utilities for use with the filtered monotonic polynomial (FMP) item response model. One of the unique features of the FMP model is the ability to transform the model to a user-specified metric. The FMP model can also transform the two-, three-, and four-parameter models, as well as generalized partial credit models. # The FMP Model A general form of the FMP model is specified using the composite function, \[ P(X_i = c | \theta) = \exp\left(\sum_{v=0}^c(b_{0i_{v}} + m_i(\theta))\right) / \left(\sum_{u=0}^{C_i - 1}\exp\left(\sum_{v=0}^u(b_{0i_{v}} + m_i(\theta))\right)\right) \] where $P$ indicates the probability of a response in category $c$, $c = 0, \ldots, C_i - 1$, $\theta$ is the latent trait parameter, $b_{iv}$ indicates an intercept for category $v$, $v = 1, \ldots, C_i - 1$, and $\sum_{v=0}^0(b_{iv} + m_i(\theta)) \equiv 0$. In addition, let \[ m_{i}(\theta)=b_{1i}\theta+b_{2i}\theta^{2}+\cdots+ b_{2k_{i}+1,i}\theta^{2k_{i}+1}, \] where $2k_{i}+1$ equals the order of the polynomial for item $i$, $k_{i}$ is a nonnegative integer, and $\boldsymbol{b}_{i}=(b_{0i_{1}},\ldots, b_{0i_{C_i-1}} b_{1i},\ldots,b_{2k_{i}+1,i})^{\prime}$ are item parameters that define the location and shape of the IRF. When $k = 0$, the general FMP model reduces to the two-parameter item response model (for binary item responses) or the generalized partial credit model (for polytomous item responses). For models with binary (0/1) item responses, flexmet also allows the use to include a lower asymptote parameter $c_i$ and upper asymptote parameter $d_i$ for the extended FMP model: \[ P_i(\theta)=c_i + (d_i - c_i)[1+\exp(-m_{i}(\theta))]^{-1}. \] The $c_i$ and $d_i$ parameters are unaffected by parameter transformations. # Transforming an Item Response Model Below is a worked example of how to transform a two-parameter model to the expected sum score metric. The original two-parameter metric is denoted $\theta$ and the expected sum score metric is denoted $\theta^\star$. This example uses the 23 two-parameter model low self-esteem parameter estimates reported in Table 7 of Reise & Waller (2003). First, we need to express the two-parameter model as an FMP model. The FMP model with $k=0$ is identical to the slope-intercept parameterization of the two-parameter model. The Reise & Waller parameters are expressed on the more familiar difficulty-discrimination parameterization of the FMP model. ```{r, autodep=TRUE} ## example parameters from Table 7 of Reise & Waller (2003) a <- c(0.57, 0.68, 0.76, 0.72, 0.69, 0.57, 0.53, 0.64, 0.45, 1.01, 1.05, 0.50, 0.58, 0.58, 0.60, 0.59, 1.03, 0.52, 0.59, 0.99, 0.95, 0.39, 0.50) b <- c(0.87, 1.02, 0.87, 0.81, 0.75, -0.22, 0.14, 0.56, 1.69, 0.37, 0.68, 0.56, 1.70, 1.20, 1.04, 1.69, 0.76, 1.51, 1.89, 1.77, 0.39, 0.08, 2.02) ## convert from difficulties and discriminations to FMP parameters b1 <- 1.702 * a b0 <- - 1.702 * a * b bmat <- cbind(b0, b1) ``` The transformation from $\theta$ to $\theta^\star$ is defined by the test response function, which is the sum of item response functions: \[ \theta^\star =\sum_iP_i(\theta) \] There is usually not a closed form expression for $\theta$ as a function of $\theta^\star$. In addition, to transform the FMP item parmaeters, $\theta$ must be expressed as a polynomial function of $\theta^\star$: \[ \theta = t_0 + t_1\theta^\star + t_2\theta^{\star2} + \cdots + t_{2k_\theta+1}\theta^{\star 2k_\theta+1}. \] In this example, the metric transformation is known exactly, but it can be approximated by a monotonic polynomial. To approximate the metric transformation, we can generate a large number of observations and fit a monotonic polynomial function to the simulated values. ```{r} # generate a large number of theta and TRF (thetastar) values theta <- seq(-3, 5, length = 5000) TRF <- rowSums(irf_fmp(theta = theta, b = bmat)) ``` Monotonic polynomial regression using the MonoPoly package can be used to approximate the metric transformation coefficients $\boldsymbol{t}=(t_0,t_1,\ldots,t_{2k_\theta+1})^\prime$. We can fit a sequence of $k_\theta$ values to find a good choice for the polynomial degree. ```{r} fmp0 <- MonoPoly::monpol(theta ~ TRF, K = 0) fmp1 <- MonoPoly::monpol(theta ~ TRF, K = 1) fmp2 <- MonoPoly::monpol(theta ~ TRF, K = 2) fmp3 <- MonoPoly::monpol(theta ~ TRF, K = 3) fmp4 <- MonoPoly::monpol(theta ~ TRF, K = 4) ``` Choose a "good enough" polynomial degree by looking at the residual sum of squares and by viewing patterns of residuals. ```{r} fmp0$RSS fmp1$RSS fmp2$RSS fmp3$RSS fmp4$RSS ``` ```{r, fig.height = 6, fig.width = 7} cols <- c("#E41A1C", "#377EB8", "#4DAF4A", "#984EA3", "#FF7F00") par(lwd = 2) curve(0*x, xlim = c(0, 22), ylim = c(-1, 1), col = "darkgray", xlab = "Expected Sum Score", ylab = "Residuals of Polynomial Approximation") points(TRF, residuals(fmp0), type = 'l', col = cols[1], lty = 2) points(TRF, residuals(fmp1), type = 'l', col = cols[2], lty = 3) points(TRF, residuals(fmp2), type = 'l', col = cols[3], lty = 2) points(TRF, residuals(fmp3), type = 'l', col = cols[4], lty = 1) points(TRF, residuals(fmp4), type = 'l', col = cols[5], lty = 3) legend("bottomright", legend = c(expression(paste(italic(k[theta])," = 0")), expression(paste(italic(k[theta])," = 1")), expression(paste(italic(k[theta])," = 2")), expression(paste(italic(k[theta])," = 3")), expression(paste(italic(k[theta])," = 4"))), col = cols, lty = c(2, 3, 2, 1, 3), bty = "n") ``` Suppose we choose to retain the $k_\theta = 3$ approximation. Then, the metric transformation vector equals, ```{r} (tvec <- coef(fmp3)) ``` and the transformed item parameters equal ```{r} bstarmat <- t(apply(bmat, 1, transform_b, tvec = tvec)) ## inspect transformed parameters signif(head(bstarmat), 2) ``` We can check that the transformation worked by plotting the test response function for the transformed model. If successful, this is a straight line because the latent trait $\theta^\star$ should be as close as possible to the expected sum score. ```{r, fig.height=5, fig.width=5, fig.align="center"} par(pty = "s") curve(rowSums(irf_fmp(x, bmat = bstarmat)), xlim = c(0, 23), ylim = c(0, 23), xlab = expression(paste(theta,"*")), ylab = "Expected Sum Score") abline(0, 1, col = 2) ``` The bstarmat parameters can then be used as item parameters for subsequent analyses, such as trait score estimation.