TY - JOUR
T1 - Generating Multivariate Ordinal Data via Entropy Principles
AU - Lee, Yen
AU - Kaplan, David
N1 - Publisher Copyright:
© 2018, The Psychometric Society.
PY - 2018/3/1
Y1 - 2018/3/1
N2 - When conducting robustness research where the focus of attention is on the impact of non-normality, the marginal skewness and kurtosis are often used to set the degree of non-normality. Monte Carlo methods are commonly applied to conduct this type of research by simulating data from distributions with skewness and kurtosis constrained to pre-specified values. Although several procedures have been proposed to simulate data from distributions with these constraints, no corresponding procedures have been applied for discrete distributions. In this paper, we present two procedures based on the principles of maximum entropy and minimum cross-entropy to estimate the multivariate observed ordinal distributions with constraints on skewness and kurtosis. For these procedures, the correlation matrix of the observed variables is not specified but depends on the relationships between the latent response variables. With the estimated distributions, researchers can study robustness not only focusing on the levels of non-normality but also on the variations in the distribution shapes. A simulation study demonstrates that these procedures yield excellent agreement between specified parameters and those of estimated distributions. A robustness study concerning the effect of distribution shape in the context of confirmatory factor analysis shows that shape can affect the robust χ2 and robust fit indices, especially when the sample size is small, the data are severely non-normal, and the fitted model is complex.
AB - When conducting robustness research where the focus of attention is on the impact of non-normality, the marginal skewness and kurtosis are often used to set the degree of non-normality. Monte Carlo methods are commonly applied to conduct this type of research by simulating data from distributions with skewness and kurtosis constrained to pre-specified values. Although several procedures have been proposed to simulate data from distributions with these constraints, no corresponding procedures have been applied for discrete distributions. In this paper, we present two procedures based on the principles of maximum entropy and minimum cross-entropy to estimate the multivariate observed ordinal distributions with constraints on skewness and kurtosis. For these procedures, the correlation matrix of the observed variables is not specified but depends on the relationships between the latent response variables. With the estimated distributions, researchers can study robustness not only focusing on the levels of non-normality but also on the variations in the distribution shapes. A simulation study demonstrates that these procedures yield excellent agreement between specified parameters and those of estimated distributions. A robustness study concerning the effect of distribution shape in the context of confirmatory factor analysis shows that shape can affect the robust χ2 and robust fit indices, especially when the sample size is small, the data are severely non-normal, and the fitted model is complex.
KW - Discrete data
KW - Entropy
KW - Non-normal data generation
UR - http://www.scopus.com/inward/record.url?scp=85040776200&partnerID=8YFLogxK
U2 - 10.1007/s11336-018-9603-3
DO - 10.1007/s11336-018-9603-3
M3 - Article
C2 - 29359242
AN - SCOPUS:85040776200
SN - 0033-3123
VL - 83
SP - 156
EP - 181
JO - Psychometrika
JF - Psychometrika
IS - 1
ER -