Generating Multivariate Ordinal Data via Entropy Principles

Yen Lee*, David Kaplan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

When conducting robustness research where the focus of attention is on the impact of non-normality, the marginal skewness and kurtosis are often used to set the degree of non-normality. Monte Carlo methods are commonly applied to conduct this type of research by simulating data from distributions with skewness and kurtosis constrained to pre-specified values. Although several procedures have been proposed to simulate data from distributions with these constraints, no corresponding procedures have been applied for discrete distributions. In this paper, we present two procedures based on the principles of maximum entropy and minimum cross-entropy to estimate the multivariate observed ordinal distributions with constraints on skewness and kurtosis. For these procedures, the correlation matrix of the observed variables is not specified but depends on the relationships between the latent response variables. With the estimated distributions, researchers can study robustness not only focusing on the levels of non-normality but also on the variations in the distribution shapes. A simulation study demonstrates that these procedures yield excellent agreement between specified parameters and those of estimated distributions. A robustness study concerning the effect of distribution shape in the context of confirmatory factor analysis shows that shape can affect the robust χ2 and robust fit indices, especially when the sample size is small, the data are severely non-normal, and the fitted model is complex.

Original languageEnglish
Pages (from-to)156-181
Number of pages26
JournalPsychometrika
Volume83
Issue number1
DOIs
StatePublished - 1 Mar 2018

Keywords

  • Discrete data
  • Entropy
  • Non-normal data generation

Cite this