抄録
It is often said that correlation coefficients computed from categorical variables are biased and thus should not be used. However, practitioners often ignore this longstanding caveat from statisticians. Although some studies have examined the bias, the true extent is still unknown. This study is an extensive attempt to determine the range and degree of the biases. In our simulation, continuous variables were categorized according to various thresholds and used to compute Pearson’s r. The results indicated that there were more serious biases than highlighted in previous studies. The results also revealed that increasing data size did not reduce the biases. Possible ways to cope with the biases are discussed.
本文言語 | 英語 |
---|---|
ページ(範囲) | 389-399 |
ページ数 | 11 |
ジャーナル | Behaviormetrika |
巻 | 46 |
号 | 2 |
DOI | |
出版ステータス | 出版済み - 2019 10月 1 |