Decline of Pearson’s r with categorization of variables: a large-scale simulation

Takahiro Onoshima, Kenpei Shiina, Takashi Ueda, Saori Kubo

研究成果: ジャーナルへの寄稿学術論文査読

1 被引用数 (Scopus)

抄録

It is often said that correlation coefficients computed from categorical variables are biased and thus should not be used. However, practitioners often ignore this longstanding caveat from statisticians. Although some studies have examined the bias, the true extent is still unknown. This study is an extensive attempt to determine the range and degree of the biases. In our simulation, continuous variables were categorized according to various thresholds and used to compute Pearson’s r. The results indicated that there were more serious biases than highlighted in previous studies. The results also revealed that increasing data size did not reduce the biases. Possible ways to cope with the biases are discussed.

本文言語英語
ページ(範囲)389-399
ページ数11
ジャーナルBehaviormetrika
46
2
DOI
出版ステータス出版済み - 2019 10月 1

フィンガープリント

「Decline of Pearson’s r with categorization of variables: a large-scale simulation」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル