TY - JOUR
T1 - Inference in Sparsity-Induced Weak Factor Models
AU - Uematsu, Yoshimasa
AU - Yamagata, Takashi
N1 - Funding Information:
This work was supported by JSPS KAKENHI (grant numbers 18K01545, 19K13665, 20H01484, and 20H05631). An earlier version of this article was circulated under the title, “Inference in weak factor models”. The authors thank the editor Jianqing Fan, the associate editor, and three anonymous referees for their valuable comments and suggestions. The authors appreciate Kun Chen giving helpful comments and modification of the R package, rrpack.
Publisher Copyright:
© 2021 American Statistical Association.
PY - 2022
Y1 - 2022
N2 - In this article, we consider statistical inference for high-dimensional approximate factor models. We posit a weak factor structure, in which the factor loading matrix can be sparse and the signal eigenvalues may diverge more slowly than the cross-sectional dimension, N. We propose a novel inferential procedure to decide whether each component of the factor loadings is zero or not, and prove that this controls the false discovery rate (FDR) below a preassigned level, while the power tends to unity. This “factor selection” procedure is primarily based on a debiased version of the sparse orthogonal factor regression (SOFAR) estimator; but is also applicable to the principal component (PC) estimator. After the factor selection, the resparsified SOFAR and sparsified PC estimators are proposed and their consistency is established. Finite sample evidence supports the theoretical results. We apply our method to the FRED-MD dataset of macroeconomic variables and the monthly firm-level excess returns which constitute the S&P 500 index. The results give very strong statistical evidence of sparse factor loadings under the identification restrictions and exhibit clear associations of factors and categories of the variables. Furthermore, our method uncovers a very weak but statistically significant factor in the residuals of Fama-French five factor regression.
AB - In this article, we consider statistical inference for high-dimensional approximate factor models. We posit a weak factor structure, in which the factor loading matrix can be sparse and the signal eigenvalues may diverge more slowly than the cross-sectional dimension, N. We propose a novel inferential procedure to decide whether each component of the factor loadings is zero or not, and prove that this controls the false discovery rate (FDR) below a preassigned level, while the power tends to unity. This “factor selection” procedure is primarily based on a debiased version of the sparse orthogonal factor regression (SOFAR) estimator; but is also applicable to the principal component (PC) estimator. After the factor selection, the resparsified SOFAR and sparsified PC estimators are proposed and their consistency is established. Finite sample evidence supports the theoretical results. We apply our method to the FRED-MD dataset of macroeconomic variables and the monthly firm-level excess returns which constitute the S&P 500 index. The results give very strong statistical evidence of sparse factor loadings under the identification restrictions and exhibit clear associations of factors and categories of the variables. Furthermore, our method uncovers a very weak but statistically significant factor in the residuals of Fama-French five factor regression.
KW - Approximate factor models
KW - Debiased SOFAR estimator
KW - FDR and power
KW - Multiple testing
KW - Resparsification
UR - http://www.scopus.com/inward/record.url?scp=85121879337&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121879337&partnerID=8YFLogxK
U2 - 10.1080/07350015.2021.2003203
DO - 10.1080/07350015.2021.2003203
M3 - Article
AN - SCOPUS:85121879337
SN - 0735-0015
VL - 41
SP - 126
EP - 139
JO - Journal of Business and Economic Statistics
JF - Journal of Business and Economic Statistics
IS - 1
ER -