TY - JOUR
T1 - Bivariate symmetric Heckman models and their characterization
AU - Saulo, Helton
AU - Vila, Roberto
AU - Cordeiro, Shayane S.
AU - Leiva, Víctor
N1 - Funding Information:
The authors would like to thank the Editor, Associate Editor and referees for their comments which led to improve the presentation of this article. The present research was funded partially by (i) CNPq, Brazil (grant number 309674/2020-4) , Brazil (H. Saulo); (ii) Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Brazil , Finance Code 001 (S.S. Cordeiro); and (iii) FONDECYT, Chile grant number 1200525 (V. Leiva and H. Saulo) from the National Agency for Research and Development (ANID), Chile of the Chilean government under the Ministry of Science, Technology, Knowledge, and Innovation. All authors read and approved the final manuscript.
Publisher Copyright:
© 2022 Elsevier Inc.
PY - 2022
Y1 - 2022
N2 - A sample selection bias problem arises when a variable of interest or response is correlated with a latent variable. This problem is presented when the response variable has part of its observations censored. The Heckman sample selection model is based on the bivariate normality assumption and fits both response and latent variables. Recently, this assumption has been relaxed to more flexible models based on the Student-t distribution, which has appealing statistical properties. In this article, we introduce an extention of the Heckman sample selection model to the wide class of symmetric distributions. In the new class of sample selection models, covariates are used to describe its dispersion and correlation parameters explaining heteroscedasticity and sample selection bias, respectively. We derive mathematical and statistical properties of the introduced model, and estimate its parameters with the maximum likelihood method. The case of the bivariate Heckman-Student-t model, as a special member of the family of symmetric Heckman models, is analyzed. Monte Carlo simulations are performed to assess the statistical behavior of the estimation method. Two real data sets are analyzed to illustrate our results.
AB - A sample selection bias problem arises when a variable of interest or response is correlated with a latent variable. This problem is presented when the response variable has part of its observations censored. The Heckman sample selection model is based on the bivariate normality assumption and fits both response and latent variables. Recently, this assumption has been relaxed to more flexible models based on the Student-t distribution, which has appealing statistical properties. In this article, we introduce an extention of the Heckman sample selection model to the wide class of symmetric distributions. In the new class of sample selection models, covariates are used to describe its dispersion and correlation parameters explaining heteroscedasticity and sample selection bias, respectively. We derive mathematical and statistical properties of the introduced model, and estimate its parameters with the maximum likelihood method. The case of the bivariate Heckman-Student-t model, as a special member of the family of symmetric Heckman models, is analyzed. Monte Carlo simulations are performed to assess the statistical behavior of the estimation method. Two real data sets are analyzed to illustrate our results.
KW - Heckman models
KW - Multivariate elliptically contoured distributions
KW - Symmetric distributions
KW - Varying correlation and dispersion
UR - http://www.scopus.com/inward/record.url?scp=85138590555&partnerID=8YFLogxK
U2 - 10.1016/j.jmva.2022.105097
DO - 10.1016/j.jmva.2022.105097
M3 - Article
AN - SCOPUS:85138590555
JO - Journal of Multivariate Analysis
JF - Journal of Multivariate Analysis
SN - 0047-259X
M1 - 105097
ER -