Dynamic analysis of complex panel count data
Abstract
Panel count data occur in many fields including clinical, demographical and industrial studies and an extensive literature has been established for their regression analysis. However, most of the existing methods apply only to the situations where both covariates and their effects are constant or one of them may be time-dependent. In the first part of this dissertation, we consider a situation where both covariates and their effects may be time-dependent and an estimating equation-based approach is developed for estimating those time-varying effects. In the method, B-spline functions are employed to approximate time-dependent coefficients and the asymptotic properties of the proposed estimators are established. To assess the performance of the proposed approach, an extensive simulation study is conducted and suggests that it works well in practical situations. An application to the China Health and Nutrition Survey (CHNS) study is provided. In practice, there could exist more than one type of event of interest, such as two types of tumor recurrence, leading to multivariate panel count data. The second part of this dissertation considers marginal mean model for multivariate panel count data with time-dependent coefficient and covariate effects, which has limited previous research. Based on the conditional estimating equation method developed for time-dependent covariates, we approximate the coefficients by B-splines, hence allow both coefficients and covariates to be time-dependent. Simulation studies show that the proposed estimation procedures work well for practical situations. The methodology is again applied to the China Health and Nutrition Survey (CHNS) study. When we consider time-varying covariates and coefficients effects, most of the previous study focused on the proportional mean model because the likelihood function under the rate model involves intractable integration. However, the rate model is more realistic and efficient. Hence, in the third part of this dissertation, we propose a semi-parametric MLE method under the rate model for panel count data with time-dependent covariates and time-varying effects. B-spline functions are employed again to approximate time-dependent coefficients and an efficient Expectation-Maximization-type algorithm is developed to overcome the computational difficulty. The resulting estimators are shown to be consistent and asymptotically efficient. Monte Carlo simulation studies demonstrate that the proposed method enjoys desirable finite-sample properties. An application to The Young Women's Project (YWP) is provided.
Degree
Ph. D.