• 论文 • 上一篇    下一篇

病例对照数据次级分析中的众数回归

曹睿1,田茂再1,2,3   

  1. 1. 中国人民大学应用统计科学研究中心, 中国人民大学统计学院, 北京 100872;2. 新疆财经大学统计与信息学院,乌鲁木齐 830012; 3. 兰州财经大学统计学院, 兰州 730020
  • 出版日期:2019-06-25 发布日期:2019-10-10

曹睿,田茂再. 病例对照数据次级分析中的众数回归[J]. 系统科学与数学, 2019, 39(6): 954-976.

CAO Rui,TIAN Maozai. Mode Regression in the Secondary Analysis of Case-Control Data[J]. Journal of Systems Science and Mathematical Sciences, 2019, 39(6): 954-976.

Mode Regression in the Secondary Analysis of Case-Control Data

CAO Rui1 ,TIAN Maozai 1,2,3   

  1. 1. Center for Applied Statistics, School of Statistics, Renmin University of China, Beijing 100872; 2. School of Statistics and Information, Xinjiang University of Finance and Economics, Urumqi 830012; 3. School of Statistics, Lanzhou University of Finance and Economics, Lanzhou 730020
  • Online:2019-06-25 Published:2019-10-10

病例对照研究被广泛应用于流行病学等领域, 通过其获得的病例对照数据不但可以用于寻找疾病的 风险因素, 还能够用于次级分析, 即探究与疾病相关的风险因素之间的关系. 文献中已有的方法多集中于研究次级分析 中的均值回归和分位回归, 而众数作为数据中最有可能出现的值, 既是描述数据中心位置的重要参数, 更是 对均值和分位数的重要补充. 因此文章结合病例对照数据的特征, 提出了一种基于估计方程的众数回归方法 用于次级分析, 同时探讨了估计量的渐近性质. 蒙特卡洛数值模拟结果表明文章的估计方法相比于其他方法有更好的有效性和适用性. 最后利用一个乳腺癌数据集说明了文章所提方法的表现性能.

Case-control study is widely used in epidemiology and other fields. The case-control data from it not only can be used to find risk factors of diseases, but also can be used for secondary analysis, namely, to explore the relationships between risk factors of diseases. The existing methods in literatures mainly focus on mean regression and quantile regression in secondary analysis. As the most likely value in data, mode is not only an important parameter to describe the central position of data, but also an important supplement to mean and quantile. Thus, combining the features of case-control data, a mode regression method based on the estimating equation is proposed. The asymptotic properties of the estimator are also discussed. The Monte Carlo numerical simulation results show that the proposed estimation method is more effective and applicable than other methods. Finally, a breast cancer data set is used to illustrate the performance of the proposed method.

()
[1] 杨青, 孙晓伟. 带信息观测的与时间相关协变量的面板计数数据分析[J]. 系统科学与数学, 2021, 41(3): 865-874.
[2] 刘娟芳,薛留根. 纵向单调缺失数据下线性模型的二次推断函数估计[J]. 系统科学与数学, 2016, 36(4): 560-572.
[3] 祝丽萍. 缺失数据下估计方程的经验似然推断[J]. 系统科学与数学, 2013, 33(7): 766-776.
[4] 田萍;薛留根. 纵向数据下半参数回归模型的统计分析[J]. 系统科学与数学, 2007, 27(6): 847-857.
阅读次数
全文


摘要