嘉宾介绍
主题介绍
Genetical genomics data provide promising opportunities for integrative analysis of gene expression and genotype data. Lin et al. (2015) recently proposed an instrumental variables (IV) regression framework to select important genes with high dimensional genetical genomics data. The IV regression solves the problem of endogeneity issue caused by potential correlation of gene expressions and the error terms, hence improves the performance of gene selection. As genes function in networks to fulfill their joint task, incorporating network or graph structures in a regression model can further improve gene selection performance. Furthermore, gene expressions can be nonlinearly regulated or modified by environmental variables. In this work, we propose a graph constrained penalized nonlinear IV regression framework to solve the endogeneity issue and to improve the selection performance via considering gene network structures. We propose a two-step estimation procedure by adopting a network constrained regularization method to obtain better variable selection and estimation, and further establish the selection consistency. Simulation and real data analysis are conducted to show the utility of the method.
This is a joint work with Bin Gao and Xu Liu.
—— 点击加载更多 ——
收起