A Framework for Efficient N-Way Interaction Testing in Case/Control Studies With Categorical Data

<italic>Goal:</italic> Most common diseases are influenced by multiple gene interactions and interactions with the environment. Performing an exhaustive search to identify such interactions is computationally expensive and needs to address the multiple testing problem. A four-step framew...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Aristos Aristodimou, Athos Antoniades, Efthimios Dardiotis, Eleni Loizidou, George Spyrou, Christina Votsi, Christodoulou Kyproula, Marios Pantzaris, Nikolaos Grigoriadis, Georgios Hadjigeorgiou, Theodoros Kyriakides, Constantinos Pattichi
Formato: article
Lenguaje:EN
Publicado: IEEE 2021
Materias:
Acceso en línea:https://doaj.org/article/37424223ba2649ab913818a0b0e64cf5
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:<italic>Goal:</italic> Most common diseases are influenced by multiple gene interactions and interactions with the environment. Performing an exhaustive search to identify such interactions is computationally expensive and needs to address the multiple testing problem. A four-step framework is proposed for the efficient identification of n-Way interactions. <italic>Methods:</italic> The framework was applied on a Multiple Sclerosis dataset with 725 subjects and 147 tagging SNPs. The first two steps of the framework are quality control and feature selection. The next step uses clustering and binary encodes the features. The final step performs the n-Way interaction testing. <italic>Results:</italic> The feature space was reduced to 7 SNPs and using the proposed binary encoding, more 2-SNP and 3-SNP interactions were identified compared to using the initial encoding. <italic>Conclusions:</italic> The framework selects informative features and with the proposed binary encoding it is able to identify more n-way interactions by increasing the power of the statistical analysis.