Efficient replication of over 180 genetic associations with self-reported medical data.

While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recont...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Joyce Y Tung, Chuong B Do, David A Hinds, Amy K Kiefer, J Michael Macpherson, Arnab B Chowdry, Uta Francke, Brian T Naughton, Joanna L Mountain, Anne Wojcicki, Nicholas Eriksson
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2011
Materias:
R
Q
Acceso en línea:https://doaj.org/article/6b0856aecd7845b09963675ce77be0ce
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:6b0856aecd7845b09963675ce77be0ce
record_format dspace
spelling oai:doaj.org-article:6b0856aecd7845b09963675ce77be0ce2021-11-18T06:47:47ZEfficient replication of over 180 genetic associations with self-reported medical data.1932-620310.1371/journal.pone.0023473https://doaj.org/article/6b0856aecd7845b09963675ce77be0ce2011-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/21858135/?tool=EBIhttps://doaj.org/toc/1932-6203While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recontactable cohort by assessing our ability to replicate genetic associations using these data. Using web-based questionnaires, we gathered self-reported data on 50 medical phenotypes from a generally unselected cohort of over 20,000 genotyped individuals. Of a list of genetic associations curated by NHGRI, we successfully replicated about 75% of the associations that we expected to (based on the number of cases in our cohort and reported odds ratios, and excluding a set of associations with contradictory published evidence). Altogether we replicated over 180 previously reported associations, including many for type 2 diabetes, prostate cancer, cholesterol levels, and multiple sclerosis. We found significant variation across categories of conditions in the percentage of expected associations that we were able to replicate, which may reflect systematic inflation of the effects in some initial reports, or differences across diseases in the likelihood of misdiagnosis or misreport. We also demonstrated that we could improve replication success by taking advantage of our recontactable cohort, offering more in-depth questions to refine self-reported diagnoses. Our data suggest that online collection of self-reported data from a recontactable cohort may be a viable method for both broad and deep phenotyping in large populations.Joyce Y TungChuong B DoDavid A HindsAmy K KieferJ Michael MacphersonArnab B ChowdryUta FranckeBrian T NaughtonJoanna L MountainAnne WojcickiNicholas ErikssonPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 6, Iss 8, p e23473 (2011)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Joyce Y Tung
Chuong B Do
David A Hinds
Amy K Kiefer
J Michael Macpherson
Arnab B Chowdry
Uta Francke
Brian T Naughton
Joanna L Mountain
Anne Wojcicki
Nicholas Eriksson
Efficient replication of over 180 genetic associations with self-reported medical data.
description While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recontactable cohort by assessing our ability to replicate genetic associations using these data. Using web-based questionnaires, we gathered self-reported data on 50 medical phenotypes from a generally unselected cohort of over 20,000 genotyped individuals. Of a list of genetic associations curated by NHGRI, we successfully replicated about 75% of the associations that we expected to (based on the number of cases in our cohort and reported odds ratios, and excluding a set of associations with contradictory published evidence). Altogether we replicated over 180 previously reported associations, including many for type 2 diabetes, prostate cancer, cholesterol levels, and multiple sclerosis. We found significant variation across categories of conditions in the percentage of expected associations that we were able to replicate, which may reflect systematic inflation of the effects in some initial reports, or differences across diseases in the likelihood of misdiagnosis or misreport. We also demonstrated that we could improve replication success by taking advantage of our recontactable cohort, offering more in-depth questions to refine self-reported diagnoses. Our data suggest that online collection of self-reported data from a recontactable cohort may be a viable method for both broad and deep phenotyping in large populations.
format article
author Joyce Y Tung
Chuong B Do
David A Hinds
Amy K Kiefer
J Michael Macpherson
Arnab B Chowdry
Uta Francke
Brian T Naughton
Joanna L Mountain
Anne Wojcicki
Nicholas Eriksson
author_facet Joyce Y Tung
Chuong B Do
David A Hinds
Amy K Kiefer
J Michael Macpherson
Arnab B Chowdry
Uta Francke
Brian T Naughton
Joanna L Mountain
Anne Wojcicki
Nicholas Eriksson
author_sort Joyce Y Tung
title Efficient replication of over 180 genetic associations with self-reported medical data.
title_short Efficient replication of over 180 genetic associations with self-reported medical data.
title_full Efficient replication of over 180 genetic associations with self-reported medical data.
title_fullStr Efficient replication of over 180 genetic associations with self-reported medical data.
title_full_unstemmed Efficient replication of over 180 genetic associations with self-reported medical data.
title_sort efficient replication of over 180 genetic associations with self-reported medical data.
publisher Public Library of Science (PLoS)
publishDate 2011
url https://doaj.org/article/6b0856aecd7845b09963675ce77be0ce
work_keys_str_mv AT joyceytung efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT chuongbdo efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT davidahinds efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT amykkiefer efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT jmichaelmacpherson efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT arnabbchowdry efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT utafrancke efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT briantnaughton efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT joannalmountain efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT annewojcicki efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT nicholaseriksson efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
_version_ 1718424355277373440