Artículo de Revisión

Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, “significant” difference. The usual procedure is to test the probability (P)...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autor principal: DOMENECH,RAÚL J
Lenguaje:Spanish / Castilian
Publicado: Sociedad Médica de Santiago 2018
Materias:
Acceso en línea:http://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:scielo:S0034-98872018001001184
record_format dspace
spelling oai:scielo:S0034-988720180010011842019-03-07Artículo de RevisiónDOMENECH,RAÚL J Biostatistics Confidence Intervals Reproducibility of Results Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, &#8220;significant&#8221; difference. The usual procedure is to test the probability (P) against the null hypothesis that there is no real difference except because of the inevitable sampling variability. If this probability is high we accept the null hypothesis and infer that there is no real difference, but if P is low (P < 0.05) we reject the null hypothesis and infer that there is, a &#8220;significant&#8221; difference. However, a large amount of discoveries using this method are not reproducible. Statisticians have defined the deficiencies of the method and warned the researchers that P is a very unreliable measure. Two uncertainties of the &#8220;significance&#8221; concept are described in this review: a) The inefficacy of a P value to discard the null hypothesis; b) The low probability to reproduce a P value after an exact replication of the experiment. Due to the discredit of &#8220;significance&#8221; the American Statistical Association recently stated that P values do not provide a good measure of evidence for a hypothesis. Statisticians recommend to never use the word &#8220;significant&#8221; because it is misleading. Instead, the exact P value should be stated along with the effect size and confidence intervals. Nothing greater than P = 0.001 should be considered as a demonstration that something was discovered. Currently, several alternatives are being studied to replace the classical concepts.info:eu-repo/semantics/openAccessSociedad Médica de SantiagoRevista médica de Chile v.146 n.10 20182018-12-01text/htmlhttp://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184es10.4067/S0034-98872018001001184
institution Scielo Chile
collection Scielo Chile
language Spanish / Castilian
topic Biostatistics
Confidence Intervals
Reproducibility of Results
spellingShingle Biostatistics
Confidence Intervals
Reproducibility of Results
DOMENECH,RAÚL J
Artículo de Revisión
description Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, &#8220;significant&#8221; difference. The usual procedure is to test the probability (P) against the null hypothesis that there is no real difference except because of the inevitable sampling variability. If this probability is high we accept the null hypothesis and infer that there is no real difference, but if P is low (P < 0.05) we reject the null hypothesis and infer that there is, a &#8220;significant&#8221; difference. However, a large amount of discoveries using this method are not reproducible. Statisticians have defined the deficiencies of the method and warned the researchers that P is a very unreliable measure. Two uncertainties of the &#8220;significance&#8221; concept are described in this review: a) The inefficacy of a P value to discard the null hypothesis; b) The low probability to reproduce a P value after an exact replication of the experiment. Due to the discredit of &#8220;significance&#8221; the American Statistical Association recently stated that P values do not provide a good measure of evidence for a hypothesis. Statisticians recommend to never use the word &#8220;significant&#8221; because it is misleading. Instead, the exact P value should be stated along with the effect size and confidence intervals. Nothing greater than P = 0.001 should be considered as a demonstration that something was discovered. Currently, several alternatives are being studied to replace the classical concepts.
author DOMENECH,RAÚL J
author_facet DOMENECH,RAÚL J
author_sort DOMENECH,RAÚL J
title Artículo de Revisión
title_short Artículo de Revisión
title_full Artículo de Revisión
title_fullStr Artículo de Revisión
title_full_unstemmed Artículo de Revisión
title_sort artículo de revisión
publisher Sociedad Médica de Santiago
publishDate 2018
url http://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184
work_keys_str_mv AT domenechraulj articuloderevision
_version_ 1718437028959354880