Artículo de Revisión
Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, “significant” difference. The usual procedure is to test the probability (P)...
Guardado en:
Autor principal: | |
---|---|
Lenguaje: | Spanish / Castilian |
Publicado: |
Sociedad Médica de Santiago
2018
|
Materias: | |
Acceso en línea: | http://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:scielo:S0034-98872018001001184 |
---|---|
record_format |
dspace |
spelling |
oai:scielo:S0034-988720180010011842019-03-07Artículo de RevisiónDOMENECH,RAÚL J Biostatistics Confidence Intervals Reproducibility of Results Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, “significant” difference. The usual procedure is to test the probability (P) against the null hypothesis that there is no real difference except because of the inevitable sampling variability. If this probability is high we accept the null hypothesis and infer that there is no real difference, but if P is low (P < 0.05) we reject the null hypothesis and infer that there is, a “significant” difference. However, a large amount of discoveries using this method are not reproducible. Statisticians have defined the deficiencies of the method and warned the researchers that P is a very unreliable measure. Two uncertainties of the “significance” concept are described in this review: a) The inefficacy of a P value to discard the null hypothesis; b) The low probability to reproduce a P value after an exact replication of the experiment. Due to the discredit of “significance” the American Statistical Association recently stated that P values do not provide a good measure of evidence for a hypothesis. Statisticians recommend to never use the word “significant” because it is misleading. Instead, the exact P value should be stated along with the effect size and confidence intervals. Nothing greater than P = 0.001 should be considered as a demonstration that something was discovered. Currently, several alternatives are being studied to replace the classical concepts.info:eu-repo/semantics/openAccessSociedad Médica de SantiagoRevista médica de Chile v.146 n.10 20182018-12-01text/htmlhttp://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184es10.4067/S0034-98872018001001184 |
institution |
Scielo Chile |
collection |
Scielo Chile |
language |
Spanish / Castilian |
topic |
Biostatistics Confidence Intervals Reproducibility of Results |
spellingShingle |
Biostatistics Confidence Intervals Reproducibility of Results DOMENECH,RAÚL J Artículo de Revisión |
description |
Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, “significant” difference. The usual procedure is to test the probability (P) against the null hypothesis that there is no real difference except because of the inevitable sampling variability. If this probability is high we accept the null hypothesis and infer that there is no real difference, but if P is low (P < 0.05) we reject the null hypothesis and infer that there is, a “significant” difference. However, a large amount of discoveries using this method are not reproducible. Statisticians have defined the deficiencies of the method and warned the researchers that P is a very unreliable measure. Two uncertainties of the “significance” concept are described in this review: a) The inefficacy of a P value to discard the null hypothesis; b) The low probability to reproduce a P value after an exact replication of the experiment. Due to the discredit of “significance” the American Statistical Association recently stated that P values do not provide a good measure of evidence for a hypothesis. Statisticians recommend to never use the word “significant” because it is misleading. Instead, the exact P value should be stated along with the effect size and confidence intervals. Nothing greater than P = 0.001 should be considered as a demonstration that something was discovered. Currently, several alternatives are being studied to replace the classical concepts. |
author |
DOMENECH,RAÚL J |
author_facet |
DOMENECH,RAÚL J |
author_sort |
DOMENECH,RAÚL J |
title |
Artículo de Revisión |
title_short |
Artículo de Revisión |
title_full |
Artículo de Revisión |
title_fullStr |
Artículo de Revisión |
title_full_unstemmed |
Artículo de Revisión |
title_sort |
artículo de revisión |
publisher |
Sociedad Médica de Santiago |
publishDate |
2018 |
url |
http://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184 |
work_keys_str_mv |
AT domenechraulj articuloderevision |
_version_ |
1718437028959354880 |