Artículo de Revisión

Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, “significant” difference. The usual procedure is to test the probability (P)...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autor principal:	DOMENECH,RAÚL J
Lenguaje:	Spanish / Castilian
Publicado:	Sociedad Médica de Santiago 2018
Materias:	Biostatistics Confidence Intervals Reproducibility of Results
Acceso en línea:	http://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

id	oai:scielo:S0034-98872018001001184
record_format	dspace
spelling	oai:scielo:S0034-988720180010011842019-03-07Artículo de RevisiónDOMENECH,RAÚL J Biostatistics Confidence Intervals Reproducibility of Results Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, “significant” difference. The usual procedure is to test the probability (P) against the null hypothesis that there is no real difference except because of the inevitable sampling variability. If this probability is high we accept the null hypothesis and infer that there is no real difference, but if P is low (P < 0.05) we reject the null hypothesis and infer that there is, a “significant” difference. However, a large amount of discoveries using this method are not reproducible. Statisticians have defined the deficiencies of the method and warned the researchers that P is a very unreliable measure. Two uncertainties of the “significance” concept are described in this review: a) The inefficacy of a P value to discard the null hypothesis; b) The low probability to reproduce a P value after an exact replication of the experiment. Due to the discredit of “significance” the American Statistical Association recently stated that P values do not provide a good measure of evidence for a hypothesis. Statisticians recommend to never use the word “significant” because it is misleading. Instead, the exact P value should be stated along with the effect size and confidence intervals. Nothing greater than P = 0.001 should be considered as a demonstration that something was discovered. Currently, several alternatives are being studied to replace the classical concepts.info:eu-repo/semantics/openAccessSociedad Médica de SantiagoRevista médica de Chile v.146 n.10 20182018-12-01text/htmlhttp://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184es10.4067/S0034-98872018001001184
institution	Scielo Chile
collection	Scielo Chile
language	Spanish / Castilian
topic	Biostatistics Confidence Intervals Reproducibility of Results
spellingShingle	Biostatistics Confidence Intervals Reproducibility of Results DOMENECH,RAÚL J Artículo de Revisión
description	Statistical inference was introduced by Fisher and Neyman-Pearson more than 90 years ago to define the probability that the difference in results between several groups is due to randomness or is a real, “significant” difference. The usual procedure is to test the probability (P) against the null hypothesis that there is no real difference except because of the inevitable sampling variability. If this probability is high we accept the null hypothesis and infer that there is no real difference, but if P is low (P < 0.05) we reject the null hypothesis and infer that there is, a “significant” difference. However, a large amount of discoveries using this method are not reproducible. Statisticians have defined the deficiencies of the method and warned the researchers that P is a very unreliable measure. Two uncertainties of the “significance” concept are described in this review: a) The inefficacy of a P value to discard the null hypothesis; b) The low probability to reproduce a P value after an exact replication of the experiment. Due to the discredit of “significance” the American Statistical Association recently stated that P values do not provide a good measure of evidence for a hypothesis. Statisticians recommend to never use the word “significant” because it is misleading. Instead, the exact P value should be stated along with the effect size and confidence intervals. Nothing greater than P = 0.001 should be considered as a demonstration that something was discovered. Currently, several alternatives are being studied to replace the classical concepts.
author	DOMENECH,RAÚL J
author_facet	DOMENECH,RAÚL J
author_sort	DOMENECH,RAÚL J
title	Artículo de Revisión
title_short	Artículo de Revisión
title_full	Artículo de Revisión
title_fullStr	Artículo de Revisión
title_full_unstemmed	Artículo de Revisión
title_sort	artículo de revisión
publisher	Sociedad Médica de Santiago
publishDate	2018
url	http://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0034-98872018001001184
work_keys_str_mv	AT domenechraulj articuloderevision
_version_	1718437028959354880

Artículo de Revisión

Ejemplares similares