Metodologias para aprendizado de maquina
5x2cv Thomas G Dietterich. Approximate statistical tests for com- paring supervised classification learning algorithms. Neural computation, 10(7):1895–1923, 1998.
Demsar: Demsar, Janez. “Statistical comparisons of classifiers over multiple data sets.” The Journal of Machine Learning Research 7 (2006): 1-30.
extensoes: Garcia, Salvador, and Francisco Herrera. “An Extension on” Statistical Comparisons of Classifiers over Multiple Data Sets” for all Pairwise Comparisons.” Journal of machine learning research 9.12 (2008). e García S. and Herrera, F. (2010) Advanced Nonparametric Tests for Multiple Comparison in the Design of Experiments in Computational Intelligence and Data Mining: Experimental Analysis of Power. Information Sciences, 180, 2044-2064.
implementacao: R https://cran.r-project.org/web/packages/PMCMRplus/index.html, e https://github.com/b0rxa/scmamp python https://github.com/sherbold/autorank
bayesian non-parametric (2 clasificadores) Benavoli, Alessio, et al. “Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis.” The Journal of Machine Learning Research 18.1 (2017): 2653-2688.
implementacao https://github.com/BayesianTestsML/tutorial
ajuste de p-valor. R https://www.rdocumentation.org/packages/stats/versions/3.6.2/topics/p.adjust , python https://www.statsmodels.org/dev/generated/statsmodels.stats.multitest.multipletests.html