With the raw results downloaded, these analyses can be recreated using development.MultipleClassifierEvaluation if you really want to, hopefully the comments there are enough for you to get it working. Due to time constraints, train results were not created for every classifier in every scenario. If it complains about train results not being found (but test results are) for what you're trying to do, set the bool testResultsOnly = true.