| » ASP Competition | |
|
» Login
|
ErrataErrors were found in encodings and/or checkers of three benchmarks after the competition took place. Below we discuss these errors in detail and their possible impact on the outcome of the competition. The errors on this page have been identified by Broes De Cat and Johan Wittocx. We offer our sincere apologies for these errors.
Impact on the competition resultsIn all three cases, the errors in the encodings make the benchmark problems computationally easier for the ASP solvers that use them. In the case of Labyrinth, the Asparagus encoding solves a variant of the benchmark that is computationally simpler than the actual benchmark. In the case of EdgeMatching, the same can be said for the encodings of Asparagus and of DLV.In the three benchmarks, teams that did not use the erroneous encodings had a clear disadvantage. These are:
The impact is possibly larger in the track "Decision problems in NP", to which Labyrinth and EdgeMatching belong. If we drop these 2 benchmarks and we compare total number of solved instances, then Potassco still leads with a good margin, followed by single system teams IDP, Claspfolio and Cmodels: here, IDP overtakes Claspfolio and Cmodels. This does not guarantee that IDP would have been first single system in this track. Indeed, rankings were made on the basis of weighted sums of number of solved instances for individual benchmarks and these weights depended on various parameters. The only fully correct way to recompute rankings would be to rerun all solvers on all instances of the reduced set of benchmarks. We have decided against this. Lessons for the futureIn our opinion, a modeling competition like this ASP competition is valuable and gives valuable information on various aspect of modeling and solving (see the competition report).Unfortunately, in such a competition, there is an inherent risk that bugs go unnoticed or even occur in the programs that should check for correctness. Based on our experience, here are some ideas that would have led to an early detection of the above problems and could help to reduce the risk for errors in future modeling competitions:
|