Performance ESFOR-04

The following tests were run on a Pentium IV 2.4 Ghz with 512 MB RAM under SUSe Linux 9.0 on the TPTP version 2.6. The Time/Memory columns give the average time spent on the solved problems - but due to sharing the test computer with other processes these results are not highly reliable. The detailed results are given sorted by name and rating in the Details column. Detailed explanations for the configurations and results are given in the ESFOR paper.

Horn problems without equality

The following tests were run on the 753 Horn problems without equality. The time limit is 300 s, the memory limit 500 MB. The version of Darwin used is the same as the one mentioned in the ESFOR workshop paper listed above.
Configuration Solved Time Memory Theorem Non Theorem Out of Memory Details
Default 591 6.5 4.0 570 21 0 name/rating
with Discrimination trees 592 6.0 3.9 571 21 0 name/rating
without Subsume 591 6.4 4.0 570 21 0 name/rating
without Term Database 589 5.8 3.9 568 21 1 name/rating
best inactive candidate 600 7.6 4.6 579 21 1 name/rating

non-Horn problems without equality

The following tests were run on the 1172 non-Horn problems without equality. The time limit is 300 s, the memory limit 500 MB. Note: the reason for unsound derivations are buggy TPTP problems.
Configuration Solved Time Memory Theorem Non Theorem Out of Memory Details
Default 803 4.1 3.2 520 283 0 name/rating
with Discrimination trees 801 3.7 4.0 519 282 0 name/rating
without Subsume 802 4.1 3.3 520 282 0 name/rating
without Term Database 803 4.0 4.1 520 283 0 name/rating
best inactive candidate 802 4.0 3.2 519 283 0 name/rating
Dynamic Backtracking 804 3.9 3.4 517 287 0 name/rating
with +v 730 9.0 6.4 482 248 32 name/rating

CASC18

The following tests were run on some of the problem sets used at CASC18. The time limit is 500 s, the memory limit 500 MB. See also the results of the CASC18 competition. Note: the reason for unsound derivations are buggy TPTP problems.
Problems Configuration Solved Time Memory Details
hne / 35 Default 18 24.0 17.3 name/rating
hne / 35 best inactive candidate 19 23.0 20.6 name/rating
heq / 35 Default 9 23.6 12.8 name/rating
heq / 35 best inactive candidate 9 24.2 17.9 name/rating
eps / 35 Default 28 11.1 4.8 name/rating
eps / 35 best inactive candidate 28 10.0 4.8 name/rating
eps / 35 Dynamic Backtracking 30 3.8 6.7 name/rating
eps / 35 with +v 29 26.9 12.6 name/rating
ept / 35 Default 33 16.8 4.5 name/rating
ept / 35 best inactive candidate 33 18.1 4.6 name/rating
ept / 35 Dynamic Backtracking 32 9.9 4.4 name/rating
ept / 35 with +v 28 7.1 4.7 name/rating
nne / 35 Default 16 7.4 3.1 name/rating
nne / 35 best inactive candidate 17 19.7 4.0 name/rating
nne / 35 Dynamic Backtracking 16 7.7 3.6 name/rating
nne / 35 with +v 11 6.8 4.8 name/rating
sne / 35 Default 9 14.5 2.1 name/rating
sne / 35 best inactive candidate 9 14.1 2.1 name/rating
sne / 35 Dynamic Backtracking 9 15.5 4.0 name/rating
sne / 35 with +v 6 0.0 2.5 name/rating

CASC19

The following tests were run on some of the problem sets used at CASC19. The time limit is 500 s, the memory limit 500 MB. See also the results of the CASC19 competition. Note: the reason for unsound derivations are buggy TPTP problems.
Problems Configuration Solved Time Memory Details
hne / 20 Default 10 64.9 37.4 name/rating
hne / 20 best inactive candidate 10 33.8 32.4 name/rating
heq / 20 Default 0 0.0 0.0 name/rating
heq / 20 best inactive candidate 0 0.0 0.0 name/rating
eps / 35 Default 31 6.2 8.1 name/rating
eps / 35 best inactive candidate 31 5.4 8.2 name/rating
eps / 35 Dynamic Backtracking 31 5.0 7.6 name/rating
eps / 35 with +v 31 28.5 16.6 name/rating
ept / 35 Default 31 5.6 4.5 name/rating
ept / 35 best inactive candidate 32 19.1 5.8 name/rating
ept / 35 Dynamic Backtracking 31 7.6 3.5 name/rating
ept / 35 with +v 30 29.3 5.9 name/rating
nne / 20 Default 12 16.5 6.4 name/rating
nne / 20 best inactive candidate 13 31.0 6.0 name/rating
nne / 20 Dynamic Backtracking 10 16.8 5.4 name/rating
nne / 20 with +v 8 53.9 16.5 name/rating
sne / 35 Default 3 0.0 4.0 name/rating
sne / 35 best inactive candidate 3 0.0 4.0 name/rating
sne / 35 Dynamic Backtracking 3 0.0 2.0 name/rating
sne / 35 with +v 2 0.0 2.0 name/rating


Alexander Fuchs
Last modified: Sun Mar 20 16:49:12 CST 2005