Overview of performance values

The following statistics were calculated from the performance values of each algorithm:

obs (number of observations within performance values)
nas (number of NAs, i.e., missing values, within performance values)
min (minimum), mean (arithmetic mean), max (maximum), sd (standard deviation)
qu_1st (1st quartile = lower quartile = 25%-quantile)
med (median = 50%-quantile)
qu_3rd (3rd quartile = upper quartile = 75%-quantile)
coeff_var (coefficient of variation = standard deviation / arithmetic mean)

	obs	min	qu_1st	med	mean	qu_3rd	max	sd	coeff_var
AIGSolve	825	0	0.14	5	5168.46	18000	18000	8127.47	1.57251
aqua.f3v	825	0	0.07	13.99	7493.27	18000	18000	8868.71	1.18356
aqua.s2v	825	0	0.07	11.49	7449.54	18000	18000	8861.25	1.1895
aqua.s3o	825	0	0.07	15.01	7557.3	18000	18000	8880.78	1.17513
caqe.minisat	825	0	0.19	4.41	5451.17	18000	18000	8256.02	1.51454
caqe.picosat	825	0	0.19	4.39	5148.1	18000	18000	8116.24	1.57655
depqbf.v1	825	0.02	0.21	45.81	8063.03	18000	18000	8944.51	1.10932
depqbf.v2	825	0.02	0.21	2.41	4860.7	18000	18000	7977.53	1.64123
depqbf.v3	825	0	0.14	9.11	6521.44	18000	18000	8637.14	1.32442
ghostq.cegar	825	0.06	0.52	8.2	5253.99	18000	18000	8169.1	1.55484
ghostq.plain	825	0.05	0.39	6.95	5623.91	18000	18000	8330.12	1.4812
hiqqer1	825	0.03	0.06	1.24	5381.75	18000	18000	8230.07	1.52926
hiqqer1ldsq	825	0.02	0.05	1.16	5489.64	18000	18000	8278	1.50793
hiqqer3	825	0.03	0.06	1.45	5535.28	18000	18000	8295.1	1.49859
iprover.QBF	825	0.04	11.36	18000	10422.9	18000	18000	8876.46	0.851628
iprover.QBF.bloqqer	825	0.04	3.19	18000	10942.3	18000	18000	8781.78	0.802555
qesto	825	0	0.03	1.4	5320.67	18000	18000	8198.2	1.54082
qestos	825	0	0.03	0.68	6507.1	18000	18000	8647.68	1.32896
qsts	825	0	0.04	0.56	5722.74	18000	18000	8380.4	1.4644
rareqs	825	0	0.02	0.54	4053.54	212.79	18000	7503.13	1.85101
squeezebf.struqs	825	0	3.18	18000	10269.9	18000	18000	8900.05	0.866615
struqs.10	825	0	1.94	18000	10204.6	18000	18000	8908.96	0.873031
xb.bid.qsts	825	0	0.19	3.95	3666.71	131.6	18000	7225.59	1.9706
xb.qsts	825	0	0.18	3.12	4644	18000	18000	7859.44	1.69239

Summary of the runstatus per algorithm

The following table summarizes the runstatus of each algorithm over all instances (in %).

	ok	timeout
AIGSolve	71.394	28.606
aqua.f3v	58.424	41.576
aqua.s2v	58.667	41.333
aqua.s3o	58.061	41.939
caqe.minisat	69.818	30.182
caqe.picosat	71.515	28.485
depqbf.v1	55.273	44.727
depqbf.v2	73.091	26.909
depqbf.v3	63.879	36.121
ghostq.cegar	70.909	29.091
ghostq.plain	68.848	31.152
hiqqer1	70.182	29.818
hiqqer1ldsq	69.576	30.424
hiqqer3	69.333	30.667
iprover.QBF	42.182	57.818
iprover.QBF.bloqqer	39.273	60.727
qesto	70.545	29.455
qestos	63.879	36.121
qsts	68.242	31.758
rareqs	77.576	22.424
squeezebf.struqs	43.030	56.970
struqs.10	43.394	56.606
xb.bid.qsts	79.758	20.242
xb.qsts	74.303	25.697

Dominated Algorithms

Here, you'll find an overview of dominating/dominated algorithms:
None of the algorithms was superior to any of the other.

An algorithm (A) is considered to be superior to an other algorithm (B), if it has at least an equal performance on all instances (compared to B) and if it is better on at least one of them. A missing value is automatically a worse performance. However, instances which could not be solved by either one of the algorithms, were not considered for the dominance relation.

Visualisations

Important note w.r.t. some of the following plots:
If appropriate, we imputed performance values for failed or censored runs. We used max + 0.3 * (max - min), in case of minimization problems, or min - 0.3 * (max - min), in case of maximization problems.
In addition, a small noise is added to the imputed values (except for the cluster matrix, based on correlations, which is shown at the end of this page).

Boxplots of performance values

Imputing the performance values of failed or censored runs (as described in the red note at the beginning of this section):

Discarding the performance values of failed or censored runs:

## Warning: Removed 7168 rows containing non-finite values (stat_boxplot).

Estimated densitities of performance values

Imputing the performance values of failed or censored runs (as described in the red note at the beginning of this section):

Discarding the performance values of failed or censored runs:

Estimated cumulative distribution functions of performance values

Imputing the performance values of failed runs (as described in the red note at the beginning of this section):

Discarding the performance values of failed or censored runs:

Scatterplot matrix of the performance values

The figure underneath shows pairwise scatterplots of the performance values.

Imputing the performance values of failed and censored runs (as described in the red note at the beginning of this section):

Clustering algorithms based on their correlations

The following figure shows the correlations of the ranks of the performance values. Per default it will show the correlation coefficient of spearman. Missing values were imputed prior to computing the correlation coefficients. The algorithms are ordered in a way that similar (highly correlated) algorithms are close to each other. Per default the clustering is based on hierarchical clustering, using Ward's method.