LLAMA results

All results were produced by using the cross-validation splits in the repository with 10 folds and 1 repetitions.
The best values within a type (i.e., baseline (except for vbs), classif, regr and cluster) and performance measure (i.e., Percentage solved, PAR10, MCP) are colored green. Furthermore, the three best values over all groups within a performance measure are colored pink, the absolute best one is red.

The performance is measured in three different ways.

algo model succ par10 mcp
baseline vbs 0.843 8187.518 0.000
baseline singleBest 0.717 14605.904 718.386
baseline singleBestByPar 0.717 14605.904 718.386
baseline singleBestBySuccesses 0.717 14605.904 718.386
regr lm 0.717 14608.378 711.428
regr rpart 0.693 15694.222 747.190
regr randomForest 0.750 12872.154 473.871

The following default feature steps were used for model building:

Pre, Basic, KLB, CG, code, AST

Number of presolved instances: 0

The cost for using the feature steps (adapted for presolving) is: 0 or on average: 0

The feature steps correspond to the following 50 / 115 instance features:

nvarsOrig, nclausesOrig, nvars, nclauses, reducedVars,
reducedClauses, vars_clauses_ratio, POSNEG_RATIO_CLAUSE_mean, POSNEG_RATIO_CLAUSE_coeff_variation, POSNEG_RATIO_CLAUSE_min,
POSNEG_RATIO_CLAUSE_max, POSNEG_RATIO_CLAUSE_entropy, VCG_CLAUSE_mean, VCG_CLAUSE_coeff_variation, VCG_CLAUSE_min,
VCG_CLAUSE_max, VCG_CLAUSE_entropy, UNARY, BINARYp, TRINARYp,
VCG_VAR_mean, VCG_VAR_coeff_variation, VCG_VAR_min, VCG_VAR_max, VCG_VAR_entropy,
POSNEG_RATIO_VAR_mean, POSNEG_RATIO_VAR_stdev, POSNEG_RATIO_VAR_min, POSNEG_RATIO_VAR_max, POSNEG_RATIO_VAR_entropy,
HORNY_VAR_mean, HORNY_VAR_coeff_variation, HORNY_VAR_min, HORNY_VAR_max, HORNY_VAR_entropy,
horn_clauses_fraction, VG_mean, VG_coeff_variation, VG_min, VG_max,
CG_mean, CG_coeff_variation, CG_min, CG_max, CG_entropy,
cluster_coeff_mean, cluster_coeff_coeff_variation, cluster_coeff_min, cluster_coeff_max, cluster_coeff_entropy

The feature steps correspond to the following 75 / 75 algorithm features:

Lines..Average., Lines..Total., Size..Average., Size..Total., Number.of.files,
Cyclomatic..Average., Cyclomatic..Total., Max.Indent..Average., Max.Indent..Total., nb_nodes,
nb_edges, degree_min, degree_max, degree_mean, degree_variance,
degree_entropy, transitivity, clustering_min, clustering_max, clustering_mean,
clustering_variance, path_min, paths_max, path_mean, path_variance,
path_entropy, Stmt, Type, Decl, Attribute,
Operator, Literal, edge_ss, edge_st, edge_sd,
edge_sa, edge_so, edge_sl, edge_ts, edge_tt,
edge_td, edge_ta, edge_to, edge_tl, edge_ds,
edge_dt, edge_dd, edge_da, edge_do, edge_dl,
edge_as, edge_at, edge_ad, edge_aa, edge_ao,
edge_al, edge_os, edge_ot, edge_od, edge_oa,
edge_oo, edge_ol, edge_ls, edge_lt, edge_ld,
edge_la, edge_lo, edge_ll, op_short, op_int,
op_long, op_long_long, op_float, op_double, op_bit

Algorithm and Feature Subset Selection

In order to get a better insight of the scenarios, forward selections have been applied to the solvers and features to determine whether small subsets achieve comparable performances. Following this approach, we reduced the number of solvers from 18 to 6, resulting in a PAR10 score of 12297.623 for the reduced model. Analogously, the model that was generated based on 2 of the originally 167 features resulted in a PAR10 score of 12200.994. Below, you can find the list of the selected features and solvers:

Selected Features:
SP_bias_q25, Decl

Selected Solvers:
EBGlucose_1.0, Lingeling_587f_fixed_, QuteRSat_2011.05.12_fixed_, SAT09referencesolverprecosat_236, glucose_2,
minisathackreferenceminisat_2.2.0