Evaluating Component Solver Contributions to Portfolio-Based Algorithm Selectors

Similar documents
Monomino-Domino Tatami Coverings

The Role of EIA for Decision- Making and Formulation of Alternatives Evaluation of Influence

Received: 26 th Feb-2014 Revised: 29 th March-2014 Accepted: 30 th March-2014 Research article

Parallelization of an Urban Microscale Flow Model (urbanstream)

ON-LINE SENSOR CALIBRATION MONITORING AND FAULT DETECTION FOR CHEMICAL PROCESSES

Scholars Research Library. The Role of Plant Clinic in Protecting Vertical Urban Green Spaces in Tehran

Implementation and testing of a model for the calculation of equilibrium between components of a refrigeration installation

Minimax: A Multiwinner Election Procedure

Clemens Felsmann 1, Jean Lebrun 2, Vincent Lemort 2 and Aad Wijsman 3 ABSTRACT INTRODUCTION METHODOLOGY

Supporting Information

Testy Testing Makes for Better Transformation

Predicting Narrow-band and Wideband Speech Quality with WB-PESQ and TOSQA

NEW CD WARP CONTROL SYSTEM FOR THE CORRUGATING INDUSTRY

Prediction of Soil Infiltration Rate Based on Sand Content of Soil

Finned Heat Sinks for Cooling Outdoor Electronics under Natural Convection

Performance of Water-in-Glass Evacuated Tube Solar Water Heaters

Virtual Household Refrigerators at Steady-State and Transient Conditions. Numerical Model and Experimental Validation.

Domino Tatami Covering is NP-complete

TECHNICAL BRIEF BACKGROUND THE BIDGELY SOLUTION. Bidgely Hybrid Model. 100% Appliance Itemization

Brief Synopsis of the Japanese MOE Sponsored Evaluation of Wind Turbine Noise

Computer Modelling and Simulation of a Smart Water Heater

Where Technology Shapes Solutions. Alarm management : Wasn t that problem already solved years ago?

Optimized Finned Heat Sinks for Natural Convection Cooling of Outdoor Electronics

TESTS OF ADSIL COATING

Applied Data Science: Using Machine Learning for Alarm Verification

A new model for CEN/TC 89 Thermal performance of buildings and building components

Development of Small-diameter Tube Heat Exchanger: Circuit Design and Performance Simulation

BC Hydro Energy Star Appliances

Global Journal of Engineering Science and Research Management

Enhancement of COP using Nanoadditives in Domestic Refrigerator

TYPE APPROVAL CERTIFICATE

Analysis of Landscape Character for Visual Resource Management 1

Working Paper Series 09/2008. The strategic role of the plant in international networks: a longitudinal study

CAN THE CONE CALORIMETER BE USED TO PREDICT FULL SCALE HEAT AND SMOKE RELEASE CABLE TRAY RESULTS FROM A FULL SCALE TEST PROTOCOL?

Official Journal of the European Union. (Non-legislative acts) REGULATIONS

Performance Comparison of Hydronic Secondary Loop Heat Pump and Conventional Air-Source Heat Pump ABSTRACT 1. INTRODUCTION

Plant Performance 2017 User Symposium

Available online at Energy Procedia 6 (2011) MEDGREEN 2011-LB

Avirup Sil and Alexander Yates (Temple University) Ernest Cronin, Penghai Nie, Yinfei Yang (St. Joseph s University) Ana-Maria Popescu (Yahoo!

TYPE APPROVAL CERTIFICATE

Virginia TrueAllele Validation Study: Casework Comparison. One person, one genotype. DNA data. Cybergenetics

Intrusion Detection System: Facts, Challenges and Futures. By Gina Tjhai 13 th March 2007 Network Research Group

R&D for the improvement of O&M in CSP plants. Dr. Marcelino Sánchez. - November,

Landscape-scale simulation for terrestrial population modelling and ERA

How Important Are Bacteria for the Vase Life of Cut Gerbera Flowers?

Methodology and Application of Pattern Mining in Multiple Alarm Flood Sequences

HOT IN HERE: OFFICE OCCUPANT THERMAL COMFORT IN LAWRENCE HALL

Faster GPS via the Sparse Fourier Transform. Haitham Hassanieh Fadel Adib Dina Katabi Piotr Indyk

Real time Video Fire Detection using Spatio-Temporal Consistency Energy

Information Effects on Residential Energy Conservation: A Japanese Experiment

Computers and Chemical Engineering

Application Note. Application Note for BAYEX

Contents. 1 Planning vs. problem solving. 2 Planning in the situation calculus. 3 STRIPS formalism. 4 Non-linear planning. 5 The POP algorithm

OPERATION OF RADIATION PORTAL MONITORS OF SPECIAL NUCLEAR MATERIALS AT NUCLEAR SITES IN TERMS OF RELIABILITY

Ecography. Supplementary material

Recent BRANZFIRE enhancements and validation

AS SEEN IN YOUR MOST TRUSTED FRANCHISE RESOURCE. March April Building A Concept. The Harding siblings launch a new brand of affordable luxury

Thermal Properties and Temperature

femto newsletter 2013/4 Femto on Jetfire Top Partner award 3 Bullet impact 6 Femto on JetFire 4 Happy holidays 8

CONVENTION 192 PATIENTS 384 PATIENTS BREAKTHROUGH CONCEPT

AN ANALYSIS OF THE ACTUAL THERMAL PLUMES OF KITCHEN APPLIANCES DURING COOKING MODE

Modelling of Split Condenser Heat Pump with Limited Set of Plate Heat Exchanger Dimensions

2. HEAT EXCHANGERS MESA

COP-Based Performance Evaluation of Domestic Refrigerators using Accelerated Flow Evaporators

How to Use Fire Risk Assessment Tools to Evaluate Performance Based Designs

A WEB-BASED SIMULATION TOOL ON THE PERFORMANCE OF DIFFERENT ROOFING SYSTEMS

Sensor Placement Guidance for Small Utilities

INSTITUTO DE INGENIERÍA ENERGÉTICA. (Institute for Energy Engineering) Research Publications

Advanced Pattern Recognition for Anomaly Detection Chance Kleineke/Michael Santucci Engineering Consultants Group Inc.

FESTIVAL AS AN AGENT FOR BEHAVIOR CHANGE: A STUDY OF GARDEN FESTIVAL IN MALAYSIA

PERFORMANCE EVALUATION OF GAS COOLER/ CONDENSER FOR CO2 REFRIGERATION SYSTEMS

Field Evaluation of Pre-Commercial Residential Gas Heat Pump Water Heaters

Playing in the Dirt: Discovering Soil

FLACS-Fire. Djurre Siccama Gexcon R&D and Software VP Products. FLUG 3 rd - 4 th. November 2015, Shanghai China

White Paper: Video/Audio Analysis Technology. hanwhasecurity.com

Heat Transfer in Evacuated Tubular Solar Collectors

HOUSEHOLD APPLIANCES

Air Flow Study inside the Supermarket Refrigeration System

LANDSCAPE VISUAL QUALITY ASSESSMENT IN WASHTENAW COUNTY, MI

Organic Production of Tobacco Transplants: The. Nitrogen Dilemma

Impacts of an Energy Star Promotion

FIMD: Fine-grained Device-free Motion Detection

MODELLING AND OPTIMIZATION OF DIRECT EXPANSION AIR CONDITIONING SYSTEM FOR COMMERCIAL BUILDING ENERGY SAVING

Study and development of an innovative 3G/4G wireless network analysis tool

MATERIALS AND METHODS

Field Trial of Residential Ammonia-Water Absorption Heat Pump Water Heaters

Kimberly Rollins, Professor, Department of Economics, University of Nevada, Reno

TrueAllele Technology Computer interpretation of DNA evidence

Feasibility Study of Condensation Heat Exchanger with Helical Tubes for a Passive Auxiliary Feedwater System

Keywords: Triple tube heat exchanger (TTHE), hot water, cold water, CFD simulation.

Study into the operation of a convector heater at low temperatures

Benefits of Enhanced Event Analysis in. Mark Miller

VALIDATION OF FINITE ELEMENT MODELS OF INJURY RISK IN VEHICLE-ROADSIDE BARRIER CRASHES

Variable Refrigerant Flow

2016 FIRST SOLAR ENERGY SERVICES VITALS

A Statistical Analysis of a Liquid Desiccant Dehumidifier/Regenerator in an Air Conditioning System

Statistical Analysis of Criteria and Key Aspects for Urban Design Quality Assessment of Built Environment

Philips Semiconductors

Heat Transfer Enhancement using Herringbone wavy & Smooth Wavy fin Heat Exchanger for Hydraulic Oil Cooling

BULLETIN POLL: HOME SAFETY 2012 COMPARISON OF THOSE YEARS AND THOSE AGE 50+

Transcription:

Evaluating Component Solver Contributions to Portfolio-Based Algorithm Selectors Lin Xu, Frank Hutter, Holger Hoos, Kevin Leyton-Brown BETA Lab Department of Computer Science University of British Columbia Canada

SAT Competitions help... establish benchmarks Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 2

SAT Competitions help... establish benchmarks assess state of the art Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 2

SAT Competitions help... establish benchmarks assess state of the art promote solvers, solver development Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 2

Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 3

2009: 5 of 27 medals 2011: 30 of 54 medals Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 3

What is the state of the art in SAT solving? single best solver, SBS (= winner of competition category)? Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 4

What is the state of the art in SAT solving? single best solver, SBS (= winner of competition category)? virtual best solver, VBS (= oracle) over winners of competition categories? Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 4

What is the state of the art in SAT solving? single best solver, SBS (= winner of competition category)? virtual best solver, VBS (= oracle) over all solvers from competition? Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 4

What is the state of the art in SAT solving? single best solver, SBS (= winner of competition category)? virtual best solver, VBS (= oracle) over all solvers from competition? portfolio-based selector over all solvers from competition? Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 4

Meta-algorithmic techniques rule! instance-based solver selection (e.g., SATzilla, 3S) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 5

Meta-algorithmic techniques rule! instance-based solver selection (e.g., SATzilla, 3S) parallel solver portfolios (e.g., ManySAT, ppfolio, plingeling) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 5

Meta-algorithmic techniques rule! instance-based solver selection (e.g., SATzilla, 3S) parallel solver portfolios (e.g., ManySAT, ppfolio, plingeling) sequential solver schedules (used in SATzilla, 3S) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 5

Meta-algorithmic techniques rule! instance-based solver selection (e.g., SATzilla, 3S) parallel solver portfolios (e.g., ManySAT, ppfolio, plingeling) sequential solver schedules (used in SATzilla, 3S) SATzilla-2009: 3+2 = 5/27 medals in 2009 SAT Competition Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 5

Meta-algorithmic techniques rule! instance-based solver selection (e.g., SATzilla, 3S) parallel solver portfolios (e.g., ManySAT, ppfolio, plingeling) sequential solver schedules (used in SATzilla, 3S) SATzilla-2009: 3+2 = 5/27 medals in 2009 SAT Competition ppfolio: 5+5+6 = 16/54 medals in 2011 SAT Competition 3S: 2+1+3 = 7/54 medals in 2011 SAT Competition Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 5

Meta-algorithmic techniques rule! instance-based solver selection (e.g., SATzilla, 3S) parallel solver portfolios (e.g., ManySAT, ppfolio, plingeling) sequential solver schedules (used in SATzilla, 3S) SATzilla-2009: 3+2 = 5/27 medals in 2009 SAT Competition ppfolio: 5+5+6 = 16/54 medals in 2011 SAT Competition 3S: 2+1+3 = 7/54 medals in 2011 SAT Competition Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 5

When does portfolio-based selection work well? several strong & weakly/un-correlated component solvers Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 6

When does portfolio-based selection work well? several strong & weakly/un-correlated component solvers informative & cheaply computable features Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 6

When does portfolio-based selection work well? several strong & weakly/un-correlated component solvers informative & cheaply computable features effective selector construction technique informative set of training data Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 6

How to improve the state of the art in SAT solving? several strong & weakly/un-correlated component solvers informative & cheaply computable features effective selector construction technique informative set of training data Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 7

How to improve the state of the art in SAT solving? several strong & weakly/un-correlated component solvers informative & cheaply computable features effective selector construction technique informative set of training data Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 7

Goals in (non-portfolio) solver development: (A) better all-round performance (required to do well in competition under current scoring) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 8

Goals in (non-portfolio) solver development: (A) better all-round performance (required to do well in competition under current scoring) (B) better performance on certain types of instances (rewarded under purse-based scoring, van Gelder et al., 2005) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 8

Goals in (non-portfolio) solver development: (A) better all-round performance (required to do well in competition under current scoring) (B) better performance on certain types of instances (rewarded under purse-based scoring, van Gelder et al., 2005) state of the art (SOTA) = portfolio-based selector (B) is more effective in improving SOTA Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 8

SOTA portfolio strongest portfolio-based solver that can be fully automatically constructed from available solvers Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 9

SOTA portfolio strongest portfolio-based solver that can be fully automatically constructed from available solvers Marginal contribution of solver S to SOTA portfolio P difference in performance of P with and without S (trained separately) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 9

SOTA portfolio strongest portfolio-based solver that can be fully automatically constructed from available solvers Marginal contribution of solver S to SOTA portfolio P difference in performance of P with and without S (trained separately) frequency of selecting S Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 9

SOTA portfolio strongest portfolio-based solver that can be fully automatically constructed from available solvers Marginal contribution of solver S to SOTA portfolio P difference in performance of P with and without S (trained separately) frequency of selecting S fraction of instances solved by S Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 9

SOTA portfolio strongest portfolio-based solver that can be fully automatically constructed from available solvers Marginal contribution of solver S to SOTA portfolio P difference in performance of P with and without S (trained separately) frequency of selecting S fraction of instances solved by S contribution of S to VBS Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 9

How SATzilla Works Instance Feature extractor Algorithm selector Minimal cost feature extractor Feature cost predictor Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 10

How SATzilla Works Instance Feature extractor Algorithm selector Minimal cost feature extractor Feature cost predictor SATzilla 2011 uses... cost-sensitive decision forests for every pair of solvers Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 10

How SATzilla Works Instance Feature extractor Algorithm selector Minimal cost feature extractor Feature cost predictor SATzilla 2011 uses... cost-sensitive decision forests for every pair of solvers voting to select solver to be run Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 10

Empirical Analysis all instances from 2011 SAT Competition: 300 Application; 300 Crafted; 300 Random Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 11

Empirical Analysis all instances from 2011 SAT Competition: 300 Application; 300 Crafted; 300 Random candidate solvers from 2011 SAT Competition: for determining VBS and SBS: all solvers from Phase 2 of competition: 31 Application; 25 Crafted; 17 Random Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 11

Empirical Analysis all instances from 2011 SAT Competition: 300 Application; 300 Crafted; 300 Random candidate solvers from 2011 SAT Competition: for determining VBS and SBS: all solvers from Phase 2 of competition: 31 Application; 25 Crafted; 17 Random for building SATzilla: all sequential, non-portfolio solvers from Phase 2: 18 Application; 15 Crafted; 9 Random Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 11

Empirical Analysis all instances from 2011 SAT Competition: 300 Application; 300 Crafted; 300 Random candidate solvers from 2011 SAT Competition: for determining VBS and SBS: all solvers from Phase 2 of competition: 31 Application; 25 Crafted; 17 Random for building SATzilla: all sequential, non-portfolio solvers from Phase 2: 18 Application; 15 Crafted; 9 Random SATzilla assessed by 10-fold cross validation Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 11

SATzilla 2011 Performance (Inst. Solved) Solver Application Crafted Random VBS 84.7% 76.3% 82.2% SATzilla 2011 75.3% 66.0% 80.8% SATzilla 2009 70.3% 63.0% 80.3% Gold medalist (SBS) 71.7% 54.3% 68.0% Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 12

SATzilla 2011 vs 3S Fair Comparison same 26 candidate solvers, features, training data 5000 sec cutoff time same machine, instance set for testing Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 13

SATzilla 2011 vs 3S Fair Comparison same 26 candidate solvers, features, training data 5000 sec cutoff time same machine, instance set for testing SATzilla 2011 3S SBS = MXC09 VBS Inst. Solved 68.3% 67.4% 38.2% 76.9% PAR10 16 166 16 442 31 185 11 836 (combined results for all 3 categories) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 13

SATzilla 2011 vs 3S Fair Comparison same 26 candidate solvers, features, training data 5000 sec cutoff time same machine, instance set for testing SATzilla 2011 3S SBS = MXC09 VBS Inst. Solved 68.3% 67.4% 38.2% 76.9% PAR10 16 166 16 442 31 185 11 836 (combined results for all 3 categories) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 13

SATzilla 2011 vs 3S Fair Comparison same 26 candidate solvers, features, training data 5000 sec cutoff time same machine, instance set for testing SATzilla 2011 3S SBS = MXC09 VBS Inst. Solved 68.3% 67.4% 38.2% 76.9% PAR10 16 166 16 442 31 185 11 836 (combined results for all 3 categories) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 13

SATzilla 2011 Performance (Inst. Solved) Solver Application Crafted Random VBS 84.7% 76.3% 82.2% SATzilla 2011 75.3% 66.0% 80.8% SATzilla 2009 70.3% 63.0% 80.3% Gold medalist (SBS) 71.7% 54.3% 68.0% Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 14

Performance of Individual Solvers Application RestartSAT Cirminisat Minisat EBMinisat Minisatagile Glueminisat LR GL SHR Precosat Lingeling MPhaseSAT64 Contrasat Minisat_psm Rcl Glucose1 Glucose2 EBGlucose CryptoMinisat QuteRSat 0 2 60 80 100 Percentage Solved 5000 CPU sec cutoff Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 15

Correlation of Solver Performance Application RestartSAT Cirminisat Minisat EBMinisat Minisatagile Glueminisat LR GL SHR Precosat Lingeling MPhaseSAT64 Contrasat Minisat_psm Rcl Glucose1 Glucose2 EBGlucose CryptoMinisat QuteRSat RestartSAT Cirminisat Minisat EBMinisat Minisatagile Glueminisat LR GL SHR Precosat Lingeling MPhaseSAT64 Contrasat Minisat_psm Rcl Glucose1 Glucose2 EBGlucose CryptoMinisat QuteRSat darker = higher Spearman correlation coefficient Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 16

Correlation of Solver Performance Random Sparrow EagleUP Gnovelty+2 TNM Sattime11 Adaptg2wsat11 MPhaseSAT_M March_rw March_hi Sparrow EagleUP Gnovelty+2 TNM Sattime11 Adaptg2wsat11 MPhaseSAT_M March_rw March_hi darker = higher Spearman correlation coefficient Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 17

Solver Selection Frequency in SATzilla 2011 Application Glucose2 (Backup) Solved by Presolvers Glucose2 Glueminisat QuteRSat Precosat Other Solvers Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 18

Instances Solved by SATzilla 2011 Components Application Glucose2 (Backup) Unsolved Glucose2 (Pre1) Other Solvers Glucose2 Glueminisat (Pre1) Glueminisat QuteRSat EBGlucose EBGlucose (Pre1) Precosat Minisat psm Minisat psm (Pre1) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 19

Marginal Contribution of Components Application RestartSAT Cirminisat Minisat EBMinisat Minisatagile Glueminisat LR GL SHR Precosat Lingeling MPhaseSAT64 Contrasat Minisat_psm Rcl Glucose1 Glucose2 EBGlucose CryptoMinisat QuteRSat 0 2 4 6 8 10 Marginal Contribution (%) Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 20

Instances Solved vs Marginal Contribution of Components Application 10 Marginal Contribution 8 6 4 2 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 21

Instances Solved vs Marginal Contribution of Components Application 10 Marginal Contribution 8 6 4 2 MPhaseSAT64 Glueminisat 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 21

Instances Solved vs Marginal Contribution of Components Crafted 10 Marginal Contribution 8 6 4 2 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 22

Instances Solved vs Marginal Contribution of Components Crafted 10 Marginal Contribution 8 6 4 2 Sattime MPhaseSAT Sol Clasp2 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 22

Instances Solved vs Marginal Contribution of Components Crafted 10 Marginal Contribution 8 6 4 2 Sattime MPhaseSAT Sol Clasp2 Joint contributions: - 2 Clasp variants = 6.3% - 2 Sattime variants = 5.4% 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 22

Instances Solved vs Marginal Contribution of Components Random 10 Marginal Contribution 8 6 4 2 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 23

Instances Solved vs Marginal Contribution of Components Random 10 Marginal Contribution 8 6 4 2 Sparrow EagleUP March_rw 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 23

Instances Solved vs Marginal Contribution of Components Random Marginal Contribution 10 8 6 4 2 Joint contributions: - 2 March variants = 4% - 6 LS solvers = 22.5% Sparrow EagleUP March_rw 0 0 10 20 30 40 50 60 % Solved by Component Solver Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 23

Conclusions State of the arts in SAT solving: portfolio-based algorithm selectors Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 24

Conclusions State of the arts in SAT solving: portfolio-based algorithm selectors Use marginal contributions to SOTA portfolio to assess value of solvers for improving state of the art. Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 24

Conclusions State of the arts in SAT solving: portfolio-based algorithm selectors Use marginal contributions to SOTA portfolio to assess value of solvers for improving state of the art. To promote development of strong, uncorrelated solvers: Give formal recognition to solvers contributing most to SOTA portfolio(s). Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 24

Conclusions State of the arts in SAT solving: portfolio-based algorithm selectors Use marginal contributions to SOTA portfolio to assess value of solvers for improving state of the art. To promote development of strong, uncorrelated solvers: Give formal recognition to solvers contributing most to SOTA portfolio(s). Evaluate portfolio-based solvers separately. Xu, Hutter, Hoos, Leyton-Brown: Evaluating Portfolio-Based Algorithm Selectors 24