All Constraints Experiment

PreviousMetric Failure NextMultisolution

Last updated 12 months ago

All Constraints Experiment

Early in the experimentation process, users often want to understand the relationship between parameters and metrics. In particular, users may want to study which parameter regions consistently yield high-performing models. By conducting an experiment with all , SigOpt users can efficiently search for many high-performing models as defined through constraints on each of the metrics under analysis. All-Constraint experiments focus on diverse parameter configurations, increasing the chances of finding models that meet business goals.

Diversity Accelerates Model Development

Let us go through an example. Suppose we want to classify chess end-games for White King and Rook against Black King. We use the UCI dataset known as created by Michael Bain and Arthur van Hoff at the Turing Institute, Glasgow, UK. We are interested in performing hyperparameter tuning of XGBoost models. We will use the following parameter space in our experiments:

list_of_parameters = [
  dict(name="num_boost_round", bounds=dict(min=1, max=200), type="int"),
  dict(name="eta", bounds=dict(min=-5, max=0), type="double"),
  dict(name="gamma", bounds=dict(min=0, max=5), type="double"),
  dict(name="max_depth", bounds=dict(min=1, max=32), type="int"),
  dict(name="min_child_weight", bounds=dict(min=1, max=5), type="double"),
]

Defining our metrics

Now, let us say that we want to search for models with a high F1 score and low model complexity. We then define two metrics: f1_score and the actual average_depth of the model. We are interested in models that achieve higher than 0.8 of f1_score and have average_depth lower than 10. It is also a good idea to store other metrics to inspect the models further. For example, we can keep track of each model's inference_time on the test set.

If we are confident that f1_score and average_depth capture everything about our problem, we can run a to search for the Pareto Efficient Frontier points. The minimum-performance thresholds can (optionally) be incorporated as .

xgb_multimetric_threshold = [
  {"name": "f1_score", "strategy": "optimize", "objective": "maximize", "threshold": 0.8},
  {"name": "average_depth", "strategy": "optimize", "objective": "minimize", "threshold": 10},
]

Our new All-Constraint experiment looks very similar, but replaces the optimize strategy with the constraint strategy.

xgb_all_constraints = [
  {"name": "f1_score", "strategy": "constraint", "objective": "maximize", "threshold": 0.8},
  {"name": "average_depth", "strategy": "constraint", "objective": "minimize", "threshold": 10},
]

SigOpt allows our users to store additional metrics for consideration during analysis of the experiment. These should be defined during model creation as well.

xgb_stored_metrics = [
  {"name": "inference_time", "strategy": "store"},
  {"name": "precision", "strategy": "store"},
  {"name": "recall", "strategy": "store"},
]

Running our experiment

With the above lists of parameters and metrics, we can easily create SigOpt experiments:

experiment_meta = dict(
  name="chess xgboost_experiment",
  parameters=list_of_parameters,
  metrics=active_metrics + xgb_stored_metrics,  # active_metrics is xgb_multimetric_threshold or xgb_all_constraints
  budget=150,
  parallel_bandwidth=1,
)

experiment = conn.experiments.create(**experiment_meta)
print(f"Created experiment: https://app.sigopt.com/experiment/{experiment.id}")

Dealing with unforeseen requirements

An All-Constraint experiment finds more points that satisfy the user's constraints, at the cost of a less well-defined Pareto frontier. Why is this valuable? Suppose that, after this experiment, we talk to other stakeholders of our project; now they explicitly state that low inference time is critical for our application. Instead of rerunning this experiment (which could take a while), we decide to revisit our current results. Below we display the results after filtering the points by inference time (less than 0.1s).

Since our Multimetric experiment had a limited goal (highest f1_score and lowest average_depth), all models failed to achieve low inference time. All-Constraint experiments recognize that other goals may exist, and they search for a diverse range of outcomes to service future demands. Specifically, note that:

The All-Constraint experiment found nine viable models, whereas the Multimetric experiment did not find models with low inference time.
None of the points from our earlier Pareto Efficient frontier met this prediction time requirement.

Analyzing parameters

The value of an All-Constraint experiment is most striking when we use our Parallel Coordinate plot. See the comparison to the Multimetric experiment below when we filter the models by inference_time. Notice that only models with low num_boost_round remain active.

There are some useful insights to be gained here about the parameters and the resulting metric values.

High num_boost_round yields high F1 score -- this is not surprising, but our Multimetric experiment learns this and then spends its energy exploiting that information to make a better Pareto frontier.
In contrast, All-Constraint finds models with low num_boost_round. That is critical for producing models with good performance and faster inference time.
For the full range of satisfactory models, all models require eta (learning rate) values between [-1.5, 0].
Most viable models have gamma values less than 3.
All-Constraint finds more models with lower max_depth than Multimetric, especially between values 5 and 15.
The entire range of min_child_weight values seems to produce acceptable results -- the metrics seem unaffected by this parameter alone. However, for satisfactory models, it looks like max_depth and min_child_weight are inversely correlated.

Conceptualizing the Value of an All-Constraint Experiment

Creating an All-Constraint Experiment

Core Module

from sigopt import Connection

conn = Connection(client_token="USER_TOKEN")
experiment = conn.experiments().create(
  name="All-constraint experiment",
  parameters=[
    dict(
      name="num_boost_round",
      bounds=dict(
        min=1,
        max=200,
        ),
      type="int"
      ),
    dict(
      name="eta",
      bounds=dict(
        min=-5,
        max=0,
        ),
      type="double"
      ),
    dict(
      name="gamma",
      bounds=dict(
        min=0,
        max=5,
        ),
      type="double"
      ),      
    ],
  metrics=[
    dict(
      name="f1_score",
      objective="maximize",
      strategy="constraint",      
      threshold=0.8,
      ),
    dict(
      name="average_depth",
      objective="minimize",
      strategy="constraint",      
      threshold=10,
      )
    ],
  observation_budget=65,
  parallel_bandwidth=2,
  )

print("Created experiment: https://app.sigopt.com/experiment/" + experiment.id)

EXPERIMENT=curl -s -X POST https://api.sigopt.com/v1/experiments -u "$SIGOPT_API_TOKEN": \
    -H "Content-Type: application/json" -d "`cat experiment_meta.json`"

JSON file defining the Experiment:

experiment_meta.json

{
  "name": "All-constraint experiment",
  "parameters": [
    {
      "name": "num_boost_round",
      "bounds": {
        "min": 1,
        "max": 200
      },
      "type": "int"
    },
    {
      "name": "eta",
      "bounds": {
        "min": -5,
        "max": 0
      },
      "type": "double"
    },
    {
      "name": "gamma",
      "bounds": {
        "min": 0,
        "max": 5
      },
      "type": "double"
    }
  ],
  "metrics": [
    {
      "name": "f1_score",
      "objective": "maximize",
      "strategy": "constraint",
      "threshold": 0.8
    },
    {
      "name": "average_depth",
      "objective": "minimize",
      "strategy": "constraint",
      "threshold": 10
    }
  ],
  "observation_budget": 65,
  "parallel_bandwidth": 2
}

import com.sigopt.SigOpt;
import com.sigopt.exception.SigoptException;
import com.sigopt.model.*;

import java.util.Arrays;

public class YourSigoptExperiment {
  public static Experiment createExperiment() throws SigoptException {
    Experiment experiment = Experiment.create()
      .data(
        new Experiment.Builder()
          .name("All-constraint experiment")
          .parameters(java.util.Arrays.asList(
            new Parameter.Builder()
              .name("eta")
              .bounds(new Bounds.Builder()
                .min(-5)
                .max(0)
                .build())
              .type("double")
              .build(),
            new Parameter.Builder()
              .name("gamma")
              .bounds(new Bounds.Builder()
                .min(0)
                .max(5)
                .build())
              .type("double")
              .build(),
            new Parameter.Builder()
              .name("num_boost_round")
              .bounds(new Bounds.Builder()
                .min(1)
                .max(200)
                .build())
              .type("int")
              .build()
            ))
          .metrics(java.util.Arrays.asList(
            new Metric.Builder()
              .name("average_depth")
              .objective("minimize")
              .strategy("constraint")
              .threshold(10)
              .build(),
            new Metric.Builder()
              .name("f1_score")
              .objective("maximize")
              .strategy("constraint")
              .threshold(0.8)
              .build()
            ))
          .observationBudget(65)
          .parallelBandwidth(2)
          .type("offline")
          .build()
        )
      .call();
    return experiment;
  }

AI Module

experiment = sigopt.create_experiment(
  name="All-constraint experiment",
  parameters=[
    dict(
      name="num_boost_round",
      bounds=dict(
        min=1,
        max=200,
      ),
      type="int",
    ),
    dict(
      name="eta",
      bounds=dict(
        min=-5,
        max=0,
      ),
      type="double",
    ),
    dict(
      name="gamma",
      bounds=dict(
        min=0,
        max=5,
      ),
      type="double",
    ),
  ],
  metrics=[
    dict(
      name="f1_score",
      objective="maximize",
      strategy="constraint",
      threshold=0.8,
    ),
    dict(
      name="average_depth",
      objective="minimize",
      strategy="constraint",
      threshold=10,
    ),
  ],
  budget=65,
  parallel_bandwidth=2,
)

print("Created experiment: https:/app.sigopt.com/experiment/" + experiment.id)

name: All-constraint experiment
parameters:
  - name: num_boost_round
    bounds:
      min: 1
      max: 200
    type: int
  - name: eta
    bounds:
      min: -5
      max: 0
    type: double
  - name: gamma
    bounds:
      min: 0
      max: 5
    type: double
metrics:
  - name: f1_score
    objective: maximize
    strategy: constraint
    threshold: 0.8
  - name: average_depth
    objective: minimize
    strategy: constraint
    threshold: 10
budget: 65
parallel_bandwidth: 2

As your Experiment executes, report the metric values to the corresponding Run:

# Report the metric values for a SigOpt Run
run.log_metric("f1_score", 0.803)
run.log_metric("average_depth", 2.78)

Selecting and Updating the Metric Thresholds

Core Module

experiment = conn.experiments(experiment.id).update(
  metrics=[
    dict(
      name="f1_score",
      threshold=0.85,
      ),
    dict(
      name="average_depth",
      threshold=8.0,
      )
    ]
  )

EXPERIMENT=`curl -s -X PUT https://api.sigopt.com/v1/experiments/EXPERIMENT_ID -u "$SIGOPT_API_TOKEN": \
  -H 'Content-Type: application/json' \
  -d "{\"metrics\":[{\"name\":\"f1_score\",\"threshold\":0.85},{\"name\":\"average_depth\",\"threshold\":8.0}]}"`

Experiment.update(experiment.getId())
  .data(
    new Experiment.Builder()
      .metrics(java.util.Arrays.asList(
        new Metric.Builder()
          .name("f1_score")
          .objective("maximize")
          .strategy("constraint")
          .threshold(0.85)
          .build(),
        new Metric.Builder()
          .name("average_depth")
          .objective("miminize")
          .strategy("constraint")
          .threshold(8.0)
          .build()
        ))
      .build()
    )
  .call();

Limitations

Budget must be set when an All-Constraint experiment is created.
The maximum number of constraint metrics is 4.
The maximum number of dimensions for All-Constraint is 50.

PreviousMetric Failure NextMultisolution

Last updated 12 months ago