Create an experiment - Weights & Biases Documentation

Use the W&B Python SDK to track machine learning experiments. You can then review the results in an interactive dashboard or export your data to Python for programmatic access with the W&B Public API. This guide describes how to use W&B building blocks to create a W&B Experiment so you can capture hyperparameters, log metrics, and save model artifacts for later analysis and comparison. This page is for you if you train machine learning models in Python and want to track your work with W&B.

Create a W&B Experiment

Create a W&B Experiment in four steps:

Initialize a W&B Run.
Capture a dictionary of hyperparameters.
Log metrics inside your training loop.
Log an artifact to W&B.

The following sections describe each step in detail.

Initialize a W&B run

A run is the basic unit of computation tracked by W&B, and every experiment begins by creating one. Use wandb.init() to create a W&B Run. The following snippet creates a run in a W&B project named "cat-classification" with the description "My first experiment" to help identify this run. The tags "baseline" and "paper1" indicate that this run is a baseline experiment intended for a future paper publication.

import wandb

with wandb.init(
    project="cat-classification",
    notes="My first experiment",
    tags=["baseline", "paper1"],
) as run:
    ...

wandb.init() returns a Run object.

W&B adds runs to pre-existing projects if that project already exists when you call wandb.init(). For example, if you already have a project called "cat-classification", that project continues to exist and isn’t deleted. Instead, W&B adds a new run to that project.

Capture a dictionary of hyperparameters

Save a dictionary of hyperparameters such as learning rate or model type. The model settings you capture in config are useful later to organize and query your results.

with wandb.init(
    ...,
    config={"epochs": 100, "learning_rate": 0.001, "batch_size": 128},
) as run:
    ...

For more information about how to configure an experiment, see Configure Experiments.

Log metrics inside your training loop

Log metrics during training to monitor progress and compare runs in the W&B dashboard. Call run.log() to log metrics about each training step such as accuracy and loss.

model, dataloader = get_model(), get_data()

for epoch in range(run.config.epochs):
    for batch in dataloader:
        loss, accuracy = model.training_step()
        run.log({"accuracy": accuracy, "loss": loss})

For more information about different data types you can log with W&B, see Log Data During Experiments.

Log an artifact to W&B

Optionally log a W&B Artifact. Artifacts let you version datasets and models.

# You can save any file or even a directory. This example assumes
# the model has a save() method that outputs an ONNX file.
model.save("path_to_model.onnx")
run.log_artifact("path_to_model.onnx", name="trained-model", type="model")

Learn more about Artifacts or about versioning models in Registry.

Complete example

The following script combines the previous code snippets into a complete experiment that initializes a run, captures configuration, logs metrics, and saves a model artifact:

import wandb

with wandb.init(
    project="cat-classification",
    notes="",
    tags=["baseline", "paper1"],
    # Record the run's hyperparameters.
    config={"epochs": 100, "learning_rate": 0.001, "batch_size": 128},
) as run:
    # Set up model and data.
    model, dataloader = get_model(), get_data()

    # Run your training while logging metrics to visualize model performance.
    for epoch in range(run.config["epochs"]):
        for batch in dataloader:
            loss, accuracy = model.training_step()
            run.log({"accuracy": accuracy, "loss": loss})

    # Upload the trained model as an artifact.
    model.save("path_to_model.onnx")
    run.log_artifact("path_to_model.onnx", name="trained-model", type="model")

Next steps: visualize your experiment

After your run completes, the metrics, configuration, and artifacts you logged are available in the W&B App. Use the W&B Dashboard as a central place to organize and visualize results from your machine learning models. With a few clicks, construct interactive charts such as parallel coordinates plots, parameter importance analyses, and additional chart types.

For more information about how to view experiments and specific runs, see Visualize results from experiments.

Best practices

The following are some suggested guidelines to consider when you create experiments:

Finish your runs: Use wandb.init() in a with statement to automatically mark the run as finished when the code completes or raises an exception.
- In Jupyter notebooks, it might be more convenient to manage the Run object yourself. In this case, you can explicitly call finish() on the Run object to mark it complete:
  # In a notebook cell: run = wandb.init() # In a different cell: run.finish()
Config: Track hyperparameters, architecture, dataset, and anything else you’d like to use to reproduce your model. These show up in columns. Use config columns to group, sort, and filter runs dynamically in the app.
Project: A project is a set of experiments you can compare together. Each project gets a dedicated dashboard page, and you can turn on and off different groups of runs to compare different model versions.
Notes: Set a quick commit message directly from your script. Edit and access notes in the Overview section of a run in the W&B App.
Tags: Identify baseline runs and favorite runs. You can filter runs using tags. You can edit tags later on the Overview section of your project’s dashboard on the W&B App.
Create multiple run sets to compare experiments: When you compare experiments, create multiple run sets to make metrics easy to compare. You can toggle run sets on or off on the same chart or group of charts.

The following code snippet demonstrates how to define a W&B Experiment using the previous best practices:

import wandb

config = {
    "learning_rate": 0.01,
    "momentum": 0.2,
    "architecture": "CNN",
    "dataset_id": "cats-0192",
}

with wandb.init(
    project="detect-cats",
    notes="tweak baseline",
    tags=["baseline", "paper1"],
    config=config,
) as run:
    ...

For more information about available parameters when defining a W&B Experiment, see the wandb.init() API docs in the API Reference Guide.

​Create a W&B Experiment

​Initialize a W&B run

​Capture a dictionary of hyperparameters

​Log metrics inside your training loop

​Log an artifact to W&B

​Complete example

​Next steps: visualize your experiment

​Best practices

Create a W&B Experiment

Initialize a W&B run

Capture a dictionary of hyperparameters

Log metrics inside your training loop

Log an artifact to W&B

Complete example

Next steps: visualize your experiment

Best practices