A minimal Python library to stop your experiments from being a hot mess. Because "it worked on my machine" is not a valid scientific publication.
expyro
is your new lab assistant π§βπ¬ that automatically organizes your chaos: configurations, results, plots, and even that random log file you swear you'll look at later.
- ποΈ Structured Experiment Tracking: No more
final_final_v2_test.py
files. Each run gets its own fancy, timestamped folder. Look organized, even if you're not. - π― Type Safety: Your config isn't just a bunch of random numbers. It's a well-defined bunch of random numbers! Thanks, type hints!
- β»οΈ Reproducibility: Relive the magic (or the horror) of any past run. Perfect for when your advisor asks "can we get the results from last Tuesday?".
- π Artifact Generation: Automatically save your beautiful plots and tables. Make your future thesis-writing self cry tears of joy.
- πΎ Data Capture: Easily dump any other file (models, logs, a screenshot of your error) right into the experiment's folder.
Get the core package and become 10x more organized instantly:
pip install expyro
Level up your experiment-fu with optional extras:
# For making pretty, pretty plots (matplotlib)
pip install "expyro[matplotlib]"
# For turning results into sweet, sweet tables (pandas)
pip install "expyro[pandas]"
# I want it ALL! π€
pip install "expyro[all]"
Decorate your experiment function. It's like putting a lab coat on it.
from dataclasses import dataclass
from pathlib import Path
import expyro
# Step 1: Define your config. This is your recipe.
@dataclass
class TrainConfig:
learning_rate: float = 0.01 # The spice of life
batch_size: int = 32 # The bigger, the better (until it crashes)
epochs: int = 10 # The "are we there yet?" parameter
# Step 2: Declare your experiment. Give it a home ("runs/") and a name.
# Your experiment must take exactly one argument as a config.
# The input and output must be typed.
@expyro.experiment(root=Path("runs"), name="my_awesome_experiment")
def train_model(config: TrainConfig) -> dict[str, float]:
# Your brilliant (or "it should work") experiment logic goes here.
final_loss = 0.1 * config.learning_rate
final_accuracy = 0.9
# Return whatever you want to remember
return {"final_loss": final_loss, "final_accuracy": final_accuracy}
Call your experiment. Watch the magic happen.
if __name__ == "__main__":
cfg = TrainConfig(learning_rate=0.01, batch_size=32, epochs=10)
run = train_model(cfg) # This saves everything! You're welcome.
print(f"Run completed! Data is chilling in: {run.path}")
Automatically save plots and tables. Impress everyone.
import matplotlib.pyplot as plt
import pandas as pd
# Artist function: Takes config & result, returns a masterpiece (figure) or even a nested string dict of masterpieces
def create_plot(config: TrainConfig, result: dict) -> plt.Figure:
fig, ax = plt.subplots()
ax.bar(["Loss", "Accuracy"], [result["final_loss"], result["final_accuracy"]])
ax.set_title("How Did We Do?")
return fig
# Analyst function: Takes config & result, returns a sweet, sweet table (or a nested string dict of tables)
def create_table(config: TrainConfig, result: dict) -> pd.DataFrame:
return pd.DataFrame([{"metric": k, "value": v} for k, v in result.items()])
# Stack decorators like a pro! The order is bottom-up.
@expyro.plot(create_plot, file_format="pdf") # Save a high-res PDF
@expyro.table(create_table) # Save a CSV table
@expyro.experiment(root=Path("runs"), name="fancy_experiment")
def train_and_analyze(config: TrainConfig) -> dict:
# ... your code ...
return {"final_loss": 0.1, "final_accuracy": 0.9}
Use hook
to save anything else right into the experiment's folder.
@expyro.experiment(root=Path("runs"), name="experiment_with_everything")
def train_with_logging(config: TrainConfig) -> dict:
# Save a log file
with expyro.hook("logs/training_log.txt", "w") as f:
f.write(f"Let's hope this LR {config.learning_rate} works...\n")
f.write("Epoch 1: Loss=0.5 π¬\n")
f.write("Epoch 2: Loss=0.2 π\n")
# Save a model file (pytorch example)
# with expyro.hook("best_model.pt", "wb") as f:
# torch.save(model.state_dict(), f)
return {"final_loss": 0.1}
Iterate over past runs like a data archaeologist.
# Your experiment is now also a container for all its runs!
my_experiment = train_model # This is your decorated function
print("Behold, all my past runs:")
for run in my_experiment: # π Iterate over everything!
print(f"Run {run.path.name}: Config={run.config}, Result={run.result}")
# Load a specific run from its path
that_one_run = my_experiment["2024-05-27/12:30:45.123 abcdef00"]
print(f"Ah yes, the run where loss was: {that_one_run.result['final_loss']}")
Hereβs how expyro
organizes your brilliance:
runs/
βββ my_awesome_experiment/ # Your experiment name
βββ 2024-05-27/ # The date (so you know when you did the work)
βββ 12:30:45.123 abcdef00/ # Time & unique ID (so you can find it)
β βββ config.pickle # ποΈ Your configuration, pickled.
β βββ result.pickle # π Your results, also pickled.
β βββ artifacts/
β β βββ plots/ # π¨ Home for your beautiful graphs
β β β βββ create_plot.pdf
β β βββ tables/ # π Home for your elegant tables
β β βββ create_table.csv
β βββ data/ # πΎ Your custom files live here (from `hook`)
β βββ logs/
β βββ training_log.txt
βββ 14:22:10.456 1a2b3c4d/ # Another run! You've been busy!
βββ config.pickle
βββ result.pickle
Prefer the command line life? expyro
scans your project for decorated experiments and hands each one its own
subcommand. It's like giving every lab rat a keyboard. π
# Run a fresh experiment
expyro my_awesome_experiment run --learning-rate 0.01 --batch-size 32
# Reproduce an old run with the exact same config
expyro my_awesome_experiment reproduce "2024-05-27/12:30:45.123 abcdef00"
# Redo an artifact when you forgot to save that plot π¨
expyro my_awesome_experiment redo plots "2024-05-27/12:30:45.123 abcdef00"
Why so many verbs? Because reproducibility is king π:
run
starts a brand-new adventure and saves everything.reproduce
reruns an experiment with the original config, giving you a carbon-copy run for free.redo
regenerates plots or tables for an existing run, so you can tweak your visuals without touching the science.
All from the shell, all consistent, all reproducible. π
For detailed information for your specific setup, run
expyro --help
from the root directory of your project.
MIT License. Go forth and experiment! Just maybe use this library first.
Now go forth and reproduce! π