Tool Hut
AI developer infrastructure

Weights & Biases

MLOps platform — experiment tracking, model evaluation, fine-tuning monitoring.

Add Weights & Biases to your hut →

Weights & Biases (W&B) is the standard MLOps platform for experiment tracking and model evaluation. When training or fine-tuning a model, W&B logs every run's metrics, hyperparameters, gradients, and outputs to a shareable dashboard — so you can compare runs, spot regressions, and reproduce results. Used by researchers at OpenAI, Google, Hugging Face, and most ML teams in production. The LLM tools (Weave) extend this to prompt tracking, evaluation, and tracing for production AI applications.

Free for personal and academic use. Teams starts at $50/user/mo. Most often compared to MLflow (open-source, self-hostable) and Comet — W&B's edge is the richest visualisation and collaboration features and the broadest adoption in the ML research community.

Made byWeights & Biases
PricingFree (personal) · Teams $50/user/mo · Enterprise (custom)
Best forExperiment tracking, fine-tuning monitoring, model evaluation, MLOps teams

Alternatives to Weights & Biases