All Tags

#deployment

8 posts tagged with "deployment"

Deploying and Serving Fine-tuned Models

Merge adapters for single-tenant deployments. Keep them separate for multi-tenant. The serving architecture depends on how many customizations you're running.

Running Fine-tuned Models in Production

Fine-tuning a model is the easy part. Running it in production with checkpoints, evals, rollback, and serving is the hard part. Here's the full picture.

Safe Rollouts for LLM Changes

Model changes are high-risk deployments. 1% traffic to new, compare outputs, then gradually expand. Here's the playbook.

The Checklist Before You Deploy

12 things to check before your LLM goes to production. Most teams skip at least half. That's how incidents happen.