Model selector
Default models per plan
Auto-assigned:
- Free. Lightweight model. Fast and capable.
- Starter, Growth. Mid-tier. Better reasoning.
- Business. Higher-capability. Best for complex queries.
- Enterprise. Custom or bring-your-own.
Override on Business
For Business workspaces:
- Settings > Model Selector.
- Pick from available list.
- Save.
Override applies within 30 seconds to new conversations.
Bring your own model (Enterprise)
For Enterprise customers:
- Self-hosted open-source models. Llama, Mistral, etc.
- Private cloud deployments (your AWS/GCP).
- Custom fine-tuned models.
Configure via Enterprise onboarding.
Comparing models
Use Compare Models in Chat Playground:
- Run same query against 2 to 3 models.
- See quality, latency, cost differences.
- Pick based on your priority.
Cost vs quality trade-offs
Roughly:
- Lightweight (Free tier). Fastest, lowest cost per token. Good for simple FAQs.
- Mid-tier (Growth). Better at multi-step reasoning.
- High-capability (Business). Best for complex queries, long context.
For high-volume bots: prefer mid-tier unless quality requires higher.
Per-skill model override
Some skills can use different models:
subscription_managercan pin a deterministic model for billing accuracy.content_recommendercan use a faster model for low-stakes suggestions.
Configure under each skill's Advanced > Model.
Model versioning
When AskVault rolls out a new version:
- Default model auto-updates to latest unless pinned.
- Pin specific version on Business and above for stability.
- Notification 30 days before retiring old versions.
Limits
- Available models. Plan-dependent.
- Override granularity. Workspace or per-skill.
- Model change rate. Up to 10 per day (anti-thrashing).
Common pitfalls
Switched to a lower model, quality dropped. Revert in 30 seconds; or fine-tune the system prompt for the lower model.
Latency jumped after model switch. Higher-capability models often slower. Trade-off.
Cost spike. Higher-tier model means more tokens per query. Audit usage post-switch.
FAQ
Can I see which model is currently active?
Yes under Settings > Model Selector > Status.
Will switching models lose conversation context?
No. Past messages preserve; the next turn uses the new model.
Does switching affect cost immediately?
Yes. Next query bills at new model rate.