Pricing
Deploy accurate, fast, secure, and cost-efficient models tuned to the data that matters most to your business. We offer three deployment options to give you full control over your data and models.
On-demand
Pay as you go, new users get $300 free credit
$0.50/1M tokens
$1/tuning step
$1/tuning step
$0.50/1M inference tokens - one price for input, output, and JSON output
$1/tuning step - scale number of steps based on your data
Linear multiplier - burst tuning across multiple GPUs or nodes for faster performance (e.g. x3 for 3 GPUs)
Try the full lifecycle: choose a model, RAG/prompt tune, memory tune, evaluate, and run inference
Access to top open source models like Llama 3.1, Mistral v0.3, and Phi 3
Runs on Lamini’s optimized compute platform, generating state-of-the-art MoME models
Reserved
Don't have your own GPUs? Get dedicated GPUs from Lamini's cluster
Custom
Run on reserved GPUs from Lamini
Unlimited tuning and inference
Unmatched inference throughput
Full evaluation suite
Access to world-class ML experts
Enterprise support
Self-managed
Run Lamini in your own secure environment (VPC, on-prem, air-gapped)
Custom
Run Lamini on your own GPUs
No internet access needed
Pay per software license
Full evaluation suite
Access to world-class ML experts
Enterprise support
Free
Sed sed risus pretium quam vulputate.
$0/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Starter
Sed sed risus pretium quam vulputate.
$250/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Unlimited proofings
Unlimited proofings
Pro
Sed sed risus pretium quam vulputate.
$400/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Unlimited proofings
Unlimited custom fields
Unlimited milestones
Unlimited timeline
Trusted by Fortune 500 & Leading startups