Free
Sed sed risus pretium quam vulputate.
$0/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Starter
Sed sed risus pretium quam vulputate.
$250/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Unlimited proofings
Unlimited proofings
Pro
Sed sed risus pretium quam vulputate.
$400/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Unlimited proofings
Unlimited custom fields
Unlimited milestones
Unlimited timeline
Pricing
Plans for startups and enterprises
Deploy accurate, fast, secure, and cost-efficient agents specialized on the data that matters to your business.
On-demand
$0.50/1M tokens
or $1/tuning step
or $1/tuning step
Pay as you go, new users get $300 free credit.
$0.50/1M inference tokens - one price for input, output, and JSON output
$1/tuning step - scale number of steps based on your data
Linear multiplier - burst tuning across multiple GPUs or nodes for faster performance (e.g. x3 for 3 GPUs)
Try the full lifecycle: choose a model, RAG/prompt tune, memory tune, evaluate, and run inference
Access to top open source models like Llama 3.1, Mistral v0.3, and Phi 3
Runs on Lamini’s optimized compute platform, generating state-of-the-art MoME models
Get started
Reserved
Custom
Don't have your own GPUs? Get dedicated GPUs from Lamini's cluster.
Run on reserved GPUs from Lamini
Unlimited tuning and inference
Unmatched inference throughput
Full evaluation suite
Full evaluation suite
Enterprise support
Contact us
Popular with startups
Self-managed
Custom
Run Lamini in your own secure environment (VPC, on-prem, air-gapped)
Run Lamini on your own GPUs
No internet access needed
Pay per software license
Full evaluation suite
Access to world-class ML experts
Enterprise support
Contact us
Special pricing available for startups
Get started with $300 in free credit
Partner with our team of AI experts to build your LLM application
Get access to our reserved GPUs
Trusted by Fortune 500 and leading startups
100%
Accuracy for content classification
1200+h
Of manual work saved annually
"Lamini's classifier SDK is easy to use... Once [the tuned LLM] was ready, we tested it, and it was so easy to deploy to production. It allowed us to move really rapidly.”
Chris Lu
CTO
Support
Frequently asked questions
Is there a way to try Lamini for free?
Yes, you can try us for free. Just sign up and get $300 in free credit.
Do you offer special pricing for startups?
Yes, we do. Please contact us for more details.
How do I size the number of GPUs?
Increasing the number of GPUs will speed up your job by approximately 1.5x per GPU. Lamini will automatically reschedule your long running jobs, even if they’re only scheduled on 1 GPU.
How much data do you need to start?
For an initial evaluation data set, you will need about 20-40 input-output pairs to start. As you iterate, you will add more data until you achieve the level of accuracy required for your use case.
Do you offer any volume discounts?
Not for Lamini On-Demand. If you want to run a large volume of jobs or data, contact us about Lamini Reserved or Self-managed for better pricing.