Free
Sed sed risus pretium quam vulputate.
$0/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Get started
Pro
Sed sed risus pretium quam vulputate.
$400/year
Upto 10 projects
Customizable dashboard
Upto 50 tasks
Upto 1 GB storage
Unlimited proofings
Unlimited custom fields
Unlimited milestones
Unlimited timeline
Get started
Pricing

Plans for startups and enterprises

Deploy accurate, fast, secure, and cost-efficient agents specialized on the data that matters to your business.

On-demand

$0.50/1M tokens
or $1/tuning step
Pay as you go, new users get $300 free credit.
$0.50/1M inference tokens - one price for input, output, and JSON output
$1/tuning step - scale number of steps based on your data
Linear multiplier - burst tuning across multiple GPUs or nodes for faster performance (e.g. x3 for 3 GPUs)
Try the full lifecycle: choose a model, RAG/prompt tune, memory tune, evaluate, and run inference
Access to top open source models like Llama 3.1, Mistral v0.3, and Phi 3
Runs on Lamini’s optimized compute platform, generating state-of-the-art MoME models
Get started

Reserved

Custom
Don't have your own GPUs? Get dedicated GPUs from Lamini's cluster.
Run on reserved GPUs from Lamini
Unlimited tuning and inference
Unmatched inference throughput
Full evaluation suite
Full evaluation suite
Enterprise support
Contact us

Self-managed

Custom
Run Lamini in your own secure environment (VPC, on-prem, air-gapped)
Run Lamini on your own GPUs
No internet access needed
Pay per software license
Full evaluation suite
Access to world-class ML experts
Enterprise support
Contact us

Special pricing available for startups

Get started with $300 in free credit
Partner with our team of AI experts to build your LLM application
Get access to our reserved GPUs
Header image

Trusted by Fortune 500 and leading startups

100%
Accuracy for content classification
1200+h
Of manual work saved annually

"Lamini's classifier SDK is easy to use... Once [the tuned LLM] was ready, we tested it, and it was so easy to deploy to production. It allowed us to move really rapidly.”

Chris Lu
CTO
Support

Frequently asked questions

Is there a way to try Lamini for free?
Yes, you can try us for free. Just sign up and get $300 in free credit.
Do you offer special pricing for startups?
Yes, we do. Please contact us for more details.
How do I size the number of GPUs?
Increasing the number of GPUs will speed up your job by approximately 1.5x per GPU. Lamini will automatically reschedule your long running jobs, even if they’re only scheduled on 1 GPU.
How much data do you need to start?
For an initial evaluation data set, you will need about 20-40 input-output pairs to start. As you iterate, you will add more data until you achieve the level of accuracy required for your use case.
Do you offer any volume discounts?
Not for Lamini On-Demand. If you want to run a large volume of jobs or data, contact us about Lamini Reserved or Self-managed for better pricing.
Untitled UI logotextLogo
Lamini helps enterprises reduce hallucinations by 95%, enabling them to build smaller, faster LLMs and agents based on their proprietary data. Lamini can be deployed in secure environments —on-premise (even air-gapped) or VPC—so your data remains private.

Join our newsletter to stay up to date on features and releases.
We care about your data in our privacy policy.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
© 2024 Lamini Inc. All rights reserved.