Lamini - Enterprise LLM Platform

Free

Sed sed risus pretium quam vulputate.

$0/year

Upto 10 projects

Customizable dashboard

Upto 50 tasks

Upto 1 GB storage

Get started

Starter

Sed sed risus pretium quam vulputate.

$250/year

Upto 10 projects

Customizable dashboard

Upto 50 tasks

Upto 1 GB storage

Unlimited proofings

Get started

Pro

Sed sed risus pretium quam vulputate.

$400/year

Upto 10 projects

Customizable dashboard

Upto 50 tasks

Upto 1 GB storage

Unlimited proofings

Unlimited custom fields

Unlimited milestones

Unlimited timeline

Get started

Pricing

Plans for startups and enterprises

Deploy accurate, fast, secure, and cost-efficient agents specialized on the data that matters to your business. Sign up and get $300 free credit.

On-demand

Inference:
$0.50/1M tokens
Tuning:
$0.50/step

Pay as you go, new users get $300 free credit.

$0.50/1M inference tokens - one price for input, output, and JSON output

$0.50/tuning step - scale number of steps based on your data

Linear multiplier - burst tuning across multiple GPUs or nodes for faster performance (e.g. x3 for 3 GPUs)

Try the full lifecycle: choose a model, RAG/prompt tune, memory tune, evaluate, and run inference

Access to top open source models like Llama 3.1, Mistral v0.3, and Phi 3

Runs on Lamini’s optimized compute platform, generating state-of-the-art MoME models

Get started

Reserved

Custom

Don't have your own GPUs? Get dedicated GPUs from Lamini's cluster.

Run on reserved GPUs from Lamini

Unlimited tuning and inference

Unmatched inference throughput

Full evaluation suite

Enterprise support

Popular with startups

Self-managed

Custom

Run Lamini in your own secure environment (VPC, on-prem, air-gapped)

Run Lamini on your own GPUs

No internet access needed

Pay per software license

Full evaluation suite

Access to world-class ML experts

Enterprise support

Special pricing available for startups

Get started with $300 in free credit

Partner with our team of AI experts to build your LLM application

Get access to our reserved GPUs

Request demo

Trusted by Fortune 500 and leading startups

100%

Accuracy for content classification

1200+h

Of manual work saved annually

"Lamini's classifier SDK is easy to use... Once [the tuned LLM] was ready, we tested it, and it was so easy to deploy to production. It allowed us to move really rapidly.”

Chris Lu

CTO

Support

Frequently asked questions

Is there a way to try Lamini for free?

Yes, you can try us for free. Just sign up and get $300 in free credit.

Do you offer special pricing for startups?

Yes, we do. Please contact us for more details.

How do I size the number of GPUs?

Increasing the number of GPUs will speed up your job by approximately 1.5x per GPU. Lamini will automatically reschedule your long running jobs, even if they’re only scheduled on 1 GPU.

How much data do you need to start?

For an initial evaluation data set, you will need about 20-40 input-output pairs to start. As you iterate, you will add more data until you achieve the level of accuracy required for your use case.

Do you offer any volume discounts?

Not for Lamini On-Demand. If you want to run a large volume of jobs or data, contact us about Lamini Reserved or Self-managed for better pricing.