Kimi is an AI assistant from Moonshot AI, a Chinese lab. It is known for handling very long context (long documents and conversations) and, through the open-weight Kimi K2 model, for strong agentic and coding ability.

The web and mobile apps are free with usage limits. The API is low-cost and usage-based, and the Kimi K2 open weights cost only your own compute to self-host.

Kimi K2 is Moonshot AI's open-weight mixture-of-experts model, tuned for agentic tasks and coding. Its weights are published, so you can call it by API or run it on your own infrastructure.

Can I self-host Kimi?

You can self-host the Kimi K2 open weights, which run under common runtimes. The consumer Kimi app and the hosted API run on Moonshot's own infrastructure.

Is Kimi safe for business data?

It depends how you run it. The hosted app and API operate from China, with the data-residency and governance questions that follow. Self-hosting the K2 open weights keeps inference inside your own environment, which is the path to prefer for sensitive or regulated data.

How does Kimi compare to DeepSeek or Qwen?

All three are capable Chinese models with open-weight options. Kimi is known for long context and the agentic K2 model; DeepSeek for low-cost transparent reasoning; Qwen for the widest range of sizes and multilingual coverage. For data-sensitive work, self-host whichever you choose.

Kimi guide

What is Kimi?

Kimi is an AI assistant built by Moonshot AI, a Chinese lab, first known for reading very long documents in a single conversation and more recently for its open-weight Kimi K2 model, a large mixture-of-experts model tuned for agentic tasks and coding.

You use Kimi through its web and mobile apps, through an OpenAI-compatible API, or, for K2, by self-hosting the published open weights. It handles long files, web search, and tool use, and is strong in both Chinese and English.

Kimi is worth evaluating when you need to reason over long inputs, want capable open weights you can host, or are comparing low-cost frontier-class models, while accounting for the data-governance questions of a China-based hosted service.

Strengths

What it's best for

Reading and reasoning over very long documents, transcripts, and codebases in one pass.
Agentic and coding work through the open-weight Kimi K2 model, which is tuned for multi-step tool use.
Self-hosting: K2's weights are openly published, so you can run inference in your own environment.
Bilingual Chinese and English tasks where strong coverage of both matters.
Cost-sensitive evaluation of a capable model, via a low-cost API or your own compute.

Limits

Where it falls short

Sensitive or regulated data on the hosted app and API, which run on China-based infrastructure. Self-hosting the open weights avoids this.
Topics subject to Chinese content restrictions, which the hosted model may avoid or steer.
Native image, audio, or video generation. Pair it with a dedicated generation model.
Teams needing the mature enterprise support and SLAs of the large US providers.

How to use it

Getting started

Sign up at kimi.com (or the mobile app) and you can chat for free. Upload long PDFs, spreadsheets, or code files and ask questions across the whole document set.

Turn on web search when you need current information; without it, Kimi answers from training data.

How to use it

Kimi K2 and the API

For building, the Moonshot API is OpenAI-compatible, so most SDKs work by changing the base URL and key. Kimi K2 is the model to reach for on agentic and coding workloads.

K2's open weights are published on hubs like Hugging Face and run under common runtimes, so a data-sensitive team can self-host rather than call the hosted service.

How to use it

Getting better answers

Give Kimi the full document and a precise question rather than a summary; its strength is keeping long context in view.

For agentic runs with K2, describe the tools and the goal clearly and let it plan the steps. Check the data-governance path before sending confidential inputs to the hosted API.

Pricing

What Kimi costs

Approximate, in USD, as of January 2026. Prices change often. Confirm on the official site before you rely on them.

Web and mobile app

Free assistant with web search and long-document upload, subject to limits.

Open weights (Kimi K2)

$0 (self-host)

Download and run K2 yourself; you pay only for your own compute.

API

Low, usage-based

OpenAI-compatible, priced per million tokens by model.

Visit the official Kimi site

Try it

Example prompts

Copy these into Kimi as starting points, then adapt them to your task.

Long-document analysisCopy prompt

Here is a long report. Summarize it in one paragraph, list the five key findings with the page they come from, and flag any claim the document does not support.

Agentic coding with K2Copy prompt

Using Kimi K2, plan and implement this feature step by step. List the files you would change, make the edits, and explain each change. Flag anything you are unsure about.

Swap-in testCopy prompt

Rewrite this OpenAI API call to use the Moonshot (Kimi) API instead, changing only the base URL and model name and keeping everything else the same.

Governance checkCopy prompt

We are considering Kimi for an internal tool that touches customer data. List the data-governance questions to answer before using the hosted API, and what changes if we self-host the K2 open weights.

FAQ

Kimi
common questions.

Direct answers to the questions we get asked the most. If yours isn't covered, write to the team.

Contact the team

Kimi

What it's best for

Where it falls short

Getting started

Kimi K2 and the API

Getting better answers

What Kimi costs

Example prompts

Kimi
common questions.

Related guides

DeepSeek

Qwen

GLM (Z.ai)

Putting AI into production?

What it's best for

Where it falls short

Getting started

Kimi K2 and the API

Getting better answers

What Kimi costs

Example prompts

Kimicommon questions.

What is Kimi?

Is Kimi free to use?

What is Kimi K2?

Can I self-host Kimi?

Is Kimi safe for business data?

How does Kimi compare to DeepSeek or Qwen?

Related guides

DeepSeek

Qwen

GLM (Z.ai)

Putting AI into production?

Kimi
common questions.