# DeepSeek V4

Open DeepSeek V4 model family for million-token context, coding, reasoning, and agent workflows.

## Summary
DeepSeek V4 is DeepSeek's current open model family, with V4-Pro and V4-Flash variants surfaced through DeepSeek's official API docs and DeepSeek AI's Hugging Face release pages.


## Guide
DeepSeek V4 is the current DeepSeek family to evaluate after DeepSeek-R1. For OpenAgent readers, the important question is not whether V4 is new; it is whether V4-Pro or V4-Flash improves the specific coding, tool, long-context, or agent workload you plan to run.

### What it is
DeepSeek V4 is an open model resource to evaluate by workload, serving path, context behavior, license terms, and how reliably it supports the agent or local AI tasks you actually plan to run.

### Why it matters
Model pages get stale quickly. DeepSeek-R1 was a major open reasoning baseline, but teams choosing models in mid-2026 should also test V4 because it targets long context, coding, tool calls, JSON output, and agentic workflows that sit closer to real production use.

### How it works
Run DeepSeek V4 on a fixed prompt set from your own workflow. Compare quality, latency, context handling, retry behavior, deployment path, and license fit against nearby open models before adopting it.


## Use Cases
- Agent executor model: Use V4-Flash as a candidate for cheaper high-volume agent steps such as extraction, summarization, and simple tool planning.
- Coding and reasoning escalation: Use V4-Pro as a candidate when the workflow needs deeper reasoning, code review, planning, or complex technical analysis.
- Long-context evaluation: Use V4 to test whether million-token context improves repository, document, or memory-heavy workflows without creating new failure modes.

## Alternatives
- DeepSeek-R1 remains a reasoning baseline vs DeepSeek-R1: Keep R1 in comparisons when you want a known reasoning reference, but add V4 when evaluating current DeepSeek model behavior.
- Compare against Qwen, Kimi, GLM, Gemma, and Mistral vs other open model families: V4 should earn adoption through workload tests, not release momentum alone.

### Getting Started
- Read DeepSeek's V4 preview release: https://api-docs.deepseek.com/news/news260424
- Review DeepSeek API model pricing: https://api-docs.deepseek.com/quick_start/pricing
- Inspect the official Hugging Face collection: https://huggingface.co/collections/deepseek-ai/deepseek-v4

### FAQ
- What should I check before using DeepSeek V4?
  - Run DeepSeek V4 on a fixed prompt set from your own workflow. Compare quality, latency, context handling, retry behavior, deployment path, and license fit against nearby open models before adopting it.
- Is DeepSeek V4 open source?
  - DeepSeek V4 is listed with MIT based on the official source links in this profile. Re-check the repository, model card, or docs before production use.
- Who should evaluate DeepSeek V4?
  - DeepSeek V4 is most worth evaluating for developers testing open models for coding, reasoning, and agent workflows.
- Is DeepSeek V4 newer than DeepSeek-R1?
  - Yes. This page treats DeepSeek-R1 as an older reasoning baseline and DeepSeek V4 as the current DeepSeek model family to evaluate for coding, long context, tool calls, and agent workflows.
- Should I use V4-Pro or V4-Flash first?
  - Start with V4-Flash for cost-sensitive or high-throughput tasks, then test V4-Pro on harder coding and reasoning prompts where quality gains might justify the cost.
- Is DeepSeek V4 suitable for image input?
  - Verify the current official docs before relying on image input. This profile focuses on the official text, long-context, JSON output, and tool-call signals surfaced in DeepSeek's V4 docs.
## Why It Matters
DeepSeek V4 matters because open model evaluation has moved beyond older reasoning baselines. Builders now need to compare newer long-context, tool-friendly, and agent-oriented DeepSeek models against Qwen, Kimi, GLM, Mistral, and closed hosted models on their own workloads.


## Best For
- Developers testing open models for coding, reasoning, and agent workflows
- Teams comparing V4-Pro quality against lower-cost V4-Flash throughput
- Builders who need long-context model options with official API and open-weight reference links

## Not For
- Teams that need a simple consumer chatbot with no model or provider evaluation
- Builders who cannot review model cards, license terms, safety behavior, and serving cost before adoption
- Use cases that require verified image input support rather than text, tool calls, JSON output, and long-context reasoning

## What It Actually Does
- Two-model V4 family: DeepSeek's official V4 preview describes V4-Pro as the stronger variant and V4-Flash as the faster, more economical option.
  - Why it matters: Agent builders can route easier executor tasks to Flash and reserve Pro for higher-value reasoning or coding work.
- Million-token context direction: The official V4 release and pricing pages position V4 around very long context, thinking and non-thinking modes, JSON output, and tool calls.
  - Why it matters: Long-context reliability matters for agents that read large repositories, documents, logs, or task histories.
- Official API plus open-weight references: DeepSeek points readers to official API model IDs and a Hugging Face collection for the V4 open-weight release path.
  - Why it matters: Teams can compare hosted API behavior with model-card and open-weight constraints before building production routing.

## Typical Use Cases
- Coding agent model routing: Test V4-Flash for cheaper planning and executor steps, then escalate to V4-Pro when deeper reasoning or code review quality is required.
- Long-context research and analysis: Evaluate how V4 handles large documents, repository context, meeting archives, or task traces before trusting million-token claims.
- Open-model comparison baseline: Compare DeepSeek V4 against Qwen, Kimi, GLM, Mistral, Gemma, and DeepSeek-R1 with the same prompts, latency budget, and license review.

## How It Compares
- Choose DeepSeek V4 when you need the current DeepSeek family vs DeepSeek-R1: R1 remains useful as a reasoning baseline, but V4 is the newer DeepSeek family to test for long context, coding, tool calls, and current API behavior.
- Use V4-Flash before V4-Pro when cost and throughput matter vs V4-Pro-only routing: Flash should be evaluated first for simpler agent steps; Pro is more appropriate when quality gains justify the higher-cost path.
- Do not rank it from benchmark headlines alone vs generic leaderboard selection: Run your own coding traces, tool-call prompts, retrieval context, and safety checks before putting V4 into a product loop.

## Command Line
### Use the official API model IDs
Keep your existing DeepSeek API base URL and change the model ID for a controlled V4 evaluation.

```bash
model="deepseek-v4-flash" # or "deepseek-v4-pro"
```

## Facts
- Category: models
- Resource type: model
- Open source: yes
- License: MIT
- Last verified: 2026-06-02

## Capabilities
- local-inference
- tool-calling

## Structured Use Case Tags
- local-ai
- self-hosted-ai

## Getting Started
- Read the DeepSeek V4 preview release: https://api-docs.deepseek.com/news/news260424
- Check official models and pricing: https://api-docs.deepseek.com/quick_start/pricing
- Open the Hugging Face V4 collection: https://huggingface.co/collections/deepseek-ai/deepseek-v4
- Verify the license page before deployment: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-Base/blob/main/LICENSE

## Links
- Homepage: https://www.deepseek.com/
- Docs: https://api-docs.deepseek.com/news/news260424
- Demo: https://huggingface.co/collections/deepseek-ai/deepseek-v4
- Source: https://api-docs.deepseek.com/quick_start/pricing
- Source: https://api-docs.deepseek.com/updates/
- Source: https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash
- Source: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-Base/blob/main/LICENSE

## Structured Outputs
- JSON: https://www.openagent.bot/models/deepseek-v4.json
- Markdown: https://www.openagent.bot/models/deepseek-v4.md
- Canonical: https://www.openagent.bot/models/deepseek-v4
