Support for deepseek-v4-pro/flash Preview and optimal configuration for reasoning models #2437

gtonu · 2026-06-09T07:42:41Z

gtonu
Jun 9, 2026

Hi everyone,
With DeepSeek's official release of the DeepSeek V4 model family, including deepseek-v4-pro and deepseek-v4-flash, I would like to inquire if the community plans to introduce native configuration support for these variants.

Given their aggressive pricing, 1M token context capacity, and optimized agentic coding benchmarks, integrating these variants would offer a highly cost-efficient alternative for PR reviews.

I am currently running pr-agent via GitHub Actions with the following workflow setup and configuration:

GitHub Actions Workflow:

pr-agent-job:
    if: \${{ github.event.sender.type != 'Bot' }}
    runs-on: ubuntu-latest

    name: Calling Pr-agent
    steps:
      - name: Checkout
        uses: actions/checkout@v4
        
      - name: Setup Pr-agent
        uses: Codium-ai/pr-agent@e13da4fdda9903c8c7d1c9ba22f671b43f56039b
        env:
          OPENAI_KEY: \${{ secrets.OPENAI_API_KEY }}
          GITHUB_TOKEN: \${{ secrets.GITHUB_TOKEN }}

PR-Agent Config:

[config]
model = "gpt-4o-mini-2024-07-18"
fallback_models = []
#model_reasoning = ""
#model_weak = ""
temperature = 0.2
max_tokens = 1500

Context

DeepSeek V4 Pro (deepseek-v4-pro): Exceptional for heavy code reasoning, multi-file context tracking, and high-complexity agentic workflows.
DeepSeek V4 Flash (deepseek-v4-flash): Extremely low latency and low cost, perfect for quick PR summarizations, changelog generation, or as a fast utility fallback.

Currently, pr-agent accommodates custom OpenAI-compatible endpoints, but explicit naming configurations and specific handling for DeepSeek's native reasoning_content (interleaved thinking blocks) are necessary to maximize review quality and avoid token formatting errors.

Questions for the Community

Model Support & Identifiers: Are deepseek-v4-pro and deepseek-v4-flash already natively supported in PR-Agent? If so, what are the correct provider prefixes and identifiers to use in the configuration?
Reasoning Orchestration: If I decide to transition to these new models, how should I ideally structure the model_reasoning and model_weak parameters using the DeepSeek V4 family (e.g., mapping Pro to reasoning and Flash to weak)?
Task Allocation: How does PR-Agent internally decide which tasks to offload to model_reasoning versus the primary model when both are defined?
Performance vs. Efficiency: In terms of code review quality, is it better to route all tasks to a single strong model like deepseek-v4-pro, or does splitting tasks across model, model_reasoning, and model_weak yield a significant functional benefit?
Workaround Routing: If native integration isn't fully ready, what is the recommended fallback configuration to securely route these models using the general openai or custom endpoint provider setup in our current workflow?

Thank you for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for deepseek-v4-pro/flash Preview and optimal configuration for reasoning models #2437

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Support for deepseek-v4-pro/flash Preview and optimal configuration for reasoning models #2437

Uh oh!

gtonu Jun 9, 2026

Context

Questions for the Community

Replies: 0 comments

gtonu
Jun 9, 2026