Model Selection Guide
This document will help you understand the most suitable models to use in different scenarios in NekroAgent, and provide detailed performance, price, and applicability analysis. Currently, it mainly provides model selection information supplied by NekroAgent Official Relay, and will gradually add models from other sources.
Rating Description
In the recommended models, we use the following rating standards:
| Rating | Corresponding Level | Description |
|---|---|---|
| 👑 | ⭐⭐⭐⭐⭐ | Excellent |
| 🥇 | ⭐⭐⭐⭐ | Outstanding |
| 🥈 | ⭐⭐⭐ | Good |
| 🥉 | ⭐⭐ | Average |
| ⚪ | ⭐ | Poor |
Note
The following recommendations are for reference only. The same model from different sources may have differences in final performance due to channel conversion strategies, different configuration settings, concurrency situations, current status, etc. We encourage you to try multiple models based on actual usage, including those not in the following form, to choose the model that best suits you!
The models in the following tables are from NekroAgent Official Relay - Available Model List. If you think there is a significant difference between the following tables and actual experience, you are welcome to contact us for feedback. We will continuously maintain and update the tables to better match actual experience
For information on deprecated & discontinued models, please see Model Deprecations
NekroAgent Main Application
Chat Conversation Process
The chat session process of NekroAgent (excluding plugin functions) is mainly affected by three configuration items: Main Model Group (USE_MODEL_GROUP), Debug/Agent Migration Model Group (DEBUG_MIGRATION_MODEL_GROUP), and Fallback Model Group (FALLBACK_MODEL_GROUP). The specific scheduling strategy is as follows:
- When a conversation process starts, the model in the
Main Model Groupis first used for generation - When the code generated by the
Main Model Grouptriggers Agent type methods or produces program errors, subsequent calls in this process all use the model in theDebug/Agent Migration Model Groupfor iteration - If either the
Main Model GrouporDebug/Agent Migration Model Groupmodel call fails, the model in theFallback Model Groupis used for generation - If the
Fallback Model Groupalso fails to call, the response process ends in failure
Below is the list of recommended models for Chat Conversation Process:
This list updated on December 10, 2025
| Model Name | Quality | Speed | Stability | Cost-Effectiveness | Vision | Built-in Thinking | Notes |
|---|---|---|---|---|---|---|---|
| claude-4-5-sonnet-latest | 👑 | 🥈 | 🥈 | 🥈 | 👁️ | ❌ | Anthropic's latest flagship model, with the strongest comprehensive capabilities but limited supply, suitable as the main model |
| gemini-3-pro-preview | 👑 | 🥉 | 🥇 | 🥉 | 👁️ | 🧠 | Google's latest high-quality flagship model, with powerful agent and coding capabilities, supports thinking signature and thinking levels ⚠️ Preview model |
| gemini-2.5-pro | 👑 | 🥇 | 🥇 | 🥈 | 👁️ | 🧠 | Best overall experience, with good performance in language ability, logic ability, and other aspects, has adaptive thinking ability, suitable as the main model ⚠️ Expected to be discontinued as early as June 2026, recommend switching to gemini-3-pro |
| gpt-4.1 | 🥇 | 🥈 | 🥇 | 🥈 | 👁️ | ❌ | Newer flagship GPT model, with obvious AI characteristics but decent logical ability |
| claude-4-5-haiku | 🥈 | 🥇 | 🥇 | 🥉 | 👁️ | ❌ | Anthropic's small model, comparable to gpt-4o-mini level |
| gemini-2.5-flash | 🥇 | 🥇 | 🥇 | 👑 | 👁️ | ❌ | High cost-effectiveness, fast speed, with balanced logical and language abilities, suitable as the main model ⚠️ Expected to be discontinued as early as June 2026 |
| gemini-2.5-flash-thinking | 🥇 | 🥈 | 🥉 | 👑 | 👁️ | 🧠 | Slightly faster built-in thinking model, with high generation quality but fluctuating generation speed ⚠️ Expected to be discontinued as early as June 2026 |
| deepseek-chat (v3) | 🥇 | 🥉 | 🥇 | 🥈 | ❌ | ❌ | Classic domestic model, excellent Chinese ability, distinctive language style |
| doubao-1.5-vision-pro-32k-250115 | 🥈 | 🥈 | 👑 | 🥈 | 👁️ | ❌ | Domestic model provided by ByteDance, good stability, strong multimodal ability, stable price, suitable as a backup model |
| gemini-2.0-flash | 🥈 | 👑 | 🥇 | 🥇 | 👁️ | ❌ | Extremely low-cost and fast small model, recommended to use with external chain of thought, can also be used as an iterative model ⚠️ Expected to be discontinued as early as February 2026, recommend switching to gemini-2.5-flash |
| gpt-4o | 🥇 | 🥈 | 🥇 | 🥈 | 👁️ | ❌ | GPT-generated content has a strong AI flavor, suitable for productivity use |
| gpt-4o-mini | 🥈 | 🥈 | 🥇 | 🥇 | 👁️ | ❌ | GPT-generated content has a strong AI flavor, suitable for productivity use |
| grok-3 | 🥈 | 🥈 | 🥇 | 🥉 | 👁️ | ❌ | Language model launched by xAI, with fewer restrictions and lower AI flavor, suitable for conversation |
Note:
- In NekroAgent, the
External Chain of Thoughtswitch of the model first used in the conversation process (usually the main model) will affect the use of chain of thought in subsequent calls of this conversation process. For example, if the main model enablesExternal Chain of Thought, the iteration/debug model will also have the effect ofenabling external chain of thought - Generally, models that support
Built-in Thinkingare not recommended to enableExternal Chain of Thought, otherwise it may reduce model generation speed - Due to the implementation of the prompt iteration mechanism, it is not recommended to mix models that
support visionanddo not support vision, otherwise it may lead to request format errors
Plugin Development
The generation modification suggestion model in NekroAgent's Plugin Editor uses the Plugin Code Generation Model Group (PLUGIN_GENERATE_MODEL_GROUP) to generate code solutions for user needs. It is recommended to use models with strong coding capabilities and high quality. Below is the list of recommended models:
| Model Name | Quality | Speed | Stability | Cost-Effectiveness | Vision | Thinking | Notes |
|---|---|---|---|---|---|---|---|
| claude-4-5 | 👑 | 🥈 | 🥈 | 🥈 | 👁️ | 🧠 | Anthropic's latest high-quality flagship coding model |
| gemini-3-pro-preview | 👑 | 🥉 | 🥈 | 🥉 | 👁️ | 🧠 | Google's latest high-quality flagship model, excellent performance in the programming field, with powerful agent and coding capabilities ⚠️ Preview model |
| gemini-2.5-pro | 👑 | 🥇 | 🥇 | 🥈 | 👁️ | 🧠 | Google's previous generation flagship model, excellent performance in the programming field, with adaptive thinking ability ⚠️ Expected to be discontinued as early as June 2026, recommend switching to gemini-3-pro |
After the generation model generates modification suggestions, we also need to use the Plugin Code Application Model Group (PLUGIN_APPLY_MODEL_GROUP) to apply the modification suggestions in the current plugin editor. It is recommended to use models with strong prompt compliance and fast generation speed. Below is the list of recommended models:
| Model Name | Quality | Speed | Stability | Cost-Effectiveness | Vision | Thinking | Notes |
|---|---|---|---|---|---|---|---|
| gemini-2.5-flash | 🥈 | 👑 | 🥇 | 🥈 | 👁️ | ❌ | ⚠️ Expected to be discontinued as early as June 2026 |
Built-in Plugins
Emoticon Pack Plugin
The emoticon pack plugin needs to use a Vector Embedding Model to provide emoticon search capability. It is strongly recommended to use the text-embedding-v3 model:
| Model Name | Quality | Speed | Stability | Cost-Effectiveness | Vision | Dimensions | Notes |
|---|---|---|---|---|---|---|---|
| text-embedding-v3 | 👑 | 👑 | 👑 | 👑 | ❌ | 1024 | Very cheap and efficient text embedding model provided by Alibaba Cloud |
| multimodal-embedding-v1 | 👑 | 🥇 | 👑 | 👑 | ✅ | 1024 | Multimodal embedding model provided by Alibaba Cloud, but with many input restrictions, only recommended for special use |
Drawing (Learn to Draw)
The drawing plugin supports OpenAI standard drawing API (such as DALL-E 3) and any OpenAI chat completion API that supports conversation-generated images. Below is the list of recommended models:
| Model Name | Quality | Speed | Stability | Cost-Effectiveness | Image-to-Image | Format | Notes |
|---|---|---|---|---|---|---|---|
| gemini-2.5-flash-image-preview | 🥇 | 🥇 | 🥈 | 🥇 | ✅ | Chat mode | Gemini 2.5 drawing model ⚠️ Will be closed on January 15, 2026 |
| gemini-3-pro-image-preview | 🥇 | 🥇 | 🥈 | 🥉 | ✅ | Chat mode | Gemini 3 drawing model, recommended to use with advanced drawing plugin (market) |
| sora_image | 👑 | ⚪ | 🥇 | 🥈 | ✅ | Chat mode | Consistent with ChatGPT official website 4o drawing, high compliance but very slow |
| Kolors | 🥈 | 👑 | 👑 | 🥇 | ✅ | Image generation mode | Domestic drawing model, with single style and偏向 CG style |
Notes
- Model performance may change over time with updates
- Price information is for reference only, actual prices are subject to official quotations
- It is recommended to regularly evaluate model selection based on actual usage
- Experimental Models (exp/preview): These models are experimental and may be updated or closed at any time. It is recommended that:
- Regularly follow Google Gemini API Version Notes for the latest updates
- Prepare backup solutions when using in production environments
- Prioritize using stable version (GA) models
- Some preview models will automatically redirect to stable versions. It is recommended to directly use stable version model names to avoid delays caused by redirection
- Model Redirection: Some discontinued preview models will automatically redirect to corresponding stable versions, for example:
gemini-2.5-pro-preview-03-25→gemini-2.5-progemini-2.5-pro-preview-05-06→gemini-2.5-pro
Important Note
When using any generative artificial intelligence service, be sure to comply with relevant terms of service and laws and regulations
