generative Big Model Official API
price system
In this era of openness and sharing, OpenAI leads a revolution in artificial intelligence. Now, we announce to the world that we have fully supported all models of OpenAI, for example, supporting GPT-4-ALL, GPT-4-multimodal, GPT-4-gizmo-*, etc. as well as a variety of home-grown big models. Most excitingly, we have introduced the more powerful and influential GPT-4o to the world!




OpenAI
OpenAI Plus
Midjourney
Suno
Luma
Claude
Moonshot
Google Internet company
Baidu's online shop, zhidao.baidu.com
Aritongyi, one of the indigenous peoples of Taiwan
OpenAI
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
gpt-4o-mini 🔥 | visual model | pay per volume |
Input Tokens: $0.00045/K tokens Output Tokens: $0.0018/K tokens |
Input magnification: 0.225 Complementary magnification: 4 |
The gpt-4o-mini is mainly cheap and good. Has 128K contexts and outputs up to 16K tokens and October 2023 knowledge cutoffs per request. tool_calls and function_calls are supported for models in the 1106 series and above. |
gpt-4o-mini-2024-07-18 🔥 | visual model | pay per volume |
Input Tokens: $0.00045/K tokens Output Tokens: $0.0018/K tokens |
Input magnification: 0.225 Complementary magnification: 4 |
The gpt-4o-mini is mainly cheap and good. Has 128K contexts and outputs up to 16K tokens and October 2023 knowledge cutoffs per request. tool_calls and function_calls are supported for models in the 1106 series and above. |
gpt-4o 🔥 | visual model | pay per volume |
Input Tokens: $0.015/K tokens Output Tokens: $0.045/K tokens |
Input magnification: 7.5 Complementary magnification: 3 |
GPT-4o is OpenAI's most advanced multimodal model, faster and cheaper than GPT-4 Turbo, with enhanced visual capabilities. With 128K contexts and a knowledge cutoff of October 2023. 1106 series and above models support tool_calls and function_calls. |
gpt-4o-2024-05-13 🔥 | visual model | pay per volume |
Input Tokens: $0.015/K tokens Output Tokens: $0.045/K tokens |
Input magnification: 7.5 Complementary magnification: 3 |
GPT-4o is OpenAI's most advanced multimodal model, faster and cheaper than GPT-4 Turbo, with enhanced visual capabilities. With 128K contexts and a knowledge cutoff of October 2023. 1106 series and above models support tool_calls and function_calls. |
gpt-4-turbo 🔥 | visual model | pay per volume |
Input Tokens: $0.03/K tokens Output Tokens: $0.09/K tokens |
Input magnification: 15 Complementary magnification: 3 |
GPT-4 Turbo with Vision is the latest generation of models. It is more powerful, updated with a knowledge deadline of April 2023, introduces a 128k context window, accepts text or image input and outputs text, and solves puzzles more accurately than any previous model.The 1106 series and above models support tool_calls and function_calls. |
gpt-4-turbo-2024-04-09 🔥 | visual model | pay per volume |
Input Tokens: $0.03/K tokens Output Tokens: $0.09/K tokens |
Input magnification: 15 Complementary magnification: 3 |
GPT-4 Turbo with Vision is the latest generation model. It is more powerful, updated with a knowledge deadline of April 2023, introduces a 128k context window, accepts text or image input and outputs text, and solves puzzles more accurately than any previous model. |
dall-e-3 🔥 | Drawings, dialogues | pay per volume |
Standard. - 1024×1024: $0.16 - 1024×1792 / 1792×1024: $0.32 HD. - 1024×1024: $0.32 - 1024×1792 / 1792×1024: $0.48 |
Input magnification: 64 Complementary multiplier: 1 |
DALL-E-3 drawing with support for setting quality and size parameters, compatible with chat format calls. Generate novel images and artworks directly. |
gpt-4-vision-compatible | visual model | pay per volume |
Input Tokens: $0.03/K tokens Output Tokens: $0.06/K tokens |
Input magnification: 15 Complementary magnification: 2 |
Specific to this site! Compatible chat prompt with URL parameter to read images, compatible prompt format (image URL + space + prompt support multiple images): img_url prompt. official original vision need to modify the code according to the official documentation. |
gpt-4-vision-preview | visual model | pay per volume |
Input Tokens: $0.03/K tokens Output Tokens: $0.09/K tokens |
Input magnification: 15 Complementary magnification: 3 |
vision preview requires code changes according to the official documentation. |
gpt-4-1106-preview | dialogues | pay per volume |
Input Tokens: $0.03/K tokens Output Tokens: $0.09/K tokens |
Input magnification: 15 Complementary magnification: 3 |
The latest gpt-4-1106-preview, also known as gpt-4-turbo, is 67% cheaper than gpt-4, supports 128k contexts, supports TOOLS, and has an April 2023 knowledge deadline. |
gpt-4-0125-preview | dialogues | pay per volume |
Input Tokens: $0.03/K tokens Output Tokens: $0.09/K tokens |
Input magnification: 15 Complementary magnification: 3 |
Latest gpt-4-0125-preview, upgraded version of gpt-4-1106-preview, stronger code generation capability, reduce model "lazy" phenomenon, fix the problem of non-English UTF-8 generation. |
gpt-4-turbo-preview | dialogues | pay per volume |
Input Tokens: $0.03/K tokens Output Tokens: $0.09/K tokens |
Input magnification: 15 Complementary magnification: 3 |
Upgraded version of gpt-4-turbo-preview, stronger code generation capability, reduce model 'laziness' phenomenon, fix non-English UTF-8 generation problem. |
gpt-4 | dialogues | pay per volume |
Input Tokens: $0.09/K tokens Output Tokens: $0.18/K tokens |
Input magnification: 45 Complementary magnification: 2 |
Pure official GPT4 series, series model support function_call. |
gpt-4-0613 | dialogues | pay per volume |
Input Tokens: $0.09/K tokens Output Tokens: $0.18/K tokens |
Input magnification: 45 Complementary magnification: 2 |
Pure official GPT4 series, 0613 series models support function_call. |
gpt-4-0314 (obsolete) | dialogues | pay per volume |
Input Tokens: $0.09/K tokens Output Tokens: $0.18/K tokens |
Input magnification: 45 Complementary magnification: 2 |
Pure Official GPT4 Series. |
gpt-4-32k | dialogues | pay per volume |
Input Tokens: $0.06/K tokens Output Tokens: $0.12/K tokens |
Input magnification: 30 Complementary magnification: 2 |
Pure Official GPT4 32K Series. |
gpt-4-32k-0613 | dialogues | pay per volume |
Input Tokens: $0.06/K tokens Output Tokens: $0.12/K tokens |
Input magnification: 30 Complementary magnification: 2 |
Pure official GPT4 1.5B series, 0613 series models support function_call. |
gpt-4-32k-0314 | dialogues | pay per volume |
Input Tokens: $0.06/K tokens Output Tokens: $0.12/K tokens |
Input magnification: 30 Complementary magnification: 2 |
Pure official GPT4 1.5B series. |
gpt-3.5-turbo | dialogues | pay per volume |
Input Tokens: $0.0015/K tokens Output Tokens: $0.0045/K tokens |
Input magnification: 0.75 Complementary magnification: 3 |
Pure official high speed GPT3.5 series, support function_call. |
gpt-3.5-turbo-0613 | dialogues | pay per volume |
Input Tokens: $0.0045/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 2.25 Complementary multiplier: 1.333333333333333333 |
Pure official high speed GPT3.5 series, support function_call. |
gpt-3.5-turbo-0301 | dialogues | pay per volume |
Input Tokens: $0.0045/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 2.25 Complementary multiplier: 1.333333333333333333 |
Pure official high speed GPT3.5 series with tools_call support. |
gpt-3.5-turbo-1106 | dialogues | pay per volume |
Input Tokens: $0.003/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 1.5 Complementary magnification: 2 |
Pure official high speed GPT3.5 series with tools_call support. |
gpt-3.5-turbo-0125 | dialogues | pay per volume |
Input Tokens: $0.0015/K tokens Output Tokens: $0.0045/K tokens |
Input magnification: 0.75 Complementary magnification: 3 |
Pure official high speed GPT3.5 series with tools_call support. |
gpt-3.5-turbo-instruct | dialogues | pay per volume |
Input Tokens: $0.0045/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 2.25 Complementary multiplier: 1.333333333333333333 |
Pure official high speed GPT3.5 series, support function_call. |
gpt-3.5-turbo-16k | dialogues | pay per volume |
Input Tokens: $0.009/K tokens Output Tokens: $0.012/K tokens |
Input magnification: 4.5 Complementary multiplier: 1.333333333333333333 |
Pure official high speed GPT3.5 16K series, support function_call. |
gpt-3.5-turbo-16k-0613 | dialogues | pay per volume |
Input Tokens: $0.009/K tokens Output Tokens: $0.012/K tokens |
Input magnification: 4.5 Complementary multiplier: 1.333333333333333333 |
Pure official high speed GPT3.5 16K series, support function_call. |
dall-e, dall-e-2 | Drawings, dialogues | pay per volume |
1024×1024: $0.045 512×512: $0.0405 256×256: $0.036 |
Input magnification: 18 Complementary multiplier: 1 |
DALL-E drawing, support for setting quality and size parameters, compatible with chat format calls. |
tts-1 | text-to-speech | pay per volume |
Input Tokens: $0.06/K characters Output Tokens: $0.06/K characters |
Input magnification: 30 Complementary multiplier: 1 |
Text-to-speech model TTS with support for setting tones, the standard tts-1 model offers the lowest latency but lower quality than the tts-1-hd model. |
tts-1-1106 | text-to-speech | pay per volume |
Input Tokens: $0.06/K characters Output Tokens: $0.06/K characters |
Input magnification: 30 Complementary multiplier: 1 |
Text-to-speech model TTS with support for setting tones, the standard tts-1 model offers the lowest latency but lower quality than the tts-1-hd model. |
tts-1-hd | text-to-speech | pay per volume |
Input Tokens: $0.06/K characters Output Tokens: $0.06/K characters |
Input magnification: 30 Complementary multiplier: 1 |
Text-to-speech model TTS with support for setting tones. |
tts-1-hd-1106 | text-to-speech | pay per volume |
Input Tokens: $0.06/K characters Output Tokens: $0.06/K characters |
Input magnification: 30 Complementary multiplier: 1 |
Text-to-speech model TTS with support for setting tones. |
whisper-1 | speech-to-text | pay per volume |
Input Tokens: $0.06/K tokens Output Tokens: $0.06/K tokens |
Input magnification: 30 Complementary multiplier: 1 |
Whisper can transcribe speech to text and translate multiple languages into English. |
Embedding | Vector embedding | pay per volume |
Input Tokens: Official Multiplier x 3 Output Tokens. |
Input magnification: - Complementary multiplier: - |
All official all embedded models are supported. |
Other OpenAI base models | – | pay per volume |
Input Tokens: Aligned to official documentation Output Tokens. |
Input magnification: - Complementary multiplier: - |
All official models (except Fine-tuning, Assistants) are supported. |
OpenAI Plus
Model name | typology | Billing Type | Final price | clarification |
---|---|---|---|---|
gpt-4-gizmo | dialogues | volumetric billing | Input magnification: 15 Complementary magnification: 2 |
You can call all the GPTs in the official website, you need to get the id of the gpts, model: gpt-4-gizmo-(gizmo_id). how to get the gizmo_id: after creating the gpts with the gizmo_id, you can see the g-xxxx in the sharing link, e.g.: gpt-4-gizmog-123456. you can also go to the GPTs Or you can go to GPTs store and search for the gpts you want, copy the gizmo_id. |
gpt-4-all | dialogues | volumetric billing | Input magnification: 15 Complementary magnification: 2 |
GPT All model, a collection of official GPT-4, networking, map reading, drawing functions, code interpreter all in one, the file link can be put prompt any location. |
gpt-4o-all | dialogues | volumetric billing | Input magnification: 15 Complementary magnification: 2 |
GPT All model, a collection of official GPT-4, networking, map reading, drawing functions, code interpreter all in one, the file link can be put prompt any location. |
search-gpts | search | volumetric billing | Input magnification: 15 Complementary magnification: 2 |
Searches the gpts interface and returns the raw json data. |
search-gpts-chat | dialogues | volumetric billing | Input magnification: 15 Complementary magnification: 2 |
Chat search gpts, compatible with openai format, returns Markdown typeset data. |
Midjourney
Model name | typology | Billing Type | Final price | clarification |
---|---|---|---|---|
mj-chat | dialogues | per-passage billing | $0.25/session | Based on the agent implementation, it can be called directly Chat way, for example: prompt format (image URL + space + prompt + custom parameters): img_url prompt -niji 5 Local repainting is not supported. |
midjourney | sculpture | per-passage billing | – | Support all Midjourney operations Mj V6, local repainting, face change. Support switching painting mode fast, turbo, relax. can be accessed by yourself, support Midjourney proxy plus as well as Midjourney proxy interface protocol, if your project does not support the above way. Has integrated the picture of the domestic anti-generation + Discord domestic anti-generation + Chinese translation interface, the use of super 120 U.S. dollars official account to enhance concurrency. |
Suno
Model name | typology | Billing Type | Final price | clarification |
---|---|---|---|---|
suno-v3 | dialogues | per-passage billing | $0.3/session | SunoAI introduces the Bunsen burner model, which supports inspiration mode, lyrics mode, and generates pure music. One request generates 2 songs with the same lyrics and different styles, and also generates MVs and covers. The work is commercially available. OpenAI's Chat format has been integrated for calling, just like using gpt. Example question: Generate a rock song about love. |
suno-v3.5 | dialogues | per-passage billing | $0.3/session | SunoAI introduces the Bunsen burner model, which supports inspiration mode, lyrics mode, and generates pure music. One request generates 2 songs with the same lyrics and different styles, and also generates MVs and covers. The work is commercially available. OpenAI's Chat format has been integrated for calling, just like using gpt. Example question: Generate a rock song about love. |
suno_music | API Asynchronous Tasks | per-passage billing | $0.1/session | Suno API, you can create a Suno official website with the same platform, support all the operations of the official website. This model is for generating songs, support custom mode, inspiration mode, continuation, compatible with GoAmz. |
suno_lyrics | API Asynchronous Tasks | per-passage billing | $0.01/session | Suno API, you can create a Suno official website with the same platform, support all the operations of the official website. This model is for generating lyrics and is compatible with GoAmz. |
Luma
Model name | typology | Billing Type | Final price | clarification |
---|---|---|---|---|
luma-video | dialogues | per-passage billing | $0.2/session | Luma AI literate video, Chat format, supports uploading reference images, used in the same way as GPT 3.5, example question: dancing cat. |
luma_video_api | API Asynchronous Tasks | per-passage billing | $0.54/session | Luma AI API literate video, support for uploading 2 reference images; you can build the same platform as the Luma official website, support for all the operations of the official website; compatible with GoAmz; SVIP, SSVIP grouping for the paid account version, the price is slightly higher, the queue is prioritized without watermarks; VIP, VVIP grouping for the free account version, the price is slightly lower. |
Claude
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
claude-3-opus-20240229 | dialogues | pay per volume | Input Tokens: $0.045/K tokens Output Tokens: $0.225/K tokens |
Input magnification: 22.5 Complementary magnification: 5 |
The latest version of the Claude model with state-of-the-art language processing, support for 200K contexts, reading images. |
claude-3-sonnet-20240229 | dialogues | pay per volume | Input Tokens: $0.009/K tokens Output Tokens: $0.045/K tokens |
Input magnification: 4.5 Complementary magnification: 5 |
The latest version of the Claude model with state-of-the-art language processing, support for 200K contexts, reading images. |
claude-3-5-sonnet-20240620 | dialogues | pay per volume | Input Tokens: $0.009/K tokens Output Tokens: $0.045/K tokens |
Input magnification: 4.5 Complementary magnification: 5 |
The latest version of the Claude model, claude-3-5, features state-of-the-art language processing, supports 200K contexts, and reads images. |
claude-3-haiku-20240307 | dialogues | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.01/K tokens |
Input magnification: 1 Complementary magnification: 5 |
The latest version of the Claude model with state-of-the-art language processing, support for 200K contexts, reading images. |
Moonshot
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
moonshot-v1-128k | dialogues | pay per volume | Input Tokens: $0.12/K tokens Output Tokens: $0.12/K tokens |
Input magnification: 60 Complementary multiplier: 1 |
is a 100 billion parameter language model from Moonshot AI with excellent semantic understanding, command following, and text generation. It supports 128K context windows. |
moonshot-v1-32k | dialogues | pay per volume | Input Tokens: $0.048/K tokens Output Tokens: $0.048/K tokens |
Input magnification: 24 Complementary multiplier: 1 |
is a 100 billion parameter language model from Moonshot AI with excellent semantic understanding, command following, and text generation. It supports 32K context windows. |
moonshot-v1-8k | dialogues | pay per volume | Input Tokens: $0.024/K tokens Output Tokens: $0.024/K tokens |
Input magnification: 12 Complementary multiplier: 1 |
Moonshot AI has released a language model with hundreds of billions of parameters, with excellent semantic understanding, instruction following and text generation capabilities. It supports 8K context windows and is suitable for short text real-time interaction scenarios. |
Google Internet company
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
gemini-1.0-pro-001 | dialogues | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 1 Complementary magnification: 3 |
Google introduces the Gemini 1.0 family of models, which outperform state-of-the-art (SoTA) models in terms of multimodal AI capabilities. |
gemini-1.0-pro-vision-001 | dialogues | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 1 Complementary magnification: 3 |
Google has introduced the Gemini 1.0 family of models, which outperforms state-of-the-art (SoTA) models in terms of multimodal AI capabilities and supports reading images. |
gemini-pro | dialogues | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 1 Complementary magnification: 3 |
The most general, and would be the most widely used of Google's LLM models, this model could correspond to an OpenAI GPT-3.5 level model. |
gemini-pro-vision | dialogues | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 1 Complementary magnification: 3 |
The most general, and would be the most widely used of Google's LLM models, this model could correspond to an OpenAI GPT-3.5 level model with support for reading images. |
gemini-1.0-pro-002 | dialogues | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 1 Complementary magnification: 3 |
– |
gemini-1.5-pro | dialogues | pay per volume | Input Tokens: $0.008/K tokens Output Tokens: $0.024/K tokens |
Input Magnification: 4 Complementary magnification: 3 |
– |
gemini-1.5-flash-preview-0514 | dialogues | pay per volume | Input Tokens: $0.008/K tokens Output Tokens: $0.024/K tokens |
Input Magnification: 4 Complementary magnification: 3 |
– |
gemini-1.5-flash | dialogues | pay per volume | Input Tokens: $0.008/K tokens Output Tokens: $0.024/K tokens |
Input Magnification: 4 Complementary magnification: 3 |
– |
gemini-1.5-pro-preview-0514 | dialogues | pay per volume | Input Tokens: $0.008/K tokens Output Tokens: $0.024/K tokens |
Input Magnification: 4 Complementary magnification: 3 |
– |
gemini-experimental | dialogues | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 1 Complementary magnification: 3 |
– |
google-palm | dialogues | per-passage billing | $0.006/session | – | – |
Baidu's online shop, zhidao.baidu.com
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
ERNIE-Bot-4 | dialogues | pay per volume | Input Tokens: $0.258/K tokens Output Tokens: $0.258/K tokens | Input Multiplier: 129 Completion multiplier: 1 | Version 4.0 of Baidu's self-developed Wenxin industrial-grade knowledge-enhanced large language model realizes a comprehensive upgrade of the basic model, with significant improvements in comprehension, generation, logic and memory capabilities relative to ERNIE 3.5, and supports 5K input + 2K output. |
ERNIE-4.0-8K | dialogues | pay per volume | Input Tokens: $0.03408/K tokens Output Tokens: $0.03408/K tokens | Input Multiplier: 17.04 Completion Multiplier: 1 | Version 4.0 of Baidu's self-developed Wenxin industrial-grade knowledge-enhanced large language model realizes a comprehensive upgrade of the basic model, with significant improvements in comprehension, generation, logic and memory capabilities relative to ERNIE 3.5, and supports 5K input + 2K output. |
ERNIE-Lite-8K-0308 | dialogues | pay per volume | Input Tokens: $0.001/K tokens Output Tokens: $0.002/K tokens | Input Multiplier: 0.5 Completion multiplier: 2 | ERNIE-Bot-8K is Baidu's flagship large-scale language model, covering a huge amount of Chinese and English corpus, with a strong universal ability to meet most of the requirements of dialog Q&A, creation and plug-in application scenarios; it supports automatic docking of Baidu search plug-ins to ensure the timeliness of Q&A information, and supports 5K input + 2K output. |
ERNIE-Lite-8K-0922 | dialogues | pay per volume | Input Tokens: $0.0022/K tokens Output Tokens: $0.0044/K tokens | Input Multiplier: 1.1 Completion multiplier: 2 | Baidu's self-developed lightweight large language model, taking into account the excellent model effect and reasoning performance, suitable for low-computing power AI acceleration card reasoning use. |
ERNIE-Speed-128K | dialogues | pay per volume | Input Tokens: $0.0012/K tokens Output Tokens: $0.0024/K tokens | Input Multiplier: 0.6 Completion multiplier: 2 | Baidu's self-developed lightweight large language model, taking into account the excellent model effect and reasoning performance, suitable for low-computing power AI acceleration card reasoning use. |
ERNIE-Speed-8K | dialogues | pay per volume | Input Tokens: $0.0012/K tokens Output Tokens: $0.0024/K tokens | Input Multiplier: 0.6 Completion multiplier: 2 | Baidu's self-developed lightweight large language model, taking into account the excellent model effect and reasoning performance, suitable for low-computing power AI acceleration card reasoning use. |
Aritongyi, one of the indigenous peoples of Taiwan
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
qwen-plus | dialogues | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens |
Input magnification: 20 Complementary multiplier: 1 |
Tongyi Thousand Questions is a super large-scale language model that supports input in different languages such as Chinese and English. Suitable for text creation, text processing, programming assistance, translation services, dialog simulation. |
qwen-turbo | dialogues | pay per volume | Input Tokens: $0.016/K tokens Output Tokens: $0.016/K tokens |
Input magnification: 8 Complementary multiplier: 1 |
Tongyi Thousand Questions is a super large-scale language model that supports input in different languages such as Chinese and English. Suitable for text creation, text processing, programming assistance, translation services, dialog simulation. |
qwen-max | dialogues | pay per volume | Input Tokens: $0.24/K tokens Output Tokens: $0.24/K tokens |
Input magnification: 120 Complementary multiplier: 1 |
Tongyi Qianqi 100 billion level super-large-scale language model, supporting Chinese, English and other different language input. Suitable for text creation, text processing, programming assistance, translation services, and dialog simulation. |
qwen-max-1201 | dialogues | pay per volume | Input Tokens: $0.24/K tokens Output Tokens: $0.24/K tokens |
Input magnification: 120 Complementary multiplier: 1 |
Tongyi Qianqi 100 billion level super-large-scale language model, supporting Chinese, English and other different language input. Suitable for text creation, text processing, programming assistance, translation services, and dialog simulation. |
qwen-max-longcontext | dialogues | pay per volume | Input Tokens: $0.24/K tokens Output Tokens: $0.24/K tokens |
Input magnification: 120 Complementary multiplier: 1 |
Tongyi Qianqi 100 billion level super-large-scale language model, supporting Chinese, English and other different language input. Suitable for text creation, text processing, programming assistance, translation services, and dialog simulation. |
lit. record wisdom and say clearly
Deepseek
Baichuan Intelligence
lit. zero-one million things
rapid telecommunications starburst
Bing Bing
360 AI
Tencent hybrid
Other models
lit. record wisdom and say clearly
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
glm-3-turbo | dialogues | per-passage billing | $0.002/session | – | Wisdom Spectrum AI generalized large model for scenarios that require a high amount of knowledge, reasoning ability, and creativity, such as advertising copywriting, novel writing, knowledge-based writing, code generation, and so on. |
glm-4 | dialogues | pay per volume | Input Tokens: $0.06/K tokens Output Tokens: $0.06/K tokens |
Input magnification: 30 Complementary multiplier: 1 |
Smart Spectrum AI generalized big models that provide more powerful Q&A and text generation capabilities. Suitable for complex dialog interactions and deep content creation and design scenarios. |
glm-4v | dialogues | pay per volume | Input Tokens: $0.06/K tokens Output Tokens: $0.06/K tokens |
Input magnification: 30 Complementary multiplier: 1 |
The Smart Spectrum AI generalized large model, which realizes the deep fusion of visual language features, supports various types of image understanding tasks such as visual question and answer, image captioning, visual localization, and complex target detection. |
Deepseek
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
deepseek-chat | dialogues | pay per volume | Input Tokens: $0.001/K tokens Output Tokens: $0.002/K tokens |
Input magnification: 0.5 Complementary magnification: 2 |
The strongest open source MoE model DeepSeek-V2, the world's first model to compete with GPT-4-Turbo in terms of code and math capabilities, and ranked second globally in several lists of code and math; DeepSeek is a liberal arts student; supports 32K contexts. |
deepseek-coder | dialogues | pay per volume | Input Tokens: $0.001/K tokens Output Tokens: $0.002/K tokens |
Input magnification: 0.5 Complementary magnification: 2 |
While possessing the world's leading code and math capabilities, DeepSeek-Coder-V2 also has good general performance, ranking in the first echelon of China in terms of Chinese and English general capabilities; DeepSeek-Coder is a science student; supports 32K contexts. |
Baichuan Intelligence
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
Baichuan2-53B | dialogues | pay per volume | Input Tokens: $0.022/K tokens Output Tokens: $0.022/K tokens |
Input magnification: 11 Complementary multiplier: 1 |
Comprehensively upgrading the capabilities of Baichuan1-53B, Baichuan2-53B not only improves mathematical and logical reasoning significantly, but also greatly reduces modeling illusions through high-quality data systems and search enhancements. |
Baichuan2-Turbo | dialogues | pay per volume | Input Tokens: $0.012/K tokens Output Tokens: $0.012/K tokens |
Input magnification: 6 Complementary multiplier: 1 |
Baichuan Intelligence takes Baichuan2 big model as the core and deeply integrates search enhancement technology with the big model. |
EBaichuan2-Turbo-192k | dialogues | pay per volume | Input Tokens: $0.022/K tokens Output Tokens: $0.022/K tokens |
Input magnification: 11 Complementary multiplier: 1 |
With a 192k ultra-long context window, Baichuan Intelligence takes the Baichuan2 big model as the core and deeply integrates search enhancement technology with the big model. |
lit. zero-one million things
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
yi-34b-chat-0205 | dialogues | pay per volume | Input Tokens: $0.006/K tokens Output Tokens: $0.006/K tokens |
Input magnification: 3 Complementary multiplier: 1 |
Based on the open-source version of the deeply optimized version, the command compliance ability is improved by nearly 30%, and the delay of model reply is greatly reduced. Applicable to chat, Q&A, dialog, collaboration, translation and other scenarios. |
yi-34b-chat-200k | dialogues | pay per volume | Input Tokens: $0.024/K tokens Output Tokens: $0.024/K tokens |
Input magnification: 12 Complementary multiplier: 1 |
200K long context, supports processing about 20w ~ 30w Chinese characters (about 1 Harry Potter book) or English words. It is suitable for multi-document content understanding, massive data analysis and mining, and cross-domain knowledge fusion applications. |
yi-vl-plus | dialogues | pay per volume | Input Tokens: $0.012/K tokens Output Tokens: $0.012/K tokens |
Input magnification: 6 Complementary multiplier: 1 |
Supports 1024*1024 high-resolution image input with image Q&A, chart comprehension, OCR, and visual reasoning capabilities. Suitable for content analysis of complex charts and screenshots, including information recognition, extraction, understanding, and reasoning. |
rapid telecommunications starburst
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
SparkDesk | dialogues | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens |
Input magnification: 20 Complementary multiplier: 1 |
It is a new generation of cognitive intelligence big model launched by KU Xunfei, with cross-domain knowledge and language comprehension, able to understand and perform tasks based on natural dialog, providing language understanding, knowledge quiz, logical reasoning, math problem solving, code understanding. |
SparkDesk-v1.1 | dialogues | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens |
Input magnification: 20 Complementary multiplier: 1 |
It is a new generation of cognitive intelligence big model launched by KU Xunfei, with cross-domain knowledge and language comprehension, able to understand and perform tasks based on natural dialog, providing language understanding, knowledge quiz, logical reasoning, math problem solving, code understanding. |
SparkDesk-v2.1 | dialogues | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens |
Input magnification: 20 Complementary multiplier: 1 |
It is a new generation of cognitive intelligence big model launched by KU Xunfei, with cross-domain knowledge and language comprehension, able to understand and perform tasks based on natural dialog, providing language understanding, knowledge quiz, logical reasoning, math problem solving, code understanding. |
SparkDesk-v3.1 | dialogues | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens |
Input magnification: 20 Complementary multiplier: 1 |
It is a new generation of cognitive intelligence big model launched by KU Xunfei, with cross-domain knowledge and language comprehension, able to understand and perform tasks based on natural dialog, providing language understanding, knowledge quiz, logical reasoning, math problem solving, code understanding. |
SparkDesk-v3.5 | dialogues | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens |
Input magnification: 20 Complementary multiplier: 1 |
It is a new generation of cognitive intelligence big model launched by KU Xunfei, with cross-domain knowledge and language comprehension, able to understand and perform tasks based on natural dialog, providing language understanding, knowledge quiz, logical reasoning, math problem solving, code understanding. |
Bing Bing
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
Precise | dialogues | pay per volume | Input Tokens: $0.01/K tokens Output Tokens: $0.01/K tokens |
Input magnification: 5 Complementary multiplier: 1 |
NewBing Precision Mode with networking capabilities and no painting. |
Balanced | dialogues | pay per volume | Input Tokens: $0.01/K tokens Output Tokens: $0.01/K tokens |
Input magnification: 5 Complementary multiplier: 1 |
NewBing Balanced mode with networking capabilities, no painting. |
Creative | dialogues | pay per volume | Input Tokens: $0.01/K tokens Output Tokens: $0.01/K tokens |
Input magnification: 5 Complementary multiplier: 1 |
NewBing Creation Mode, with networking capabilities, cannot draw. |
360 AI
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
360GPT_S2_V9 | dialogues | pay per volume | Input Tokens: $0.024/K tokens Output Tokens: $0.024/K tokens |
Input magnification: 12 Complementary multiplier: 1 |
360Brain is a cognitive generalized big model independently developed and trained by 360. With 360's technical accumulation in the field of artificial intelligence and the first-mover advantage of large model training, 360 Brain now has a hundred billion parameter scale, with ten core capabilities such as generative creation, multi-round dialog, logical reasoning, and hundreds of segmented functions, which can cover all the scenarios of large model application. |
Tencent hybrid
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
hunyuan | dialogues | pay per volume | Input Tokens: $0.2/K tokens Output Tokens: $0.2/K tokens |
Input magnification: 100 Complementary multiplier: 1 |
Developed by Tencent, the Big Language Model features powerful Chinese authoring capabilities, logical reasoning in complex contexts, and reliable task execution. |
Other models
Model name | typology | Billing Type | Final price | Model Price Calculation Multiplier | clarification |
---|---|---|---|---|---|
stable-diffusion | dialogues | per-passage billing | $0.01/session | – | Advanced image generation and processing model specializing in creating realistic visual effects. |
llama-3-70b | dialogues | per-passage billing | $0.01/session | – | The latest Meta Llama 3 model with 7 billion parameters. |
llama-3-8b | dialogues | per-passage billing | $0.01/session | – | The latest Meta Llama 3 model with 700 million parameters. |
llama-2-70b (13b, 7b) | dialogues | per-passage billing | $0.006/session | – | High-capacity Llama models for complex analytical and predictive tasks. |
code-llama-34b (13b, 7b) | dialogues | per-passage billing | $0.006/session | – | Llama model designed for programming and code analysis with advanced code understanding. |
mixtral-8x7b | dialogues | per-passage billing | $0.01/session | – | About the same as GPT 3.5. |
mistral-medium | dialogues | per-passage billing | $0.01/session | – | Close to gpt-4 performance, faster, 32k context. |
mixtral-8x22b | dialogues | per-passage billing | $0.01/session | – | 8×2.2 billion parameter Mixtral model for large-scale data analysis and machine learning tasks. |
chirp-v2-xxl-alpha | – | per-passage billing | $0.3/session | – | – |
chirp-v3-0 | – | per-passage billing | $0.3/session | – | – |
claude-1-100k | – | per-passage billing | $0.01/session | – | – |
claude-1.3-100k | – | per-passage billing | $0.01/session | – | – |
claude-2 | – | per-passage billing | $0.02/session | – | – |
claude-2-100k | – | per-passage billing | $0.02/session | – | – |
code-llama-13b | – | per-passage billing | $0.006/session | – | – |
code-llama-7b | – | per-passage billing | $0.006/session | – | – |
domo-img-to-video | – | per-passage billing | $0.6/session | – | – |
domo-video-to-video | – | per-passage billing | $0.6/session | – | – |
gpt-4-dalle | – | per-passage billing | $0.1/session | – | – |
gpt-4-v | – | per-passage billing | $0.1/session | – | – |
llama-2-13b | – | per-passage billing | $0.006/session | – | – |
llama-2-7b | – | per-passage billing | $0.006/session | – | – |
luma_video_download_api | – | per-passage billing | $0.001/session | – | – |
luma_video_extend_api | – | per-passage billing | $0.54/session | – | – |
net-gpt-4-0125-preview | – | per-passage billing | $0.1/session | – | – |
net-gpt-4-0314 | – | per-passage billing | $0.1/session | – | – |
net-gpt-4-0613</td> | – | per-passage billing | $0.1/session | – | – |
net-gpt-4-1106-preview | – | per-passage billing | $0.1/session | – | – |
net-gpt-4-32k | – | per-passage billing | $0.1/session | – | – |
net-gpt-4-turbo | – | per-passage billing | $0.1/session | – | – |
net-gpt-4-turbo-preview | – | per-passage billing | $0.1/session | – | – |
net-gpt-4o | – | per-passage billing | $0.6/session | – | – |
pika-text-to-video | – | per-passage billing | $0.5/session | – | – |
BLOOMZ-7B | – | pay per volume | Input Tokens: $0.0012/K tokens Output Tokens: $0.0012/K tokens | Input magnification: 0.6 Complementary multiplier: 1 | – |
Bing | – | pay per volume | Input Tokens: $0.01/K tokens Output Tokens: $0.01/K tokens | Input magnification: 5 Complementary multiplier: 1 | – |
ChatPro | – | pay per volume | Input Tokens: $0.028/K tokens Output Tokens: $0.028/K tokens | Input magnification: 14 Complementary multiplier: 1 | – |
ChatStd | – | pay per volume | Input Tokens: $0.0028/K tokens Output Tokens: $0.0028/K tokens | Input magnification: 1.4 Complementary multiplier: 1 | – |
ERNIE-3.5-4K-0205 | – | pay per volume | Input Tokens: $0.00341/K tokens Output Tokens: $0.00341/K tokens | Input magnification: 1.704 Complementary multiplier: 1 | – |
ERNIE-3.5-8K | – | pay per volume | Input Tokens: $0.00341/K tokens Output Tokens: $0.00341/K tokens | Input magnification: 1.704 Complementary multiplier: 1 | – |
ERNIE-3.5-8K-0205 | – | pay per volume | Input Tokens: $0.00682/K tokens Output Tokens: $0.00682/K tokens | Input magnification: 3.408 Complementary multiplier: 1 | – |
ERNIE-3.5-8K-1222 | – | pay per volume | Input Tokens: $0.00341/K tokens Output Tokens: $0.00341/K tokens | Input magnification: 1.704 Complementary multiplier: 1 | – |
ERNIE-Bot | – | pay per volume | Input Tokens: $0.024/K tokens Output Tokens: $0.024/K tokens | Input magnification: 12 Complementary multiplier: 1 | – |
ERNIE-Bot-8K | – | pay per volume | Input Tokens: $0.048/K tokens Output Tokens: $0.048/K tokens | Input magnification: 24 Complementary multiplier: 1 | – |
ERNIE-Bot-turbo | – | pay per volume | Input Tokens: $0.016/K tokens Output Tokens: $0.016/K tokens | Input magnification: 8 Complementary multiplier: 1 | – |
ERNIE-Tiny-8K | – | pay per volume | Input Tokens: $0.0003/K tokens Output Tokens: $0.0003/K tokens | Input magnification: 0.15 Complementary multiplier: 1 | – |
Embedding-V1 | – | pay per volume | Input Tokens: $0.00029/K tokens Output Tokens: $0.00029/K tokens | Input magnification: 0.1429 Complementary multiplier: 1 | – |
PaLM-2 | – | pay per volume | Input Tokens: $0.002/K tokens Output Tokens: $0.002/K tokens | Input magnification: 1 Complementary multiplier: 1 | – |
abab5.5 | – | pay per volume | Input Tokens: $0.02/K tokens Output Tokens: $0.02/K tokens | Input magnification: 10 Complementary multiplier: 1 | – |
abab5.5-chat | – | pay per volume | Input Tokens: $0.032/K tokens Output Tokens: $0.032/K tokens | Input magnification: 16 Complementary multiplier: 1 | – |
abab5.5s | – | pay per volume | Input Tokens: $0.0066/K tokens Output Tokens: $0.0066/K tokens | Input magnification: 3.3 1 | – |
abab5.5s-chat | – | pay per volume | Input Tokens: $0.01/K tokens Output Tokens: $0.01/K tokens | Input magnification: 5 Complementary multiplier: 1 | – |
abab6 | – | pay per volume | Input Tokens: $0.134/K tokens Output Tokens: $0.134/K tokens | Input magnification: 67 Complementary multiplier: 1 | – |
abab6-chat | – | pay per volume | Input Tokens: $0.204/K tokens Output Tokens: $0.204/K tokens | Input magnification: 102 Complementary multiplier: 1 | – |
ada | – | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens | Input magnification: 20 Complementary multiplier: 1 | – |
ali-stable-diffusion-v1.5 | – | pay per volume | Input Tokens: $0.032/K tokens Output Tokens: $0.032/K tokens | Input magnification: 16 Complementary multiplier: 1 | – |
ali-stable-diffusion-xl | – | pay per volume | Input Tokens: $0.032/K tokens Output Tokens: $0.032/K tokens | Input magnification: 16 Complementary multiplier: 1 | – |
babbage | – | pay per volume | Input Tokens: $0.02/K tokens Output Tokens: $0.02/K tokens | Input magnification: 10 Complementary multiplier: 1 | – |
bge-large-8k | – | pay per volume | Input Tokens: $0.0006/K tokens Output Tokens: $0.0006/K tokens | Input magnification: 0.3 Complementary multiplier: 1 | – |
bge-large-en | – | pay per volume | Input Tokens: $0.0006/K tokens Output Tokens: $0.0006/K tokens | Input magnification: 0.3 Complementary multiplier: 1 | – |
bge-large-zh | – | pay per volume | Input Tokens: $0.0006/K tokens Output Tokens: $0.0006/K tokens | Input magnification: 0.3 Complementary multiplier: 1 | – |
chatglm_lite | – | pay per volume | Input Tokens: $0.004/K tokens Output Tokens: $0.004/K tokens | Input magnification: 2 Complementary multiplier: 1 | – |
chatglm_pro | – | pay per volume | Input Tokens: $0.02/K tokens Output Tokens: $0.02/K tokens | Input magnification: 10 Complementary multiplier: 1 | – |
chatglm_std | – | pay per volume | Input Tokens: $0.01/K tokens Output Tokens: $0.01/K tokens | Input magnification: 5 Complementary multiplier: 1 | – |
chatglm_turbo | – | pay per volume | Input Tokens: $0.01/K tokens Output Tokens: $0.01/K tokens | Input magnification: 5 Complementary multiplier: 1 | – |
claude-instant-1 | – | pay per volume | Input Tokens: $0.00163/K tokens Output Tokens: $0.00489/K tokens | Input magnification: 0.815 Complementary magnification: 3 | – |
code-davinci-edit-001 | – | pay per volume | Input Tokens: $0.02/K tokens Output Tokens: $0.02/K tokens | Input magnification: 10 Complementary multiplier: 1 | – |
curie | – | pay per volume | Input Tokens: $0.02/K tokens Output Tokens: $0.02/K tokens | Input magnification: 10 Complementary multiplier: 1 | – |
dall-e-2 | – | pay per volume | Resolution Price 1024×1024 $0.045 512 x 512 $0.0405 256 x 256 $0.036 | Input magnification: 18 Complementary multiplier: 1 | – |
davinci | – | pay per volume | Input Tokens: $0.02/K tokens Output Tokens: $0.02/K tokens | Input magnification: 10 Complementary multiplier: 1 | – |
embedding-bert-512-v1 | – | pay per volume | Input Tokens: $0.00014/K tokens Output Tokens: $0.00014/K tokens | Input magnification: 0.0715 Complementary multiplier: 1 | – |
embedding_s1_v1 | – | pay per volume | Input Tokens: $0.00014/K tokens Output Tokens: $0.00014/K tokens | Input magnification: 0.0715 Complementary multiplier: 1 | – |
gemini-1.5-flash-latest | – | pay per volume | Input Tokens: $0.008/K tokens Output Tokens: $0.024/K tokens | Input Magnification: 4
| – |
gemini-1.5-pro-latest | – | pay per volume | Input Tokens: $0.008/K tokens Output Tokens: $0.024/K tokens | Input Magnification: 4 Complementary magnification: 3 | – |
gpt-4-1106-vision-preview | – | pay per volume | Input Tokens: $0.03/K tokens Output Tokens. | Input magnification: 15 Complementary multiplier: - | – |
net-gpt-3.5-turbo-0301 | – | pay per volume | Input Tokens: $0.0007/K tokens Output Tokens: $0.0007/K tokens | Input magnification: 0.35 Complementary multiplier: 1 | – |
net-gpt-3.5-turbo-0613 | – | pay per volume | Input Tokens: $0.0007/K tokens Output Tokens: $0.0007/K tokens | Input magnification: 0.35 Complementary multiplier: 1 | – |
net-gpt-3.5-turbo-16k | – | pay per volume | Input Tokens: $0.0007/K tokens Output Tokens: $0.0007/K tokens | Input magnification: 0.35 Complementary multiplier: 1 | – |
net-gpt-3.5-turbo-16k-0613 | – | pay per volume | Input Tokens: $0.0007/K tokens Output Tokens: $0.0007/K tokens | Input magnification: 0.35 Complementary multiplier: 1 | – |
net-gpt-3.5-turbo-instruct | – | pay per volume | Input Tokens: $0.0015/K tokens Output Tokens: $0.0015/K tokens | Input magnification: 0.75 Complementary multiplier: 1 | – |
semantic_similarity_s1_v1 | – | pay per volume | Input Tokens: $0.00014/K tokens Output Tokens: $0.00014/K tokens | Input magnification: 0.0715 Complementary multiplier: 1 | – |
text-ada-001 | – | pay per volume | Input Tokens: $0.0008/K tokens Output Tokens: $0.0008/K tokens | Input magnification: 0.4 Complementary multiplier: 1 | – |
text-babbage-001 | – | pay per volume | Input Tokens: $0.001/K tokens Output Tokens: $0.001/K tokens | Input magnification: 0.5 Complementary multiplier: 1 | – |
text-curie-001 | – | pay per volume | Input Tokens: $0.004/K tokens Output Tokens: $0.004/K tokens | Input magnification: 2 Complementary multiplier: 1 | – |
text-davinci-002 | – | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens | Input magnification: 20 Complementary multiplier: 1 | – |
text-davinci-003 | – | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens | Input magnification: 20 Complementary multiplier: 1 | – |
text-davinci-edit-001 | – | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens | Input magnification: 20 Complementary multiplier: 1 | – |
text-embedding-3-large | – | pay per volume | Input Tokens: $0.00054/K tokens Output Tokens: $0.00054/K tokens | Input magnification: 0.27 Complementary multiplier: 1 | – |
text-embedding-3-small | – | pay per volume | Input Tokens: $0.0002/K tokens Output Tokens: $0.0002/K tokens | Input magnification: 0.1 Complementary multiplier: 1 | – |
text-embedding-ada-002 | – | pay per volume | Input Tokens: $0.0004/K tokens Output Tokens: $0.0004/K tokens | Input magnification: 0.2 Complementary multiplier: 1 | – |
text-embedding-v1 | – | pay per volume | Input Tokens: $0.0004/K tokens Output Tokens: $0.0004/K tokens | Input magnification: 0.2 Complementary multiplier: 1 | – |
text-moderation-latest | – | pay per volume | Input Tokens: $0.0004/K tokens Output Tokens: $0.0004/K tokens | Input magnification: 0.2 Complementary multiplier: 1 | – |
text-moderation-stable | – | pay per volume | Input Tokens: $0.0004/K tokens Output Tokens: $0.0004/K tokens | Input magnification: 0.2 Complementary multiplier: 1 | – |
text-search-ada-doc-001 | – | pay per volume | Input Tokens: $0.04/K tokens Output Tokens: $0.04/K tokens | Input magnification: 20 | – |
“`