AI Models Prices
* Prices are per 1M tokens in USD
Model | Version | Provider | Pricing mode | Input price * | Output price * |
---|---|---|---|---|---|
omni-moderation-latest | omni-moderation-latest | openai | standard | 0.00 | 0.00 |
omni-moderation-latest-intents | omni-moderation-latest-intents | openai | standard | 0.00 | 0.00 |
omni-moderation-2024-09-26 | omni-moderation-2024-09-26 | openai | standard | 0.00 | 0.00 |
gpt-4 | gpt-4 | openai | standard | 30.00 | 60.00 |
gpt-4.1 | gpt-4.1 | openai | standard | 2.00 | 8.00 |
gpt-4.1 | gpt-4.1 | openai | batch | 1.00 | 4.00 |
gpt-4.1-2025-04-14 | gpt-4.1-2025-04-14 | openai | standard | 2.00 | 8.00 |
gpt-4.1-2025-04-14 | gpt-4.1-2025-04-14 | openai | batch | 1.00 | 4.00 |
gpt-4.1-mini | gpt-4.1-mini | openai | standard | 0.40 | 1.60 |
gpt-4.1-mini | gpt-4.1-mini | openai | batch | 0.20 | 0.80 |
gpt-4.1-mini-2025-04-14 | gpt-4.1-mini-2025-04-14 | openai | standard | 0.40 | 1.60 |
gpt-4.1-mini-2025-04-14 | gpt-4.1-mini-2025-04-14 | openai | batch | 0.20 | 0.80 |
gpt-4.1-nano | gpt-4.1-nano | openai | standard | 0.10 | 0.40 |
gpt-4.1-nano | gpt-4.1-nano | openai | batch | 0.05 | 0.20 |
gpt-4.1-nano-2025-04-14 | gpt-4.1-nano-2025-04-14 | openai | standard | 0.10 | 0.40 |
gpt-4.1-nano-2025-04-14 | gpt-4.1-nano-2025-04-14 | openai | batch | 0.05 | 0.20 |
gpt-4o | gpt-4o | openai | standard | 2.50 | 10.00 |
gpt-4o | gpt-4o | openai | batch | 1.25 | 5.00 |
watsonx/ibm/granite-3-8b-instruct | watsonx/ibm/granite-3-8b-instruct | watsonx | standard | 200.00 | 200.00 |
gpt-4o-search-preview-2025-03-11 | gpt-4o-search-preview-2025-03-11 | openai | standard | 2.50 | 10.00 |
gpt-4o-search-preview-2025-03-11 | gpt-4o-search-preview-2025-03-11 | openai | batch | 1.25 | 5.00 |
gpt-4o-search-preview | gpt-4o-search-preview | openai | standard | 2.50 | 10.00 |
gpt-4o-search-preview | gpt-4o-search-preview | openai | batch | 1.25 | 5.00 |
gpt-4.5-preview | gpt-4.5-preview | openai | standard | 75.00 | 150.00 |
gpt-4.5-preview | gpt-4.5-preview | openai | batch | 37.50 | 75.00 |
gpt-4.5-preview-2025-02-27 | gpt-4.5-preview-2025-02-27 | openai | standard | 75.00 | 150.00 |
gpt-4.5-preview-2025-02-27 | gpt-4.5-preview-2025-02-27 | openai | batch | 37.50 | 75.00 |
gpt-4o-audio-preview | gpt-4o-audio-preview | openai | standard | 2.50 | 10.00 |
gpt-4o-audio-preview-2024-12-17 | gpt-4o-audio-preview-2024-12-17 | openai | standard | 2.50 | 10.00 |
gpt-4o-audio-preview-2024-10-01 | gpt-4o-audio-preview-2024-10-01 | openai | standard | 2.50 | 10.00 |
gpt-4o-mini-audio-preview | gpt-4o-mini-audio-preview | openai | standard | 0.15 | 0.60 |
gpt-4o-mini-audio-preview-2024-12-17 | gpt-4o-mini-audio-preview-2024-12-17 | openai | standard | 0.15 | 0.60 |
gpt-4o-mini | gpt-4o-mini | openai | standard | 0.15 | 0.60 |
gpt-4o-mini | gpt-4o-mini | openai | batch | 0.08 | 0.30 |
gpt-4o-mini-search-preview-2025-03-11 | gpt-4o-mini-search-preview-2025-03-11 | openai | standard | 0.15 | 0.60 |
gpt-4o-mini-search-preview-2025-03-11 | gpt-4o-mini-search-preview-2025-03-11 | openai | batch | 0.08 | 0.30 |
gpt-4o-mini-search-preview | gpt-4o-mini-search-preview | openai | standard | 0.15 | 0.60 |
gpt-4o-mini-search-preview | gpt-4o-mini-search-preview | openai | batch | 0.08 | 0.30 |
gpt-4o-mini-2024-07-18 | gpt-4o-mini-2024-07-18 | openai | standard | 0.15 | 0.60 |
gpt-4o-mini-2024-07-18 | gpt-4o-mini-2024-07-18 | openai | batch | 0.08 | 0.30 |
codex-mini-latest | codex-mini-latest | openai | standard | 1.50 | 6.00 |
o1-pro | o1-pro | openai | standard | 150.00 | 600.00 |
o1-pro | o1-pro | openai | batch | 75.00 | 300.00 |
o1-pro-2025-03-19 | o1-pro-2025-03-19 | openai | standard | 150.00 | 600.00 |
o1-pro-2025-03-19 | o1-pro-2025-03-19 | openai | batch | 75.00 | 300.00 |
o1 | o1 | openai | standard | 15.00 | 60.00 |
o1-mini | o1-mini | openai | standard | 1.10 | 4.40 |
computer-use-preview | computer-use-preview | azure | standard | 3.00 | 12.00 |
o3 | o3 | openai | standard | 10.00 | 40.00 |
o3-2025-04-16 | o3-2025-04-16 | openai | standard | 10.00 | 40.00 |
o3-mini | o3-mini | openai | standard | 1.10 | 4.40 |
o3-mini-2025-01-31 | o3-mini-2025-01-31 | openai | standard | 1.10 | 4.40 |
o4-mini | o4-mini | openai | standard | 1.10 | 4.40 |
o4-mini-2025-04-16 | o4-mini-2025-04-16 | openai | standard | 1.10 | 4.40 |
o1-mini-2024-09-12 | o1-mini-2024-09-12 | openai | standard | 3.00 | 12.00 |
o1-preview | o1-preview | openai | standard | 15.00 | 60.00 |
o1-preview-2024-09-12 | o1-preview-2024-09-12 | openai | standard | 15.00 | 60.00 |
o1-2024-12-17 | o1-2024-12-17 | openai | standard | 15.00 | 60.00 |
chatgpt-4o-latest | chatgpt-4o-latest | openai | standard | 5.00 | 15.00 |
gpt-4o-2024-05-13 | gpt-4o-2024-05-13 | openai | standard | 5.00 | 15.00 |
gpt-4o-2024-05-13 | gpt-4o-2024-05-13 | openai | batch | 2.50 | 7.50 |
gpt-4o-2024-08-06 | gpt-4o-2024-08-06 | openai | standard | 2.50 | 10.00 |
gpt-4o-2024-08-06 | gpt-4o-2024-08-06 | openai | batch | 1.25 | 5.00 |
gpt-4o-2024-11-20 | gpt-4o-2024-11-20 | openai | standard | 2.50 | 10.00 |
gpt-4o-2024-11-20 | gpt-4o-2024-11-20 | openai | batch | 1.25 | 5.00 |
gpt-4o-realtime-preview-2024-10-01 | gpt-4o-realtime-preview-2024-10-01 | openai | standard | 5.00 | 20.00 |
gpt-4o-realtime-preview | gpt-4o-realtime-preview | openai | standard | 5.00 | 20.00 |
gpt-4o-realtime-preview-2024-12-17 | gpt-4o-realtime-preview-2024-12-17 | openai | standard | 5.00 | 20.00 |
gpt-4o-mini-realtime-preview | gpt-4o-mini-realtime-preview | openai | standard | 0.60 | 2.40 |
gpt-4o-mini-realtime-preview-2024-12-17 | gpt-4o-mini-realtime-preview-2024-12-17 | openai | standard | 0.60 | 2.40 |
gpt-4-turbo-preview | gpt-4-turbo-preview | openai | standard | 10.00 | 30.00 |
gpt-4-0314 | gpt-4-0314 | openai | standard | 30.00 | 60.00 |
gpt-4-0613 | gpt-4-0613 | openai | standard | 30.00 | 60.00 |
gpt-4-32k | gpt-4-32k | openai | standard | 60.00 | 120.00 |
gpt-4-32k-0314 | gpt-4-32k-0314 | openai | standard | 60.00 | 120.00 |
gpt-4-32k-0613 | gpt-4-32k-0613 | openai | standard | 60.00 | 120.00 |
gpt-4-turbo | gpt-4-turbo | openai | standard | 10.00 | 30.00 |
gpt-4-turbo-2024-04-09 | gpt-4-turbo-2024-04-09 | openai | standard | 10.00 | 30.00 |
gpt-4-1106-preview | gpt-4-1106-preview | openai | standard | 10.00 | 30.00 |
gpt-4-0125-preview | gpt-4-0125-preview | openai | standard | 10.00 | 30.00 |
gpt-4-vision-preview | gpt-4-vision-preview | openai | standard | 10.00 | 30.00 |
gpt-4-1106-vision-preview | gpt-4-1106-vision-preview | openai | standard | 10.00 | 30.00 |
gpt-3.5-turbo | gpt-3.5-turbo | openai | standard | 1.50 | 2.00 |
gpt-3.5-turbo-0301 | gpt-3.5-turbo-0301 | openai | standard | 1.50 | 2.00 |
gpt-3.5-turbo-0613 | gpt-3.5-turbo-0613 | openai | standard | 1.50 | 2.00 |
gpt-3.5-turbo-1106 | gpt-3.5-turbo-1106 | openai | standard | 1.00 | 2.00 |
gpt-3.5-turbo-0125 | gpt-3.5-turbo-0125 | openai | standard | 0.50 | 1.50 |
gpt-3.5-turbo-16k | gpt-3.5-turbo-16k | openai | standard | 3.00 | 4.00 |
gpt-3.5-turbo-16k-0613 | gpt-3.5-turbo-16k-0613 | openai | standard | 3.00 | 4.00 |
ft:gpt-3.5-turbo | ft:gpt-3.5-turbo | openai | standard | 3.00 | 6.00 |
ft:gpt-3.5-turbo | ft:gpt-3.5-turbo | openai | batch | 1.50 | 3.00 |
ft:gpt-3.5-turbo-0125 | ft:gpt-3.5-turbo-0125 | openai | standard | 3.00 | 6.00 |
ft:gpt-3.5-turbo-1106 | ft:gpt-3.5-turbo-1106 | openai | standard | 3.00 | 6.00 |
ft:gpt-3.5-turbo-0613 | ft:gpt-3.5-turbo-0613 | openai | standard | 3.00 | 6.00 |
ft:gpt-4-0613 | ft:gpt-4-0613 | openai | standard | 30.00 | 60.00 |
ft:gpt-4o-2024-08-06 | ft:gpt-4o-2024-08-06 | openai | standard | 3.75 | 15.00 |
ft:gpt-4o-2024-08-06 | ft:gpt-4o-2024-08-06 | openai | batch | 1.88 | 7.50 |
ft:gpt-4o-2024-11-20 | ft:gpt-4o-2024-11-20 | openai | standard | 3.75 | 15.00 |
ft:gpt-4o-mini-2024-07-18 | ft:gpt-4o-mini-2024-07-18 | openai | standard | 0.30 | 1.20 |
ft:gpt-4o-mini-2024-07-18 | ft:gpt-4o-mini-2024-07-18 | openai | batch | 0.15 | 0.60 |
ft:davinci-002 | ft:davinci-002 | text-completion-openai | standard | 2.00 | 2.00 |
ft:davinci-002 | ft:davinci-002 | text-completion-openai | batch | 1.00 | 1.00 |
ft:babbage-002 | ft:babbage-002 | text-completion-openai | standard | 0.40 | 0.40 |
ft:babbage-002 | ft:babbage-002 | text-completion-openai | batch | 0.20 | 0.20 |
text-embedding-3-large | text-embedding-3-large | openai | standard | 0.13 | 0.00 |
text-embedding-3-large | text-embedding-3-large | openai | batch | 0.07 | 0.00 |
text-embedding-3-small | text-embedding-3-small | openai | standard | 0.02 | 0.00 |
text-embedding-3-small | text-embedding-3-small | openai | batch | 0.01 | 0.00 |
text-embedding-ada-002 | text-embedding-ada-002 | openai | standard | 0.10 | 0.00 |
text-embedding-ada-002-v2 | text-embedding-ada-002-v2 | openai | standard | 0.10 | 0.00 |
text-embedding-ada-002-v2 | text-embedding-ada-002-v2 | openai | batch | 0.05 | 0.00 |
text-moderation-stable | text-moderation-stable | openai | standard | 0.00 | 0.00 |
text-moderation-007 | text-moderation-007 | openai | standard | 0.00 | 0.00 |
text-moderation-latest | text-moderation-latest | openai | standard | 0.00 | 0.00 |
gpt-4o-transcribe | gpt-4o-transcribe | openai | standard | 2.50 | 10.00 |
gpt-4o-mini-transcribe | gpt-4o-mini-transcribe | openai | standard | 1.25 | 5.00 |
gpt-4o-mini-tts | gpt-4o-mini-tts | openai | standard | 2.50 | 10.00 |
azure/gpt-4o-mini-tts | azure/gpt-4o-mini-tts | azure | standard | 2.50 | 10.00 |
azure/computer-use-preview | azure/computer-use-preview | azure | standard | 3.00 | 12.00 |
azure/gpt-4o-audio-preview-2024-12-17 | azure/gpt-4o-audio-preview-2024-12-17 | azure | standard | 2.50 | 10.00 |
azure/gpt-4o-mini-audio-preview-2024-12-17 | azure/gpt-4o-mini-audio-preview-2024-12-17 | azure | standard | 2.50 | 10.00 |
azure/gpt-4.1 | azure/gpt-4.1 | azure | standard | 2.00 | 8.00 |
azure/gpt-4.1 | azure/gpt-4.1 | azure | batch | 1.00 | 4.00 |
azure/gpt-4.1-2025-04-14 | azure/gpt-4.1-2025-04-14 | azure | standard | 2.00 | 8.00 |
azure/gpt-4.1-2025-04-14 | azure/gpt-4.1-2025-04-14 | azure | batch | 1.00 | 4.00 |
azure/gpt-4.1-mini | azure/gpt-4.1-mini | azure | standard | 0.40 | 1.60 |
azure/gpt-4.1-mini | azure/gpt-4.1-mini | azure | batch | 0.20 | 0.80 |
azure/gpt-4.1-mini-2025-04-14 | azure/gpt-4.1-mini-2025-04-14 | azure | standard | 0.40 | 1.60 |
azure/gpt-4.1-mini-2025-04-14 | azure/gpt-4.1-mini-2025-04-14 | azure | batch | 0.20 | 0.80 |
azure/gpt-4.1-nano | azure/gpt-4.1-nano | azure | standard | 0.10 | 0.40 |
azure/gpt-4.1-nano | azure/gpt-4.1-nano | azure | batch | 0.05 | 0.20 |
azure/gpt-4.1-nano-2025-04-14 | azure/gpt-4.1-nano-2025-04-14 | azure | standard | 0.10 | 0.40 |
azure/gpt-4.1-nano-2025-04-14 | azure/gpt-4.1-nano-2025-04-14 | azure | batch | 0.05 | 0.20 |
azure/o3 | azure/o3 | azure | standard | 10.00 | 40.00 |
azure/o3-2025-04-16 | azure/o3-2025-04-16 | azure | standard | 10.00 | 40.00 |
azure/o4-mini | azure/o4-mini | azure | standard | 1.10 | 4.40 |
azure/gpt-4o-mini-realtime-preview-2024-12-17 | azure/gpt-4o-mini-realtime-preview-2024-12-17 | azure | standard | 0.60 | 2.40 |
azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 | azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 | azure | standard | 0.66 | 2.64 |
azure/us/gpt-4o-mini-realtime-preview-2024-12-17 | azure/us/gpt-4o-mini-realtime-preview-2024-12-17 | azure | standard | 0.66 | 2.64 |
azure/gpt-4o-realtime-preview-2024-12-17 | azure/gpt-4o-realtime-preview-2024-12-17 | azure | standard | 5.00 | 20.00 |
azure/us/gpt-4o-realtime-preview-2024-12-17 | azure/us/gpt-4o-realtime-preview-2024-12-17 | azure | standard | 5.50 | 22.00 |
azure/eu/gpt-4o-realtime-preview-2024-12-17 | azure/eu/gpt-4o-realtime-preview-2024-12-17 | azure | standard | 5.50 | 22.00 |
azure/gpt-4o-realtime-preview-2024-10-01 | azure/gpt-4o-realtime-preview-2024-10-01 | azure | standard | 5.00 | 20.00 |
azure/us/gpt-4o-realtime-preview-2024-10-01 | azure/us/gpt-4o-realtime-preview-2024-10-01 | azure | standard | 5.50 | 22.00 |
azure/eu/gpt-4o-realtime-preview-2024-10-01 | azure/eu/gpt-4o-realtime-preview-2024-10-01 | azure | standard | 5.50 | 22.00 |
azure/o4-mini-2025-04-16 | azure/o4-mini-2025-04-16 | azure | standard | 1.10 | 4.40 |
azure/o3-mini-2025-01-31 | azure/o3-mini-2025-01-31 | azure | standard | 1.10 | 4.40 |
azure/us/o3-mini-2025-01-31 | azure/us/o3-mini-2025-01-31 | azure | standard | 1.21 | 4.84 |
azure/us/o3-mini-2025-01-31 | azure/us/o3-mini-2025-01-31 | azure | batch | 0.61 | 2.42 |
azure/eu/o3-mini-2025-01-31 | azure/eu/o3-mini-2025-01-31 | azure | standard | 1.21 | 4.84 |
azure/eu/o3-mini-2025-01-31 | azure/eu/o3-mini-2025-01-31 | azure | batch | 0.61 | 2.42 |
azure/o3-mini | azure/o3-mini | azure | standard | 1.10 | 4.40 |
azure/o1-mini | azure/o1-mini | azure | standard | 1.21 | 4.84 |
azure/o1-mini-2024-09-12 | azure/o1-mini-2024-09-12 | azure | standard | 1.10 | 4.40 |
azure/us/o1-mini-2024-09-12 | azure/us/o1-mini-2024-09-12 | azure | standard | 1.21 | 4.84 |
azure/us/o1-mini-2024-09-12 | azure/us/o1-mini-2024-09-12 | azure | batch | 0.61 | 2.42 |
azure/eu/o1-mini-2024-09-12 | azure/eu/o1-mini-2024-09-12 | azure | standard | 1.21 | 4.84 |
azure/eu/o1-mini-2024-09-12 | azure/eu/o1-mini-2024-09-12 | azure | batch | 0.61 | 2.42 |
azure/o1 | azure/o1 | azure | standard | 15.00 | 60.00 |
azure/o1-2024-12-17 | azure/o1-2024-12-17 | azure | standard | 15.00 | 60.00 |
azure/us/o1-2024-12-17 | azure/us/o1-2024-12-17 | azure | standard | 16.50 | 66.00 |
azure/eu/o1-2024-12-17 | azure/eu/o1-2024-12-17 | azure | standard | 16.50 | 66.00 |
azure/codex-mini-latest | azure/codex-mini-latest | azure | standard | 1.50 | 6.00 |
azure/o1-preview | azure/o1-preview | azure | standard | 15.00 | 60.00 |
azure/o1-preview-2024-09-12 | azure/o1-preview-2024-09-12 | azure | standard | 15.00 | 60.00 |
azure/us/o1-preview-2024-09-12 | azure/us/o1-preview-2024-09-12 | azure | standard | 16.50 | 66.00 |
azure/eu/o1-preview-2024-09-12 | azure/eu/o1-preview-2024-09-12 | azure | standard | 16.50 | 66.00 |
azure/gpt-4.5-preview | azure/gpt-4.5-preview | azure | standard | 75.00 | 150.00 |
azure/gpt-4.5-preview | azure/gpt-4.5-preview | azure | batch | 37.50 | 75.00 |
azure/gpt-4o | azure/gpt-4o | azure | standard | 2.50 | 10.00 |
azure/global/gpt-4o-2024-11-20 | azure/global/gpt-4o-2024-11-20 | azure | standard | 2.50 | 10.00 |
azure/gpt-4o-2024-08-06 | azure/gpt-4o-2024-08-06 | azure | standard | 2.50 | 10.00 |
azure/global/gpt-4o-2024-08-06 | azure/global/gpt-4o-2024-08-06 | azure | standard | 2.50 | 10.00 |
azure/gpt-4o-2024-11-20 | azure/gpt-4o-2024-11-20 | azure | standard | 2.75 | 11.00 |
azure/us/gpt-4o-2024-11-20 | azure/us/gpt-4o-2024-11-20 | azure | standard | 2.75 | 11.00 |
azure/eu/gpt-4o-2024-11-20 | azure/eu/gpt-4o-2024-11-20 | azure | standard | 2.75 | 11.00 |
azure/gpt-4o-2024-05-13 | azure/gpt-4o-2024-05-13 | azure | standard | 5.00 | 15.00 |
azure/global-standard/gpt-4o-2024-08-06 | azure/global-standard/gpt-4o-2024-08-06 | azure | standard | 2.50 | 10.00 |
azure/us/gpt-4o-2024-08-06 | azure/us/gpt-4o-2024-08-06 | azure | standard | 2.75 | 11.00 |
azure/eu/gpt-4o-2024-08-06 | azure/eu/gpt-4o-2024-08-06 | azure | standard | 2.75 | 11.00 |
azure/global-standard/gpt-4o-2024-11-20 | azure/global-standard/gpt-4o-2024-11-20 | azure | standard | 2.50 | 10.00 |
azure/global-standard/gpt-4o-mini | azure/global-standard/gpt-4o-mini | azure | standard | 0.15 | 0.60 |
azure/gpt-4o-mini | azure/gpt-4o-mini | azure | standard | 0.17 | 0.66 |
azure/gpt-4o-mini-2024-07-18 | azure/gpt-4o-mini-2024-07-18 | azure | standard | 0.17 | 0.66 |
azure/us/gpt-4o-mini-2024-07-18 | azure/us/gpt-4o-mini-2024-07-18 | azure | standard | 0.17 | 0.66 |
azure/eu/gpt-4o-mini-2024-07-18 | azure/eu/gpt-4o-mini-2024-07-18 | azure | standard | 0.17 | 0.66 |
azure/gpt-4-turbo-2024-04-09 | azure/gpt-4-turbo-2024-04-09 | azure | standard | 10.00 | 30.00 |
azure/gpt-4-0125-preview | azure/gpt-4-0125-preview | azure | standard | 10.00 | 30.00 |
azure/gpt-4-1106-preview | azure/gpt-4-1106-preview | azure | standard | 10.00 | 30.00 |
azure/gpt-4-0613 | azure/gpt-4-0613 | azure | standard | 30.00 | 60.00 |
azure/gpt-4-32k-0613 | azure/gpt-4-32k-0613 | azure | standard | 60.00 | 120.00 |
azure/gpt-4-32k | azure/gpt-4-32k | azure | standard | 60.00 | 120.00 |
azure/gpt-4 | azure/gpt-4 | azure | standard | 30.00 | 60.00 |
azure/gpt-4-turbo | azure/gpt-4-turbo | azure | standard | 10.00 | 30.00 |
azure/gpt-4-turbo-vision-preview | azure/gpt-4-turbo-vision-preview | azure | standard | 10.00 | 30.00 |
azure/gpt-35-turbo-16k-0613 | azure/gpt-35-turbo-16k-0613 | azure | standard | 3.00 | 4.00 |
azure/gpt-35-turbo-1106 | azure/gpt-35-turbo-1106 | azure | standard | 1.00 | 2.00 |
azure/gpt-35-turbo-0613 | azure/gpt-35-turbo-0613 | azure | standard | 1.50 | 2.00 |
azure/gpt-35-turbo-0301 | azure/gpt-35-turbo-0301 | azure | standard | 0.20 | 2.00 |
azure/gpt-35-turbo-0125 | azure/gpt-35-turbo-0125 | azure | standard | 0.50 | 1.50 |
azure/gpt-3.5-turbo-0125 | azure/gpt-3.5-turbo-0125 | azure | standard | 0.50 | 1.50 |
azure/gpt-35-turbo-16k | azure/gpt-35-turbo-16k | azure | standard | 3.00 | 4.00 |
azure/gpt-35-turbo | azure/gpt-35-turbo | azure | standard | 0.50 | 1.50 |
azure/gpt-3.5-turbo | azure/gpt-3.5-turbo | azure | standard | 0.50 | 1.50 |
azure/gpt-3.5-turbo-instruct-0914 | azure/gpt-3.5-turbo-instruct-0914 | azure_text | standard | 1.50 | 2.00 |
azure/gpt-35-turbo-instruct | azure/gpt-35-turbo-instruct | azure_text | standard | 1.50 | 2.00 |
azure/gpt-35-turbo-instruct-0914 | azure/gpt-35-turbo-instruct-0914 | azure_text | standard | 1.50 | 2.00 |
azure/mistral-large-latest | azure/mistral-large-latest | azure | standard | 8.00 | 24.00 |
azure/mistral-large-2402 | azure/mistral-large-2402 | azure | standard | 8.00 | 24.00 |
azure/command-r-plus | azure/command-r-plus | azure | standard | 3.00 | 15.00 |
azure/ada | azure/ada | azure | standard | 0.10 | 0.00 |
azure/text-embedding-ada-002 | azure/text-embedding-ada-002 | azure | standard | 0.10 | 0.00 |
azure/text-embedding-3-large | azure/text-embedding-3-large | azure | standard | 0.13 | 0.00 |
azure/text-embedding-3-small | azure/text-embedding-3-small | azure | standard | 0.02 | 0.00 |
azure/standard/1024-x-1024/dall-e-3 | azure/standard/1024-x-1024/dall-e-3 | azure | standard | - | 0.00 |
azure/hd/1024-x-1024/dall-e-3 | azure/hd/1024-x-1024/dall-e-3 | azure | standard | - | 0.00 |
azure/standard/1024-x-1792/dall-e-3 | azure/standard/1024-x-1792/dall-e-3 | azure | standard | - | 0.00 |
azure/standard/1792-x-1024/dall-e-3 | azure/standard/1792-x-1024/dall-e-3 | azure | standard | - | 0.00 |
azure/hd/1024-x-1792/dall-e-3 | azure/hd/1024-x-1792/dall-e-3 | azure | standard | - | 0.00 |
azure/hd/1792-x-1024/dall-e-3 | azure/hd/1792-x-1024/dall-e-3 | azure | standard | - | 0.00 |
azure/standard/1024-x-1024/dall-e-2 | azure/standard/1024-x-1024/dall-e-2 | azure | standard | - | 0.00 |
azure_ai/deepseek-r1 | azure_ai/deepseek-r1 | azure_ai | standard | 1.35 | 5.40 |
azure_ai/deepseek-v3 | azure_ai/deepseek-v3 | azure_ai | standard | 1.14 | 4.56 |
azure_ai/deepseek-v3-0324 | azure_ai/deepseek-v3-0324 | azure_ai | standard | 1.14 | 4.56 |
azure_ai/jamba-instruct | azure_ai/jamba-instruct | azure_ai | standard | 0.50 | 0.70 |
azure_ai/mistral-nemo | azure_ai/mistral-nemo | azure_ai | standard | 0.15 | 0.15 |
azure_ai/mistral-medium-2505 | azure_ai/mistral-medium-2505 | azure_ai | standard | 0.40 | 2.00 |
azure_ai/mistral-large | azure_ai/mistral-large | azure_ai | standard | 4.00 | 12.00 |
azure_ai/mistral-small | azure_ai/mistral-small | azure_ai | standard | 1.00 | 3.00 |
azure_ai/mistral-small-2503 | azure_ai/mistral-small-2503 | azure_ai | standard | 1.00 | 3.00 |
azure_ai/mistral-large-2407 | azure_ai/mistral-large-2407 | azure_ai | standard | 2.00 | 6.00 |
azure_ai/mistral-large-latest | azure_ai/mistral-large-latest | azure_ai | standard | 2.00 | 6.00 |
azure_ai/ministral-3b | azure_ai/ministral-3b | azure_ai | standard | 0.04 | 0.04 |
azure_ai/Llama-3.2-11B-Vision-Instruct | azure_ai/Llama-3.2-11B-Vision-Instruct | azure_ai | standard | 0.37 | 0.37 |
azure_ai/Llama-3.3-70B-Instruct | azure_ai/Llama-3.3-70B-Instruct | azure_ai | standard | 0.71 | 0.71 |
azure_ai/Llama-4-Scout-17B-16E-Instruct | azure_ai/Llama-4-Scout-17B-16E-Instruct | azure_ai | standard | 0.20 | 0.78 |
azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 | azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 | azure_ai | standard | 1.41 | 0.35 |
azure_ai/Llama-3.2-90B-Vision-Instruct | azure_ai/Llama-3.2-90B-Vision-Instruct | azure_ai | standard | 2.04 | 2.04 |
azure_ai/Meta-Llama-3-70B-Instruct | azure_ai/Meta-Llama-3-70B-Instruct | azure_ai | standard | 1.10 | 0.37 |
azure_ai/Meta-Llama-3.1-8B-Instruct | azure_ai/Meta-Llama-3.1-8B-Instruct | azure_ai | standard | 0.30 | 0.61 |
azure_ai/Meta-Llama-3.1-70B-Instruct | azure_ai/Meta-Llama-3.1-70B-Instruct | azure_ai | standard | 2.68 | 3.54 |
azure_ai/Meta-Llama-3.1-405B-Instruct | azure_ai/Meta-Llama-3.1-405B-Instruct | azure_ai | standard | 5.33 | 16.00 |
azure_ai/Phi-4-mini-instruct | azure_ai/Phi-4-mini-instruct | azure_ai | standard | 0.08 | 0.30 |
azure_ai/Phi-4-multimodal-instruct | azure_ai/Phi-4-multimodal-instruct | azure_ai | standard | 0.08 | 0.32 |
azure_ai/Phi-4 | azure_ai/Phi-4 | azure_ai | standard | 0.13 | 0.50 |
azure_ai/Phi-3.5-mini-instruct | azure_ai/Phi-3.5-mini-instruct | azure_ai | standard | 0.13 | 0.52 |
azure_ai/Phi-3.5-vision-instruct | azure_ai/Phi-3.5-vision-instruct | azure_ai | standard | 0.13 | 0.52 |
azure_ai/Phi-3.5-MoE-instruct | azure_ai/Phi-3.5-MoE-instruct | azure_ai | standard | 0.16 | 0.64 |
azure_ai/Phi-3-mini-4k-instruct | azure_ai/Phi-3-mini-4k-instruct | azure_ai | standard | 0.13 | 0.52 |
azure_ai/Phi-3-mini-128k-instruct | azure_ai/Phi-3-mini-128k-instruct | azure_ai | standard | 0.13 | 0.52 |
azure_ai/Phi-3-small-8k-instruct | azure_ai/Phi-3-small-8k-instruct | azure_ai | standard | 0.15 | 0.60 |
azure_ai/Phi-3-small-128k-instruct | azure_ai/Phi-3-small-128k-instruct | azure_ai | standard | 0.15 | 0.60 |
azure_ai/Phi-3-medium-4k-instruct | azure_ai/Phi-3-medium-4k-instruct | azure_ai | standard | 0.17 | 0.68 |
azure_ai/Phi-3-medium-128k-instruct | azure_ai/Phi-3-medium-128k-instruct | azure_ai | standard | 0.17 | 0.68 |
azure_ai/cohere-rerank-v3-multilingual | azure_ai/cohere-rerank-v3-multilingual | azure_ai | standard | 0.00 | 0.00 |
azure_ai/cohere-rerank-v3-english | azure_ai/cohere-rerank-v3-english | azure_ai | standard | 0.00 | 0.00 |
azure_ai/Cohere-embed-v3-english | azure_ai/Cohere-embed-v3-english | azure_ai | standard | 0.10 | 0.00 |
azure_ai/Cohere-embed-v3-multilingual | azure_ai/Cohere-embed-v3-multilingual | azure_ai | standard | 0.10 | 0.00 |
azure_ai/embed-v-4-0 | azure_ai/embed-v-4-0 | azure_ai | standard | 0.12 | 0.00 |
babbage-002 | babbage-002 | text-completion-openai | standard | 0.40 | 0.40 |
davinci-002 | davinci-002 | text-completion-openai | standard | 2.00 | 2.00 |
gpt-3.5-turbo-instruct | gpt-3.5-turbo-instruct | text-completion-openai | standard | 1.50 | 2.00 |
gpt-3.5-turbo-instruct-0914 | gpt-3.5-turbo-instruct-0914 | text-completion-openai | standard | 1.50 | 2.00 |
claude-instant-1 | claude-instant-1 | anthropic | standard | 1.63 | 5.51 |
mistral/mistral-tiny | mistral/mistral-tiny | mistral | standard | 0.25 | 0.25 |
mistral/mistral-small | mistral/mistral-small | mistral | standard | 0.10 | 0.30 |
mistral/mistral-small-latest | mistral/mistral-small-latest | mistral | standard | 0.10 | 0.30 |
mistral/mistral-medium | mistral/mistral-medium | mistral | standard | 2.70 | 8.10 |
mistral/mistral-medium-latest | mistral/mistral-medium-latest | mistral | standard | 0.40 | 2.00 |
mistral/mistral-medium-2505 | mistral/mistral-medium-2505 | mistral | standard | 0.40 | 2.00 |
mistral/mistral-medium-2312 | mistral/mistral-medium-2312 | mistral | standard | 2.70 | 8.10 |
mistral/mistral-large-latest | mistral/mistral-large-latest | mistral | standard | 2.00 | 6.00 |
mistral/mistral-large-2411 | mistral/mistral-large-2411 | mistral | standard | 2.00 | 6.00 |
mistral/mistral-large-2402 | mistral/mistral-large-2402 | mistral | standard | 4.00 | 12.00 |
mistral/mistral-large-2407 | mistral/mistral-large-2407 | mistral | standard | 3.00 | 9.00 |
mistral/pixtral-large-latest | mistral/pixtral-large-latest | mistral | standard | 2.00 | 6.00 |
mistral/pixtral-large-2411 | mistral/pixtral-large-2411 | mistral | standard | 2.00 | 6.00 |
mistral/pixtral-12b-2409 | mistral/pixtral-12b-2409 | mistral | standard | 0.15 | 0.15 |
mistral/open-mistral-7b | mistral/open-mistral-7b | mistral | standard | 0.25 | 0.25 |
mistral/open-mixtral-8x7b | mistral/open-mixtral-8x7b | mistral | standard | 0.70 | 0.70 |
mistral/open-mixtral-8x22b | mistral/open-mixtral-8x22b | mistral | standard | 2.00 | 6.00 |
mistral/codestral-latest | mistral/codestral-latest | mistral | standard | 1.00 | 3.00 |
mistral/codestral-2405 | mistral/codestral-2405 | mistral | standard | 1.00 | 3.00 |
mistral/open-mistral-nemo | mistral/open-mistral-nemo | mistral | standard | 0.30 | 0.30 |
mistral/open-mistral-nemo-2407 | mistral/open-mistral-nemo-2407 | mistral | standard | 0.30 | 0.30 |
mistral/open-codestral-mamba | mistral/open-codestral-mamba | mistral | standard | 0.25 | 0.25 |
mistral/codestral-mamba-latest | mistral/codestral-mamba-latest | mistral | standard | 0.25 | 0.25 |
mistral/devstral-small-2505 | mistral/devstral-small-2505 | mistral | standard | 0.10 | 0.30 |
mistral/mistral-embed | mistral/mistral-embed | mistral | standard | 0.10 | - |
deepseek/deepseek-reasoner | deepseek/deepseek-reasoner | deepseek | standard | 0.55 | 2.19 |
deepseek/deepseek-chat | deepseek/deepseek-chat | deepseek | standard | 0.27 | 1.10 |
codestral/codestral-latest | codestral/codestral-latest | codestral | standard | 0.00 | 0.00 |
codestral/codestral-2405 | codestral/codestral-2405 | codestral | standard | 0.00 | 0.00 |
text-completion-codestral/codestral-latest | text-completion-codestral/codestral-latest | text-completion-codestral | standard | 0.00 | 0.00 |
text-completion-codestral/codestral-2405 | text-completion-codestral/codestral-2405 | text-completion-codestral | standard | 0.00 | 0.00 |
xai/grok-beta | xai/grok-beta | xai | standard | 5.00 | 15.00 |
xai/grok-2-vision-1212 | xai/grok-2-vision-1212 | xai | standard | 2.00 | 10.00 |
xai/grok-2-vision-latest | xai/grok-2-vision-latest | xai | standard | 2.00 | 10.00 |
xai/grok-2-vision | xai/grok-2-vision | xai | standard | 2.00 | 10.00 |
xai/grok-3 | xai/grok-3 | xai | standard | 3.00 | 15.00 |
xai/grok-3-beta | xai/grok-3-beta | xai | standard | 3.00 | 15.00 |
xai/grok-3-fast-beta | xai/grok-3-fast-beta | xai | standard | 5.00 | 25.00 |
xai/grok-3-fast-latest | xai/grok-3-fast-latest | xai | standard | 5.00 | 25.00 |
xai/grok-3-mini-beta | xai/grok-3-mini-beta | xai | standard | 0.30 | 0.50 |
xai/grok-3-mini-fast-beta | xai/grok-3-mini-fast-beta | xai | standard | 0.60 | 4.00 |
xai/grok-3-mini-fast-latest | xai/grok-3-mini-fast-latest | xai | standard | 0.60 | 4.00 |
xai/grok-vision-beta | xai/grok-vision-beta | xai | standard | 5.00 | 15.00 |
xai/grok-2-1212 | xai/grok-2-1212 | xai | standard | 2.00 | 10.00 |
xai/grok-2 | xai/grok-2 | xai | standard | 2.00 | 10.00 |
xai/grok-2-latest | xai/grok-2-latest | xai | standard | 2.00 | 10.00 |
deepseek/deepseek-coder | deepseek/deepseek-coder | deepseek | standard | 0.14 | 0.28 |
groq/deepseek-r1-distill-llama-70b | groq/deepseek-r1-distill-llama-70b | groq | standard | 0.75 | 0.99 |
groq/llama-3.3-70b-versatile | groq/llama-3.3-70b-versatile | groq | standard | 0.59 | 0.79 |
groq/llama-3.3-70b-specdec | groq/llama-3.3-70b-specdec | groq | standard | 0.59 | 0.99 |
groq/llama-guard-3-8b | groq/llama-guard-3-8b | groq | standard | 0.20 | 0.20 |
groq/llama2-70b-4096 | groq/llama2-70b-4096 | groq | standard | 0.70 | 0.80 |
groq/llama3-8b-8192 | groq/llama3-8b-8192 | groq | standard | 0.05 | 0.08 |
groq/llama-3.2-1b-preview | groq/llama-3.2-1b-preview | groq | standard | 0.04 | 0.04 |
groq/llama-3.2-3b-preview | groq/llama-3.2-3b-preview | groq | standard | 0.06 | 0.06 |
groq/llama-3.2-11b-text-preview | groq/llama-3.2-11b-text-preview | groq | standard | 0.18 | 0.18 |
groq/llama-3.2-11b-vision-preview | groq/llama-3.2-11b-vision-preview | groq | standard | 0.18 | 0.18 |
groq/llama-3.2-90b-text-preview | groq/llama-3.2-90b-text-preview | groq | standard | 0.90 | 0.90 |
groq/llama-3.2-90b-vision-preview | groq/llama-3.2-90b-vision-preview | groq | standard | 0.90 | 0.90 |
groq/llama3-70b-8192 | groq/llama3-70b-8192 | groq | standard | 0.59 | 0.79 |
groq/llama-3.1-8b-instant | groq/llama-3.1-8b-instant | groq | standard | 0.05 | 0.08 |
groq/llama-3.1-70b-versatile | groq/llama-3.1-70b-versatile | groq | standard | 0.59 | 0.79 |
groq/llama-3.1-405b-reasoning | groq/llama-3.1-405b-reasoning | groq | standard | 0.59 | 0.79 |
groq/meta-llama/llama-4-scout-17b-16e-instruct | groq/meta-llama/llama-4-scout-17b-16e-instruct | groq | standard | 0.11 | 0.34 |
groq/meta-llama/llama-4-maverick-17b-128e-instruct | groq/meta-llama/llama-4-maverick-17b-128e-instruct | groq | standard | 0.20 | 0.60 |
groq/mistral-saba-24b | groq/mistral-saba-24b | groq | standard | 0.79 | 0.79 |
groq/mixtral-8x7b-32768 | groq/mixtral-8x7b-32768 | groq | standard | 0.24 | 0.24 |
groq/gemma-7b-it | groq/gemma-7b-it | groq | standard | 0.07 | 0.07 |
groq/gemma2-9b-it | groq/gemma2-9b-it | groq | standard | 0.20 | 0.20 |
groq/llama3-groq-70b-8192-tool-use-preview | groq/llama3-groq-70b-8192-tool-use-preview | groq | standard | 0.89 | 0.89 |
groq/llama3-groq-8b-8192-tool-use-preview | groq/llama3-groq-8b-8192-tool-use-preview | groq | standard | 0.19 | 0.19 |
groq/qwen-qwq-32b | groq/qwen-qwq-32b | groq | standard | 0.29 | 0.39 |
cerebras/llama3.1-8b | cerebras/llama3.1-8b | cerebras | standard | 0.10 | 0.10 |
cerebras/llama3.1-70b | cerebras/llama3.1-70b | cerebras | standard | 0.60 | 0.60 |
cerebras/llama-3.3-70b | cerebras/llama-3.3-70b | cerebras | standard | 0.85 | 1.20 |
cerebras/qwen-3-32b | cerebras/qwen-3-32b | cerebras | standard | 0.40 | 0.80 |
friendliai/meta-llama-3.1-8b-instruct | friendliai/meta-llama-3.1-8b-instruct | friendliai | standard | 0.10 | 0.10 |
friendliai/meta-llama-3.1-70b-instruct | friendliai/meta-llama-3.1-70b-instruct | friendliai | standard | 0.60 | 0.60 |
claude-instant-1.2 | claude-instant-1.2 | anthropic | standard | 0.16 | 0.55 |
claude-2 | claude-2 | anthropic | standard | 8.00 | 24.00 |
claude-2.1 | claude-2.1 | anthropic | standard | 8.00 | 24.00 |
claude-3-haiku-20240307 | claude-3-haiku-20240307 | anthropic | standard | 0.25 | 1.25 |
claude-3-5-haiku-20241022 | claude-3-5-haiku-20241022 | anthropic | standard | 0.80 | 4.00 |
claude-3-5-haiku-latest | claude-3-5-haiku-latest | anthropic | standard | 1.00 | 5.00 |
claude-3-opus-latest | claude-3-opus-latest | anthropic | standard | 15.00 | 75.00 |
claude-3-opus-20240229 | claude-3-opus-20240229 | anthropic | standard | 15.00 | 75.00 |
claude-3-sonnet-20240229 | claude-3-sonnet-20240229 | anthropic | standard | 3.00 | 15.00 |
claude-3-5-sonnet-latest | claude-3-5-sonnet-latest | anthropic | standard | 3.00 | 15.00 |
claude-3-5-sonnet-20240620 | claude-3-5-sonnet-20240620 | anthropic | standard | 3.00 | 15.00 |
claude-opus-4-20250514 | claude-opus-4-20250514 | anthropic | standard | 15.00 | 75.00 |
claude-sonnet-4-20250514 | claude-sonnet-4-20250514 | anthropic | standard | 3.00 | 15.00 |
claude-4-opus-20250514 | claude-4-opus-20250514 | anthropic | standard | 15.00 | 75.00 |
claude-4-sonnet-20250514 | claude-4-sonnet-20250514 | anthropic | standard | 3.00 | 15.00 |
claude-3-7-sonnet-latest | claude-3-7-sonnet-latest | anthropic | standard | 3.00 | 15.00 |
claude-3-7-sonnet-20250219 | claude-3-7-sonnet-20250219 | anthropic | standard | 3.00 | 15.00 |
claude-3-5-sonnet-20241022 | claude-3-5-sonnet-20241022 | anthropic | standard | 3.00 | 15.00 |
text-bison32k | text-bison32k | vertex_ai-text-models | standard | 0.13 | 0.13 |
text-bison32k@002 | text-bison32k@002 | vertex_ai-text-models | standard | 0.13 | 0.13 |
text-unicorn | text-unicorn | vertex_ai-text-models | standard | 10.00 | 28.00 |
text-unicorn@001 | text-unicorn@001 | vertex_ai-text-models | standard | 10.00 | 28.00 |
chat-bison | chat-bison | vertex_ai-chat-models | standard | 0.13 | 0.13 |
chat-bison@001 | chat-bison@001 | vertex_ai-chat-models | standard | 0.13 | 0.13 |
chat-bison@002 | chat-bison@002 | vertex_ai-chat-models | standard | 0.13 | 0.13 |
chat-bison-32k | chat-bison-32k | vertex_ai-chat-models | standard | 0.13 | 0.13 |
chat-bison-32k@002 | chat-bison-32k@002 | vertex_ai-chat-models | standard | 0.13 | 0.13 |
code-bison | code-bison | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-bison@001 | code-bison@001 | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-bison@002 | code-bison@002 | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-bison32k | code-bison32k | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-bison-32k@002 | code-bison-32k@002 | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-gecko@001 | code-gecko@001 | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-gecko@002 | code-gecko@002 | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-gecko | code-gecko | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
code-gecko-latest | code-gecko-latest | vertex_ai-code-text-models | standard | 0.13 | 0.13 |
codechat-bison@latest | codechat-bison@latest | vertex_ai-code-chat-models | standard | 0.13 | 0.13 |
codechat-bison | codechat-bison | vertex_ai-code-chat-models | standard | 0.13 | 0.13 |
codechat-bison@001 | codechat-bison@001 | vertex_ai-code-chat-models | standard | 0.13 | 0.13 |
codechat-bison@002 | codechat-bison@002 | vertex_ai-code-chat-models | standard | 0.13 | 0.13 |
codechat-bison-32k | codechat-bison-32k | vertex_ai-code-chat-models | standard | 0.13 | 0.13 |
codechat-bison-32k@002 | codechat-bison-32k@002 | vertex_ai-code-chat-models | standard | 0.13 | 0.13 |
gemini-pro | gemini-pro | vertex_ai-language-models | standard | 0.50 | 1.50 |
gemini-1.0-pro | gemini-1.0-pro | vertex_ai-language-models | standard | 0.50 | 1.50 |
gemini-1.0-pro-001 | gemini-1.0-pro-001 | vertex_ai-language-models | standard | 0.50 | 1.50 |
gemini-1.0-ultra | gemini-1.0-ultra | vertex_ai-language-models | standard | 0.50 | 1.50 |
gemini-1.0-ultra-001 | gemini-1.0-ultra-001 | vertex_ai-language-models | standard | 0.50 | 1.50 |
gemini-1.0-pro-002 | gemini-1.0-pro-002 | vertex_ai-language-models | standard | 0.50 | 1.50 |
gemini-1.5-pro | gemini-1.5-pro | vertex_ai-language-models | standard | 1.25 | 5.00 |
gemini-1.5-pro-002 | gemini-1.5-pro-002 | vertex_ai-language-models | standard | 1.25 | 5.00 |
gemini-1.5-pro-001 | gemini-1.5-pro-001 | vertex_ai-language-models | standard | 1.25 | 5.00 |
gemini-1.5-pro-preview-0514 | gemini-1.5-pro-preview-0514 | vertex_ai-language-models | standard | 0.08 | 0.31 |
gemini-1.5-pro-preview-0215 | gemini-1.5-pro-preview-0215 | vertex_ai-language-models | standard | 0.08 | 0.31 |
gemini-1.5-pro-preview-0409 | gemini-1.5-pro-preview-0409 | vertex_ai-language-models | standard | 0.08 | 0.31 |
gemini-1.5-flash | gemini-1.5-flash | vertex_ai-language-models | standard | 0.08 | 0.30 |
gemini-1.5-flash-exp-0827 | gemini-1.5-flash-exp-0827 | vertex_ai-language-models | standard | 0.00 | 0.00 |
gemini-1.5-flash-002 | gemini-1.5-flash-002 | vertex_ai-language-models | standard | 0.08 | 0.30 |
gemini-1.5-flash-001 | gemini-1.5-flash-001 | vertex_ai-language-models | standard | 0.08 | 0.30 |
gemini-1.5-flash-preview-0514 | gemini-1.5-flash-preview-0514 | vertex_ai-language-models | standard | 0.08 | 0.00 |
gemini-pro-experimental | gemini-pro-experimental | vertex_ai-language-models | standard | 0.00 | 0.00 |
gemini-flash-experimental | gemini-flash-experimental | vertex_ai-language-models | standard | 0.00 | 0.00 |
gemini-pro-vision | gemini-pro-vision | vertex_ai-vision-models | standard | 0.50 | 1.50 |
gemini-1.0-pro-vision | gemini-1.0-pro-vision | vertex_ai-vision-models | standard | 0.50 | 1.50 |
gemini-1.0-pro-vision-001 | gemini-1.0-pro-vision-001 | vertex_ai-vision-models | standard | 0.50 | 1.50 |
gemini-2.5-pro-exp-03-25 | gemini-2.5-pro-exp-03-25 | vertex_ai-language-models | standard | 1.25 | 10.00 |
gemini-2.0-pro-exp-02-05 | gemini-2.0-pro-exp-02-05 | vertex_ai-language-models | standard | 1.25 | 10.00 |
gemini-2.0-flash-exp | gemini-2.0-flash-exp | vertex_ai-language-models | standard | 0.15 | 0.60 |
gemini-2.0-flash-001 | gemini-2.0-flash-001 | vertex_ai-language-models | standard | 0.15 | 0.60 |
gemini-2.0-flash-thinking-exp | gemini-2.0-flash-thinking-exp | vertex_ai-language-models | standard | 0.00 | 0.00 |
gemini-2.0-flash-thinking-exp-01-21 | gemini-2.0-flash-thinking-exp-01-21 | vertex_ai-language-models | standard | 0.00 | 0.00 |
gemini/gemini-2.5-pro-exp-03-25 | gemini/gemini-2.5-pro-exp-03-25 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-2.5-flash-preview-tts | gemini/gemini-2.5-flash-preview-tts | gemini | standard | 0.15 | 0.60 |
gemini/gemini-2.5-flash-preview-05-20 | gemini/gemini-2.5-flash-preview-05-20 | gemini | standard | 0.15 | 0.60 |
gemini/gemini-2.5-flash-preview-04-17 | gemini/gemini-2.5-flash-preview-04-17 | gemini | standard | 0.15 | 0.60 |
gemini-2.5-flash-preview-05-20 | gemini-2.5-flash-preview-05-20 | vertex_ai-language-models | standard | 0.15 | 0.60 |
gemini-2.5-flash-preview-04-17 | gemini-2.5-flash-preview-04-17 | vertex_ai-language-models | standard | 0.15 | 0.60 |
gemini-2.0-flash | gemini-2.0-flash | vertex_ai-language-models | standard | 0.10 | 0.40 |
gemini-2.0-flash-lite | gemini-2.0-flash-lite | vertex_ai-language-models | standard | 0.08 | 0.30 |
gemini-2.0-flash-lite-001 | gemini-2.0-flash-lite-001 | vertex_ai-language-models | standard | 0.08 | 0.30 |
gemini-2.5-pro-preview-06-05 | gemini-2.5-pro-preview-06-05 | vertex_ai-language-models | standard | 1.25 | 10.00 |
gemini-2.5-pro-preview-05-06 | gemini-2.5-pro-preview-05-06 | vertex_ai-language-models | standard | 1.25 | 10.00 |
gemini-2.5-pro-preview-03-25 | gemini-2.5-pro-preview-03-25 | vertex_ai-language-models | standard | 1.25 | 10.00 |
gemini-2.0-flash-preview-image-generation | gemini-2.0-flash-preview-image-generation | vertex_ai-language-models | standard | 0.10 | 0.40 |
gemini-2.5-pro-preview-tts | gemini-2.5-pro-preview-tts | vertex_ai-language-models | standard | 1.25 | 10.00 |
gemini/gemini-2.0-pro-exp-02-05 | gemini/gemini-2.0-pro-exp-02-05 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-2.0-flash-preview-image-generation | gemini/gemini-2.0-flash-preview-image-generation | gemini | standard | 0.10 | 0.40 |
gemini/gemini-2.0-flash | gemini/gemini-2.0-flash | gemini | standard | 0.10 | 0.40 |
gemini/gemini-2.0-flash-lite | gemini/gemini-2.0-flash-lite | gemini | standard | 0.08 | 0.30 |
gemini/gemini-2.0-flash-001 | gemini/gemini-2.0-flash-001 | gemini | standard | 0.10 | 0.40 |
gemini/gemini-2.5-pro-preview-tts | gemini/gemini-2.5-pro-preview-tts | gemini | standard | 1.25 | 10.00 |
gemini/gemini-2.5-pro-preview-06-05 | gemini/gemini-2.5-pro-preview-06-05 | gemini | standard | 1.25 | 10.00 |
gemini/gemini-2.5-pro-preview-05-06 | gemini/gemini-2.5-pro-preview-05-06 | gemini | standard | 1.25 | 10.00 |
gemini/gemini-2.5-pro-preview-03-25 | gemini/gemini-2.5-pro-preview-03-25 | gemini | standard | 1.25 | 10.00 |
gemini/gemini-2.0-flash-exp | gemini/gemini-2.0-flash-exp | gemini | standard | 0.00 | 0.00 |
gemini/gemini-2.0-flash-lite-preview-02-05 | gemini/gemini-2.0-flash-lite-preview-02-05 | gemini | standard | 0.08 | 0.30 |
gemini/gemini-2.0-flash-thinking-exp | gemini/gemini-2.0-flash-thinking-exp | gemini | standard | 0.00 | 0.00 |
gemini/gemini-2.0-flash-thinking-exp-01-21 | gemini/gemini-2.0-flash-thinking-exp-01-21 | gemini | standard | 0.00 | 0.00 |
gemini/gemma-3-27b-it | gemini/gemma-3-27b-it | gemini | standard | 0.00 | 0.00 |
gemini/learnlm-1.5-pro-experimental | gemini/learnlm-1.5-pro-experimental | gemini | standard | 0.00 | 0.00 |
vertex_ai/claude-3-sonnet | vertex_ai/claude-3-sonnet | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-3-sonnet@20240229 | vertex_ai/claude-3-sonnet@20240229 | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-3-5-sonnet | vertex_ai/claude-3-5-sonnet | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-3-5-sonnet@20240620 | vertex_ai/claude-3-5-sonnet@20240620 | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-3-5-sonnet-v2 | vertex_ai/claude-3-5-sonnet-v2 | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-3-5-sonnet-v2@20241022 | vertex_ai/claude-3-5-sonnet-v2@20241022 | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-3-7-sonnet@20250219 | vertex_ai/claude-3-7-sonnet@20250219 | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-opus-4@20250514 | vertex_ai/claude-opus-4@20250514 | vertex_ai-anthropic_models | standard | 15.00 | 75.00 |
vertex_ai/claude-sonnet-4@20250514 | vertex_ai/claude-sonnet-4@20250514 | vertex_ai-anthropic_models | standard | 3.00 | 15.00 |
vertex_ai/claude-3-haiku | vertex_ai/claude-3-haiku | vertex_ai-anthropic_models | standard | 0.25 | 1.25 |
vertex_ai/claude-3-haiku@20240307 | vertex_ai/claude-3-haiku@20240307 | vertex_ai-anthropic_models | standard | 0.25 | 1.25 |
vertex_ai/claude-3-5-haiku | vertex_ai/claude-3-5-haiku | vertex_ai-anthropic_models | standard | 1.00 | 5.00 |
vertex_ai/claude-3-5-haiku@20241022 | vertex_ai/claude-3-5-haiku@20241022 | vertex_ai-anthropic_models | standard | 1.00 | 5.00 |
vertex_ai/claude-3-opus | vertex_ai/claude-3-opus | vertex_ai-anthropic_models | standard | 15.00 | 75.00 |
vertex_ai/claude-3-opus@20240229 | vertex_ai/claude-3-opus@20240229 | vertex_ai-anthropic_models | standard | 15.00 | 75.00 |
vertex_ai/meta/llama3-405b-instruct-maas | vertex_ai/meta/llama3-405b-instruct-maas | vertex_ai-llama_models | standard | 0.00 | 0.00 |
vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas | vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas | vertex_ai-llama_models | standard | 0.25 | 0.70 |
vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas | vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas | vertex_ai-llama_models | standard | 0.25 | 0.70 |
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | vertex_ai-llama_models | standard | 0.35 | 1.15 |
vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas | vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas | vertex_ai-llama_models | standard | 0.35 | 1.15 |
vertex_ai/meta/llama3-70b-instruct-maas | vertex_ai/meta/llama3-70b-instruct-maas | vertex_ai-llama_models | standard | 0.00 | 0.00 |
vertex_ai/meta/llama3-8b-instruct-maas | vertex_ai/meta/llama3-8b-instruct-maas | vertex_ai-llama_models | standard | 0.00 | 0.00 |
vertex_ai/meta/llama-3.2-90b-vision-instruct-maas | vertex_ai/meta/llama-3.2-90b-vision-instruct-maas | vertex_ai-llama_models | standard | 0.00 | 0.00 |
vertex_ai/mistral-large@latest | vertex_ai/mistral-large@latest | vertex_ai-mistral_models | standard | 2.00 | 6.00 |
vertex_ai/mistral-large@2411-001 | vertex_ai/mistral-large@2411-001 | vertex_ai-mistral_models | standard | 2.00 | 6.00 |
vertex_ai/mistral-large-2411 | vertex_ai/mistral-large-2411 | vertex_ai-mistral_models | standard | 2.00 | 6.00 |
vertex_ai/mistral-large@2407 | vertex_ai/mistral-large@2407 | vertex_ai-mistral_models | standard | 2.00 | 6.00 |
vertex_ai/mistral-nemo@latest | vertex_ai/mistral-nemo@latest | vertex_ai-mistral_models | standard | 0.15 | 0.15 |
vertex_ai/mistral-small-2503@001 | vertex_ai/mistral-small-2503@001 | vertex_ai-mistral_models | standard | 1.00 | 3.00 |
vertex_ai/mistral-small-2503 | vertex_ai/mistral-small-2503 | vertex_ai-mistral_models | standard | 1.00 | 3.00 |
vertex_ai/jamba-1.5-mini@001 | vertex_ai/jamba-1.5-mini@001 | vertex_ai-ai21_models | standard | 0.20 | 0.40 |
vertex_ai/jamba-1.5-large@001 | vertex_ai/jamba-1.5-large@001 | vertex_ai-ai21_models | standard | 2.00 | 8.00 |
vertex_ai/jamba-1.5 | vertex_ai/jamba-1.5 | vertex_ai-ai21_models | standard | 0.20 | 0.40 |
vertex_ai/jamba-1.5-mini | vertex_ai/jamba-1.5-mini | vertex_ai-ai21_models | standard | 0.20 | 0.40 |
vertex_ai/jamba-1.5-large | vertex_ai/jamba-1.5-large | vertex_ai-ai21_models | standard | 2.00 | 8.00 |
vertex_ai/mistral-nemo@2407 | vertex_ai/mistral-nemo@2407 | vertex_ai-mistral_models | standard | 3.00 | 3.00 |
vertex_ai/codestral@latest | vertex_ai/codestral@latest | vertex_ai-mistral_models | standard | 0.20 | 0.60 |
vertex_ai/codestral@2405 | vertex_ai/codestral@2405 | vertex_ai-mistral_models | standard | 0.20 | 0.60 |
vertex_ai/codestral-2501 | vertex_ai/codestral-2501 | vertex_ai-mistral_models | standard | 0.20 | 0.60 |
text-embedding-004 | text-embedding-004 | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
gemini-embedding-001 | gemini-embedding-001 | vertex_ai-embedding-models | standard | 0.15 | 0.00 |
text-embedding-005 | text-embedding-005 | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
text-multilingual-embedding-002 | text-multilingual-embedding-002 | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
multimodalembedding | multimodalembedding | vertex_ai-embedding-models | standard | 0.80 | 0.00 |
multimodalembedding@001 | multimodalembedding@001 | vertex_ai-embedding-models | standard | 0.80 | 0.00 |
text-embedding-large-exp-03-07 | text-embedding-large-exp-03-07 | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
textembedding-gecko | textembedding-gecko | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
textembedding-gecko-multilingual | textembedding-gecko-multilingual | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
textembedding-gecko-multilingual@001 | textembedding-gecko-multilingual@001 | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
textembedding-gecko@001 | textembedding-gecko@001 | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
textembedding-gecko@003 | textembedding-gecko@003 | vertex_ai-embedding-models | standard | 0.10 | 0.00 |
text-embedding-preview-0409 | text-embedding-preview-0409 | vertex_ai-embedding-models | standard | 0.01 | 0.00 |
text-multilingual-embedding-preview-0409 | text-multilingual-embedding-preview-0409 | vertex_ai-embedding-models | standard | 0.01 | 0.00 |
palm/chat-bison | palm/chat-bison | palm | standard | 0.13 | 0.13 |
palm/chat-bison-001 | palm/chat-bison-001 | palm | standard | 0.13 | 0.13 |
palm/text-bison | palm/text-bison | palm | standard | 0.13 | 0.13 |
palm/text-bison-001 | palm/text-bison-001 | palm | standard | 0.13 | 0.13 |
palm/text-bison-safety-off | palm/text-bison-safety-off | palm | standard | 0.13 | 0.13 |
palm/text-bison-safety-recitation-off | palm/text-bison-safety-recitation-off | palm | standard | 0.13 | 0.13 |
gemini/gemini-1.5-flash-002 | gemini/gemini-1.5-flash-002 | gemini | standard | 0.08 | 0.30 |
gemini/gemini-1.5-flash-001 | gemini/gemini-1.5-flash-001 | gemini | standard | 0.08 | 0.30 |
gemini/gemini-1.5-flash | gemini/gemini-1.5-flash | gemini | standard | 0.08 | 0.30 |
gemini/gemini-1.5-flash-latest | gemini/gemini-1.5-flash-latest | gemini | standard | 0.08 | 0.30 |
gemini/gemini-1.5-flash-8b | gemini/gemini-1.5-flash-8b | gemini | standard | 0.00 | 0.00 |
gemini/gemini-1.5-flash-8b-exp-0924 | gemini/gemini-1.5-flash-8b-exp-0924 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-exp-1114 | gemini/gemini-exp-1114 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-exp-1206 | gemini/gemini-exp-1206 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-1.5-flash-exp-0827 | gemini/gemini-1.5-flash-exp-0827 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-1.5-flash-8b-exp-0827 | gemini/gemini-1.5-flash-8b-exp-0827 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-pro | gemini/gemini-pro | gemini | standard | 0.35 | 1.05 |
gemini/gemini-1.5-pro | gemini/gemini-1.5-pro | gemini | standard | 3.50 | 10.50 |
gemini/gemini-1.5-pro-002 | gemini/gemini-1.5-pro-002 | gemini | standard | 3.50 | 10.50 |
gemini/gemini-1.5-pro-001 | gemini/gemini-1.5-pro-001 | gemini | standard | 3.50 | 10.50 |
gemini/gemini-1.5-pro-exp-0801 | gemini/gemini-1.5-pro-exp-0801 | gemini | standard | 3.50 | 10.50 |
gemini/gemini-1.5-pro-exp-0827 | gemini/gemini-1.5-pro-exp-0827 | gemini | standard | 0.00 | 0.00 |
gemini/gemini-1.5-pro-latest | gemini/gemini-1.5-pro-latest | gemini | standard | 3.50 | 1.05 |
gemini/gemini-pro-vision | gemini/gemini-pro-vision | gemini | standard | 0.35 | 1.05 |
gemini/gemini-gemma-2-27b-it | gemini/gemini-gemma-2-27b-it | gemini | standard | 0.35 | 1.05 |
gemini/gemini-gemma-2-9b-it | gemini/gemini-gemma-2-9b-it | gemini | standard | 0.35 | 1.05 |
command-a-03-2025 | command-a-03-2025 | cohere_chat | standard | 2.50 | 10.00 |
command-r | command-r | cohere_chat | standard | 0.15 | 0.60 |
command-r-08-2024 | command-r-08-2024 | cohere_chat | standard | 0.15 | 0.60 |
command-r7b-12-2024 | command-r7b-12-2024 | cohere_chat | standard | 0.15 | 0.04 |
command-light | command-light | cohere_chat | standard | 0.30 | 0.60 |
command-r-plus | command-r-plus | cohere_chat | standard | 2.50 | 10.00 |
command-r-plus-08-2024 | command-r-plus-08-2024 | cohere_chat | standard | 2.50 | 10.00 |
command-nightly | command-nightly | cohere | standard | 1.00 | 2.00 |
command | command | cohere | standard | 1.00 | 2.00 |
rerank-v3.5 | rerank-v3.5 | cohere | standard | 0.00 | 0.00 |
rerank-english-v3.0 | rerank-english-v3.0 | cohere | standard | 0.00 | 0.00 |
rerank-multilingual-v3.0 | rerank-multilingual-v3.0 | cohere | standard | 0.00 | 0.00 |
rerank-english-v2.0 | rerank-english-v2.0 | cohere | standard | 0.00 | 0.00 |
rerank-multilingual-v2.0 | rerank-multilingual-v2.0 | cohere | standard | 0.00 | 0.00 |
embed-english-light-v3.0 | embed-english-light-v3.0 | cohere | standard | 0.10 | 0.00 |
embed-multilingual-v3.0 | embed-multilingual-v3.0 | cohere | standard | 0.10 | 0.00 |
embed-english-v2.0 | embed-english-v2.0 | cohere | standard | 0.10 | 0.00 |
embed-english-light-v2.0 | embed-english-light-v2.0 | cohere | standard | 0.10 | 0.00 |
embed-multilingual-v2.0 | embed-multilingual-v2.0 | cohere | standard | 0.10 | 0.00 |
embed-english-v3.0 | embed-english-v3.0 | cohere | standard | 0.10 | 0.00 |
replicate/meta/llama-2-13b | replicate/meta/llama-2-13b | replicate | standard | 0.10 | 0.50 |
replicate/meta/llama-2-13b-chat | replicate/meta/llama-2-13b-chat | replicate | standard | 0.10 | 0.50 |
replicate/meta/llama-2-70b | replicate/meta/llama-2-70b | replicate | standard | 0.65 | 2.75 |
replicate/meta/llama-2-70b-chat | replicate/meta/llama-2-70b-chat | replicate | standard | 0.65 | 2.75 |
replicate/meta/llama-2-7b | replicate/meta/llama-2-7b | replicate | standard | 0.05 | 0.25 |
replicate/meta/llama-2-7b-chat | replicate/meta/llama-2-7b-chat | replicate | standard | 0.05 | 0.25 |
replicate/meta/llama-3-70b | replicate/meta/llama-3-70b | replicate | standard | 0.65 | 2.75 |
replicate/meta/llama-3-70b-instruct | replicate/meta/llama-3-70b-instruct | replicate | standard | 0.65 | 2.75 |
replicate/meta/llama-3-8b | replicate/meta/llama-3-8b | replicate | standard | 0.05 | 0.25 |
replicate/meta/llama-3-8b-instruct | replicate/meta/llama-3-8b-instruct | replicate | standard | 0.05 | 0.25 |
replicate/mistralai/mistral-7b-v0.1 | replicate/mistralai/mistral-7b-v0.1 | replicate | standard | 0.05 | 0.25 |
replicate/mistralai/mistral-7b-instruct-v0.2 | replicate/mistralai/mistral-7b-instruct-v0.2 | replicate | standard | 0.05 | 0.25 |
replicate/mistralai/mixtral-8x7b-instruct-v0.1 | replicate/mistralai/mixtral-8x7b-instruct-v0.1 | replicate | standard | 0.30 | 1.00 |
openrouter/deepseek/deepseek-r1 | openrouter/deepseek/deepseek-r1 | openrouter | standard | 0.55 | 2.19 |
openrouter/deepseek/deepseek-chat | openrouter/deepseek/deepseek-chat | openrouter | standard | 0.14 | 0.28 |
openrouter/deepseek/deepseek-coder | openrouter/deepseek/deepseek-coder | openrouter | standard | 0.14 | 0.28 |
openrouter/microsoft/wizardlm-2-8x22b:nitro | openrouter/microsoft/wizardlm-2-8x22b:nitro | openrouter | standard | 1.00 | 1.00 |
openrouter/google/gemini-pro-1.5 | openrouter/google/gemini-pro-1.5 | openrouter | standard | 2.50 | 7.50 |
openrouter/google/gemini-2.0-flash-001 | openrouter/google/gemini-2.0-flash-001 | openrouter | standard | 0.10 | 0.40 |
openrouter/mistralai/mixtral-8x22b-instruct | openrouter/mistralai/mixtral-8x22b-instruct | openrouter | standard | 0.65 | 0.65 |
openrouter/cohere/command-r-plus | openrouter/cohere/command-r-plus | openrouter | standard | 3.00 | 15.00 |
openrouter/databricks/dbrx-instruct | openrouter/databricks/dbrx-instruct | openrouter | standard | 0.60 | 0.60 |
openrouter/anthropic/claude-3-haiku | openrouter/anthropic/claude-3-haiku | openrouter | standard | 0.25 | 1.25 |
openrouter/anthropic/claude-3-5-haiku | openrouter/anthropic/claude-3-5-haiku | openrouter | standard | 1.00 | 5.00 |
openrouter/anthropic/claude-3-haiku-20240307 | openrouter/anthropic/claude-3-haiku-20240307 | openrouter | standard | 0.25 | 1.25 |
openrouter/anthropic/claude-3-5-haiku-20241022 | openrouter/anthropic/claude-3-5-haiku-20241022 | openrouter | standard | 1.00 | 5.00 |
openrouter/anthropic/claude-3.5-sonnet | openrouter/anthropic/claude-3.5-sonnet | openrouter | standard | 3.00 | 15.00 |
openrouter/anthropic/claude-3.5-sonnet:beta | openrouter/anthropic/claude-3.5-sonnet:beta | openrouter | standard | 3.00 | 15.00 |
openrouter/anthropic/claude-3.7-sonnet | openrouter/anthropic/claude-3.7-sonnet | openrouter | standard | 3.00 | 15.00 |
openrouter/anthropic/claude-3.7-sonnet:beta | openrouter/anthropic/claude-3.7-sonnet:beta | openrouter | standard | 3.00 | 15.00 |
openrouter/anthropic/claude-3-sonnet | openrouter/anthropic/claude-3-sonnet | openrouter | standard | 3.00 | 15.00 |
openrouter/mistralai/mistral-large | openrouter/mistralai/mistral-large | openrouter | standard | 8.00 | 24.00 |
mistralai/mistral-small-3.1-24b-instruct | mistralai/mistral-small-3.1-24b-instruct | openrouter | standard | 0.10 | 0.30 |
openrouter/cognitivecomputations/dolphin-mixtral-8x7b | openrouter/cognitivecomputations/dolphin-mixtral-8x7b | openrouter | standard | 0.50 | 0.50 |
openrouter/google/gemini-pro-vision | openrouter/google/gemini-pro-vision | openrouter | standard | 0.13 | 0.38 |
openrouter/fireworks/firellava-13b | openrouter/fireworks/firellava-13b | openrouter | standard | 0.20 | 0.20 |
openrouter/meta-llama/llama-3-8b-instruct:free | openrouter/meta-llama/llama-3-8b-instruct:free | openrouter | standard | 0.00 | 0.00 |
openrouter/meta-llama/llama-3-8b-instruct:extended | openrouter/meta-llama/llama-3-8b-instruct:extended | openrouter | standard | 0.23 | 2.25 |
openrouter/meta-llama/llama-3-70b-instruct:nitro | openrouter/meta-llama/llama-3-70b-instruct:nitro | openrouter | standard | 0.90 | 0.90 |
openrouter/meta-llama/llama-3-70b-instruct | openrouter/meta-llama/llama-3-70b-instruct | openrouter | standard | 0.59 | 0.79 |
openrouter/openai/o1 | openrouter/openai/o1 | openrouter | standard | 15.00 | 60.00 |
openrouter/openai/o1-mini | openrouter/openai/o1-mini | openrouter | standard | 3.00 | 12.00 |
openrouter/openai/o1-mini-2024-09-12 | openrouter/openai/o1-mini-2024-09-12 | openrouter | standard | 3.00 | 12.00 |
openrouter/openai/o1-preview | openrouter/openai/o1-preview | openrouter | standard | 15.00 | 60.00 |
openrouter/openai/o1-preview-2024-09-12 | openrouter/openai/o1-preview-2024-09-12 | openrouter | standard | 15.00 | 60.00 |
openrouter/openai/o3-mini | openrouter/openai/o3-mini | openrouter | standard | 1.10 | 4.40 |
openrouter/openai/o3-mini-high | openrouter/openai/o3-mini-high | openrouter | standard | 1.10 | 4.40 |
openrouter/openai/gpt-4o | openrouter/openai/gpt-4o | openrouter | standard | 2.50 | 10.00 |
openrouter/openai/gpt-4o-2024-05-13 | openrouter/openai/gpt-4o-2024-05-13 | openrouter | standard | 5.00 | 15.00 |
openrouter/openai/gpt-4-vision-preview | openrouter/openai/gpt-4-vision-preview | openrouter | standard | 10.00 | 30.00 |
openrouter/openai/gpt-3.5-turbo | openrouter/openai/gpt-3.5-turbo | openrouter | standard | 1.50 | 2.00 |
openrouter/openai/gpt-3.5-turbo-16k | openrouter/openai/gpt-3.5-turbo-16k | openrouter | standard | 3.00 | 4.00 |
openrouter/openai/gpt-4 | openrouter/openai/gpt-4 | openrouter | standard | 30.00 | 60.00 |
openrouter/anthropic/claude-instant-v1 | openrouter/anthropic/claude-instant-v1 | openrouter | standard | 1.63 | 5.51 |
openrouter/anthropic/claude-2 | openrouter/anthropic/claude-2 | openrouter | standard | 11.02 | 32.68 |
openrouter/anthropic/claude-3-opus | openrouter/anthropic/claude-3-opus | openrouter | standard | 15.00 | 75.00 |
openrouter/google/palm-2-chat-bison | openrouter/google/palm-2-chat-bison | openrouter | standard | 0.50 | 0.50 |
openrouter/google/palm-2-codechat-bison | openrouter/google/palm-2-codechat-bison | openrouter | standard | 0.50 | 0.50 |
openrouter/meta-llama/llama-2-13b-chat | openrouter/meta-llama/llama-2-13b-chat | openrouter | standard | 0.20 | 0.20 |
openrouter/meta-llama/llama-2-70b-chat | openrouter/meta-llama/llama-2-70b-chat | openrouter | standard | 1.50 | 1.50 |
openrouter/meta-llama/codellama-34b-instruct | openrouter/meta-llama/codellama-34b-instruct | openrouter | standard | 0.50 | 0.50 |
openrouter/nousresearch/nous-hermes-llama2-13b | openrouter/nousresearch/nous-hermes-llama2-13b | openrouter | standard | 0.20 | 0.20 |
openrouter/mancer/weaver | openrouter/mancer/weaver | openrouter | standard | 5.63 | 5.63 |
openrouter/gryphe/mythomax-l2-13b | openrouter/gryphe/mythomax-l2-13b | openrouter | standard | 1.88 | 1.88 |
openrouter/jondurbin/airoboros-l2-70b-2.1 | openrouter/jondurbin/airoboros-l2-70b-2.1 | openrouter | standard | 13.88 | 13.88 |
openrouter/undi95/remm-slerp-l2-13b | openrouter/undi95/remm-slerp-l2-13b | openrouter | standard | 1.88 | 1.88 |
openrouter/pygmalionai/mythalion-13b | openrouter/pygmalionai/mythalion-13b | openrouter | standard | 1.88 | 1.88 |
openrouter/mistralai/mistral-7b-instruct | openrouter/mistralai/mistral-7b-instruct | openrouter | standard | 0.13 | 0.13 |
openrouter/mistralai/mistral-7b-instruct:free | openrouter/mistralai/mistral-7b-instruct:free | openrouter | standard | 0.00 | 0.00 |
openrouter/qwen/qwen-2.5-coder-32b-instruct | openrouter/qwen/qwen-2.5-coder-32b-instruct | openrouter | standard | 0.18 | 0.18 |
j2-ultra | j2-ultra | ai21 | standard | 15.00 | 15.00 |
jamba-1.5-mini@001 | jamba-1.5-mini@001 | ai21 | standard | 0.20 | 0.40 |
jamba-1.5-large@001 | jamba-1.5-large@001 | ai21 | standard | 2.00 | 8.00 |
jamba-1.5 | jamba-1.5 | ai21 | standard | 0.20 | 0.40 |
jamba-1.5-mini | jamba-1.5-mini | ai21 | standard | 0.20 | 0.40 |
jamba-1.5-large | jamba-1.5-large | ai21 | standard | 2.00 | 8.00 |
jamba-large-1.6 | jamba-large-1.6 | ai21 | standard | 2.00 | 8.00 |
jamba-mini-1.6 | jamba-mini-1.6 | ai21 | standard | 0.20 | 0.40 |
j2-mid | j2-mid | ai21 | standard | 10.00 | 10.00 |
j2-light | j2-light | ai21 | standard | 3.00 | 3.00 |
dolphin | dolphin | nlp_cloud | standard | 0.50 | 0.50 |
chatdolphin | chatdolphin | nlp_cloud | standard | 0.50 | 0.50 |
luminous-base | luminous-base | aleph_alpha | standard | 30.00 | 33.00 |
luminous-base-control | luminous-base-control | aleph_alpha | standard | 37.50 | 41.25 |
luminous-extended | luminous-extended | aleph_alpha | standard | 45.00 | 49.50 |
luminous-extended-control | luminous-extended-control | aleph_alpha | standard | 56.25 | 61.88 |
luminous-supreme | luminous-supreme | aleph_alpha | standard | 175.00 | 192.50 |
luminous-supreme-control | luminous-supreme-control | aleph_alpha | standard | 218.75 | 240.63 |
ai21.j2-mid-v1 | ai21.j2-mid-v1 | bedrock | standard | 12.50 | 12.50 |
ai21.j2-ultra-v1 | ai21.j2-ultra-v1 | bedrock | standard | 18.80 | 18.80 |
ai21.jamba-instruct-v1:0 | ai21.jamba-instruct-v1:0 | bedrock | standard | 0.50 | 0.70 |
ai21.jamba-1-5-large-v1:0 | ai21.jamba-1-5-large-v1:0 | bedrock | standard | 2.00 | 8.00 |
ai21.jamba-1-5-mini-v1:0 | ai21.jamba-1-5-mini-v1:0 | bedrock | standard | 0.20 | 0.40 |
amazon.rerank-v1:0 | amazon.rerank-v1:0 | bedrock | standard | 0.00 | 0.00 |
amazon.titan-text-lite-v1 | amazon.titan-text-lite-v1 | bedrock | standard | 0.30 | 0.40 |
amazon.titan-text-express-v1 | amazon.titan-text-express-v1 | bedrock | standard | 1.30 | 1.70 |
amazon.titan-text-premier-v1:0 | amazon.titan-text-premier-v1:0 | bedrock | standard | 0.50 | 1.50 |
amazon.titan-embed-text-v1 | amazon.titan-embed-text-v1 | bedrock | standard | 0.10 | 0.00 |
amazon.titan-embed-text-v2:0 | amazon.titan-embed-text-v2:0 | bedrock | standard | 0.20 | 0.00 |
amazon.titan-embed-image-v1 | amazon.titan-embed-image-v1 | bedrock | standard | 0.80 | 0.00 |
mistral.mistral-7b-instruct-v0:2 | mistral.mistral-7b-instruct-v0:2 | bedrock | standard | 0.15 | 0.20 |
mistral.mixtral-8x7b-instruct-v0:1 | mistral.mixtral-8x7b-instruct-v0:1 | bedrock | standard | 0.45 | 0.70 |
mistral.mistral-large-2402-v1:0 | mistral.mistral-large-2402-v1:0 | bedrock | standard | 8.00 | 24.00 |
mistral.mistral-large-2407-v1:0 | mistral.mistral-large-2407-v1:0 | bedrock | standard | 3.00 | 9.00 |
mistral.mistral-small-2402-v1:0 | mistral.mistral-small-2402-v1:0 | bedrock | standard | 1.00 | 3.00 |
bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 | bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 | bedrock | standard | 0.45 | 0.70 |
bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 | bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 | bedrock | standard | 0.45 | 0.70 |
bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 | bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 | bedrock | standard | 0.59 | 0.91 |
bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 | bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 | bedrock | standard | 0.15 | 0.20 |
bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 | bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 | bedrock | standard | 0.15 | 0.20 |
bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 | bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 | bedrock | standard | 0.20 | 0.26 |
bedrock/us-east-1/mistral.mistral-large-2402-v1:0 | bedrock/us-east-1/mistral.mistral-large-2402-v1:0 | bedrock | standard | 8.00 | 24.00 |
bedrock/us-west-2/mistral.mistral-large-2402-v1:0 | bedrock/us-west-2/mistral.mistral-large-2402-v1:0 | bedrock | standard | 8.00 | 24.00 |
bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 | bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 | bedrock | standard | 10.40 | 31.20 |
amazon.nova-micro-v1:0 | amazon.nova-micro-v1:0 | bedrock_converse | standard | 0.04 | 0.14 |
us.amazon.nova-micro-v1:0 | us.amazon.nova-micro-v1:0 | bedrock_converse | standard | 0.04 | 0.14 |
eu.amazon.nova-micro-v1:0 | eu.amazon.nova-micro-v1:0 | bedrock_converse | standard | 0.05 | 0.18 |
amazon.nova-lite-v1:0 | amazon.nova-lite-v1:0 | bedrock_converse | standard | 0.06 | 0.24 |
us.amazon.nova-lite-v1:0 | us.amazon.nova-lite-v1:0 | bedrock_converse | standard | 0.06 | 0.24 |
eu.amazon.nova-lite-v1:0 | eu.amazon.nova-lite-v1:0 | bedrock_converse | standard | 0.08 | 0.31 |
amazon.nova-pro-v1:0 | amazon.nova-pro-v1:0 | bedrock_converse | standard | 0.80 | 3.20 |
us.amazon.nova-pro-v1:0 | us.amazon.nova-pro-v1:0 | bedrock_converse | standard | 0.80 | 3.20 |
eu.amazon.nova-pro-v1:0 | eu.amazon.nova-pro-v1:0 | bedrock_converse | standard | 1.05 | 4.20 |
us.amazon.nova-premier-v1:0 | us.amazon.nova-premier-v1:0 | bedrock_converse | standard | 2.50 | 12.50 |
anthropic.claude-3-sonnet-20240229-v1:0 | anthropic.claude-3-sonnet-20240229-v1:0 | bedrock | standard | 3.00 | 15.00 |
bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 | bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 | bedrock | standard | 3.00 | 15.00 |
anthropic.claude-3-5-sonnet-20240620-v1:0 | anthropic.claude-3-5-sonnet-20240620-v1:0 | bedrock | standard | 3.00 | 15.00 |
anthropic.claude-opus-4-20250514-v1:0 | anthropic.claude-opus-4-20250514-v1:0 | bedrock_converse | standard | 15.00 | 75.00 |
anthropic.claude-sonnet-4-20250514-v1:0 | anthropic.claude-sonnet-4-20250514-v1:0 | bedrock_converse | standard | 3.00 | 15.00 |
anthropic.claude-3-7-sonnet-20250219-v1:0 | anthropic.claude-3-7-sonnet-20250219-v1:0 | bedrock_converse | standard | 3.00 | 15.00 |
anthropic.claude-3-5-sonnet-20241022-v2:0 | anthropic.claude-3-5-sonnet-20241022-v2:0 | bedrock | standard | 3.00 | 15.00 |
anthropic.claude-3-haiku-20240307-v1:0 | anthropic.claude-3-haiku-20240307-v1:0 | bedrock | standard | 0.25 | 1.25 |
anthropic.claude-3-5-haiku-20241022-v1:0 | anthropic.claude-3-5-haiku-20241022-v1:0 | bedrock | standard | 0.80 | 4.00 |
anthropic.claude-3-opus-20240229-v1:0 | anthropic.claude-3-opus-20240229-v1:0 | bedrock | standard | 15.00 | 75.00 |
us.anthropic.claude-3-sonnet-20240229-v1:0 | us.anthropic.claude-3-sonnet-20240229-v1:0 | bedrock | standard | 3.00 | 15.00 |
us.anthropic.claude-3-5-sonnet-20240620-v1:0 | us.anthropic.claude-3-5-sonnet-20240620-v1:0 | bedrock | standard | 3.00 | 15.00 |
us.anthropic.claude-3-5-sonnet-20241022-v2:0 | us.anthropic.claude-3-5-sonnet-20241022-v2:0 | bedrock | standard | 3.00 | 15.00 |
us.anthropic.claude-3-7-sonnet-20250219-v1:0 | us.anthropic.claude-3-7-sonnet-20250219-v1:0 | bedrock_converse | standard | 3.00 | 15.00 |
us.anthropic.claude-opus-4-20250514-v1:0 | us.anthropic.claude-opus-4-20250514-v1:0 | bedrock_converse | standard | 15.00 | 75.00 |
us.anthropic.claude-sonnet-4-20250514-v1:0 | us.anthropic.claude-sonnet-4-20250514-v1:0 | bedrock_converse | standard | 3.00 | 15.00 |
us.anthropic.claude-3-haiku-20240307-v1:0 | us.anthropic.claude-3-haiku-20240307-v1:0 | bedrock | standard | 0.25 | 1.25 |
us.anthropic.claude-3-5-haiku-20241022-v1:0 | us.anthropic.claude-3-5-haiku-20241022-v1:0 | bedrock | standard | 0.80 | 4.00 |
us.anthropic.claude-3-opus-20240229-v1:0 | us.anthropic.claude-3-opus-20240229-v1:0 | bedrock | standard | 15.00 | 75.00 |
eu.anthropic.claude-3-sonnet-20240229-v1:0 | eu.anthropic.claude-3-sonnet-20240229-v1:0 | bedrock | standard | 3.00 | 15.00 |
eu.anthropic.claude-3-5-sonnet-20240620-v1:0 | eu.anthropic.claude-3-5-sonnet-20240620-v1:0 | bedrock | standard | 3.00 | 15.00 |
eu.anthropic.claude-3-5-sonnet-20241022-v2:0 | eu.anthropic.claude-3-5-sonnet-20241022-v2:0 | bedrock | standard | 3.00 | 15.00 |
eu.anthropic.claude-3-7-sonnet-20250219-v1:0 | eu.anthropic.claude-3-7-sonnet-20250219-v1:0 | bedrock | standard | 3.00 | 15.00 |
eu.anthropic.claude-3-haiku-20240307-v1:0 | eu.anthropic.claude-3-haiku-20240307-v1:0 | bedrock | standard | 0.25 | 1.25 |
eu.anthropic.claude-opus-4-20250514-v1:0 | eu.anthropic.claude-opus-4-20250514-v1:0 | bedrock_converse | standard | 15.00 | 75.00 |
eu.anthropic.claude-sonnet-4-20250514-v1:0 | eu.anthropic.claude-sonnet-4-20250514-v1:0 | bedrock_converse | standard | 3.00 | 15.00 |
eu.anthropic.claude-3-5-haiku-20241022-v1:0 | eu.anthropic.claude-3-5-haiku-20241022-v1:0 | bedrock | standard | 0.25 | 1.25 |
eu.anthropic.claude-3-opus-20240229-v1:0 | eu.anthropic.claude-3-opus-20240229-v1:0 | bedrock | standard | 15.00 | 75.00 |
anthropic.claude-v1 | anthropic.claude-v1 | bedrock | standard | 8.00 | 24.00 |
bedrock/us-east-1/anthropic.claude-v1 | bedrock/us-east-1/anthropic.claude-v1 | bedrock | standard | 8.00 | 24.00 |
bedrock/us-west-2/anthropic.claude-v1 | bedrock/us-west-2/anthropic.claude-v1 | bedrock | standard | 8.00 | 24.00 |
bedrock/ap-northeast-1/anthropic.claude-v1 | bedrock/ap-northeast-1/anthropic.claude-v1 | bedrock | standard | 8.00 | 24.00 |
bedrock/eu-central-1/anthropic.claude-v1 | bedrock/eu-central-1/anthropic.claude-v1 | bedrock | standard | 8.00 | 24.00 |
anthropic.claude-v2 | anthropic.claude-v2 | bedrock | standard | 8.00 | 24.00 |
bedrock/us-east-1/anthropic.claude-v2 | bedrock/us-east-1/anthropic.claude-v2 | bedrock | standard | 8.00 | 24.00 |
bedrock/us-west-2/anthropic.claude-v2 | bedrock/us-west-2/anthropic.claude-v2 | bedrock | standard | 8.00 | 24.00 |
bedrock/ap-northeast-1/anthropic.claude-v2 | bedrock/ap-northeast-1/anthropic.claude-v2 | bedrock | standard | 8.00 | 24.00 |
bedrock/eu-central-1/anthropic.claude-v2 | bedrock/eu-central-1/anthropic.claude-v2 | bedrock | standard | 8.00 | 24.00 |
anthropic.claude-v2:1 | anthropic.claude-v2:1 | bedrock | standard | 8.00 | 24.00 |
bedrock/us-east-1/anthropic.claude-v2:1 | bedrock/us-east-1/anthropic.claude-v2:1 | bedrock | standard | 8.00 | 24.00 |
bedrock/us-west-2/anthropic.claude-v2:1 | bedrock/us-west-2/anthropic.claude-v2:1 | bedrock | standard | 8.00 | 24.00 |
bedrock/ap-northeast-1/anthropic.claude-v2:1 | bedrock/ap-northeast-1/anthropic.claude-v2:1 | bedrock | standard | 8.00 | 24.00 |
bedrock/eu-central-1/anthropic.claude-v2:1 | bedrock/eu-central-1/anthropic.claude-v2:1 | bedrock | standard | 8.00 | 24.00 |
anthropic.claude-instant-v1 | anthropic.claude-instant-v1 | bedrock | standard | 0.80 | 2.40 |
bedrock/us-east-1/anthropic.claude-instant-v1 | bedrock/us-east-1/anthropic.claude-instant-v1 | bedrock | standard | 0.80 | 2.40 |
bedrock/us-west-2/anthropic.claude-instant-v1 | bedrock/us-west-2/anthropic.claude-instant-v1 | bedrock | standard | 0.80 | 2.40 |
bedrock/ap-northeast-1/anthropic.claude-instant-v1 | bedrock/ap-northeast-1/anthropic.claude-instant-v1 | bedrock | standard | 2.23 | 7.55 |
bedrock/eu-central-1/anthropic.claude-instant-v1 | bedrock/eu-central-1/anthropic.claude-instant-v1 | bedrock | standard | 2.48 | 8.38 |
cohere.rerank-v3-5:0 | cohere.rerank-v3-5:0 | bedrock | standard | 0.00 | 0.00 |
cohere.command-text-v14 | cohere.command-text-v14 | bedrock | standard | 1.50 | 2.00 |
cohere.command-light-text-v14 | cohere.command-light-text-v14 | bedrock | standard | 0.30 | 0.60 |
cohere.command-r-plus-v1:0 | cohere.command-r-plus-v1:0 | bedrock | standard | 3.00 | 15.00 |
cohere.command-r-v1:0 | cohere.command-r-v1:0 | bedrock | standard | 0.50 | 1.50 |
cohere.embed-english-v3 | cohere.embed-english-v3 | bedrock | standard | 0.10 | 0.00 |
cohere.embed-multilingual-v3 | cohere.embed-multilingual-v3 | bedrock | standard | 0.10 | 0.00 |
us.deepseek.r1-v1:0 | us.deepseek.r1-v1:0 | bedrock_converse | standard | 1.35 | 5.40 |
meta.llama3-3-70b-instruct-v1:0 | meta.llama3-3-70b-instruct-v1:0 | bedrock_converse | standard | 0.72 | 0.72 |
meta.llama2-13b-chat-v1 | meta.llama2-13b-chat-v1 | bedrock | standard | 0.75 | 1.00 |
meta.llama2-70b-chat-v1 | meta.llama2-70b-chat-v1 | bedrock | standard | 1.95 | 2.56 |
meta.llama3-8b-instruct-v1:0 | meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.30 | 0.60 |
bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 | bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.30 | 0.60 |
bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 | bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.30 | 0.60 |
bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 | bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.36 | 0.72 |
bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 | bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.35 | 0.69 |
bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 | bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.32 | 0.65 |
bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 | bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.39 | 0.78 |
bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 | bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 | bedrock | standard | 0.50 | 1.01 |
meta.llama3-70b-instruct-v1:0 | meta.llama3-70b-instruct-v1:0 | bedrock | standard | 2.65 | 3.50 |
bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 | bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 | bedrock | standard | 2.65 | 3.50 |
bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 | bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 | bedrock | standard | 2.65 | 3.50 |
bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 | bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 | bedrock | standard | 3.18 | 4.20 |
bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 | bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 | bedrock | standard | 3.05 | 4.03 |
bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 | bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 | bedrock | standard | 2.86 | 3.78 |
bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 | bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 | bedrock | standard | 3.45 | 4.55 |
bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 | bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 | bedrock | standard | 4.45 | 5.88 |
meta.llama3-1-8b-instruct-v1:0 | meta.llama3-1-8b-instruct-v1:0 | bedrock | standard | 0.22 | 0.22 |
us.meta.llama3-1-8b-instruct-v1:0 | us.meta.llama3-1-8b-instruct-v1:0 | bedrock | standard | 0.22 | 0.22 |
meta.llama3-1-70b-instruct-v1:0 | meta.llama3-1-70b-instruct-v1:0 | bedrock | standard | 0.99 | 0.99 |
us.meta.llama3-1-70b-instruct-v1:0 | us.meta.llama3-1-70b-instruct-v1:0 | bedrock | standard | 0.99 | 0.99 |
meta.llama3-1-405b-instruct-v1:0 | meta.llama3-1-405b-instruct-v1:0 | bedrock | standard | 5.32 | 16.00 |
us.meta.llama3-1-405b-instruct-v1:0 | us.meta.llama3-1-405b-instruct-v1:0 | bedrock | standard | 5.32 | 16.00 |
meta.llama3-2-1b-instruct-v1:0 | meta.llama3-2-1b-instruct-v1:0 | bedrock | standard | 0.10 | 0.10 |
us.meta.llama3-2-1b-instruct-v1:0 | us.meta.llama3-2-1b-instruct-v1:0 | bedrock | standard | 0.10 | 0.10 |
eu.meta.llama3-2-1b-instruct-v1:0 | eu.meta.llama3-2-1b-instruct-v1:0 | bedrock | standard | 0.13 | 0.13 |
meta.llama3-2-3b-instruct-v1:0 | meta.llama3-2-3b-instruct-v1:0 | bedrock | standard | 0.15 | 0.15 |
us.meta.llama3-2-3b-instruct-v1:0 | us.meta.llama3-2-3b-instruct-v1:0 | bedrock | standard | 0.15 | 0.15 |
eu.meta.llama3-2-3b-instruct-v1:0 | eu.meta.llama3-2-3b-instruct-v1:0 | bedrock | standard | 0.19 | 0.19 |
meta.llama3-2-11b-instruct-v1:0 | meta.llama3-2-11b-instruct-v1:0 | bedrock | standard | 0.35 | 0.35 |
us.meta.llama3-2-11b-instruct-v1:0 | us.meta.llama3-2-11b-instruct-v1:0 | bedrock | standard | 0.35 | 0.35 |
meta.llama3-2-90b-instruct-v1:0 | meta.llama3-2-90b-instruct-v1:0 | bedrock | standard | 2.00 | 2.00 |
us.meta.llama3-2-90b-instruct-v1:0 | us.meta.llama3-2-90b-instruct-v1:0 | bedrock | standard | 2.00 | 2.00 |
us.meta.llama3-3-70b-instruct-v1:0 | us.meta.llama3-3-70b-instruct-v1:0 | bedrock_converse | standard | 0.72 | 0.72 |
meta.llama4-maverick-17b-instruct-v1:0 | meta.llama4-maverick-17b-instruct-v1:0 | bedrock_converse | standard | 0.24 | 0.97 |
meta.llama4-maverick-17b-instruct-v1:0 | meta.llama4-maverick-17b-instruct-v1:0 | bedrock_converse | batch | 0.12 | 0.49 |
us.meta.llama4-maverick-17b-instruct-v1:0 | us.meta.llama4-maverick-17b-instruct-v1:0 | bedrock_converse | standard | 0.24 | 0.97 |
us.meta.llama4-maverick-17b-instruct-v1:0 | us.meta.llama4-maverick-17b-instruct-v1:0 | bedrock_converse | batch | 0.12 | 0.49 |
meta.llama4-scout-17b-instruct-v1:0 | meta.llama4-scout-17b-instruct-v1:0 | bedrock_converse | standard | 0.17 | 0.66 |
meta.llama4-scout-17b-instruct-v1:0 | meta.llama4-scout-17b-instruct-v1:0 | bedrock_converse | batch | 0.09 | 0.33 |
us.meta.llama4-scout-17b-instruct-v1:0 | us.meta.llama4-scout-17b-instruct-v1:0 | bedrock_converse | standard | 0.17 | 0.66 |
us.meta.llama4-scout-17b-instruct-v1:0 | us.meta.llama4-scout-17b-instruct-v1:0 | bedrock_converse | batch | 0.09 | 0.33 |
sagemaker/meta-textgeneration-llama-2-7b | sagemaker/meta-textgeneration-llama-2-7b | sagemaker | standard | 0.00 | 0.00 |
sagemaker/meta-textgeneration-llama-2-7b-f | sagemaker/meta-textgeneration-llama-2-7b-f | sagemaker | standard | 0.00 | 0.00 |
sagemaker/meta-textgeneration-llama-2-13b | sagemaker/meta-textgeneration-llama-2-13b | sagemaker | standard | 0.00 | 0.00 |
sagemaker/meta-textgeneration-llama-2-13b-f | sagemaker/meta-textgeneration-llama-2-13b-f | sagemaker | standard | 0.00 | 0.00 |
sagemaker/meta-textgeneration-llama-2-70b | sagemaker/meta-textgeneration-llama-2-70b | sagemaker | standard | 0.00 | 0.00 |
sagemaker/meta-textgeneration-llama-2-70b-b-f | sagemaker/meta-textgeneration-llama-2-70b-b-f | sagemaker | standard | 0.00 | 0.00 |
together-ai-up-to-4b | together-ai-up-to-4b | together_ai | standard | 0.10 | 0.10 |
together-ai-4.1b-8b | together-ai-4.1b-8b | together_ai | standard | 0.20 | 0.20 |
together-ai-8.1b-21b | together-ai-8.1b-21b | together_ai | standard | 0.30 | 0.30 |
together-ai-21.1b-41b | together-ai-21.1b-41b | together_ai | standard | 0.80 | 0.80 |
together-ai-41.1b-80b | together-ai-41.1b-80b | together_ai | standard | 0.90 | 0.90 |
together-ai-81.1b-110b | together-ai-81.1b-110b | together_ai | standard | 1.80 | 1.80 |
together-ai-embedding-up-to-150m | together-ai-embedding-up-to-150m | together_ai | standard | 0.01 | 0.00 |
together-ai-embedding-151m-to-350m | together-ai-embedding-151m-to-350m | together_ai | standard | 0.02 | 0.00 |
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | together_ai | standard | 0.18 | 0.18 |
together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | together_ai | standard | 0.88 | 0.88 |
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | together_ai | standard | 3.50 | 3.50 |
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo | together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo | together_ai | standard | 0.88 | 0.88 |
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free | together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free | together_ai | standard | 0.00 | 0.00 |
together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 | together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 | together_ai | standard | 0.60 | 0.60 |
ollama/codegemma | ollama/codegemma | ollama | standard | 0.00 | 0.00 |
ollama/codegeex4 | ollama/codegeex4 | ollama | standard | 0.00 | 0.00 |
ollama/deepseek-coder-v2-instruct | ollama/deepseek-coder-v2-instruct | ollama | standard | 0.00 | 0.00 |
ollama/deepseek-coder-v2-base | ollama/deepseek-coder-v2-base | ollama | standard | 0.00 | 0.00 |
ollama/deepseek-coder-v2-lite-instruct | ollama/deepseek-coder-v2-lite-instruct | ollama | standard | 0.00 | 0.00 |
ollama/deepseek-coder-v2-lite-base | ollama/deepseek-coder-v2-lite-base | ollama | standard | 0.00 | 0.00 |
ollama/internlm2_5-20b-chat | ollama/internlm2_5-20b-chat | ollama | standard | 0.00 | 0.00 |
ollama/llama2 | ollama/llama2 | ollama | standard | 0.00 | 0.00 |
ollama/llama2:7b | ollama/llama2:7b | ollama | standard | 0.00 | 0.00 |
ollama/llama2:13b | ollama/llama2:13b | ollama | standard | 0.00 | 0.00 |
ollama/llama2:70b | ollama/llama2:70b | ollama | standard | 0.00 | 0.00 |
ollama/llama2-uncensored | ollama/llama2-uncensored | ollama | standard | 0.00 | 0.00 |
ollama/llama3 | ollama/llama3 | ollama | standard | 0.00 | 0.00 |
ollama/llama3:8b | ollama/llama3:8b | ollama | standard | 0.00 | 0.00 |
ollama/llama3:70b | ollama/llama3:70b | ollama | standard | 0.00 | 0.00 |
ollama/llama3.1 | ollama/llama3.1 | ollama | standard | 0.00 | 0.00 |
ollama/mistral-large-instruct-2407 | ollama/mistral-large-instruct-2407 | ollama | standard | 0.00 | 0.00 |
ollama/mistral | ollama/mistral | ollama | standard | 0.00 | 0.00 |
ollama/mistral-7B-Instruct-v0.1 | ollama/mistral-7B-Instruct-v0.1 | ollama | standard | 0.00 | 0.00 |
ollama/mistral-7B-Instruct-v0.2 | ollama/mistral-7B-Instruct-v0.2 | ollama | standard | 0.00 | 0.00 |
ollama/mixtral-8x7B-Instruct-v0.1 | ollama/mixtral-8x7B-Instruct-v0.1 | ollama | standard | 0.00 | 0.00 |
ollama/mixtral-8x22B-Instruct-v0.1 | ollama/mixtral-8x22B-Instruct-v0.1 | ollama | standard | 0.00 | 0.00 |
ollama/codellama | ollama/codellama | ollama | standard | 0.00 | 0.00 |
ollama/orca-mini | ollama/orca-mini | ollama | standard | 0.00 | 0.00 |
ollama/vicuna | ollama/vicuna | ollama | standard | 0.00 | 0.00 |
deepinfra/lizpreciatior/lzlv_70b_fp16_hf | deepinfra/lizpreciatior/lzlv_70b_fp16_hf | deepinfra | standard | 0.70 | 0.90 |
deepinfra/Gryphe/MythoMax-L2-13b | deepinfra/Gryphe/MythoMax-L2-13b | deepinfra | standard | 0.22 | 0.22 |
deepinfra/mistralai/Mistral-7B-Instruct-v0.1 | deepinfra/mistralai/Mistral-7B-Instruct-v0.1 | deepinfra | standard | 0.13 | 0.13 |
deepinfra/meta-llama/Llama-2-70b-chat-hf | deepinfra/meta-llama/Llama-2-70b-chat-hf | deepinfra | standard | 0.70 | 0.90 |
deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b | deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b | deepinfra | standard | 0.27 | 0.27 |
deepinfra/codellama/CodeLlama-34b-Instruct-hf | deepinfra/codellama/CodeLlama-34b-Instruct-hf | deepinfra | standard | 0.60 | 0.60 |
deepinfra/deepinfra/mixtral | deepinfra/deepinfra/mixtral | deepinfra | standard | 0.27 | 0.27 |
deepinfra/Phind/Phind-CodeLlama-34B-v2 | deepinfra/Phind/Phind-CodeLlama-34B-v2 | deepinfra | standard | 0.60 | 0.60 |
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | deepinfra | standard | 0.27 | 0.27 |
deepinfra/deepinfra/airoboros-70b | deepinfra/deepinfra/airoboros-70b | deepinfra | standard | 0.70 | 0.90 |
deepinfra/01-ai/Yi-34B-Chat | deepinfra/01-ai/Yi-34B-Chat | deepinfra | standard | 0.60 | 0.60 |
deepinfra/01-ai/Yi-6B-200K | deepinfra/01-ai/Yi-6B-200K | deepinfra | standard | 0.13 | 0.13 |
deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 | deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 | deepinfra | standard | 0.70 | 0.90 |
deepinfra/meta-llama/Llama-2-13b-chat-hf | deepinfra/meta-llama/Llama-2-13b-chat-hf | deepinfra | standard | 0.22 | 0.22 |
deepinfra/amazon/MistralLite | deepinfra/amazon/MistralLite | deepinfra | standard | 0.20 | 0.20 |
deepinfra/meta-llama/Llama-2-7b-chat-hf | deepinfra/meta-llama/Llama-2-7b-chat-hf | deepinfra | standard | 0.13 | 0.13 |
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | deepinfra | standard | 0.08 | 0.08 |
deepinfra/meta-llama/Meta-Llama-3-70B-Instruct | deepinfra/meta-llama/Meta-Llama-3-70B-Instruct | deepinfra | standard | 0.59 | 0.79 |
deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct | deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct | deepinfra | standard | 0.90 | 0.90 |
deepinfra/01-ai/Yi-34B-200K | deepinfra/01-ai/Yi-34B-200K | deepinfra | standard | 0.60 | 0.60 |
deepinfra/openchat/openchat_3.5 | deepinfra/openchat/openchat_3.5 | deepinfra | standard | 0.13 | 0.13 |
perplexity/codellama-34b-instruct | perplexity/codellama-34b-instruct | perplexity | standard | 0.35 | 1.40 |
perplexity/codellama-70b-instruct | perplexity/codellama-70b-instruct | perplexity | standard | 0.70 | 2.80 |
perplexity/llama-3.1-70b-instruct | perplexity/llama-3.1-70b-instruct | perplexity | standard | 1.00 | 1.00 |
perplexity/llama-3.1-8b-instruct | perplexity/llama-3.1-8b-instruct | perplexity | standard | 0.20 | 0.20 |
perplexity/llama-3.1-sonar-huge-128k-online | perplexity/llama-3.1-sonar-huge-128k-online | perplexity | standard | 5.00 | 5.00 |
perplexity/llama-3.1-sonar-large-128k-online | perplexity/llama-3.1-sonar-large-128k-online | perplexity | standard | 1.00 | 1.00 |
perplexity/llama-3.1-sonar-large-128k-chat | perplexity/llama-3.1-sonar-large-128k-chat | perplexity | standard | 1.00 | 1.00 |
perplexity/llama-3.1-sonar-small-128k-chat | perplexity/llama-3.1-sonar-small-128k-chat | perplexity | standard | 0.20 | 0.20 |
perplexity/llama-3.1-sonar-small-128k-online | perplexity/llama-3.1-sonar-small-128k-online | perplexity | standard | 0.20 | 0.20 |
perplexity/pplx-7b-chat | perplexity/pplx-7b-chat | perplexity | standard | 0.07 | 0.28 |
perplexity/pplx-70b-chat | perplexity/pplx-70b-chat | perplexity | standard | 0.70 | 2.80 |
perplexity/pplx-7b-online | perplexity/pplx-7b-online | perplexity | standard | 0.00 | 0.28 |
perplexity/pplx-70b-online | perplexity/pplx-70b-online | perplexity | standard | 0.00 | 2.80 |
perplexity/llama-2-70b-chat | perplexity/llama-2-70b-chat | perplexity | standard | 0.70 | 2.80 |
perplexity/mistral-7b-instruct | perplexity/mistral-7b-instruct | perplexity | standard | 0.07 | 0.28 |
perplexity/mixtral-8x7b-instruct | perplexity/mixtral-8x7b-instruct | perplexity | standard | 0.07 | 0.28 |
perplexity/sonar-small-chat | perplexity/sonar-small-chat | perplexity | standard | 0.07 | 0.28 |
perplexity/sonar-small-online | perplexity/sonar-small-online | perplexity | standard | 0.00 | 0.28 |
perplexity/sonar-medium-chat | perplexity/sonar-medium-chat | perplexity | standard | 0.60 | 1.80 |
perplexity/sonar-medium-online | perplexity/sonar-medium-online | perplexity | standard | 0.00 | 1.80 |
perplexity/sonar | perplexity/sonar | perplexity | standard | 1.00 | 1.00 |
perplexity/sonar-pro | perplexity/sonar-pro | perplexity | standard | 3.00 | 15.00 |
perplexity/sonar-reasoning | perplexity/sonar-reasoning | perplexity | standard | 1.00 | 5.00 |
perplexity/sonar-reasoning-pro | perplexity/sonar-reasoning-pro | perplexity | standard | 2.00 | 8.00 |
perplexity/sonar-deep-research | perplexity/sonar-deep-research | perplexity | standard | 2.00 | 8.00 |
fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct | fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct | fireworks_ai | standard | 0.10 | 0.10 |
fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct | fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct | fireworks_ai | standard | 0.10 | 0.10 |
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct | fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct | fireworks_ai | standard | 0.10 | 0.10 |
fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct | fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct | fireworks_ai | standard | 0.20 | 0.20 |
fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct | fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct | fireworks_ai | standard | 0.90 | 0.90 |
fireworks_ai/accounts/fireworks/models/firefunction-v2 | fireworks_ai/accounts/fireworks/models/firefunction-v2 | fireworks_ai | standard | 0.90 | 0.90 |
fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf | fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf | fireworks_ai | standard | 1.20 | 1.20 |
fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct | fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct | fireworks_ai | standard | 0.90 | 0.90 |
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct | fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct | fireworks_ai | standard | 0.90 | 0.90 |
fireworks_ai/accounts/fireworks/models/yi-large | fireworks_ai/accounts/fireworks/models/yi-large | fireworks_ai | standard | 3.00 | 3.00 |
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct | fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct | fireworks_ai | standard | 1.20 | 1.20 |
fireworks_ai/accounts/fireworks/models/deepseek-v3 | fireworks_ai/accounts/fireworks/models/deepseek-v3 | fireworks_ai | standard | 0.90 | 0.90 |
fireworks_ai/accounts/fireworks/models/deepseek-r1 | fireworks_ai/accounts/fireworks/models/deepseek-r1 | fireworks_ai | standard | 3.00 | 8.00 |
fireworks_ai/accounts/fireworks/models/deepseek-r1-basic | fireworks_ai/accounts/fireworks/models/deepseek-r1-basic | fireworks_ai | standard | 0.55 | 2.19 |
fireworks_ai/accounts/fireworks/models/deepseek-r1-0528 | fireworks_ai/accounts/fireworks/models/deepseek-r1-0528 | fireworks_ai | standard | 3.00 | 8.00 |
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct | fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct | fireworks_ai | standard | 3.00 | 3.00 |
fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic | fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic | fireworks_ai | standard | 0.22 | 0.88 |
fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic | fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic | fireworks_ai | standard | 0.15 | 0.60 |
fireworks_ai/nomic-ai/nomic-embed-text-v1.5 | fireworks_ai/nomic-ai/nomic-embed-text-v1.5 | fireworks_ai-embedding-models | standard | 0.01 | 0.00 |
fireworks_ai/nomic-ai/nomic-embed-text-v1 | fireworks_ai/nomic-ai/nomic-embed-text-v1 | fireworks_ai-embedding-models | standard | 0.01 | 0.00 |
fireworks_ai/WhereIsAI/UAE-Large-V1 | fireworks_ai/WhereIsAI/UAE-Large-V1 | fireworks_ai-embedding-models | standard | 0.02 | 0.00 |
fireworks_ai/thenlper/gte-large | fireworks_ai/thenlper/gte-large | fireworks_ai-embedding-models | standard | 0.02 | 0.00 |
fireworks_ai/thenlper/gte-base | fireworks_ai/thenlper/gte-base | fireworks_ai-embedding-models | standard | 0.01 | 0.00 |
fireworks-ai-up-to-4b | fireworks-ai-up-to-4b | fireworks_ai | standard | 0.20 | 0.20 |
fireworks-ai-4.1b-to-16b | fireworks-ai-4.1b-to-16b | fireworks_ai | standard | 0.20 | 0.20 |
fireworks-ai-above-16b | fireworks-ai-above-16b | fireworks_ai | standard | 0.90 | 0.90 |
fireworks-ai-moe-up-to-56b | fireworks-ai-moe-up-to-56b | fireworks_ai | standard | 0.50 | 0.50 |
fireworks-ai-56b-to-176b | fireworks-ai-56b-to-176b | fireworks_ai | standard | 1.20 | 1.20 |
fireworks-ai-default | fireworks-ai-default | fireworks_ai | standard | 0.00 | 0.00 |
fireworks-ai-embedding-up-to-150m | fireworks-ai-embedding-up-to-150m | fireworks_ai-embedding-models | standard | 0.01 | 0.00 |
fireworks-ai-embedding-150m-to-350m | fireworks-ai-embedding-150m-to-350m | fireworks_ai-embedding-models | standard | 0.02 | 0.00 |
anyscale/mistralai/Mistral-7B-Instruct-v0.1 | anyscale/mistralai/Mistral-7B-Instruct-v0.1 | anyscale | standard | 0.15 | 0.15 |
anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 | anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 | anyscale | standard | 0.15 | 0.15 |
anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 | anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 | anyscale | standard | 0.90 | 0.90 |
anyscale/HuggingFaceH4/zephyr-7b-beta | anyscale/HuggingFaceH4/zephyr-7b-beta | anyscale | standard | 0.15 | 0.15 |
anyscale/google/gemma-7b-it | anyscale/google/gemma-7b-it | anyscale | standard | 0.15 | 0.15 |
anyscale/meta-llama/Llama-2-7b-chat-hf | anyscale/meta-llama/Llama-2-7b-chat-hf | anyscale | standard | 0.15 | 0.15 |
anyscale/meta-llama/Llama-2-13b-chat-hf | anyscale/meta-llama/Llama-2-13b-chat-hf | anyscale | standard | 0.25 | 0.25 |
anyscale/meta-llama/Llama-2-70b-chat-hf | anyscale/meta-llama/Llama-2-70b-chat-hf | anyscale | standard | 1.00 | 1.00 |
anyscale/codellama/CodeLlama-34b-Instruct-hf | anyscale/codellama/CodeLlama-34b-Instruct-hf | anyscale | standard | 1.00 | 1.00 |
anyscale/codellama/CodeLlama-70b-Instruct-hf | anyscale/codellama/CodeLlama-70b-Instruct-hf | anyscale | standard | 1.00 | 1.00 |
anyscale/meta-llama/Meta-Llama-3-8B-Instruct | anyscale/meta-llama/Meta-Llama-3-8B-Instruct | anyscale | standard | 0.15 | 0.15 |
anyscale/meta-llama/Meta-Llama-3-70B-Instruct | anyscale/meta-llama/Meta-Llama-3-70B-Instruct | anyscale | standard | 1.00 | 1.00 |
cloudflare/@cf/meta/llama-2-7b-chat-fp16 | cloudflare/@cf/meta/llama-2-7b-chat-fp16 | cloudflare | standard | 1.92 | 1.92 |
cloudflare/@cf/meta/llama-2-7b-chat-int8 | cloudflare/@cf/meta/llama-2-7b-chat-int8 | cloudflare | standard | 1.92 | 1.92 |
cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 | cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 | cloudflare | standard | 1.92 | 1.92 |
cloudflare/@hf/thebloke/codellama-7b-instruct-awq | cloudflare/@hf/thebloke/codellama-7b-instruct-awq | cloudflare | standard | 1.92 | 1.92 |
voyage/voyage-01 | voyage/voyage-01 | voyage | standard | 0.10 | 0.00 |
voyage/voyage-lite-01 | voyage/voyage-lite-01 | voyage | standard | 0.10 | 0.00 |
voyage/voyage-large-2 | voyage/voyage-large-2 | voyage | standard | 0.12 | 0.00 |
voyage/voyage-finance-2 | voyage/voyage-finance-2 | voyage | standard | 0.12 | 0.00 |
voyage/voyage-lite-02-instruct | voyage/voyage-lite-02-instruct | voyage | standard | 0.10 | 0.00 |
voyage/voyage-law-2 | voyage/voyage-law-2 | voyage | standard | 0.12 | 0.00 |
voyage/voyage-code-2 | voyage/voyage-code-2 | voyage | standard | 0.12 | 0.00 |
voyage/voyage-2 | voyage/voyage-2 | voyage | standard | 0.10 | 0.00 |
voyage/voyage-3-large | voyage/voyage-3-large | voyage | standard | 0.18 | 0.00 |
voyage/voyage-3 | voyage/voyage-3 | voyage | standard | 0.06 | 0.00 |
voyage/voyage-3-lite | voyage/voyage-3-lite | voyage | standard | 0.02 | 0.00 |
voyage/voyage-code-3 | voyage/voyage-code-3 | voyage | standard | 0.18 | 0.00 |
voyage/voyage-multimodal-3 | voyage/voyage-multimodal-3 | voyage | standard | 0.12 | 0.00 |
voyage/rerank-2 | voyage/rerank-2 | voyage | standard | 0.05 | 0.00 |
voyage/rerank-2-lite | voyage/rerank-2-lite | voyage | standard | 0.02 | 0.00 |
databricks/databricks-claude-3-7-sonnet | databricks/databricks-claude-3-7-sonnet | databricks | standard | 2.50 | 17.86 |
databricks/databricks-meta-llama-3-1-405b-instruct | databricks/databricks-meta-llama-3-1-405b-instruct | databricks | standard | 5.00 | 15.00 |
databricks/databricks-meta-llama-3-1-70b-instruct | databricks/databricks-meta-llama-3-1-70b-instruct | databricks | standard | 1.00 | 3.00 |
databricks/databricks-meta-llama-3-3-70b-instruct | databricks/databricks-meta-llama-3-3-70b-instruct | databricks | standard | 1.00 | 3.00 |
databricks/databricks-llama-4-maverick | databricks/databricks-llama-4-maverick | databricks | standard | 5.00 | 15.00 |
databricks/databricks-dbrx-instruct | databricks/databricks-dbrx-instruct | databricks | standard | 0.75 | 2.25 |
databricks/databricks-meta-llama-3-70b-instruct | databricks/databricks-meta-llama-3-70b-instruct | databricks | standard | 1.00 | 3.00 |
databricks/databricks-llama-2-70b-chat | databricks/databricks-llama-2-70b-chat | databricks | standard | 0.50 | 1.50 |
databricks/databricks-mixtral-8x7b-instruct | databricks/databricks-mixtral-8x7b-instruct | databricks | standard | 0.50 | 1.00 |
databricks/databricks-mpt-30b-instruct | databricks/databricks-mpt-30b-instruct | databricks | standard | 1.00 | 1.00 |
databricks/databricks-mpt-7b-instruct | databricks/databricks-mpt-7b-instruct | databricks | standard | 0.50 | 0.00 |
databricks/databricks-bge-large-en | databricks/databricks-bge-large-en | databricks | standard | 0.10 | 0.00 |
databricks/databricks-gte-large-en | databricks/databricks-gte-large-en | databricks | standard | 0.13 | 0.00 |
sambanova/Meta-Llama-3.1-8B-Instruct | sambanova/Meta-Llama-3.1-8B-Instruct | sambanova | standard | 0.10 | 0.20 |
sambanova/Meta-Llama-3.1-405B-Instruct | sambanova/Meta-Llama-3.1-405B-Instruct | sambanova | standard | 5.00 | 10.00 |
sambanova/Meta-Llama-3.2-1B-Instruct | sambanova/Meta-Llama-3.2-1B-Instruct | sambanova | standard | 0.04 | 0.08 |
sambanova/Meta-Llama-3.2-3B-Instruct | sambanova/Meta-Llama-3.2-3B-Instruct | sambanova | standard | 0.08 | 0.16 |
sambanova/Llama-4-Maverick-17B-128E-Instruct | sambanova/Llama-4-Maverick-17B-128E-Instruct | sambanova | standard | 0.63 | 1.80 |
sambanova/Llama-4-Scout-17B-16E-Instruct | sambanova/Llama-4-Scout-17B-16E-Instruct | sambanova | standard | 0.40 | 0.70 |
sambanova/Meta-Llama-3.3-70B-Instruct | sambanova/Meta-Llama-3.3-70B-Instruct | sambanova | standard | 0.60 | 1.20 |
sambanova/Meta-Llama-Guard-3-8B | sambanova/Meta-Llama-Guard-3-8B | sambanova | standard | 0.30 | 0.30 |
sambanova/Qwen3-32B | sambanova/Qwen3-32B | sambanova | standard | 0.40 | 0.80 |
sambanova/QwQ-32B | sambanova/QwQ-32B | sambanova | standard | 0.50 | 1.00 |
sambanova/Qwen2-Audio-7B-Instruct | sambanova/Qwen2-Audio-7B-Instruct | sambanova | standard | 0.50 | 100.00 |
sambanova/DeepSeek-R1-Distill-Llama-70B | sambanova/DeepSeek-R1-Distill-Llama-70B | sambanova | standard | 0.70 | 1.40 |
sambanova/DeepSeek-R1 | sambanova/DeepSeek-R1 | sambanova | standard | 5.00 | 7.00 |
sambanova/DeepSeek-V3-0324 | sambanova/DeepSeek-V3-0324 | sambanova | standard | 3.00 | 4.50 |
jina-reranker-v2-base-multilingual | jina-reranker-v2-base-multilingual | jina_ai | standard | 0.02 | 0.02 |
nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct | nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct | nscale | standard | 0.09 | 0.29 |
nscale/Qwen/Qwen2.5-Coder-3B-Instruct | nscale/Qwen/Qwen2.5-Coder-3B-Instruct | nscale | standard | 0.01 | 0.03 |
nscale/Qwen/Qwen2.5-Coder-7B-Instruct | nscale/Qwen/Qwen2.5-Coder-7B-Instruct | nscale | standard | 0.01 | 0.03 |
nscale/Qwen/Qwen2.5-Coder-32B-Instruct | nscale/Qwen/Qwen2.5-Coder-32B-Instruct | nscale | standard | 0.06 | 0.20 |
nscale/Qwen/QwQ-32B | nscale/Qwen/QwQ-32B | nscale | standard | 0.18 | 0.20 |
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | nscale | standard | 0.38 | 0.38 |
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B | nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B | nscale | standard | 0.03 | 0.03 |
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | nscale | standard | 0.09 | 0.09 |
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | nscale | standard | 0.20 | 0.20 |
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | nscale | standard | 0.07 | 0.07 |
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | nscale | standard | 0.15 | 0.15 |
nscale/mistralai/mixtral-8x22b-instruct-v0.1 | nscale/mistralai/mixtral-8x22b-instruct-v0.1 | nscale | standard | 0.60 | 0.60 |
nscale/meta-llama/Llama-3.1-8B-Instruct | nscale/meta-llama/Llama-3.1-8B-Instruct | nscale | standard | 0.03 | 0.03 |
nscale/meta-llama/Llama-3.3-70B-Instruct | nscale/meta-llama/Llama-3.3-70B-Instruct | nscale | standard | 0.20 | 0.20 |
gemini-2.5-pro | gemini-2.5-pro | gemini | standard | 1.25 | 10.00 |
960 rows