AI Models Prices

* Prices are per 1M tokens in USD

Model Version Provider Pricing mode Input price * Output price *
omni-moderation-latest omni-moderation-latest openai standard 0.00 0.00
omni-moderation-latest-intents omni-moderation-latest-intents openai standard 0.00 0.00
omni-moderation-2024-09-26 omni-moderation-2024-09-26 openai standard 0.00 0.00
gpt-4 gpt-4 openai standard 30.00 60.00
gpt-4.1 gpt-4.1 openai standard 2.00 8.00
gpt-4.1 gpt-4.1 openai batch 1.00 4.00
gpt-4.1-2025-04-14 gpt-4.1-2025-04-14 openai standard 2.00 8.00
gpt-4.1-2025-04-14 gpt-4.1-2025-04-14 openai batch 1.00 4.00
gpt-4.1-mini gpt-4.1-mini openai standard 0.40 1.60
gpt-4.1-mini gpt-4.1-mini openai batch 0.20 0.80
gpt-4.1-mini-2025-04-14 gpt-4.1-mini-2025-04-14 openai standard 0.40 1.60
gpt-4.1-mini-2025-04-14 gpt-4.1-mini-2025-04-14 openai batch 0.20 0.80
gpt-4.1-nano gpt-4.1-nano openai standard 0.10 0.40
gpt-4.1-nano gpt-4.1-nano openai batch 0.05 0.20
gpt-4.1-nano-2025-04-14 gpt-4.1-nano-2025-04-14 openai standard 0.10 0.40
gpt-4.1-nano-2025-04-14 gpt-4.1-nano-2025-04-14 openai batch 0.05 0.20
gpt-4o gpt-4o openai standard 2.50 10.00
gpt-4o gpt-4o openai batch 1.25 5.00
watsonx/ibm/granite-3-8b-instruct watsonx/ibm/granite-3-8b-instruct watsonx standard 200.00 200.00
gpt-4o-search-preview-2025-03-11 gpt-4o-search-preview-2025-03-11 openai standard 2.50 10.00
gpt-4o-search-preview-2025-03-11 gpt-4o-search-preview-2025-03-11 openai batch 1.25 5.00
gpt-4o-search-preview gpt-4o-search-preview openai standard 2.50 10.00
gpt-4o-search-preview gpt-4o-search-preview openai batch 1.25 5.00
gpt-4.5-preview gpt-4.5-preview openai standard 75.00 150.00
gpt-4.5-preview gpt-4.5-preview openai batch 37.50 75.00
gpt-4.5-preview-2025-02-27 gpt-4.5-preview-2025-02-27 openai standard 75.00 150.00
gpt-4.5-preview-2025-02-27 gpt-4.5-preview-2025-02-27 openai batch 37.50 75.00
gpt-4o-audio-preview gpt-4o-audio-preview openai standard 2.50 10.00
gpt-4o-audio-preview-2024-12-17 gpt-4o-audio-preview-2024-12-17 openai standard 2.50 10.00
gpt-4o-audio-preview-2024-10-01 gpt-4o-audio-preview-2024-10-01 openai standard 2.50 10.00
gpt-4o-mini-audio-preview gpt-4o-mini-audio-preview openai standard 0.15 0.60
gpt-4o-mini-audio-preview-2024-12-17 gpt-4o-mini-audio-preview-2024-12-17 openai standard 0.15 0.60
gpt-4o-mini gpt-4o-mini openai standard 0.15 0.60
gpt-4o-mini gpt-4o-mini openai batch 0.08 0.30
gpt-4o-mini-search-preview-2025-03-11 gpt-4o-mini-search-preview-2025-03-11 openai standard 0.15 0.60
gpt-4o-mini-search-preview-2025-03-11 gpt-4o-mini-search-preview-2025-03-11 openai batch 0.08 0.30
gpt-4o-mini-search-preview gpt-4o-mini-search-preview openai standard 0.15 0.60
gpt-4o-mini-search-preview gpt-4o-mini-search-preview openai batch 0.08 0.30
gpt-4o-mini-2024-07-18 gpt-4o-mini-2024-07-18 openai standard 0.15 0.60
gpt-4o-mini-2024-07-18 gpt-4o-mini-2024-07-18 openai batch 0.08 0.30
codex-mini-latest codex-mini-latest openai standard 1.50 6.00
o1-pro o1-pro openai standard 150.00 600.00
o1-pro o1-pro openai batch 75.00 300.00
o1-pro-2025-03-19 o1-pro-2025-03-19 openai standard 150.00 600.00
o1-pro-2025-03-19 o1-pro-2025-03-19 openai batch 75.00 300.00
o1 o1 openai standard 15.00 60.00
o1-mini o1-mini openai standard 1.10 4.40
computer-use-preview computer-use-preview azure standard 3.00 12.00
o3 o3 openai standard 10.00 40.00
o3-2025-04-16 o3-2025-04-16 openai standard 10.00 40.00
o3-mini o3-mini openai standard 1.10 4.40
o3-mini-2025-01-31 o3-mini-2025-01-31 openai standard 1.10 4.40
o4-mini o4-mini openai standard 1.10 4.40
o4-mini-2025-04-16 o4-mini-2025-04-16 openai standard 1.10 4.40
o1-mini-2024-09-12 o1-mini-2024-09-12 openai standard 3.00 12.00
o1-preview o1-preview openai standard 15.00 60.00
o1-preview-2024-09-12 o1-preview-2024-09-12 openai standard 15.00 60.00
o1-2024-12-17 o1-2024-12-17 openai standard 15.00 60.00
chatgpt-4o-latest chatgpt-4o-latest openai standard 5.00 15.00
gpt-4o-2024-05-13 gpt-4o-2024-05-13 openai standard 5.00 15.00
gpt-4o-2024-05-13 gpt-4o-2024-05-13 openai batch 2.50 7.50
gpt-4o-2024-08-06 gpt-4o-2024-08-06 openai standard 2.50 10.00
gpt-4o-2024-08-06 gpt-4o-2024-08-06 openai batch 1.25 5.00
gpt-4o-2024-11-20 gpt-4o-2024-11-20 openai standard 2.50 10.00
gpt-4o-2024-11-20 gpt-4o-2024-11-20 openai batch 1.25 5.00
gpt-4o-realtime-preview-2024-10-01 gpt-4o-realtime-preview-2024-10-01 openai standard 5.00 20.00
gpt-4o-realtime-preview gpt-4o-realtime-preview openai standard 5.00 20.00
gpt-4o-realtime-preview-2024-12-17 gpt-4o-realtime-preview-2024-12-17 openai standard 5.00 20.00
gpt-4o-mini-realtime-preview gpt-4o-mini-realtime-preview openai standard 0.60 2.40
gpt-4o-mini-realtime-preview-2024-12-17 gpt-4o-mini-realtime-preview-2024-12-17 openai standard 0.60 2.40
gpt-4-turbo-preview gpt-4-turbo-preview openai standard 10.00 30.00
gpt-4-0314 gpt-4-0314 openai standard 30.00 60.00
gpt-4-0613 gpt-4-0613 openai standard 30.00 60.00
gpt-4-32k gpt-4-32k openai standard 60.00 120.00
gpt-4-32k-0314 gpt-4-32k-0314 openai standard 60.00 120.00
gpt-4-32k-0613 gpt-4-32k-0613 openai standard 60.00 120.00
gpt-4-turbo gpt-4-turbo openai standard 10.00 30.00
gpt-4-turbo-2024-04-09 gpt-4-turbo-2024-04-09 openai standard 10.00 30.00
gpt-4-1106-preview gpt-4-1106-preview openai standard 10.00 30.00
gpt-4-0125-preview gpt-4-0125-preview openai standard 10.00 30.00
gpt-4-vision-preview gpt-4-vision-preview openai standard 10.00 30.00
gpt-4-1106-vision-preview gpt-4-1106-vision-preview openai standard 10.00 30.00
gpt-3.5-turbo gpt-3.5-turbo openai standard 1.50 2.00
gpt-3.5-turbo-0301 gpt-3.5-turbo-0301 openai standard 1.50 2.00
gpt-3.5-turbo-0613 gpt-3.5-turbo-0613 openai standard 1.50 2.00
gpt-3.5-turbo-1106 gpt-3.5-turbo-1106 openai standard 1.00 2.00
gpt-3.5-turbo-0125 gpt-3.5-turbo-0125 openai standard 0.50 1.50
gpt-3.5-turbo-16k gpt-3.5-turbo-16k openai standard 3.00 4.00
gpt-3.5-turbo-16k-0613 gpt-3.5-turbo-16k-0613 openai standard 3.00 4.00
ft:gpt-3.5-turbo ft:gpt-3.5-turbo openai standard 3.00 6.00
ft:gpt-3.5-turbo ft:gpt-3.5-turbo openai batch 1.50 3.00
ft:gpt-3.5-turbo-0125 ft:gpt-3.5-turbo-0125 openai standard 3.00 6.00
ft:gpt-3.5-turbo-1106 ft:gpt-3.5-turbo-1106 openai standard 3.00 6.00
ft:gpt-3.5-turbo-0613 ft:gpt-3.5-turbo-0613 openai standard 3.00 6.00
ft:gpt-4-0613 ft:gpt-4-0613 openai standard 30.00 60.00
ft:gpt-4o-2024-08-06 ft:gpt-4o-2024-08-06 openai standard 3.75 15.00
ft:gpt-4o-2024-08-06 ft:gpt-4o-2024-08-06 openai batch 1.88 7.50
ft:gpt-4o-2024-11-20 ft:gpt-4o-2024-11-20 openai standard 3.75 15.00
ft:gpt-4o-mini-2024-07-18 ft:gpt-4o-mini-2024-07-18 openai standard 0.30 1.20
ft:gpt-4o-mini-2024-07-18 ft:gpt-4o-mini-2024-07-18 openai batch 0.15 0.60
ft:davinci-002 ft:davinci-002 text-completion-openai standard 2.00 2.00
ft:davinci-002 ft:davinci-002 text-completion-openai batch 1.00 1.00
ft:babbage-002 ft:babbage-002 text-completion-openai standard 0.40 0.40
ft:babbage-002 ft:babbage-002 text-completion-openai batch 0.20 0.20
text-embedding-3-large text-embedding-3-large openai standard 0.13 0.00
text-embedding-3-large text-embedding-3-large openai batch 0.07 0.00
text-embedding-3-small text-embedding-3-small openai standard 0.02 0.00
text-embedding-3-small text-embedding-3-small openai batch 0.01 0.00
text-embedding-ada-002 text-embedding-ada-002 openai standard 0.10 0.00
text-embedding-ada-002-v2 text-embedding-ada-002-v2 openai standard 0.10 0.00
text-embedding-ada-002-v2 text-embedding-ada-002-v2 openai batch 0.05 0.00
text-moderation-stable text-moderation-stable openai standard 0.00 0.00
text-moderation-007 text-moderation-007 openai standard 0.00 0.00
text-moderation-latest text-moderation-latest openai standard 0.00 0.00
gpt-4o-transcribe gpt-4o-transcribe openai standard 2.50 10.00
gpt-4o-mini-transcribe gpt-4o-mini-transcribe openai standard 1.25 5.00
gpt-4o-mini-tts gpt-4o-mini-tts openai standard 2.50 10.00
azure/gpt-4o-mini-tts azure/gpt-4o-mini-tts azure standard 2.50 10.00
azure/computer-use-preview azure/computer-use-preview azure standard 3.00 12.00
azure/gpt-4o-audio-preview-2024-12-17 azure/gpt-4o-audio-preview-2024-12-17 azure standard 2.50 10.00
azure/gpt-4o-mini-audio-preview-2024-12-17 azure/gpt-4o-mini-audio-preview-2024-12-17 azure standard 2.50 10.00
azure/gpt-4.1 azure/gpt-4.1 azure standard 2.00 8.00
azure/gpt-4.1 azure/gpt-4.1 azure batch 1.00 4.00
azure/gpt-4.1-2025-04-14 azure/gpt-4.1-2025-04-14 azure standard 2.00 8.00
azure/gpt-4.1-2025-04-14 azure/gpt-4.1-2025-04-14 azure batch 1.00 4.00
azure/gpt-4.1-mini azure/gpt-4.1-mini azure standard 0.40 1.60
azure/gpt-4.1-mini azure/gpt-4.1-mini azure batch 0.20 0.80
azure/gpt-4.1-mini-2025-04-14 azure/gpt-4.1-mini-2025-04-14 azure standard 0.40 1.60
azure/gpt-4.1-mini-2025-04-14 azure/gpt-4.1-mini-2025-04-14 azure batch 0.20 0.80
azure/gpt-4.1-nano azure/gpt-4.1-nano azure standard 0.10 0.40
azure/gpt-4.1-nano azure/gpt-4.1-nano azure batch 0.05 0.20
azure/gpt-4.1-nano-2025-04-14 azure/gpt-4.1-nano-2025-04-14 azure standard 0.10 0.40
azure/gpt-4.1-nano-2025-04-14 azure/gpt-4.1-nano-2025-04-14 azure batch 0.05 0.20
azure/o3 azure/o3 azure standard 10.00 40.00
azure/o3-2025-04-16 azure/o3-2025-04-16 azure standard 10.00 40.00
azure/o4-mini azure/o4-mini azure standard 1.10 4.40
azure/gpt-4o-mini-realtime-preview-2024-12-17 azure/gpt-4o-mini-realtime-preview-2024-12-17 azure standard 0.60 2.40
azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 azure standard 0.66 2.64
azure/us/gpt-4o-mini-realtime-preview-2024-12-17 azure/us/gpt-4o-mini-realtime-preview-2024-12-17 azure standard 0.66 2.64
azure/gpt-4o-realtime-preview-2024-12-17 azure/gpt-4o-realtime-preview-2024-12-17 azure standard 5.00 20.00
azure/us/gpt-4o-realtime-preview-2024-12-17 azure/us/gpt-4o-realtime-preview-2024-12-17 azure standard 5.50 22.00
azure/eu/gpt-4o-realtime-preview-2024-12-17 azure/eu/gpt-4o-realtime-preview-2024-12-17 azure standard 5.50 22.00
azure/gpt-4o-realtime-preview-2024-10-01 azure/gpt-4o-realtime-preview-2024-10-01 azure standard 5.00 20.00
azure/us/gpt-4o-realtime-preview-2024-10-01 azure/us/gpt-4o-realtime-preview-2024-10-01 azure standard 5.50 22.00
azure/eu/gpt-4o-realtime-preview-2024-10-01 azure/eu/gpt-4o-realtime-preview-2024-10-01 azure standard 5.50 22.00
azure/o4-mini-2025-04-16 azure/o4-mini-2025-04-16 azure standard 1.10 4.40
azure/o3-mini-2025-01-31 azure/o3-mini-2025-01-31 azure standard 1.10 4.40
azure/us/o3-mini-2025-01-31 azure/us/o3-mini-2025-01-31 azure standard 1.21 4.84
azure/us/o3-mini-2025-01-31 azure/us/o3-mini-2025-01-31 azure batch 0.61 2.42
azure/eu/o3-mini-2025-01-31 azure/eu/o3-mini-2025-01-31 azure standard 1.21 4.84
azure/eu/o3-mini-2025-01-31 azure/eu/o3-mini-2025-01-31 azure batch 0.61 2.42
azure/o3-mini azure/o3-mini azure standard 1.10 4.40
azure/o1-mini azure/o1-mini azure standard 1.21 4.84
azure/o1-mini-2024-09-12 azure/o1-mini-2024-09-12 azure standard 1.10 4.40
azure/us/o1-mini-2024-09-12 azure/us/o1-mini-2024-09-12 azure standard 1.21 4.84
azure/us/o1-mini-2024-09-12 azure/us/o1-mini-2024-09-12 azure batch 0.61 2.42
azure/eu/o1-mini-2024-09-12 azure/eu/o1-mini-2024-09-12 azure standard 1.21 4.84
azure/eu/o1-mini-2024-09-12 azure/eu/o1-mini-2024-09-12 azure batch 0.61 2.42
azure/o1 azure/o1 azure standard 15.00 60.00
azure/o1-2024-12-17 azure/o1-2024-12-17 azure standard 15.00 60.00
azure/us/o1-2024-12-17 azure/us/o1-2024-12-17 azure standard 16.50 66.00
azure/eu/o1-2024-12-17 azure/eu/o1-2024-12-17 azure standard 16.50 66.00
azure/codex-mini-latest azure/codex-mini-latest azure standard 1.50 6.00
azure/o1-preview azure/o1-preview azure standard 15.00 60.00
azure/o1-preview-2024-09-12 azure/o1-preview-2024-09-12 azure standard 15.00 60.00
azure/us/o1-preview-2024-09-12 azure/us/o1-preview-2024-09-12 azure standard 16.50 66.00
azure/eu/o1-preview-2024-09-12 azure/eu/o1-preview-2024-09-12 azure standard 16.50 66.00
azure/gpt-4.5-preview azure/gpt-4.5-preview azure standard 75.00 150.00
azure/gpt-4.5-preview azure/gpt-4.5-preview azure batch 37.50 75.00
azure/gpt-4o azure/gpt-4o azure standard 2.50 10.00
azure/global/gpt-4o-2024-11-20 azure/global/gpt-4o-2024-11-20 azure standard 2.50 10.00
azure/gpt-4o-2024-08-06 azure/gpt-4o-2024-08-06 azure standard 2.50 10.00
azure/global/gpt-4o-2024-08-06 azure/global/gpt-4o-2024-08-06 azure standard 2.50 10.00
azure/gpt-4o-2024-11-20 azure/gpt-4o-2024-11-20 azure standard 2.75 11.00
azure/us/gpt-4o-2024-11-20 azure/us/gpt-4o-2024-11-20 azure standard 2.75 11.00
azure/eu/gpt-4o-2024-11-20 azure/eu/gpt-4o-2024-11-20 azure standard 2.75 11.00
azure/gpt-4o-2024-05-13 azure/gpt-4o-2024-05-13 azure standard 5.00 15.00
azure/global-standard/gpt-4o-2024-08-06 azure/global-standard/gpt-4o-2024-08-06 azure standard 2.50 10.00
azure/us/gpt-4o-2024-08-06 azure/us/gpt-4o-2024-08-06 azure standard 2.75 11.00
azure/eu/gpt-4o-2024-08-06 azure/eu/gpt-4o-2024-08-06 azure standard 2.75 11.00
azure/global-standard/gpt-4o-2024-11-20 azure/global-standard/gpt-4o-2024-11-20 azure standard 2.50 10.00
azure/global-standard/gpt-4o-mini azure/global-standard/gpt-4o-mini azure standard 0.15 0.60
azure/gpt-4o-mini azure/gpt-4o-mini azure standard 0.17 0.66
azure/gpt-4o-mini-2024-07-18 azure/gpt-4o-mini-2024-07-18 azure standard 0.17 0.66
azure/us/gpt-4o-mini-2024-07-18 azure/us/gpt-4o-mini-2024-07-18 azure standard 0.17 0.66
azure/eu/gpt-4o-mini-2024-07-18 azure/eu/gpt-4o-mini-2024-07-18 azure standard 0.17 0.66
azure/gpt-4-turbo-2024-04-09 azure/gpt-4-turbo-2024-04-09 azure standard 10.00 30.00
azure/gpt-4-0125-preview azure/gpt-4-0125-preview azure standard 10.00 30.00
azure/gpt-4-1106-preview azure/gpt-4-1106-preview azure standard 10.00 30.00
azure/gpt-4-0613 azure/gpt-4-0613 azure standard 30.00 60.00
azure/gpt-4-32k-0613 azure/gpt-4-32k-0613 azure standard 60.00 120.00
azure/gpt-4-32k azure/gpt-4-32k azure standard 60.00 120.00
azure/gpt-4 azure/gpt-4 azure standard 30.00 60.00
azure/gpt-4-turbo azure/gpt-4-turbo azure standard 10.00 30.00
azure/gpt-4-turbo-vision-preview azure/gpt-4-turbo-vision-preview azure standard 10.00 30.00
azure/gpt-35-turbo-16k-0613 azure/gpt-35-turbo-16k-0613 azure standard 3.00 4.00
azure/gpt-35-turbo-1106 azure/gpt-35-turbo-1106 azure standard 1.00 2.00
azure/gpt-35-turbo-0613 azure/gpt-35-turbo-0613 azure standard 1.50 2.00
azure/gpt-35-turbo-0301 azure/gpt-35-turbo-0301 azure standard 0.20 2.00
azure/gpt-35-turbo-0125 azure/gpt-35-turbo-0125 azure standard 0.50 1.50
azure/gpt-3.5-turbo-0125 azure/gpt-3.5-turbo-0125 azure standard 0.50 1.50
azure/gpt-35-turbo-16k azure/gpt-35-turbo-16k azure standard 3.00 4.00
azure/gpt-35-turbo azure/gpt-35-turbo azure standard 0.50 1.50
azure/gpt-3.5-turbo azure/gpt-3.5-turbo azure standard 0.50 1.50
azure/gpt-3.5-turbo-instruct-0914 azure/gpt-3.5-turbo-instruct-0914 azure_text standard 1.50 2.00
azure/gpt-35-turbo-instruct azure/gpt-35-turbo-instruct azure_text standard 1.50 2.00
azure/gpt-35-turbo-instruct-0914 azure/gpt-35-turbo-instruct-0914 azure_text standard 1.50 2.00
azure/mistral-large-latest azure/mistral-large-latest azure standard 8.00 24.00
azure/mistral-large-2402 azure/mistral-large-2402 azure standard 8.00 24.00
azure/command-r-plus azure/command-r-plus azure standard 3.00 15.00
azure/ada azure/ada azure standard 0.10 0.00
azure/text-embedding-ada-002 azure/text-embedding-ada-002 azure standard 0.10 0.00
azure/text-embedding-3-large azure/text-embedding-3-large azure standard 0.13 0.00
azure/text-embedding-3-small azure/text-embedding-3-small azure standard 0.02 0.00
azure/standard/1024-x-1024/dall-e-3 azure/standard/1024-x-1024/dall-e-3 azure standard - 0.00
azure/hd/1024-x-1024/dall-e-3 azure/hd/1024-x-1024/dall-e-3 azure standard - 0.00
azure/standard/1024-x-1792/dall-e-3 azure/standard/1024-x-1792/dall-e-3 azure standard - 0.00
azure/standard/1792-x-1024/dall-e-3 azure/standard/1792-x-1024/dall-e-3 azure standard - 0.00
azure/hd/1024-x-1792/dall-e-3 azure/hd/1024-x-1792/dall-e-3 azure standard - 0.00
azure/hd/1792-x-1024/dall-e-3 azure/hd/1792-x-1024/dall-e-3 azure standard - 0.00
azure/standard/1024-x-1024/dall-e-2 azure/standard/1024-x-1024/dall-e-2 azure standard - 0.00
azure_ai/deepseek-r1 azure_ai/deepseek-r1 azure_ai standard 1.35 5.40
azure_ai/deepseek-v3 azure_ai/deepseek-v3 azure_ai standard 1.14 4.56
azure_ai/deepseek-v3-0324 azure_ai/deepseek-v3-0324 azure_ai standard 1.14 4.56
azure_ai/jamba-instruct azure_ai/jamba-instruct azure_ai standard 0.50 0.70
azure_ai/mistral-nemo azure_ai/mistral-nemo azure_ai standard 0.15 0.15
azure_ai/mistral-medium-2505 azure_ai/mistral-medium-2505 azure_ai standard 0.40 2.00
azure_ai/mistral-large azure_ai/mistral-large azure_ai standard 4.00 12.00
azure_ai/mistral-small azure_ai/mistral-small azure_ai standard 1.00 3.00
azure_ai/mistral-small-2503 azure_ai/mistral-small-2503 azure_ai standard 1.00 3.00
azure_ai/mistral-large-2407 azure_ai/mistral-large-2407 azure_ai standard 2.00 6.00
azure_ai/mistral-large-latest azure_ai/mistral-large-latest azure_ai standard 2.00 6.00
azure_ai/ministral-3b azure_ai/ministral-3b azure_ai standard 0.04 0.04
azure_ai/Llama-3.2-11B-Vision-Instruct azure_ai/Llama-3.2-11B-Vision-Instruct azure_ai standard 0.37 0.37
azure_ai/Llama-3.3-70B-Instruct azure_ai/Llama-3.3-70B-Instruct azure_ai standard 0.71 0.71
azure_ai/Llama-4-Scout-17B-16E-Instruct azure_ai/Llama-4-Scout-17B-16E-Instruct azure_ai standard 0.20 0.78
azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 azure_ai standard 1.41 0.35
azure_ai/Llama-3.2-90B-Vision-Instruct azure_ai/Llama-3.2-90B-Vision-Instruct azure_ai standard 2.04 2.04
azure_ai/Meta-Llama-3-70B-Instruct azure_ai/Meta-Llama-3-70B-Instruct azure_ai standard 1.10 0.37
azure_ai/Meta-Llama-3.1-8B-Instruct azure_ai/Meta-Llama-3.1-8B-Instruct azure_ai standard 0.30 0.61
azure_ai/Meta-Llama-3.1-70B-Instruct azure_ai/Meta-Llama-3.1-70B-Instruct azure_ai standard 2.68 3.54
azure_ai/Meta-Llama-3.1-405B-Instruct azure_ai/Meta-Llama-3.1-405B-Instruct azure_ai standard 5.33 16.00
azure_ai/Phi-4-mini-instruct azure_ai/Phi-4-mini-instruct azure_ai standard 0.08 0.30
azure_ai/Phi-4-multimodal-instruct azure_ai/Phi-4-multimodal-instruct azure_ai standard 0.08 0.32
azure_ai/Phi-4 azure_ai/Phi-4 azure_ai standard 0.13 0.50
azure_ai/Phi-3.5-mini-instruct azure_ai/Phi-3.5-mini-instruct azure_ai standard 0.13 0.52
azure_ai/Phi-3.5-vision-instruct azure_ai/Phi-3.5-vision-instruct azure_ai standard 0.13 0.52
azure_ai/Phi-3.5-MoE-instruct azure_ai/Phi-3.5-MoE-instruct azure_ai standard 0.16 0.64
azure_ai/Phi-3-mini-4k-instruct azure_ai/Phi-3-mini-4k-instruct azure_ai standard 0.13 0.52
azure_ai/Phi-3-mini-128k-instruct azure_ai/Phi-3-mini-128k-instruct azure_ai standard 0.13 0.52
azure_ai/Phi-3-small-8k-instruct azure_ai/Phi-3-small-8k-instruct azure_ai standard 0.15 0.60
azure_ai/Phi-3-small-128k-instruct azure_ai/Phi-3-small-128k-instruct azure_ai standard 0.15 0.60
azure_ai/Phi-3-medium-4k-instruct azure_ai/Phi-3-medium-4k-instruct azure_ai standard 0.17 0.68
azure_ai/Phi-3-medium-128k-instruct azure_ai/Phi-3-medium-128k-instruct azure_ai standard 0.17 0.68
azure_ai/cohere-rerank-v3-multilingual azure_ai/cohere-rerank-v3-multilingual azure_ai standard 0.00 0.00
azure_ai/cohere-rerank-v3-english azure_ai/cohere-rerank-v3-english azure_ai standard 0.00 0.00
azure_ai/Cohere-embed-v3-english azure_ai/Cohere-embed-v3-english azure_ai standard 0.10 0.00
azure_ai/Cohere-embed-v3-multilingual azure_ai/Cohere-embed-v3-multilingual azure_ai standard 0.10 0.00
azure_ai/embed-v-4-0 azure_ai/embed-v-4-0 azure_ai standard 0.12 0.00
babbage-002 babbage-002 text-completion-openai standard 0.40 0.40
davinci-002 davinci-002 text-completion-openai standard 2.00 2.00
gpt-3.5-turbo-instruct gpt-3.5-turbo-instruct text-completion-openai standard 1.50 2.00
gpt-3.5-turbo-instruct-0914 gpt-3.5-turbo-instruct-0914 text-completion-openai standard 1.50 2.00
claude-instant-1 claude-instant-1 anthropic standard 1.63 5.51
mistral/mistral-tiny mistral/mistral-tiny mistral standard 0.25 0.25
mistral/mistral-small mistral/mistral-small mistral standard 0.10 0.30
mistral/mistral-small-latest mistral/mistral-small-latest mistral standard 0.10 0.30
mistral/mistral-medium mistral/mistral-medium mistral standard 2.70 8.10
mistral/mistral-medium-latest mistral/mistral-medium-latest mistral standard 0.40 2.00
mistral/mistral-medium-2505 mistral/mistral-medium-2505 mistral standard 0.40 2.00
mistral/mistral-medium-2312 mistral/mistral-medium-2312 mistral standard 2.70 8.10
mistral/mistral-large-latest mistral/mistral-large-latest mistral standard 2.00 6.00
mistral/mistral-large-2411 mistral/mistral-large-2411 mistral standard 2.00 6.00
mistral/mistral-large-2402 mistral/mistral-large-2402 mistral standard 4.00 12.00
mistral/mistral-large-2407 mistral/mistral-large-2407 mistral standard 3.00 9.00
mistral/pixtral-large-latest mistral/pixtral-large-latest mistral standard 2.00 6.00
mistral/pixtral-large-2411 mistral/pixtral-large-2411 mistral standard 2.00 6.00
mistral/pixtral-12b-2409 mistral/pixtral-12b-2409 mistral standard 0.15 0.15
mistral/open-mistral-7b mistral/open-mistral-7b mistral standard 0.25 0.25
mistral/open-mixtral-8x7b mistral/open-mixtral-8x7b mistral standard 0.70 0.70
mistral/open-mixtral-8x22b mistral/open-mixtral-8x22b mistral standard 2.00 6.00
mistral/codestral-latest mistral/codestral-latest mistral standard 1.00 3.00
mistral/codestral-2405 mistral/codestral-2405 mistral standard 1.00 3.00
mistral/open-mistral-nemo mistral/open-mistral-nemo mistral standard 0.30 0.30
mistral/open-mistral-nemo-2407 mistral/open-mistral-nemo-2407 mistral standard 0.30 0.30
mistral/open-codestral-mamba mistral/open-codestral-mamba mistral standard 0.25 0.25
mistral/codestral-mamba-latest mistral/codestral-mamba-latest mistral standard 0.25 0.25
mistral/devstral-small-2505 mistral/devstral-small-2505 mistral standard 0.10 0.30
mistral/mistral-embed mistral/mistral-embed mistral standard 0.10 -
deepseek/deepseek-reasoner deepseek/deepseek-reasoner deepseek standard 0.55 2.19
deepseek/deepseek-chat deepseek/deepseek-chat deepseek standard 0.27 1.10
codestral/codestral-latest codestral/codestral-latest codestral standard 0.00 0.00
codestral/codestral-2405 codestral/codestral-2405 codestral standard 0.00 0.00
text-completion-codestral/codestral-latest text-completion-codestral/codestral-latest text-completion-codestral standard 0.00 0.00
text-completion-codestral/codestral-2405 text-completion-codestral/codestral-2405 text-completion-codestral standard 0.00 0.00
xai/grok-beta xai/grok-beta xai standard 5.00 15.00
xai/grok-2-vision-1212 xai/grok-2-vision-1212 xai standard 2.00 10.00
xai/grok-2-vision-latest xai/grok-2-vision-latest xai standard 2.00 10.00
xai/grok-2-vision xai/grok-2-vision xai standard 2.00 10.00
xai/grok-3 xai/grok-3 xai standard 3.00 15.00
xai/grok-3-beta xai/grok-3-beta xai standard 3.00 15.00
xai/grok-3-fast-beta xai/grok-3-fast-beta xai standard 5.00 25.00
xai/grok-3-fast-latest xai/grok-3-fast-latest xai standard 5.00 25.00
xai/grok-3-mini-beta xai/grok-3-mini-beta xai standard 0.30 0.50
xai/grok-3-mini-fast-beta xai/grok-3-mini-fast-beta xai standard 0.60 4.00
xai/grok-3-mini-fast-latest xai/grok-3-mini-fast-latest xai standard 0.60 4.00
xai/grok-vision-beta xai/grok-vision-beta xai standard 5.00 15.00
xai/grok-2-1212 xai/grok-2-1212 xai standard 2.00 10.00
xai/grok-2 xai/grok-2 xai standard 2.00 10.00
xai/grok-2-latest xai/grok-2-latest xai standard 2.00 10.00
deepseek/deepseek-coder deepseek/deepseek-coder deepseek standard 0.14 0.28
groq/deepseek-r1-distill-llama-70b groq/deepseek-r1-distill-llama-70b groq standard 0.75 0.99
groq/llama-3.3-70b-versatile groq/llama-3.3-70b-versatile groq standard 0.59 0.79
groq/llama-3.3-70b-specdec groq/llama-3.3-70b-specdec groq standard 0.59 0.99
groq/llama-guard-3-8b groq/llama-guard-3-8b groq standard 0.20 0.20
groq/llama2-70b-4096 groq/llama2-70b-4096 groq standard 0.70 0.80
groq/llama3-8b-8192 groq/llama3-8b-8192 groq standard 0.05 0.08
groq/llama-3.2-1b-preview groq/llama-3.2-1b-preview groq standard 0.04 0.04
groq/llama-3.2-3b-preview groq/llama-3.2-3b-preview groq standard 0.06 0.06
groq/llama-3.2-11b-text-preview groq/llama-3.2-11b-text-preview groq standard 0.18 0.18
groq/llama-3.2-11b-vision-preview groq/llama-3.2-11b-vision-preview groq standard 0.18 0.18
groq/llama-3.2-90b-text-preview groq/llama-3.2-90b-text-preview groq standard 0.90 0.90
groq/llama-3.2-90b-vision-preview groq/llama-3.2-90b-vision-preview groq standard 0.90 0.90
groq/llama3-70b-8192 groq/llama3-70b-8192 groq standard 0.59 0.79
groq/llama-3.1-8b-instant groq/llama-3.1-8b-instant groq standard 0.05 0.08
groq/llama-3.1-70b-versatile groq/llama-3.1-70b-versatile groq standard 0.59 0.79
groq/llama-3.1-405b-reasoning groq/llama-3.1-405b-reasoning groq standard 0.59 0.79
groq/meta-llama/llama-4-scout-17b-16e-instruct groq/meta-llama/llama-4-scout-17b-16e-instruct groq standard 0.11 0.34
groq/meta-llama/llama-4-maverick-17b-128e-instruct groq/meta-llama/llama-4-maverick-17b-128e-instruct groq standard 0.20 0.60
groq/mistral-saba-24b groq/mistral-saba-24b groq standard 0.79 0.79
groq/mixtral-8x7b-32768 groq/mixtral-8x7b-32768 groq standard 0.24 0.24
groq/gemma-7b-it groq/gemma-7b-it groq standard 0.07 0.07
groq/gemma2-9b-it groq/gemma2-9b-it groq standard 0.20 0.20
groq/llama3-groq-70b-8192-tool-use-preview groq/llama3-groq-70b-8192-tool-use-preview groq standard 0.89 0.89
groq/llama3-groq-8b-8192-tool-use-preview groq/llama3-groq-8b-8192-tool-use-preview groq standard 0.19 0.19
groq/qwen-qwq-32b groq/qwen-qwq-32b groq standard 0.29 0.39
cerebras/llama3.1-8b cerebras/llama3.1-8b cerebras standard 0.10 0.10
cerebras/llama3.1-70b cerebras/llama3.1-70b cerebras standard 0.60 0.60
cerebras/llama-3.3-70b cerebras/llama-3.3-70b cerebras standard 0.85 1.20
cerebras/qwen-3-32b cerebras/qwen-3-32b cerebras standard 0.40 0.80
friendliai/meta-llama-3.1-8b-instruct friendliai/meta-llama-3.1-8b-instruct friendliai standard 0.10 0.10
friendliai/meta-llama-3.1-70b-instruct friendliai/meta-llama-3.1-70b-instruct friendliai standard 0.60 0.60
claude-instant-1.2 claude-instant-1.2 anthropic standard 0.16 0.55
claude-2 claude-2 anthropic standard 8.00 24.00
claude-2.1 claude-2.1 anthropic standard 8.00 24.00
claude-3-haiku-20240307 claude-3-haiku-20240307 anthropic standard 0.25 1.25
claude-3-5-haiku-20241022 claude-3-5-haiku-20241022 anthropic standard 0.80 4.00
claude-3-5-haiku-latest claude-3-5-haiku-latest anthropic standard 1.00 5.00
claude-3-opus-latest claude-3-opus-latest anthropic standard 15.00 75.00
claude-3-opus-20240229 claude-3-opus-20240229 anthropic standard 15.00 75.00
claude-3-sonnet-20240229 claude-3-sonnet-20240229 anthropic standard 3.00 15.00
claude-3-5-sonnet-latest claude-3-5-sonnet-latest anthropic standard 3.00 15.00
claude-3-5-sonnet-20240620 claude-3-5-sonnet-20240620 anthropic standard 3.00 15.00
claude-opus-4-20250514 claude-opus-4-20250514 anthropic standard 15.00 75.00
claude-sonnet-4-20250514 claude-sonnet-4-20250514 anthropic standard 3.00 15.00
claude-4-opus-20250514 claude-4-opus-20250514 anthropic standard 15.00 75.00
claude-4-sonnet-20250514 claude-4-sonnet-20250514 anthropic standard 3.00 15.00
claude-3-7-sonnet-latest claude-3-7-sonnet-latest anthropic standard 3.00 15.00
claude-3-7-sonnet-20250219 claude-3-7-sonnet-20250219 anthropic standard 3.00 15.00
claude-3-5-sonnet-20241022 claude-3-5-sonnet-20241022 anthropic standard 3.00 15.00
text-bison32k text-bison32k vertex_ai-text-models standard 0.13 0.13
text-bison32k@002 text-bison32k@002 vertex_ai-text-models standard 0.13 0.13
text-unicorn text-unicorn vertex_ai-text-models standard 10.00 28.00
text-unicorn@001 text-unicorn@001 vertex_ai-text-models standard 10.00 28.00
chat-bison chat-bison vertex_ai-chat-models standard 0.13 0.13
chat-bison@001 chat-bison@001 vertex_ai-chat-models standard 0.13 0.13
chat-bison@002 chat-bison@002 vertex_ai-chat-models standard 0.13 0.13
chat-bison-32k chat-bison-32k vertex_ai-chat-models standard 0.13 0.13
chat-bison-32k@002 chat-bison-32k@002 vertex_ai-chat-models standard 0.13 0.13
code-bison code-bison vertex_ai-code-text-models standard 0.13 0.13
code-bison@001 code-bison@001 vertex_ai-code-text-models standard 0.13 0.13
code-bison@002 code-bison@002 vertex_ai-code-text-models standard 0.13 0.13
code-bison32k code-bison32k vertex_ai-code-text-models standard 0.13 0.13
code-bison-32k@002 code-bison-32k@002 vertex_ai-code-text-models standard 0.13 0.13
code-gecko@001 code-gecko@001 vertex_ai-code-text-models standard 0.13 0.13
code-gecko@002 code-gecko@002 vertex_ai-code-text-models standard 0.13 0.13
code-gecko code-gecko vertex_ai-code-text-models standard 0.13 0.13
code-gecko-latest code-gecko-latest vertex_ai-code-text-models standard 0.13 0.13
codechat-bison@latest codechat-bison@latest vertex_ai-code-chat-models standard 0.13 0.13
codechat-bison codechat-bison vertex_ai-code-chat-models standard 0.13 0.13
codechat-bison@001 codechat-bison@001 vertex_ai-code-chat-models standard 0.13 0.13
codechat-bison@002 codechat-bison@002 vertex_ai-code-chat-models standard 0.13 0.13
codechat-bison-32k codechat-bison-32k vertex_ai-code-chat-models standard 0.13 0.13
codechat-bison-32k@002 codechat-bison-32k@002 vertex_ai-code-chat-models standard 0.13 0.13
gemini-pro gemini-pro vertex_ai-language-models standard 0.50 1.50
gemini-1.0-pro gemini-1.0-pro vertex_ai-language-models standard 0.50 1.50
gemini-1.0-pro-001 gemini-1.0-pro-001 vertex_ai-language-models standard 0.50 1.50
gemini-1.0-ultra gemini-1.0-ultra vertex_ai-language-models standard 0.50 1.50
gemini-1.0-ultra-001 gemini-1.0-ultra-001 vertex_ai-language-models standard 0.50 1.50
gemini-1.0-pro-002 gemini-1.0-pro-002 vertex_ai-language-models standard 0.50 1.50
gemini-1.5-pro gemini-1.5-pro vertex_ai-language-models standard 1.25 5.00
gemini-1.5-pro-002 gemini-1.5-pro-002 vertex_ai-language-models standard 1.25 5.00
gemini-1.5-pro-001 gemini-1.5-pro-001 vertex_ai-language-models standard 1.25 5.00
gemini-1.5-pro-preview-0514 gemini-1.5-pro-preview-0514 vertex_ai-language-models standard 0.08 0.31
gemini-1.5-pro-preview-0215 gemini-1.5-pro-preview-0215 vertex_ai-language-models standard 0.08 0.31
gemini-1.5-pro-preview-0409 gemini-1.5-pro-preview-0409 vertex_ai-language-models standard 0.08 0.31
gemini-1.5-flash gemini-1.5-flash vertex_ai-language-models standard 0.08 0.30
gemini-1.5-flash-exp-0827 gemini-1.5-flash-exp-0827 vertex_ai-language-models standard 0.00 0.00
gemini-1.5-flash-002 gemini-1.5-flash-002 vertex_ai-language-models standard 0.08 0.30
gemini-1.5-flash-001 gemini-1.5-flash-001 vertex_ai-language-models standard 0.08 0.30
gemini-1.5-flash-preview-0514 gemini-1.5-flash-preview-0514 vertex_ai-language-models standard 0.08 0.00
gemini-pro-experimental gemini-pro-experimental vertex_ai-language-models standard 0.00 0.00
gemini-flash-experimental gemini-flash-experimental vertex_ai-language-models standard 0.00 0.00
gemini-pro-vision gemini-pro-vision vertex_ai-vision-models standard 0.50 1.50
gemini-1.0-pro-vision gemini-1.0-pro-vision vertex_ai-vision-models standard 0.50 1.50
gemini-1.0-pro-vision-001 gemini-1.0-pro-vision-001 vertex_ai-vision-models standard 0.50 1.50
gemini-2.5-pro-exp-03-25 gemini-2.5-pro-exp-03-25 vertex_ai-language-models standard 1.25 10.00
gemini-2.0-pro-exp-02-05 gemini-2.0-pro-exp-02-05 vertex_ai-language-models standard 1.25 10.00
gemini-2.0-flash-exp gemini-2.0-flash-exp vertex_ai-language-models standard 0.15 0.60
gemini-2.0-flash-001 gemini-2.0-flash-001 vertex_ai-language-models standard 0.15 0.60
gemini-2.0-flash-thinking-exp gemini-2.0-flash-thinking-exp vertex_ai-language-models standard 0.00 0.00
gemini-2.0-flash-thinking-exp-01-21 gemini-2.0-flash-thinking-exp-01-21 vertex_ai-language-models standard 0.00 0.00
gemini/gemini-2.5-pro-exp-03-25 gemini/gemini-2.5-pro-exp-03-25 gemini standard 0.00 0.00
gemini/gemini-2.5-flash-preview-tts gemini/gemini-2.5-flash-preview-tts gemini standard 0.15 0.60
gemini/gemini-2.5-flash-preview-05-20 gemini/gemini-2.5-flash-preview-05-20 gemini standard 0.15 0.60
gemini/gemini-2.5-flash-preview-04-17 gemini/gemini-2.5-flash-preview-04-17 gemini standard 0.15 0.60
gemini-2.5-flash-preview-05-20 gemini-2.5-flash-preview-05-20 vertex_ai-language-models standard 0.15 0.60
gemini-2.5-flash-preview-04-17 gemini-2.5-flash-preview-04-17 vertex_ai-language-models standard 0.15 0.60
gemini-2.0-flash gemini-2.0-flash vertex_ai-language-models standard 0.10 0.40
gemini-2.0-flash-lite gemini-2.0-flash-lite vertex_ai-language-models standard 0.08 0.30
gemini-2.0-flash-lite-001 gemini-2.0-flash-lite-001 vertex_ai-language-models standard 0.08 0.30
gemini-2.5-pro-preview-06-05 gemini-2.5-pro-preview-06-05 vertex_ai-language-models standard 1.25 10.00
gemini-2.5-pro-preview-05-06 gemini-2.5-pro-preview-05-06 vertex_ai-language-models standard 1.25 10.00
gemini-2.5-pro-preview-03-25 gemini-2.5-pro-preview-03-25 vertex_ai-language-models standard 1.25 10.00
gemini-2.0-flash-preview-image-generation gemini-2.0-flash-preview-image-generation vertex_ai-language-models standard 0.10 0.40
gemini-2.5-pro-preview-tts gemini-2.5-pro-preview-tts vertex_ai-language-models standard 1.25 10.00
gemini/gemini-2.0-pro-exp-02-05 gemini/gemini-2.0-pro-exp-02-05 gemini standard 0.00 0.00
gemini/gemini-2.0-flash-preview-image-generation gemini/gemini-2.0-flash-preview-image-generation gemini standard 0.10 0.40
gemini/gemini-2.0-flash gemini/gemini-2.0-flash gemini standard 0.10 0.40
gemini/gemini-2.0-flash-lite gemini/gemini-2.0-flash-lite gemini standard 0.08 0.30
gemini/gemini-2.0-flash-001 gemini/gemini-2.0-flash-001 gemini standard 0.10 0.40
gemini/gemini-2.5-pro-preview-tts gemini/gemini-2.5-pro-preview-tts gemini standard 1.25 10.00
gemini/gemini-2.5-pro-preview-06-05 gemini/gemini-2.5-pro-preview-06-05 gemini standard 1.25 10.00
gemini/gemini-2.5-pro-preview-05-06 gemini/gemini-2.5-pro-preview-05-06 gemini standard 1.25 10.00
gemini/gemini-2.5-pro-preview-03-25 gemini/gemini-2.5-pro-preview-03-25 gemini standard 1.25 10.00
gemini/gemini-2.0-flash-exp gemini/gemini-2.0-flash-exp gemini standard 0.00 0.00
gemini/gemini-2.0-flash-lite-preview-02-05 gemini/gemini-2.0-flash-lite-preview-02-05 gemini standard 0.08 0.30
gemini/gemini-2.0-flash-thinking-exp gemini/gemini-2.0-flash-thinking-exp gemini standard 0.00 0.00
gemini/gemini-2.0-flash-thinking-exp-01-21 gemini/gemini-2.0-flash-thinking-exp-01-21 gemini standard 0.00 0.00
gemini/gemma-3-27b-it gemini/gemma-3-27b-it gemini standard 0.00 0.00
gemini/learnlm-1.5-pro-experimental gemini/learnlm-1.5-pro-experimental gemini standard 0.00 0.00
vertex_ai/claude-3-sonnet vertex_ai/claude-3-sonnet vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-3-sonnet@20240229 vertex_ai/claude-3-sonnet@20240229 vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-3-5-sonnet vertex_ai/claude-3-5-sonnet vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-3-5-sonnet@20240620 vertex_ai/claude-3-5-sonnet@20240620 vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-3-5-sonnet-v2 vertex_ai/claude-3-5-sonnet-v2 vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-3-5-sonnet-v2@20241022 vertex_ai/claude-3-5-sonnet-v2@20241022 vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-3-7-sonnet@20250219 vertex_ai/claude-3-7-sonnet@20250219 vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-opus-4@20250514 vertex_ai/claude-opus-4@20250514 vertex_ai-anthropic_models standard 15.00 75.00
vertex_ai/claude-sonnet-4@20250514 vertex_ai/claude-sonnet-4@20250514 vertex_ai-anthropic_models standard 3.00 15.00
vertex_ai/claude-3-haiku vertex_ai/claude-3-haiku vertex_ai-anthropic_models standard 0.25 1.25
vertex_ai/claude-3-haiku@20240307 vertex_ai/claude-3-haiku@20240307 vertex_ai-anthropic_models standard 0.25 1.25
vertex_ai/claude-3-5-haiku vertex_ai/claude-3-5-haiku vertex_ai-anthropic_models standard 1.00 5.00
vertex_ai/claude-3-5-haiku@20241022 vertex_ai/claude-3-5-haiku@20241022 vertex_ai-anthropic_models standard 1.00 5.00
vertex_ai/claude-3-opus vertex_ai/claude-3-opus vertex_ai-anthropic_models standard 15.00 75.00
vertex_ai/claude-3-opus@20240229 vertex_ai/claude-3-opus@20240229 vertex_ai-anthropic_models standard 15.00 75.00
vertex_ai/meta/llama3-405b-instruct-maas vertex_ai/meta/llama3-405b-instruct-maas vertex_ai-llama_models standard 0.00 0.00
vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas vertex_ai-llama_models standard 0.25 0.70
vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas vertex_ai-llama_models standard 0.25 0.70
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas vertex_ai-llama_models standard 0.35 1.15
vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas vertex_ai-llama_models standard 0.35 1.15
vertex_ai/meta/llama3-70b-instruct-maas vertex_ai/meta/llama3-70b-instruct-maas vertex_ai-llama_models standard 0.00 0.00
vertex_ai/meta/llama3-8b-instruct-maas vertex_ai/meta/llama3-8b-instruct-maas vertex_ai-llama_models standard 0.00 0.00
vertex_ai/meta/llama-3.2-90b-vision-instruct-maas vertex_ai/meta/llama-3.2-90b-vision-instruct-maas vertex_ai-llama_models standard 0.00 0.00
vertex_ai/mistral-large@latest vertex_ai/mistral-large@latest vertex_ai-mistral_models standard 2.00 6.00
vertex_ai/mistral-large@2411-001 vertex_ai/mistral-large@2411-001 vertex_ai-mistral_models standard 2.00 6.00
vertex_ai/mistral-large-2411 vertex_ai/mistral-large-2411 vertex_ai-mistral_models standard 2.00 6.00
vertex_ai/mistral-large@2407 vertex_ai/mistral-large@2407 vertex_ai-mistral_models standard 2.00 6.00
vertex_ai/mistral-nemo@latest vertex_ai/mistral-nemo@latest vertex_ai-mistral_models standard 0.15 0.15
vertex_ai/mistral-small-2503@001 vertex_ai/mistral-small-2503@001 vertex_ai-mistral_models standard 1.00 3.00
vertex_ai/mistral-small-2503 vertex_ai/mistral-small-2503 vertex_ai-mistral_models standard 1.00 3.00
vertex_ai/jamba-1.5-mini@001 vertex_ai/jamba-1.5-mini@001 vertex_ai-ai21_models standard 0.20 0.40
vertex_ai/jamba-1.5-large@001 vertex_ai/jamba-1.5-large@001 vertex_ai-ai21_models standard 2.00 8.00
vertex_ai/jamba-1.5 vertex_ai/jamba-1.5 vertex_ai-ai21_models standard 0.20 0.40
vertex_ai/jamba-1.5-mini vertex_ai/jamba-1.5-mini vertex_ai-ai21_models standard 0.20 0.40
vertex_ai/jamba-1.5-large vertex_ai/jamba-1.5-large vertex_ai-ai21_models standard 2.00 8.00
vertex_ai/mistral-nemo@2407 vertex_ai/mistral-nemo@2407 vertex_ai-mistral_models standard 3.00 3.00
vertex_ai/codestral@latest vertex_ai/codestral@latest vertex_ai-mistral_models standard 0.20 0.60
vertex_ai/codestral@2405 vertex_ai/codestral@2405 vertex_ai-mistral_models standard 0.20 0.60
vertex_ai/codestral-2501 vertex_ai/codestral-2501 vertex_ai-mistral_models standard 0.20 0.60
text-embedding-004 text-embedding-004 vertex_ai-embedding-models standard 0.10 0.00
gemini-embedding-001 gemini-embedding-001 vertex_ai-embedding-models standard 0.15 0.00
text-embedding-005 text-embedding-005 vertex_ai-embedding-models standard 0.10 0.00
text-multilingual-embedding-002 text-multilingual-embedding-002 vertex_ai-embedding-models standard 0.10 0.00
multimodalembedding multimodalembedding vertex_ai-embedding-models standard 0.80 0.00
multimodalembedding@001 multimodalembedding@001 vertex_ai-embedding-models standard 0.80 0.00
text-embedding-large-exp-03-07 text-embedding-large-exp-03-07 vertex_ai-embedding-models standard 0.10 0.00
textembedding-gecko textembedding-gecko vertex_ai-embedding-models standard 0.10 0.00
textembedding-gecko-multilingual textembedding-gecko-multilingual vertex_ai-embedding-models standard 0.10 0.00
textembedding-gecko-multilingual@001 textembedding-gecko-multilingual@001 vertex_ai-embedding-models standard 0.10 0.00
textembedding-gecko@001 textembedding-gecko@001 vertex_ai-embedding-models standard 0.10 0.00
textembedding-gecko@003 textembedding-gecko@003 vertex_ai-embedding-models standard 0.10 0.00
text-embedding-preview-0409 text-embedding-preview-0409 vertex_ai-embedding-models standard 0.01 0.00
text-multilingual-embedding-preview-0409 text-multilingual-embedding-preview-0409 vertex_ai-embedding-models standard 0.01 0.00
palm/chat-bison palm/chat-bison palm standard 0.13 0.13
palm/chat-bison-001 palm/chat-bison-001 palm standard 0.13 0.13
palm/text-bison palm/text-bison palm standard 0.13 0.13
palm/text-bison-001 palm/text-bison-001 palm standard 0.13 0.13
palm/text-bison-safety-off palm/text-bison-safety-off palm standard 0.13 0.13
palm/text-bison-safety-recitation-off palm/text-bison-safety-recitation-off palm standard 0.13 0.13
gemini/gemini-1.5-flash-002 gemini/gemini-1.5-flash-002 gemini standard 0.08 0.30
gemini/gemini-1.5-flash-001 gemini/gemini-1.5-flash-001 gemini standard 0.08 0.30
gemini/gemini-1.5-flash gemini/gemini-1.5-flash gemini standard 0.08 0.30
gemini/gemini-1.5-flash-latest gemini/gemini-1.5-flash-latest gemini standard 0.08 0.30
gemini/gemini-1.5-flash-8b gemini/gemini-1.5-flash-8b gemini standard 0.00 0.00
gemini/gemini-1.5-flash-8b-exp-0924 gemini/gemini-1.5-flash-8b-exp-0924 gemini standard 0.00 0.00
gemini/gemini-exp-1114 gemini/gemini-exp-1114 gemini standard 0.00 0.00
gemini/gemini-exp-1206 gemini/gemini-exp-1206 gemini standard 0.00 0.00
gemini/gemini-1.5-flash-exp-0827 gemini/gemini-1.5-flash-exp-0827 gemini standard 0.00 0.00
gemini/gemini-1.5-flash-8b-exp-0827 gemini/gemini-1.5-flash-8b-exp-0827 gemini standard 0.00 0.00
gemini/gemini-pro gemini/gemini-pro gemini standard 0.35 1.05
gemini/gemini-1.5-pro gemini/gemini-1.5-pro gemini standard 3.50 10.50
gemini/gemini-1.5-pro-002 gemini/gemini-1.5-pro-002 gemini standard 3.50 10.50
gemini/gemini-1.5-pro-001 gemini/gemini-1.5-pro-001 gemini standard 3.50 10.50
gemini/gemini-1.5-pro-exp-0801 gemini/gemini-1.5-pro-exp-0801 gemini standard 3.50 10.50
gemini/gemini-1.5-pro-exp-0827 gemini/gemini-1.5-pro-exp-0827 gemini standard 0.00 0.00
gemini/gemini-1.5-pro-latest gemini/gemini-1.5-pro-latest gemini standard 3.50 1.05
gemini/gemini-pro-vision gemini/gemini-pro-vision gemini standard 0.35 1.05
gemini/gemini-gemma-2-27b-it gemini/gemini-gemma-2-27b-it gemini standard 0.35 1.05
gemini/gemini-gemma-2-9b-it gemini/gemini-gemma-2-9b-it gemini standard 0.35 1.05
command-a-03-2025 command-a-03-2025 cohere_chat standard 2.50 10.00
command-r command-r cohere_chat standard 0.15 0.60
command-r-08-2024 command-r-08-2024 cohere_chat standard 0.15 0.60
command-r7b-12-2024 command-r7b-12-2024 cohere_chat standard 0.15 0.04
command-light command-light cohere_chat standard 0.30 0.60
command-r-plus command-r-plus cohere_chat standard 2.50 10.00
command-r-plus-08-2024 command-r-plus-08-2024 cohere_chat standard 2.50 10.00
command-nightly command-nightly cohere standard 1.00 2.00
command command cohere standard 1.00 2.00
rerank-v3.5 rerank-v3.5 cohere standard 0.00 0.00
rerank-english-v3.0 rerank-english-v3.0 cohere standard 0.00 0.00
rerank-multilingual-v3.0 rerank-multilingual-v3.0 cohere standard 0.00 0.00
rerank-english-v2.0 rerank-english-v2.0 cohere standard 0.00 0.00
rerank-multilingual-v2.0 rerank-multilingual-v2.0 cohere standard 0.00 0.00
embed-english-light-v3.0 embed-english-light-v3.0 cohere standard 0.10 0.00
embed-multilingual-v3.0 embed-multilingual-v3.0 cohere standard 0.10 0.00
embed-english-v2.0 embed-english-v2.0 cohere standard 0.10 0.00
embed-english-light-v2.0 embed-english-light-v2.0 cohere standard 0.10 0.00
embed-multilingual-v2.0 embed-multilingual-v2.0 cohere standard 0.10 0.00
embed-english-v3.0 embed-english-v3.0 cohere standard 0.10 0.00
replicate/meta/llama-2-13b replicate/meta/llama-2-13b replicate standard 0.10 0.50
replicate/meta/llama-2-13b-chat replicate/meta/llama-2-13b-chat replicate standard 0.10 0.50
replicate/meta/llama-2-70b replicate/meta/llama-2-70b replicate standard 0.65 2.75
replicate/meta/llama-2-70b-chat replicate/meta/llama-2-70b-chat replicate standard 0.65 2.75
replicate/meta/llama-2-7b replicate/meta/llama-2-7b replicate standard 0.05 0.25
replicate/meta/llama-2-7b-chat replicate/meta/llama-2-7b-chat replicate standard 0.05 0.25
replicate/meta/llama-3-70b replicate/meta/llama-3-70b replicate standard 0.65 2.75
replicate/meta/llama-3-70b-instruct replicate/meta/llama-3-70b-instruct replicate standard 0.65 2.75
replicate/meta/llama-3-8b replicate/meta/llama-3-8b replicate standard 0.05 0.25
replicate/meta/llama-3-8b-instruct replicate/meta/llama-3-8b-instruct replicate standard 0.05 0.25
replicate/mistralai/mistral-7b-v0.1 replicate/mistralai/mistral-7b-v0.1 replicate standard 0.05 0.25
replicate/mistralai/mistral-7b-instruct-v0.2 replicate/mistralai/mistral-7b-instruct-v0.2 replicate standard 0.05 0.25
replicate/mistralai/mixtral-8x7b-instruct-v0.1 replicate/mistralai/mixtral-8x7b-instruct-v0.1 replicate standard 0.30 1.00
openrouter/deepseek/deepseek-r1 openrouter/deepseek/deepseek-r1 openrouter standard 0.55 2.19
openrouter/deepseek/deepseek-chat openrouter/deepseek/deepseek-chat openrouter standard 0.14 0.28
openrouter/deepseek/deepseek-coder openrouter/deepseek/deepseek-coder openrouter standard 0.14 0.28
openrouter/microsoft/wizardlm-2-8x22b:nitro openrouter/microsoft/wizardlm-2-8x22b:nitro openrouter standard 1.00 1.00
openrouter/google/gemini-pro-1.5 openrouter/google/gemini-pro-1.5 openrouter standard 2.50 7.50
openrouter/google/gemini-2.0-flash-001 openrouter/google/gemini-2.0-flash-001 openrouter standard 0.10 0.40
openrouter/mistralai/mixtral-8x22b-instruct openrouter/mistralai/mixtral-8x22b-instruct openrouter standard 0.65 0.65
openrouter/cohere/command-r-plus openrouter/cohere/command-r-plus openrouter standard 3.00 15.00
openrouter/databricks/dbrx-instruct openrouter/databricks/dbrx-instruct openrouter standard 0.60 0.60
openrouter/anthropic/claude-3-haiku openrouter/anthropic/claude-3-haiku openrouter standard 0.25 1.25
openrouter/anthropic/claude-3-5-haiku openrouter/anthropic/claude-3-5-haiku openrouter standard 1.00 5.00
openrouter/anthropic/claude-3-haiku-20240307 openrouter/anthropic/claude-3-haiku-20240307 openrouter standard 0.25 1.25
openrouter/anthropic/claude-3-5-haiku-20241022 openrouter/anthropic/claude-3-5-haiku-20241022 openrouter standard 1.00 5.00
openrouter/anthropic/claude-3.5-sonnet openrouter/anthropic/claude-3.5-sonnet openrouter standard 3.00 15.00
openrouter/anthropic/claude-3.5-sonnet:beta openrouter/anthropic/claude-3.5-sonnet:beta openrouter standard 3.00 15.00
openrouter/anthropic/claude-3.7-sonnet openrouter/anthropic/claude-3.7-sonnet openrouter standard 3.00 15.00
openrouter/anthropic/claude-3.7-sonnet:beta openrouter/anthropic/claude-3.7-sonnet:beta openrouter standard 3.00 15.00
openrouter/anthropic/claude-3-sonnet openrouter/anthropic/claude-3-sonnet openrouter standard 3.00 15.00
openrouter/mistralai/mistral-large openrouter/mistralai/mistral-large openrouter standard 8.00 24.00
mistralai/mistral-small-3.1-24b-instruct mistralai/mistral-small-3.1-24b-instruct openrouter standard 0.10 0.30
openrouter/cognitivecomputations/dolphin-mixtral-8x7b openrouter/cognitivecomputations/dolphin-mixtral-8x7b openrouter standard 0.50 0.50
openrouter/google/gemini-pro-vision openrouter/google/gemini-pro-vision openrouter standard 0.13 0.38
openrouter/fireworks/firellava-13b openrouter/fireworks/firellava-13b openrouter standard 0.20 0.20
openrouter/meta-llama/llama-3-8b-instruct:free openrouter/meta-llama/llama-3-8b-instruct:free openrouter standard 0.00 0.00
openrouter/meta-llama/llama-3-8b-instruct:extended openrouter/meta-llama/llama-3-8b-instruct:extended openrouter standard 0.23 2.25
openrouter/meta-llama/llama-3-70b-instruct:nitro openrouter/meta-llama/llama-3-70b-instruct:nitro openrouter standard 0.90 0.90
openrouter/meta-llama/llama-3-70b-instruct openrouter/meta-llama/llama-3-70b-instruct openrouter standard 0.59 0.79
openrouter/openai/o1 openrouter/openai/o1 openrouter standard 15.00 60.00
openrouter/openai/o1-mini openrouter/openai/o1-mini openrouter standard 3.00 12.00
openrouter/openai/o1-mini-2024-09-12 openrouter/openai/o1-mini-2024-09-12 openrouter standard 3.00 12.00
openrouter/openai/o1-preview openrouter/openai/o1-preview openrouter standard 15.00 60.00
openrouter/openai/o1-preview-2024-09-12 openrouter/openai/o1-preview-2024-09-12 openrouter standard 15.00 60.00
openrouter/openai/o3-mini openrouter/openai/o3-mini openrouter standard 1.10 4.40
openrouter/openai/o3-mini-high openrouter/openai/o3-mini-high openrouter standard 1.10 4.40
openrouter/openai/gpt-4o openrouter/openai/gpt-4o openrouter standard 2.50 10.00
openrouter/openai/gpt-4o-2024-05-13 openrouter/openai/gpt-4o-2024-05-13 openrouter standard 5.00 15.00
openrouter/openai/gpt-4-vision-preview openrouter/openai/gpt-4-vision-preview openrouter standard 10.00 30.00
openrouter/openai/gpt-3.5-turbo openrouter/openai/gpt-3.5-turbo openrouter standard 1.50 2.00
openrouter/openai/gpt-3.5-turbo-16k openrouter/openai/gpt-3.5-turbo-16k openrouter standard 3.00 4.00
openrouter/openai/gpt-4 openrouter/openai/gpt-4 openrouter standard 30.00 60.00
openrouter/anthropic/claude-instant-v1 openrouter/anthropic/claude-instant-v1 openrouter standard 1.63 5.51
openrouter/anthropic/claude-2 openrouter/anthropic/claude-2 openrouter standard 11.02 32.68
openrouter/anthropic/claude-3-opus openrouter/anthropic/claude-3-opus openrouter standard 15.00 75.00
openrouter/google/palm-2-chat-bison openrouter/google/palm-2-chat-bison openrouter standard 0.50 0.50
openrouter/google/palm-2-codechat-bison openrouter/google/palm-2-codechat-bison openrouter standard 0.50 0.50
openrouter/meta-llama/llama-2-13b-chat openrouter/meta-llama/llama-2-13b-chat openrouter standard 0.20 0.20
openrouter/meta-llama/llama-2-70b-chat openrouter/meta-llama/llama-2-70b-chat openrouter standard 1.50 1.50
openrouter/meta-llama/codellama-34b-instruct openrouter/meta-llama/codellama-34b-instruct openrouter standard 0.50 0.50
openrouter/nousresearch/nous-hermes-llama2-13b openrouter/nousresearch/nous-hermes-llama2-13b openrouter standard 0.20 0.20
openrouter/mancer/weaver openrouter/mancer/weaver openrouter standard 5.63 5.63
openrouter/gryphe/mythomax-l2-13b openrouter/gryphe/mythomax-l2-13b openrouter standard 1.88 1.88
openrouter/jondurbin/airoboros-l2-70b-2.1 openrouter/jondurbin/airoboros-l2-70b-2.1 openrouter standard 13.88 13.88
openrouter/undi95/remm-slerp-l2-13b openrouter/undi95/remm-slerp-l2-13b openrouter standard 1.88 1.88
openrouter/pygmalionai/mythalion-13b openrouter/pygmalionai/mythalion-13b openrouter standard 1.88 1.88
openrouter/mistralai/mistral-7b-instruct openrouter/mistralai/mistral-7b-instruct openrouter standard 0.13 0.13
openrouter/mistralai/mistral-7b-instruct:free openrouter/mistralai/mistral-7b-instruct:free openrouter standard 0.00 0.00
openrouter/qwen/qwen-2.5-coder-32b-instruct openrouter/qwen/qwen-2.5-coder-32b-instruct openrouter standard 0.18 0.18
j2-ultra j2-ultra ai21 standard 15.00 15.00
jamba-1.5-mini@001 jamba-1.5-mini@001 ai21 standard 0.20 0.40
jamba-1.5-large@001 jamba-1.5-large@001 ai21 standard 2.00 8.00
jamba-1.5 jamba-1.5 ai21 standard 0.20 0.40
jamba-1.5-mini jamba-1.5-mini ai21 standard 0.20 0.40
jamba-1.5-large jamba-1.5-large ai21 standard 2.00 8.00
jamba-large-1.6 jamba-large-1.6 ai21 standard 2.00 8.00
jamba-mini-1.6 jamba-mini-1.6 ai21 standard 0.20 0.40
j2-mid j2-mid ai21 standard 10.00 10.00
j2-light j2-light ai21 standard 3.00 3.00
dolphin dolphin nlp_cloud standard 0.50 0.50
chatdolphin chatdolphin nlp_cloud standard 0.50 0.50
luminous-base luminous-base aleph_alpha standard 30.00 33.00
luminous-base-control luminous-base-control aleph_alpha standard 37.50 41.25
luminous-extended luminous-extended aleph_alpha standard 45.00 49.50
luminous-extended-control luminous-extended-control aleph_alpha standard 56.25 61.88
luminous-supreme luminous-supreme aleph_alpha standard 175.00 192.50
luminous-supreme-control luminous-supreme-control aleph_alpha standard 218.75 240.63
ai21.j2-mid-v1 ai21.j2-mid-v1 bedrock standard 12.50 12.50
ai21.j2-ultra-v1 ai21.j2-ultra-v1 bedrock standard 18.80 18.80
ai21.jamba-instruct-v1:0 ai21.jamba-instruct-v1:0 bedrock standard 0.50 0.70
ai21.jamba-1-5-large-v1:0 ai21.jamba-1-5-large-v1:0 bedrock standard 2.00 8.00
ai21.jamba-1-5-mini-v1:0 ai21.jamba-1-5-mini-v1:0 bedrock standard 0.20 0.40
amazon.rerank-v1:0 amazon.rerank-v1:0 bedrock standard 0.00 0.00
amazon.titan-text-lite-v1 amazon.titan-text-lite-v1 bedrock standard 0.30 0.40
amazon.titan-text-express-v1 amazon.titan-text-express-v1 bedrock standard 1.30 1.70
amazon.titan-text-premier-v1:0 amazon.titan-text-premier-v1:0 bedrock standard 0.50 1.50
amazon.titan-embed-text-v1 amazon.titan-embed-text-v1 bedrock standard 0.10 0.00
amazon.titan-embed-text-v2:0 amazon.titan-embed-text-v2:0 bedrock standard 0.20 0.00
amazon.titan-embed-image-v1 amazon.titan-embed-image-v1 bedrock standard 0.80 0.00
mistral.mistral-7b-instruct-v0:2 mistral.mistral-7b-instruct-v0:2 bedrock standard 0.15 0.20
mistral.mixtral-8x7b-instruct-v0:1 mistral.mixtral-8x7b-instruct-v0:1 bedrock standard 0.45 0.70
mistral.mistral-large-2402-v1:0 mistral.mistral-large-2402-v1:0 bedrock standard 8.00 24.00
mistral.mistral-large-2407-v1:0 mistral.mistral-large-2407-v1:0 bedrock standard 3.00 9.00
mistral.mistral-small-2402-v1:0 mistral.mistral-small-2402-v1:0 bedrock standard 1.00 3.00
bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 bedrock standard 0.45 0.70
bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 bedrock standard 0.45 0.70
bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 bedrock standard 0.59 0.91
bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 bedrock standard 0.15 0.20
bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 bedrock standard 0.15 0.20
bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 bedrock standard 0.20 0.26
bedrock/us-east-1/mistral.mistral-large-2402-v1:0 bedrock/us-east-1/mistral.mistral-large-2402-v1:0 bedrock standard 8.00 24.00
bedrock/us-west-2/mistral.mistral-large-2402-v1:0 bedrock/us-west-2/mistral.mistral-large-2402-v1:0 bedrock standard 8.00 24.00
bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 bedrock standard 10.40 31.20
amazon.nova-micro-v1:0 amazon.nova-micro-v1:0 bedrock_converse standard 0.04 0.14
us.amazon.nova-micro-v1:0 us.amazon.nova-micro-v1:0 bedrock_converse standard 0.04 0.14
eu.amazon.nova-micro-v1:0 eu.amazon.nova-micro-v1:0 bedrock_converse standard 0.05 0.18
amazon.nova-lite-v1:0 amazon.nova-lite-v1:0 bedrock_converse standard 0.06 0.24
us.amazon.nova-lite-v1:0 us.amazon.nova-lite-v1:0 bedrock_converse standard 0.06 0.24
eu.amazon.nova-lite-v1:0 eu.amazon.nova-lite-v1:0 bedrock_converse standard 0.08 0.31
amazon.nova-pro-v1:0 amazon.nova-pro-v1:0 bedrock_converse standard 0.80 3.20
us.amazon.nova-pro-v1:0 us.amazon.nova-pro-v1:0 bedrock_converse standard 0.80 3.20
eu.amazon.nova-pro-v1:0 eu.amazon.nova-pro-v1:0 bedrock_converse standard 1.05 4.20
us.amazon.nova-premier-v1:0 us.amazon.nova-premier-v1:0 bedrock_converse standard 2.50 12.50
anthropic.claude-3-sonnet-20240229-v1:0 anthropic.claude-3-sonnet-20240229-v1:0 bedrock standard 3.00 15.00
bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 bedrock standard 3.00 15.00
anthropic.claude-3-5-sonnet-20240620-v1:0 anthropic.claude-3-5-sonnet-20240620-v1:0 bedrock standard 3.00 15.00
anthropic.claude-opus-4-20250514-v1:0 anthropic.claude-opus-4-20250514-v1:0 bedrock_converse standard 15.00 75.00
anthropic.claude-sonnet-4-20250514-v1:0 anthropic.claude-sonnet-4-20250514-v1:0 bedrock_converse standard 3.00 15.00
anthropic.claude-3-7-sonnet-20250219-v1:0 anthropic.claude-3-7-sonnet-20250219-v1:0 bedrock_converse standard 3.00 15.00
anthropic.claude-3-5-sonnet-20241022-v2:0 anthropic.claude-3-5-sonnet-20241022-v2:0 bedrock standard 3.00 15.00
anthropic.claude-3-haiku-20240307-v1:0 anthropic.claude-3-haiku-20240307-v1:0 bedrock standard 0.25 1.25
anthropic.claude-3-5-haiku-20241022-v1:0 anthropic.claude-3-5-haiku-20241022-v1:0 bedrock standard 0.80 4.00
anthropic.claude-3-opus-20240229-v1:0 anthropic.claude-3-opus-20240229-v1:0 bedrock standard 15.00 75.00
us.anthropic.claude-3-sonnet-20240229-v1:0 us.anthropic.claude-3-sonnet-20240229-v1:0 bedrock standard 3.00 15.00
us.anthropic.claude-3-5-sonnet-20240620-v1:0 us.anthropic.claude-3-5-sonnet-20240620-v1:0 bedrock standard 3.00 15.00
us.anthropic.claude-3-5-sonnet-20241022-v2:0 us.anthropic.claude-3-5-sonnet-20241022-v2:0 bedrock standard 3.00 15.00
us.anthropic.claude-3-7-sonnet-20250219-v1:0 us.anthropic.claude-3-7-sonnet-20250219-v1:0 bedrock_converse standard 3.00 15.00
us.anthropic.claude-opus-4-20250514-v1:0 us.anthropic.claude-opus-4-20250514-v1:0 bedrock_converse standard 15.00 75.00
us.anthropic.claude-sonnet-4-20250514-v1:0 us.anthropic.claude-sonnet-4-20250514-v1:0 bedrock_converse standard 3.00 15.00
us.anthropic.claude-3-haiku-20240307-v1:0 us.anthropic.claude-3-haiku-20240307-v1:0 bedrock standard 0.25 1.25
us.anthropic.claude-3-5-haiku-20241022-v1:0 us.anthropic.claude-3-5-haiku-20241022-v1:0 bedrock standard 0.80 4.00
us.anthropic.claude-3-opus-20240229-v1:0 us.anthropic.claude-3-opus-20240229-v1:0 bedrock standard 15.00 75.00
eu.anthropic.claude-3-sonnet-20240229-v1:0 eu.anthropic.claude-3-sonnet-20240229-v1:0 bedrock standard 3.00 15.00
eu.anthropic.claude-3-5-sonnet-20240620-v1:0 eu.anthropic.claude-3-5-sonnet-20240620-v1:0 bedrock standard 3.00 15.00
eu.anthropic.claude-3-5-sonnet-20241022-v2:0 eu.anthropic.claude-3-5-sonnet-20241022-v2:0 bedrock standard 3.00 15.00
eu.anthropic.claude-3-7-sonnet-20250219-v1:0 eu.anthropic.claude-3-7-sonnet-20250219-v1:0 bedrock standard 3.00 15.00
eu.anthropic.claude-3-haiku-20240307-v1:0 eu.anthropic.claude-3-haiku-20240307-v1:0 bedrock standard 0.25 1.25
eu.anthropic.claude-opus-4-20250514-v1:0 eu.anthropic.claude-opus-4-20250514-v1:0 bedrock_converse standard 15.00 75.00
eu.anthropic.claude-sonnet-4-20250514-v1:0 eu.anthropic.claude-sonnet-4-20250514-v1:0 bedrock_converse standard 3.00 15.00
eu.anthropic.claude-3-5-haiku-20241022-v1:0 eu.anthropic.claude-3-5-haiku-20241022-v1:0 bedrock standard 0.25 1.25
eu.anthropic.claude-3-opus-20240229-v1:0 eu.anthropic.claude-3-opus-20240229-v1:0 bedrock standard 15.00 75.00
anthropic.claude-v1 anthropic.claude-v1 bedrock standard 8.00 24.00
bedrock/us-east-1/anthropic.claude-v1 bedrock/us-east-1/anthropic.claude-v1 bedrock standard 8.00 24.00
bedrock/us-west-2/anthropic.claude-v1 bedrock/us-west-2/anthropic.claude-v1 bedrock standard 8.00 24.00
bedrock/ap-northeast-1/anthropic.claude-v1 bedrock/ap-northeast-1/anthropic.claude-v1 bedrock standard 8.00 24.00
bedrock/eu-central-1/anthropic.claude-v1 bedrock/eu-central-1/anthropic.claude-v1 bedrock standard 8.00 24.00
anthropic.claude-v2 anthropic.claude-v2 bedrock standard 8.00 24.00
bedrock/us-east-1/anthropic.claude-v2 bedrock/us-east-1/anthropic.claude-v2 bedrock standard 8.00 24.00
bedrock/us-west-2/anthropic.claude-v2 bedrock/us-west-2/anthropic.claude-v2 bedrock standard 8.00 24.00
bedrock/ap-northeast-1/anthropic.claude-v2 bedrock/ap-northeast-1/anthropic.claude-v2 bedrock standard 8.00 24.00
bedrock/eu-central-1/anthropic.claude-v2 bedrock/eu-central-1/anthropic.claude-v2 bedrock standard 8.00 24.00
anthropic.claude-v2:1 anthropic.claude-v2:1 bedrock standard 8.00 24.00
bedrock/us-east-1/anthropic.claude-v2:1 bedrock/us-east-1/anthropic.claude-v2:1 bedrock standard 8.00 24.00
bedrock/us-west-2/anthropic.claude-v2:1 bedrock/us-west-2/anthropic.claude-v2:1 bedrock standard 8.00 24.00
bedrock/ap-northeast-1/anthropic.claude-v2:1 bedrock/ap-northeast-1/anthropic.claude-v2:1 bedrock standard 8.00 24.00
bedrock/eu-central-1/anthropic.claude-v2:1 bedrock/eu-central-1/anthropic.claude-v2:1 bedrock standard 8.00 24.00
anthropic.claude-instant-v1 anthropic.claude-instant-v1 bedrock standard 0.80 2.40
bedrock/us-east-1/anthropic.claude-instant-v1 bedrock/us-east-1/anthropic.claude-instant-v1 bedrock standard 0.80 2.40
bedrock/us-west-2/anthropic.claude-instant-v1 bedrock/us-west-2/anthropic.claude-instant-v1 bedrock standard 0.80 2.40
bedrock/ap-northeast-1/anthropic.claude-instant-v1 bedrock/ap-northeast-1/anthropic.claude-instant-v1 bedrock standard 2.23 7.55
bedrock/eu-central-1/anthropic.claude-instant-v1 bedrock/eu-central-1/anthropic.claude-instant-v1 bedrock standard 2.48 8.38
cohere.rerank-v3-5:0 cohere.rerank-v3-5:0 bedrock standard 0.00 0.00
cohere.command-text-v14 cohere.command-text-v14 bedrock standard 1.50 2.00
cohere.command-light-text-v14 cohere.command-light-text-v14 bedrock standard 0.30 0.60
cohere.command-r-plus-v1:0 cohere.command-r-plus-v1:0 bedrock standard 3.00 15.00
cohere.command-r-v1:0 cohere.command-r-v1:0 bedrock standard 0.50 1.50
cohere.embed-english-v3 cohere.embed-english-v3 bedrock standard 0.10 0.00
cohere.embed-multilingual-v3 cohere.embed-multilingual-v3 bedrock standard 0.10 0.00
us.deepseek.r1-v1:0 us.deepseek.r1-v1:0 bedrock_converse standard 1.35 5.40
meta.llama3-3-70b-instruct-v1:0 meta.llama3-3-70b-instruct-v1:0 bedrock_converse standard 0.72 0.72
meta.llama2-13b-chat-v1 meta.llama2-13b-chat-v1 bedrock standard 0.75 1.00
meta.llama2-70b-chat-v1 meta.llama2-70b-chat-v1 bedrock standard 1.95 2.56
meta.llama3-8b-instruct-v1:0 meta.llama3-8b-instruct-v1:0 bedrock standard 0.30 0.60
bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 bedrock standard 0.30 0.60
bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 bedrock standard 0.30 0.60
bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 bedrock standard 0.36 0.72
bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 bedrock standard 0.35 0.69
bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 bedrock standard 0.32 0.65
bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 bedrock standard 0.39 0.78
bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 bedrock standard 0.50 1.01
meta.llama3-70b-instruct-v1:0 meta.llama3-70b-instruct-v1:0 bedrock standard 2.65 3.50
bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 bedrock standard 2.65 3.50
bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 bedrock standard 2.65 3.50
bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 bedrock standard 3.18 4.20
bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 bedrock standard 3.05 4.03
bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 bedrock standard 2.86 3.78
bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 bedrock standard 3.45 4.55
bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 bedrock standard 4.45 5.88
meta.llama3-1-8b-instruct-v1:0 meta.llama3-1-8b-instruct-v1:0 bedrock standard 0.22 0.22
us.meta.llama3-1-8b-instruct-v1:0 us.meta.llama3-1-8b-instruct-v1:0 bedrock standard 0.22 0.22
meta.llama3-1-70b-instruct-v1:0 meta.llama3-1-70b-instruct-v1:0 bedrock standard 0.99 0.99
us.meta.llama3-1-70b-instruct-v1:0 us.meta.llama3-1-70b-instruct-v1:0 bedrock standard 0.99 0.99
meta.llama3-1-405b-instruct-v1:0 meta.llama3-1-405b-instruct-v1:0 bedrock standard 5.32 16.00
us.meta.llama3-1-405b-instruct-v1:0 us.meta.llama3-1-405b-instruct-v1:0 bedrock standard 5.32 16.00
meta.llama3-2-1b-instruct-v1:0 meta.llama3-2-1b-instruct-v1:0 bedrock standard 0.10 0.10
us.meta.llama3-2-1b-instruct-v1:0 us.meta.llama3-2-1b-instruct-v1:0 bedrock standard 0.10 0.10
eu.meta.llama3-2-1b-instruct-v1:0 eu.meta.llama3-2-1b-instruct-v1:0 bedrock standard 0.13 0.13
meta.llama3-2-3b-instruct-v1:0 meta.llama3-2-3b-instruct-v1:0 bedrock standard 0.15 0.15
us.meta.llama3-2-3b-instruct-v1:0 us.meta.llama3-2-3b-instruct-v1:0 bedrock standard 0.15 0.15
eu.meta.llama3-2-3b-instruct-v1:0 eu.meta.llama3-2-3b-instruct-v1:0 bedrock standard 0.19 0.19
meta.llama3-2-11b-instruct-v1:0 meta.llama3-2-11b-instruct-v1:0 bedrock standard 0.35 0.35
us.meta.llama3-2-11b-instruct-v1:0 us.meta.llama3-2-11b-instruct-v1:0 bedrock standard 0.35 0.35
meta.llama3-2-90b-instruct-v1:0 meta.llama3-2-90b-instruct-v1:0 bedrock standard 2.00 2.00
us.meta.llama3-2-90b-instruct-v1:0 us.meta.llama3-2-90b-instruct-v1:0 bedrock standard 2.00 2.00
us.meta.llama3-3-70b-instruct-v1:0 us.meta.llama3-3-70b-instruct-v1:0 bedrock_converse standard 0.72 0.72
meta.llama4-maverick-17b-instruct-v1:0 meta.llama4-maverick-17b-instruct-v1:0 bedrock_converse standard 0.24 0.97
meta.llama4-maverick-17b-instruct-v1:0 meta.llama4-maverick-17b-instruct-v1:0 bedrock_converse batch 0.12 0.49
us.meta.llama4-maverick-17b-instruct-v1:0 us.meta.llama4-maverick-17b-instruct-v1:0 bedrock_converse standard 0.24 0.97
us.meta.llama4-maverick-17b-instruct-v1:0 us.meta.llama4-maverick-17b-instruct-v1:0 bedrock_converse batch 0.12 0.49
meta.llama4-scout-17b-instruct-v1:0 meta.llama4-scout-17b-instruct-v1:0 bedrock_converse standard 0.17 0.66
meta.llama4-scout-17b-instruct-v1:0 meta.llama4-scout-17b-instruct-v1:0 bedrock_converse batch 0.09 0.33
us.meta.llama4-scout-17b-instruct-v1:0 us.meta.llama4-scout-17b-instruct-v1:0 bedrock_converse standard 0.17 0.66
us.meta.llama4-scout-17b-instruct-v1:0 us.meta.llama4-scout-17b-instruct-v1:0 bedrock_converse batch 0.09 0.33
sagemaker/meta-textgeneration-llama-2-7b sagemaker/meta-textgeneration-llama-2-7b sagemaker standard 0.00 0.00
sagemaker/meta-textgeneration-llama-2-7b-f sagemaker/meta-textgeneration-llama-2-7b-f sagemaker standard 0.00 0.00
sagemaker/meta-textgeneration-llama-2-13b sagemaker/meta-textgeneration-llama-2-13b sagemaker standard 0.00 0.00
sagemaker/meta-textgeneration-llama-2-13b-f sagemaker/meta-textgeneration-llama-2-13b-f sagemaker standard 0.00 0.00
sagemaker/meta-textgeneration-llama-2-70b sagemaker/meta-textgeneration-llama-2-70b sagemaker standard 0.00 0.00
sagemaker/meta-textgeneration-llama-2-70b-b-f sagemaker/meta-textgeneration-llama-2-70b-b-f sagemaker standard 0.00 0.00
together-ai-up-to-4b together-ai-up-to-4b together_ai standard 0.10 0.10
together-ai-4.1b-8b together-ai-4.1b-8b together_ai standard 0.20 0.20
together-ai-8.1b-21b together-ai-8.1b-21b together_ai standard 0.30 0.30
together-ai-21.1b-41b together-ai-21.1b-41b together_ai standard 0.80 0.80
together-ai-41.1b-80b together-ai-41.1b-80b together_ai standard 0.90 0.90
together-ai-81.1b-110b together-ai-81.1b-110b together_ai standard 1.80 1.80
together-ai-embedding-up-to-150m together-ai-embedding-up-to-150m together_ai standard 0.01 0.00
together-ai-embedding-151m-to-350m together-ai-embedding-151m-to-350m together_ai standard 0.02 0.00
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo together_ai standard 0.18 0.18
together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo together_ai standard 0.88 0.88
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo together_ai standard 3.50 3.50
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo together_ai standard 0.88 0.88
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free together_ai standard 0.00 0.00
together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 together_ai standard 0.60 0.60
ollama/codegemma ollama/codegemma ollama standard 0.00 0.00
ollama/codegeex4 ollama/codegeex4 ollama standard 0.00 0.00
ollama/deepseek-coder-v2-instruct ollama/deepseek-coder-v2-instruct ollama standard 0.00 0.00
ollama/deepseek-coder-v2-base ollama/deepseek-coder-v2-base ollama standard 0.00 0.00
ollama/deepseek-coder-v2-lite-instruct ollama/deepseek-coder-v2-lite-instruct ollama standard 0.00 0.00
ollama/deepseek-coder-v2-lite-base ollama/deepseek-coder-v2-lite-base ollama standard 0.00 0.00
ollama/internlm2_5-20b-chat ollama/internlm2_5-20b-chat ollama standard 0.00 0.00
ollama/llama2 ollama/llama2 ollama standard 0.00 0.00
ollama/llama2:7b ollama/llama2:7b ollama standard 0.00 0.00
ollama/llama2:13b ollama/llama2:13b ollama standard 0.00 0.00
ollama/llama2:70b ollama/llama2:70b ollama standard 0.00 0.00
ollama/llama2-uncensored ollama/llama2-uncensored ollama standard 0.00 0.00
ollama/llama3 ollama/llama3 ollama standard 0.00 0.00
ollama/llama3:8b ollama/llama3:8b ollama standard 0.00 0.00
ollama/llama3:70b ollama/llama3:70b ollama standard 0.00 0.00
ollama/llama3.1 ollama/llama3.1 ollama standard 0.00 0.00
ollama/mistral-large-instruct-2407 ollama/mistral-large-instruct-2407 ollama standard 0.00 0.00
ollama/mistral ollama/mistral ollama standard 0.00 0.00
ollama/mistral-7B-Instruct-v0.1 ollama/mistral-7B-Instruct-v0.1 ollama standard 0.00 0.00
ollama/mistral-7B-Instruct-v0.2 ollama/mistral-7B-Instruct-v0.2 ollama standard 0.00 0.00
ollama/mixtral-8x7B-Instruct-v0.1 ollama/mixtral-8x7B-Instruct-v0.1 ollama standard 0.00 0.00
ollama/mixtral-8x22B-Instruct-v0.1 ollama/mixtral-8x22B-Instruct-v0.1 ollama standard 0.00 0.00
ollama/codellama ollama/codellama ollama standard 0.00 0.00
ollama/orca-mini ollama/orca-mini ollama standard 0.00 0.00
ollama/vicuna ollama/vicuna ollama standard 0.00 0.00
deepinfra/lizpreciatior/lzlv_70b_fp16_hf deepinfra/lizpreciatior/lzlv_70b_fp16_hf deepinfra standard 0.70 0.90
deepinfra/Gryphe/MythoMax-L2-13b deepinfra/Gryphe/MythoMax-L2-13b deepinfra standard 0.22 0.22
deepinfra/mistralai/Mistral-7B-Instruct-v0.1 deepinfra/mistralai/Mistral-7B-Instruct-v0.1 deepinfra standard 0.13 0.13
deepinfra/meta-llama/Llama-2-70b-chat-hf deepinfra/meta-llama/Llama-2-70b-chat-hf deepinfra standard 0.70 0.90
deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b deepinfra standard 0.27 0.27
deepinfra/codellama/CodeLlama-34b-Instruct-hf deepinfra/codellama/CodeLlama-34b-Instruct-hf deepinfra standard 0.60 0.60
deepinfra/deepinfra/mixtral deepinfra/deepinfra/mixtral deepinfra standard 0.27 0.27
deepinfra/Phind/Phind-CodeLlama-34B-v2 deepinfra/Phind/Phind-CodeLlama-34B-v2 deepinfra standard 0.60 0.60
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 deepinfra standard 0.27 0.27
deepinfra/deepinfra/airoboros-70b deepinfra/deepinfra/airoboros-70b deepinfra standard 0.70 0.90
deepinfra/01-ai/Yi-34B-Chat deepinfra/01-ai/Yi-34B-Chat deepinfra standard 0.60 0.60
deepinfra/01-ai/Yi-6B-200K deepinfra/01-ai/Yi-6B-200K deepinfra standard 0.13 0.13
deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 deepinfra standard 0.70 0.90
deepinfra/meta-llama/Llama-2-13b-chat-hf deepinfra/meta-llama/Llama-2-13b-chat-hf deepinfra standard 0.22 0.22
deepinfra/amazon/MistralLite deepinfra/amazon/MistralLite deepinfra standard 0.20 0.20
deepinfra/meta-llama/Llama-2-7b-chat-hf deepinfra/meta-llama/Llama-2-7b-chat-hf deepinfra standard 0.13 0.13
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct deepinfra/meta-llama/Meta-Llama-3-8B-Instruct deepinfra standard 0.08 0.08
deepinfra/meta-llama/Meta-Llama-3-70B-Instruct deepinfra/meta-llama/Meta-Llama-3-70B-Instruct deepinfra standard 0.59 0.79
deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct deepinfra standard 0.90 0.90
deepinfra/01-ai/Yi-34B-200K deepinfra/01-ai/Yi-34B-200K deepinfra standard 0.60 0.60
deepinfra/openchat/openchat_3.5 deepinfra/openchat/openchat_3.5 deepinfra standard 0.13 0.13
perplexity/codellama-34b-instruct perplexity/codellama-34b-instruct perplexity standard 0.35 1.40
perplexity/codellama-70b-instruct perplexity/codellama-70b-instruct perplexity standard 0.70 2.80
perplexity/llama-3.1-70b-instruct perplexity/llama-3.1-70b-instruct perplexity standard 1.00 1.00
perplexity/llama-3.1-8b-instruct perplexity/llama-3.1-8b-instruct perplexity standard 0.20 0.20
perplexity/llama-3.1-sonar-huge-128k-online perplexity/llama-3.1-sonar-huge-128k-online perplexity standard 5.00 5.00
perplexity/llama-3.1-sonar-large-128k-online perplexity/llama-3.1-sonar-large-128k-online perplexity standard 1.00 1.00
perplexity/llama-3.1-sonar-large-128k-chat perplexity/llama-3.1-sonar-large-128k-chat perplexity standard 1.00 1.00
perplexity/llama-3.1-sonar-small-128k-chat perplexity/llama-3.1-sonar-small-128k-chat perplexity standard 0.20 0.20
perplexity/llama-3.1-sonar-small-128k-online perplexity/llama-3.1-sonar-small-128k-online perplexity standard 0.20 0.20
perplexity/pplx-7b-chat perplexity/pplx-7b-chat perplexity standard 0.07 0.28
perplexity/pplx-70b-chat perplexity/pplx-70b-chat perplexity standard 0.70 2.80
perplexity/pplx-7b-online perplexity/pplx-7b-online perplexity standard 0.00 0.28
perplexity/pplx-70b-online perplexity/pplx-70b-online perplexity standard 0.00 2.80
perplexity/llama-2-70b-chat perplexity/llama-2-70b-chat perplexity standard 0.70 2.80
perplexity/mistral-7b-instruct perplexity/mistral-7b-instruct perplexity standard 0.07 0.28
perplexity/mixtral-8x7b-instruct perplexity/mixtral-8x7b-instruct perplexity standard 0.07 0.28
perplexity/sonar-small-chat perplexity/sonar-small-chat perplexity standard 0.07 0.28
perplexity/sonar-small-online perplexity/sonar-small-online perplexity standard 0.00 0.28
perplexity/sonar-medium-chat perplexity/sonar-medium-chat perplexity standard 0.60 1.80
perplexity/sonar-medium-online perplexity/sonar-medium-online perplexity standard 0.00 1.80
perplexity/sonar perplexity/sonar perplexity standard 1.00 1.00
perplexity/sonar-pro perplexity/sonar-pro perplexity standard 3.00 15.00
perplexity/sonar-reasoning perplexity/sonar-reasoning perplexity standard 1.00 5.00
perplexity/sonar-reasoning-pro perplexity/sonar-reasoning-pro perplexity standard 2.00 8.00
perplexity/sonar-deep-research perplexity/sonar-deep-research perplexity standard 2.00 8.00
fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct fireworks_ai standard 0.10 0.10
fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct fireworks_ai standard 0.10 0.10
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct fireworks_ai standard 0.10 0.10
fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct fireworks_ai standard 0.20 0.20
fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct fireworks_ai standard 0.90 0.90
fireworks_ai/accounts/fireworks/models/firefunction-v2 fireworks_ai/accounts/fireworks/models/firefunction-v2 fireworks_ai standard 0.90 0.90
fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf fireworks_ai standard 1.20 1.20
fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct fireworks_ai standard 0.90 0.90
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct fireworks_ai standard 0.90 0.90
fireworks_ai/accounts/fireworks/models/yi-large fireworks_ai/accounts/fireworks/models/yi-large fireworks_ai standard 3.00 3.00
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct fireworks_ai standard 1.20 1.20
fireworks_ai/accounts/fireworks/models/deepseek-v3 fireworks_ai/accounts/fireworks/models/deepseek-v3 fireworks_ai standard 0.90 0.90
fireworks_ai/accounts/fireworks/models/deepseek-r1 fireworks_ai/accounts/fireworks/models/deepseek-r1 fireworks_ai standard 3.00 8.00
fireworks_ai/accounts/fireworks/models/deepseek-r1-basic fireworks_ai/accounts/fireworks/models/deepseek-r1-basic fireworks_ai standard 0.55 2.19
fireworks_ai/accounts/fireworks/models/deepseek-r1-0528 fireworks_ai/accounts/fireworks/models/deepseek-r1-0528 fireworks_ai standard 3.00 8.00
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct fireworks_ai standard 3.00 3.00
fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic fireworks_ai standard 0.22 0.88
fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic fireworks_ai standard 0.15 0.60
fireworks_ai/nomic-ai/nomic-embed-text-v1.5 fireworks_ai/nomic-ai/nomic-embed-text-v1.5 fireworks_ai-embedding-models standard 0.01 0.00
fireworks_ai/nomic-ai/nomic-embed-text-v1 fireworks_ai/nomic-ai/nomic-embed-text-v1 fireworks_ai-embedding-models standard 0.01 0.00
fireworks_ai/WhereIsAI/UAE-Large-V1 fireworks_ai/WhereIsAI/UAE-Large-V1 fireworks_ai-embedding-models standard 0.02 0.00
fireworks_ai/thenlper/gte-large fireworks_ai/thenlper/gte-large fireworks_ai-embedding-models standard 0.02 0.00
fireworks_ai/thenlper/gte-base fireworks_ai/thenlper/gte-base fireworks_ai-embedding-models standard 0.01 0.00
fireworks-ai-up-to-4b fireworks-ai-up-to-4b fireworks_ai standard 0.20 0.20
fireworks-ai-4.1b-to-16b fireworks-ai-4.1b-to-16b fireworks_ai standard 0.20 0.20
fireworks-ai-above-16b fireworks-ai-above-16b fireworks_ai standard 0.90 0.90
fireworks-ai-moe-up-to-56b fireworks-ai-moe-up-to-56b fireworks_ai standard 0.50 0.50
fireworks-ai-56b-to-176b fireworks-ai-56b-to-176b fireworks_ai standard 1.20 1.20
fireworks-ai-default fireworks-ai-default fireworks_ai standard 0.00 0.00
fireworks-ai-embedding-up-to-150m fireworks-ai-embedding-up-to-150m fireworks_ai-embedding-models standard 0.01 0.00
fireworks-ai-embedding-150m-to-350m fireworks-ai-embedding-150m-to-350m fireworks_ai-embedding-models standard 0.02 0.00
anyscale/mistralai/Mistral-7B-Instruct-v0.1 anyscale/mistralai/Mistral-7B-Instruct-v0.1 anyscale standard 0.15 0.15
anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 anyscale standard 0.15 0.15
anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 anyscale standard 0.90 0.90
anyscale/HuggingFaceH4/zephyr-7b-beta anyscale/HuggingFaceH4/zephyr-7b-beta anyscale standard 0.15 0.15
anyscale/google/gemma-7b-it anyscale/google/gemma-7b-it anyscale standard 0.15 0.15
anyscale/meta-llama/Llama-2-7b-chat-hf anyscale/meta-llama/Llama-2-7b-chat-hf anyscale standard 0.15 0.15
anyscale/meta-llama/Llama-2-13b-chat-hf anyscale/meta-llama/Llama-2-13b-chat-hf anyscale standard 0.25 0.25
anyscale/meta-llama/Llama-2-70b-chat-hf anyscale/meta-llama/Llama-2-70b-chat-hf anyscale standard 1.00 1.00
anyscale/codellama/CodeLlama-34b-Instruct-hf anyscale/codellama/CodeLlama-34b-Instruct-hf anyscale standard 1.00 1.00
anyscale/codellama/CodeLlama-70b-Instruct-hf anyscale/codellama/CodeLlama-70b-Instruct-hf anyscale standard 1.00 1.00
anyscale/meta-llama/Meta-Llama-3-8B-Instruct anyscale/meta-llama/Meta-Llama-3-8B-Instruct anyscale standard 0.15 0.15
anyscale/meta-llama/Meta-Llama-3-70B-Instruct anyscale/meta-llama/Meta-Llama-3-70B-Instruct anyscale standard 1.00 1.00
cloudflare/@cf/meta/llama-2-7b-chat-fp16 cloudflare/@cf/meta/llama-2-7b-chat-fp16 cloudflare standard 1.92 1.92
cloudflare/@cf/meta/llama-2-7b-chat-int8 cloudflare/@cf/meta/llama-2-7b-chat-int8 cloudflare standard 1.92 1.92
cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 cloudflare standard 1.92 1.92
cloudflare/@hf/thebloke/codellama-7b-instruct-awq cloudflare/@hf/thebloke/codellama-7b-instruct-awq cloudflare standard 1.92 1.92
voyage/voyage-01 voyage/voyage-01 voyage standard 0.10 0.00
voyage/voyage-lite-01 voyage/voyage-lite-01 voyage standard 0.10 0.00
voyage/voyage-large-2 voyage/voyage-large-2 voyage standard 0.12 0.00
voyage/voyage-finance-2 voyage/voyage-finance-2 voyage standard 0.12 0.00
voyage/voyage-lite-02-instruct voyage/voyage-lite-02-instruct voyage standard 0.10 0.00
voyage/voyage-law-2 voyage/voyage-law-2 voyage standard 0.12 0.00
voyage/voyage-code-2 voyage/voyage-code-2 voyage standard 0.12 0.00
voyage/voyage-2 voyage/voyage-2 voyage standard 0.10 0.00
voyage/voyage-3-large voyage/voyage-3-large voyage standard 0.18 0.00
voyage/voyage-3 voyage/voyage-3 voyage standard 0.06 0.00
voyage/voyage-3-lite voyage/voyage-3-lite voyage standard 0.02 0.00
voyage/voyage-code-3 voyage/voyage-code-3 voyage standard 0.18 0.00
voyage/voyage-multimodal-3 voyage/voyage-multimodal-3 voyage standard 0.12 0.00
voyage/rerank-2 voyage/rerank-2 voyage standard 0.05 0.00
voyage/rerank-2-lite voyage/rerank-2-lite voyage standard 0.02 0.00
databricks/databricks-claude-3-7-sonnet databricks/databricks-claude-3-7-sonnet databricks standard 2.50 17.86
databricks/databricks-meta-llama-3-1-405b-instruct databricks/databricks-meta-llama-3-1-405b-instruct databricks standard 5.00 15.00
databricks/databricks-meta-llama-3-1-70b-instruct databricks/databricks-meta-llama-3-1-70b-instruct databricks standard 1.00 3.00
databricks/databricks-meta-llama-3-3-70b-instruct databricks/databricks-meta-llama-3-3-70b-instruct databricks standard 1.00 3.00
databricks/databricks-llama-4-maverick databricks/databricks-llama-4-maverick databricks standard 5.00 15.00
databricks/databricks-dbrx-instruct databricks/databricks-dbrx-instruct databricks standard 0.75 2.25
databricks/databricks-meta-llama-3-70b-instruct databricks/databricks-meta-llama-3-70b-instruct databricks standard 1.00 3.00
databricks/databricks-llama-2-70b-chat databricks/databricks-llama-2-70b-chat databricks standard 0.50 1.50
databricks/databricks-mixtral-8x7b-instruct databricks/databricks-mixtral-8x7b-instruct databricks standard 0.50 1.00
databricks/databricks-mpt-30b-instruct databricks/databricks-mpt-30b-instruct databricks standard 1.00 1.00
databricks/databricks-mpt-7b-instruct databricks/databricks-mpt-7b-instruct databricks standard 0.50 0.00
databricks/databricks-bge-large-en databricks/databricks-bge-large-en databricks standard 0.10 0.00
databricks/databricks-gte-large-en databricks/databricks-gte-large-en databricks standard 0.13 0.00
sambanova/Meta-Llama-3.1-8B-Instruct sambanova/Meta-Llama-3.1-8B-Instruct sambanova standard 0.10 0.20
sambanova/Meta-Llama-3.1-405B-Instruct sambanova/Meta-Llama-3.1-405B-Instruct sambanova standard 5.00 10.00
sambanova/Meta-Llama-3.2-1B-Instruct sambanova/Meta-Llama-3.2-1B-Instruct sambanova standard 0.04 0.08
sambanova/Meta-Llama-3.2-3B-Instruct sambanova/Meta-Llama-3.2-3B-Instruct sambanova standard 0.08 0.16
sambanova/Llama-4-Maverick-17B-128E-Instruct sambanova/Llama-4-Maverick-17B-128E-Instruct sambanova standard 0.63 1.80
sambanova/Llama-4-Scout-17B-16E-Instruct sambanova/Llama-4-Scout-17B-16E-Instruct sambanova standard 0.40 0.70
sambanova/Meta-Llama-3.3-70B-Instruct sambanova/Meta-Llama-3.3-70B-Instruct sambanova standard 0.60 1.20
sambanova/Meta-Llama-Guard-3-8B sambanova/Meta-Llama-Guard-3-8B sambanova standard 0.30 0.30
sambanova/Qwen3-32B sambanova/Qwen3-32B sambanova standard 0.40 0.80
sambanova/QwQ-32B sambanova/QwQ-32B sambanova standard 0.50 1.00
sambanova/Qwen2-Audio-7B-Instruct sambanova/Qwen2-Audio-7B-Instruct sambanova standard 0.50 100.00
sambanova/DeepSeek-R1-Distill-Llama-70B sambanova/DeepSeek-R1-Distill-Llama-70B sambanova standard 0.70 1.40
sambanova/DeepSeek-R1 sambanova/DeepSeek-R1 sambanova standard 5.00 7.00
sambanova/DeepSeek-V3-0324 sambanova/DeepSeek-V3-0324 sambanova standard 3.00 4.50
jina-reranker-v2-base-multilingual jina-reranker-v2-base-multilingual jina_ai standard 0.02 0.02
nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct nscale standard 0.09 0.29
nscale/Qwen/Qwen2.5-Coder-3B-Instruct nscale/Qwen/Qwen2.5-Coder-3B-Instruct nscale standard 0.01 0.03
nscale/Qwen/Qwen2.5-Coder-7B-Instruct nscale/Qwen/Qwen2.5-Coder-7B-Instruct nscale standard 0.01 0.03
nscale/Qwen/Qwen2.5-Coder-32B-Instruct nscale/Qwen/Qwen2.5-Coder-32B-Instruct nscale standard 0.06 0.20
nscale/Qwen/QwQ-32B nscale/Qwen/QwQ-32B nscale standard 0.18 0.20
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B nscale standard 0.38 0.38
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B nscale standard 0.03 0.03
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B nscale standard 0.09 0.09
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B nscale standard 0.20 0.20
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B nscale standard 0.07 0.07
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B nscale standard 0.15 0.15
nscale/mistralai/mixtral-8x22b-instruct-v0.1 nscale/mistralai/mixtral-8x22b-instruct-v0.1 nscale standard 0.60 0.60
nscale/meta-llama/Llama-3.1-8B-Instruct nscale/meta-llama/Llama-3.1-8B-Instruct nscale standard 0.03 0.03
nscale/meta-llama/Llama-3.3-70B-Instruct nscale/meta-llama/Llama-3.3-70B-Instruct nscale standard 0.20 0.20
gemini-2.5-pro gemini-2.5-pro gemini standard 1.25 10.00

960 rows