RunPod

Pricing Serverless Blog Docs

Contact sales Sign Up Login

LLM Models on RunPod

Deploy and run popular LLM models with your own custom API endpoint. Choose a model below to get started.

openai-community/gpt2

meta-llama/llama-3.1-8b-instruct

qwen/qwen2.5-7b-instruct

qwen/qwen2.5-7b-instruct-1m

distilbert/distilgpt2

meta-llama/llama-3.2-1b-instruct

qwen/qwen2.5-1.5b-instruct

qwen/qwen2.5-0.5b

deepseek-ai/deepseek-r1-distill-qwen-1.5b

mistralai/mistral-7b-instruct-v0.2

qwen/qwen2.5-3b-instruct

tinyllama/tinyllama-1.1b-chat-v1.0

deepseek-ai/deepseek-r1-distill-qwen-7b

meta-llama/meta-llama-3-8b-instruct

qwen/qwen2.5-0.5b-instruct

mistralai/mistral-small-24b-instruct-2501

deepseek-ai/deepseek-r1-distill-qwen-14b

microsoft/phi-2

meta-llama/llama-2-7b-hf

qwen/qwen2.5-14b-instruct

deepseek-ai/deepseek-r1-distill-llama-8b

mistralai/mistral-7b-v0.1

huggingfacetb/smollm2-360m-instruct

microsoft/phi-3-mini-4k-instruct

meta-llama/llama-3.2-3b

huggingfaceh4/zephyr-7b-beta

meta-llama/meta-llama-3-8b

qwen/qwen2.5-7b

microsoft/phi-4

mistralai/mistral-7b-instruct-v0.1

qwen/qwen2.5-3b

mistralai/mistral-7b-v0.3

meta-llama/llama-guard-3-8b

microsoft/phi-3.5-mini-instruct

qwen/qwen2.5-14b

tiiuae/falcon-7b-instruct

huggingfacetb/smollm2-135m-instruct

qwen/qwen2.5-math-1.5b

unsloth/meta-llama-3.1-8b-instruct

mixedbread-ai/mxbai-rerank-large-v2

deepseek-ai/deepseek-llm-7b-chat

qwen/qwen2.5-math-7b

qwen/qwq-32b-awq

huggingfacetb/smollm2-1.7b-instruct

lgai-exaone/exaone-deep-32b

jinaai/readerlm-v2

lgai-exaone/exaone-deep-2.4b

nousresearch/deephermes-3-llama-3-8b-preview

agentica-org/deepscaler-1.5b-preview

deepseek-ai/deepseek-llm-7b-base

defog/sqlcoder-7b-2

nousresearch/hermes-3-llama-3.1-8b

nvidia/llama-3.1-nemotron-nano-8b-v1

lgai-exaone/exaone-3.5-2.4b-instruct

ibm-granite/granite-3.1-8b-instruct

deepseek-ai/deepseek-coder-6.7b-instruct

ibm-granite/granite-3.2-8b-instruct

powerinfer/smallthinker-3b-preview

agentica-org/deepcoder-14b-preview

sakanaai/tinyswallow-1.5b

ibm-granite/granite-3.3-2b-instruct

probemedicalyonseimailab/medllama3-v20

ibm-granite/granite-3.2-2b-instruct

m-a-p/yue-s1-7b-anneal-en-cot

bllossom/llama-3.2-korean-bllossom-3b

m-a-p/yue-s2-1b-general

sbintuitions/sarashina2.2-3b-instruct-v0.1

latitudegames/wayfarer-12b

alamios/mistral-small-3.1-draft-0.5b

mistralai/codestral-22b-v0.1

mistralai/mistral-small-24b-base-2501

ai-mo/kimina-prover-preview-distill-1.5b

valdemardi/deepseek-r1-distill-qwen-32b-awq

kblueleaf/tipo-500m-ft

open-thoughts/openthinker-7b

lgai-exaone/exaone-deep-7.8b

unsloth/deepseek-r1-distill-llama-8b

orenguteng/llama-3.1-8b-lexi-uncensored-v2

nousresearch/hermes-3-llama-3.2-3b

mrfakename/mistral-small-3.1-24b-instruct-2503-hf

bigcode/starcoder

contactdoctor/bio-medical-llama-3-8b

numind/nuextract-1.5

allam-ai/allam-7b-instruct-preview

ibm-granite/granite-3.3-8b-instruct

mixedbread-ai/mxbai-rerank-base-v2

trillionlabs/trillion-7b-preview

pocketdoc/dans-personalityengine-v1.2.0-24b

deepcogito/cogito-v1-preview-llama-3b

kakaocorp/kanana-nano-2.1b-instruct

gryphe/mythomax-l2-13b

knoveleng/open-rs3

deepcogito/cogito-v1-preview-qwen-14b

aifeifei798/darkidol-llama-3.1-8b-instruct-1.2-uncensored

deepcogito/cogito-v1-preview-llama-8b

arcee-ai/arcee-blitz

meta-llama/codellama-7b-hf

cyberagent/deepseek-r1-distill-qwen-14b-japanese

aidc-ai/marco-o1

almawave/velvet-14b

RunPod

Copyright © 2025. All rights reserved.