LLM Models on RunPod
Deploy and run popular LLM models with your own custom API endpoint. Choose a model below to get started.
openai-community/gpt2
meta-llama/llama-3.1-8b-instruct
mistralai/mistral-7b-instruct-v0.2
meta-llama/llama-3.2-1b-instruct
distilbert/distilgpt2
meta-llama/meta-llama-3-8b-instruct
qwen/qwen2.5-7b-instruct
deepseek-ai/deepseek-r1-distill-llama-8b
deepseek-ai/deepseek-r1-distill-qwen-1.5b
qwen/qwen2.5-0.5b-instruct
tinyllama/tinyllama-1.1b-chat-v1.0
qwen/qwen2.5-1.5b-instruct
deepseek-ai/deepseek-r1-distill-qwen-7b
meta-llama/llama-2-7b-hf
microsoft/phi-3-mini-4k-instruct
mistralai/mistral-small-24b-instruct-2501
mistralai/mistral-7b-instruct-v0.1
qwen/qwen2.5-3b-instruct
microsoft/phi-4
deepseek-ai/deepseek-r1-distill-qwen-14b
huggingfacetb/smollm2-360m-instruct
microsoft/phi-2
qwen/qwen2.5-14b-instruct
qwen/qwen2.5-0.5b
meta-llama/meta-llama-3-8b
huggingfacetb/smollm2-1.7b-instruct
microsoft/phi-3.5-mini-instruct
mistralai/mistral-7b-v0.1
qwen/qwen2.5-7b-instruct-1m
meta-llama/llama-3.2-3b
qwen/qwen2.5-7b
qwen/qwen2.5-math-7b
nousresearch/hermes-3-llama-3.2-3b
lgai-exaone/exaone-3.5-2.4b-instruct
huggingfacetb/smollm2-135m-instruct
m-a-p/yue-s1-7b-anneal-en-cot
qwen/qwen2.5-math-1.5b
mistralai/mistral-7b-v0.3
nousresearch/hermes-3-llama-3.1-8b
powerinfer/smallthinker-3b-preview
deepseek-ai/deepseek-llm-7b-chat
ibm-granite/granite-3.1-8b-instruct
deepseek-ai/deepseek-llm-7b-base
qwen/qwen2.5-14b
deepseek-ai/deepseek-coder-6.7b-instruct
bllossom/llama-3.2-korean-bllossom-3b
m-a-p/yue-s2-1b-general
agentica-org/deepscaler-1.5b-preview
mistralai/mistral-small-24b-base-2501
jinaai/readerlm-v2
valdemardi/deepseek-r1-distill-qwen-32b-awq
probemedicalyonseimailab/medllama3-v20
unsloth/deepseek-r1-distill-llama-8b
m-a-p/yue-s1-7b-anneal-en-icl
sakanaai/tinyswallow-1.5b-instruct
cyberagent/deepseek-r1-distill-qwen-14b-japanese
open-thoughts/openthinker-7b
latitudegames/wayfarer-12b
contactdoctor/bio-medical-llama-3-8b
orenguteng/llama-3.1-8b-lexi-uncensored-v2
allenai/llama-3.1-tulu-3-8b
mistralai/codestral-22b-v0.1
nousresearch/deephermes-3-llama-3-8b-preview
atlaai/selene-1-mini-llama-3.1-8b
ibm-granite/granite-3.2-8b-instruct-preview
kblueleaf/tipo-500m-ft
sometimesanotion/lamarck-14b-v0.7-rc4
aidc-ai/marco-o1
univa-bllossom/deepseek-llama3.1-bllossom-8b
huihui-ai/deepseek-r1-distill-qwen-14b-abliterated-v2
prithivmlmods/llama-8b-distill-cot
cognitivecomputations/dolphin3.0-r1-mistral-24b
allam-ai/allam-7b-instruct-preview
almawave/velvet-14b
huihui-ai/deepseek-r1-distill-llama-8b-abliterated
thefinai/fino1-8b
sentientagi/dobby-mini-unhinged-llama-3.1-8b
arcee-ai/arcee-maestro-7b-preview
cognitivecomputations/dolphin3.0-mistral-24b
lightblue/deepseek-r1-distill-qwen-7b-japanese
bsc-lt/salamandra-7b
kakaocorp/kanana-nano-2.1b-instruct
ibm-granite/granite-3.2-8b-instruct
vikhrmodels/qvikhr-2.5-1.5b-instruct-smpo
bespokelabs/bespoke-stratos-7b
sakanaai/tinyswallow-1.5b
arcee-ai/virtuoso-small-v2
axcxept/phi-4-deepseek-r1k-rl-ezo
ibm-granite/granite-3.2-2b-instruct
ilsp/llama-krikri-8b-instruct
m-a-p/yue-s1-7b-anneal-jp-kr-cot
huihui-ai/deepseek-r1-distill-qwen-7b-abliterated-v2
nvidia/acemath-7b-instruct
arcee-ai/arcee-blitz
knifeayumu/cydonia-v1.3-magnum-v4-22b
kakaocorp/kanana-nano-2.1b-base
voidful/llama-3.1-taide-r1-8b-chat
pocketdoc/dans-personalityengine-v1.2.0-24b
arcee-ai/virtuoso-lite
m-a-p/yue-s1-7b-anneal-zh-cot