cheahjs/free-llm-api-resources

요약

이 문서는 API 기반 대규모 언어 모델(LLM)을 무료로 사용하거나 크레딧을 얻을 수 있는 다양한 서비스와 모델 목록을 제공합니다. 사용자들은 Gemma, Llama, Mistral 등 광범위한 오픈 소스 및 상용 모델들을 접근할 수 있지만, 대부분의 서비스는 공통 할당량 제한, 요청 속도 제한(RPS), 토큰 제한 등을 엄격하게 적용하고 있습니다. 또한, 무료 티어 사용 시 데이터가 훈련에 사용될 수 있다는 주의사항과, 합법적이지 않은 역공학 행위는 명시적으로 금지하고 있습니다.

핵심 포인트

다양한 최신 LLM(Gemma, Llama, Mistral 등)을 무료 또는 크레딧 기반으로 접근할 수 있는 리소스 목록이다.
대부분의 서비스는 공통 할당량 및 엄격한 사용 제한(RPS, 토큰/분, 월별 한도)을 적용하므로 남용에 주의해야 한다.
무료 티어 이용 시 데이터가 모델 개선 훈련에 사용될 수 있으며, 이는 중요한 개인정보 보호 고려 사항이다.
합법적인 API 사용이 필수적이며, 기존 챗봇의 역공학(reverse engineering)과 같은 불법 행위는 엄격히 금지된다.

이 목록은 API 기반 LLM 사용에 대한 무료 접근 또는 크레딧을 제공하는 다양한 서비스를 나열합니다.

주의사항

이러한 서비스를 남용하지 마십시오. 그렇지 않으면 우리가 이를 잃을 수 있습니다.

경고

이 목록은 합법적이지 않은 모든 서비스 (예: 기존 챗봇을 역공학하는 것) 를 명시적으로 제외합니다.

제한 사항:

모델은 공통 할당량을 공유합니다.

Gemma 3 12B Instruct
Gemma 3 27B Instruct
Gemma 3 4B Instruct
Hermes 3 Llama 3.1 405B
Llama 3.2 3B Instruct
Llama 3.3 70B Instruct
baidu/qianfan-ocr-fast:free
cognitivecomputations/dolphin-mistral-24b-venice-edition:free
google/gemma-3n-e2b-it:free
google/gemma-3n-e4b-it:free
google/gemma-4-26b-a4b-it:free
google/gemma-4-31b-it:free
inclusionai/ling-2.6-1t:free
liquid/lfm-2.5-1.2b-instruct:free
liquid/lfm-2.5-1.2b-thinking:free
minimax/minimax-m2.5:free
nvidia/nemotron-3-nano-30b-a3b:free
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
nvidia/nemotron-3-super-120b-a12b:free
nvidia/nemotron-nano-12b-v2-vl:free
nvidia/nemotron-nano-9b-v2:free
openai/gpt-oss-120b:free
openai/gpt-oss-20b:free
poolside/laguna-m.1:free
poolside/laguna-xs.2:free
qwen/qwen3-coder:free
qwen/qwen3-next-80b-a3b-instruct:free
tencent/hy3-preview:free
z-ai/glm-4.5-air:free

영국/CH/EEA/EU 밖에서 사용할 때 데이터는 훈련에 사용됩니다.

모델 이름	모델 제한
Gemini 3 Flash	250,000 토큰/분 20 요청/일 5 요청/분
...
전화번호 확인이 필요합니다. 모델은 일반적으로 컨텍스트 윈도우에 제한됩니다.

제한 사항: 40 요청/분

무료 티어 (실험 계획) 은 데이터 훈련에 가입해야 합니다.
전화번호 확인이 필요합니다.

제한 사항 (모델당): 1 요청/초, 500,000 토큰/분, 1,000,000,000 토큰/월

현재 무료로 사용 가능
월간 구독 기반
전화번호 확인이 필요합니다.

제한 사항: 30 요청/분, 2,000 요청/일

Codestral

HuggingFace Serverless Inference 는 10GB 보다 작은 모델에만 제한됩니다. 일부 인기 있는 모델은 10GB 를 초과하더라도 지원됩니다.

제한 사항: $0.10/월 크레딧

지원된 제공자 across 다양한 오픈 모델

다양한 지원된 제공자로 가는 루트.

제한 사항: $5/월

구비된 모델을 가진 AI 게이트웨이.

무료 모델은 데이터를 개선을 위해 사용할 수 있습니다.

Big Pickle Stealth
MiniMax M2.5 Free
Arcee Large Preview Free

모델명	모델 제한
gpt-oss-120b	분당 30 요청, 분당 60,000 토큰, 시간당 900 요청, 시간당 1,000,000 토큰, 일당 14,400 요청, 일당 1,000,000 토큰
Llama 3.1 8B	분당 30 요청, 분당 60,000 토큰, 시간당 900 요청, 시간당 1,000,000 토큰, 일당 14,400 요청, 일당 1,000,000 토큰

모델명	모델 제한
Allam 2 7B	일당 7,000 요청, 분당 6,000 토큰
...
제한:
모든 모델은 공통 월별 할당량을 공유합니다.

c4ai-aya-expanse-32b
c4ai-aya-vision-32b
command-a-03-2025
command-a-reasoning-08-2025
command-a-translate-08-2025
command-a-vision-07-2025
command-r-08-2024
command-r-plus-08-2024
command-r7b-12-2024
command-r7b-arabic-02-2025

매우 엄격한 입력/출력 토큰 제한.

AI21 Jamba 1.5 Large
Codestral 25.01
Cohere Command A
Cohere Command R 08-2024
Cohere Command R+ 08-2024
DeepSeek-R1
DeepSeek-R1-0528
DeepSeek-V3-0324
Grok 3
Grok 3 Mini
Llama 4 Maverick 17B 128E Instruct FP8
Llama 4 Scout 17B 16E Instruct
Llama-3.2-11B-Vision-Instruct
Llama-3.2-90B-Vision-Instruct
Llama-3.3-70B-Instruct
MAI-DS-R1
Meta-Llama-3.1-405B-Instruct
Meta-Llama-3.1-8B-Instruct
Ministral 3B
Mistral Medium 3 (25.05)
Mistral Small 3.1
OpenAI GPT-4.1
OpenAI GPT-4.1-mini
OpenAI GPT-4.1-nano
OpenAI GPT-4o
OpenAI GPT-4o mini
OpenAI Text Embedding 3 (large)
OpenAI Text Embedding 3 (small)
OpenAI gpt-5
OpenAI gpt-5-chat (preview)
OpenAI gpt-5-mini
OpenAI gpt-5-nano
OpenAI o1
OpenAI o1-mini
OpenAI o1-preview
OpenAI o3
OpenAI o3-mini
OpenAI o4-mini
Phi-4
Phi-4-mini-instruct
Phi-4-mini-reasoning
Phi-4-multimodal-instruct
Phi-4-reasoning

제한: 일당 10,000 뉴런

@cf/aisingapore/gemma-sea-lion-v4-27b-it
@cf/google/gemma-4-26b-a4b-it
@cf/ibm-granite/granite-4.0-h-micro
@cf/moonshotai/kimi-k2.5
@cf/moonshotai/kimi-k2.6
@cf/nvidia/nemotron-3-120b-a12b
@cf/openai/gpt-oss-120b
@cf/openai/gpt-oss-20b
@cf/qwen/qwen3-30b-a3b-fp8
@cf/zai-org/glm-4.7-flash
DeepSeek R1 Distill Qwen 32B
Deepseek Coder 6.7B Base (AWQ)
Deepseek Coder 6.7B Instruct (AWQ)
Deepseek Math 7B Instruct
Discolm German 7B v1 (AWQ)
Falcom 7B Instruct
Gemma 2B Instruct (LoRA)
Gemma 3 12B Instruct
Gemma 7B Instruct
Gemma 7B Instruct (LoRA)
Hermes 2 Pro Mistral 7B
Llama 2 13B Chat (AWQ)
Llama 2 7B Chat (FP16)
Llama 2 7B Chat (INT8)
Llama 2 7B Chat (LoRA)
Llama 3 8B Instruct
Llama 3 8B Instruct (AWQ)
Llama 3.1 8B Instruct (AWQ)
Llama 3.1 8B Instruct (FP8)
Llama 3.2 11B Vision Instruct
Llama 3.2 1B Instruct
Llama 3.2 3B Instruct
Llama 3.3 70B Instruct (FP8)
Llama 4 Scout Instruct
Llama Guard 3 8B
Mistral 7B Instruct v0.1
Mistral 7B Instruct v0.1 (AWQ)
Mistral 7B Instruct v0.2
Mistral 7B Instruct v0.2 (LoRA)
Mistral Small 3.1 24B Instruct
Neural Chat 7B v3.1 (AWQ)
OpenChat 3.5 0106
OpenHermes 2.5 Mistral 7B (AWQ)
Phi-2
Qwen 1.5 0.5B Chat
Qwen 1.5 1.8B Chat
Qwen 1.5 14B Chat (AWQ)
Qwen 1.5 7B Chat (AWQ)
Qwen 2.5 Coder 32B Instruct
Qwen QwQ 32B
SQLCoder 7B 2
Starling LM 7B Beta
TinyLlama 1.1B Chat v1.0
Una Cybertron 7B v2 (BF16)
Zephyr 7B Beta (AWQ)

Credits: $1

Models: Various open models

Credits: $30

Models: Any supported model - pay by compute time

Credits: $1

Models: Various open models

Credits: $0.5 for 1 year

Models: Various open models

Credits: $10 for 3 months

Models: Jamba family of models

Credits: $10 for 3 months

Models: Solar Pro/Mini

Credits: $15

Requirements: Phone number verification

Models: Various open models

Credits: 1 million tokens/model

Models: Various open and proprietary Qwen models

Credits: $5/month upon sign up, $30/month with payment method added

Models: Any supported model - pay by compute time

Credits: $1, $25 on responding to email survey

Models: Various open models

Credits: $1

Models:

DeepSeek V3 0324
Llama 3.3 70B Instruct
deepseek-ai/deepseek-r1-0528
qwen/qwen3-coder-480b-a35b-instruct

Credits: $5 for 3 months

Models:

Llama 3.3 70B
Llama-4-Maverick-17B-128E-Instruct
deepseek-ai/DeepSeek-V3.1
deepseek-ai/DeepSeek-V3.1
deepseek-ai/DeepSeek-V3.2
google/gemma-3-12b-it
minimaxai/minimax-m2.5
openai/gpt-oss-120b

Credits: 1,000,000 무료 토큰

모델:

BGE-Multilingual-Gemma2
Gemma 3 27B Instruct
Llama 3.3 70B Instruct
Pixtral 12B (2409)
Whisper Large v3
devstral-2-123b-instruct-2512
gpt-oss-120b
holo2-30b-a3b
mistral-small-3.2-24b-instruct-2506
qwen3-235b-a22b-instruct-2507
qwen3-coder-30b-a3b-instruct
qwen3-embedding-8b
qwen3.5-397b-a17b
voxtral-small-24b-2507

AI 자동 생성 콘텐츠

원문 바로가기

cheahjs/free-llm-api-resources

요약

핵심 포인트

댓글