models.github.ai

Item: models.github.ai
Rating: 4.3
Author: FreeAIRouter

免费大模型额度 · 免费模型 · 社区口碑 · 风险评估

网站类型官方站(免费层)

状态掉线

平台openai-compatible

口碑分85 / 100

风险等级低

置信度90%

官网https://github.com/marketplace/models ↗

发现来源for-the-zero/Free-LLM-Collection

网站介绍

GitHub Models 是 GitHub（微软）官方推出的模型试验平台，底层由 Azure AI Inference 托管，提供一个限速的免费层用于原型与试验，可用 OpenAI/Llama/Phi/DeepSeek/Mistral/Cohere/Grok 等多家模型。

免费额度

完全免费但严格限速的「原型/试验层」，按 Copilot 账号分层与模型档位（low/high/embedding）限速。以高档位模型（如 GPT-4o/GPT-5/o1 等）为例：Copilot Free/Pro 约 10 次/分钟、50 次/天、单请求 8000 输入+4000 输出 tokens、2 并发；Copilot Business 约 100 次/天；Copilot Enterprise 约 15 次/分钟、150 次/天、16000 输入 tokens、4 并发。低档位模型上限更高（Free/Pro 150 次/天，Enterprise 450 次/天）。免费层仅供试验，生产需切换到按量付费（Azure 级生产限速）。

免费模型

openai/gpt-4oopenai/gpt-4o-miniopenai/gpt-4.1openai/gpt-5openai/gpt-5-miniopenai/o1openai/o3openai/o4-minideepseek/deepseek-r1deepseek/deepseek-v3-0324meta/llama-3.3-70b-instructmeta/meta-llama-3.1-405b-instructmeta/llama-4-maverick-17b-128e-instruct-fp8microsoft/phi-4microsoft/phi-4-reasoningmistral-ai/mistral-medium-2505cohere/cohere-command-r-plus-08-2024ai21-labs/ai21-jamba-1.5-largexai/grok-3

优点

官方大厂平台（GitHub/微软），底层 Azure AI 托管，跑路与盗用上游 key 风险极低
免费即可试用大量前沿模型：GPT-4o/GPT-5/o1/o3、Llama 3.x/4、DeepSeek R1/V3、Phi-4、Mistral、Cohere、Grok-3 等
OpenAI 兼容接口，仅需 GitHub 账号生成 PAT 即可调用，接入门槛低
官方声明数据不用于训练，合规性与隐私优于多数中转站
可平滑升级到按量付费生产层，或用 BYOK（自带 OpenAI/Azure key）扩容

缺点

免费层限速很严：高档位模型仅约 50 次/天（个人），并发仅 2，不适合真实业务流量
单请求 token 上限低（个人高档位约 8000 输入/4000 输出），不适合长上下文
明确定位为「原型/试验」，免费层不允许/不适合生产用途，生产需付费
限速与模型可用性随 Copilot 订阅层级绑定，免费个人账号最弱
GitHub Copilot 计费政策持续变动（2026 起转向按量计费 / AI Credits），免费层规则未来可能调整

风险点

免费层为「原型/试验」用途，明确不面向生产；超额或商用需切换按量付费
高档位模型免费额度极低（个人约 50 次/天、2 并发），高频调用会被 429 限流
需要 GitHub 账号并生成 PAT；token 泄露可能被滥用，应妥善保管
免费额度与限速绑定 Copilot 订阅层级，且 Copilot 计费政策（2026 起 AI Credits/按量）仍在变动，未来免费规则可能收紧
生产计费转入 Azure AI Foundry，需关注 Azure 侧的配额、地区与合规约束

社区口碑综述

作为 GitHub/微软官方平台，口碑可信、稳定性好，被社区普遍当作「免费试用前沿模型」的可靠入口。社区主要抱怨集中在限速过严（高档位模型每天约 50 次、易触发 429 Too Many Requests）以及仅适合原型而非生产；但没有盗用 key、跑路、数据违规等典型中转站风险。

使用建议

适合开发者用 GitHub 账号免费试验/原型 GPT-4o、GPT-5、Llama、DeepSeek、Phi 等前沿模型，接入简单、合规性好、跑路风险低。但免费层限速严格（高档位约 50 次/天、2 并发、短上下文），不适合作为生产或高频后端依赖；真要上量需开通按量付费或 BYOK。仅作信息整理与风险提示，不构成背书；请遵守 GitHub 服务条款与各模型许可，勿用于违规用途。

社区提及 (4)

GitHub Community Discussion #137298 — How does GitHub Models Rate-Limiting work · neutral

社区开发者讨论限速机制：每个高档位模型有独立配额，免费层每天约 50 次，高频调用会触发限流。 ↗

GitHub Community Discussion #189990 — What is the specific limit for "Too Many Requests" · negative

用户报告频繁遇到 429 Too Many Requests，确认免费试验层限速很严，仅够开发原型。 ↗

2coffee.dev — Using GPT-4 for Free Through GitHub Models (with Limitations) · positive

可以免费用 GitHub 账号调用 GPT-4 等模型，接入简单，但有明显的速率限制，适合学习与原型。 ↗

pecollective — 11 AI Free Tiers Compared (2026) · neutral

GitHub Models 免费可用 GPT-4o、Llama 等前沿模型，但限速比 Gemini Flash 等更严，定位是原型试验。 ↗

参考来源

In English

Summary: GitHub Models is a model experimentation platform launched officially by GitHub (Microsoft), hosted underneath on Azure AI Inference. It offers a rate-limited free tier for prototyping and experimentation, with access to models from multiple vendors including OpenAI/Llama/Phi/DeepSeek/Mistral/Cohere/Grok.

Free quota: A fully free but strictly rate-limited 'prototype/experimentation tier,' with limits tiered by Copilot account level and model class (low/high/embedding). Taking high-class models (e.g. GPT-4o/GPT-5/o1) as an example: Copilot Free/Pro is about 10 req/min, 50 req/day, 8,000 input + 4,000 output tokens per request, 2 concurrent; Copilot Business is about 100 req/day; Copilot Enterprise is about 15 req/min, 150 req/day, 16,000 input tokens, 4 concurrent. Low-class models have higher caps (Free/Pro 150 req/day, Enterprise 450 req/day). The free tier is for experimentation only; production requires switching to pay-as-you-go (Azure-grade production rate limits).