里约热内卢的“本土”大语言模型似乎是由现有模型合并而成的。

里约热内卢的“本土”大语言模型似乎是由现有模型合并而成的。
Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model

原始链接: https://github.com/nex-agi/Nex-N2/issues/4

prefeitura-rio/Rio-3.5-Open-397B 声称是由 IplanRIO 训练的原创 397B 模型。事实并非如此。其权重是我们模型 Nex 与官方 Qwen3.5-397B-A17B 基座模型的直接逐元素合并（比例约为 0.6 Nex / 0.4 Qwen），我们没有发现任何他们自行训练的证据。我们可以通过两种完全独立的方式证明这一点：在移除 Rio 硬编码的“你是 Rio”系统提示词后，其部署的模型在 79% 的情况下会将自己标识为“来自 Nex-AGI 的 Nex”，而标识为“Rio”的比例为 0%。它甚至逐字背诵了我们机构定制的背景故事。Rio 的每一个权重张量，在所有 60 层和网络的每个组件中，在数千个标准差范围内，都与 Nex 和 Qwen 的 0.6/0.4 混合结果完全相同。其他的微调模型无法用这种插值方式来解释。以下是证据，请自行判断。

Hacker News | 最新 | 往期 | 评论 | 提问 | 展示 | 招聘 | 提交 | 登录里约热内卢的“自研”大语言模型似乎是现有模型的合并 (github.com/nex-agi) 14 积分，作者：unrvl22，52 分钟前 | 隐藏 | 往期 | 收藏 | 2 条评论 | 帮助 elzbardico 1 分钟前 | 下一条 [–] 这就是典型的巴西学术界。回复 unrvl22 52 分钟前 | 上一条 [–] 里约热内卢市政府（通过其 IT 公司 IplanRIO）发布了 Rio-3.5-Open-397B，声称这是自研的 Qwen3.5 微调版本，在基准测试中优于同类开源模型。链接中的议题指出，它实际上是约 60% Nex-N2 Pro 和 40% Qwen3.5-397B-A17B 的加权合并——而 Nex-N2 大约在一周前发布。回复指南 | 常见问题 | 列表 | API | 安全 | 法律 | 加入 YC | 联系方式搜索：

prefeitura-rio/Rio-3.5-Open-397B is presented as an original 397B model trained by IplanRIO. It is not. Its weights are a direct element-wise merge of our model, Nex, with the official Qwen3.5-397B-A17B base — about 0.6 Nex / 0.4 Qwen — and we find no evidence of any training of their own. We can show this two completely independent ways:

With Rio's hard-coded "You are Rio" system prompt removed, its own deployed model identifies itself as "Nex, from Nex-AGI" 79% of the time — and as "Rio" 0% of the time. It even recites our organization's bespoke backstory word-for-word.
Every weight tensor in Rio is, to thousands of standard deviations, the same 0.6/0.4 blend of Nex and Qwen — across all 60 layers and every component of the network. Other finetunes cannot be explained as interpolations.

Below is the evidence. Judge for yourself.

里约热内卢的“本土”大语言模型似乎是由现有模型合并而成的。 Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model

里约热内卢的“本土”大语言模型似乎是由现有模型合并而成的。
Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model