预计 1 分钟

Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model

摘要

Nex-AGI 指出，Rio-3.5-Open-397B 实际上是其 Nex 模型与 Qwen3.5-397B - A17B 以 0.6:0.4 比例进行的权重合并。证据包括：在移除预设系统提示后，该模型有 79% 的概率自称为 Nex 并背诵其背景故事；且全量 60 层权重张量均符合该特定混合比例，未发现任何自主训练的痕迹。

荐读理由

里约3.5被曝光为Nex-N2-Pro与Qwen3.5-397B的0.6/0.4权重合并，无证据表明IplanRIO有独立训练过程

原文

prefeitura-rio/Rio-3.5-Open-397B is presented as an original 397B model trained by IplanRIO. It is not. Its weights are a direct element-wise merge of our model, Nex, with the official Qwen3.5-397B-A17B base — about 0.6 Nex / 0.4 Qwen — and we find no evidence of any training of their own. We can show this two completely independent ways:

With Rio's hard-coded "You are Rio" system prompt removed, its own deployed model identifies itself as "Nex, from Nex-AGI" 79% of the time — and as "Rio" 0% of the time. It even recites our organization's bespoke backstory word-for-word.
Every weight tensor in Rio is, to thousands of standard deviations, the same 0.6/0.4 blend of Nex and Qwen — across all 60 layers and every component of the network. Other finetunes cannot be explained as interpolations.

Below is the evidence. Judge for yourself.

Hacker News · 132 赞 · 76 评讨论 → 阅读原文 →

这条对你有帮助吗？