开源项目面临日益严重的问题:由LLM生成的问题。
LLM policy?

原始链接: https://github.com/opencontainers/runc/issues/4990

该项目正在经历由LLM生成的拉取请求和错误报告数量增加,由此引发了对明确接受准则的需求。一个主要问题是验证LLM生成问题的有效性——由于描述通常包含过多且可能不准确的信息,建议将其视为垃圾信息,并要求提供原始提示。 关于代码贡献,建议提交者必须通过用自己的话回复审查请求来证明他们对所做更改的真正理解。这解决了关于作者身份和满足开发者证书来源(DCO)要求的担忧,尽管后者存在分歧。 最终,团队需要决定一项政策并在`CONTRIBUTING.md`中记录下来,可能效仿Incus的例子,直接禁止LLM生成的贡献。目标是维护代码质量并确保真实的问题报告。

相关文章

原文

We've seen a slight uptick in pull-requests and bug reports which appear to be LLM-generated, so it's probably about time to come to a decision on what we should and should not accept and document this somewhere (presumably in CONTRIBUTING.md).

My personal opinion is we shouldn't accept anything LLM-generated, but this is probably not the common position of most @opencontainers/runc-maintainers, so we should probably consider LLM-generated code and issues separately.

IMHO, we should close all LLM-generated issues as spam, because even if they are describing real issues the entire issue description contains so much unneeded (and probably incorrect) information that it'd be better if they just provided their LLM prompt as an issue instead. More importantly, when triaging bugs we have to assume that what the user has written did actually happen to them, but with LLM-generated issues -- who knows whether the description is actually describing something real? (See #4982 and #4972 as possible examples of LLM-generated bug reports.)

For LLM-generated code, I think the minimum bar should be that the submitter needs to be able to respond to review requests in their own words (i.e., they understand what their patch does and was able to write the code themselves). (#4940 and #4939 was the most recent example of this I can think of, and I'm not convinced the submitter would've cleared this bar.)

(FWIW, my view is that LLM-generated code cannot fulfil the requirements of the DCO and so we shouldn't accept it for the legal reasons alone, but I appreciate this is a minority view.)

For reference, Incus added add a note to their CONTRIBUTING.md earlier this year, banning all LLM usage.

联系我们 contact @ memedata.com