华东师范大学学报(哲学社会科学版) ›› 2024, Vol. 56 ›› Issue (2): 31-41.doi: 10.16382/j.cnki.1000-5579.2024.02.004

• 重启科技与人文对话 • 上一篇    下一篇

大语言模型时代的开源知识生产:机遇、挑战与未来

甘莅豪   

  • 接受日期:2024-03-05 出版日期:2024-03-15 发布日期:2024-04-08
  • 作者简介:甘莅豪,华东师范大学传播学院教授(上海,200241)
  • 基金资助:
    国家社科基金重点项目“以‘新言语行为分析’为核心的汉语修辞学理论研究”(项目编号:19AYY002);国家社科基金重大项目“网络空间社会治理语言问题研究”(项目编号:20&ZD299)。

Opensource Knowledge Production in the Age of Large Language Models:Opportunities,Challenges,and the Future

Lihao Gan   

  • Accepted:2024-03-05 Online:2024-03-15 Published:2024-04-08

摘要:

随着大语言模型的崛起,开源知识生产领域迎来了新的变革。大语言模型对开源知识生产的积极影响及其潜在挑战并存。一方面,大语言模型通过适应开源社区的坎宁安定律、提供全天候的新人知识培训支持,以及通过领域化构建策略来修正知识生产的系统性偏差,显著提升了开源知识的生成与传播效率;另一方面,大语言模型带来的幻觉现象、版权风险、数字剥削问题以及“死亡互联网”趋势的加剧,对开源知识的核查、合法性、价值观以及生态环境构成了严重威胁。基于此,未来应强化人类认知体验在引导大语言模型技术发展中的核心作用,并通过实践不断探索解决方案,以期实现大语言模型知识生产与开源知识生产的和谐共生及共同进步。

关键词: 大语言模型, 开源知识生产, 系统性偏差, 幻觉现象, 数字剥削, 维基百科

Abstract:

With the rise of Large Language Models, the field of opensource knowledge production will witness a new transformation. Large Language Models have positive impacts on opensource knowledge production while they also bring potential challenges. On the one hand, Large Language Models significantly improve the efficiency of opensource knowledge generation and dissemination by adapting to Cunningham’s Law of the opensource community, providing round-the-clock newcomer knowledge training support, and correcting the systematic bias of knowledge production through domain-based construction strategies; on the other hand, Large Language Models have brought about the phenomenon of hallucination, copyright risk, digital exploitation, and the intensification of the trend of “dead internet”, which poses a serious threat to the verification, legitimacy, values, and ecological environment of opensource knowledge. Thus, we should strengthen the core role of human cognitive experience in guiding the technology development of Large Language Models, and explore solutions through practice, in order to realize the harmonious coexistence and common progress of knowledge production of Large Language Models and opensource knowledge production.

Key words: Large Language Models, opensource knowledge production, systematic bias, illusionary phenomena, digital exploitation, Wikipedia