人工智能价值对齐中的元价值规范探析 对价值对齐两种批评思路的一个回应

张立达

doi:10.14071/j.1008-8105(2026)-3003

人工智能价值对齐中的元价值规范探析对价值对齐两种批评思路的一个回应

张立达

An Analysis of Meta-value Norms in Artificial Intelligence Value AlignmentA Response to Two Criticisms of Value Alignment

ZHANG Li-da

摘要

摘要: 价值共生和理由对齐是两种自下而上的人工智能伦理设计思路，但是它们不能代替自上而下的伦理设计。要对人工智能伦理设计进行整体规约，就要确立价值对齐中的元价值规范。这一元价值规范可以总结为：以人机共生方式的可普遍化为准则，在这个抽象准则之下，任何具体的价值目标和行为方式都是有限的、不完备的，因此都是需要反思和修正的，但如果要修正目标和行为，必须给出尽可能充分的理由。由此，价值共生和理由对齐就可以整合进价值对齐的大框架中，人工智能伦理设计实现了更高层次的综合。

Abstract: Value symbiosis and reason alignment are two bottom-up ethical design ideas for artificial intelligence, but they can’t replace top-down ethical design. To regulate the overall ethical design of AI, it is necessary to establish meta-value norms in value alignment. This meta-value norm can be summarized as follows: taking the universalizability of human-computer symbiosis as the criterion, under this abstract criterion, any specific value goals and behavioral methods are limited and incomplete, and therefore require reflection and modification. However, if the goals and behaviors are to be modified, the fullest possible reasons must be given. From this, value symbiosis and reason alignment can be integrated into the larger framework of value alignment, and AI ethical design achieves a higher level of synthesis.

HTML全文

参考文献(18)

施引文献

资源附件(0)