Powered by GitBook

Debiasing word embeddings

前言

如果AI在分析問題時存在偏見時，我們該如何消除這些偏見。

內容

The problem of bias in word embedding

根據底下的論文，發現AI透過word embedding進行學習時，會不自覺得學到一些偏見，例如男女性別的固定的刻板印象。

Addressing bias in word embeddings

先找出偏見的方向，例如可能對性別有偏見，那就先從embedding vetocor(原本共有300個維度/特徵)找出可能有關於性別含義的那些特徵(he, she, male, female)。

將he, she的embedding vector相減。

將male, female的embedding vector相減。

將這些相減的vector在進行平均。

找出x軸(性別偏見所在bias)。

再來將哪些可存在性別偏見的詞彙(doctor, babysitter)進行bias消除，例如將doctor, babysitter的embedding vector中將有關於性別偏見的維度改成0，也就是將這些詞投射到x軸為0。

在調整一下本別就有性別差異的詞彙組(girl-boy, he-she)的位置。

該學到的

如何消除偏見的方式。

results matching ""

No results matching ""