Web: http://arxiv.org/abs/2205.01366

May 4, 2022, 1:11 a.m. | Jeevesh Juneja, Ritu Agarwal

cs.LG updates on arXiv.org arxiv.org

We analyze the Knowledge Neurons framework for the attribution of factual and
relational knowledge to particular neurons in the transformer network. We use a
12-layer multi-lingual BERT model for our experiments. Our study reveals
various interesting phenomena. We observe that mostly factual knowledge can be
attributed to middle and higher layers of the network($\ge 6$). Further
analysis reveals that the middle layers($6-9$) are mostly responsible for
relational information, which is further refined into actual factual knowledge
or the "correct answer" …

arxiv attribution knowledge patterns transformers

