向语言模型中添加知识

Add Knowledge to Language Model

entity linking: map the word occurred in text to the entity in knowledge database.
incorporate pretrained entity embeddings from a different embedding space?

Fusion layer
ERNIE: pretraining tasks MLM, NSP, Knowledge Pretraining Task (on word-entity pairs)
KnowBERT: integrated entity linker. No annotations

Easy to modify (like SQL?) : )

KGLM: build a local knowledge graph when iterating over the sequence. LSTM to predict type of next word: related entity(in local KG), new entity(not in local KG), or not entity? take into account aliases vocabulary to the standard vocabulary
kNN-LM: it's easier to learn similarities between text sequences rather than predicting the next word. lambda: control the ratio of using kNN or traditional LM

mask or corrupt the training data to introduce additional training tasks(like justify the truth value of an argument).

Advantages: No additional memory or computation requirements. No modification of the architecture required.

WKLM: Weakly Supervised Knowledge-Pretrained Language Model. distinguish between true and false knowledge. Use MLM(token-level) loss and entity replacement(entity-level) loss to train.
Learn inductive biases through masking(more clever strategy about masking):
- ERNIE: phrase-level masking and entity-level masking (in Chinese language)
- Salient span masking:

LAMA: Language Model Analysis Probe. exam for the model: Cloze task （No fine-tune）
LAMA-UnHelpful Names: LAMA-UHN, remove examples from LAMA that can be answered without relational knowledge.
better prompts to query knowledge in LMs: model may be sensitive to the phrasing structure of sentence. sometimes slightly re-writing the query helps the model to recall what it has learned.
Knowledge-driven downstream tasks: (requires fine-tune) QA, entity typing, relation extraction

posted @ 2022-08-03 20:15 19376273 阅读(56) 评论(0) 编辑收藏举报

刷新页面返回顶部