« Chain-of-Knowledge » : différence entre les versions


(Page créée avec « ==en construction== == Définition == XXXXXXXXX == Français == ''' XXXXXXXXX ''' == Anglais == ''' Chain-of-Knowledge''' The CHAIN-OF-KNOWLEDGE framework has two main components: dataset construction and model learning. For dataset construction, the authors first mine compositional rules from knowledge graphs. These rules represent patterns of how different facts can be combined to infer new knowledge. They then select knowledge triples from the graph that... »)
 
Aucun résumé des modifications
 
(Une version intermédiaire par le même utilisateur non affichée)
Ligne 10 : Ligne 10 :
''' Chain-of-Knowledge'''
''' Chain-of-Knowledge'''


  The CHAIN-OF-KNOWLEDGE framework has two main components: dataset construction and model learning.
''' Chain-of-Knowledge framework '''
For dataset construction, the authors first mine compositional rules from knowledge graphs. These rules represent patterns of how different facts can be combined to infer new knowledge. They then select knowledge triples from the graph that match these rules. Finally, they use advanced language models to transform the structured knowledge into natural language questions and reasoning steps.
 
   
  The CHAIN-OF-KNOWLEDGE framework has two main components: dataset construction and model learning. For dataset construction, the authors first mine compositional rules from knowledge graphs. These rules represent patterns of how different facts can be combined to infer new knowledge. They then select knowledge triples from the graph that match these rules. Finally, they use advanced language models to transform the structured knowledge into natural language questions and reasoning steps.
For model learning, they initially tried simply fine-tuning LLMs on this data. However, this led to "rule overfitting" where models would apply rules even without supporting facts. To address this, they introduced a trial-and-error mechanism. This simulates how humans explore their internal knowledge when reasoning, by having the model try different rules and backtrack if it lacks key facts.
 
  For model learning, they initially tried simply fine-tuning LLMs on this data. However, this led to "rule overfitting" where models would apply rules even without supporting facts. To address this, they introduced a trial-and-error mechanism. This simulates how humans explore their internal knowledge when reasoning, by having the model try different rules and backtrack if it lacks key facts.
   
   



Dernière version du 9 juillet 2024 à 10:07

en construction

Définition

XXXXXXXXX

Français

XXXXXXXXX

Anglais

Chain-of-Knowledge

Chain-of-Knowledge framework

The CHAIN-OF-KNOWLEDGE framework has two main components: dataset construction and model learning. For dataset construction, the authors first mine compositional rules from knowledge graphs. These rules represent patterns of how different facts can be combined to infer new knowledge. They then select knowledge triples from the graph that match these rules. Finally, they use advanced language models to transform the structured knowledge into natural language questions and reasoning steps.
 
For model learning, they initially tried simply fine-tuning LLMs on this data. However, this led to "rule overfitting" where models would apply rules even without supporting facts. To address this, they introduced a trial-and-error mechanism. This simulates how humans explore their internal knowledge when reasoning, by having the model try different rules and backtrack if it lacks key facts.

Source

Source : huggingface



Contributeurs: wiki