Technological taxonomies for hypernym and hyponym retrieval in patent texts - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Technological taxonomies for hypernym and hyponym retrieval in patent texts

Résumé

This paper presents an automatic approach to creating taxonomies of technical terms based on the Cooperative Patent Classification (CPC). The resulting taxonomy contains about 170k nodes in 9 separate technological branches and is freely available. We also show that a Text-to-Text Transfer Transformer (T5) model can be fine-tuned to generate hypernyms and hyponyms with relatively high precision, confirming the manually assessed quality of the resource. The T5 model opens the taxonomy to any new technological terms for which a hypernym can be generated, thus making the resource updateable with new terms, an essential feature for the constantly evolving field of technological terminology.
Fichier principal
Vignette du fichier
ToTh2022_ZUO_et_al.pdf (573.31 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03850399 , version 1 (14-11-2022)
hal-03850399 , version 2 (11-12-2022)

Identifiants

  • HAL Id : hal-03850399 , version 2

Citer

You Zuo, Yixuan Li, Alma Parias García, Kim Gerdes. Technological taxonomies for hypernym and hyponym retrieval in patent texts. ToTh 2022 - Terminology & Ontology: Theories and applications, Jun 2022, Chambéry, France. ⟨hal-03850399v2⟩
113 Consultations
134 Téléchargements

Partager

Gmail Facebook X LinkedIn More