3527 articles – 5253 references  [version française]

halshs-00004541, version 1

Proposals for a normalized representation of Standard Arabic full form lexica

Susanne Salmon-Alt () 1, Amine Akrout 1, Laurent Romary () 2

International Conference on Machine Intelligence (2005)

Abstract: Standardized lexical resources are an important prerequisite for the development of robust and wide coverage natural language processing application. Therefore, we applied the Lexical Markup Framework, a recent ISO initiative towards standards for designing, implementing and representing lexical resources, on a test bed of data for an Arabic full form lexicon. Besides minor structural accommodation that would be needed in order to take into account the traditional root-based organization of Arabic dictionaries, the LMF proposal appeared to be suitable to our purpose, especially because of the separate management of the hierarchical data structure (LMF core model) and elementary linguistic descriptors (data categories)

  • 1:  Analyse et Traitement Informatique de la Langue Française (ATILF)
  • CNRS : UMR7118 – Université Henri Poincaré - Nancy I – Université Nancy II
  • 2:  LANGUE ET DIALOGUE (INRIA Lorraine - LORIA)
  • INRIA – CNRS : UMR7503 – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
  • Domain : Humanities and Social Sciences/Library and information sciences
  • Keywords : standards – lexicon – Arabic language – morphology
 
  • halshs-00004541, version 1
  • oai:halshs.archives-ouvertes.fr:halshs-00004541
  • From: 
  • Submitted on: Friday, 2 September 2005 14:56:14
  • Updated on: Tuesday, 4 July 2006 15:25:01