Skip to Main content Skip to Navigation
New interface
Conference papers

A Hybrid Bi-LSTM-CRF Model for Sequence Labeling Applied to the Sourcing Domain

Hasnaa Daoud 1 Molka Tounsi Dhouib 2, 1 Jerôme Rancati 1 Catherine Faron 2 Andrea Tettamanzi 2 
2 WIMMICS - Web-Instrumented Man-Machine Interactions, Communities and Semantics
CRISAM - Inria Sophia Antipolis - Méditerranée , Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : In a number of areas, companies are often faced with the task of dealing with large amounts of textual customers' requests. Automating information extraction like key phrases from customers' requests can help to accelerate the processing process. Silex France is currently facing this challenge in the context of processing sourcing requests.In this article, we share our sequence labeling results based on a hybrid method Bi-LSTM-CRF, in an industrial context. This work was integrated in the B2B Silex platform for service providers recommendation. Experiments with the B2B Silex platform data show that, with a good choice of features to extract and optimal choice of hyper-parameters, the combination of the Bi-LSTM and CRF helps to achieve good results even in a context of small data. Indeed, the textual content processed is in the form of complete sentences generated by users, and thus is subject to typing errors. To handle this type of data we combine several types of extracted features describing the textual content such as: (i) semantics, (ii) syntax, (iii) word characters, (iv) position of words.
Complete list of metadata

Cited literature [11 references]  Display  Hide  Download
Contributor : Molka Tounsi Dhouib Connect in order to contact the contributor
Submitted on : Monday, September 7, 2020 - 3:07:13 PM
Last modification on : Thursday, August 4, 2022 - 4:55:01 PM
Long-term archiving on: : Wednesday, December 2, 2020 - 9:53:49 PM


Files produced by the author(s)


  • HAL Id : hal-02932095, version 1



Hasnaa Daoud, Molka Tounsi Dhouib, Jerôme Rancati, Catherine Faron, Andrea Tettamanzi. A Hybrid Bi-LSTM-CRF Model for Sequence Labeling Applied to the Sourcing Domain. PFIA-APIA 2020 - 5ème Conférence Nationale sur les Applications Pratiques de l’Intelligence Artificielle, Jun 2020, Angers, France. ⟨hal-02932095⟩



Record views


Files downloads