Skip to Main content Skip to Navigation
New interface
Conference papers

Tamil Paraphrase Detection Using Encoder-Decoder Neural Networks

Abstract : Detecting paraphrases in Indian languages require critical analysis on the lexical, syntactic and semantic features. Since the structure of Indian languages differ from the other languages like English, the usage of lexico-syntactic features vary between the Indian languages and plays a critical role in determining the performance of the system. Instead of using various lexico-syntactic similarity features, we aim to apply a complete end-to-end system using deep learning networks with no lexico-syntactic features. In this paper we exploited the encoder-decoder model of deep neural network to analyze the paraphrase sentences in Tamil language and to classify. In this encoder-decoder model, LSTM, GRU units and gNMT are used as layers along with attention mechanism. Using this end-to-end model, there is an increase in f1-measure by 0.5% for the subtask-1 when compared to the state-of-the-art systems. The system was trained and evaluated on DPIL@FIRE2016 Shared Task dataset. To our knowledge, ours is the first deep learning model which validates the training instances of both the subtask-1 and subtask-2 dataset of DPIL shared task.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/hal-03434784
Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Thursday, November 18, 2021 - 2:20:37 PM
Last modification on : Thursday, November 18, 2021 - 2:32:10 PM
Long-term archiving on: : Saturday, February 19, 2022 - 7:11:39 PM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2023-01-01

Please log in to resquest access to the document

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

B. Senthil Kumar, D. Thenmozhi, S. Kayalvizhi. Tamil Paraphrase Detection Using Encoder-Decoder Neural Networks. 3rd International Conference on Computational Intelligence in Data Science (ICCIDS), Feb 2020, Chennai, India. pp.30-42, ⟨10.1007/978-3-030-63467-4_3⟩. ⟨hal-03434784⟩

Share

Metrics

Record views

17