Semi-automatic Domain Ontology Construction from Spoken Corpus in Tunisian Dialect: Railway Request Information

Authors

  • Jihen Karoui Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia
  • Marwa Graja Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia
  • Mohamed Mahdi Boudabous Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia
  • Lamia Hadrich Belguith Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

DOI:

https://doi.org/10.3991/ijes.v1i1.2925

Abstract


In this paper, we present a hybrid method for semi-automatic building of domain ontology from spoken dialogue corpus in Tunisian Dialect for the railway request information domain. The proposed method is based on a statistical method for term and concept extraction and a linguistic method for semantic relation extraction. This method consists of three fundamental phases, namely the corpus construction and treatment, the ontology construction and the ontology evaluation. The proposed method is implemented through the ABDO system to generate the RIO ontology that contains 14 concepts, 25 semantic relations and 387 concepts instances. The generated domain ontology is used to semantically label Tunisian dialect utterances in spoken dialogue.

Author Biographies

Jihen Karoui, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Master Student

Marwa Graja, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

PhD Student

Mohamed Mahdi Boudabous, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

PhD Student

Lamia Hadrich Belguith, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Professor of Computer Science

Downloads

Published

2013-07-23

How to Cite

Karoui, J., Graja, M., Boudabous, M. M., & Hadrich Belguith, L. (2013). Semi-automatic Domain Ontology Construction from Spoken Corpus in Tunisian Dialect: Railway Request Information. International Journal of Recent Contributions from Engineering, Science & IT (iJES), 1(1), pp. 35–38. https://doi.org/10.3991/ijes.v1i1.2925

Issue

Section

Papers