Semi-automatic Domain Ontology Construction from Spoken Corpus in Tunisian Dialect: Railway Request Information

Jihen Karoui; Marwa Graja; Mohamed Mahdi Boudabous; Lamia Hadrich Belguith

doi:10.3991/ijes.v1i1.2925

Semi-automatic Domain Ontology Construction from Spoken Corpus in Tunisian Dialect: Railway Request Information

Authors

Jihen Karoui Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia
Marwa Graja Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia
Mohamed Mahdi Boudabous Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia
Lamia Hadrich Belguith Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

DOI:

https://doi.org/10.3991/ijes.v1i1.2925

Abstract

In this paper, we present a hybrid method for semi-automatic building of domain ontology from spoken dialogue corpus in Tunisian Dialect for the railway request information domain. The proposed method is based on a statistical method for term and concept extraction and a linguistic method for semantic relation extraction. This method consists of three fundamental phases, namely the corpus construction and treatment, the ontology construction and the ontology evaluation. The proposed method is implemented through the ABDO system to generate the RIO ontology that contains 14 concepts, 25 semantic relations and 387 concepts instances. The generated domain ontology is used to semantically label Tunisian dialect utterances in spoken dialogue.

Author Biographies

Jihen Karoui, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Master Student

Marwa Graja, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

PhD Student

Mohamed Mahdi Boudabous, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

PhD Student

Lamia Hadrich Belguith, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Professor of Computer Science

Downloads

Published

2013-07-23

How to Cite

Karoui, J., Graja, M., Boudabous, M. M., & Hadrich Belguith, L. (2013). Semi-automatic Domain Ontology Construction from Spoken Corpus in Tunisian Dialect: Railway Request Information. International Journal of Recent Contributions from Engineering, Science & IT (iJES), 1(1), pp. 35–38. https://doi.org/10.3991/ijes.v1i1.2925

Download Citation

Issue

Vol. 1 No. 1 (2013)

Section

Papers

License

The submitting author warrants that the submission is original and that she/he is the author of the submission together with the named co-authors; to the extend the submission incorporates text passages, figures, data or other material from the work of others, the submitting author has obtained any necessary permission.
Articles in this journal are published under the Creative Commons Attribution Licence (CC-BY What does this mean?). This is to get more legal certainty about what readers can do with published articles, and thus a wider dissemination and archiving, which in turn makes publishing with this journal more valuable for you, the authors.
By submitting an article the author grants to this journal the non-exclusive right to publish it. The author retains the copyright and the publishing rights for his article without any restrictions.
This journal has been awarded the SPARC Europe Seal for Open Access Journals (What's this?)

Semi-automatic Domain Ontology Construction from Spoken Corpus in Tunisian Dialect: Railway Request Information

Authors

DOI:

Abstract

Author Biographies

Jihen Karoui, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Marwa Graja, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Mohamed Mahdi Boudabous, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Lamia Hadrich Belguith, Miracl Laboratory, Automatic Natural Language Processing Research Group, ANLP-RG, University of Sfax, Tunisia

Downloads

Published

How to Cite

Issue

Section

License

Information

Other journals