<  Retour au portail Polytechnique Montréal

DiSeg 1.0: the first system for Spanish discourse segmentation

Iria Da Cunha, Eric San Juan, Juan-Manuel Torres-Moreno, Marina Lloberese et Irene Castellóne

Article de revue (2012)

Un lien externe est disponible pour ce document
Afficher le résumé
Cacher le résumé

Abstract

Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, divided in a medical and a terminological subcorpus. We obtain promising results, which means that discourse segmentation is possible using shallow parsing.

Mots clés

Matériel d'accompagnement:
Département: Département de génie informatique et génie logiciel
Organismes subventionnaires: National Plan of Scientific Research, PAPIIT-DGAPA, Spanish Science and Innovation Ministry
Numéro de subvention: 82050, IN403108, TIN2009-14715-C04-03, FFI2010-21365-C03-01, FFI2009-12188-C05-01
URL de PolyPublie: https://publications.polymtl.ca/15591/
Titre de la revue: Expert Systems With Applications (vol. 39, no 2)
Maison d'édition: Elsevier
DOI: 10.1016/j.eswa.2011.06.058
URL officielle: https://doi.org/10.1016/j.eswa.2011.06.058
Date du dépôt: 18 avr. 2023 15:10
Dernière modification: 27 mars 2026 11:44
Citer en APA 7: Da Cunha, I., San Juan, E., Torres-Moreno, J.-M., Lloberese, M., & Castellóne, I. (2012). DiSeg 1.0: the first system for Spanish discourse segmentation. Expert Systems With Applications, 39(2), 1671-1678. https://doi.org/10.1016/j.eswa.2011.06.058

Statistiques

Dimensions

Actions réservées au personnel

Afficher document Afficher document