Combining semi-supervised and active learning to recognize minority senses in a new corpus

Ponencia presentada en la 24th International Joint Conference on Artificial Intelligence. Workshop on Replicability and Reproducibility in Natural Language Processing: adaptive methods, resources and software. Buenos Aires, Argentina, del 25 al 31 de julio de 2015.

Bibliographic Details
Main Authors: Cardellino, Cristian Adrián, Teruel, Milagro, Alonso i Alemany, Laura
Format: conferenceObject
Language:eng
Published: 2021
Subjects:
Online Access:http://hdl.handle.net/11086/22132
_version_ 1801216354484748288
author Cardellino, Cristian Adrián
Teruel, Milagro
Alonso i Alemany, Laura
author_facet Cardellino, Cristian Adrián
Teruel, Milagro
Alonso i Alemany, Laura
author_sort Cardellino, Cristian Adrián
collection Repositorio Digital Universitario
description Ponencia presentada en la 24th International Joint Conference on Artificial Intelligence. Workshop on Replicability and Reproducibility in Natural Language Processing: adaptive methods, resources and software. Buenos Aires, Argentina, del 25 al 31 de julio de 2015.
format conferenceObject
id rdu-unc.22132
institution Universidad Nacional de Cordoba
language eng
publishDate 2021
record_format dspace
spelling rdu-unc.221322022-10-13T11:05:55Z Combining semi-supervised and active learning to recognize minority senses in a new corpus Cardellino, Cristian Adrián Teruel, Milagro Alonso i Alemany, Laura Natural language processing Active learning Semi-supervised learning Ponencia presentada en la 24th International Joint Conference on Artificial Intelligence. Workshop on Replicability and Reproducibility in Natural Language Processing: adaptive methods, resources and software. Buenos Aires, Argentina, del 25 al 31 de julio de 2015. Fil: Cardellino, Cristian Adrián. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Fil: Teruel, Milagro. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Fil: Alonso i Alemany, Laura. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. In this paper we study the impact of combining active learning with bootstrapping to grow a small annotated corpus from a different, unannotated corpus. The intuition underlying our approach is that bootstrapping includes instances that are closer to the generative centers of the data, while discriminative approaches to active learning include instances that are closer to the decision boundaries of classifiers. We build an initial model from the original annotated corpus, which is then iteratively enlarged by including both manually annotated examples and automatically labelled examples as training examples for the following iteration. Examples to be annotated are selected in each iteration by applying active learning techniques. We show that intertwining an active learning component in a bootstrapping approach helps to overcome an initial bias towards a majority class, thus facilitating adaptation of a starting dataset towards the real distribution of a different, unannotated corpus. Fil: Cardellino, Cristian Adrián. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Fil: Teruel, Milagro. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Fil: Alonso i Alemany, Laura. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Otras Ciencias de la Computación e Información 2021-12-28T14:30:44Z 2021-12-28T14:30:44Z 2015 conferenceObject http://hdl.handle.net/11086/22132 eng Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/ Electrónico y/o Digital
spellingShingle Natural language processing
Active learning
Semi-supervised learning
Cardellino, Cristian Adrián
Teruel, Milagro
Alonso i Alemany, Laura
Combining semi-supervised and active learning to recognize minority senses in a new corpus
title Combining semi-supervised and active learning to recognize minority senses in a new corpus
title_full Combining semi-supervised and active learning to recognize minority senses in a new corpus
title_fullStr Combining semi-supervised and active learning to recognize minority senses in a new corpus
title_full_unstemmed Combining semi-supervised and active learning to recognize minority senses in a new corpus
title_short Combining semi-supervised and active learning to recognize minority senses in a new corpus
title_sort combining semi supervised and active learning to recognize minority senses in a new corpus
topic Natural language processing
Active learning
Semi-supervised learning
url http://hdl.handle.net/11086/22132
work_keys_str_mv AT cardellinocristianadrian combiningsemisupervisedandactivelearningtorecognizeminoritysensesinanewcorpus
AT teruelmilagro combiningsemisupervisedandactivelearningtorecognizeminoritysensesinanewcorpus
AT alonsoialemanylaura combiningsemisupervisedandactivelearningtorecognizeminoritysensesinanewcorpus