NLPContributions: An Annotation Scheme for Machine Reading of Scholarly Contributions in Natural Language Processing Literature

Zur Kurzanzeige

dc.identifier.uri http://dx.doi.org/10.15488/16292
dc.identifier.uri https://www.repo.uni-hannover.de/handle/123456789/16419
dc.contributor.author D'Souza, Jennifer
dc.contributor.author Auer, Sören
dc.contributor.editor Zhang, Chengzhi
dc.contributor.editor Mayr, Philipp
dc.contributor.editor Lu, Wie
dc.contributor.editor Zhang, Yi
dc.date.accessioned 2024-02-13T08:26:16Z
dc.date.available 2024-02-13T08:26:16Z
dc.date.issued 2020
dc.identifier.citation D'Souza, J.; Auer, S.: NLPContributions: An Annotation Scheme for Machine Reading of Scholarly Contributions in Natural Language Processing Literature. In: Zhang, Chengzhi; Mayr, Philipp; Lu, Wie; Zhang, Yi (Eds.): EEKE 2020, 1st Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents : proceedings of the 1st Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents, co-located with the ACM/IEEE Joint Conference on Digital Libraries in 2020 (JCDL 2020). Aachen, Germany : RWTH Aachen, 2020 (CEUR Workshop Proceedings ; 2658), S. 16-27.
dc.description.abstract We describe an annotation initiative to capture the scholarly contributions in natural language processing (NLP) articles, particularly, for the articles that discuss machine learning (ML) approaches for various information extraction tasks. We develop the annotation task based on a pilot annotation exercise on 50 NLP-ML scholarly articles presenting contributions to five information extraction tasks 1. machine translation, 2. named entity recognition, 3. Question answering, 4. relation classification, and 5. text classification. In this article, we describe the outcomes of this pilot annotation phase. Through the exercise we have obtained an annotation methodology; and found ten core information units that reflect the contribution of the NLP-ML scholarly investigations. The resulting annotation scheme we developed based on these information units is called NLPContributions. The overarching goal of our endeavor is four-fold: 1) to find a systematic set of patterns of subject-predicate-object statements for the semantic structuring of scholarly contributions that are more or less generically applicable for NLP-ML research articles; 2) to apply the discovered patterns in the creation of a larger annotated dataset for training machine readers [18] of research contributions; 3) to ingest the dataset into the Open Research Knowledge Graph (ORKG) infrastructure as a showcase for creating user-friendly state-of-the-art overviews; 4) to integrate the machine readers into the ORKG to assist users in the manual curation of their respective article contributions. We envision that the NLPContributions methodology engenders a wider discussion on the topic toward its further refinement and development. Our pilot annotated dataset of 50 NLP-ML scholarly articles according to the NLPContributions scheme is openly available to the research community at https://doi.org/10.25835/0019761. eng
dc.language.iso eng
dc.publisher Aachen, Germany : RWTH Aachen
dc.relation.ispartof EEKE 2020, 1st Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents : proceedings of the 1st Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents, co-located with the ACM/IEEE Joint Conference on Digital Libraries in 2020 (JCDL 2020)
dc.relation.ispartofseries CEUR Workshop Proceedings ; 2658
dc.relation.uri https://ceur-ws.org/Vol-2658/paper2.pdf
dc.rights CC BY 4.0 Unported
dc.rights.uri https://creativecommons.org/licenses/by/4.0
dc.subject dataset eng
dc.subject annotation guidelines eng
dc.subject semantic publishing eng
dc.subject digital libraries eng
dc.subject scholarly knowledge graphs eng
dc.subject open science graphs eng
dc.subject.classification Konferenzschrift ger
dc.subject.ddc 004 | Informatik
dc.subject.ddc 020 | Bibliotheks- und Informationswissenschaft
dc.title NLPContributions: An Annotation Scheme for Machine Reading of Scholarly Contributions in Natural Language Processing Literature eng
dc.type BookPart
dc.type Text
dc.relation.essn 1613-0073
dc.bibliographicCitation.volume 2658
dc.bibliographicCitation.firstPage 16
dc.bibliographicCitation.lastPage 27
dc.description.version publishedVersion
tib.accessRights frei zug�nglich


Die Publikation erscheint in Sammlung(en):

Zur Kurzanzeige

 

Suche im Repositorium


Durchblättern

Mein Nutzer/innenkonto

Nutzungsstatistiken