Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines

Zur Kurzanzeige

dc.identifier.uri http://dx.doi.org/10.15488/16804
dc.identifier.uri https://www.repo.uni-hannover.de/handle/123456789/16931
dc.contributor.author Jaradeh, Mohamad Yaser
dc.contributor.author Singh, Kuldeep
dc.contributor.author Stocker, Markus
dc.contributor.author Both, Andreas
dc.contributor.author Auer, Sören
dc.contributor.editor Brambilla, M.
dc.contributor.editor Chbeir, R.
dc.contributor.editor Frasincar, F.
dc.contributor.editor Manolescu, I.
dc.date.accessioned 2024-03-26T09:31:15Z
dc.date.available 2024-03-26T09:31:15Z
dc.date.issued 2021
dc.identifier.citation Jaradeh, M.Y.; Singh, K.; Stocker, M.; Both, A.; Auer, S.: Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines. In: Brambilla, M.; Chbeir, R.; Frasincar, F.; Manolescu, I. (Eds.): Web Engineering. ICWE 2021. New York, NY : Springer, 2021 (Lecture notes in computer science ; 12706), S. 240-254. DOI: https://doi.org/10.1007/978-3-030-74296-6_19
dc.description.abstract We propose Plumber, the first framework that brings together the research community’s disjoint information extraction (IE) efforts. The Plumber architecture comprises 33 reusable components for various Knowledge Graphs (KG) information extraction subtasks, such as coreference resolution, entity linking, and relation extraction. Using these components, Plumber dynamically generates suitable information extraction pipelines and offers overall 264 distinct pipelines. We study the optimization problem of choosing suitable pipelines based on input sentences. To do so, we train a transformer-based classification model that extracts contextual embeddings from the input and finds an appropriate pipeline. We study the efficacy of Plumber for extracting the KG triples using standard datasets over two KGs: DBpedia, and Open Research Knowledge Graph (ORKG). Our results demonstrate the effectiveness of Plumber in dynamically generating KG information extraction pipelines, outperforming all baselines agnostics of the underlying KG. Furthermore, we provide an analysis of collective failure cases, study the similarities and synergies among integrated components, and discuss their limitations. eng
dc.language.iso eng
dc.publisher New York, NY : Springer
dc.relation.ispartof Web Engineering. ICWE 2021
dc.relation.ispartofseries Lecture notes in computer science ; 12706
dc.rights This document may be downloaded, read, stored and printed for your own use within the limits of § 53 UrhG but it may not be distributed on other websites via the internet or passed on to external parties. eng
dc.rights Dieses Dokument darf im Rahmen von § 53 UrhG zum eigenen Gebrauch kostenfrei heruntergeladen, gelesen, gespeichert und ausgedruckt, aber nicht auf anderen Webseiten im Internet bereitgestellt oder an Außenstehende weitergegeben werden. ger
dc.subject Information extraction eng
dc.subject NLP pipelines eng
dc.subject Semantic search eng
dc.subject Semantic Web eng
dc.subject Software reusability eng
dc.subject.classification Konferenzschrift ger
dc.subject.ddc 620 | Ingenieurwissenschaften und Maschinenbau
dc.title Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines eng
dc.type BookPart
dc.type Text
dc.relation.essn 1611-3349
dc.relation.isbn 978-3-030-74296-6
dc.relation.issn 0302-9743
dc.relation.doi https://doi.org/10.1007/978-3-030-74296-6_19
dc.bibliographicCitation.firstPage 240
dc.bibliographicCitation.lastPage 254
dc.description.version acceptedVersion eng
tib.accessRights frei zug�nglich


Die Publikation erscheint in Sammlung(en):

Zur Kurzanzeige

 

Suche im Repositorium


Durchblättern

Mein Nutzer/innenkonto

Nutzungsstatistiken