dc.identifier.uri |
http://dx.doi.org/10.15488/16804 |
|
dc.identifier.uri |
https://www.repo.uni-hannover.de/handle/123456789/16931 |
|
dc.contributor.author |
Jaradeh, Mohamad Yaser
|
|
dc.contributor.author |
Singh, Kuldeep
|
|
dc.contributor.author |
Stocker, Markus
|
|
dc.contributor.author |
Both, Andreas
|
|
dc.contributor.author |
Auer, Sören
|
|
dc.contributor.editor |
Brambilla, M.
|
|
dc.contributor.editor |
Chbeir, R.
|
|
dc.contributor.editor |
Frasincar, F.
|
|
dc.contributor.editor |
Manolescu, I.
|
|
dc.date.accessioned |
2024-03-26T09:31:15Z |
|
dc.date.available |
2024-03-26T09:31:15Z |
|
dc.date.issued |
2021 |
|
dc.identifier.citation |
Jaradeh, M.Y.; Singh, K.; Stocker, M.; Both, A.; Auer, S.: Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines. In: Brambilla, M.; Chbeir, R.; Frasincar, F.; Manolescu, I. (Eds.): Web Engineering. ICWE 2021. New York, NY : Springer, 2021 (Lecture notes in computer science ; 12706), S. 240-254. DOI: https://doi.org/10.1007/978-3-030-74296-6_19 |
|
dc.description.abstract |
We propose Plumber, the first framework that brings together the research community’s disjoint information extraction (IE) efforts. The Plumber architecture comprises 33 reusable components for various Knowledge Graphs (KG) information extraction subtasks, such as coreference resolution, entity linking, and relation extraction. Using these components, Plumber dynamically generates suitable information extraction pipelines and offers overall 264 distinct pipelines. We study the optimization problem of choosing suitable pipelines based on input sentences. To do so, we train a transformer-based classification model that extracts contextual embeddings from the input and finds an appropriate pipeline. We study the efficacy of Plumber for extracting the KG triples using standard datasets over two KGs: DBpedia, and Open Research Knowledge Graph (ORKG). Our results demonstrate the effectiveness of Plumber in dynamically generating KG information extraction pipelines, outperforming all baselines agnostics of the underlying KG. Furthermore, we provide an analysis of collective failure cases, study the similarities and synergies among integrated components, and discuss their limitations. |
eng |
dc.language.iso |
eng |
|
dc.publisher |
New York, NY : Springer |
|
dc.relation.ispartof |
Web Engineering. ICWE 2021 |
|
dc.relation.ispartofseries |
Lecture notes in computer science ; 12706 |
|
dc.rights |
This document may be downloaded, read, stored and printed for your own use within the limits of § 53 UrhG but it may not be distributed on other websites via the internet or passed on to external parties. |
eng |
dc.rights |
Dieses Dokument darf im Rahmen von § 53 UrhG zum eigenen Gebrauch kostenfrei heruntergeladen, gelesen, gespeichert und ausgedruckt, aber nicht auf anderen Webseiten im Internet bereitgestellt oder an Außenstehende weitergegeben werden. |
ger |
dc.subject |
Information extraction |
eng |
dc.subject |
NLP pipelines |
eng |
dc.subject |
Semantic search |
eng |
dc.subject |
Semantic Web |
eng |
dc.subject |
Software reusability |
eng |
dc.subject.classification |
Konferenzschrift |
ger |
dc.subject.ddc |
620 | Ingenieurwissenschaften und Maschinenbau
|
|
dc.title |
Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines |
eng |
dc.type |
BookPart |
|
dc.type |
Text |
|
dc.relation.essn |
1611-3349 |
|
dc.relation.isbn |
978-3-030-74296-6 |
|
dc.relation.issn |
0302-9743 |
|
dc.relation.doi |
https://doi.org/10.1007/978-3-030-74296-6_19 |
|
dc.bibliographicCitation.firstPage |
240 |
|
dc.bibliographicCitation.lastPage |
254 |
|
dc.description.version |
acceptedVersion |
eng |
tib.accessRights |
frei zug�nglich |
|