Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines

Jaradeh, Mohamad Yaser; Singh, Kuldeep; Stocker, Markus; Both, Andreas; Auer, Sören

Startseite
→
Forschungseinrichtungen
→
Forschungszentren
→
Dokumentanzeige

dc.identifier.uri	http://dx.doi.org/10.15488/16804
dc.identifier.uri	https://www.repo.uni-hannover.de/handle/123456789/16931
dc.contributor.author	Jaradeh, Mohamad Yaser
dc.contributor.author	Singh, Kuldeep
dc.contributor.author	Stocker, Markus
dc.contributor.author	Both, Andreas
dc.contributor.author	Auer, Sören
dc.contributor.editor	Brambilla, M.
dc.contributor.editor	Chbeir, R.
dc.contributor.editor	Frasincar, F.
dc.contributor.editor	Manolescu, I.
dc.date.accessioned	2024-03-26T09:31:15Z
dc.date.available	2024-03-26T09:31:15Z
dc.date.issued	2021
dc.identifier.citation	Jaradeh, M.Y.; Singh, K.; Stocker, M.; Both, A.; Auer, S.: Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines. In: Brambilla, M.; Chbeir, R.; Frasincar, F.; Manolescu, I. (Eds.): Web Engineering. ICWE 2021. New York, NY : Springer, 2021 (Lecture notes in computer science ; 12706), S. 240-254. DOI: https://doi.org/10.1007/978-3-030-74296-6_19
dc.description.abstract	We propose Plumber, the first framework that brings together the research community’s disjoint information extraction (IE) efforts. The Plumber architecture comprises 33 reusable components for various Knowledge Graphs (KG) information extraction subtasks, such as coreference resolution, entity linking, and relation extraction. Using these components, Plumber dynamically generates suitable information extraction pipelines and offers overall 264 distinct pipelines. We study the optimization problem of choosing suitable pipelines based on input sentences. To do so, we train a transformer-based classification model that extracts contextual embeddings from the input and finds an appropriate pipeline. We study the efficacy of Plumber for extracting the KG triples using standard datasets over two KGs: DBpedia, and Open Research Knowledge Graph (ORKG). Our results demonstrate the effectiveness of Plumber in dynamically generating KG information extraction pipelines, outperforming all baselines agnostics of the underlying KG. Furthermore, we provide an analysis of collective failure cases, study the similarities and synergies among integrated components, and discuss their limitations.	eng
dc.language.iso	eng
dc.publisher	New York, NY : Springer
dc.relation.ispartof	Web Engineering. ICWE 2021
dc.relation.ispartofseries	Lecture notes in computer science ; 12706
dc.rights	This document may be downloaded, read, stored and printed for your own use within the limits of § 53 UrhG but it may not be distributed on other websites via the internet or passed on to external parties.	eng
dc.rights	Dieses Dokument darf im Rahmen von § 53 UrhG zum eigenen Gebrauch kostenfrei heruntergeladen, gelesen, gespeichert und ausgedruckt, aber nicht auf anderen Webseiten im Internet bereitgestellt oder an Außenstehende weitergegeben werden.	ger
dc.subject	Information extraction	eng
dc.subject	NLP pipelines	eng
dc.subject	Semantic search	eng
dc.subject	Semantic Web	eng
dc.subject	Software reusability	eng
dc.subject.classification	Konferenzschrift	ger
dc.subject.ddc	620 \| Ingenieurwissenschaften und Maschinenbau
dc.title	Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines	eng
dc.type	BookPart
dc.type	Text
dc.relation.essn	1611-3349
dc.relation.isbn	978-3-030-74296-6
dc.relation.issn	0302-9743
dc.relation.doi	https://doi.org/10.1007/978-3-030-74296-6_19
dc.bibliographicCitation.firstPage	240
dc.bibliographicCitation.lastPage	254
dc.description.version	acceptedVersion	eng
tib.accessRights	frei zug�nglich