Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts

Download statistics - Document (COUNTER):

Sakor, A.; Singh, K.; Vidal, M.-E.: Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts. In: IEEE access : practical research, open solutions 10 (2022), S. 115351-115371. DOI: https://doi.org/10.1109/access.2022.3217492

Repository version

To cite the version in the repository, please use this identifier: https://doi.org/10.15488/13678

Selected time period:

year: 
month: 

Sum total of downloads: 66




Thumbnail
Abstract: 
Social media networks have become a prime source for sharing news, opinions, and research accomplishments in various domains, and hundreds of millions of posts are announced daily. Given this wealth of information in social media, finding related announcements has become a relevant task, particularly in trending news (e.g., COVID-19 or lung cancer). To facilitate the search of connected posts, social networks enable users to annotate their posts, e.g., with hashtags in tweets. Albeit effective, an annotation-based search is limited because results will only include the posts that share the same annotations. This paper focuses on retrieving context-related posts based on a specific topic, and presents PINYON, a knowledge-driven framework, that retrieves associated posts effectively. PINYON implements a two-fold pipeline. First, it encodes, in a graph, a CORPUS of posts and an input post; posts are annotated with entities for existing knowledge graphs and connected based on the similarity of their entities. In a decoding phase, the encoded graph is used to discover communities of related posts. We cast this problem into the Vertex Coloring Problem, where communities of similar posts include the posts annotated with entities colored with the same colors. Built on results reported in the graph theory, PINYON implements the decoding phase guided by a heuristic-based method that determines relatedness among posts based on contextual knowledge, and efficiently groups the most similar posts in the same communities. PINYON is empirically evaluated on various datasets and compared with state-of-the-art implementations of the decoding phase. The quality of the generated communities is also analyzed based on multiple metrics. The observed outcomes indicate that PINYON accurately identifies semantically related posts in different contexts. Moreover, the reported results put in perspective the impact of known properties about the optimality of existing heuristics for vertex graph coloring and their implications on PINYON scalability.
License of this version: CC BY 4.0 Unported
Document Type: Article
Publishing status: publishedVersion
Issue Date: 2022
Appears in Collections:Fakultät für Elektrotechnik und Informatik
Zentrale Einrichtungen
Forschungszentren

distribution of downloads over the selected time period:

downloads by country:

pos. country downloads
total perc.
1 image of flag of Germany Germany 25 37.88%
2 image of flag of United States United States 14 21.21%
3 image of flag of Netherlands Netherlands 12 18.18%
4 image of flag of Russian Federation Russian Federation 4 6.06%
5 image of flag of Lebanon Lebanon 2 3.03%
6 image of flag of Iran, Islamic Republic of Iran, Islamic Republic of 2 3.03%
7 image of flag of Canada Canada 2 3.03%
8 image of flag of Zimbabwe Zimbabwe 1 1.52%
9 image of flag of Indonesia Indonesia 1 1.52%
10 image of flag of France France 1 1.52%
    other countries 2 3.03%

Further download figures and rankings:


Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Search the repository


Browse