FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain

Објеката

Тип
Рад у часопису
Верзија рада
објављена верзија
Језик
енглески
Креатор
Aleksandra Marković, Ranka Stanković, Natalija Tomić, Olivera Kitanović
Извор
Infotheca
Издавач
Faculty of Philology, University of Belgrade
Датум издавања
2021
Сажетак
U radu se daje kratak prikaz teorije semantike okvira, na kojoj je zasnovana leksička baza Frejmnet. Predstavljena je koncepcija ove mreže, kao i mogućnosti njene primene. Predstavljena je i leksička analiza koja se primenjuje u projektu izrade Frejmneta i ukazano na razlike između analize zasnovane na okviru u odnosu na analizu zasnovanu na reči. Zatim je prikazano nekoliko povezanih okvira koje prizivaju reči iz domena rizika. U radu je predstavljena i platforma NLTК pomoću koje se mogu koristiti razni jezički resursi, među njima i Frejmnet. Završno poglavlje pruža analizu imenice rizik na korpusu rudarstva. Predstavljeni su najčešći kolokati ove imenice, skica njene upotrebe, konkordance za pojedine modele, pronalaženje sinonima i povezanih reči u vidu tezaurusa, grafički prikaz frekvencija pojedinih kolokacija, kao i oblaka reči.
This paper gives a short overview of the frame semantics theory that forms the theoretical basis of the Berkeley FrameNet project. We present the basic concepts of this database, as well as the possibility of implementing it in Serbian. We also take a close look at the lexical analysis used in the FrameNet development project and point out the differences between the frame-based lexical analysis and its word-based counterpart. This is followed by an illustration of a couple of related frames evoked by words from the risk domain. FrameNet data is also readily available through the Python API included in the NLTК (Natural Language Toolkit) suite, which provides a good natural language processing resource. The last chapter shows a corpus search of the noun risk in a mining-themed corpus. We also present its most common collocates, word sketch, individual pattern concordances, thesaurus entry of its synonyms and related words, collocation frequency graphs. A word cloud for the word risk is also included.
том
21
Број
1
почетак странице
7
крај странице
33
doi
10.18485/infotheca.2021.21.1.1
issn
1450-9687
Subject
Srpski jezik, semantika okvira, FrameNet, scenario rizika, rudarski korpus, obrada prirodnog jezika
Serbian language, frame semantics, FrameNet, risk scenario, mining corpus, natural language processing
Шира категорија рада
M50
Ужа категорија рада
М53
Права
Отворени приступ
Лиценца
Creative Commons – Attribution-Share Alike 4.0 International
Формат
.pdf

Aleksandra Marković, Ranka Stanković, Natalija Tomić, Olivera Kitanović. "FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.1

This item was submitted on 22. новембар 2021. by [anonymous user] using the form “Рад у часопису” on the site “Радови”: http://dr.rgf.bg.ac.rs/s/repo

Click here to view the collected data.