Претрага ⚒ Радови ⚒ Др РГФ - Репозиторијум РГФ

Претрага

Per page

Sort by

28 items

Српски језик у дигиталном добу -- The Serbian Language in the Digital Age

Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević (2012)

Serbian language

Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević. "Српски језик у дигиталном добу -- The Serbian Language in the Digital Age" in META-NET White Paper Series, G. Rehm, H. Uszkoreit (eds.), Springer (2012) M12
WS4LR - a Worksation for Lexical Resources

Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović (2006)

Lexical Resources, Wordnet, Serbian

Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović. "WS4LR - a Worksation for Lexical Resources" in Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation, Genoa, Italy, May 2006, ELRA - European Language Resources Association (2006) М33
An Italian-Serbian Sentence Aligned Parallel Literary Corpus

Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić (2023)

This article presents the construction and relevance of an Italian-Serbian sentence-aligned parallel corpus, delving into the aligned sentences in order to facilitate effective translation between the two languages. The parallel corpus serves as a valuable resource for language experts, researchers, and language enthusiasts, fostering a deeper understanding of linguistic nuances and cultural expressions. By bridging the gap between Serbian and Italian, this corpus opens new avenues for cross-cultural communication and collaboration, and ultimately contributes to the improvement of language-related ...

Aligned corpus, parallel corpus, Serbian, Italian, literature

Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić. "An Italian-Serbian Sentence Aligned Parallel Literary Corpus" in Review of the National Center for Digitization, Belgrade : Faculty of Mathematics, University of Belgrade (2023). https://doi.org/10.5281/zenodo.11203388 М53
The Dictionary of the Serbian Academy: from the Text to the Lexical Database

Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo (2018)

In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...

computer lexicography, lexical database, language resources, dictionary, Serbian language

Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018) M33
Vebran Web Services for Corpus Query Expansion

Ranka Stanković, Miloš Utvić (2020)

U ovom radu se govori o razvoju veb usluga Vebran i njihovoj primeni u poboljšanju pretraživanja korpusa. Veb-servisi Vebran koriste se za konsultovanje spoljnih leksičkih izvora za srpski jezik (uglavnom elektronski morfološki rečnici i srpski Vordnet) i proširivanje korisničkih upita radi dobijanja relevantnijih rezultata iz srpskih korpusa.

corpus search, web service, Serbian lexical resources, query expansion

Ranka Stanković, Miloš Utvić. "Vebran Web Services for Corpus Query Expansion" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.5 М53
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian

Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić (2020)

The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...

Part-of-Speech tagging, lemmatization, corpus, evaluation, Serbian, morphological dictionary

Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020) М33
The Many Faces of SrpKor

Duško Vitas, Ranka Stanković, Cvetana Krstev (2024.)

Акроним СрпКор означава фамилију електронских корпуса савременог српског језика чија је изградња почела крајем седамдесетих година прошлога века, а која је постала шире видљива заинтересованој истраживачкој заједници објављивањем његове прве верзије на вебу 2002. године. У овом дугом периоду, посебно пре појаве корисних текстуелних ресурса на вебу, развој корпуса се састојао у прикупљању и обради грађе као и у развоју метода обраде корпуса. Наиме, електронски корпус није само колекција текстова у дигиталном облику (како се то, на пример, наводи ...

СрпКор, корпуси, српски, лематизација, Лексимирка

Duško Vitas, Ranka Stanković, Cvetana Krstev. "The Many Faces of SrpKor" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.) М64
Automatic construction of a morphological dictionary of multi-word units

Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić (2010)

The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...

electronic dictionary, Serbian, morphology, inflection, multiwordn units, noun phrases, query expansion

Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić. "Automatic construction of a morphological dictionary of multi-word units" in Lecture Notes in Computer Science 6233, Advances in Natural Language Processing, Proceedings of the 7thInternational Conference on NLP, IceTAL 2010, Reykjavik, Iceland, August 2010, Springer (2010): 226-237. https://doi.org/10.1007/978-3-642-14770-8_26 M14
Frequency and Length of Syllables in Serbian

Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová (2019)

Basic analyses of several properties of syllables (the rank-frequency distribution, the distribution of length, and the relation between length and frequency) in Serbian is presented. The syllabification algorithm used combines the maximum onset principle and the sonority hierarchy. Results indicate that syllables behave similarly to words as far as mathematical models are concerned, but values of parameters in models for syllables are quite different from those for words.

frekvencije slogova, dužina slogova, srpski jezik

Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová. "Frequency and Length of Syllables in Serbian" in Glottometrics (2019) М24
Production of morphological dictionaries of multi-word units using a multipurpose tool

Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas (2011)

The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...

electronic dictionary, Serbian, morphology, inﬂection, multi-word units, noun phrases, query expansion

Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011) M33
Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages

Jelena Lazarević, Olivera Kitanović (2024.)

Cilj rada je istraživanje kolokabilnosti kao načina na koji se leksičke jedinice povezuju sa rečima iz različitih kategorija, formirajući veće jedinice. Istraživanje semantičkih i sintaksičkih principa ovih kombinacija u španskom i srpskom jeziku fudbala izvedeno je na komparabilnim fudbalskim korpusima SrFudKo i EsFudko, razvijenim u okviru doktorske disertacije Jelene Lazarević pod nazivom: Jezičke odlike diskursa novih medija o fudbalu: kontrastivna analiza na korpusu srpskog i španskog jezika. Korpus fudbala SrFudKo, kreiran na osnovu tekstova o fudbalu sa pet srpskih veb-portala: ...

fudbal, korpusi, terminologija, kolokacije, srpski, španski

Jelena Lazarević, Olivera Kitanović . "Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.) М64
SrpELTeC: A Serbian Literary Corpus for Distant Reading

Ranka Stanković, Cvetana Krstev, Duško Vitas (2024)

U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...

digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analytics

Ranka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03 М23
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian

Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih (2021)

Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...

uvredljivi jezik, govor mržnje, srpski, tviter, leksikon, korpus

Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13 М33
Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names

Branislava Šandrih, Cvetana Krstev, Ranka Stanković (2019)

In this paper we present a rule- and lexicon-based system for the recognition of Named Entities (NE) in Serbian news paper texts that was used to prepare a gold standard annotated with personal names. It was further used to prepare training sets for four different levels of annota tion, which were further used to train two Named Entity Recognition (NER) sys tems: Stanford and spaCy. All obtained models, together with a rule- and lexicon based system were evaluated on ...

NER, Named Entity Recognition Systems, Serbian, Personal Names

Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names" in Proceedings - Natural Language Processing in a Deep Learning World, Incoma Ltd., Shoumen, Bulgaria (2019). https://doi.org/10.26615/978-954-452-056-4_122 М33
Повезивање лексема морфолошких речника коришћењем базе Лексимирка

Биљана Рујевић, Ранка Станковић, Михаило Шкорић (2024)

Рад приказује приступ успостављању повезивања лексема у Морфолошким речницима српског језика. Повезивање, тј. успостављање релација не би било могуће без претходне конверзије речника из облика текстуалних датотека у облик лексичке базе података назване Лексимирка. Методологија за успостављање релација почива на 69 појединачних релација заснованих на 388 правила. Правила за повезивање се дефинишу на основу обележја лексичких записа (врсте речи, маркера, граматичких категорија и подниски). Успостављене релације су крајњем кориснику видљиве путем апликације Лексимирка у форми хипервеза и могу се ...

морфолошки речници, повезивање лексема, лексичка база података, српски језик

Биљана Рујевић, Ранка Станковић, Михаило Шкорић. "Повезивање лексема морфолошких речника коришћењем базе Лексимирка" in Модерни речници у функцији просечнога корисника: стари проблеми, савремени правци и нови изазови, Лексикографски сусрети, Београд, 27-29. мај 2024. , Београд : Филолошки факултет (2024). https://doi.org/10.18485/lexicog_meet.2024.1.ch23 М33
Understanding partitioning of deformation in highly arcuate orogenic systems: Inferences from the evolution of the Serbian Carpathians

Nemanja Krstekanić, Liviu Matenco, Marinko Toljić, Oleg Mandić, Uroš Stojadinović, Ernst Willingshofer (2020)

orogeneza, oroklini, višesmerna ekstenzija, transkurentna kretanja, Srpski Karpati

Nemanja Krstekanić, Liviu Matenco, Marinko Toljić, Oleg Mandić, Uroš Stojadinović, Ernst Willingshofer. "Understanding partitioning of deformation in highly arcuate orogenic systems: Inferences from the evolution of the Serbian Carpathians" in Global and Planetary Change, Elsevier BV (2020). https://doi.org/10.1016/j.gloplacha.2020.103361 М21а
Witness of the history: A hundred years old the geological hammer of Jovan Zujovic

Ljupko Rundić (2021)

Током обележавања тридесет година постојања и рада Српског геолошког друштва 10. фебруара 1921. године, у знак великог поштовања према академику Јовану Жујовићу, председнику и оснивачу Српског геолошког друштва и српске геолошке школе, чланови СГД поклонили су му јединствени геолошки чекић са угравираном посветом и својим потписима. Током протеклих стотину година, многе генерације геолога проналазиле су инспирацију гледајући чекић и делећи ову причу с великим пијететом. Данас, када геолози посећују Спомен собу геологије (Рударско-геолошки факултет, ул. Каменичка бр. 6), где се пажљиво ...

Геолошки чекић, Јован Жујовић, Српско геолошко друштво, 1921–2021

Ljupko Rundić. "Witness of the history: A hundred years old the geological hammer of Jovan Zujovic" in Annales Geologiques de la Peninsule Balkanique, National Library of Serbia (2021). https://doi.org/10.2298/GABP210607004R М24
Речници у дигиталном добу - информатичка подршка за српски језик

Биљана Рујевић (2022)

Морфолошки речници српског језика представљају електронски језички ресурс који има значајну историју развоја и коришћења за потребе обраде природних језика. С обзиром на то да су чувани у облику датотека чији је број нарастао па је самим тим управљање речницима постало отежано јавила се потреба за смештањем информација из речника у облик лексикографске базе. Како би се омогућио симултани рад на развоју речника за више корисника јавила се потреба за веб-апликацијом заснованој на лексикографској бази. Како би се размотриле ...

електронски речници, лексикографска база података, лексички ресурси, српски језик

Биљана Рујевић. Речници у дигиталном добу - информатичка подршка за српски језик, Београд : [Б. Рујевић], 2022 M70
Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model

Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević (2024)

Ova studija predstavlja analizu sentimenta srpskih starih romana iz perioda 1840-1920, koristeći veliki jezički model (LLM) Mistral za tehniku učenja sa zasnovani na takozvanim "zero" i "few-shot" pokušajima. Glavni pristup uvodi inovacije osmišljavanjem istraživačkih upita (promptova) uključuju tekst sa uputstvom za klasifikaciju bez primera i na osnovu nekoliko primera, omogućavajući jezičkom modelu da klasifikuje osećanja u pozitivne, negativne ili objektivne kategorije. Ova metodologija ima za cilj da pojednostavi analizu osećanja ograničavanjem odgovora, čime se povećava preciznost ...

zero-shot, few-shot, sentiment, Serbian, Mistral model

Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević. "Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model" in In Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024), BAS (2024) М33
Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit

Milena Šošić, Ranka Stanković, Jelena Graovac (2024)

U digitalnom okruženju južnoslovenskih jezika, analiza emocija u tekstovima na društvenim mrežama postaje sve važnija za razumevanje javnog mnjenja, kreiranje personalizovanog sadržaja i analizu međusobnih interakcija korisnika. U okviru ovog rada predstavljamo detaljnu metodologiju i rezultate označavanja korpusa na srpskom jeziku prema Plutčikovom modelu kategorizacije, koji prepoznaje osam osnovnih emocionalnih kategorija, kao što su radost, tuga, bes, strah, poverenje, gađenje, iščekivanje i iznenađenje. Cilj istraživanja je da se analizira emocionalni sadržaj tekstova preuzetih sa društvenih mreža X (nekada Twitter) ...

emocije, Plutčikov model, označavanje, korpus, društvene mreže, srpski jezik

Milena Šošić, Ranka Stanković, Jelena Graovac. "Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024., University of Belgrade - Faculty of Philology (2024) М64

Претрага

28 items

Српски језик у дигиталном добу -- The Serbian Language in the Digital Age cite

WS4LR - a Worksation for Lexical Resources cite

An Italian-Serbian Sentence Aligned Parallel Literary Corpus cite

The Dictionary of the Serbian Academy: from the Text to the Lexical Database cite

Vebran Web Services for Corpus Query Expansion cite

Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian cite

The Many Faces of SrpKor cite

Automatic construction of a morphological dictionary of multi-word units cite

Frequency and Length of Syllables in Serbian cite

Production of morphological dictionaries of multi-word units using a multipurpose tool cite

Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages cite

SrpELTeC: A Serbian Literary Corpus for Distant Reading cite

A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian cite

Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names cite

Повезивање лексема морфолошких речника коришћењем базе Лексимирка cite

Understanding partitioning of deformation in highly arcuate orogenic systems: Inferences from the evolution of the Serbian Carpathians cite

Witness of the history: A hundred years old the geological hammer of Jovan Zujovic cite

Речници у дигиталном добу - информатичка подршка за српски језик cite

Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model cite

Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit cite

Српски језик у дигиталном добу -- The Serbian Language in the Digital Age

WS4LR - a Worksation for Lexical Resources

An Italian-Serbian Sentence Aligned Parallel Literary Corpus

The Dictionary of the Serbian Academy: from the Text to the Lexical Database

Vebran Web Services for Corpus Query Expansion

Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian

The Many Faces of SrpKor

Automatic construction of a morphological dictionary of multi-word units

Frequency and Length of Syllables in Serbian

Production of morphological dictionaries of multi-word units using a multipurpose tool

Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages

SrpELTeC: A Serbian Literary Corpus for Distant Reading

A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian

Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names

Повезивање лексема морфолошких речника коришћењем базе Лексимирка

Understanding partitioning of deformation in highly arcuate orogenic systems: Inferences from the evolution of the Serbian Carpathians

Witness of the history: A hundred years old the geological hammer of Jovan Zujovic

Речници у дигиталном добу - информатичка подршка за српски језик

Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model

Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit