Претрага
116 items
-
It-Sr-NER: CLARIN Compatible NER and Geoparsing Web Services for Italian and Serbian Parallel Text
Olja Perišić, Ranka Stanković, Milica Ikonić Nešić, Mihailo Škorić. "It-Sr-NER: CLARIN Compatible NER and Geoparsing Web Services for Italian and Serbian Parallel Text" in Linköping Electronic Conference Proceedings, Linköping University Electronic Press (2023). https://doi.org/10.3384/ecp198010 М33
-
It-Sr-NER: Web Services for Recognizing and Linking Named Entities in Text and Displaying Them on a Web Map
The paper will present the results of the project `“It-Sr-NER: Web services for named entities recognition, linking and mapping,” in which teams from the University of Turin and the Society for Language Resources and Technologies JeRTeh participated, and whose goal was the development of the It-Sr-NER web service for named entity annotations in the text and displaying them on the map. Named entities in these services are names of persons, places, organizations, demonyms (ethnicities), events and works of art.Olja Perišić, Ranka Stanković, Milica Ikonić Nešić, Mihailo Škorić. "It-Sr-NER: Web Services for Recognizing and Linking Named Entities in Text and Displaying Them on a Web Map" in Infotheca, Belgrade : Faculty of Philology, University of Belgrade (2023). https://doi.org/10.18485/infotheca.2023.23.1.3 М53
-
Serbian ELTeC Sub-Collection in Wikidata
This paper presents an example of integration of Wikidata with digital libraries and external systems, as well as some best practices for speeding up the process of data preparation and import to Wikidata, on the use case of SrpELTeC, Serbian subcollection of the ELTeC multilingual collection (European Literary Text Collection). After preliminary work on the manual Wikidata population with SrpELTeC novels, the goal was to automate the process of preparing and importing information, so different solutions were analysed and ...Milica Ikonić Nešić, Ranka Stanković, Biljana Rujević. "Serbian ELTeC Sub-Collection in Wikidata" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.2.4 М53
-
Нове технологије за оживљавање старих текстова
удаљено читање, књижевни корпус, обрада српског језика, анотација врстом речи, лематизација, именовани ентитетиЦветана Крстев, Ранка Станковић, Бранислава Шандрих Тодоровић, Милица Иконић Нешић. "Нове технологије за оживљавање старих текстова" in Зборник радова Међународне научне конференције Дигитална хуманистика и словенско културно наслеђе II, Београд, 28-29 јуни 2021., Београд : Савез славистичких друштава Србије (2023) М14
-
BERT Downstream Task Analysis: Named Entity Recognition in Serbian
This paper compares different architectures and techniques for preparing named entity recognition (NER) models for the Serbian language via integrating BERT with spaCy. Models were trained to recognize seven different named entity types (persons, locations, organisations, professions, events, demonyms, and artworks), and are trained on the dataset containing Serbian novels published between 1840 and 1920, publicly available newspaper articles and sentences generated from the Wikidata knowledge base and Leximirka lexical database. We explore various configurations and several training pipelines ...Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković. "BERT Downstream Task Analysis: Named Entity Recognition in Serbian" in Lecture Notes in Networks and Systems, Springer Nature Switzerland (2024). https://doi.org/10.1007/978-3-031-71419-1_29 М33
-
Sentiment Analysis of Serbian Old Novels
In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022) М33
-
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022) М33
-
Serbian NER&Beyond: The Archaic and the Modern Intertwinned
U ovom radu predstavljamo srpski književni korpus koji se razvija pod okriljem COST Akcije „Distant Reading for European Literary History” CA16204. Koristeći ovaj korpus romana napisanih pre više od jednog veka, razvili smo i učinili javno dostupnim Sistem za prepoznavanje imenovanih entiteta (NER) obučen da prepozna 7 različitih tipova imenovanih entiteta, sa konvolucionom neuronskom mrežom (CNN), koja ima F1 rezultat od ≈91% na test skupu podataka. Ovaj model je dalje ocenjen na posebnom skupu podataka za evaluaciju. Završavamo poređenje ...Branislava Šandrih Todorović, Cvetana Krstev, Ranka Stanković, Milica Ikonić Nešić. "Serbian NER&Beyond: The Archaic and the Modern Intertwinned" in Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications, INCOMA Ltd. Shoumen, BULGARIA (2021). https://doi.org/10.26615/978-954-452-072-4_141 М33
-
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)
In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022) М33
-
Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution
This paper explores the effectiveness of parallel stylometric document embeddings in solving the authorship attribution task by testing a novel approach on literary texts in 7 different languages, totaling in 7051 unique 10,000-token chunks from 700 PoS and lemma annotated documents. We used these documents to produce four document embedding models using Stylo R package (word-based, lemma-based, PoS-trigrams-based, and PoS-mask-based) and one document embedding model using mBERT for each of the seven languages. We created further derivations of these ...Mihailo Škorić, Ranka Stanković, Milica Ikonić Nešić, Joanna Byszuk, Maciej Eder. "Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution" in Mathematics, MDPI AG (2022). https://doi.org/10.3390/math10050838 М21а
-
SrpCNNeL: Serbian Model for Named Entity Linking
Ovaj rad predstavlja razvoj modela za prepoznavanje i povezivanje imenovanih entiteta (NEL) sa bazom znanja Vikipodaci za srpski jezik pod nazivom SrpCNNeL. Model je obučen da prepozna i poveže sedam različitih imenovanih tipova entiteta (osobe, lokacije, organizacije, profesije, događaji, demoni i umetnička dela) na skupu podataka koji sadrži rečenice iz romana, pravnih dokumenata, kao i rečenice generisane iz znanja Vikipodataka baza i Leksimirka leksička baza podataka. Dobijeni model je pokazao dobre performanse, postigavši F1 rezultat od 0,8 na test ...Milica Ikonić Nešić, Saša Petalinkar, Ranka Stanković, Miloš Utvić, Olivera Kitanović. "SrpCNNeL: Serbian Model for Named Entity Linking" in Annals of Computer Science and Information Systems, IEEE (2024). https://doi.org/10.15439/2024F8827 М33
-
Topic Modeling of the SrpELTeC Corpus: A Comparison of NMF, LDA, and BERTopic
Modeliranje tema je efikasan način da se dobije uvid u velike količine podataka. Neki od najčešće korišćenih metoda za modeliranje tema su Latentna Dirihleova alokacija (LDA) i faktorizacija nenegativne matrice (NMF). Međutim, sa porastom modela samopažnje i unapred obučenih jezičkih modela, pojavili su se novi načini za ekstrakcju tema. BERTopic predstavlja novi pristup modeliranju tema. U ovom radu smo uporedili performanse LDA, NMF i BERTopic na književnim tekstovima na srpskom, merenjem koherentnosti tema i raznovrsnosti tema, kao i kvalitativnom ...Teodora Mihajlov, Milica Ikonić Nešić, Ranka Stanković, Olivera Kitanović. "Topic Modeling of the SrpELTeC Corpus: A Comparison of NMF, LDA, and BERTopic" in Annals of Computer Science and Information Systems, IEEE (2024). https://doi.org/10.15439/2024F1593 М33
-
Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model
Ova studija predstavlja analizu sentimenta srpskih starih romana iz perioda 1840-1920, koristeći veliki jezički model (LLM) Mistral za tehniku učenja sa zasnovani na takozvanim "zero" i "few-shot" pokušajima. Glavni pristup uvodi inovacije osmišljavanjem istraživačkih upita (promptova) uključuju tekst sa uputstvom za klasifikaciju bez primera i na osnovu nekoliko primera, omogućavajući jezičkom modelu da klasifikuje osećanja u pozitivne, negativne ili objektivne kategorije. Ova metodologija ima za cilj da pojednostavi analizu osećanja ograničavanjem odgovora, čime se povećava preciznost ...Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević. "Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model" in In Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024), BAS (2024) М33
-
Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking
U radu se prikazuju rezultati istraživanja vezanih za pripremu paralelnih korpusa, fokusirajući se na transformaciju u RDF grafove koristeći NLP Interchange Format (NIF) za lingvističku anotaciju. Pružamo pregled paralelnog korpusa koji je korišćen u ovom studijskom slučaju, kao i proces označavanja delova govora, lematizacije i prepoznavanja imenovanih entiteta (NER). Zatim opisujemo povezivanje imenovanih entiteta (NEL), konverziju podataka u RDF, i uključivanje NIF anotacija. Proizvedene NIF datoteke su evaluirane kroz istraživanje triplestore-a korišćenjem SPARQL upita. Na kraju, razmatra se povezivanje Linked ...paralelni korpusi, povezivanje imenovanih entiteta, prepoznavanje imenovanih entiteta, NER, NEL, povezani podaci, NIF, VikipodaciRanka Stanković, Milica Ikonić Nešić, Olja Perisic, Mihailo Škorić, Olivera Kitanović. "Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024) М33
-
Preparation, characterization and photocatalytic activity of lanthanum and vanadium co-doped mesoporous TiO2 for azo-dye degradation
Nešić Jelena, Manojlović Dragan D., Anđelković Ivan, Dojčinović Biljana, Vulić Predrag, Krstić Jugoslav, Roglić Goran (2013)Nešić Jelena, Manojlović Dragan D., Anđelković Ivan, Dojčinović Biljana, Vulić Predrag, Krstić Jugoslav, Roglić Goran. "Preparation, characterization and photocatalytic activity of lanthanum and vanadium co-doped mesoporous TiO2 for azo-dye degradation" in Journal of Molecular Catalysis A: Chemical 378, Amsterdam:Elsevier (2013): 67-75. https://doi.org/10.1016/j.molcata.2013.05.018 M22
-
Kompleksno iskorišćenje mineralno-sirovinskog potencijala leđišta uglja Veliki Crljeni, kolubarski ugljonosni basen (mogući scenario)
Vučković Bogoljub, Nešić Duško, Bogdanović Vesna, Draško Zoran, Klemčić Goran. "Kompleksno iskorišćenje mineralno-sirovinskog potencijala leđišta uglja Veliki Crljeni, kolubarski ugljonosni basen (mogući scenario)" in Savremene tehnologije u rudarstvu i zaštiti životne sredine : zbornik radova / I Međunarodni simpozijum RUDARSTVO 2010, Tara 24. - 26. Maj, 2010 = Modern Technologies in Mining and Environmental Protection : proceedings 1, Tara:Privredna komora Srbije (2010): 93-98 M33
-
Fe doped TiO2 prepared by microwave-assisted hydrothermal process for removal of As(III) and As(V) from water
Anđelković Ivan B, Stanković Dalibor M, Nešić Jelena, Krstić Jugoslav B, Vulić Predrag, Manojlović Dragan D, Roglić Goran M (2014)Anđelković Ivan B, Stanković Dalibor M, Nešić Jelena, Krstić Jugoslav B, Vulić Predrag, Manojlović Dragan D, Roglić Goran M. "Fe doped TiO2 prepared by microwave-assisted hydrothermal process for removal of As(III) and As(V) from water" in Industrial and Engineering Chemistry Research 53 no. 27, Washington:American Chemical Society (2014): 10841-10848. https://doi.org/10.1021/ie500849r M21
-
Sadržaj i distribucija Ni i Co u ultrabazičnim stenama i njihovim produktima raspadanja
Milica Simović (1970)Milica Simović. Sadržaj i distribucija Ni i Co u ultrabazičnim stenama i njihovim produktima raspadanja, Beograd:Rudarsko-geološki fakultet, 1970
-
Modeliranje disperzije dimnih gasova iz termoelektrane ,, Nikola Tesla”
Milica Kostić (2024)Termoelektrane na ugalj, iako značajan izvor električne energije, predstavljaju veliki izazov zbog zagađenja koje uzrokuju. Glavni uzrok zagađenja je neumerena ljudska potreba za energijom i resursima, što dovodi do emisije različitih zagađivača u atmosferu, uključujući okside ugljenika, sumpora, azota, kao i lebdeće čestice i toksine. Ove emisije imaju negativan uticaj na kvalitet vazduha, zdravlje ljudi i životnu sredinu. Procena uticaja termoelektrana na ugalj na životnu sredinu je zakonska obaveza i ključna za smanjenje tih negativnih efekata. Ovaj dokument analizira potencijalne ...Milica Kostić. Modeliranje disperzije dimnih gasova iz termoelektrane ,, Nikola Tesla”, 2024
-
Analiza rada električnih centrifugalnih pumpi
Milica Stanojević (2024)Ovaj rad analizira rad električne centrifugalne pumpe (ESP) u naftnoj industriji, sa fokusom na problem taloženja kamenca (karbonatnih naslaga) unutar sistema. Električne centrifugalne pumpe (ESP) predstavljaju srce mnogih naftnih bušotina. Njihova visoka pouzdanost i sposobnost rada u zahtevnim uslovima čine ih neophodnim za stabilnu i kontinuiranu proizvodnju.Senzori igraju ključnu ulogu u praćenju rada ESP sistema i detekciji ranih znakova taloženja kamenca. Problem taloženja kamenca je izuzetno značajan, jer može dovesti do smanjenja efikasnosti pumpe, povećanja troškova održavanja, i dugoročnih zastoja ...Milica Stanojević . Analiza rada električnih centrifugalnih pumpi, 2024