Претрага
528 items
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016) M33
-
Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources
Large collections of textual documents represent an example of big data that requires the solution of three basic problems: the representation of documents, the representation of information needs and the matching of the two representations. This paper outlines the introduction of document indexing as a possible solution to document representation. Documents within a large textual database developed for geological projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources" in Trans. Computational Collective Intelligence - Lecture Notes in Computer Science 26, Springer (2017). https://doi.org/10.1007/978-3-319-59268-8_8 M33
-
The Dictionary of the Serbian Academy: from the Text to the Lexical Database
In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018) M33
-
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic ...LR web services, MultiWord Expressions & Collocations, Information Extraction, Information RetrievalKrstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan. "The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines" in LREC 2008: Conference on Language Resources and Evaluation, Marrakesh, Morocco, May 2008, European Language Resources Association (ELRA) (2008) М63
-
Serbian NER&Beyond: The Archaic and the Modern Intertwinned
U ovom radu predstavljamo srpski književni korpus koji se razvija pod okriljem COST Akcije „Distant Reading for European Literary History” CA16204. Koristeći ovaj korpus romana napisanih pre više od jednog veka, razvili smo i učinili javno dostupnim Sistem za prepoznavanje imenovanih entiteta (NER) obučen da prepozna 7 različitih tipova imenovanih entiteta, sa konvolucionom neuronskom mrežom (CNN), koja ima F1 rezultat od ≈91% na test skupu podataka. Ovaj model je dalje ocenjen na posebnom skupu podataka za evaluaciju. Završavamo poređenje ...Branislava Šandrih Todorović, Cvetana Krstev, Ranka Stanković, Milica Ikonić Nešić. "Serbian NER&Beyond: The Archaic and the Modern Intertwinned" in Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications, INCOMA Ltd. Shoumen, BULGARIA (2021). https://doi.org/10.26615/978-954-452-072-4_141 М33
-
Part of Speech Tagging for Serbian language using Natural Language Toolkit
Ranka Stanković, Boro Milovanović (2020)Dok se razvijaju složeni algoritmi za NLP (obrada prirodnog jezika), osnovni zadaci kao što je označavanje ostaju veoma važni i još uvek izazovni. NLTK (Natural Language Toolkit) je moćna Python biblioteka za razvoj programa zasnovanih na NLP-u. Pokušavamo da iskoristimo ovu biblioteku za kreiranje PoS (vrsta reči) oznake za savremeni srpski jezik. Jedanaest različitih modela je kreirano korišćenjem NLTK API-ja za označavanje. Najbolji modeli se transformišu sa Brill tagerom da bi se poboljšala tačnost. Obučili smo modele na označenom ...Ranka Stanković, Boro Milovanović. "Part of Speech Tagging for Serbian language using Natural Language Toolkit" in 7th International Conference on Electrical, Electronic and Computing Engineering IcETRAN 2020, Academic Mind, Belgrade (2020) М33
-
Mine ventilation system planning using genetic algorithms
The most common problem in contemporary mining practice related to the planning and analysis of ventilation systems is the optimization of partially regulated air distribution in mine ventilation networks. This paper presents a two step optimization procedure for partly regulated air distribution in mine ventilation networks. The first step is the determination of air distribution in all branches using the well known Hardy-Cross method. In the second step the distribution and parameters of air flow regulators with minimum engaged ...coal, lignite, and peat; ventilation systems; coal mining; underground mining; ventilation; mathematical models; design; air flow; optimization; calculation methodsNikola Lilić, Ranka Stanković, Ivan Obradović. "Mine ventilation system planning using genetic algorithms" in 6. international symposium on mine planning and equipment selection, Ostrava (Czech Republic), 3-6 Sep 1997, A.A. Balkema, Rotterdam (Netherlands) (1997) М33
-
Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution
This paper explores the effectiveness of parallel stylometric document embeddings in solving the authorship attribution task by testing a novel approach on literary texts in 7 different languages, totaling in 7051 unique 10,000-token chunks from 700 PoS and lemma annotated documents. We used these documents to produce four document embedding models using Stylo R package (word-based, lemma-based, PoS-trigrams-based, and PoS-mask-based) and one document embedding model using mBERT for each of the seven languages. We created further derivations of these ...Mihailo Škorić, Ranka Stanković, Milica Ikonić Nešić, Joanna Byszuk, Maciej Eder. "Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution" in Mathematics, MDPI AG (2022). https://doi.org/10.3390/math10050838 М21а
-
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)
In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022) М33
-
Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data
Овај рад описује студију случаја о генерисању повезаних података креираних на основу обечежених текстуалних корпуса коришћењем формата размене података у обради природних језика (NIF). Као основа за ово истраживање послужио је подскуп корпуса ELTeC, који се састоји од 900 романа из периода 1840-1920 за 9 европских језика. Верзија романа са коментарима, у такозваном TEI level-2 формату, трансформисана је у NIF, формат заснован на RDF/OWL који има за циљ постизање интероперабилности између алата за обраду природних језика, језичких ресурса и ...Ranka Stanković, Christian Chiarcos, Miloš Utvić, Olivera Kitanović. "Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj М33
-
SrpCNNeL: Serbian Model for Named Entity Linking
Ovaj rad predstavlja razvoj modela za prepoznavanje i povezivanje imenovanih entiteta (NEL) sa bazom znanja Vikipodaci za srpski jezik pod nazivom SrpCNNeL. Model je obučen da prepozna i poveže sedam različitih imenovanih tipova entiteta (osobe, lokacije, organizacije, profesije, događaji, demoni i umetnička dela) na skupu podataka koji sadrži rečenice iz romana, pravnih dokumenata, kao i rečenice generisane iz znanja Vikipodataka baza i Leksimirka leksička baza podataka. Dobijeni model je pokazao dobre performanse, postigavši F1 rezultat od 0,8 na test ...Milica Ikonić Nešić, Saša Petalinkar, Ranka Stanković, Miloš Utvić, Olivera Kitanović. "SrpCNNeL: Serbian Model for Named Entity Linking" in Annals of Computer Science and Information Systems, IEEE (2024). https://doi.org/10.15439/2024F8827 М33
-
Modeliranje parametara kompenzacionih uređaja u sistemu transporta uglja u podzemnoj eksploataciji
Dragan M. Medenica (2005)Dragan M. Medenica. Modeliranje parametara kompenzacionih uređaja u sistemu transporta uglja u podzemnoj eksploataciji, Beograd:Rudarsko Geološki Fakultet, 2005
-
Geološko-geofizički model dela Timočkog magmatskog kompleksa
Snežana M. IGNJATOVIĆ (2014)Timočki magmatski kompleks; aeromagnetska istraživanja, gravimetrijska istraživanja, 2D geološko-geofizički model, Valja StržSnežana M. IGNJATOVIĆ. Geološko-geofizički model dela Timočkog magmatskog kompleksa, Beograd:Rudarsko Geološki Fakultet, 2014
-
Model inteligentnog sistema adaptivnog upravljanja procesom prerade rude
Ivana M. Jovanović (2016)adaptivno upravljanje, ruda, prerada rude, flotacijska koncentracija, priprema mineralnih sirovina, modelovanjeIvana M. Jovanović. Model inteligentnog sistema adaptivnog upravljanja procesom prerade rude, Beograd:Rudarsko Geološki Fakultet, 2016
-
Prirodno prečišćavanje i stimulisana bioremedijacija podzemnih voda zagađenih naftnim ugljovodonicima
Nenad M. MARIĆ (2016)Nenad M. MARIĆ. Prirodno prečišćavanje i stimulisana bioremedijacija podzemnih voda zagađenih naftnim ugljovodonicima, Beograd:Rudarsko Geološki Fakultet, 2016
-
Mineragenija i potencijalnost karbonatnih sirovina rudnog reona Bjelopavlića (Crna Gora)
Darko M. BOŽOVIĆ (2016)Darko M. BOŽOVIĆ. Mineragenija i potencijalnost karbonatnih sirovina rudnog reona Bjelopavlića (Crna Gora), Beograd:Rudarsko Geološki Fakultet, 2016
-
Inženjerskogeološki kriterijumi kao deo višekriterijumske analize izbora lokacija deponija
Sonja M. ĐOKANOVIĆ (2014)Sonja M. ĐOKANOVIĆ. Inženjerskogeološki kriterijumi kao deo višekriterijumske analize izbora lokacija deponija, Beograd:Rudarsko-geološki fakultet, 2014
-
Model konstrukcije podzemnog proizvodnog sistema za primenu dizel opreme u ležištima uglja Srbiji
Zoran M. Gligorić (2004)Zoran M. Gligorić. Model konstrukcije podzemnog proizvodnog sistema za primenu dizel opreme u ležištima uglja Srbiji, Beograd:Rudarsko Geološki Fakultet, 2004
-
Fuzzy stohastički model izbora sistema otvaranja podzemnog rudnika
Saša M. Jovanović (2016)otkopavanje, otvaranje, mreže i grafovi, fuzzy skupovi, stohastički procesi, indeks konveksnosti, sistemi podrške odlučivannjuSaša M. Jovanović. Fuzzy stohastički model izbora sistema otvaranja podzemnog rudnika, Beograd:Rudarsko Geološki Fakultet, 2016
-
Prognoza operativne efikasnosti aktivnog podzemnog rudnika zasnovana na teoriji sivih sistema
Svetlana M. ŠTRBAC-Savić (2016)Svetlana M. ŠTRBAC-Savić. Prognoza operativne efikasnosti aktivnog podzemnog rudnika zasnovana na teoriji sivih sistema, Beograd:Rudarsko Geološki Fakultet, 2016