Претрага
50 items
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...... supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment between Serbian morphological dictionaries, MULTEXT-East and Universal Part-of-Speech tagset. The trained models will be used to publish the new ...
... prepara- tion of training sets to be used for different taggers and tagsets in the future. The research was focused on anno- tation schemata alignment between Serbian morphological dictionaries tagset (presented briefly in Subsection 2.1.), MULTEXT-East tagset (Erjavec, 2012), and the Universal ...
... tex- tual narratives that contain certain information about the subject, object and motive) into tokens. Since the result- ing tokens contain the term itself, its PoS-tag and relation- ships with other tokens, they are subsequently used to in- fer concepts and relationships contained in user-stories ...Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
-
On the compatibility of lexical resources for NooJ
Lexical resources for many languages are provided for the NooJ linguistic development environment. Meta-data descriptions of morphosyntactic and semantic properties of these languages and their resources are a mandatory part of each language module. In this paper we analyze how well the meta-data actually describe resources for a chosen subset of languages and to what extent are they compatible across languages to support multilingual processing. We show that there is place for improvement in both directions.... were in XML format in compliance with TEI, and their alignment was performed at the sentence level using the ACIDE system (Obradović et al 2008), which can handle aligned texts in various formats (TEI, TMX, html, Vanilla). During the alignment process, the texts were segmented in such a way as ...Ranka Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović. "On the compatibility of lexical resources for NooJ" in Automatic Processing of Various Levels of Linguistic Phenomena: Selected Papers from the 2011 International Nooj Conference, Cambridge Scholars Publishing (2012): 96-108
-
The Nooj System as Module within an Integrated Language Processing Environment
... and its relations. This adds additional functionality for information retrieval, indexing, machine aided translation, machine translation, and alignment of multilingual texts. WS4LR handles aligned texts as well. A pair of semantically equivalent texts in different languages, such as an original ...
... Parallel Text Management The WS4LR module for management of aligned parallel texts uses texts which have previously been aligned using Xalign as an alignment tool (Bonhomme 2001). Parallel texts which usually originate from a text in one language and its translation in another, are often aligned at ...Ranka Stanković, Duško Vitas, Cvetana Krstev. "The Nooj System as Module within an Integrated Language Processing Environment" in Proceedings of the 2007 International Nooj Conference, Cambridge Scholars Publishing (2008)
-
Wordnet Development Using a Multifunctional Tool
Ivan Obradović, Ranka Stanković (2007)In this paper we present a multifunctional tool for manipulating heterogeneous language resources. The tool handles electronic dictionaries, wordnets and aligned texts, and provides for their synchronous use in various tasks. We focus here on the description of the possibilities this tool offers in the development of wordnets. Besides the wordnet module which enables parallel handling of two wordnets, other modules, such as the module for morphological dictionaries and the module for aligned texts, as well as available finite ...... in cases of aligned multilingual wordnets, such as EuroWordNet and BalkaNet, since a common conceptual network substantially alleviated the alignment. However, within the BalkaNet project the following questions have often been raised: are concepts linguistically independent or not, are the ...
... the next section. The WS4LR module for management of aligned parallel texts uses texts which have previously been aligned using Xalign as an alignment tool [3]. The module converts these texts to the Translation Memory eXchange (TMX) format, which is becoming the standard format for aligned ...Ivan Obradović, Ranka Stanković. "Wordnet Development Using a Multifunctional Tool" in Proceedings of the International Workshop Computer Aided Language Processing (CALP) '2007, Borovets, Bulgaria, September 2007, - (2007)
-
Vebran Web Services for Corpus Query Expansion
Ranka Stanković, Miloš Utvić (2020)U ovom radu se govori o razvoju veb usluga Vebran i njihovoj primeni u poboljšanju pretraživanja korpusa. Veb-servisi Vebran koriste se za konsultovanje spoljnih leksičkih izvora za srpski jezik (uglavnom elektronski morfološki rečnici i srpski Vordnet) i proširivanje korisničkih upita radi dobijanja relevantnijih rezultata iz srpskih korpusa.... hierarchical display of the vocabulary terms is available for each domain. Besides its name, each term has its synonyms, abbreviations, description and bibliography. In case that the description of a term contains a LATEX fragment, the fragment will be interpreted, which helps in the presentation of ...
... an original query with word forms related to term X is based on the use of semantic and terminological resources to find other terms such that there exists a given semantic relation (synonymy, antonymy, hyperonymy, meronymy) between those terms and the term X. Web service function sinonimi/post16 receives ...
... than 700 users, mostly Slavists. 2.2 RudKor Systematic collection and preparation of texts from the mining domain started with English-Serbian alignment of articles in a bilingual journal “Podzemni radovi”, followed by mining projects, law regulations, PhD theses and textbooks from the mining domain ...Ranka Stanković, Miloš Utvić. "Vebran Web Services for Corpus Query Expansion" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.5
-
An Integrated Environment for Management and Exploitation of Linguistic Resources
Ranka Stanković, Ivan Obradović (2009)... first step parallel texts were segmented into sentences, and in the sec- ond step the sentences were aligned by means of one of the available alignment methods. The tool used for the majority of alignments was XAlign, developed within LORIA, Labo- ratoire Lorrain de Recherche en Informatique et ...
... HTML or another format, depending on the type of vi- sualization required. The module operates on specific file structures that result from alignment with XAlign, but it can also accept as input other files that are already in TMX format. IV. WS4QE WEB APPLICATION FOR QUERY EXPANSION ...
... query. To that end one of the basic functions of WS4QE has been developed, namely query expansion. It should be noted that a more adequate term might be query “refinement”. Namely, besides query expansion, WS4QE allows the user to choose, among the strings offered, the ones to be includ- ...Ranka Stanković, Ivan Obradović. "An Integrated Environment for Management and Exploitation of Linguistic Resources" in Proceedings of the International Multiconference on Computer Science and Information Technology, Computational Linguistics – Applications Workshop (CLA09), Mrągowo, Poland, October 2009, Piscataway : IEEE (2009)
-
An Italian-Serbian Sentence Aligned Parallel Literary Corpus
This article presents the construction and relevance of an Italian-Serbian sentence-aligned parallel corpus, delving into the aligned sentences in order to facilitate effective translation between the two languages. The parallel corpus serves as a valuable resource for language experts, researchers, and language enthusiasts, fostering a deeper understanding of linguistic nuances and cultural expressions. By bridging the gap between Serbian and Italian, this corpus opens new avenues for cross-cultural communication and collaboration, and ultimately contributes to the improvement of language-related ...Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić. "An Italian-Serbian Sentence Aligned Parallel Literary Corpus" in Review of the National Center for Digitization, Belgrade : Faculty of Mathematics, University of Belgrade (2023). https://doi.org/10.5281/zenodo.11203388
-
LRMI markup of OER content within the BAEKTEL project
... targetUrl, timeRequired, typicalAgeRange, useRightsUrl. The educational role describes the target audience of the content, while educational alignment points out to an established educational framework (or other educational scheme). The educational use is the educational purpose of the resource ...
... acquisition, use and reuse of learning objects. 3. SEMANTIC ANNOTATION The next generation of the Web is denoted as Web 3.0, which is an umbrella term for customization, semantic contents, and more sophisticated web applications toward artificial intelligence, including computer-generated contents ...
... resources that meet specific individual learning requirements. [6 Nelson again] Another type of additional data about courses are paradata. The term paradata itself is relatively new and it is used to refer to a particular kind of metadata about ...Ranka Stanković, Daniela Carlucci, Olivera Kitanović, Nikola Vulović, Bojan Zlatić. "LRMI markup of OER content within the BAEKTEL project" in The Sixth International Conference on e-Learning (eLearning-2015), September 2015, Belgrade, Serbia, Belgrade : Belgrade Metropolitan Univesity (2015)
-
Infotheca (Q25460443) in Wikidata
Ranka Stanković, Lazar Davidović (2021)Vikipodaci su baza znanja Zadužbine Vikimedija koja predstavlja zajednički izvor različitih vrsta podataka koje koriste ne samo drugi Vikipedijini projekti, već sve više i brojne aplikacije semantičkog veba. U ovom radu ćemo prezentovati primer integracije Vikipodataka sa digitalnim bibliotekama i eksternim sistemima, kao i mogućnost ubrzanja pripreme i unosa podataka na primeru radova iz časopisa za digitalnu humanistiku Infoteka.... the digital age. References Andonovski, Jelena, Branislava Šandrih, and Olivera Kitanović. 2019. “Bilin- gual lexical extraction based on word alignment for improving corpus search.” The Electronic Library. Krstev, Cvetana, Jelena Jaćimović, Branislava Šandrih, and Ranka Stanković. 2019. “Analysis ...Ranka Stanković, Lazar Davidović. "Infotheca (Q25460443) in Wikidata" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.5
-
Primena ubrzane konsolidacije kod izgradnje nasipa na deonici priključka Novi Beograd – Surčin autoputa ,,Miloš Veliki"
Aleksa Tomičić (2024)Problem koji se javlja prilikom izgradnje objekata na ne konsolidovanom stišljivom tlu jeste pre svega vezan za prevelika sleganja tla usled opterećenja od objekata, najčešće su to objekti putne infrastrukture (nasipi), ili stambeno poslovni objekti. U tu svrhu se,primenjuju različiti vidovi ubrzanja konsolidacije tla. Stoga za potrebe izrade završnog rada Master akademskih studija studijskog programa geotehnika, na Rudarsko-geološkom fakultetu, izvršena je analiza dva najčešće korišćena postupka ubrzanja konsolidacije: predopterećenjem i ugradnjom peščanih drenova. Glavni cilj ovih postupaka je ...Aleksa Tomičić . Primena ubrzane konsolidacije kod izgradnje nasipa na deonici priključka Novi Beograd – Surčin autoputa ,,Miloš Veliki", 2024
-
A Mathematical Learning Environment Based on Serbian Language Resources
In recent years, in line with ever growing usage of Information technology, the learning environments are changing. The amount of available learning materials in various forms has increased. These new environments demand comprehensive learning systems, which enable management of the learning corpus with special attention paid to relevant lexical resources. In this paper we present the concept of a Mathematical Learning Environment in Serbian (MLES), which is based on a corpus of mathematical materials and various lexical resources, enabling ...... only in the English, or simultaneously in both languages. Term modification implies changes of the very properties of the term (name, abbreviation, synonyms and description) as well as modification of external connections of the term with the existing bibliography. Two more options are available ...
... Results of the third component are annotated and linked texts, where every mathematical term in the text is linked to the appropriate dictionary entry or relevant corpus content related to that term. This system component also extracts mathematical concepts from problems related to engineering ...
... the page a hierarchical display of the vocabulary terms is available. Besides its name, each term has its synonyms, abbreviations, description and bibliography. In case that the description of a term contains a Latex fragment, the fragment will be interpreted, which helps in the presentation ...Radojičić Marija, Obradović Ivan, Stanković Ranka, Utvić Miloć, Kaplar Sebastijan. "A Mathematical Learning Environment Based on Serbian Language Resources" in Proceedings of the 7th International Scientific Conference Technics and Informatics in Education, Faculty of Technical Sciences, Čačak (2018)
-
Mine surveying works for the purpose of excavating the remaining reserves of bauxite in the deposit of “Podbracan”
... that the total station be previously tested and recti�ed), and that while carrying out of those operations, one must not strictly look at the alignment of instrument and signal, due to the fact that the lengths of sides are short, but one must also look at the quality of measurement of the instru- ...Aleksandar Milutinović, Aleksandar Ganić, Thamer Rayes Diyab, Rade Tokalić, Meri Ganić. "Mine surveying works for the purpose of excavating the remaining reserves of bauxite in the deposit of “Podbracan”" in Revista Escola de Minas, School of Mines Magazine, Sao paulo, Brasil : SciELO (2015)
-
Дефинисање геотехничких параметара еолских седимената земуснког платоа на основу лабораторијских и теренских SPT испитивања
Ђорђе Крзман (2024)Циљ овог рада је био да се одреде геотехничке караткеристике предметног терена што представља полазну основу за рационално пројектовање и градњу. Истражним бушењем утврђено је да предметни терен изграђују четри лесна хоризонта и три слоја погребене земље до дубине од 22-23 m. Лабораторијским испитивањем добијени су ефективни параметри смичуће чврстоће, где се кохезија најчешће креће између c’=14 и 18 kPa, а угао унутрашњег трења f'= 25 - 30 °. Теренска SPT испитивањем уз коришћење прихваћених корелација у геотехничкој пракси ...... |moderately to strongly compressible, moderately compacted, brittle, low permeabiliy, with the appearance (96,931 (96,97) Exploratory borehole for alignment |of carbonale concretions and Fe hydroxide, dark yellow to brown in color. Marsh clays appear in the- t a y periormec interlayers, more compressible ...
... transgressive Geological border P p g VicrpaxHa 6yuioTuHa 3a Tpacy ca KkOTOM TepeHa - HOBOM3BeneHa (9&,92) (98,92) Exploratory borehole for alignment with ground surface elevation - recently performed VicrpaxHa GyuioTMHa 3a MeTpo cTaHMLy BW2s-2} ca KOTOM TepeHa - HOBOM3Be/leHa a (98,2&) (98 ...
... Teonouika rpannua. - . MI NN | onpaona Gyuyomina aa Tpacy oBW2f-1 | BWh-„ | ca koroM Yepena - HooWBenea (96,92) O }, Exploratory borehole for alignment d with ground surface elevation - recentiy performed | onrpaaona Gyuyomaia aa Merpo crauy– oBW2s-2 | BWDMs-2 | a Korow Tepeta - HoBow3BoneNar ...Ђорђе Крзман. Дефинисање геотехничких параметара еолских седимената земуснког платоа на основу лабораторијских и теренских SPT испитивања, 2024
-
Indexing of textual databases based on lexical resources: A case study for Serbian
In this paper we describe an approach to improvement of information retrieval results for large textual databases by pre-indexing documents using bag-of-words and Named Entity Recognition. The approach was applied on a database of geological projects financed by the Republic of Serbia in the last half century. Each document within this database is described by metadata, consisting of several fields such as title, domain, keywords, abstract, geographical location and the like. A bag of words was produced from these ...... Determination of term weights is a complex process and there are numerous models, the most used being: idf based on the term frequency in the document, probabilistic, which includes in addition relevance weights, tf idf which takes into account the number of documents in which the term appears, tfc tfc ...
... relative frequency tfij for each term Tij in a text Di as nij/li where li is the length of the text in the number of simple words; 6. Calculating document frequency dfj as the number of documents in the collection in which the term Tj appears, and the acceptable indicator of term value as a document discriminator ...
... especially visible when short words that could be parts of other words are used in a search. As explained before, the old system does not require alignment with whole words precisely in order to recognize at least some inflectional forms. This problem is generated by query keywords such as the chemical ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Indexing of textual databases based on lexical resources: A case study for Serbian" in Semantic Keyword-based Search on Structured Data Sources : First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers, Springer (2015). https://doi.org/10.1007/978-3-319-27932-9_15
-
Extensive vibrations of the belt conveyer drive electromotor of a bucket wheel excavator as a result of intesified wear-and-tear of its mount support
Vesna Damnjanović, Predrag Jovančić, Snežana Aleksandrović. "Extensive vibrations of the belt conveyer drive electromotor of a bucket wheel excavator as a result of intesified wear-and-tear of its mount support" in Journal of Vibroengineering (2017). https://doi.org/10.21595/jve.2016.17321
-
Mesozoic carbonate rocks in Serbia used as dimension stone
Vesna Matović, Tijana Vojnović Ćalić (2016)The building industry in Serbia uses, to a great extent, imported natural stone for architectural purposes. The significance of local deposits, particularly limestones, is not adequately perceived despite the country’s abundance of these valuable resources. Therefore, this study focuses on Serbia’s Mesozoic carbonate rocks, specifically on the deposits of four selected quarries: Klisura, Skrzut, Struganik, and Tisnica. The quality and prospects of the application of these limestones has not yet been the subject of a detailed, comprehensive investigation. Therefore, ...... indicators of their usability. For the appropriate use of natural stone products, it is advisable to consult suitable technical requirements. The alignment of Serbian national standards with the standards of the European Union is in progress. The requirements concerning natural stone slabs for cladding ...Vesna Matović, Tijana Vojnović Ćalić. "Mesozoic carbonate rocks in Serbia used as dimension stone" in Bulletin of Engineering Geology and the Environment, Springer Science and Business Media LLC (2016). https://doi.org/10.1007/s10064-015-0722-0
-
Praktikum za vežbe iz Informatike 1
Ranka Stanković, Ivan Obradović, Olivera Kitanović, Mirjana Banković. Praktikum za vežbe iz Informatike 1, Beograd : Univerzitet u Beogradu, Rudarsko-geološki fakultet, 2014
-
Tеслапедиа као централна база библиографских података о Николи Тесли
Циљ овог рада је да прикаже идеју и процес израде најкомплетније електронске библиографске базе о Николи Тесли, симболично назване Теслапедиа. Oвaквa бaзa би укључивaлa Teслину библиoгрaфиjу, кao и библиoгрaфиjу других o њeму. Сaм кoнцeпт рaзвoja Teслaпeдиe je видљив крoз двe глaвнe идeje. Првa идeja je зaснoвaнa нa пoстaвљaњу свих пoстojeћих библиoгрaфских јединица кoje Mузej пoсeдуje нa oдрeђeну вeб-лoкaциjу a другa, пoдjeднaкo вaжнa, oглeдa сe у мoгућнoстимa дaљeг рaзвoja и нaдгрaдњe сaмe структурe базе придруживaњeм нoвих библиoгрaфских пoдaтaкa. Oвa бaзa ...... bibliography of related works by other authors. The concept of the development of Teslapedia is visible in two main ideas. The fi rst of them implies alignment of all the existing bibliographical units, located in the Museum as its property, on a specifi c web site, and the other, equally important, involves ...Ивана Ћирић, Сузана Топаловић, Биљана Лазић. "Tеслапедиа као централна база библиографских података о Николи Тесли" in Читалиште (2013)
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...... of the phenomenon, 3) Vague or incomplete annotation instructions, 4) Overlapping of abusive speech sub-categories. In general, our results are in alignment with the findings of other researchers who reported low inter-annotator agreement scores ([21, 33, 23]) As Ross et al. [36] noted, hate speech is ...
... inappropriate content and incitement to violence have gained importance. The concept of abusive speech, in the context of this paper, is an umbrella term for phenomena such as profanities, derogatory, and hate speech. One of the most cited definitions of hate speech comes from John T. Nockleby [44, 4] ...
... automatic abusive speech detection systems for the Serbian language. In the course of this work, we leveraged existing annotation schemes and abusive term definitions as much as possible with the aim of creating a general data set convenient for the detection of a broad range of abusive topics. We already ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
-
Structural dissymmetrization of optically anisotropic Grs64±1Adr36±1Sps2 grandite from Meka Presedla (Kopaonik Mt., Serbia)
In this paper, grandite core with Grs64±1Adr36±1Sps2composition was crystallographically studied. This core represents zone A of the macroscopically visiblefive A–E zones of the optically anisotropic Grs58–64 Adr36–42Sps2 grandite. The applied procedure includes the detailed analysis of the powder diffraction patterns, and the Rietveld refinements of the crystal structures in a series of 18 space groups and two mixtures, which were followed by the comparative analysis of the R-values, site occupancy factors, and the bond lengths and angles. Synthesis of all ...Pavle Tančić, Aleksandar Kremenović, Predrag Vulić. "Structural dissymmetrization of optically anisotropic Grs64±1Adr36±1Sps2 grandite from Meka Presedla (Kopaonik Mt., Serbia)" in Powder Diffraction, Cambridge University Press (CUP) (2019). https://doi.org/10.1017/S0885715619000897