Претрага
2379 items
-
Development of Open Educational Resources (OER) for Natural Language Processing
In this paper we present the development of an online course at the edX BAEKTEL platform named “Lexical Recognition in the Natural Language Processing (NLP)”. It is based on the course of the same name for PhD studies at the University of Belgrade, Faculty of Philology. There are not many courses in Computational Linguistics (CL) on OER platforms, and there is none in Serbian either for CL or NLP. We have developed this course in order to improve this ...... morphology, and syntax. Some of the functions are: developing and applying electronic dictionaries of simple words and multi-word units; pattern matching with queries in form of regular expressions and graphs; text transformations; processing of monolingual and bilingual texts ...
... but also in written form as parallel (multilingual) corpora of lessons and texts, supported by electronic terminological resources[10], services, and functionalities for searching and browsing of terminological resources and using them for text annotation. The project consortium 10 consists ...
... es and software that enables their effective use was developed at LADL (Laboratoire d' Automatique Documentaire et Linguistique), under the direction of its director, Maurice Gross. [12] With Unitex, user can develop electronic resources such as electronic dictionaries and grammars and apply ...Cvetana Krstev, Biljana Lazić, Ranka Stanković, Giovanni Schiuma, Miladin Kotorčević. "Development of Open Educational Resources (OER) for Natural Language Processing" in The Sixth International Conference on e-Learning (eLearning-2015), September 2015, Belgrade, Serbia, Belgrade : Belgrade Metropolitan Univesity (2015)
-
Речници у дигиталном добу - информатичка подршка за српски језик
Биљана Рујевић (2022)Морфолошки речници српског језика представљају електронски језички ресурс који има значајну историју развоја и коришћења за потребе обраде природних језика. С обзиром на то да су чувани у облику датотека чији је број нарастао па је самим тим управљање речницима постало отежано јавила се потреба за смештањем информација из речника у облик лексикографске базе. Како би се омогућио симултани рад на развоју речника за више корисника јавила се потреба за веб-апликацијом заснованој на лексикографској бази. Како би се размотриле ...Биљана Рујевић. Речници у дигиталном добу - информатичка подршка за српски језик, Београд : [Б. Рујевић], 2022
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...... Texts and Electronic Dictionaries. University of Belgrade, Faculty of Philology, Belgrade. Irene Kwok and Yuzhou Wang. 2013. Locate the hate: Detecting tweets against blacks. In Proceedings of the twenty-seventh AAAI conference on artificial intelligence, pages 1621–1622. Biljana Lazić and Mihailo ...
... Resources and Evaluation (LREC’06), Genoa, Italy, May. European Language Resources Association (ELRA). Njagi Dennis Gitari, Zhang Zuping, Hanyurwimfura Damien, and Jun Long. 2015. A lexicon-based approach for hate speech detection. International Journal of Multimedia and Ubiquitous Engineering, 10(4):215–230 ...
... University of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Joint Workshop on Multiword Expressions and Electronic Lexicons, pages 74–84 Barcelona, Spain (Online) ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...... of mining engineering and mine safety terminology (Stankovic et al., 2012). The third termbase used is the Dictionary of Library and Information Sciences (RNBS), 4 developed by the National Library of Serbia. It contains terminology in Serbian, English and German, related to theory and practice ...
... Krstev, C. (2008). Processing of Serbian. Automata, Texts and Electronic Dictionaries: Faculty of Phi- lology of the University of Belgrade. Nakagawa, H., & Mori, T. (2003). Automatic term recognition based on statistics of compound nouns and their components. Terminology, 9(2), 201-219. Przepiórkowski ...
... Natural Language Processing and Information Systems (Vol. 6177, pp. 248-255): Springer Berlin Heidelberg. Justeson, J. S., & Katz, S. M. (1995). Technical ter- minology: some linguistic properties and an algo- rithm for identification in text. Natural Language Engineering, 1 (01): 9-27. doi:10.10 ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
Correlation Between Dynamic CBR and Compaction and Bearing Capacity of Pavement Foundation Layers
This paper presents a dynamic California Bearing Ratio (CBRd) device that can be used on a prepared pavement foundation layer to obtain a certain parameter of the tested material in the laboratory and in situ in a direct and fast way. For the selected natural material (gravel), it is shown that the laboratory CBRd can be used to estimate the compaction and bearing capacity, because it correlates well with the parameters of the standard and modifed Proctor compaction tests. ...korelacija parametara, donji slojevi saobraćajnica, laboratorijska i terenska ispitivanja, dinamički kalifornijski indeks nosivosti CBRd... recorded using an electronic device, which prevented poten- tial measurement errors. The LWD tests were performed in accordance with TP BF-StB Teil B 8.3 [16], and approximately five minutes were required for one test. Research Results To establish correlations between the compaction and bearing capacity ...
... of Belgrade Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs DO!I 10.1007/511204-024-09939-x Soil Mechanics and Foundation Engineering, Vol. 61, No. 1, March, 2024 (Russian ...
... Capacity of Pavement Foundation Layers | Milan Bogdanović, Snežana Bogdanović, Dragoslav Rakić, Irena Basarić Ikodinović | Soil Mechanics and Foundation Engineering | 2024 | | 10.1007/s11204-024-09939-x http://dr.rgf.bg.ac.rs/s/repo/item/0008686 Дигитални репозиторијум Рударско-геолошког факултета ...Milan Bogdanović, Snežana Bogdanović, Dragoslav Rakić, Irena Basarić Ikodinović. "Correlation Between Dynamic CBR and Compaction and Bearing Capacity of Pavement Foundation Layers" in Soil Mechanics and Foundation Engineering, Springer Science and Business Media LLC (2024). https://doi.org/10.1007/s11204-024-09939-x
-
Possibilities of retro-digitalized German-Serbian Mining Dictionary
U radu će biti prikazan opis procesa retrodigitalizacije dvojezičnog Nemačko-srpskog rudarskog rečnika iz 1923. godine čiji je autor rudarski inženjer Dragutin Stepanović (Степановић, 1923). Ovaj rečnik je zasnovan na skoro 4 000 leksičkih zapisa koji su prevodilački ekvivalenti ili uputnice. Umesto predgovora autor daje uvid u svoje pismo upućeno “Ministru šuma i rudnika” u kome piše o nameri da zabeleži reči koje se koriste u narodu kako bi izbegao upotrebu nemačkih reči. Iako broj odrednica nije toliko veliki, rečnik ...Biljana Lazić, Olivera Kitanović, Ivan Obradović. "Possibilities of retro-digitalized German-Serbian Mining Dictionary" in E-dictionaries and E-lexicography, Zagreb, 10-11 May 2019, Zagreb : Institut za hrvatski jezik i jezikoslovlje (2019)
-
The Importance of Geotechnical Investigations for the Construction of Transfer Station „Prelići“ on Landfill in Cacak – Serbia
For the purpose of the facilities construction within the transfer station „Prelici“ in Cacak, specific geotechnical investigations were carried out based on which the technological project with the facilities schedule was developed. Significant deviations in relation to the presented results were determined during earthworks, so the additional geotechnical investigation works were carried out. Since the new investigation works have determined a different cross section of the terrain, it was proposed to move the planned facilities which led to the ...... until further transport to the regional landfill „Duboko“. Part of the area is designed for temporary storage of specific waste (waste from electrical and electronic equipment, waste tiers, bulky household waste etc.). The transfer station complex itself contains technological units which are defined ...
... (roads, water supply, sewerage, electrical installations) at locations outside urban areas, so the costs of preparing the terrain often exceed the costs of establishing a recycling yard, i.e. transfer station, • increasing investment and exploitation costs for equipment and labor. 3. PRESENTATION OF ...
... on of individual materials and by their size), insufficient compaction and consolidation. Due to this specific porous structure, the physical and mechanical characteristics of the waste are extremely variable at a relatively small distance. Therefore, any engineering intervention on the deposited ...Dragoslav Rakić, Irena Basarić, Jovana Janković, Tina Đurić. "The Importance of Geotechnical Investigations for the Construction of Transfer Station „Prelići“ on Landfill in Cacak – Serbia" in 17th International waste management and landfill symposium, Proceedings SARDINIA 2019, CISA Publisher (2019)
-
From DELA Based Dictionary to Leximirka Lexical Database
Biljana Lazić, Mihailo Škorić (2020)In this paper, we will present an approach in transforming Serbian language Morphological dictionaries from a DELA text format to a lexical database dubbed Leximirka. Considering the benefits of storing data within a database when compared to storing them in textual documents, we will outline some of the functionality that the database has made possible. We will also show how hand-made rules that use category labels lexical entries are marked with can be used to link lexical entries. ...... from the morphological dictionaries and to make them in- teroperable and reusable. Three standards for lexical information have been considered: Guidelines for Electronic Text Encoding and Interchange, Text Encoding Initiative (TEI)3, Lexical Markup Framework (LMF)4 and the Lemon model5. Although Chapter ...
... Automata, Texts and Electronic Dic- tionaries. Faculty of Philology of the University of Belgrade, 2008 Krstev, Cvetana, Ranka Stanković, Duško Vitas and Ivan Obradović. “WS4LR - a Worksation for Lexical Resources”. In Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation ...
... in Section 5. 2 Electronic dictionaries 2.1 The DELA text format Serbian morphological dictionaries are electronic dictionaries primarily intended for machine use. This type of dictionary was first developed for the French language under the influence of linguist Maurice Gross and it is one of the ...Biljana Lazić, Mihailo Škorić. "From DELA Based Dictionary to Leximirka Lexical Database" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.4
-
The Nooj System as Module within an Integrated Language Processing Environment
... main functions (Figure 1): - management of a system of electronic dictionaries which consists of morphological dictionaries of lemmas for simple and compound words but also of bilingual and multilingual dictionaries - development and refinement of wordnets, with simultaneous usage of wordnets ...
... among these resources, and to enable the merging of large numbers of different individual electronic resources to form large global electronic resources, so conversion of NooJ resources to LMF format (Lexical markup framework) (ISO LMF 2006) is also included in this environment. 3. Lexical resources ...
... C. (ed.) “WordNet: An Electronic Lexical Database”, The MIT Press (1998) ISO LMF, Language resource management - Lexical markup framework (LMF), (2006), ISO/TC 37/SC 4 N130 Rev.9, ISO CD 24613:2006. Krstev C., Pavlović-Lažetić G., Vitas D., Obradović I.: Using Textual and Lexical Resources in ...Ranka Stanković, Duško Vitas, Cvetana Krstev. "The Nooj System as Module within an Integrated Language Processing Environment" in Proceedings of the 2007 International Nooj Conference, Cambridge Scholars Publishing (2008)
-
Using Query Expansion for Cross-Lingual Mathematical Terminology Extraction
Velislava Stoykova, Ranka Stanković (2018)Velislava Stoykova, Ranka Stanković. "Using Query Expansion for Cross-Lingual Mathematical Terminology Extraction" in Advances in Intelligent Systems and Computing, Springer International Publishing (2018). https://doi.org/10.1007/978-3-319-91189-2_16
-
Wordnet Development Using a Multifunctional Tool
Ivan Obradović, Ranka Stanković (2007)In this paper we present a multifunctional tool for manipulating heterogeneous language resources. The tool handles electronic dictionaries, wordnets and aligned texts, and provides for their synchronous use in various tasks. We focus here on the description of the possibilities this tool offers in the development of wordnets. Besides the wordnet module which enables parallel handling of two wordnets, other modules, such as the module for morphological dictionaries and the module for aligned texts, as well as available finite ...... Faculty of Mining and Geology Đušina 7, 11000 Belgrade, Serbia ranka@rgf.bg.ac.yu Abstract In this paper we present a multifunctional tool for manipulating heterogeneous language resources. The tool handles electronic dictionaries, wordnets and aligned texts, and provides for their s ...
... development and refinement of wordnets, where both work with a monolingual wordnet and simultaneous usage of two wordnets for different languages is supported - management of a system of electronic dictionaries which consist of morphological dictionaries of lemmas for simple and compound words ...
... to provide a gloss, and preferably usage examples. As synset elements appear as a word-sense pairs the lexicographer has to assign senses to all chosen words. It goes without saying that other linguistic resources, such as electronic dictionaries, bilingual word lists and corpora can be of ...Ivan Obradović, Ranka Stanković. "Wordnet Development Using a Multifunctional Tool" in Proceedings of the International Workshop Computer Aided Language Processing (CALP) '2007, Borovets, Bulgaria, September 2007, - (2007)
-
Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking
U radu se prikazuju rezultati istraživanja vezanih za pripremu paralelnih korpusa, fokusirajući se na transformaciju u RDF grafove koristeći NLP Interchange Format (NIF) za lingvističku anotaciju. Pružamo pregled paralelnog korpusa koji je korišćen u ovom studijskom slučaju, kao i proces označavanja delova govora, lematizacije i prepoznavanja imenovanih entiteta (NER). Zatim opisujemo povezivanje imenovanih entiteta (NEL), konverziju podataka u RDF, i uključivanje NIF anotacija. Proizvedene NIF datoteke su evaluirane kroz istraživanje triplestore-a korišćenjem SPARQL upita. Na kraju, razmatra se povezivanje Linked ...paralelni korpusi, povezivanje imenovanih entiteta, prepoznavanje imenovanih entiteta, NER, NEL, povezani podaci, NIF, VikipodaciRanka Stanković, Milica Ikonić Nešić, Olja Perisic, Mihailo Škorić, Olivera Kitanović. "Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources
Large collections of textual documents represent an example of big data that requires the solution of three basic problems: the representation of documents, the representation of information needs and the matching of the two representations. This paper outlines the introduction of document indexing as a possible solution to document representation. Documents within a large textual database developed for geological projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named ...... hydrogeology and engineering geology, as a modern and efficient information basis for planning, design and decision- making in the geological domain. A u t h o r P r o o f 4 R. Stanković et al. Within the GeolISS project a web portal1 was developed with the aim to pro- vide quicker and easier access ...
... ‘Institute for civil engineering Subotica’. The Serbian NER system is a handcrafted rule-based system that relies on comprehensive lexical resources for Serbian. For recognition of some types of named entities, e.g. personal names and locations, e-dictionaries and information within them is crucial; ...
... indexing we used only three top level types: personal names, locations and organizations and their distribution is presented in Table 2. Nested NEs were also used for indexing, e.g. toponym Subotica in “Institute for civil engineering Subotica”. 3.2 The Architecture of the New System Indexing of documents ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources" in Trans. Computational Collective Intelligence - Lecture Notes in Computer Science 26, Springer (2017). https://doi.org/10.1007/978-3-319-59268-8_8
-
Simulation of Migration of Nitrate Contamination in Epikarst Groundwater: A Column Laboratory Method
Branislav Petrović (2021)Branislav Petrović. "Simulation of Migration of Nitrate Contamination in Epikarst Groundwater: A Column Laboratory Method" in Karst: From Top to Bottom, Belgrade, 6.6.2021., Belgrade : Centre for Karst Hydrogeology (2021)
-
Модел миграције нитрата у епикарсту: лабораторијски експеримент
Бранислав Петровић (2022)Епикарст је део карстног изданског система који се налази у зони аерације и представља сложену област додира и мешања неконсолидованог материјала са површине терена, остатака карбонатних стена измењених корозивном водом, флоре и фауне и њихових остатака, који је делимично и повремено засићен подземном водом. Подручје истраживања примењених мултидисциплинарних истраживања био је карстни масив Суве планине (ЈИ Србија), односно подручје око мале пећине Печ, настале у епикарсту. Између осталих истраживања, опит трасирања (На-флуоресцеин) у пећини Печ помогао је у одређивању ...Бранислав Петровић. "Модел миграције нитрата у епикарсту: лабораторијски експеримент" in XVI Српски симпозијум о хидрогеологији, са међународним учешћем, Универзитет у Београду, Рударско-геолошки факултет (2022)
-
DNA/BSA interactions and cytotoxic studies of tetradentate N,N,O,O Schiff base copper(II) complexes
Aleksandar Mijatović, Angelina Caković, Aleksandar Lolić, Snezana Sretenović, Marko Živanović, Dragana Seklić, Jovana Bogojeski, Biljana Petrović (2023)Three Schiff base Cu(II) complexes, (N,N’-bis(acetylacetone)propylenediimine)copper(II) complex, [Cu(acac2pn)] (1), (N,N'-bis-(benzoylacetone)propylenediimine)copper(II) complex, [Cu(phacac2pn)] (2) and (N,N’-bis-(trifluoroacetylacetone)propylenediimine)copper(II) complex, [Cu(tfacac2pn)] (3), were used to investigate the interactions with calf thymus DNA (ct-DNA) and bovine serum albumin (BSA) using the electronic absorption and spectroscopic fluorescence methods. UV-Vis absorption studies showed that studied complexes interact with DNA molecule and exhibit moderate binding affinity. Fluorescence studies of complexes 1-3 also showed a possibility for DNA intercalation as well as a relatively high binding ability ...Aleksandar Mijatović, Angelina Caković, Aleksandar Lolić, Snezana Sretenović, Marko Živanović, Dragana Seklić, Jovana Bogojeski, Biljana Petrović. "DNA/BSA interactions and cytotoxic studies of tetradentate N,N,O,O Schiff base copper(II) complexes" in Journal of the Serbian Chemical Society (2023). https://doi.org/10.2298/JSC230614063M
-
An aproach to Implementation of blended learning in a university setting
... all forms of learning and teaching supported by electronic devices, the opportunities offered by Internet and Web are now prevailing. Hence the term Web-based learning or Web- based education is now also frequently encountered. This concept encompasses all aspects and processes of education ...
... In order to help students in reading scientific articles and texts in English, and finding appropriate translations in Serbian for specific terms, we have developed an electronic dictionary of basic GIS terms. Besides the English and the Serbian term, each dictionary entry contains a short ...
... multitutor ontology-based learning environments, International Journal of Continuing Engineering Education and Lifelong Learning, Vol.14, No.3, pp.229–245, 2004. View publication statsView publication stats http://geoliss.ekoplan.gov.rs/term� https://www.researchgate.net/publication/284680651 ...Ivan Obradović, Ranka Stanković, Olivera Kitanović, Jelena Prodanović . "An aproach to Implementation of blended learning in a university setting" in Proceedings of the Second International Conference on e-Learning, eLearning 2011, September 2011, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2011)
-
An Italian-Serbian Sentence Aligned Parallel Literary Corpus
This article presents the construction and relevance of an Italian-Serbian sentence-aligned parallel corpus, delving into the aligned sentences in order to facilitate effective translation between the two languages. The parallel corpus serves as a valuable resource for language experts, researchers, and language enthusiasts, fostering a deeper understanding of linguistic nuances and cultural expressions. By bridging the gap between Serbian and Italian, this corpus opens new avenues for cross-cultural communication and collaboration, and ultimately contributes to the improvement of language-related ...Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić. "An Italian-Serbian Sentence Aligned Parallel Literary Corpus" in Review of the National Center for Digitization, Belgrade : Faculty of Mathematics, University of Belgrade (2023). https://doi.org/10.5281/zenodo.11203388
-
Old or New, We Repair, Adjust and Alter (Texts)
Cvetana Krstev, Ranka Stanković (2020)U ovom radu predstavljamo kako se e-rečnici i kaskade transduktora konačnih stanja implementirani u alatu Unitex mogu koristiti za rešavanje tri problema transformacije teksta: ispravljanje tekstova nakon OCR-a, vraćanje dijakritičkih znakova i prebacivanje između različitih jezičkih varijanti.ispravka teksta, OCR greške, restauracija dijakritika , jezičke varijante, elektronski rečnik, transduktori konačnih stanja... Publishers Inc., 2002 Krstev, Cvetana. Processing of Serbian – Automata, Texts and Electronic dictionaries. Faculty of Philology, University of Belgrade, 2008 Krstev, Cvetana, Ranka Stanković and Duško Vitas. “Knowledge and Rule- Based Diacritic Restoration in Serbian”. In Proceedings of the Third Inter- ...
... presented in sections 2–4 use similar re- sources and are built using similar solutions: – Electronic dictionaries are used to detect words that are candidates for change and to offer possible corrections. They are also used to build spe- cial dictionaries for solving some concrete problems (diacritic restoration ...
... second cascade as strict, relaxed or somewhere in between. A similar approach that also relies on electronic dictionaries and FSTs implemented in Unitex was used for vowel restoration in Arabic (Neme and Infotheca Vol. 19, No. 2, December 2019 77 Krstev C., Stanković R., “Old or new, we repair . ...Cvetana Krstev, Ranka Stanković. "Old or New, We Repair, Adjust and Alter (Texts)" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.3