Претрага
108 items
-
Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis
U ovom radu predstavljen je model koji omogućava prikupljanje, pripremu, opis metapodataka, upravljanje i eksploataciju, uključujući pretragu punog teksta dokumenata iz domena kriminalistike napisanih na srpskom jeziku. Predloženi pristup primenjuje se na veb portalu koji sakuplja različite tekstove nastale iz časopisa Akademije za kriminalistiku i policijske studije, Krivičnog zakona Srbije, konferencija „Tara“ i „Reiss“, kao i iz nekih doktorskih disertacija vezanih za ovu oblast istraživanje. Nakon obrade teksta, korpus koji sadrži preko 5500 stranica običnog teksta, kreiran je i ...... semantic text search expansion. The paper outlines possibilities for further use and analysis on a digital library as a corpus, annotation, tagging, document classification and clustering, as well as sentiment analysis with first results in that direction. Keywords: Omeka, WordNet, full text search ...
... this paper is available at http://master-kpa.rgf.rs/ for search and browse public use and editing management authorized use. The digital library document collection is accessible through user friendly application, that is organised in several categories: Journal for criminalistics and right, Archibald ...
... keywords, subject headings, abstract etc.; structural – describe types, versions and links between digital objects (e.g. connect the original document and all its versions, whereby include information about versions and the information about latest change in them etc.); administrative – contain ...Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović. "Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis" in International Scientific Conference “Archibald Reiss Days” Thematic Conference Proceedings Of International Significance, Belgrade, 7-9 November 2017, Academy Of Criminalistic And Police Studies Belgrade (2017)
-
Keyword Extraction from Parallel Abstracts of Scientific Publications
... SBKE method · Parallel abstracts 1 Introduction The task of keyword extraction is to automatically identify a set of terms that best describes the document [1,2]. Keyword extraction can be a demanding task, especially when the aim is keyword extraction from bilingual or multilingual tex- tual sources ...
... collection was treated as a single text, while for the research presented in this paper, the text process- ing and analysis is performed per each text document in the collection. SBKE method does not include calculation of C-Value, T-Score, LLR, and Keyness, it follows the procedure described in Subsect. ...
... statistical information incor- porated into the network structure the expected outcome is to achieve better performance on larger texts and on the whole document collection. In this case, SBKE proved correctly also on shorter texts. This outcome requires deeper fur- ther investigation which we plan to address ...Slobodan Beliga, Olivera Kitanović, Ranka Stanković, Sanda Martinčić-Ipšić . "Keyword Extraction from Parallel Abstracts of Scientific Publications" in Sematic Keyword-Based Search on Structured Data Sources - Third International KEYSTONE Conference, IKC 2017 Gdańsk, Poland, September 11–12, 2017 Revised Selected Papers and COST Action IC1302 Reports, Springer (2017)
-
Towards Sustainable Management of Transboundary Hungarian-Serbian Aquifer
Zoran Stevanović, Peter Kozák, Milojko Lazić, Janos Szanyi, Dušan Polomčić, Balazs Kovács, Jozsef Török, Saša Milanović, Bojan Hajdin, Petar Papić (2011)... Pallas, P. (2001) Internationally Shared (Tiansboundary) Aquifer Resources Management: Their Significance and Sustainable Management. A Framework Document, in IHP-VI, IHP Series on Groundwater No. L, UNESCO, Paris. Available at http: //unesdoc.unesco. org/images/00 1 2 / 001243 I t243s6e.Pd{ 3 Vadasz ...
... Josipovid, J.,Zogovi{D.' I(omatina, M., Stevanovi{ 2., Diokt{ I. and Luk16, V. (2005) ICPDR Roof Report {or 2004,lnstitute WM "iarosiav Cerni", Document of MinistrY of Agrlculture Forest and Wateq Belgrade. 7 Soro, A., Dimki6, M. and |osiPovi6, ]' (1997) Hydrogeological investigations related to ...Zoran Stevanović, Peter Kozák, Milojko Lazić, Janos Szanyi, Dušan Polomčić, Balazs Kovács, Jozsef Török, Saša Milanović, Bojan Hajdin, Petar Papić. "Towards Sustainable Management of Transboundary Hungarian-Serbian Aquifer" in Transboundary Water Resources Management - A Multidisciplinary Approach, Weinheim, Germany : Wiley-VCH (2011): 143-149
-
Development of terminological resources for expert knowledge: a case study in mining
Ljiljana Kolonja, Ranka Stanković, Ivan Obradović, Olivera Kitanović, Aleksandar Cvjetić. "Development of terminological resources for expert knowledge: a case study in mining" in Knowledge Management Research & Practice, Palgrave Macmillan (2015). https://doi.org/10.1057/kmrp.2015.10
-
Using Metadata For Content Indexing Within An OER Network
Ranka Stanković, Olivera Kitanović, Ivan Obradović, Roberto Linzalone, Giovanni Schiuma, Daniela Carlucci (2014)... layout of the resource in terms of how the information contained in the resource is organized. It indicates whetherit is an electronic document, paper only document, slide(s), website, cd-rom/dvd, audio, or video. Educational data, taken from the LOM standard, suggest the auditorium the resource ...
... indicates the degree to which the learner can influence the aspect or behaviour of the resource. Value for this field can be "very low", fora document intended for printing; "low",a video clip with play and pause controls; "medium",a hypertext; "high", a lesson with multiple-choice exercises providing ...
... object. The attributesin the Rightscategory are publisher, rights and cost.Publisher is the individual, group, or organization named in the document as being responsible for that document’s publication, distribution, issuing, or release. Rights includes information about various property ...Ranka Stanković, Olivera Kitanović, Ivan Obradović, Roberto Linzalone, Giovanni Schiuma, Daniela Carlucci. "Using Metadata For Content Indexing Within An OER Network" in Proceedings of the Fifth International Conference on e-Learning, eLearning 2014, September 2014, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2014)
-
Bilingual lexical extraction based on word alignment for improving corpus search
Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056
-
An Integrated Environment for Management and Exploitation of Linguistic Resources
Ranka Stanković, Ivan Obradović (2009)... XML, TXT and HTML, can subse- quently be generated from the TMX document filtered in this way. Fig. 7 Aligned segments with highlighted forms of words corresponding to a bilingual query Fig. 7 depicts a HTML document in WS4QE with seg- ments containing word forms from the query for ...
... query can lead to an increase of irrelevant documents in the query result, thus reducing precision, which represents the ratio of relevant document obtained to the total number of documents obtained. In view of this trade-off between recall and precision, the words or strings that are used ...
... were included in query. B. Aligned text search When a bilingual query is applied to an aligned text, WS4QE generates a filtered aligned document in TMX for- mat. Namely, based on the expansion of the query, which can be morphological and/or semantic, segments that con- tain one of the ...Ranka Stanković, Ivan Obradović. "An Integrated Environment for Management and Exploitation of Linguistic Resources" in Proceedings of the International Multiconference on Computer Science and Information Technology, Computational Linguistics – Applications Workshop (CLA09), Mrągowo, Poland, October 2009, Piscataway : IEEE (2009)
-
Advantages and challenges in presenting mathematical content using EDX platform
... mathematic formula (Image 5). MathJax presents a cross-browser JavaScript library that displays mathematical notation in web browsers using LaTex document preparation system. Image 5: Example of tasks editing According to [5] edX-BAEKTEL platform presents a very good environment for ...
... exist. A prototype can be WikiMir - mathematics information retrieval system, which is based on keyword, structure and importance of formulae in a document [10]. For such search adequate resources as mathematical term bases are needed. According to [11] there is a great difference between natural ...Marija Radojičić, Ivan Obradović, Ranka Stanković, Olivera Kitanović, Roberto Linzalone. "Advantages and challenges in presenting mathematical content using EDX platform" in The Seventh International Conference on e-Learning (eLearning-2016), Belgrade : Metropolitan University (2016)
-
An Italian-Serbian Sentence Aligned Parallel Literary Corpus
This article presents the construction and relevance of an Italian-Serbian sentence-aligned parallel corpus, delving into the aligned sentences in order to facilitate effective translation between the two languages. The parallel corpus serves as a valuable resource for language experts, researchers, and language enthusiasts, fostering a deeper understanding of linguistic nuances and cultural expressions. By bridging the gap between Serbian and Italian, this corpus opens new avenues for cross-cultural communication and collaboration, and ultimately contributes to the improvement of language-related ...Saša Moderc, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić. "An Italian-Serbian Sentence Aligned Parallel Literary Corpus" in Review of the National Center for Digitization, Belgrade : Faculty of Mathematics, University of Belgrade (2023). https://doi.org/10.5281/zenodo.11203388
-
Softverski alati za korišćenje resursa za srpski jezik
Ivan Obradović, Ranka Stanković (2008)... contained in the expanded query was found. From a TMX document filtered in this way, out- put documents can be further generated in differ- ent formats, such as XML, TXT and HTML, as we have already mentioned. Figure 13 depicts a HTML document in WS4QE with selected segments where one of the forms ...
... the query can lead to an increase of irrelevant documents in the query result, thus reducing precision, which rep- resents the ration of relevant document obtained to the total number of documents obtained. In vies of this trade-off between recall and preci- sion, the words or strings that are used ...
... obtained from Google contain only pages in Cyrillic. When such a bilingual query is applied to an aligned text, WS4QE generates a filtered aligned document in TMX format. Based on the expan- sion of the bilingual query, which can be mor- phological and/or semantic, segments are ex- tracted from aligned ...Ivan Obradović, Ranka Stanković. "Softverski alati za korišćenje resursa za srpski jezik" in INFOteka: časopis za informatiku i bibliotekarstvo, Belgrade, Serbia : Zajednica biblioteka univerziteta u Srbiji (2008)
-
Automatic construction of a morphological dictionary of multi-word units
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multiwordn units, noun phrases, query expansion... these documents is presented in Figure 1. Our strategy consists of two XML documents, one for MWU nouns, and the other for MWU adjectives. Each XML document consists of a sequence of rules that are grouped by the number of components in MWUs. Each rule states the conditions that a MWU and its components ...
... conditions, where specific conditions are simply additions to general conditions. One rule will illustrate this: Fig. 1. The XML Schema for a strategy document
... 59 5 9 3 25 85 8 8 4 14 50 5 5 5 6 54 2 2 6 4 29 7 1 9 Total 76 286 20 24 The rules are applied in the sequence in which they appear in the XML document, which means that if more than one rule can apply for a particular MWU, then more MWU candidate lemmas will be offered in the order of the application ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić. "Automatic construction of a morphological dictionary of multi-word units" in Lecture Notes in Computer Science 6233, Advances in Natural Language Processing, Proceedings of the 7thInternational Conference on NLP, IceTAL 2010, Reykjavik, Iceland, August 2010, Springer (2010): 226-237. https://doi.org/10.1007/978-3-642-14770-8_26
-
Classification of mining waste landfills according to legislation in Serbia
... uri=CELEX%3A32006L0021 https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX%3A32006L0021 https://fdocuments.in/document/icold-small-dams-sept-2011.html https://fdocuments.in/document/icold-small-dams-sept-2011.html https://nzsold.org.nz/wpcontent/uploads/2019-/10/nzsold_dam_safety_guidelines-may-2015-1 ...
... Large Dams (ICOLD). Small dams design, surveillance, and rehabilitation – Bulletin No 143, ICOLD, Paris, 2011, dostupno na: https://fdocuments.in/document/icold-small-dams- sept-2011.html, pristupljeno 2021-05-28. [14] NZSOLD: New Zealand Dam Safety Guidelines – Objectives and Principles. Institution ...Dragana Nišić, Uroš Pantelić, Nikoleta Aleksić, Neda Nišić. "Classification of mining waste landfills according to legislation in Serbia" in Tehnika, Centre for Evaluation in Education and Science (CEON/CEES) (2021). https://doi.org/10.5937/tehnika2105575N
-
Legal framework for recultivation of degraded areas caused by mining exploitation
Radmila Gaćina (2023)Prilikom površinske i podzemne eksploatacije uglja eksploatacioni sistemi oštećuju veće ili manje površine zemljišta. Iskustva pokazuju da su oštećenja zemljišta znatno veća kod površinske eksploatacije, gde je prostora degradiran ne samo u konturi rudnika i okoline, već se promene takođe dešavaju u prirodnom okruženju. U članku su predstavljeni rezultati studije zakonskih okvira rekultivacije u nekim zemljama sveta i u Srbiji. Praktično iskustvo rekultivacije pokazalo je da iskorišćenje i devastacija tokom rudarske aktivnosti predstavljaju opasnost ne samo za pogođena ...... 2006/21/EC, the Best Available Techniques Reference Document for Management of Waste from Extractive Industries offers more than 700 pages of examples for good practice mining operations and best available techniques. Moreover, the Reference Document on BAT is not legally binding and the recommendation ...Radmila Gaćina. "Legal framework for recultivation of degraded areas caused by mining exploitation" in Underground mining engineering, Belgrade : University of Belgrade - Faculty of Mining and Geology (2023). https://doi.org/10.5937/podrad2342027G
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
A Mathematical Learning Environment Based on Serbian Language Resources
In recent years, in line with ever growing usage of Information technology, the learning environments are changing. The amount of available learning materials in various forms has increased. These new environments demand comprehensive learning systems, which enable management of the learning corpus with special attention paid to relevant lexical resources. In this paper we present the concept of a Mathematical Learning Environment in Serbian (MLES), which is based on a corpus of mathematical materials and various lexical resources, enabling ...... related tasks including query expansion, they need further improvement for management, named entity recognition, terminology extraction, and document indexing of mathematical content. In the next section we give and overview of the MLES system, followed by a section outlining the main issues ...
... the languages in which terms are entered, and the possibility that, depending on user needs, the term description is interpreted as a Latex document. On this page, akin to Browse page, on the left side of the screen a hierarchical view of the terms is available. However, unlike the browse ...Radojičić Marija, Obradović Ivan, Stanković Ranka, Utvić Miloć, Kaplar Sebastijan. "A Mathematical Learning Environment Based on Serbian Language Resources" in Proceedings of the 7th International Scientific Conference Technics and Informatics in Education, Faculty of Technical Sciences, Čačak (2018)
-
An Approach to Efficient Processing of Multi-Word Units
Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...... singular and the plural form in all possible combi- nations, e.g. analiza dokumenta/analiza dokumenata/analize dokumenta/analize dokumenata ‘document(s) analysis/document(s) analyses’. Only the MWUs belonging to the first listed group belong to the super-class N2X and they require inflectional information ...
... MWUs with 5 components, and 5 rules to MWUs with 6 and more components. 4.3 Software implementation To manipulate the strategy in the form of a XML document our tool LeXimir relies on W3C standard languages Xquery and XSLT supported by .Net. The user interface for automatic production of DELAC lemmas ...
... in order to apply this functionality to a new language it would be necessary to develop a new language- dependent strategy, that is, a new XML document. It is also worth mentioning that the system can be easily modified to work with formats of simple words dictionar- ies other than those supported ...Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
-
A survey of greenhouse gases production in central European lignites
Anna Pytlak, Anna Szafranek-Nakonieczna, Weronika Goraj, Izabela Śnieżyńska, Aleksandra Krążała, Artur Banach, Ivica Ristović, Mirosław Słowakiewicz, Zofia Stępniewska (2021)... Guidelines for National Green- house Gas Inventories. Volume 2. EIA, 2020. Trends and expectations surrounding the outlook for energy markets [WWW document]. URL. www.eia.gov (accessed 12.9.20). Fabianska, M., 2007. Organic Geochemistry of Brown Coals From the Selected Polish Basin (in Polish). University ...
... Science and Technology. Springer, New York, pp. 2173–2194 https://doi.org/10.1007/978-1-4419-0851-3_161. IEA, 2020. World energy outlook 2020 [WWW Document]. URL. https://www.iea.org/re- ports/world-energy-outlook-2020. International Committee for Coal Petrology, 1993. International Handbook of Coal ...
... declining coal production. J. Clean. Prod. 256, 120489. https://doi.org/10.1016/j.jclepro.2020. 120489. Knoema, 2020. Production of lignite coal [WWW Document]. World data atlas. URL. https://knoema.com/atlas/topics/Energy/Coal/Production-of-lignite-coal (accessed 3.23.21). Lian, Y., Yang, Y., Guo, J., ...Anna Pytlak, Anna Szafranek-Nakonieczna, Weronika Goraj, Izabela Śnieżyńska, Aleksandra Krążała, Artur Banach, Ivica Ristović, Mirosław Słowakiewicz, Zofia Stępniewska. "A survey of greenhouse gases production in central European lignites" in Science of The Total Environment, Elsevier (2021). https://doi.org/10.1016/j.scitotenv.2021.149551
-
Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking
U radu se prikazuju rezultati istraživanja vezanih za pripremu paralelnih korpusa, fokusirajući se na transformaciju u RDF grafove koristeći NLP Interchange Format (NIF) za lingvističku anotaciju. Pružamo pregled paralelnog korpusa koji je korišćen u ovom studijskom slučaju, kao i proces označavanja delova govora, lematizacije i prepoznavanja imenovanih entiteta (NER). Zatim opisujemo povezivanje imenovanih entiteta (NEL), konverziju podataka u RDF, i uključivanje NIF anotacija. Proizvedene NIF datoteke su evaluirane kroz istraživanje triplestore-a korišćenjem SPARQL upita. Na kraju, razmatra se povezivanje Linked ...paralelni korpusi, povezivanje imenovanih entiteta, prepoznavanje imenovanih entiteta, NER, NEL, povezani podaci, NIF, VikipodaciRanka Stanković, Milica Ikonić Nešić, Olja Perisic, Mihailo Škorić, Olivera Kitanović. "Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Building Terminological Resources in an e-Learning Environment
... Finally, there is a Multimedija class used for implementing illustrations: pictures, formulas in the form pictures, or any other relevant multimedia content. Multimedia documents proper are not entered into the resource database. Instead, they are represented by their locations on the sever (URIs) or ...
... format is growing with the rapidly expanding availability of various texts on the web. First and foremost, they are indispensable in information an document retrieval systems. In addition to monolingual resources, machine translation systems and cross- language information retrieval emphasize the need ...
... structure through more elaborate semantic relations such as holonymy/meronymy (part of) and the like, thesauruses are primarily aimed at facilitating document retrieval and achieving consistency in indexing documents stored in a database. Hence, they provide assistance to persons who associate terms or ...Ranka Stanković, Ivan Obradović, Olivera Kitanović, Ljiljana Kolonja. "Building Terminological Resources in an e-Learning Environment" in Proceedings of the Third International Conference on e-Learning, eLearning-2012, September 2012, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2012)
-
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic ...LR web services, MultiWord Expressions & Collocations, Information Extraction, Information Retrieval... When searching with the two constituent keywords beli ‘white’ AND luk 219 ‘onion’ the search engine would typically return an irrelevant document based on the following content: Sastojci za 10 porcija: 3 glavice crnog luka, 1 šoljica ulja, 1/2 čaša belog vina, 1 čaša soka od paradajza ...
... luk” then inflected forms of this multi-word term are not taken into account, and this reduces recall. In this case the aforementioned irrelevant document would be omitted, but so would be many relevant results, for instance Gambori u maslacu sa belim lukom (Shrimps on butter with garlic (in ...Krstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan. "The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines" in LREC 2008: Conference on Language Resources and Evaluation, Marrakesh, Morocco, May 2008, European Language Resources Association (ELRA) (2008)