Претрага
1036 items
-
Proširivanje upita zasnovano na leksičkim resursima
U radu je opisano kako se leksički resursi za srpski jezik i softverski alati, razvijeni u okviru Grupe za jezičke tehnologije Univerziteta u Beogradu, mogu koristiti za unapređenje postavljanja upita. Rezultati pretrage mogu biti značajno unapređeni korišćenjem različitih leksičkih resursa, kakvi su morfološki rečnici i semantičke mreže. Izloženi pristup može se iskoristiti i u Sistemu naučnih, tehnoloških i poslovnih informacija, jer je efikasno pretraživanje ovog dragocenog resursa, imajući u vidu njegovu heterogenost i obim, kao i preovladavajući tekstualni sadržaj, ...Ranka Stanković, Ivan Obradović, Cvetana Krstev. "Proširivanje upita zasnovano na leksičkim resursima" in SNTPI 09 - Naučno-stručni skup Sistem naučnih, tehnoloških i poslovnih informacija, Beograd 19. i 20. jun 2009, Beograd : Fakultet informacionih tehnologija (2009)
-
Indexing of textual databases based on lexical resources: A case study for Serbian
In this paper we describe an approach to improvement of information retrieval results for large textual databases by pre-indexing documents using bag-of-words and Named Entity Recognition. The approach was applied on a database of geological projects financed by the Republic of Serbia in the last half century. Each document within this database is described by metadata, consisting of several fields such as title, domain, keywords, abstract, geographical location and the like. A bag of words was produced from these ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Indexing of textual databases based on lexical resources: A case study for Serbian" in Semantic Keyword-based Search on Structured Data Sources : First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers, Springer (2015). https://doi.org/10.1007/978-3-319-27932-9_15
-
Using English Baits to Catch Serbian Multi-Word Terminology
In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inflection... of inflected forms as in the real experiment setting and BLEU score increased to 26.21. 12Statistical Machine Translation toolkit can be found at https://github.com/moses-smt/giza-pp 13How phrase translation probabilities are determined can be read in more details at http://www.statmt.org/moses/ ? ...
... learners and delivers improved prediction accuracy. The instances pre- 16The whole set of extracted features and the classifier it- self are available on https://github.com/Branislava/ domain_terminology_extraction. dicted correctly are given a lower weight and the ones miss- classified are weighted higher ...Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
-
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++
Branislava Šandrih, Ranka Stanković (2020)U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...... 57–69 Krstev, Cvetana. Processing of Serbian. Automata, Texts and Electronic Dictionaries. Faculty of Philology of the University of Belgrade, 2008. https://hal.archives-ouvertes.fr/hal-01011806 Krstev, Cvetana, Branislava Šandrih, Ranka Stanković and Miljana Mlade- nović. “Using English Baits to Catch ...
... Alun Preece, Nick Francis and Glyn Elwyn. “FlexiTerm: a Flexible Term Recognition Method”. Journal of Biomed- ical Semantics Vol. 4, no. 1 (2013): 27. https://doi.org/10.1186/ 2041-1480-4-27 Stanković, Ranka, Cvetana Krstev, Nikola Vulović and Biljana Lazić. “Bib- lisha”, 2014 Stanković, Ranka, Cvetana ...Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
-
SrpELTeC: A Serbian Literary Corpus for Distant Reading
U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analyticsRanka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03
-
Investigation of quantitative indicators of anisotropy based on longitudinal elastic waves propagation velocities with uncertainty
Miloš Gligorić, Jelena Majstorović, Suzana Lutovac. "Investigation of quantitative indicators of anisotropy based on longitudinal elastic waves propagation velocities with uncertainty" in Underground Mining Engineering, Belgrade : University of Belgrade - Faculty of Mining and Geology (2018). https://doi.org/10.5937/PodRad1832001G
-
Production of morphological dictionaries of multi-word units using a multipurpose tool
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multi-word units, noun phrases, query expansionRanka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...... society in 5 https://hatebase.org/ The world’s largest structured repository of regionalized, multilingual hate speech 6 https://www.w3.org/2019/09/lexicog/ 7 https://www.w3.org/2016/05/ontolex/ 8 https://nexuslinguarum.eu/ LDK 2021 https://hatebase.org/ https://www.w3.org/2019/09/lexicog/ https://www.w3 ...
... 33–36, 2018. https://rm.coe.int/hf25-hate-speech-serbian-media-eng/1680a2278e https://rm.coe.int/hf25-hate-speech-serbian-media-eng/1680a2278e https://doi.org/10.1109/NITC48475.2019.9114528 https://doi.org/10.1109/NITC48475.2019.9114528 https://doi.org/10.18485/infotheca.2019.19.2.4 https://doi.org/10 ...
... ac.rs https://orcid.org/0000-0002-7621-5847 mailto:ranka.stankovic@rgf.bg.ac.rs https://orcid.org/0000-0001-5123-6273 mailto:cvetana@matf.bg.ac.rs http://poincare.matf.bg.ac.rs/~cvetana https://orcid.org/0000-0003-3328-9392 mailto:branislava.sandrih@fil.bg.ac.rs https://branislava.github.io/ https://orcid ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
-
An aproach to Implementation of blended learning in a university setting
... Education and Lifelong Learning, Vol.14, No.3, pp.229–245, 2004. View publication statsView publication stats http://geoliss.ekoplan.gov.rs/term� https://www.researchgate.net/publication/284680651 ...Ivan Obradović, Ranka Stanković, Olivera Kitanović, Jelena Prodanović . "An aproach to Implementation of blended learning in a university setting" in Proceedings of the Second International Conference on e-Learning, eLearning 2011, September 2011, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2011)
-
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic ...LR web services, MultiWord Expressions & Collocations, Information Extraction, Information RetrievalKrstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan. "The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines" in LREC 2008: Conference on Language Resources and Evaluation, Marrakesh, Morocco, May 2008, European Language Resources Association (ELRA) (2008)
-
Towards translation of educational resources using GIZA++
... stats https://www.class-central.com/ http://tramooc.eu/ https://iversity.org/ http://www.statmt.org/moses/ https://blog.coursera.org/post/50452652317/coursera-partnering https://blog.coursera.org/post/50452652317/coursera-partnering https://www.coursera.org/ https://www.transifex.com/ https://elear ...
... course-translation https://elearningindustry.com/10-easy-steps-for-successful-elearning-course-translation https://elearningindustry.com/elearning-course-translation-workflow https://elearningindustry.com/elearning-course-translation-workflow http://www.metatexis.com/cat.htm https://www.researchgate ...
... from Top Universities, https://www.coursera.org/ , accessed June, 10th 2016 [8] Localization Platform for Translating Digital Content https://www.transifex.com/, accessed June, 10th 2016 [9]. eLearning Industry, 10 Easy Steps For Successful eLearning Course Translation https://elearningindustry. ...Ivan Obradović, Dalibor Vorkapić, Ranka Stanković, Nikola Vulović, Miladin Kotorčević. "Towards translation of educational resources using GIZA++" in The Seventh International Conference on e-Learning (eLearning-2016), September 2016, Belgrade : Metropolitan Univesity (2016)
-
Antioxidant and antimicrobial activity of some tetradentate Schiff bases and their Cu (II) complexes
Aleksandar Mijatović, Milan Nikolić, Snežana Spasić, Aleksandra N. Žerađanin, Kristina Joksimović, Aleksandar Lolić, Rada Baošić (2020)Šifove baze i njihovi Cu(II) kompleksi su poznati po svojoj biološkoj aktivnosti. U ovom radu proučavana je antibakterijska aktivnost protiv gram-negativnih sojeva Escherichia coli, Pseudomonas aeruginosa i Staphilococcus piogenes, kao i Gram-pozitivnih Staphilococcus piogenes i Pseudomonas aeruginosa, zajedno sa antifungalnim dejstvom protiv Musperg strainilusa, A. Takođe, tehnički jednostavni i brzi testovi kao što su ABTS, HORAC i ORAC korišćeni su za ispitivanje antioksidativne aktivnosti kako bi se uporedili dobijeni rezultati sa različitim tipovima testova. Uprkos tome što je princip ...Aleksandar Mijatović, Milan Nikolić, Snežana Spasić, Aleksandra N. Žerađanin, Kristina Joksimović, Aleksandar Lolić, Rada Baošić. "Antioxidant and antimicrobial activity of some tetradentate Schiff bases and their Cu (II) complexes" in XIII Conference of Chemists, Technologists and Environmentalists of Republic of Srpska, University in Banjaluka, Faculty of Technology (2020)
-
Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources
Large collections of textual documents represent an example of big data that requires the solution of three basic problems: the representation of documents, the representation of information needs and the matching of the two representations. This paper outlines the introduction of document indexing as a possible solution to document representation. Documents within a large textual database developed for geological projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources" in Trans. Computational Collective Intelligence - Lecture Notes in Computer Science 26, Springer (2017). https://doi.org/10.1007/978-3-319-59268-8_8
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
An Approach to Efficient Processing of Multi-Word Units
Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
-
Optimisation of underground mine decline development system using genetic algorithm
Zoran Gligorić, Aleksandar Ganić, Rade Tokalić, Aleksandar Milutinović. "Optimisation of underground mine decline development system using genetic algorithm" in Underground mining engineering, Beograd : Univerzitet u Beogradu – Rudarsko–geološki fakultet (2014)
-
A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals
This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically and morphologically, using different supporting monolingual and multilingual resources, such as wordnets and electronic dictionaries. The tool operates within a complex system composed ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić. "A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals" in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, May 2012, Istanbul, Turkey, Istanbul, Turkey : European Language Resources Association (2012)
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Развој геолошког информационог система Републике Србије
Геолошки информациони систем Србије (ГеолИСС) је пројектован, првенствено, са намером ефикасног дигиталног архивирања геолошких и њима сродних података. У овом раду је приказана структура базе података као основа за развој геолошки конципираног ГИС-а. Нови, објектно орјентисани (О-О) начин моделирања омогућио је дефинисање самосталних типова објеката, хијерархијски повезаних кроз тополошке и друге релације, чиме је обезбеђена њихова медјусобна интеракција. Објектно оријентисано моделирање извршено је коришћењем унифицираног језика моделирања (UML) и CASE алата, кроз концептуални и логички ниво. Физички модел ће ...Бранислав Благојевић, Бранислав Тривић, Ненад Бањац, Ранка Станковић, Велизар Николић. "Развој геолошког информационог система Републике Србије" in 14. конгрес геолога Србије и Црне Горе са међународним учешћем, Нови Сад, 18-20. октобар 2005, Cpпско геолошко друштво и Caвeз геолошких друштава Србије и Црне Горе (2005)
-
Electronic Dictionaries - from File System to lemon Based Lexical Database
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić. "Electronic Dictionaries - from File System to lemon Based Lexical Database" in Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)