Претрага
99 items
-
The Dictionary of the Serbian Academy: from the Text to the Lexical Database
In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...... испореди ‘compare’; 4. definitions are descriptive or referential (в. for види ‘see’), in rare cases synonyms; 5. definitions are supplemented by the lists of: 1. synonyms (after abbreviation син. for синоним ‘synonym’); 2. antonyms (after abbreviation супр. for супротан ‘antonym’); 3. related words; ...Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018)
-
Medical Domain Document Classification via Extraction of Taxonomy Concepts from MeSH Ontology
Mihailo Škorić, Mauro Dragoni (2019)This paper is a result of a task that was presented to attendants of Keyword Search in Big Linked Data summer school, that was organized by Vienna University of Technology, under the Keystone COST action in the summer of 2017. It presents a specific approach to the classification via creation of minimal document surrogates based on the US National medical library’s MeSH ontology, which is derived from the Medical Subject Headings thesaurus. In a series of previously classified medically ...... in the form [A-Z][0-9][0-9]([.][0-9][0-9][0-9])*.8 A simple SPARQL query was used, with one ontology concept (mesh2016: D049916) inputted, and it lists all predicates and objects of the MeSH 2016 ontology triple, whose concept is a part of (Figure 3). The query is illustrated on the concept mesh2016:D049916 ...Mihailo Škorić, Mauro Dragoni. "Medical Domain Document Classification via Extraction of Taxonomy Concepts from MeSH Ontology" in Infotheca, Faculty of Philology, University of Belgrade (2019). https://doi.org/10.18485/infotheca.2019.19.1.3
-
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic ...LR web services, MultiWord Expressions & Collocations, Information Extraction, Information Retrieval... The used log file thus gives a good insight in users’ queries. Many of the multi word queries are of no interest since they represent simple lists of key words, for instance Beograd, Gradska čistoća, privatizacija ‘Belgrade, City Waste Disposal, privatization’. It is not expected that the user ...Krstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan. "The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines" in LREC 2008: Conference on Language Resources and Evaluation, Marrakesh, Morocco, May 2008, European Language Resources Association (ELRA) (2008)
-
A necessary and sufficient condition for an algebraic integer to be a Salem number
Dragan Stankov (2019)We present a necessary and sufficient condition for a root greater than unity of a monic reciprocal polynomial of an even degree at least four, with integer coefficients, to be a Salem number. This condition requires that the minimal polynomial of some power of the algebraic integer has a linear coefficient that is relatively large. We also determine the probability that an arbitrary power of a Salem number, of certain small degrees, satisfies this condition.Algebraic integer, the house of algebraic integer, maximal modulus, reciprocal polynomial, primitive polynomial, Schinzel-Zassenhaus conjecture, Mahler measure, method of least squares, cyclotomic polynomialsDragan Stankov. "A necessary and sufficient condition for an algebraic integer to be a Salem number" in Journal de theorie des nombres de Bordeaux (2019). https://doi.org/10.5802/jtnb.1076
-
OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian
Ovaj rad predstavlja novi jezički resurs za pretraživanje i istraživanje verbalnih aspektnih parova u BCS (bosanskom, hrvatskom i srpskom), kreiran korišćenjem principa Lingvističkih Povezanih Otvorenih Podataka (LLOD). Pošto ne postoji resurs koji bi pomogao učenicima bosanskog, hrvatskog i srpskog kao stranih jezika da prepoznaju aspekt glagola ili njegove parove, kreirali smo novi resurs koji će korisnicima pružiti informacije o aspektu, kao i link ka aspektnim parovima glagola. Ovaj resurs takođe sadrži spoljne linkove ka monolingvalnim rečnicima, Wordnetu i BabelNetu. ...Ranka Stanković, Maxim Ionov, Medina Bajtarević, Lorena Ninčević. "OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
NMR kinetic studies of the interactions between [Ru(terpy)(bipy)(H2O)]2+ and some sulfur-donor ligands
Aleksandar Mijatović, Biljana Šmit, Ana Rilak, Biljana Petrović, Dragan Čanović, Živadin D. Bugarčić (2012)Reakcije supstitucije monofunkcionalnog [Ru(terpi)(bipi)(H2O)]2+ kompleksa, gde je terpi = 2,2′:6′,2″-terpiridin i bipi = 2,2′-bipiridin, sa biološki relevantnim sumporo donorskim ligandima, kao što su tiourea, l-metionin, l-cistein i glutation, proučavane su u vodenim rastvorima tehnikom 1H i 13C NMR spektroskopije. Sve reakcije su proučavajne na pH 7,4, u prisustvu fosfatnog pufera, da bi se oponašalo fiziološko okruženje tokom procesa supstitucije. Reakcije se izvode na 295 K pod uslovima reakcija drugog reda. Uočeni rezultati pokazuju da brzina supstitucije snažno zavisi ...Aleksandar Mijatović, Biljana Šmit, Ana Rilak, Biljana Petrović, Dragan Čanović, Živadin D. Bugarčić. "NMR kinetic studies of the interactions between [Ru(terpy)(bipy)(H2O)]2+ and some sulfur-donor ligands" in Inorganica Chimica Acta, Elsevier BV (2012). https://doi.org/10.1016/j.ica.2012.09.016
-
The Nooj System as Module within an Integrated Language Processing Environment
... lemma cyelo 4. Textual resources management 4.1. Parallel Text Management The WS4LR module for management of aligned parallel texts uses texts which have previously been aligned using Xalign as an alignment tool (Bonhomme 2001). Parallel texts which usually originate from a text in one language ...
... the same principles. Figure 3. Form for viewing a NooJ dictionary entry As we have already mentioned, WS4LR handles besides bilingual word lists also multilingual dictionaries, such as Prolex, the multilingual dictionary of proper names based on an ontology built around the conceptual proper ...
... wordnet windows, thus offering to the user the possibility to work with one or two wordnets. If the user decides to work with two wordnets in parallel, he/she can always synchronize them via the ILI. The equivalent synsets in different languages are linked to the same Inter-Lingual Index (ILI) ...Ranka Stanković, Duško Vitas, Cvetana Krstev. "The Nooj System as Module within an Integrated Language Processing Environment" in Proceedings of the 2007 International Nooj Conference, Cambridge Scholars Publishing (2008)
-
Frequency and Length of Syllables in Serbian
Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová (2019)Basic analyses of several properties of syllables (the rank-frequency distribution, the distribution of length, and the relation between length and frequency) in Serbian is presented. The syllabification algorithm used combines the maximum onset principle and the sonority hierarchy. Results indicate that syllables behave similarly to words as far as mathematical models are concerned, but values of parameters in models for syllables are quite different from those for words.... Karl-Heinz Best at http://wwwuser.gwdg.de/~kbest/litlist.htm and compare the number of entries for syllables and for words. 10 Needless to say, the lists of works mentioned here as examples is by no means exhaustive. Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović ...
... Russian socialist realist novel “Kak zakalyalas’ stal’” (How the Steel Was Tempered) by N. Ostrovsky. The choice is motivated by the fact that a parallel corpus consisting of the first ten chapters of the novel and their translations to all standard Slavic languages (except for Lower Sorbian) is available ...
... were so far performed on one language only. In future, other Slavic languages and other aspects of syllables will be investigated. As there is a parallel corpus of Slavic languages available, properties of syllables can be used to construct a data-based typology of Slavic languages and to compare ...Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová. "Frequency and Length of Syllables in Serbian" in Glottometrics (2019)
-
Application of VIКOR method in the selection of an optimal splution of excavation “Borska Reka” ore deposit
Sanja Bajić, Dragoljub Bajić, Branko Glušćević, Radmila Gaćina. "Application of VIКOR method in the selection of an optimal splution of excavation “Borska Reka” ore deposit" in 9th International Conference Mining and Environmental protection MEP, 24 – 27 May 2023, Sokobanja, Serbia , Belgrade : University of Belgrade-Faculty of mining and geology (2023)
-
Keyword Extraction from Parallel Abstracts of Scientific Publications
... descriptive statistics for 50 parallel abstracts in the Ser- bian and English language including the average value, the minimal and maximal number of words in rows for each category presented by columns. The first column is related to the numbers of words in the text, the KW count lists the number of keywords ...
... 03:24:53 Keyword Extraction from Parallel Abstracts of Scientific Publications Slobodan Beliga, Olivera Kitanović, Ranka Stanković, Sanda Martinčić-Ipšić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Keyword Extraction from Parallel Abstracts of Scientific Publications ...
... from parallel abstracts of scientific publication in the Serbian and English languages. The keywords are extracted by a selectivity-based keyword extraction method. The method is based on the structural and statistical properties of text represented as a complex network. The constructed parallel corpus ...Slobodan Beliga, Olivera Kitanović, Ranka Stanković, Sanda Martinčić-Ipšić . "Keyword Extraction from Parallel Abstracts of Scientific Publications" in Sematic Keyword-Based Search on Structured Data Sources - Third International KEYSTONE Conference, IKC 2017 Gdańsk, Poland, September 11–12, 2017 Revised Selected Papers and COST Action IC1302 Reports, Springer (2017)
-
An intelligent hybrid system for surface coal mine safety analysis
Nikola Lilić, Ivan Obradović, Aleksandar Cvjetić. "An intelligent hybrid system for surface coal mine safety analysis" in Engineering Applications of Artificial Intelligence (2010)
-
Serbian NER&Beyond: The Archaic and the Modern Intertwinned
U ovom radu predstavljamo srpski književni korpus koji se razvija pod okriljem COST Akcije „Distant Reading for European Literary History” CA16204. Koristeći ovaj korpus romana napisanih pre više od jednog veka, razvili smo i učinili javno dostupnim Sistem za prepoznavanje imenovanih entiteta (NER) obučen da prepozna 7 različitih tipova imenovanih entiteta, sa konvolucionom neuronskom mrežom (CNN), koja ima F1 rezultat od ≈91% na test skupu podataka. Ovaj model je dalje ocenjen na posebnom skupu podataka za evaluaciju. Završavamo poređenje ...... event, semantic products and objects) with about 20 subcategories at three levels, disambiguated by CG-rules: known lexical entries and gazetteer lists, pattern-based name type prediction and context-based name type inference for unkno- 4Materials for the NER Training School, https://github.com/di ...Branislava Šandrih Todorović, Cvetana Krstev, Ranka Stanković, Milica Ikonić Nešić. "Serbian NER&Beyond: The Archaic and the Modern Intertwinned" in Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications, INCOMA Ltd. Shoumen, BULGARIA (2021). https://doi.org/10.26615/978-954-452-072-4_141
-
Mechanosynthesis and structural characterization of nanocrystalline Ce1–Y O2– (x=0.1–0.35) solid solutions
Martin Fabián, Bratislav Antić, Vladimír Girman, Milica Vučinić-Vasić, Aleksandar Kremenović, Shigeru Suzuki, Horst Hahn, Vladimír Šepelák (2015)A series of nanostructuredfluorite-type Ce1–xYxO2–δ (0rxr0.35) solid solutions, prepared via highenergy milling of the CeO2/Y2O3mixtures, are investigated by XRD, HR-TEM, EDS and Raman spectroscopy. For thefirst time, complementary information on both the long-range and short-range structural features of mechanosynthesized Ce1–xYxO2–δ, obtained by Rietveld analysis of XRD data and Raman spectroscopy, is provided. The lattice parameters of the as-prepared solid solutions decrease with increasing yttrium content. Rietveld refinements of the XRD data reveal increase in microstrains in the host ceria ...Martin Fabián, Bratislav Antić, Vladimír Girman, Milica Vučinić-Vasić, Aleksandar Kremenović, Shigeru Suzuki, Horst Hahn, Vladimír Šepelák. "Mechanosynthesis and structural characterization of nanocrystalline Ce1–Y O2– (x=0.1–0.35) solid solutions" in Journal of Solid State Chemistry, Elsevier BV (2015). https://doi.org/10.1016/j.jssc.2015.06.027
-
The interaction of organoselenium trans-palladium(II) complexes toward small-biomolecules and CT-DNA
Serija organoselenijum trans-paladijum(II) kompleksa 1 (bis(2-(fenilselanilmetil)oksolan)dihloropaladijum(II)), 2 (bis(2-(fenilselanilmetil)oksan)dihloropaladijum(II)) i 3 (bis(2) ,2-dimetil-3-(fenilselanil)oksan)dihloropaladijum(II)) su korišćeni za ispitivanje reaktivnosti ovog specifičnog tipa kompleksa prema različitim bio-molekulima. Ovaj sistem je od posebnog interesa jer se jako malo zna o supstitucionim reakcijam organoselenijum paladijum(II) kompleksa sa trans konfiguracijom. Zamena koordinovanog hlorida sa serijom malih bio-molekula (l-Met, l-His, l-Cis, GSH i 5′-GMP) proučavana je pod uslovima pseudo-prvog reda kao funkcija koncentracije nukleofila i temperature korišćenjem tehnika zaustavljenog toka. Rezultati za proučavane komplekse ukazuju da ...Vera M. Divac, Aleksandar Mijatović, Marina D. Kostić, Jovana Bogojeski. "The interaction of organoselenium trans-palladium(II) complexes toward small-biomolecules and CT-DNA" in Inorganica Chimica Acta, Elsevier BV (2017). https://doi.org/10.1016/j.ica.2017.07.012
-
An Integrated Environment for Management and Exploitation of Linguistic Resources
Ranka Stanković, Ivan Obradović (2009)... existing synset in another language (for example English). In order to support this feature, the module provides access to bilingual, parallel word lists, which can help in translating synset literals from one lan- guage to another. This module also offers different options for data consis- ...
... connection with the Princeton wordnet, the first English wordnet, which is publicly available [3]. C. Parallel and aligned texts Although monolingual parallel texts exist, parallel texts are as a rule bilingual, composed of one original text and its translation into another language. Thus ...
... types of dictionaries, the Group is engaged in developing other resources, such as the e-corpus of Serbian, as well as parallel multilingual corpora, with the majority of parallel texts aligned. The resources have been developed during several decades in the framework of various projects ...Ranka Stanković, Ivan Obradović. "An Integrated Environment for Management and Exploitation of Linguistic Resources" in Proceedings of the International Multiconference on Computer Science and Information Technology, Computational Linguistics – Applications Workshop (CLA09), Mrągowo, Poland, October 2009, Piscataway : IEEE (2009)
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...... tailored to be ap- plicable for PoS-tagging in general is the Universal Part- of-Speech (UPoS) tagset (Petrov et al., 2012) (used by spaCy), and it lists the following 17 categories: adjective (ADJ), adposition (ADP), adverb (ADV), auxiliary (AUX), coordinating conjunction (CCONJ), determiner (DET) ...Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
-
On the distribution modulo 1 of the sum of powers of a Salem number
Dragan Stankov (2016)It is well known that the sequence of powers of a Salem number θ, modulo 1, is dense in the unit interval, but is not uniformly distributed. Generalizing a result of Dupain, we determine, explicitly, the repartition function of the sequence , where P is a polynomial with integer coefficients and θ is quartic. Also, we consider some examples to illustrate the method of determination.Algebraic integer, the house of algebraic integer, maximal modulus, reciprocal polynomial, primitive polynomial, Schinzel-Zassenhaus conjecture, Mahler measure, method of least squares, cyclotomic polynomialsDragan Stankov. "On the distribution modulo 1 of the sum of powers of a Salem number" in Comptes rendus Mathematique (2016). https://doi.org/10.1016/j.crma.2016.03.012
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...... and disgrace’ is mandatory to assure correct classification of tweets. In the next phases of the abusive words lexicon development, we plan to use: lists of slurs, abusive expressions, and courses built by conducting surveys and crowdsourcing (Mitrović et al., 2015), slang and dictionaries of synonyms ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
Towards Automatic Definition Extraction for Serbian
U radu su prikazani preliminarni rezultati automatske ekstrakcije kandidata za definicije rečnika iz nestrukturiranih tekstova na srpskom jeziku u cilju ubrzanja razvoja rečnika. Definicije u rečniku Srpske akademije nauka i umetnosti (SANU) korišćene su za modelovanje različitih tipova definicija (opisnih, gramatičkih, referentnih i sinonimskih) koje imaju različite sintaksičke i leksičke karakteristike. Korpus istraživanja sastoji se od 61.213 definicija imenica, koje su analizirane korišćenjem morfoloških e-rečnika i lokalnih gramatika implementiranih kao pretvarači konačnih stanja u paketu za obradu korpusa otvorenog ...... (Gortan-Premk 2014: 131–132). Combined definitions are used in large descriptive dictionaries - the descriptive part identifies the term directly and lists the elements of meaning, and the synonym refers to the semantic content indirectly (Gortan-Premk 1980: 111–112; Gortan-Premk 1983). The problem of ...Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić. "Towards Automatic Definition Extraction for Serbian" in Proceedings of the XIX EURALEX Congress of the European Assocition for Lexicography: Lexicography for Inclusion (Volume 2). 7-9 September (virtual), Democritus University of Thrace (2021)
-
Low-temperature phase transition and magnetic properties of K3YbSi2O7
Predrag Dabić, Volker Kahlenberg, Biljana Krüger, Marko Rodić, Sabina Kovač, Jovan Blanuša, Zvonko Jagličić, Ljiljana Karanović, Václav Petříček, Aleksandar Kremenović (2021)alkalni silikati elemenata retkih zemalja, fazni prelazi, magnetne karakteristike, razdvajanje kristalnog polja, silikati lantanica... hexagonal pyramid with a triple vertex, or as a truncated hexagonal pyramid with a regular hexagonal base upon which a parallel triangular face is situated [Figs. 1(a) and 2(a)]. This parallel triangular face (small equilateral triangle in the oxygen sheet with the 3.6.3.6 meshes) is a common triangular ...
... hexagonal pyramid with a triple vertex, or as a truncated hexagonal pyramid with a regular hexagonal base upon which a parallel triangular face is situated [Figs. 1(a) and 2(a)]. This parallel triangular face (small equilateral triangle in the oxygen sheet with the 3.6.3.6 meshes) is a common triangular face ...
... polymorph, the coordination polyhedron of K1 in �0-K3YbSi2O7 can also be described as a truncated hexagonal pyramid containing two parallel faces: a hexagonal base and a parallel triangular face at the top. K1 is located in the centre of a regular O6 hexagon formed by four O1 and two O2 atoms. The six K1—O ...Predrag Dabić, Volker Kahlenberg, Biljana Krüger, Marko Rodić, Sabina Kovač, Jovan Blanuša, Zvonko Jagličić, Ljiljana Karanović, Václav Petříček, Aleksandar Kremenović. "Low-temperature phase transition and magnetic properties of K3YbSi2O7" in Acta Crystallographica Section B Structural Science, Crystal Engineering and Materials, International Union of Crystallography (IUCr) (2021). https://doi.org/10.1107/S2052520621006077