Претрага
2357 items
-
Stratigraphy and microfossils (radiolarians and planktonic foraminifers) of the Upper Cretaceous (upper Santonian–lower Campanian) Struganik limestone (Western Serbia)
Liubov G. Bragina, Nikita Yu. Bragin, Ludmila F. Kopayevich, Nevenka Đerić, Nataša Gerzina Spajić (2019)Liubov G. Bragina, Nikita Yu. Bragin, Ludmila F. Kopayevich, Nevenka Đerić, Nataša Gerzina Spajić. "Stratigraphy and microfossils (radiolarians and planktonic foraminifers) of the Upper Cretaceous (upper Santonian–lower Campanian) Struganik limestone (Western Serbia)" in Palaeoworld, Elsevier BV (2019). https://doi.org/10.1016/j.palwor.2019.05.001
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...... their time and efforts to help us build AbCoSER v1.0 corpus of abusive speech in Serbian. 1 Introduction 1.1 Motivation and research background With the development of the Internet and the increasing use of online mass media and social networks, detection of inappropriate content and incitement to ...
... al. [50] and Fisher et al. [15] emphasize the type and the target of offensive speech. The OLID data set and the scheme proposed by Zampieri et al. [50], used also at the SemEval2019 and SemEval 2020 competitions, gained popularity among researchers leading to the production of Turkish and Danish data ...
... which leads to serious depression, and even suicide [12]. As far as Serbian law is concerned, any discrimination, endangering security, persecution, insults, and harassment on social networks are punishable [26, 4, 30]. Hate speech and flames are present in Serbian media and public discourse especially towards ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
-
Resource-based WordNet Augmentation and Enrichment
In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...... Cristea, D., and Stamou, S. (2004). Balkanet: Aims, methods, results and perspectives. a general overview. Romanian Journal of Information science and technology, 7(1-2):9–43. Vintar, Š. and Fišer, D. (2017). Enriching Slovene wordnet with domain-specific terms. Annotation, exploitation and evaluation ...
... Faculty of Mining and Geology archives faculty publications available in open access, as well as the employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Resource based WordNet augmentation and enrichment Ranka Stanković and Ivan Obradović Faculty of Mining and Geology University ...
... Different methods and resources can be used for alignment. One of the common approaches is to take PWN as the source for alignment, and a bilingual dictionary of English and the target language. There are, however, several other approaches. In (Chugur et al., 2001) a monolingual and a bilingual Sp ...Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
-
Bilingual lexical extraction based on word alignment for improving corpus search
Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056
-
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others (2020)Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages ...... 2005; Ponzetto and Navigli, 2010; Niemann and Gurevych, 2011; Mc- Crae, 2018)), with the Longman Dictionary of Contempo- rary English and with Roget’s thesaurus (Kwong, 1998), with Wiktionary3 (Meyer and Gurevych, 2011) or with the Oxford Dictionary of English (Navigli, 2006). Meyer and Gurevych (2011) ...
... sense representations and decrease sense granularity (Miller, 2016). Miller and Gurevych (2014) describe a technique for constructing an n-way alignment of LSRs and applied it to the produc- tion of a three-way alignment of the English WordNet, Wikipedia and Wiktionary. Niemann and Gurevych (2011) propose ...
... References Ahmadi, S., Arcan, M., and McCrae, J. (2018). On lex- icographical networks. In Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence. Burgun, A. and Bodenreider, O. (2001). Comparing terms, concepts and semantic classes in WordNet and the Uni- fied Medical Language ...Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others . "A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment" in Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, European Language Resources Association (ELRA) (2020)
-
Keyword-Based Search on Bilingual Digital Libraries
This paper outlines the main features of Biblisha, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Biblishsa supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic ...Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović. "Keyword-Based Search on Bilingual Digital Libraries" in Semantic Keyword-Based Search on Structured Data Sources - Second COST Action IC1302 International KEYSTONE Conference, IKC 2016, Springer (2017). https://doi.org/10.1007/978-3-319-53640-8_10
-
Molecular and Isotope Composition of Biomarkers in Immature Oil Shale and its Liquid Pyrolysis Products (Open and Closed System)
Gordana Gajica, Aleksandra Šajnovic, Jan Schwarzbauer, Aleksandar Kostić, Branimir Jovančićević, Ksenija Stojanović (2021)The molecular and isotopic composition of biomarkers in initial bitumen isolated from raw immature oil shale samples from the Lower Miocene Aleksinac Basin (Serbia) and liquid products (LPs) obtained by pyrolysis in open (OS) and closed systems (CS) are studied. The influence of pyrolysis type and variations of kerogen type on biomarkers composition and their isotopic signatures in LPs is determined. The molecular composition of the LPs from the OS pyrolysis is very similar to those in initial bitumen, independently ...... from the OS and catagentic stage for LPs from the CS (the calculated vitrinite reflectance is 0.76 and 0.92 for samples D13 and D16, respectively). The isotope δ13C values of individual n-alkanes range between –30.2 and –33.8 ‰ with average value of –31.9 ‰ in the sample D13 and from –28.7 to ...
... bitumen isolated from raw immature oil shale samples and liquid products (LPs) obtained by pyrolysis in open (OS) and closed systems (CS) are studied. The influence of pyrolysis type and variations of kerogen type on biomarkers composition and their isotopic signatures in LPs is determined. Pyrolysis ...
... displayed notably more mature sterane and hopane distributions, which are characterized by presence of thermodynamically more stable ααα(S), αββ(S) and αββ(R) steranes, βα- and αβ-diasteranes, neohopanes (C27Ts, C29Ts), the prevalence of αβ- over βα- hopane isomers, and dominance of 22S- relative to 22R ...Gordana Gajica, Aleksandra Šajnovic, Jan Schwarzbauer, Aleksandar Kostić, Branimir Jovančićević, Ksenija Stojanović. "Molecular and Isotope Composition of Biomarkers in Immature Oil Shale and its Liquid Pyrolysis Products (Open and Closed System)" in 30th International Meeting on Organic Geochemistry (IMOG 2021), European Association of Geoscientists & Engineers (2021). https://doi.org/10.3997/2214-4609.202134040
-
Development Of The Serbian Geological Resources Portal
... with keeping track of exploration and exploitation permits and works in the mineral resources field, and represents a basis for archiv- ing and efficient handling of vector, raster and related thematic alphanumeric content in one place, as well as efficient management and usage of mineral resources.8 Google ...
... archiving, query, retrieving, analysis and visualization of geologi- cal data. The development and implementation of GeolISS is managed by a team from the Faculty of Mining and Geology at the University of Belgrade (FMG) and funded by the Ministry of the Environment, Mining and Spatial Planning (MEMSP). The main ...
... archiving, query, retrieving, analysis and visualization of geological data. The development and implementa- tion of GeolISS is managed by a team from the Faculty of Mining and Geology at the University of Belgrade (FMG). Following the development of a geodatabase in ArcSDE and an ArcMap extension for data manage- ...Ranka Stanković, Jelena Prodanović, Olivera Kitanović, Velizar Nikolić. "Development Of The Serbian Geological Resources Portal" in Proceedings of the 17th Meeting of the Association of European Geological Societies, Belgrade, Serbia : The Serbian Geological Society (2011)
-
Availability as a dimension of energy security in the Republic of Serbia
Boban Pavlović, Dejan Ivezić (2016)There is a range of modern approaches and models in literature for the evaluation and determination of energy security, which are based on different parameters and indicators. For most of them, a common characteristic is emphasizing the avail ability of energy, as an important dimension for ensuring energy security. In this paper, concise overview of Serbian energy sectors is given and appropriate energy indicators are defined and determined. Selected indicators provide insight into the main components that characterize availability ...... OF SERBIA by Boban S. PAVLOVIĆ* and Dejan D. IVEZIĆ Faculty of Mining and Geology, University of Belgrade, Belgrade, Serbia Original scientific paper DOI:10.2298/TSCI160923303P There is a range of modern approaches and models in literature for the evaluation and determination of energy security ...
... diversification and energy supplier diversifica- tion, – the accessibility to resources, in terms of the availability of related energy infrastructure and energy transportation infrastructure, and – geopolitical relations. In order to provide a framework for identifying, measuring and managing v ...
... multi-dimensional concept of energy security, and from the selection of critical dimensions for functioning of the energy system and its safety. In APERC report on energy security in Asia [7], availability and affordability are merged with acceptability and accessibility, as four main dimensions of energy ...Boban Pavlović, Dejan Ivezić. "Availability as a dimension of energy security in the Republic of Serbia" in Thermal Science, National Library of Serbia (2016). https://doi.org/10.2298/TSCI160923303P
-
Asbestos-Based Pottery from Corsica: The First Fiber-Reinforced Ceramic Matrix Composite
Asbestos-containing pottery shards collected in the northeast of Corsica (Cap Corse) and dating from the 19th century, or earlier, have been analyzed by SEM-EDS, XRPD, FTIR and Raman microspectroscopy. Blue (crocidolite) and white (chrysotile) asbestos fiber bundles are observed in cross-sections. Most of the asbestos is partly or totally dehydroxylated, and some transformation to forsterite is observed to occur, indicative of a firing above 800 C. Examination of freshly fractured pieces shows a nonbrittle fracture with fiber pull-out, consistent with ...... ine are found in samples a, b and d, while albite is found only in samples a and b. Crocidolite is found in samples b and d, while tremolite is found in samples b and c. Diopside and orthopyroxene are found in samples b and c, while talc is found in samples c and d. Minor abundant mineral phases ...
... by the turning: movement and pressure of the hands. The diameters of the individual fibers range between about 100 nm and 1 hm, and the length can reach a few centimeters. The fibers form heaps and clumps, and big grains are visible (see, e.g., Figure l1a, spots 3, 4,8 and 6). The red color of the ...
... 1080) and 620 cm-! (Figure 3), are characteristic of asbestos and similar compounds (amphiboles and pyroxenes) [14-21]. From this first analysis, it can be seen that the samples b and d on the one hand and the samples a and c on the other hand are rather similar, but sample d appears to be more h ...Philippe Colomban, Aleksandar Kremenović. "Asbestos-Based Pottery from Corsica: The First Fiber-Reinforced Ceramic Matrix Composite" in Materials, MDPI AG (2020). https://doi.org/10.3390/ma13163597
-
A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals
This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically and morphologically, using different supporting monolingual and multilingual resources, such as wordnets and electronic dictionaries. The tool operates within a complex system composed ...... Serbian and English version of journal articles were manually preprocessed and then automatically aligned, and the alignments manually corrected using ACIDE. Table 1 shows the total, minimum, maximum and average length of articles (in words and sentences), given separately for Serbian and English ...
... finite automata and transducers, these dictionaries represent the basis for morphological expansion of queries. As for semantic and bilingual expansion, the system relies on wordnets (Serbian and English at present) and a bilingual English/Serbian dictionary of Library and Information Science ...
... online, and the Dictionary of librarianship is selected as the resource for semantic and bilingual expansion, on-line will be added as another English keyword, and onlajn and u mreži as Serbian keywords. A concurrent search with these two lists will reveal that both English terms: online and on-line ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić. "A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals" in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, May 2012, Istanbul, Turkey, Istanbul, Turkey : European Language Resources Association (2012)
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...... rule could be formulated and used. Our aim to include gram- matical categories of comparative degree (for adjectives) and gender (for nouns, adjectives, some forms of verbs and some types of pronouns and numbers) into tagger models required an update of the training corpus and addition of re- spective ...
... automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment between Serbian morphological dictionaries, MULTEXT-East and Universal Part-of-Speech tagset. The trained models will be used to publish ...
... (morphological, semantic and syntactic) rules, machine learning methods (Giménez and Màrquez, 2004; Denis and Sagot, 2009; Manning et al., 2014) or state-of-the-art Deep Neural Networks (DNNs) (Huang et al., 2015; Choi, 2016; Akbik et al., 2018). The first operational tagger and lemmatization model ...Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
-
Results of Recent Monitoring Activities on Landslide Umka, Belgrade, Serbia—IPL 181
Biljana Abolmasov, Uroš Đurić, Jovan Popović, Marko Pejić, Mileva Samardžić Petrović, Nenad Brodić (2021)... such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher ...
... measure- ments, geodetic benchmark survey monitoring, UAV imaging, processing and analysis, and PSInSAR data processing and analysis. Results of all monitoring activ- ities were analysed and used for cross-correlation and for verification of monitoring results obtained from different techniques. ...
... photogrammetric processing and analysis, and PSInSAR data processing and analysis. The main goals for introducing new monitoring techniques were: (1) to increase the number of surface monitoring points, (2) to test accuracy of existing and newly introduced monitoring techniques and (3) to compare monitoring ...Biljana Abolmasov, Uroš Đurić, Jovan Popović, Marko Pejić, Mileva Samardžić Petrović, Nenad Brodić. "Results of Recent Monitoring Activities on Landslide Umka, Belgrade, Serbia—IPL 181" in Understanding and Reducing Landslide Disaster Risk. WLF 2020. ICL Contribution to Landslide Disaster Risk Reduction, Springer, Cham (2021). https://doi.org/10.1007/978-3-030-60196-6_14
-
Microstructural and magnetic properties of electrospun hematite/cuprospinel composites
Phase composition, microstructural and magnetic properties of electrospun hematite/cuprospinel composites were investigated. Samples were synthesized starting with 0 to 10 mol% of copper relative to iron. The round shape of reference electrospun fbres was preserved upon their heating up to 600 °C in air, whereas at 700 °C hollow substructure was additionally formed. In these reference samples the presence of hematite phase was detected by XRPD. A small amount (traces) of Fe3O4 /γ-Fe2O3 was also found, due to the ...Electrical and Electronic Engineering, Condensed Matter Physics, Atomic and Molecular Physics and Optics, Electronic, Optical and Magnetic MaterialsMira Ristić, Aleksandar Kremenović, Michael Reissner, Željka Petrović, Svetozar Musić. "Microstructural and magnetic properties of electrospun hematite/cuprospinel composites" in Journal of Materials Science: Materials in Electronics, Springer Science and Business Media LLC (2020). https://doi.org/10.1007/s10854-020-03526-0
-
Simple 2D gravity–density inversion for the modeling of the basin basement: example from the Banat area, Serbia
We have developed a technique to calculate lateral density distribution of the sedimentary basin basement by combining linear gravity–density inversion and 2D forward modeling. The procedure requires gravity anomaly data, depth-to-basement data and density data for the sediments (density–depth distribution). Gravity efect of density variations in the basement was extracted from the total gravity anomaly by removing the joint efect of the sediments with vertical density variations and homogeneous basement of average density contrast (calculated by 2D modeling). Gravity ...Ivana Vasiljević, Snežana Ignjatović, Dragana Đurić. "Simple 2D gravity–density inversion for the modeling of the basin basement: example from the Banat area, Serbia" in Acta Geophysica, Springer (2019). https://doi.org/10.1007/s11600-019-00328-9
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...... racial, national, and religious hate speech detection adopted by Gitari et al. (2015) was based solely on the usage of lexicon and rules. They used semantics and subjectivity features – polarity, intensity, and subjectivity level of words, using the domain corpus of hateful content and Subjectivity lexicon ...
... sentimental words and expressions, and SentiWordNet (Gitari et al., 2015), where it is assumed that abusive language consists of words indicating negative polarity of feelings, (3) list of offensive words and expressions (Bassignana et al., 2018) and (Hatebase.org), whether made by experts and/or obtained ...
... extensive use of online media and the Internet in gen- eral. The concept of abusive speech, as an umbrella term for phenomena such as offensive and hate speech, its content and forms of expression are analysed, trying to define its vocabulary, collocations, colloquial expressions, and context. Starting from ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
Antioxidant and antimicrobial activity of some tetradentate Schiff bases and their Cu (II) complexes
Aleksandar Mijatović, Milan Nikolić, Snežana Spasić, Aleksandra N. Žerađanin, Kristina Joksimović, Aleksandar Lolić, Rada Baošić (2020)Šifove baze i njihovi Cu(II) kompleksi su poznati po svojoj biološkoj aktivnosti. U ovom radu proučavana je antibakterijska aktivnost protiv gram-negativnih sojeva Escherichia coli, Pseudomonas aeruginosa i Staphilococcus piogenes, kao i Gram-pozitivnih Staphilococcus piogenes i Pseudomonas aeruginosa, zajedno sa antifungalnim dejstvom protiv Musperg strainilusa, A. Takođe, tehnički jednostavni i brzi testovi kao što su ABTS, HORAC i ORAC korišćeni su za ispitivanje antioksidativne aktivnosti kako bi se uporedili dobijeni rezultati sa različitim tipovima testova. Uprkos tome što je princip ...Aleksandar Mijatović, Milan Nikolić, Snežana Spasić, Aleksandra N. Žerađanin, Kristina Joksimović, Aleksandar Lolić, Rada Baošić. "Antioxidant and antimicrobial activity of some tetradentate Schiff bases and their Cu (II) complexes" in XIII Conference of Chemists, Technologists and Environmentalists of Republic of Srpska, University in Banjaluka, Faculty of Technology (2020)
-
Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC
OntoLex, dominantni standard zajednice za mašinski čitljive leksičke resurse u kontekstu RDF-a, Linked Data i tehnologija Semantičkog veba, trenutno se proširuje sa posebnim modulom za Frekvencije, Primere i Informacije zasnovane na Korpusu (OntoLex-FrAC). Predlažemo novi komponent za OntoLex-FrAC, koji se bavi inkorporacijom korpusnih upita za (a) povezivanje rečnika sa korpusnim mašinama, (b) omogućavanje RDF baziranih web servisa da dinamički razmenjuju korpusne upite i podatke odgovora, i (c) korišćenje konvencionalnih upitačkih jezika za formalizaciju unutrašnje strukture kolokacija, skica reči i ...standardizacija, digitalna leksikografija, OntoLex, upiti korpusa, povezani podaci, Lingvistički povezani otvoreni podaciChristian Chiarcos, Ranka Stanković, Maxim Ionov, Gilles Sérasset. "Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC" in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, 20-25 May 2024, LREC (2024)
-
A Lexical Approach to Acronyms and their Definitions
In this paper we present a comprehensive approach to acronyms for Natural-Language Processing (NLP) of Serbian texts. The proposed procedure includes extraction of acronyms and their definitions that are usual Multi-Word Units (MWUs), shallow parsing of MWUs that enables MWU lemmatization and production of entries in morphological electronic dictionaries, both for MWU and acronyms, that are provided with grammatical, syntactic, semantic and domain information. This approach enables representation that reflects complex relations between acronyms and their definitions.... texts (57% daily, 8% weekly and 5% monthly newspapers) and 6% of monographs and textbooks (Krstev and Vitas, 2005), which are types of texts that tend to use acronyms and pro- vide definitions. Besides that we used two more samples of newspaper texts (having 600 thousand and 1.200 thousand simple word ...
... Electronic Communication and Postal Services’. Moreover, in many cases a relation between an entity and its name and acronym is not one-to-one. The name can change in time and some shortened variants can be in use, translated names can exhibit serious variations in used lexica and syntactic forms, original ...
... of Education and Science under grants 47003 and 178003. 6. References Jacobs, K., A. Itai, and S. Wintner, 2014. Acronym Dic- tionary Construction and Disambiguation (abstract). 3rd Parseme General Meeting – Poster Session. Krstev, C., R. Stanković, I. Obradović, D. Vitas, and M. Utvić, 2010 ...Cvetana Krstev, Duško Vitas, Ranka Stanković. "A Lexical Approach to Acronyms and their Definitions" in Proceedings of the 7th Language & Technology Conference, November 27-29, 2015, Poznań, Poland, Springer (2015)
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...... optics, medicine, physics and mathematics, psy- chology) showed that 97% of multi-words in these sources consist of nouns and adjectives only, and more than 99% consist only of nouns, adjectives, and a preposition. (Justeson & Katz, 1995) Identifying the adjectives and the preposi- tional phrase ...
... Acquisition and Description Using Lexical Resources and Local Grammars Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Terminology Acquisition and Description Using Lexical Resources and Local Grammars ...
... dictionary of prefixes and the remainder of the lemma is a word in DELAS, then the lemma is the inflectional class of the corresponding DELAS word is assigned to the lemma. 4. For thresholds 80 and less steps 1 and 2 only are repeated. From a sample of domain texts and dictionar- ies we manually ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)