Претрага
1491 items
-
An Approach to Efficient Processing of Multi-Word Units
Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...... the form of a XML document our tool LeXimir relies on W3C standard languages Xquery and XSLT supported by .Net. The user interface for automatic production of DELAC lemmas is very straightforward and easy to use. A user can choose a file with a prepared list of MWUs and a file with a strategy, and ...
... star’ and Farenhajtov stepen ‘Fahrenheit degree’ consist of an adjective followed by a noun, which in Serbian is the natural order of an adjective and a noun in a MWU. However, these MWUs, together with a few more allow a reverse order as well — zvezda petokraka and stepen Farenhajtov. Both MWUs can be ...
... analogy with entries in a dictionary of simple word lemmas, an entry in a DELAC dictionary consists of a MWU lemma to which a name of an inflectional transducer (similar to the one represented in Figure 1) is assigned. Similarity ends here, because simple word constituents of a MWU lemma also have to ...Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
-
Wordnet Development Using a Multifunctional Tool
Ivan Obradović, Ranka Stanković (2007)In this paper we present a multifunctional tool for manipulating heterogeneous language resources. The tool handles electronic dictionaries, wordnets and aligned texts, and provides for their synchronous use in various tasks. We focus here on the description of the possibilities this tool offers in the development of wordnets. Besides the wordnet module which enables parallel handling of two wordnets, other modules, such as the module for morphological dictionaries and the module for aligned texts, as well as available finite ...... and the top-ontology accepted in EuroWordNet, and aligned by using ILI. From a lexicographer’s point of view, the development of a wordnet, perceived as a specific form of dictionary and hierarchical thesaurus for a particular language, opens two critical issues. The first pertains to the or ...
... synset in the other language”. This option creates a copy of the PWN synset in the Serbian wordnet, and if necessary, it also creates copies of all its missing hypernyms, to prevent the new synset of becoming a “dangling” synset. Once a copy of a synset is created the user has to make the necessary ...
... WS4LR this can be realized by creating a regular expression or FST graph from one or more words, and using it to search a text in the target language (Figure 9). In the case when the user is working on a particular set of synsets, e.g. on adding concepts to a certain domain, he/she might find ...Ivan Obradović, Ranka Stanković. "Wordnet Development Using a Multifunctional Tool" in Proceedings of the International Workshop Computer Aided Language Processing (CALP) '2007, Borovets, Bulgaria, September 2007, - (2007)
-
Using Lexical Resources for Irony and Sarcasm Classification
The paper presents a language dependent model for classification of statements into ironic and non-ironic. The model uses various language resources: morphological dictionaries, sentiment lexicon, lexicon of markers and a WordNet based ontology. This approach uses various features: antonymous pairs obtained using the reasoning rules over the Serbian WordNet ontology (R), antonymous pairs in which one member has positive sentiment polarity (PPR), polarity of positive sentiment words (PSP), ordered sequence of sentiment tags (OSA), Part-of-Speech tags of words (POS) ...... "[rule1: (?a swn:synonym ?b) (?b near_antonym: ?c) -> (?a swn:irony ?c)]" "[rule2: (?a swn:similar_to ?b) (?b near_antonym: ?c) -> (?a swn:irony ?c)]" "[rule3: (?a swn:also_see ?b) (?b near_antonym: ?c) -> (?a swn:irony ?c)]" "[rule4: (?a swn:verb_group ?b) (?b near_antonym: ?c) -> (?a swn:irony ...
... "[rule21: (?a swn:synonym ?b)(?b swn:similar_to ?c) (?c near_antonym:?d)->(?a swn:irony ?d)]" "[rule31: (?a swn:synonym ?b)(?b swn:also_see ?c) (?c near_antonym:?d)->(?a swn:irony ?d)]" "[rule41: (?a swn:synonym ?b)(?b swn:verb_group ?c) (?c near_antonym:?d)->(?a swn:irony ?d)]" "[rule51: (?a swn:synonym ...
... c near_antonym:?d)->(?a swn:irony ?d)]" "[rule61: (?a swn:synonym ?b)(?b swn:hyponym ?c) (?c near_antonym:?d)->(?a swn:irony ?d)]" In the same way the rules can be expanded with a mutual combi- nation of all given relations, where the number of relations in a rule goes up to a maximum of all six relations ...Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković. "Using Lexical Resources for Irony and Sarcasm Classification" in Proceedings of the 8th Balkan Conference in Informatics (BCI '17), New York, NY, USA, : ACM (2017). https://doi.org/
-
Low-temperature phase transition and magnetic properties of K3YbSi2O7
Predrag Dabić, Volker Kahlenberg, Biljana Krüger, Marko Rodić, Sabina Kovač, Jovan Blanuša, Zvonko Jagličić, Ljiljana Karanović, Václav Petříček, Aleksandar Kremenović (2021)alkalni silikati elemenata retkih zemalja, fazni prelazi, magnetne karakteristike, razdvajanje kristalnog polja, silikati lantanica... about the potassium cations K1+ can be described as a non-distorted hexagonal pyramid with a triple vertex, or as a truncated hexagonal pyramid with a regular hexagonal base upon which a parallel triangular face is situated [Figs. 1(a) and 2(a)]. This parallel triangular face (small equilateral ...
... polyhedron about the potassium cations K1* can be described as a non-distorted hexagonal pyramid with a triple vertex, or as a truncated hexagonal pyramid with a regular hexagonal base upon which a parallel triangular face is situated [Figs. 1(a) and 2(a)]. This parallel triangular face (small equilateral triangle ...
... violet, O: red. electronic reprint research papers close to the value of 2.268 A (0.868 A+14 A) calculated from the ionic radii (Shannon, 1976). The shortest distance between two Yb sites (~5.7 A = a) is close to the sum 5.6 A of van der Waals radii (Alvarez, 2013). The YbO, octahedra are connected by K10¢ ...Predrag Dabić, Volker Kahlenberg, Biljana Krüger, Marko Rodić, Sabina Kovač, Jovan Blanuša, Zvonko Jagličić, Ljiljana Karanović, Václav Petříček, Aleksandar Kremenović. "Low-temperature phase transition and magnetic properties of K3YbSi2O7" in Acta Crystallographica Section B Structural Science, Crystal Engineering and Materials, International Union of Crystallography (IUCr) (2021). https://doi.org/10.1107/S2052520621006077
-
The Reciprocal Algebraic Integers Having Small House
Dragan Stankov (2021)Algebraic integer, the house of algebraic integer, maximal modulus, reciprocal polynomial, primitive polynomial, Schinzel-Zassenhaus conjecture, Mahler measure, method of least squares, cyclotomic polynomialsDragan Stankov. "The Reciprocal Algebraic Integers Having Small House" in Experimental Mathematics (2021). https://doi.org/ 10.1080/10586458.2021.1982425
-
The Nooj System as Module within an Integrated Language Processing Environment
... we mean the techniques in which a query, serving as input to a document retriever, is transformed in some way in order to improve the performance of document retrieval. Namely, a search by a concept instead of a search by a single word form is recognized as a very important new direction in ...
... format, but also in textual, XML, tabular or TMX format. Figure 7. The part of a TMX document 4.2. Query Expansion Query expansion is a name given to a class of techniques in which a query serving as input to a document retriever is evolved in some way with the intent to improve the document ...
... function. It has also been shown how WS4LR can be applied to query expansion. Without such a tool the results of a query specified by a user in the search of a specific concept would be reduced to only a fraction of the results obtained by query expansion. Although WS4LR has been used mainly for ...Ranka Stanković, Duško Vitas, Cvetana Krstev. "The Nooj System as Module within an Integrated Language Processing Environment" in Proceedings of the 2007 International Nooj Conference, Cambridge Scholars Publishing (2008)
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...... an adjective (A) fol- lowed by a noun (N), where the two components agree in gender, number, case and animateness. In class names X stands for a component that does not inflect when a MWU inflects or for a component separator. In the case of AXN, X stands for the separator, usually a space. Some- ...
... AXN – an adjective followed by a noun; the adjective and the noun have to agree in all four grammatical categories; e.g. zemni gas ‘natural gas’. 2. 2XN – a noun preceded by a word that does not inflect in the MWU. Usually it is a word used only in one or a few MWUs, a prefix or an adverb derived from ...
... possible: a. NNgi - A noun followed by two adjec- tives/nouns in the genitive case or in the in- strumental case; e.g. otkopavanje širokim čelom ‘broad forehead excavation’. b. NprepNp - A noun followed by a preposi- tional phrase; e.g. lanac sa grabuljama ‘chain with a rake’. 5. AXN2X – a noun preceded ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)
-
Synthesis and characterization of high-pressure and high-temperature sphene (CaTiSiO5)
Jelena Pantić, Vladimir Urbanovich, Vesna Poharc-Logar, Bojan Jokić, Marija Stojmenović, Aleksandar Kremenović, Branko Matović (2014)Sphene (CaTiSiO5), a calcium titanosilicate ceramic has been prepared from a powder mixture of CaCO3, TiO2 and SiO2 using vibro-milling for homogenization and activation of precursors. During the highpressure and high-temperature synthesis (HPS) process at 4 GPa and 1,200 °C, sphene undergoes into phase transition, from room-temperature phase P21/a to high-temperature phase A2/a. Evidence of that structural phase transition is given in this paper using infrared, Raman spectroscopy and X-ray powder diffraction. Rietveld refinement was employed to get the ...Jelena Pantić, Vladimir Urbanovich, Vesna Poharc-Logar, Bojan Jokić, Marija Stojmenović, Aleksandar Kremenović, Branko Matović. "Synthesis and characterization of high-pressure and high-temperature sphene (CaTiSiO5)" in Physics and Chemistry of Minerals, Springer Science and Business Media LLC (2014). https://doi.org/10.1007/s00269-014-0693-x
-
Towards a Mining Equipment Ontology
... relation, which provides for a hierarchy among concepts as a hypernym is a more general term than its hyponym(s) and hence on a higher hierarchical level. As we have already mentioned, such hierarchical classification of concepts are taxonomies, usually represented by a tree structure. However, many ...
... generated for the FMG e-learning platform Moodle [11]. A RudOnto ontology could thus ultimately serve as a tool for generating a mining equipment ontology. In this paper we explain the initial steps towards such an ontology. In the next section we give a brief outline of terminological resources in general ...
... Ontology A formal representation of knowledge which includes the vocabulary containing a set of concepts, semantic relationships between those concepts and simple reasoning about a certain domain RudOnto Strong semantic description Weak semantic description Thesaurus A taxonomy ...Ranka Stanković, Ivan Obradović, Olivera Kitanović, Ljiljana Kolonja. "Towards a Mining Equipment Ontology" in Proceedings of the 12th International Conference Research and Development in Mechanical Industry, RaDMI 2012, September 2012, Vrnjačka Banja, Serbia no. 1, Vrnjačka Banja, Serbia : SaTCIP (Scientific and Technical Center for Intellectual Property) Ltd. (2012)
-
Building Terminological Resources in an e-Learning Environment
... these concepts, and a simple reasoning mechanism related to a specific domain. Ontology A formal representation of knowledge which includes the vocabulary containing a set of concepts, semantic relationships between those concepts and simple reasoning about a certain domain RudOnto ...
... some other authors, a terminological resource qualifies for an ontology only if new knowledge can be derived from segments of knowledge already existing in the resource. According to this view. an ontology is a formal representation of knowledge, which includes a vocabulary with a set of concepts, ...
... semantic description Weak semantic description Thesaurus A taxonomy expanded to include additional semantic relationships Index a list of terms specific to a certain field Glossary an index containing definitions of terms Taxonomy A hierarchical organization of terms RudOnto Figure ...Ranka Stanković, Ivan Obradović, Olivera Kitanović, Ljiljana Kolonja. "Building Terminological Resources in an e-Learning Environment" in Proceedings of the Third International Conference on e-Learning, eLearning-2012, September 2012, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2012)
-
Advantages and Disadvantages of a Parallel and Zigzag Method of Acquisition in Walking Mode in Magnetometric Archeological Research
магнетометријска испитивања, цик-цак и паралелна аквизиција у ходајућем моду, линеарне аномалије, археологија... gradiometer, with and without the Global Positioning System. To obtain a regular data grid, the sam- pling was conducted at a 1s interval, and with a distance between the traverses of 1m. The traverses for this process were oriented in a north-south and east-west direction. The best results or, more accurately ...
... Advantages and Disadvantages of a Parallel...(93-110) Fig. 3 Contour m ap of a vertical gradient of TM I in N -S orientation. K ey as for Fig 1. 97 Archaeology and Science 10 (2014)Petković et al - Advantages and Disadvantages of a Parallel...(93-110) Fig. 3a Contour map of a vertical gradient of TMI ...
... and Disadvantages of a Parallel...(93-110) Fig. 5 Contour m ap of a vertical gradient of TM I in E-W orientation. K ey as for Fig 1. Archaeology and Science 10 (2014) 100 Petković et al - Advantages and Disadvantages of a Parallel...(93-110) Fig. 5a Contour map of a vertical gradient of TMI ...Mirko Petković, Vesna Cvetkov, Branislav Sretenović. "Advantages and Disadvantages of a Parallel and Zigzag Method of Acquisition in Walking Mode in Magnetometric Archeological Research" in Arheologija i prirodne nauke (2014)
-
OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian
Ovaj rad predstavlja novi jezički resurs za pretraživanje i istraživanje verbalnih aspektnih parova u BCS (bosanskom, hrvatskom i srpskom), kreiran korišćenjem principa Lingvističkih Povezanih Otvorenih Podataka (LLOD). Pošto ne postoji resurs koji bi pomogao učenicima bosanskog, hrvatskog i srpskog kao stranih jezika da prepoznaju aspekt glagola ili njegove parove, kreirali smo novi resurs koji će korisnicima pružiti informacije o aspektu, kao i link ka aspektnim parovima glagola. Ovaj resurs takođe sadrži spoljne linkove ka monolingvalnim rečnicima, Wordnetu i BabelNetu. ...Ranka Stanković, Maxim Ionov, Medina Bajtarević, Lorena Ninčević. "OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Fuzzy Model for Risk Assessment of Machinery Failures
The main goal of this research was the development of an algorithm for the implementation of negative risk parameters in a synthesis model for a risk level assessment for a specific machine used in the mining industry. Fuzzy sets and fuzzy logic theory, in combination with statistical methods, were applied to analyze the time picture state of the observed machine. Fuzzy logic is presented through fuzzy proposition and a fuzzy composition module. Using these tools, the symmetric position of the ...... the following: • A maintainability function as a measurement for downtime; • Records of workplace injuries as a measurement of a failure’s impact on employees; • Measured levels of pollution as a measurement of a failure’s impact on the environment. The reliability function is a measurement of the ...
... following: • A risk assessment of partial indicators of element failure; • A risk level synthesis assessment of element failure; • A hierarchical synthesis of risk assessments for the technical system. A risk level assessment of subsystem failure is a synthesis assessment based on a risk assessment ...
... variables mentioned, is a triangle in this model (Figure 2). According to Ivezić et al. [26], membership functions should have a symmetric, triangular shape for more precise linguistic terms. A fuzzy model uses symmetric fuzzy sets, a symmetric layout, and symmetric inference. A symmetric approach is ...Dejan V. Petrović, Miloš Tanasijević, Saša Stojadinović, Jelena Ivaz, Pavle Stojković. "Fuzzy Model for Risk Assessment of Machinery Failures" in Symmetry, MDPI AG (2020). https://doi.org/10.3390/sym12040525
-
Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons
Mihailo Škorić (2017)The goal of this paper is to draw attention to the possibility of using emoticon-riddled text on the web in language-neutral sentiment analysis. It introduces several innovations in the existing framework of research and tests their effectiveness. It also presents a software tool especially made for that purpose, explains how it builds a database with sentimental value of terms and offers the user manual. Finally, it presents a software tool that tests the new database and gives some examples ...... Hyperlink beginnings http:// and https:// are replaced with a neu- tral character string yy, so that they wiil not be mistaken for :/ emoticon. Finally, regular expressions [a|h|A|H][a|h|A|H][h|H][a|A][h|H][a|h|A|H] and [h|H][a|A][h|H][a|A] are used to find as many different examples of expres- sion ...
... Serbian can use it independently, create a new database or use it for a new research. 3 A total of 143 users actually suggested a determiner, and the necessary number of proposers of a new determiner was 48, or more than one-third. If enough users suggested a determiner its value was calculated as the ...
... and that the machine can decide between multiple choices using a single parameter, task is reduced to deter- mination of what is positive and what is negative. With a goal of making a more cost-effective intelligent system, it would not be a good idea to ignore any available resources and that could po- ...Mihailo Škorić. "Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons" in Infotheca, Faculty of Philology, University of Belgrade (2017). https://doi.org/10.18485/infotheca.2017.17.1.4
-
Formative evaluation of e-learning projects with the logical framework approach
... Learning is the project. The project is a temporary endeavor undertaken to create a unique product or service [6]. From a managerial perspective it is a unique set of activities designed to produce a definite result, with a clear start and end date, and a clear allocation of resources [7] (Bowen ...
... graphical-textual model that takes the form of a Matrix (Logical Framework Matrix). According to the Logical Framework a program or a project is seen as a causal sequence of events. Actions to implement it are, in sequence: a. identification of project objectives; b. identification of causal r ...
... Evaluation of E-learning projects is a topic of great interest and growing importance. The evaluation of a project is the construction of the overall judgement, based on a quali-quantitative determination of the benefits and costs associated, with scientific criterion, of a project (evaluand). The purposes ...Roberto Linzalone, Giovani Schiuma, Ivan Obradović, Ranka Stanković. "Formative evaluation of e-learning projects with the logical framework approach" in The Sixth International Conference on e-Learning (eLearning-2015), September 2015, Belgrade, Serbia, Belgrade Metropolitan Univesity (2015)
-
A Knowledge-Based Approach to Mine Ventilation Planning in Yugoslav Mining Practice
Ventilation system analysis is a complex process based on the calculation and analysis of numerous parameters. These problems can be successfully solved by the SimVent numerical package, but a full understanding and use of the obtained results require the involvement of an experienced specialist in the ventilation field. The solution was found in the creation of a hybrid system INVENTS, whose knowledge base represents a formalization of the expert knowledge in the mine ventilation field. In this paper, we ...... either a negative value of resistance (requiring the introduction of a fan) in a branch with predetermined air flow or a recirculation through a loss branch is detected after the calculation of air flow distribution in the ventilation network, the fitness of that particular solution gets a very high ...
... / s 3 - ex p lo it at io n w o rk p la ce - in ta ke a ir - re tu rn a ir - st o p p in g w it h d o o r - m ai n f an F ig . 1 4 . T h e a ir d is tr ib u ti o n o b ta in ed b y st a n d a rd m et h o d . M in . R es . E n g . 2 0 0 2 .1 1 :3 6 1 -3 ...
... rl d sc ie n ti fi c. co m b y U N IV E R S IT Y O F A L B E R T A o n 0 4 /2 6 /1 5 . F o r p er so n al u se o n ly . A Knowledge-Based Approach to Mine Ventilation Planning 363 introduction of a knowledge base, thus upgrading existing mathematical models with ...Nikola Lilić, Ivan Obradović, Ranka Stanković. "A Knowledge-Based Approach to Mine Ventilation Planning in Yugoslav Mining Practice" in Mineral Resources Engineering 11 no. 4, Imperial College Press (2002): 361-382. https://doi.org/10.1142/S0950609802001014
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...... evaluation of results is to follow and if the approved MWT is to be entered into some kind of a dictionary or a terminological data-base. In this case we need a lemmatized MWT, that is, a MWT in the form of a dictionary head-word. The problem of lemmatization of special kind of MWUs, 507 ...
... number, case and animateness, belong to the AXN class. X stands for a component that does not inflect when the MWU inflects or a separator, usually a space or a hyphen. Nominal MWUs in Serbian belong to one of several tens of different general classes, but 14 of these classes account for ...
... lemmas. For instance, in the case of 2-component MWUs the order of precedence of graphs is: AXN, 2XN (a noun preceded by a word that does not inflect in the MWU), N2X (a noun followed by a word that does not inflect in the MWU), NXN. Thus, two MWU forms mašine taložnice and korita trake ‘belt ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Old or New, We Repair, Adjust and Alter (Texts)
Cvetana Krstev, Ranka Stanković (2020)U ovom radu predstavljamo kako se e-rečnici i kaskade transduktora konačnih stanja implementirani u alatu Unitex mogu koristiti za rešavanje tri problema transformacije teksta: ispravljanje tekstova nakon OCR-a, vraćanje dijakritičkih znakova i prebacivanje između različitih jezičkih varijanti.ispravka teksta, OCR greške, restauracija dijakritika , jezičke varijante, elektronski rečnik, transduktori konačnih stanja... ‘beer’ and ниво ‘level’ would be accepted. A few examples of the application of this procedure are given in Table 1. Special attention is paid to hyphenated words. A hyphen in a Serbian OCR text can signify a word hyphenated at the end of the line or a hyphen in a multi-word. Our procedure first eliminates ...
... performed. Table 2. A FST that initiates a correction for a specific replacement pair (top). A generic FST that performs a certain type of replacement (bottom). two words: the original, input word ($1$Y) and the corrected, output word ($1$X).4 The application of these FSTs always leaves in a processed texts ...
... multiple candidates for sto (sto ‘table/hundred’ and što, a functional word that can be an adverb, a pronoun or a conjunc- tion); – A list of 50 most frequent bigrams obtained from the SrpKor in which at least one word would contain letters c, s, z or a digraph dj if diacritics were removed; for instance ...Cvetana Krstev, Ranka Stanković. "Old or New, We Repair, Adjust and Alter (Texts)" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.3
-
Using English Baits to Catch Serbian Multi-Word Terminology
In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inflection... language, a terminology extractor for a target language, and a tool for word and chunk alignment. In this first experiment a source language is English, a target language is Serbian, a domain is Library and Information Science for which a bilingual terminological dictionary exists. Our term extractor ...
... that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language ...
... which a source part of a chunk matches a term from a list of domain terms in a source language: S(align.chunk) ∼ S(term.list), where symbol ∼ denotes the relation “match” (that is for our ex- periment defined in Subsection 4.5.); • Filtering once more previously filtered chunks to those in which a target ...Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
-
SASA Dictionary as the Gold Standard for Good Dictionary Examples for Serbian
Ranka Stanković, Branislava Šandrih, Rada Stijović, Cvetana Krstev, Duško Vitas, Aleksandra Marković (2019)У овом раду представљамо модел за избор добрих примера за речник српског језика и развој иницијалних компоненти модела. Метода која се користи заснива се на детаљној анализи различитих лексичких и синтактичких карактеристика у корпусу састављених од примера из пет дигитализованих свезака речника САНУ. Почетни скуп функција био је инспирисан сличним приступом и за друге језике. Дистрибуција карактеристика примера из овог корпуса упоређује се са карактеристиком дистрибуције узорака реченица ексцерпираних из корпуса који садрже различите текстове. Анализа је показала да ...Српски, добри примери из речника, аутоматизација израде речника, издвајање својстава, Машинско учење... example. If a noun has a complement that affects its meaning, the complement should be represented in the example: Tada se javila u njega velika ljubav i velika podobnost za slikarstvo (paraphrase: ‘In that moment he felt a great affection and a great talent for painting’)3. If a keyword is a verb that ...
... only in paper form. At the same time, a formal description of dictionary entry was produced, and a lexical database model was developed (Stanković et al., 2018). The conversion of the SASA dictionary from unstructured text into a lexical database consisted of a thorough analysis of formatting conventions ...
... syntactic whole – with a subject and a predicate. It is even possible to add a missing sentence constituent, but it has to be in square brackets, as a mark of this kind of editorial intervention. (Though the excerpts are in the form of full sentences, the context they provide is sometimes insufficient, and ...Ranka Stanković, Branislava Šandrih, Rada Stijović, Cvetana Krstev, Duško Vitas, Aleksandra Marković. "SASA Dictionary as the Gold Standard for Good Dictionary Examples for Serbian" in Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference , Lexical Computing CZ, s.r.o. (2019)