Претрага
97 items
-
WS4LR - a Worksation for Lexical Resources
... matches on the ticket for each pair using symbols defined by the rules of the game. XML Output from XAlign linking" id="l3" /> Aligned Text in TMX FormatFiles: 01Sreca-EN-id 01Sreca-SR-id
... categories of the word form, each category represented by a single character code. Morphological dictionaries of compound lemmas and word forms, named DELAC and DELACF, follow a similar format, except that both lemma and form can contain non-alphabetic characters – blank, hyphen, apostrophe, and ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović. "WS4LR - a Worksation for Lexical Resources" in Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation, Genoa, Italy, May 2006, ELRA - European Language Resources Association (2006)
-
Integrisano okruženje za pripremu paralelizovanog korpusa
Razvoj paralelizovanih korpusa zahteva pripremu paralelnih tekstova za njihovu integraciju u paralelizovani korpus. Reč je o jednom kompleksnom zadatku koji se može rešiti na različite načine, i koji mora da se odvija u nekoliko koraka. U ovom radu najpre je iznet postupak pripreme paralelnih tekstova za paralelizovani korpus koji se koristi u Grupi za jezičke tehnologije Univerziteta u Beogradu. Potom je dat kratak pregled programa (XAlign, Concordancier, WS4LR), odnosno softverskih alata koji se pri tome koriste. Nedostatak udobnog okruženja ...... symbols defined by the rules of the game. XML kodirana informacija o uparivanju (treći izlazni XML dokument): linking" id="l3" /> y Kada je u pitanju XML kodirana informacija o uparivanju, treba imati u vidu da se u toku paralelizacije ...Ivan Obradović, Ranka Stanković, Miloš Utvić. "Integrisano okruženje za pripremu paralelizovanog korpusa" in Zbornik radova međunarodnog simpozijuma Razlike između bosanskog/bošnjačkog, hrvatskog i srpskog jezika, Graz, Austria, April 2007, - (2007)
-
Нове технологије за оживљавање старих текстова
удаљено читање, књижевни корпус, обрада српског језика, анотација врстом речи, лематизација, именовани ентитетиЦветана Крстев, Ранка Станковић, Бранислава Шандрих Тодоровић, Милица Иконић Нешић. "Нове технологије за оживљавање старих текстова" in Зборник радова Међународне научне конференције Дигитална хуманистика и словенско културно наслеђе II, Београд, 28-29 јуни 2021., Београд : Савез славистичких друштава Србије (2023)
-
E-Connecting Balkan Languages
In this paper we present a versatile language processing tool that can be successfully used for many Balkan languages. This tool relies for its work on several sophisticated textual and lexical resources that were developed for most of Balkan languages. These resources are based on several de facto standards in natural language processing.... including Balkan languages. Regarding its content, it represents a suitable text for different types of analysis, especially in the domain of named entity recognition (geographical concepts and different measures). Besides that, it was already used for some interesting research, e.g. multi-word ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva. "E-Connecting Balkan Languages" in Proceedings of the Workshop Workshop on Multilingual resources, technologies and evaluation for Central and Eastern European Languages, 17 September 2009, eds. C. Vertan, S. Piperidis, E. Paskaleva and Milena Slavcheva, Borovets, Bulgaria : Association for Computational Linguistics Stroudsburg, PA, USA (2009)
-
Репозиторијум дигиталних идентификатора објеката – доиФил: изградња, стање и перспективе
Милош Утвић, Михаило Шкорић (2017)Филолошки факултет Универзитета у Београду је крајем јула 2015. године склопио уговор са агенцијом Crossref и тиме стекао право да својим публикацијама додељује дигитални идентификатор објекта (DOI), као и публикацијама других издавача које заступа. Истовремено, у склопу уговора, Филолошки факултет је развио репозиторијум доиФил који за све регистроване публикације чува њихове библиографске метаподатке, хипервезу ка комплетном тексту и сам DOI. После припремних радњи и разраде процеса на којима је заснован рад репозиторијума, у новембру 2016. године је репозиторијум доиФил ...... : THE CONSTRUCTION, STATE AND PROSPECTS Summary University of Belgrade, Faculty of Philology, became a member of The Publishers International Linking Association, Inc. (“PILA”) and The DOI Registration Agency Crossref in July 2015. As a member of Crossref community, Faculty of Philology can register ...Милош Утвић, Михаило Шкорић. "Репозиторијум дигиталних идентификатора објеката – доиФил: изградња, стање и перспективе" in Научна конференција Библиоинфо — 55 година од покретања наставе библиотекарства на високошколском нивоу, Београд 18. мај 2017., Филолошки факултет Универзитета у Београду (2017). https://doi.org/10.18485/biblioinfo.2017.ch7
-
OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian
Ovaj rad predstavlja novi jezički resurs za pretraživanje i istraživanje verbalnih aspektnih parova u BCS (bosanskom, hrvatskom i srpskom), kreiran korišćenjem principa Lingvističkih Povezanih Otvorenih Podataka (LLOD). Pošto ne postoji resurs koji bi pomogao učenicima bosanskog, hrvatskog i srpskog kao stranih jezika da prepoznaju aspekt glagola ili njegove parove, kreirali smo novi resurs koji će korisnicima pružiti informacije o aspektu, kao i link ka aspektnim parovima glagola. Ovaj resurs takođe sadrži spoljne linkove ka monolingvalnim rečnicima, Wordnetu i BabelNetu. ...Ranka Stanković, Maxim Ionov, Medina Bajtarević, Lorena Ninčević. "OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Projecting short to medium-term effects of carbon emissions trading on Serbia's GDP
Boban Pavlović, Nikola Trtica (2024)This paper presents an approach to assess the short to medium-term effects of the Emissions Trading System (ETS), an economic and climate policy instrument aimed at reducing greenhouse gas emissions, on GDP from the expenditure side. The analysis focuses on Serbia’s electricity sector, which relies heavily on lignite-based pro duction dominated by state-owned enterprises. The obtained projections suggest that in the short term, the introduction of an ETS may lead to slower GDP growth compared to official forecasts. However, ...... Change Res. 13 (1), 82–96. https://doi. org/10.1016/j.accre.2021.11.002. Li, M., Weng, Y., Duan, M., 2019. Emissions, energy and economic impacts of linking China's national ETS with the EU ETS. Appl. Energy 235, 1235–1244. https://doi. org/10.1016/j.apenergy.2018.11.047. Maćić, L.J., 2023. Introducing ...
... October 2024). Oliveira, T.D., Gurgel, A.C., Tonry, S., 2021. Potential trading partners of a brazilian emissions trading scheme: the effects of linking with a developed region (Europe) and two developing regions (Latin America and China). Technol. Forecast. Soc. Change 171, 120947. https://doi.org/10 ...Boban Pavlović, Nikola Trtica. "Projecting short to medium-term effects of carbon emissions trading on Serbia's GDP" in Utilities Policy, Elsevier BV (2024). https://doi.org/10.1016/j.jup.2024.101855
-
A bilingual digital library for academic and entrepreneurial knowledge management
A generic knowledge management process of organization, storage and retrieval of knowledge can suitably be fitted in a digital library. In the digital and knowledge age digital libraries can be used in knowledge management to handle intellectual assets and support knowledge creation. A multilingual digital library either stores content in more than one language or provides multilingual query access to monolingual content. In Serbia 18 of 308 scientific journals regularly published are bi-lingual, with papers simultaneously being in English ...... to the development of Serbian wordnet, aligned multilingual corpus, and many other language resources. She also developed the first Serbian Named Entity Recognition system. She participated in a number of international and national language and TEL related projects. Biljana Lazić is librarian ...
... results of international projects is also an increasingly frequent practice. In this paper we present a publicly available multilingual digital library named Bibliša, developed for management, search and the browsing of aligned bilingual text collections. Design/methodology/approach – The approach ...
... made an effort to develop a software tool and bi-lingual resource that supports terminology research. This paper presents a developed solution, named Bibliša1, which is free for use and publicly available. In the second section of this paper are presented the aspects of a multilingual digital ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Dalibor Vorkapić. "A bilingual digital library for academic and entrepreneurial knowledge management" in Proceeding of 10th International Forum on Knowledge Asset Dynamics — IFKAD 2015: Culture, Innovation and Entrepreneurship: connecting the knowledge dots, Bari, Italy, 10-12 June 2015, Bari : IFKAD (2015)
-
Resource-based WordNet Augmentation and Enrichment
In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...... g a wordnet: Procedures and tools. In Proceedings of the GWC 2014, pages 55–62. Proceedings of CLIB 2018 113 Niles, I. and Pease, A. (2003). Linking Lexicons and Ontologies: Mapping WordNet to the Suggested Upper Merged Ontology. Proceedings of the IEEE International Conference on Information and ...Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
-
From DELA Based Dictionary to Leximirka Lexical Database
Biljana Lazić, Mihailo Škorić (2020)In this paper, we will present an approach in transforming Serbian language Morphological dictionaries from a DELA text format to a lexical database dubbed Leximirka. Considering the benefits of storing data within a database when compared to storing them in textual documents, we will outline some of the functionality that the database has made possible. We will also show how hand-made rules that use category labels lexical entries are marked with can be used to link lexical entries. ...... encoding traditional dictionaries intended for human use. This does not mean that the situation is not about to change, because there is an interest in linking to the Simple Knowledge Organiza- tion System (SKOS) ontologies (Declerck et al., 2010) and the Lemon model within the community that uses TEI. At ...
... textual files. The need for an online application based on lexicographic database emerged. In response to these needs, a new application symbolically named Leximirka was developed. Leximirka is not a language dependent application and there is no obstacle for it to be applied in purpose of maintaining DELA ...
... started with the implementation of the Workstation for Lexical Resources WS4LR (Krstev et al., 2006). This single user desktop application later re- named Leximir has various useful functions. It is possible to distribute vo- cabularies in multiple files, extract subsets of lemmas according to various ...Biljana Lazić, Mihailo Škorić. "From DELA Based Dictionary to Leximirka Lexical Database" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.4
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
Чији је пример? Анализа лексичких обележја на примерима Речника САНУ
У овом раду поставља се питање: да ли се може утврдити ко је аутор неког текста уколико се анализирају искључиво његова лексичка обележја? Како бисмо покушали да добијемо одговор на ово питање, посматрали смо примере у оквиру речничког чланка појединачне лексеме Речника САНУ, који су забележени у пет томова (и то: I, II, XVIII, XIX и XX). Сваки пример је преузет из неког извора на шта упућују скраћенице, наведене у заградама. Од преко 5.000 понуђених извора, определили смо се ...... обележавања текста према врсти речи (енгл. Part-of-Speech Tagging), лематизације (енгл. Lemmatization), препозна- вања именованих ентитета (енгл. Named Entity Recognition), класификације текста (енгл. Text Classification) и многих других. Под класификовањем текста подразумева се обучавање математичког ...Бранислава Б. Шандрих, Ранка М. Станковић, Мирјана С. Гочанин. "Чији је пример? Анализа лексичких обележја на примерима Речника САНУ" in Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch13
-
Definition of Circulation Conditions and Groundwater Genesis of the Complex Krupaja Hydrogeological Karst System (Eastern Serbia)
Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment, Geography, Planning and Development, Building and Construction... karst massif. In Proceedingss of the International Conference and Field Seminar, Karst without Boundaries (DIKTAS 2014), Trebinje, Republika Srpska Entity of Bosnia and Herzegovina, 11–15 June 2014; COBISS.RSID 4308248; pp. 73-78, ISBN 978-99938-52-58-2. Milanović, S. Physical Modeling, of Karst Environment ...Ljiljana Vasić, Saša Milanović, Tina Dašić. "Definition of Circulation Conditions and Groundwater Genesis of the Complex Krupaja Hydrogeological Karst System (Eastern Serbia)" in Sustainability, MDPI AG (2023). https://doi.org/10.3390/su151411146
-
Old or New, We Repair, Adjust and Alter (Texts)
Cvetana Krstev, Ranka Stanković (2020)U ovom radu predstavljamo kako se e-rečnici i kaskade transduktora konačnih stanja implementirani u alatu Unitex mogu koristiti za rešavanje tri problema transformacije teksta: ispravljanje tekstova nakon OCR-a, vraćanje dijakritičkih znakova i prebacivanje između različitih jezičkih varijanti.ispravka teksta, OCR greške, restauracija dijakritika , jezičke varijante, elektronski rečnik, transduktori konačnih stanja... reflection of jat do not have a variant marker, as illustrated in Table 6. As explained in (Lazić and Škorić, 2019), specific rules were developed for linking corresponding Ekavian and Ijekavian entries. This enabled the production of two specific dictionaries: Ijk2Ek for transforming an Ekavian text to Ijekavian ...
... ACM, 1959, 225–232. http://doi.acm.org/10. 1145/1460299.1460326 Friburger, Nathalie and Denis Maurel. “Finite-state transducer cascades to extract named entities in texts”. Theoretical Computer Science Vol. 313, no. 1 (2004): 93–104 Henton, Caroline G. “Automated transformation from American English ...Cvetana Krstev, Ranka Stanković. "Old or New, We Repair, Adjust and Alter (Texts)" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.3
-
Using Lexical Resources for Irony and Sarcasm Classification
The paper presents a language dependent model for classification of statements into ironic and non-ironic. The model uses various language resources: morphological dictionaries, sentiment lexicon, lexicon of markers and a WordNet based ontology. This approach uses various features: antonymous pairs obtained using the reasoning rules over the Serbian WordNet ontology (R), antonymous pairs in which one member has positive sentiment polarity (PPR), polarity of positive sentiment words (PSP), ordered sequence of sentiment tags (OSA), Part-of-Speech tags of words (POS) ...... level of this set of word forms. For that purpose, we used a hybrid system for Serbian that combines three NLP tasks: PoS tagging, compound and named-entity recognition [10] (step 5 in Fig. 1) that was trained on various annotated texts – literary, newspaper and textbooks. Tagging results are represented ...Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković. "Using Lexical Resources for Irony and Sarcasm Classification" in Proceedings of the 8th Balkan Conference in Informatics (BCI '17), New York, NY, USA, : ACM (2017). https://doi.org/
-
Development of terminological resources for expert knowledge: a case study in mining
Ljiljana Kolonja, Ranka Stanković, Ivan Obradović, Olivera Kitanović, Aleksandar Cvjetić. "Development of terminological resources for expert knowledge: a case study in mining" in Knowledge Management Research & Practice, Palgrave Macmillan (2015). https://doi.org/10.1057/kmrp.2015.10
-
Valorization of non-balanced coal reserves in Serbia for underground coal gasification
David Petrović, Lazar Kričak, Milanka Negovanović, Stefan Milanović, Jovan Marković, Nikola Simić, Ljubisav Stamenić (2019)In the name of a better and safer energy future, it is our responsibility to focus our knowledge and activities to save on imported liquid and gas fossil fuels, as well as coal on which energy security of Serbia is based. The rationalization in the use of available energy resources certainly positively affects economy and the environment of a country. This paper indicates motivations for the application of the underground coal gasification process, as well as surface gasification for ...... Lawrence Livermore National Laborato- ry, Livermore, Cal., USA, 2007 7} Blinderman, M. S., et. al., Exergy Optimisation of Reserve Combustion Linking in UCG, Journa! of the 'Energy Institute, 81 (2008), 1, pp. 7-13 &] Bowen,B. H., Irwin, M. W., Underground Coal Gasification (UCG), Indiana Center ...David Petrović, Lazar Kričak, Milanka Negovanović, Stefan Milanović, Jovan Marković, Nikola Simić, Ljubisav Stamenić. "Valorization of non-balanced coal reserves in Serbia for underground coal gasification" in Thermal Science (2019). https://doi.org/10.2298/TSCI190725390P
-
Assessments of effects of implementation of strategic plans for development of Belgrade District heating system
The district heating system (DHS) is one of the key elements of the Belgrade energy system. Overview of the current state and basic characteristics of the DHS are presented in this paper. The focus is on the planned development of DHS and on the analyses of effects of realization of measures and projects defined by strategic documents related to DHS which are adopted by city authorities (Development Strategy of DHS and Strategy of Belgrade Development). Two exploratory scenarios were ...Dejan Ivezić, Marija Živković, Aleksandar Madžarević, Miodrag Grujić. "Assessments of effects of implementation of strategic plans for development of Belgrade District heating system" in Sustainable Cities and Society, Elsevier BV (2020). https://doi.org/10.1016/j.scs.2020.102304
-
Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model
Ova studija predstavlja analizu sentimenta srpskih starih romana iz perioda 1840-1920, koristeći veliki jezički model (LLM) Mistral za tehniku učenja sa zasnovani na takozvanim "zero" i "few-shot" pokušajima. Glavni pristup uvodi inovacije osmišljavanjem istraživačkih upita (promptova) uključuju tekst sa uputstvom za klasifikaciju bez primera i na osnovu nekoliko primera, omogućavajući jezičkom modelu da klasifikuje osećanja u pozitivne, negativne ili objektivne kategorije. Ova metodologija ima za cilj da pojednostavi analizu osećanja ograničavanjem odgovora, čime se povećava preciznost ...Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević. "Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model" in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Sofia, Bulgaria, 9-10 September 2024, LREC | COLING (2024)
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...... and expressions, rhetorical figures, etc. To expand the dictionary, synsets from the Serbian WordNet and the dictionary of synonyms will be used for linking with Twitter examples. Regarding the categorization of terms in the lexicon, the Hatebase scheme5 was used as a guideline because it is already a ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13