default search action
Tanja Samardzic
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c29]Tanja Samardzic, Ximena Gutierrez, Christian Bentz, Steven Moran, Olga Pelloni:
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets. NAACL-HLT (Findings) 2024: 3367-3382 - [c28]Vani Kanjirangat, Tanja Samardzic, Ljiljana Dolamic, Fabio Rinaldi:
NLP_DI at NADI 2024 shared task: Multi-label Arabic Dialect Classifications with an Unsupervised Cross-Encoder. ArabicNLP 2024: 742-747 - [i4]Tanja Samardzic, Ximena Gutierrez-Vasques, Christian Bentz, Steven Moran, Olga Pelloni:
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets. CoRR abs/2403.03909 (2024) - 2023
- [j5]Ximena Gutierrez-Vasques, Christian Bentz, Tanja Samardzic:
Languages Through the Looking Glass of BPE Compression. Comput. Linguistics 49(4): 943-1001 (2023) - [c27]Michel Plüss, Jan Deriu, Yanick Schraner, Claudio Paonessa, Julia Hartmann, Larissa Schmidt, Christian Scheller, Manuela Hürlimann, Tanja Samardzic, Manfred Vogel, Mark Cieliebak:
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions. ACL (2) 2023: 1763-1772 - [c26]Vani Kanjirangat, Tanja Samardzic, Ljiljana Dolamic, Fabio Rinaldi:
Optimizing the Size of Subword Vocabularies in Dialect Classification. VarDial@EACL 2023: 14-30 - [i3]Michel Plüss, Jan Deriu, Yanick Schraner, Claudio Paonessa, Julia Hartmann, Larissa Schmidt, Christian Scheller, Manuela Hürlimann, Tanja Samardzic, Manfred Vogel, Mark Cieliebak:
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions. CoRR abs/2305.18855 (2023) - 2022
- [c25]Tanja Samardzic, Ximena Gutierrez-Vasques, Rob van der Goot, Max Müller-Eberstein, Olga Pelloni, Barbara Plank:
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers. CoNLL 2022: 266-281 - [c24]Vani Kanjirangat, Tanja Samardzic, Fabio Rinaldi, Ljiljana Dolamic:
Early Guessing for Dialect Identification. EMNLP (Findings) 2022: 6417-6426 - [c23]Olga Pelloni, Anastassia Shaitarova, Tanja Samardzic:
Subword Evenness (SuE) as a Predictor of Cross-lingual Transfer to Low-resource Languages. EMNLP 2022: 7428-7445 - [c22]Steven Moran, Christian Bentz, Ximena Gutierrez-Vasques, Olga Pelloni, Tanja Samardzic:
TeDDi Sample: Text Data Diversity Sample for Language Comparison and Multilingual NLP. LREC 2022: 1150-1158 - [c21]Vani Kanjirangat, Tanja Samardzic, Ljiljana Dolamic, Fabio Rinaldi:
NLP DI at NADI Shared Task Subtask-1: Sub-word Level Convolutional Neural Models and Pre-trained Binary Classifiers for Dialect Identification. WANLP@EMNLP 2022: 468-473 - 2021
- [c20]Tatyana Ruzsics, Olga Sozinova, Ximena Gutierrez-Vasques, Tanja Samardzic:
Interpretability for Morphological Inflection: from Character-level Predictions to Subword-level Rules. EACL 2021: 3189-3201 - [c19]Ximena Gutierrez-Vasques, Christian Bentz, Olga Sozinova, Tanja Samardzic:
From characters to words: the turning point of BPE merges. EACL 2021: 3454-3468 - 2020
- [c18]Larissa Schmidt, Lucy Linder, Sandra Djambazovska, Alexandros Lazaridis, Tanja Samardzic, Claudiu Musat:
A Swiss German Dictionary: Variation in Speech and Writing. LREC 2020: 2720-2725 - [c17]Tannon Kew, Iuliia Nigmatulina, Lorenz Nagele, Tanja Samardzic:
UZH TILT: A Kaldi recipe for Swiss German Speech to Standard German Text. SwissText/KONVENS 2020 - [c16]Iuliia Nigmatulina, Tannon Kew, Tanja Samardzic:
ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German. VarDial@COLING 2020: 15-24 - [i2]Larissa Schmidt, Lucy Linder, Sandra Djambazovska, Alexandros Lazaridis, Tanja Samardzic, Claudiu Musat:
A Swiss German Dictionary: Variation in Speech and Writing. CoRR abs/2004.00139 (2020)
2010 – 2019
- 2019
- [j4]Yves Scherrer, Tanja Samardzic, Elvira Glaser:
Digitising Swiss German: how to process and study a polycentric spoken language. Lang. Resour. Evaluation 53(4): 735-769 (2019) - [j3]Tatyana Ruzsics, Massimo Lusetti, Anne Göhring, Tanja Samardzic, Elisabeth Stark:
Neural text normalization with adapted decoding and POS features. Nat. Lang. Eng. 25(5): 585-605 (2019) - [i1]Tatyana Ruzsics, Tanja Samardzic:
Multilevel Text Normalization with Sequence-to-Sequence Networks and Multisource Learning. CoRR abs/1903.11340 (2019) - 2018
- [j2]Curdin Derungs, Tanja Samardzic:
Are prominent mountains frequently mentioned in text? Exploring the spatial expressiveness of text frequency. Int. J. Geogr. Inf. Sci. 32(5): 856-873 (2018) - [c15]Tanja Samardzic, Mark Cieliebak, Jan Milan Deriu:
Future Actions for Swiss German - Workshop Results at SwissText 2018. SwissText 2018: 95-99 - [c14]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James R. Glass, Yves Scherrer, Tanja Samardzic, Nikola Ljubesic, Jörg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal van den Bosch, Ritesh Kumar, Bornini Lahiri, Mayank Jain:
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign. VarDial@COLING 2018 2018: 1-17 - [c13]Massimo Lusetti, Tatyana Ruzsics, Anne Göhring, Tanja Samardzic, Elisabeth Stark:
Encoder-Decoder Methods for Text Normalization. VarDial@COLING 2018 2018: 18-28 - 2017
- [j1]Christian Bentz, Dimitrios Alikaniotis, Tanja Samardzic, Paula Buttery:
Variation in Word Frequency Distributions: Definitions, Measures and Implications for a Corpus-Based Language Typology. J. Quant. Linguistics 24(2-3): 128-162 (2017) - [c12]Tanja Samardzic, Mirjana Starovic, Zeljko Agic, Nikola Ljubesic:
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages. BSNLP@EACL 2017: 39-44 - [c11]Tatyana Ruzsics, Tanja Samardzic:
Neural Sequence-to-sequence Learning of Internal Word Structure. CoNLL 2017: 184-194 - 2016
- [c10]Christian Bentz, Tatyana Ruzsics, Alexander Koplenig, Tanja Samardzic:
A Comparison Between Morphological Complexity Measures: Typological Data vs. Language Corpora. CL4LC@COLING 2016 2016: 142-153 - [c9]Nikola Ljubesic, Tanja Samardzic, Curdin Derungs:
TweetGeo - A Tool for Collecting, Processing and Analysing Geo-encoded Linguistic Data. COLING 2016: 3412-3421 - [c8]Tanja Samardzic, Maja Milicevic:
A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from Corpora. LREC 2016 - [c7]Tanja Samardzic, Yves Scherrer, Elvira Glaser:
ArchiMob - A Corpus of Spoken Swiss German. LREC 2016 - 2015
- [c6]Tanja Samardzic, Nikola Ljubesic, Maja Milicevic:
Regional Linguistic Data Initiative (ReLDI). BSNLP@RANLP 2015: 40-42 - [c5]Tanja Samardzic, Robert Schikowski, Sabine Stoll:
Automatic interlinear glossing as two-level sequence classification. LaTeCH@ACL 2015: 68-72 - 2014
- [c4]Noëmi Aepli, Ruprecht von Waldenfels, Tanja Samardzic:
Part-of-Speech Tag Disambiguation by Cross-Linguistic Majority Vote. VarDial@COLING 2014: 76-84 - 2012
- [c3]Andrea Gesmundo, Tanja Samardzic:
Lemmatisation as a Tagging Task. ACL (2) 2012: 368-372 - [c2]Andrea Gesmundo, Tanja Samardzic:
Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification. LREC 2012: 2103-2106 - 2010
- [c1]Lonneke van der Plas, Tanja Samardzic, Paola Merlo:
Cross-Lingual Validity of PropBank in the Manual Annotation of French. Linguistic Annotation Workshop 2010: 113-117
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-08 20:33 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint