Publications

Articles

    2018

  • Silfverberg, M.; Hulden, M. (2018). An Encoder-Decoder Approach to the Paradigm Cell Filling Problem. In Proceedings of EMNLP. pdf code slides
  • Chen, H.; Hulden, M. (2018). The Computational Complexity of Distinctive Feature Minimization in Phonology. In Proceedings of NAACL-HLT. pdf code poster
  • Silfverberg, M.; Liu, L.; Hulden, M. (2018). A Computational Model for the Linguistic Notion of Morphological Paradigm. In Proceedings of COLING 2018. pdf code
  • Silfverberg, M.; Mao. L. J.; Hulden, M. (2018). Sound Analogies with Phoneme Embeddings. In Society for Computation in Linguistics (SCiL). pdf data&code poster
  • Cotterell, R.; Kirov, C.; Sylak-Glassman, J.; Walther, G.; Vylomova, E.; McCarthy, A. D.; Kann, K.; Mielke, S.; Nicolai, G.; Silfverberg, M.; Yarowsky, D.; Eisner, J.; Hulden, M. (2018). The CoNLL-SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. In Proceedings of the CoNLL-SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. pdf www github slides
  • Cotterell, R.; Kirov, C.; Hulden, M.; Eisner, J. (2018). Quantifying the Trade-off Between Two Types of Morphological Complexity. In Society for Computation in Linguistics (SCiL). pdf slides
  • Wiemerslage, A.; Silfverberg, M.; Hulden, M. (2018). Phonological Features for Morphological Inflection. In Proceedings of SIGMORPHON. pdf
  • McCarthy, A. D.; Silfverberg, M.; Cotterell, R.; Hulden, M.; Yarowsky, D. (2018). Marrying Universal Dependencies and Universal Morphology. In Proceedings of the Second Workshop on Universal Dependencies (UDW 2018). pdf github
  • Cotterell, R.; Kirov, C.; Hulden, M.; Eisner, J. (2018). On the Complexity and Typology of Inflectional Morphological Systems. In Transactions of the ACL (TACL). slides
  • Cotterell, R.; Kirov, C.; Hulden, M.; Eisner, J. (2018). On the Diachronic Stability of Irregularity in Inflectional Morphology. In NAACL. arXiv
  • Kirov, C.; Cotterell, R.; Sylak-Glassman, J.; Walther, G.; Vylomova, E.; Xia, P.; Faruqui, M.; Mielke, S. J.; McCarthy, A.; Kubler, S.; Yarowsky, D.; Eisner, J.; Hulden, M. (2018). UniMorph 2.0: Universal Morphology. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). pdf data
  • Lovick, O.; Cox, C.; Silfverberg, M.; Arppe, A.; Hulden, M. (2018). A Computational Architecture for the Morphology of Upper Tanana. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). pdf
  • Silfverberg, M.; Hulden, M. (2018). Initial Experiments in Data-Driven Morphological Analysis for Finnish. In Proceedings of IWCLUL. pdf
  • Moeller, S.; Hulden, M. (2018). Automatic Glossing in a Low-Resource Setting for Language Documentation. In Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages. pdf
  • Moeller, S.; Kazeminejad, G.; Cowell, A; Hulden, M. (2018). A Neural Morphological Analyzer for Arapaho Verbs Learned from a Finite State Transducer. In Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages. pdf
  • Hulden, M. (2018, in print). Formal Verification in Optimality Theory. Essays in honor of Lauri Karttunen. CSLI: Stanford
  • Hulden, M. (2018). Finite-State Technology. In Ruslan Mitkov (ed.) The Oxford Handbook of Computational Linguistics, 2nd Ed. Oxford University Press. link
  • 2017

  • Hulden, M. (2017). Formal and Computational Verification of Phonological Analyses. Phonology 34(2), pages 407–435. doi code
  • Cotterell, R.; Kirov, C.; Sylak-Glassman, J.; Walther, G.; Vylomova, E.; Xia, P.; Faruqui, M.; Kübler, S.; Yarowsky, D.; Eisner, J.; Hulden, M. (2017). CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages. In Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection. pdf www github
  • Hulden, M. (2017). A Phoneme Clustering Algorithm Based on the Obligatory Contour Principle. In Proceedings of CoNLL. pdf code poster
  • Hulden, M. (2017). Rewrite Rule Grammars with Multitape Automata. Journal of Language Modelling 5(1):107–130. pdf code
  • Agirrezabal, M.; Alegria, M.; Hulden, M. (2017). A Comparison of Feature-Based and Neural Scansion of Poetry. Recent Advances in Natural Language Processing (RANLP). pdf
  • Silfverberg, M.; Hulden, M. (2017). Weakly Supervised Learning of Allomorphy. In Proceedings of the First Workshop on Subword and Character Level Models in NLP (SCLeM). pdf code poster
  • Silfverberg, M.; Hulden, M. (2017). Automatic Morpheme Segmentation and Labeling in Universal Dependencies resources. In Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017). pdf code slides
  • Liu, L.; Hulden, M. (2017). Evaluation of Finite State Morphological Analyzers Based on Paradigm Extraction from Wiktionary. In Proceedings of FSMNLP. pdf slides
  • Arppe, A.; Cox, C.; Hulden, M.; Lachler, J.; Moshagen, S. N.; Silfverberg, M.; Trosterud, T. (2017). Computational modeling of the verb in Dene languages. The case of Tsuut'ina. In Working Papers in Athabascan Linguistics ("Red Book" series), Alaska Native Language Center.
  • Kazeminejad, G.; Cowell, A.; Hulden, M. (2017). Creating lexical resources for polysynthetic languages—the case of Arapaho. Proceedings of the 2nd Workshop on on the Use of Computational Methods in the Study of Endangered Languages (ComputEL). Association for Computational Linguistics. pdf
  • 2016

  • Mao, L. J.; Hulden, M. (2016). How Regular is Japanese Loanword Adaptation? A Computational Study. In Proceedings of COLING 2016. pdf
  • Agirrezabal, M.; Algeria, I.; Hulden, M. (2016). Machine Learning for Metrical Analysis of English Poetry. In Proceedings of COLING 2016. pdf
  • Cotterell, R.; Kirov, C.; Sylak-Glassman, J.; Yarowsky, D.; Eisner, J.; Hulden, M. (2016). The SIGMORPHON 2016 Shared Task—Morphological Reinflection. In Proceedings of SIGMORPHON. Association for Computational Linguistics. pdf www slides
  • Forsberg, M.; Hulden, M. (2016). Learning Transducer Models for Morphological Analysis from Example Inflections. In Proceedings of StatFSM. Association for Computational Linguistics. pdf slides code
  • Etxeberria, I.; Alegria, I.; Uria, L.; Hulden, M. (2016). Combining Phonology and Morphology for the Normalization of Historical Texts. In Proceedings of LaTeCH. Association for Computational Linguistics. pdf
  • Agirrezabal, M.; Astigarraga, A.; Arrieta, B.; Hulden, M. (2016). ZeuScansion: A tool for scansion of English poetry. Journal of Language Modelling, Vol 4. No. 1, pp. 3-28. pdf
  • Forsberg, M.; Hulden, M. (2016). Deriving Morphological Analyzers from Example Inflections. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC-2016). pdf
  • Smith, D. E.; Hulden, M. (2016). Morphological Analysis of Sahidic Coptic for Automatic Glossing. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC-2016). pdf
  • Etxeberria, I.; Alegria, I.; Uria, L.; Hulden, M. (2016). Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish, and Slovene. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC-2016). pdf
  • Francom, J.; Hulden, M. (2016). Spanish Diacritic Error Correction and Restoration—A Survey. Lecture Notes in Artificial Intelligence 9561:290–303. Special Issue: Human Language Technology. Challenges for Computer Science and Linguistics. link
  • 2015

  • Hulden, M. (2015). From two-way to one-way finite automata—three regular expression based methods. In CIAA 2015. pdf code slides
  • Hulden, M. (2015). Grammar design with multitape automata and composition. In FSMNLP 2015. pdf code slides
  • Ahlberg, M.; Forsberg, M.; Hulden, M. (2015). Paradigm classification in supervised learning of morphology. In Proceedings of NAACL-HLT 2015. pdf code slides
  • Hulden, M.; Silfverberg, M.; Francom, J. (2015). Kernel density estimation for text-based geolocation. In Proceedings of AAAI 2015. pdf code poster
  • 2014

  • Hulden, M. (2014). Finite State Languages. In Mark Aronoff (ed). Oxford Bibliographies in Linguistics. New York: Oxford University Press. link
  • Hulden, M.; Forsberg, M.; Ahlberg, M. (2014). Semi-supervised learning of morphological paradigms and lexicons. In EACL 2014. pdf code
  • Adesam, Y.; Ahlberg, M.; Andersson, P.; Bouma, G.; Forsberg, M.; Hulden, M. (2014). Computer-aided morphology expansion for Old Swedish. In Proceedings of LREC 2014. pdf
  • Hulden, M. (2014). Generalizing inflection tables into paradigms with finite state operations. In Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, ACL, 29–36. pdf code
  • Nemeskey, D. M.; Tyers, F. M.; Hulden, M. (2014). Why implementation matters: Evaluation of an open-source constraint grammar parser. In Proceedings of COLING 2014. pdf
  • Agirrezabal, M.; Heinz, J.; Hulden, M.; Arrieta, B.; (2014). Assigning stress to out-of-vocabulary words: three approaches. Proceedings of the International Conference on Artificial Intelligence 2014. pdf
  • Hulden, M.; Silfverberg, M. (2014). Finite-state subset approximation of phrase structure. In Proceedings of the International Symposium on Artificial Intelligence and Mathematics (ISAIM 2014). pdf
  • Francom, J.; Hulden, M.; Ussishkin, A. (2014). ACTIV-ES: a comparable, cross-dialect corpus of 'everyday' Spanish from Argentina, Mexico, and Spain. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014).pdf
  • Etxeberria, I.; Alegria, I.; Hulden, M.; Uria, L. (2014). Learning to map variation-standard forms in Basque using a limited parallel corpus and the standard morphology. In Procesamiento del Lenguaje Natural 52. pp. 13–20.
  • 2013

  • Hulden, M.; Francom, J. (2013). Weighted and unweighted transducers for tweet normalization. In Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 69–72. pdf
  • Francom, J.; Hulden, M. (2013). Diacritic error detection and restoration via part-of-speech tags. In Proceedings of LTC 2013.
  • Hulden, M.; Silfverberg, M.; Francom, J. (2013). Finite state applications with Javascript. In Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013), 441–445. pdf
  • Agirrezabal M.; Arrieta B.; Hulden, M.; and Astigarraga A.; (2013). POS-tag based poetry generation with WordNet. In Proceedings of the 14th European Workshop on Natural Language Generation, ACL 2013, Sofia. pdf
  • Agirrezabal, M.; Arrieta, B.; Astigarraga, A.; Hulden, M. (2013). ZeuScansion: a tool for scansion of English poetry. In Proceedings of FSMNLP 2013. pdf
  • 2012

  • Gerdemann, D.; Hulden, M. (2012). Practical finite state optimality theory. In Proceedings of FSMNLP 2012. pdf
  • Hulden, M. (2012). Treba: efficient numerically stable EM for PFA. Journal of Machine Learning Research—Proceedings Track, 21, 249–253. pdf code
  • Agirrezabal M.; Alegria I.; Hulden, M. (2012). Using foma for language-based games. In Proceedings of the First Workshop on Games and NLP, JapTAL 2012, Kanazawa. pdf
  • Hulden, M.; Samih, Y. (2012). Conversion of procedural morphologies to finite-state morphologies: a case study of Arabic. In Proceedings of FSMNLP 2012. pdf code
  • Mayor, A.; Hulden, M.; and Labaka, G. (2012). Developing an open-source FST grammar for verb chain transfer in a Spanish-Basque MT System. In Proceedings of FSMNLP 2012. pdf
  • Agirrezabal, M.; Alegria, I.; Arrieta, B.; Hulden, M. (2012). Finite-state technology in a verse-making tool. In Proceedings of FSMNLP 2012. pdf
  • Agirrezabal, M.; Alegria, I.; Arrieta, B.; Hulden M. (2012). BAD: An assistant tool for making verses in Basque. In Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences and Humanities, EACL 2012, Avignon. pdf
  • Hulden, M.; Francom, J. (2012). Boosting statistical tagger accuracy with simple rule-based grammars. In Proceedings of LREC 2012. pdf
  • 2011

  • Hulden, M. (2011). Constraint Grammar parsing with left and right sequential finite transducers. In Proceedings of FSMNLP 2011. pdf
  • Hulden, M. Alegria, I.; Etxeberria, I.; Maritxalar, M. (2011). Learning word-level dialectal variation as phonological replacement rules using a limited parallel corpus. First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, EMNLP 2011. pdf
  • Uria, L.; Hulden, M.; Etxeberria, I.; and Alegria, I. (2011). Recursos y métodos de sustitución léxica en las variantes dialectales en Euskera. SEPLN workshop: Workshop on Iberian Cross-Language NLP tasks.
  • 2010

  • Hulden, M. (2010). Parsing CFGs and PCFGs with a Chomsky-Schützenberger representation. Lecture Notes in Artificial Intelligence 6562:151-160. Special Issue: Human Language Technology. Challenges for Computer Science and Linguistics. Springer. pdf
  • Alegria, I.; Etxeberria, I.; Hulden, M.; Maritxalar, M. (2010). Porting Basque morphological grammars to foma, an open-source tool. Lecture Notes in Artificial Intelligence 6062:105-113. Springer.
  • 2009

  • Hulden, M. (2009). Regular expressions and predicate logic in finite-state language processing. Frontiers in Artificial Intelligence and Applications 191:82–97. pdf
  • Hulden, M.; Bischoff, S. T. (2009). A simple formalism for capturing reduplication in finite-state morphology. Frontiers in Artificial Intelligence and Applications 191: 207–214.
  • Hulden, M. (2009). Fast approximate string matching with finite automata. Procesamiento del Lenguaje Natural 43: 57–64. pdf poster
  • Hulden, M. (2009). Foma: a finite-state toolkit and library. Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: 29–32. pdf www
  • Hulden, M. (2009). Revisiting multi-tape automata for Semitic morphological analysis and generation. Proceedings of the EACL 2009 Workshop on Computational Approaches to Semitic Languages: 19–26. pdf
  • 2008

  • Hulden, M.; Francom, J. (2008). Parallel multi-theory annotations of syntactic structure. Proceedings of the Sixth International Language Resources and Evaluation (LREC'08): 2339–2343. pdf
  • Hulden, M.; Bischoff, S. T. (2008). An Experiment in Computational Parsing of the Navajo Verb. Coyote Papers 16: 101–118. pdf
  • 2007

  • Hulden, M.; Bischoff, S. T. (2007). A simple formalism for capturing order and co-occurrence in computational morphology. Procesamiento del Lenguaje Natural 39: 21–26. pdf
  • 2006

  • Hulden, M. (2006). Finite-state syllabification. In Anssi Yli-Jyrä, Lauri Karttunen, and Juhani Karhumäki (eds). Finite-state methods and natural language processing: 5th international workshop, FSMNLP 2005; Lecture Notes in Artificial Intelligence 4002: 86–96. pdf code

Theses

  • Hulden, M. (2009) Finite-State Machine Construction Methods and Algorithms for Phonology and Morphology. PhD Thesis, University of Arizona. pdf
  • Hulden, M. (2004). Linguistic Complexity in Two Major American Newspapers and the Associated Press Newswire 1900–2000. Masters Thesis, Åbo Akademi University.

Invited Talks

  • Cognitively Plausible Models of Natural Language Morphology. Department of Computer Science, Chalmers University of Technology, Jun 2016, Sweden.
  • Large-Scale Learning of Natural Language Morphology. Department of Computer Science, University of the Basque Country, Nov 2015, San Sebastián.
  • Large-Scale Supervised Learning of Natural Language Morphology. Institute of Cognitive Science, University of Colorado, Sep 2015, Boulder, CO.
  • Finite-state machines for morphological analysis (and other tasks). Grammatical Framework Summer School, Jul 2015, Gozo, Malta.
  • Learning FSMs for morphology and phonology (invited tutorial). FSMNLP 2015, Jun 2015, Düsseldorf.
  • Navajo parsing and resources. CWIL 2015, University of Alberta, Jun 2015, Edmonton.
  • Supervised and semi-supervised learning of morphology. Department of Computer Science, University of Alberta, Nov 2014, Edmonton.
  • Techniques for formal verification in phonology and morphology. Department of Linguistics, University of Alberta, Nov 2014, Edmonton.
  • Formal verification in phonology. University of Gothenburg, May 2014, Gothenburg.
  • Grammatical inference in Computational Linguistics. University of Gothenburg, Nov 2013, Gothenburg.
  • Advanced finite-state techniques (invited tutorial). University of Gothenburg, Nov 2013, Gothenburg.
  • Finite state morphology and phonology (invited tutorial). Department of Linguistics, University of Delaware, Dec 2013, Newark, DE.
  • Combining Statistical and Finite-State Methods in NLP. University of Düsseldorf, Apr 2012, Düsseldorf.
  • Machine Learning of Grammatical Structure. Department of Modern Languages, University of Helsinki, Apr 2012, Helsinki.
  • Creating language resources and applications using finite-state morphological grammars (tutorial with Iñaki Alegria). Language Rescources and Evaluation Conference (LREC), May 2010, Valletta, Malta.
  • Foma: a finite-state compiler and library. (invited talk and tutorial). Department of Computer Science, University of the Basque Country, Apr 2009, Donostia-San Sebastián.