Publications

Articles

    2017

  • Hulden, M. (2017). Formal and Computational Verification of Phonological Analyses. Phonology 34(2), pages 407–435. doi code
  • Hulden, M. (2017, in print). Formal Verification in Optimality Theory. Essays in honor of Lauri Karttunen. CSLI: Stanford
  • Hulden, M. (2017, in print). Finite-State Technology. In Ruslan Mitkov (ed.) The Oxford Handbook of Computational Linguistics, 2nd Ed. Oxford University Press.
  • Cotterell, R.; Kirov, C.; Sylak-Glassman, J.; Walther, G.; Vylomova, E.; Xia, P.; Faruqui, M.; Kübler, S.; Yarowsky, D.; Eisner, J.; Hulden, M. (2017). CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages. In Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection. pdf www github
  • Hulden, M. (2017). A Phoneme Clustering Algorithm Based on the Obligatory Contour Principle. In Proceedings of CoNLL. pdf code poster
  • Hulden, M. (2017). Rewrite Rule Grammars with Multitape Automata. Journal of Language Modelling 5(1):107–130. pdf code
  • Agirrezabal, M.; Alegria, M.; Hulden, M. (2017). A Comparison of Feature-Based and Neural Scansion of Poetry. Recent Advances in Natural Language Processing (RANLP) pdf
  • Silfverberg, M.; Hulden, M. (2017). Weakly Supervised Learning of Allomorphy. In Proceedings of the First Workshop on Subword and Character Level Models in NLP (SCLeM). pdf code
  • Silfverberg, M.; Hulden, M. (2017). Automatic Morpheme Segmentation and Labeling in Universal Dependencies resources. In Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017) pdf code slides
  • Liu, L.; Hulden, M. (2017). Evaluation of Finite State Morphological Analyzers Based on Paradigm Extraction from Wiktionary. In Proceedings of FSMNLP. pdf
  • Kazeminejad, G.; Cowell, A.; Hulden, M. (2017). Creating lexical resources for polysynthetic languages—the case of Arapaho. Proceedings of the 2nd Workshop on on the Use of Computational Methods in the Study of Endangered Languages (ComputEL). Association for Computational Linguistics. pdf
  • 2016

  • Mao, L. J.; Hulden, M. (2016). How Regular is Japanese Loanword Adaptation? A Computational Study. In Proceedings of COLING 2016. pdf
  • Agirrezabal, M.; Algeria, I.; Hulden, M. (2016). Machine Learning for Metrical Analysis of English Poetry. In Proceedings of COLING 2016. pdf
  • Cotterell, Ryan; Kirov, Christo; Sylak-Glassman, John; Yarowsky, David; Eisner, Jason; Hulden, Mans. (2016). The SIGMORPHON 2016 Shared Task—Morphological Reinflection. In Proceedings of SIGMORPHON. Association for Computational Linguistics. pdf www slides
  • Forsberg, M.; Hulden, M. (2016). Learning Transducer Models for Morphological Analysis from Example Inflections. In Proceedings of StatFSM. Association for Computational Linguistics. pdf slides
  • Etxeberria, I.; Alegria, I.; Uria, L.; Hulden, M. (2016). Combining Phonology and Morphology for the Normalization of Historical Texts. In Proceedings of LaTeCH. Association for Computational Linguistics. pdf
  • Agirrezabal, M.; Astigarraga, A.; Arrieta, B.; Hulden, M. (2016). ZeuScansion: A tool for scansion of English poetry. Journal of Language Modelling, Vol 4. No. 1, pp. 3-28. pdf
  • Forsberg, M.; Hulden, M. (2016). Deriving Morphological Analyzers from Example Inflections. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC-2016). pdf
  • Smith, D. E.; Hulden, M. (2016). Morphological Analysis of Sahidic Coptic for Automatic Glossing. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC-2016). pdf
  • Etxeberria, I.; Alegria, I.; Uria, L.; Hulden, M. (2016). Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish, and Slovene. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC-2016). pdf
  • Francom, J.; Hulden, M. (2016). Spanish Diacritic Error Correction and Restoration—A Survey. Lecture Notes in Artificial Intelligence 9561:290–303. Special Issue: Human Language Technology. Challenges for Computer Science and Linguistics. link
  • 2015

  • Hulden, M. (2015). From two-way to one-way finite automata—three regular expression based methods. In CIAA 2015. pdf code slides
  • Hulden, M. (2015). Grammar design with multitape automata and composition. In FSMNLP 2015. pdf code slides
  • Ahlberg, M.; Forsberg, M.; Hulden, M. (2015). Paradigm classification in supervised learning of morphology. In Proceedings of NAACL-HLT 2015. pdf code slides
  • Hulden, M.; Silfverberg, M.; Francom, J. (2015). Kernel density estimation for text-based geolocation. In Proceedings of AAAI 2015. pdf code
  • 2014

  • Hulden, M. (2014). Finite State Languages. In Mark Aronoff (ed). Oxford Bibliographies in Linguistics. New York: Oxford University Press. link
  • Hulden, M.; Forsberg, M.; Ahlberg, M. (2014). Semi-supervised learning of morphological paradigms and lexicons. In EACL 2014. pdf code
  • Adesam, Y.; Ahlberg, M.; Andersson, P.; Bouma, G.; Forsberg, M.; Hulden, M. (2014). Computer-aided morphology expansion for Old Swedish. In Proceedings of LREC 2014. pdf
  • Hulden, M. (2014). Generalizing inflection tables into paradigms with finite state operations. In Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, ACL, 29–36. pdf code
  • Nemeskey, D. M.; Tyers, F. M.; Hulden, M. (2014). Why implementation matters: Evaluation of an open-source constraint grammar parser. In Proceedings of COLING 2014. pdf
  • Agirrezabal, M.; Heinz, J.; Hulden, M.; Arrieta, B.; (2014). Assigning stress to out-of-vocabulary words: three approaches. Proceedings of the International Conference on Artificial Intelligence 2014. pdf
  • Hulden, M.; Silfverberg, M. (2014). Finite-state subset approximation of phrase structure. In Proceedings of the International Symposium on Artificial Intelligence and Mathematics (ISAIM 2014). pdf
  • Francom, J.; Hulden, M.; Ussishkin, A. (2014). ACTIV-ES: a comparable, cross-dialect corpus of 'everyday' Spanish from Argentina, Mexico, and Spain. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014).pdf
  • Etxeberria, I.; Alegria, I.; Hulden, M.; Uria, L. (2014). Learning to map variation-standard forms in Basque using a limited parallel corpus and the standard morphology. In Procesamiento del Lenguaje Natural 52. pp. 13–20.
  • 2013

  • Hulden, M.; Francom, J. (2013). Weighted and unweighted transducers for tweet normalization. In Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 69–72. pdf
  • Francom, J.; Hulden, M. (2013). Diacritic error detection and restoration via part-of-speech tags. In Proceedings of LTC 2013.
  • Hulden, M.; Silfverberg, M.; Francom, J. (2013). Finite state applications with Javascript. In Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013), 441–445. pdf
  • Agirrezabal M.; Arrieta B.; Hulden M.; and Astigarraga A.; (2013). POS-tag based poetry generation with WordNet. In Proceedings of the 14th European Workshop on Natural Language Generation, ACL 2013, Sofia. pdf
  • Agirrezabal, M.; Arrieta, B.; Astigarraga, A.; Hulden, M. (2013). ZeuScansion: a tool for scansion of English poetry. In Proceedings of FSMNLP 2013. pdf
  • 2012

  • Gerdemann, D.; Hulden, M. (2012). Practical finite state optimality theory. In Proceedings of FSMNLP 2012. pdf
  • Hulden, M. (2012). Treba: efficient numerically stable EM for PFA. Journal of Machine Learning Research—Proceedings Track, 21, 249–253. pdf code
  • Agirrezabal M.; Alegria I.; Hulden M. (2012). Using foma for language-based games. In Proceedings of the First Workshop on Games and NLP, JapTAL 2012, Kanazawa. pdf
  • Hulden, M.; Samih, Y. (2012). Conversion of procedural morphologies to finite-state morphologies: a case study of Arabic. In Proceedings of FSMNLP 2012. pdf code
  • Mayor, A.; Hulden, M.; and Labaka, G. (2012). Developing an open-source FST grammar for verb chain transfer in a Spanish-Basque MT System. In Proceedings of FSMNLP 2012. pdf
  • Agirrezabal, M.; Alegria, I.; Arrieta, B.; Hulden, M. (2012). Finite-state technology in a verse-making tool. In Proceedings of FSMNLP 2012. pdf
  • Agirrezabal M.; Alegria I.; Arrieta B.; Hulden M. (2012). BAD: An assistant tool for making verses in Basque. In Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences and Humanities, EACL 2012, Avignon. pdf
  • Hulden, M.; Francom, J. (2012). Boosting statistical tagger accuracy with simple rule-based grammars. In Proceedings of LREC 2012. pdf
  • 2011

  • Hulden, M. (2011). Constraint Grammar parsing with left and right sequential finite transducers. In Proceedings of FSMNLP 2011. pdf
  • Hulden, M. Alegria, I.; Etxeberria, I.; Maritxalar, M. (2011). Learning word-level dialectal variation as phonological replacement rules using a limited parallel corpus. First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, EMNLP 2011. pdf
  • Uria, L.; Hulden, M.; Etxeberria, I.; and Alegria, I. (2011). Recursos y métodos de sustitución léxica en las variantes dialectales en Euskera. SEPLN workshop: Workshop on Iberian Cross-Language NLP tasks.
  • 2010

  • Hulden, M. (2010). Parsing CFGs and PCFGs with a Chomsky-Schützenberger representation. Lecture Notes in Artificial Intelligence 6562:151-160. Special Issue: Human Language Technology. Challenges for Computer Science and Linguistics. Springer. pdf
  • Alegria, I.; Etxeberria, I.; Hulden, M.; Maritxalar, M. (2010). Porting Basque morphological grammars to foma, an open-source tool. Lecture Notes in Artificial Intelligence 6062:105-113. Springer.
  • 2009

  • Hulden, M. (2009). Regular expressions and predicate logic in finite-state language processing. Frontiers in Artificial Intelligence and Applications 191:82–97. pdf
  • Hulden, M.; Bischoff, S. T. (2009). A simple formalism for capturing reduplication in finite-state morphology. Frontiers in Artificial Intelligence and Applications 191: 207–214.
  • Hulden, M. (2009). Fast approximate string matching with finite automata. Procesamiento del Lenguaje Natural 43: 57–64. pdf
  • Hulden, M. (2009). Foma: a finite-state toolkit and library. Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: 29–32. pdf www
  • Hulden, M. (2009). Revisiting multi-tape automata for Semitic morphological analysis and generation. Proceedings of the EACL 2009 Workshop on Computational Approaches to Semitic Languages: 19–26. pdf
  • 2008

  • Hulden, M.; Francom, J. (2008). Parallel multi-theory annotations of syntactic structure. Proceedings of the Sixth International Language Resources and Evaluation (LREC'08): 2339–2343. pdf
  • Hulden, M.; Bischoff, S. T. (2008). An Experiment in Computational Parsing of the Navajo Verb. Coyote Papers 16: 101–118. pdf
  • 2007

  • Hulden, M.; Bischoff, S. T. (2007). A simple formalism for capturing order and co-occurrence in computational morphology. Procesamiento del Lenguaje Natural 39: 21–26. pdf
  • 2006

  • Hulden, M. (2006). Finite-state syllabification. In Anssi Yli-Jyrä, Lauri Karttunen, and Juhani Karhumäki (eds). Finite-state methods and natural language processing: 5th international workshop, FSMNLP 2005; Lecture Notes in Artificial Intelligence 4002: 86–96. pdf code

Theses

  • Hulden, M. (2009) Finite-State Machine Construction Methods and Algorithms for Phonology and Morphology. PhD Thesis, University of Arizona. pdf
  • Hulden, M. (2004). Linguistic Complexity in Two Major American Newspapers and the Associated Press Newswire 1900–2000. Masters Thesis, Åbo Akademi University.

Invited Talks

  • Cognitively Plausible Models of Natural Language Morphology. Department of Computer Science, Chalmers University of Technology, Jun 2016, Sweden.
  • Large-Scale Learning of Natural Language Morphology. Department of Computer Science, University of the Basque Country, Nov 2015, San Sebastián.
  • Large-Scale Supervised Learning of Natural Language Morphology. Institute of Cognitive Science, University of Colorado, Sep 2015, Boulder, CO.
  • Finite-state machines for morphological analysis (and other tasks). Grammatical Framework Summer School, Jul 2015, Gozo, Malta.
  • Learning FSMs for morphology and phonology (invited tutorial). FSMNLP 2015, Jun 2015, Düsseldorf.
  • Navajo parsing and resources. CWIL 2015, University of Alberta, Jun 2015, Edmonton.
  • Supervised and semi-supervised learning of morphology. Department of Computer Science, University of Alberta, Nov 2014, Edmonton.
  • Techniques for formal verification in phonology and morphology. Department of Linguistics, University of Alberta, Nov 2014, Edmonton.
  • Formal verification in phonology. University of Gothenburg, May 2014, Gothenburg.
  • Grammatical inference in Computational Linguistics. University of Gothenburg, Nov 2013, Gothenburg.
  • Advanced finite-state techniques (invited tutorial). University of Gothenburg, Nov 2013, Gothenburg.
  • Finite state morphology and phonology (invited tutorial). Department of Linguistics, University of Delaware, Dec 2013, Newark, DE.
  • Combining Statistical and Finite-State Methods in NLP. University of Düsseldorf, Apr 2012, Düsseldorf.
  • Machine Learning of Grammatical Structure. Department of Modern Languages, University of Helsinki, Apr 2012, Helsinki.
  • Creating language resources and applications using finite-state morphological grammars (tutorial with Iñaki Alegria). Language Rescources and Evaluation Conference (LREC), May 2010, Valletta, Malta.
  • Foma: a finite-state compiler and library. (invited talk and tutorial). Department of Computer Science, University of the Basque Country, Apr 2009, Donostia-San Sebastián.