Publications of Mathias Creutz

2009

Mathias Creutz, Sami Virpioja, and Anna Kovaleva (2009).
Web augmentation of language models for continuous speech recognition of SMS text messages. In Proc. EACL 2009, 30 March - 3 April, Athens, Greece, pages 157-165.
Publisher's site ]

Mikko Kurimo, Mathias Creutz, and Ville Turunen (2009).
Morpho Challenge evaluation by information retrieval. In Advances in Multilingual and MultiModal Information Retrieval, 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008, Aarhus, Denmark, September 17-19, 2008, Revised Selected Papers, Lecture Notes in Computer Science, pages 991-998. Springer.

2008

Mikko Kurimo, Mathias Creutz, and Matti Varjokallio (2008).
Morpho Challenge evaluation using a linguistic Gold Standard. In Advances in Multilingual and MultiModal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Lecture Notes in Computer Science , Vol. 5152, pages 864-873. Springer.

David Ellis, Mathias Creutz, Timo Honkela, and Mikko Kurimo (2008).
Speech to speech machine translation: Biblical chatter from Finnish to English. In Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages, pages 123-130, Hyderabad, India, January 2008. Asian Federation of Natural Language Processing.

2007

Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arisoy, Murat Saraclar, and Andreas Stolcke.
Morph-Based Speech Recognition and Modeling of Out-of-Vocabulary Words Across Languages. ACM Transactions on Speech and Language Processing, Volume 5, Issue 1, Article No. 3, December 2007.
Publisher's site ]

Mikko Kurimo, Mathias Creutz, Ville Turunen (2007).
Overview of Morpho Challenge in CLEF 2007. In Working Notes of the CLEF 2007 Workshop. Edited by Alessandro Nardi and Carol Peters. 19-21 September, Budapest, Hungary.
PDF ]

Mikko Kurimo, Mathias Creutz, Matti Varjokallio (2007).
Unsupervised Morpheme Analysis Evaluation by a Comparison to a Linguistic Gold Standard @ Morpho Challenge 2007. In Working Notes of the CLEF 2007 Workshop. Edited by Alessandro Nardi and Carol Peters. 19-21 September, Budapest, Hungary.
PDF ]

Mikko Kurimo, Mathias Creutz, and Ville Turunen (2007).
Unsupervised Morpheme Analysis Evaluation by IR experiments @ Morpho Challenge 2007. In Working Notes of the CLEF 2007 Workshop. Edited by Alessandro Nardi and Carol Peters. 19-21 September, Budapest, Hungary.
PDF ]

Sami Virpioja, Jaakko J. Väyrynen, Mathias Creutz, and Markus Sadeniemi (2007).
Morphology-Aware Statistical Machine Translation Based on Morphs Induced in an Unsupervised Manner. In Proceedings of Machine Translation Summit XI, Copenhagen, Denmark, 10 - 14 September, pages 491-498.
PDF ]

Vesa Siivola, Mathias Creutz and Mikko Kurimo (2007).
Morfessor and VariKN machine learning tools for speech and language technology. In Interspeech 2007, August.
PDF ]

Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arisoy, Murat Saraclar, and Andreas Stolcke (2007).
Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages. In Proceedings of Human Language Technologies / The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), Rochester, NY, USA, 23-25 April, pages 380-387.
PDF ]

Mathias Creutz and Krista Lagus (2007).
Unsupervised Models for Morpheme Segmentation and Morphology Learning. ACM Transactions on Speech and Language Processing, Volume 4, Issue 1, January 2007.
Publisher's site ]

2006

Teemu Hirsimäki, Mathias Creutz, Vesa Siivola, Mikko Kurimo, Sami Virpioja, and Janne Pylkkönen (2006).
Unlimited Vocabulary Speech Recognition with Morph Language Models Applied to Finnish. Computer Speech and Language, Volume 20, Issue 4, October 2006, pages 515-541.
Publisher's site ] [ PDF (manuscript) ]

Mathias Creutz, Krista Lagus, and Sami Virpioja (2006).
Unsupervised Morphology Induction Using Morfessor. In Finite-State Methods and Natural Language Processing, Lecture Notes in Computer Science, Volume 4002, pages 300-301, Springer Berlin / Heidelberg.
Publisher's site ]

Mikko Kurimo, Mathias Creutz, Matti Varjokallio, Ebru Arisoy, and Murat Saraclar (2006).
Unsupervised segmentation of words into morphemes - Morpho Challenge 2005: Application to Automatic Speech Recognition. In the Proceedings of the International Conference on Spoken Language Processing - Interspeech 2006 ICSLP. Pittsburgh, Pennsylvania, USA, September 17-21.
PDF ]

Mathias Creutz (2006).
Induction of the Morphology of Natural Language: Unsupervised Morpheme Segmentation with Application to Automatic Speech Recognition. Doctoral thesis, Dissertations in Computer and Information Science, Report D13, Helsinki University of Technology, Espoo, Finland.
Electronic archive at the TKK library ]

Mathias Creutz and Krista Lagus (2006).
Morfessor in the Morpho Challenge. In the Proceedings of the PASCAL Challenge Workshop on Unsupervised segmentation of words into morphemes, Venice, Italy, April 12.
PDF ]

Mikko Kurimo, Mathias Creutz, Matti Varjokallio, Ebru Arisoy, and Murat Saraclar (2006).
Unsupervised segmentation of words into morphemes - Challenge 2005, An Introduction and Evaluation Report. In the Proceedings of the PASCAL Challenge Workshop on Unsupervised segmentation of words into morphemes, Venice, Italy, April 12.
PDF ]

2005

Mathias Creutz and Krista Lagus (2005).
Inducing the Morphological Lexicon of a Natural Language from Unannotated Text. In Proceedings of the International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning (AKRR'05), pages 106-113, Espoo, Finland, June.
PDF ]

Teemu Hirsimäki, Mathias Creutz, Vesa Siivola and Mikko Kurimo (2005).
Morphologically Motivated Language Models in Speech Recognition. In Proceedings of the International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning (AKRR'05), pages 121-126, Espoo, Finland, June.
PDF ]

Mathias Creutz, Krista Lagus, Krister Lindén, and Sami Virpioja (2005).
Morfessor and Hutmegs: Unsupervised Morpheme Segmentation for Highly-Inflecting and Compounding Languages. In Proceedings of the Second Baltic Conference on Human Language Technologies, pages 107-112, Tallinn, Estonia, 4 - 5 April.
PDF ] [ PS ]

Mathias Creutz and Krista Lagus (2005).
Unsupervised Morpheme Segmentation and Morphology Induction from Text Corpora Using Morfessor 1.0. Publications in Computer and Information Science, Report A81, Helsinki University of Technology, March.
PDF ] [ PS ]

2004

Mathias Creutz and Krister Lindén (2004).
Morpheme Segmentation Gold Standards for Finnish and English. Publications in Computer and Information Science, Report A77, Helsinki University of Technology, October.
PDF ] [ PS ]

Krista Lagus, Mathias Creutz, and Sami Virpioja (2004).
Latent Linguistic Codes for Morphemes using Independent Component Analysis. Ninth Neural Computation and Psychology Workshop: Modelling Language, Cognition and Action, Plymouth, England, September 8-10, New Jersey etc. 2005, World Scientific.

Mathias Creutz and Krista Lagus (2004).
Induction of a Simple Morphology for Highly-Inflecting Languages. In Proceedings of the 7th Meeting of the ACL Special Interest Group in Computational Phonology (SIGPHON), pages 43-51, Barcelona, Spain, 26 July.
PDF ] [ PS ]

2003

Vesa Siivola, Teemu Hirsimäki, Mathias Creutz, and Mikko Kurimo (2003).
Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner. In Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech), pages 2293-2296, Geneva, Switzerland, 1-4 September.
PDF ] [ PS ]

Kadri Hacioglu, Bryan Pellom, Tolga Ciloglu, Ozlem Ozturk, Mikko Kurimo, and Mathias Creutz (2003).
On lexicon creation for Turkish LVCSR. In Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech), pages 1165-1168, Geneva, Switzerland, 1-4 September.
PDF ]

Kadri Hacioglu, Bryan Pellom, Tolga Ciloglu, Ozlem Ozturk, Mikko Kurimo, and Mathias Creutz (2003).
Word splitting for Turkish LVCSR. In Proceedings of the Turkish Signal Processing Conference (SIU 2003), Istanbul, Turkey.

Mathias Creutz (2003).
Unsupervised segmentation of words using prior distributions of morph length and frequency. In Proceedings of ACL-03, the 41st Annual Meeting of the Association of Computational Linguistics, pages 280-287, Sapporo, Japan, 7-12 July.
PDF ] [ PS ]

2002

Krista Lagus, Anu Airola, and Mathias Creutz (2002).
Data analysis of conceptual similarities of Finnish verbs. In Proceedings of CogSci 2002, the 24th annual meeting of the Cognitive Science Society, Fairfax, Virginia, USA, August 7-10.
PDF ] [ PS ]

Mathias Creutz, and Krista Lagus (2002).
Unsupervised discovery of morphemes. In Proceedings of the Workshop on Morphological and Phonological Learning of ACL-02, pages 21-30, Philadelphia, Pennsylvania, USA, July 11.
PDF ] [ PS ]

Page last updated: 24 March 2012