[an error occurred while processing this directive]
Publications related to morpheme discovery and its applications
-
Grönroos, S.-A., Hiovain, K., Smit, P., Rauhala, I., Jokinen, K., Kurimo, K., and Virpioja, S. (2016).
Low-Resource Active Learning of Morphological Segmentation. Northern European Journal of Language Technology, Volume 4, 2016, pp. 47-72.
-
Ruokolainen, T., Kohonen, O., Sirts, K., Grönroos, S.-A.,
Kurimo, M., and Virpioja, S. (2016).
A Comparative Study of Minimally Supervised Morphological Segmentation. Computational Linguistics, Volume 42, Issue 1, March 2016, pp. 91-120.
-
Grönroos, S.-A., Virpioja, S., Smit, P., and Kurimo, M. (2014).
Morfessor FlatCat: An HMM-based method for unsupervised and semi-supervised learning of morphology.
In proceedings of the 25th International
Conference on Computational Linguistics. Pages 1177-1185, Dublin, Ireland,
August 2014, Association for Computational Linguistics.
-
Virpioja, S., Smit, P., Grönroos, S.-A., and Kurimo, M. (2013).
Morfessor 2.0: Python Implementation and Extensions for Morfessor Baseline.
Aalto University publication series SCIENCE + TECHNOLOGY, 25/2013.
Aalto University, Helsinki, 2013. ISBN 978-952-60-5501-5.
- Virpioja, S., Kohonen, O., and Lagus, K. (2011).
Evaluating the effect of word frequencies in a probabilistic
generative model of morphology. In Proceedings of the 18th
Nordic Conference of Computational Linguistics (NODALIDA 2011),
volume 11 of NEALT Proceedings Series, pages 230-237. Northern
European Association for Language Technology, Riga, Latvia.
-
Kohonen, O., Virpioja, S., and Lagus, K. (2010).
Semi-supervised learning of concatenative morphology.
In Proceedings of the 11th Meeting of the ACL Special Interest
Group on Computational Morphology and Phonology, pages 78-86,
Uppsala, Sweden, July. Association for Computational Linguistics.
- Virpioja, S., Kohonen, O., and Lagus, K. (2010).
Unsupervised morpheme analysis with Allomorfessor.
In Multilingual Information Access Evaluation I. Text Retrieval
Experiments: 10th Workshop of the Cross-Language Evaluation Forum,
CLEF 2009, Corfu, Greece, September 30 - October 2, 2009, Revised
Selected Papers, volume 6241 of Lecture Notes in Computer
Science, pages 578-597. Springer.
-
Kohonen, O., Virpioja, S., and Klami,
M. (2009).
Allomorfessor: Towards unsupervised morpheme
analysis. In Evaluating Systems for Multilingual and
Multimodal Information Access: 9th Workshop of the Cross-Language
Evaluation Forum, CLEF 2008, Aarhus, Denmark, September 17-19, 2008,
Revised Selected Papers, volume 5706 of Lecture Notes in Computer
Science, pages 975-982. Springer Berlin / Heidelberg.
- Creutz, M., and Lagus, K. (2007).
Unsupervised
Models for Morpheme Segmentation and Morphology Learning. ACM
Transactions on Speech and Language Processing, Volume 4, Issue
1, Article 3, January 2007.
- Creutz, M., Lagus, K., and Virpioja,
S. (2006).
Unsupervised
Morphology Induction Using Morfessor. In Finite-State Methods
and Natural Language Processing, Lecture Notes in Computer
Science, Volume 4002, pages 300-301, Springer Berlin /
Heidelberg.
- Creutz, M., and Lagus, K. (2006).
Morfessor
in the Morpho Challenge. In Proceedings of the PASCAL
Challenge Workshop on Unsupervised segmentation of words into
morphemes, Venice, Italy, April 12.
- Creutz, M., and Lagus, K. (2005).
Inducing
the Morphological Lexicon of a Natural Language from Unannotated
Text. In Proceedings of the International and
Interdisciplinary Conference on Adaptive Knowledge Representation and
Reasoning (AKRR'05), Espoo, Finland, June 15-17.
- Creutz, M. and Lagus, K. (2005).
Unsupervised
Morpheme Segmentation and Morphology Induction from Text Corpora Using
Morfessor 1.0. Publications in Computer and Information Science,
Report A81, Helsinki University of Technology, March.
- Creutz, M. and Lagus, K. (2004).
Induction
of a Simple Morphology for Highly-Inflecting Languages. In
Proceedings of the 7th Meeting of the ACL Special Interest Group
in Computational Phonology (SIGPHON), pages 43-51, Barcelona, 26
July.
- Creutz, M. (2003).
Unsupervised
segmentation of words using prior distributions
of morph length and frequency. In Proceedings of ACL-03,
the 41st Annual Meeting of the Association of Computational
Linguistics, pages 280-287, Sapporo, Japan, 7-12 July.
- Creutz, M. and Lagus, K. (2002).
Unsupervised discovery of
morphemes. In Proceedings of the Workshop on Morphological and
Phonological Learning of ACL-02, pages 21-30, Philadelphia,
Pennsylvania, 11 July.
- Virpioja, S., Turunen, V. T., Spiegler, S., Kohonen, O., and
Kurimo, M. (2011).
Empirical comparison of evaluation methods for
unsupervised learning of morphology. Traitement Automatique des
Langues, 52(2):45-90, 2011.
- Kurimo, M., Virpioja, S., Turunen, V., and Lagus, K. (2010).
Morpho challenge 2005-2010: Evaluations and results.
In Proceedings of the 11th Meeting of the ACL Special Interest
Group on Computational Morphology and Phonology, pages 87-95,
Uppsala, Sweden, July. Association for Computational Linguistics.
- Creutz, M., Lagus, K., Lindén, K. and Virpioja, S. (2005).
Morfessor
and Hutmegs: Unsupervised Morpheme Segmentation for Highly-Inflecting
and Compounding Languages. In Proceedings of the Second Baltic
Conference on Human Language Technologies, pages 107-112,
Tallinn, 4-5 April.
- Creutz, M. and Lindén, K. (2004).
Morpheme
Segmentation Gold Standards for Finnish and English. Publications
in Computer and Information Science, Report A77, Helsinki University
of Technology, October.
- Creutz, M., Hirsimäki, T., Kurimo, M., Puurula, A.,
Pylkkönen, J., Siivola, V., Varjokallio, M., Arisoy, E.,
Saraçlar, M., and Stolcke, A. (2007).
Morph-based speech recognition
and modeling of out-of-vocabulary words across languages. ACM
Transactions on Speech and Language Processing, Volume 5, Issue
1, Dec 2007.
- Hirsimäki, T., Creutz, M., Siivola, V., Kurimo,
M., Virpioja, S., and Pylkkönen, J. (2006).
Unlimited
Vocabulary Speech Recognition with Morph Language Models Applied to
Finnish. Computer Speech and Language, Volume 20, Issue
4, October 2006, pp. 515-541.
- Kurimo, M., Puurula, A., Arisoy, E., Siivola, V., Hirsimäki, T.,
Pylkkönen, J., Alumae, T. and Saraclar, M. (2006).
Unlimited
vocabulary speech recognition for agglutinative languages.
In Human Language Technology, Conference of the North American
Chapter of the Association for Computational Linguistics,
HLT-NAACL 2006. New York, USA, June 5-7.
- Hirsimäki, T., Creutz, M., Siivola, V., and Kurimo,
M. (2005). Morphologically
Motivated Language Models in Speech Recognition. In
Proceedings of the International and Interdisciplinary Conference on
Adaptive Knowledge Representation and Reasoning (AKRR'05), Espoo,
Finland, June 15-17.
- Siivola, V., Hirsimäki, T.,
Creutz, M., and Kurimo, M. (2003).
Unlimited
vocabulary speech recognition based on morphs discovered in an
unsupervised manner. In Proceedings of the 8th European
Conference on Speech Communication and Technology (Eurospeech),
pages 2293-2296, Geneva, Switzerland, 1-4 September.
-
Grönroos, S.-A., Virpioja, S., and Kurimo, M. (2016).
Hybrid morphological segmentation for phrase-based machine translation.
In Proceedings of the First Conference on Machine Translation,
pages 289-295, Berlin, Germany, August 2016.
Association for Computational Linguistics.
-
Grönroos, S.-A., Virpioja, S., and Kurimo, M. (2015).
Tuning phrase-based segmented translation for a morphologically complex target language.
In Proceedings of the Tenth Workshop on Statistical Machine Translation, pages 105-111, Lisbon, Portugal, September 2015. Association for Computational Linguistics.
- De Gispert, A., Virpioja, S., Kurimo, M., and Byrne, W. (2009).
Minimum
Bayes Risk Combination of Translation Hypotheses from Alternative
Morphological Decompositions. In Proceedings of Human Language
Technologies: The 2009 Annual Conference of the North American Chapter
of the Association for Computational Linguistics, Companion Volume:
Short Papers, pages 73-76, Boulder, CO, USA, June 2009.
- Virpioja, S., Väyrynen, J. J., Creutz, M., and Sadeniemi,
M. (2007).
Morphology-Aware
Statistical Machine Translation Based on Morphs Induced in an
Unsupervised Manner.
In Proceedings of Machine Translation Summit XI, Copenhagen,
Denmark, 10-14 September, 2007, pp. 491-498.
[an error occurred while processing this directive]