Tagging Icelandic text: An experiment with integrations and combinations of taggers

Hrafn Loftsson*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

We use integrations and combinations of taggers to improve the tagging accuracy of Icelandic text. The accuracy of the best performing integrated tagger, which consists of our linguistic rule-based tagger for initial disambiguation and a trigram tagger for full disambiguation, is 91.80%. Combining five different taggers, using simple voting, results in 93.34% accuracy. By adding two linguistically motivated rules to the combined tagger, we obtain an accuracy of 93.48%. This method reduces the error rate by 20.5%, with respect to the best performing tagger in the combination pool.

Original languageEnglish
Pages (from-to)175-181
Number of pages7
JournalLanguage Resources and Evaluation
Volume40
Issue number2
DOIs
Publication statusPublished - May 2006

Other keywords

  • Combination of taggers
  • Integration of taggers
  • Linguistically motivated rules
  • Simple voting
  • Tagging accuracy

Fingerprint

Dive into the research topics of 'Tagging Icelandic text: An experiment with integrations and combinations of taggers'. Together they form a unique fingerprint.

Cite this