Abstract
We use integrations and combinations of taggers to improve the tagging accuracy of Icelandic text. The accuracy of the best performing integrated tagger, which consists of our linguistic rule-based tagger for initial disambiguation and a trigram tagger for full disambiguation, is 91.80%. Combining five different taggers, using simple voting, results in 93.34% accuracy. By adding two linguistically motivated rules to the combined tagger, we obtain an accuracy of 93.48%. This method reduces the error rate by 20.5%, with respect to the best performing tagger in the combination pool.
Original language | English |
---|---|
Pages (from-to) | 175-181 |
Number of pages | 7 |
Journal | Language Resources and Evaluation |
Volume | 40 |
Issue number | 2 |
DOIs | |
Publication status | Published - May 2006 |
Other keywords
- Combination of taggers
- Integration of taggers
- Linguistically motivated rules
- Simple voting
- Tagging accuracy