- search hit 1 of 1
MoSTNER: Morphology-aware Split-Tag German NER with Factorie
- MoSTNER is a German NER system based on machine learning with log-linear models and morphology-aware features. We use morphological analysis with Morphisto for generating features, moreover we use German Wikipedia as a gazetteer and perform punctuation-aware and morphology-aware page title matching. We use four types of factor graphs where NER labels are single variables or split into prefix (BILOU) and type (PER, LOC, etc.) variables. Our system supports nested NER (two levels), for training we use SampleRank, for prediction Iterated Conditional Modes, the implementation is based on Python and Factorie.
Author: | Peter Schüller |
---|---|
URN: | https://nbn-resolving.org/urn:nbn:de:gbv:hil2-opus-3030 |
Parent Title (German): | Workshop proceedings of the 12th edition of the KONVENS conference |
Document Type: | Conference Proceeding |
Language: | English |
Date of Publication (online): | 2014/11/25 |
Release Date: | 2014/11/25 |
Tag: | NER; Named entity recognition |
GND Keyword: | Computerlinguistik |
First Page: | 121 |
Last Page: | 124 |
PPN: | Link zum Katalog |
Institutes: | Fachbereich III / Informationswissenschaft und Sprachtechnologie |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Collections: | KONVENS 2014 / Workshop Proceedings of the 12th KONVENS 2014 |
Licence (German): | ![]() |