Constructing linguistically motivated structures from statistical grammars

Publikation: Bidrag til tidsskriftKonferenceartikelForskningfagfællebedømt

This paper discusses two Hidden Markov Models (HMM) for linking linguistically motivated XTAG grammar and the automatically extracted LTAG used by MICA parser. The former grammar is a detailed LTAG enriched with feature structures. And the latter one is a huge size LTAG that due to its statistical nature is well suited to be used in statistical approaches. Lack of an efficient parser and sparseness in the supertags set are the main obstacles in using XTAG and MICA grammars respectively. The models were trained by the standard HMM training algorithm, Baum-Welch. To converge the training algorithm to a better local optimum, the initial state of the models also were estimated using two semi-supervised EM-based algorithms. The resulting accuracy of the model (about 91%) shows that the models can provide a satisfactory way for linking these grammars to share their capabilities together.

OriginalsprogEngelsk
TidsskriftInternational Conference Recent Advances in Natural Language Processing, RANLP
Sider (fra-til)63-69
Antal sider7
ISSN1313-8502
StatusUdgivet - 2011
Begivenhed8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011 - Hissar, Bulgarien
Varighed: 12 sep. 201114 sep. 2011

Konference

Konference8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011
LandBulgarien
ByHissar
Periode12/09/201114/09/2011

ID: 366047680