Constructing linguistically motivated structures from statistical grammars

Publikation: Bidrag til tidsskrift › Konferenceartikel › Forskning › fagfællebedømt

Basirat, Ali
Heshaam Faili

This paper discusses two Hidden Markov Models (HMM) for linking linguistically motivated XTAG grammar and the automatically extracted LTAG used by MICA parser. The former grammar is a detailed LTAG enriched with feature structures. And the latter one is a huge size LTAG that due to its statistical nature is well suited to be used in statistical approaches. Lack of an efficient parser and sparseness in the supertags set are the main obstacles in using XTAG and MICA grammars respectively. The models were trained by the standard HMM training algorithm, Baum-Welch. To converge the training algorithm to a better local optimum, the initial state of the models also were estimated using two semi-supervised EM-based algorithms. The resulting accuracy of the model (about 91%) shows that the models can provide a satisfactory way for linking these grammars to share their capabilities together.

Originalsprog	Engelsk
Tidsskrift	International Conference Recent Advances in Natural Language Processing, RANLP
Sider (fra-til)	63-69
Antal sider	7
ISSN	1313-8502
Status	Udgivet - 2011
Begivenhed	8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011 - Hissar, Bulgarien Varighed: 12 sep. 2011 → 14 sep. 2011

Konference

Konference	8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011
Land	Bulgarien
By	Hissar
Periode	12/09/2011 → 14/09/2011

ID: 366047680