What I think when I think about treebanks
Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt
Standard
What I think when I think about treebanks. / Søgaard, Anders.
Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT16),. Association for Computational Linguistics, 2018. s. 161-166.Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - GEN
T1 - What I think when I think about treebanks
AU - Søgaard, Anders
PY - 2018
Y1 - 2018
N2 - In this opinion piece, I present four somewhat controversial suggestions for the design of futuretreebanks: a) Treebanks should be based on adversarial samples, rather than pseudorepresentativesamples. b) Treebanks should include multiple splits of the data, rather than justa single split, as in most treebanks today. c) They should include multiple annotations of eachsentence, whenever possible, instead of adjudicated annotations. d) There is no real motivationfor adhering to a notion of well-formedness, since we now have parsers based on deep learningthat generalize easily and perform well on any type of graphs, and treebanks therefore do not haveto limit themselves to trees or directed acyclic graphs.
AB - In this opinion piece, I present four somewhat controversial suggestions for the design of futuretreebanks: a) Treebanks should be based on adversarial samples, rather than pseudorepresentativesamples. b) Treebanks should include multiple splits of the data, rather than justa single split, as in most treebanks today. c) They should include multiple annotations of eachsentence, whenever possible, instead of adjudicated annotations. d) There is no real motivationfor adhering to a notion of well-formedness, since we now have parsers based on deep learningthat generalize easily and perform well on any type of graphs, and treebanks therefore do not haveto limit themselves to trees or directed acyclic graphs.
M3 - Article in proceedings
SP - 161
EP - 166
BT - Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT16),
PB - Association for Computational Linguistics
Y2 - 23 January 2018 through 24 January 2018
ER -
ID: 214752172