Frequent Structure Discovery in Treebanks : An efficient, practical, actually usable approach

Martens, Scott

Frequent Structure Discovery in Treebanks : An efficient, practical, actually usable approach

DSpace/Manakin Repository

Frequent Structure Discovery in Treebanks : An efficient, practical, actually usable approach

Martens, Scott

(2009) LOT Occasional Series, volume 14, pp. 99 - 114

(Part of book or chapter of book)

Abstract

Discovering frequent structures within large natural language corpora is one of the core problems of corpus linguistics, but it is difficult to do for richly structured data. This paper describes a practical algorithm to extract frequent structures from treebanks or annotated corpora that can be represented as a tree structures. It extracts the ... read more

Download/Full Text

Open Access version via Utrecht University Repository

ISSN: 1572-199X

Publisher: LOT, Netherlands Graduate School of Linguistics

See more statistics about this item