Abstract
Discovering frequent structures within large natural language corpora is one of the core
problems of corpus linguistics, but it is difficult to do for richly structured data. This paper
describes a practical algorithm to extract frequent structures from treebanks or annotated
corpora that can be represented as a tree structures. It extracts the
... read more