Difference between revisions of "Frequently Asked Questions"

Revision as of 12:33, 9 March 2010

What sections are used from training, development, and testing on the Penn Treebank?

This information comes courtesy of Nianwen Bert Xue:

 The standard Dev set is Section 1 and the standard test set is Section 23. Most people don't use 24 and 25. Mitch's explanation, which I think is a

plausible one, is that Section 23 is more ``mature" annotation since it was done after the annotators had been well-trained, vs Section 00 where the annotators had just started learning to annotation.

Training: sections 02-22
Testing: section 23
Development: section 01

@@ Line 1: / Line 1: @@
+== What sections are used from training, development, and testing on the Penn Treebank? ==
 This information comes courtesy of Nianwen Bert Xue:
    The standard Dev set is Section 1 and the standard test set is Section 23. Most people don't use 24 and 25. Mitch's explanation, which I think is a
 plausible one, is that Section 23 is more ``mature" annotation since it was done after the annotators had been well-trained, vs Section 00 where the annotators had just started learning to annotation.
+* Training: sections 02-22
+* Testing: section 23
+* Development: section 01

Difference between revisions of "Frequently Asked Questions"

Revision as of 12:33, 9 March 2010

What sections are used from training, development, and testing on the Penn Treebank?

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools