This page documents necessary steps to reproduce results of Chen & Manning (2014)[1] for English (including re-implementation) and makes explicit decisions that aren't covered in the paper.
- Obtain data: WSJ part of PENN Treebank. Section 02-21 for training, 22 for development, 23 for testing.
- I used this revised version: LDC2015T13
- Constituent-to-dependency conversion:
- LTH Constituent-to-Dependency Conversion Tool
- Downloaded pennconverter
- Command:
java -jar pennconverter.jar -format=conllx -rightBranching=false -verbosity 2 -stopOnError
- Stanford Basic Dependencies
- TODO
- LTH Constituent-to-Dependency Conversion Tool
- Assign POS tags using Stanford POS tagger with ten-way jackknifing of the training data
References
- ↑ Chen, D., & Manning, C. (2014). A Fast and Accurate Dependency Parser using Neural Networks. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 740–750). Doha, Qatar: Association for Computational Linguistics.