- Abstract:
-
Reading comprehension tests are receiving increased attention within the NLP community as a controlled test-bed for developing, evaluating and comparing robust question answering (NLQA) methods.
To support this, we have enriched the MITRE CBC4Kids corpus with multiple XML annotation layers recording the output of various tokenizers, lemmatizers, a stemmer, a semantic tagger, POS taggers and syntactic parsers. To demonstrate its use, we have built a baseline NLQA system for word-overlap based answer retrieval, NLQA evaluation and corpus browsing.
- Copyright:
- 2004 by The University of Edinburgh. All Rights Reserved
- Links To Paper
- No links available
- Bibtex format
- @Misc{EDI-INF-RR-0204,
- author = {
Tiphaine Dalmas
and Jochen Leidner
and Bonnie Webber
and Claire Grover
and Johan Bos
},
- title = {Annotating CBC4Kids: A Corpus for Reading Comprehension and Question Answering Evaluation},
- year = 2004,
- month = {Mar},
- }
|