Narrative cloze is a task proposed by Chambers and Jurafsky (2008)[1] It is widely used to evaluate models of script knowledge (Pichotta & Mooney, 2016a[2]; Pichotta & Mooney, 2016b[3]; Jans et al., 2012[4]; Rudinger et al., 2015a[5]; Rudinger et al. (2015b)[6])

From Pichotta & Mooney (2016b): "The exact definition of the Narrative Cloze evaluation depends on the formulation of events used in a script system. For example, Cham- bers and Jurafsky (2008), Jans et al. (2012), and Rudinger et al. (2015) evaluate inference of held- out (verb, dependency) pairs from documents; Pi- chotta and Mooney (2014) evaluate inference of verbs with coreference information about multi- ple arguments; and Pichotta and Mooney (2016) evaluate inference of verbs with noun informa- tion about multiple arguments. In order to gather human judgments of inference quality, the latter also learn an encoder-decoder LSTM network for transforming verbs and noun arguments into En- glish text to present to annotators for evaluation."

