pith. sign in

arxiv: 1908.11443 · v2 · submitted 2019-08-29 · 💻 cs.CL

NarrativeTime: Dense Temporal Annotation on a Timeline

Pith reviewed 2026-05-24 15:44 UTC · model grok-4.3

classification 💻 cs.CL
keywords temporal annotationtimelineTLinksdense annotationTimeBankevent relationsnarrative structureNLP
0
0 comments X

The pith

NarrativeTime is a timeline-based method that annotates every possible temporal link between events in a text.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

For the past decade temporal annotation in text has labeled only a fraction of event pairs. NarrativeTime places events on a timeline so that every pair receives a temporal relation label, producing full coverage instead of sparse selections. Re-annotating the TimeBankDense corpus with this approach yields comparable inter-annotator agreement but far higher density of labels. The work releases the resulting TimeBankNT corpus, annotation guidelines, conversion tools, and baseline results.

Core claim

The paper presents NarrativeTime as the first timeline-based annotation framework that records temporal relations for every event pair in a narrative, thereby achieving complete TLink coverage where earlier methods annotated only a small portion of pairs. Full re-annotation of TimeBankDense shows agreement levels similar to prior dense efforts while delivering a significant increase in the number of annotated relations, and the authors contribute the resulting TimeBankNT corpus annotated by two experts per text along with supporting guidelines and tools.

What carries the argument

A timeline that positions every event chronologically so annotators can determine and record a temporal relation for each pair.

If this is right

  • Produces training data with complete temporal coverage for models that reason about event order.
  • Enables quantitative comparison of annotation density across different corpora and methods.
  • Supplies open tools that convert the timeline annotations into standard TimeML format.
  • Supports qualitative study of how annotators handle ambiguous or overlapping temporal relations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Dense timeline data could expose recurring patterns in narrative time that sparse annotations have hidden.
  • Models trained on full-coverage labels might improve performance on downstream tasks such as timeline summarization or temporal question answering.
  • The method could be tested on longer documents to check whether cognitive load remains manageable at scale.

Load-bearing premise

Annotators can reliably determine and record temporal relations for every event pair on a timeline without introducing systematic inconsistencies or excessive cognitive load.

What would settle it

A fresh annotation pass on the same texts that yields either no density increase or markedly lower agreement scores would falsify the claim of reliable full coverage.

Figures

Figures reproduced from arXiv: 1908.11443 by Ankita Gupta, Anna Rogers, Anna Rumshisky, Gregory Smelkov, Marzena Karpinska, Vladislav Lialin.

Figure 1
Figure 1. Figure 1: Bounded events Creating a [B] type event requires indicating its position on the timeline with respect to other events, which is input simply as a number starting at 1. It is possible for a [B] event to span other [B] events, as in case of e4 in [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Unbounded events For example, consider the sentence Mary went 2We are hoping that the linguist reader will excuse our re￾defining the term “boundedness”, as it is an established term in verb aspect literature. to the coffee shop and found John there. He was working. She left.. The event of John’s working (e4) started at an underspecified point, possibly be￾fore Mary’s even deciding to go to the coffee shop… view at source ↗
Figure 4
Figure 4. Figure 4: Coarse temporal relations in NarrativeTime [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: NarrativeTime represenation of Two travellers fable (excerpt) wise annotation of TLinks with adjacent events would require 34 TLinks. NarrativeTime com￾bination of clusters of roughly-simultaneous and consecutive events constructs a full temporal rep￾resentation of all TLinks between 18 events in only 11 annotations, with 1-2 actions per annotation. The last two events are kept separate to enable markup of… view at source ↗
Figure 6
Figure 6. Figure 6: Branching timelines in NarrativeTime 4.6 Temporal expressions NarrativeTime follows previous work (Puste￾jovsky et al., 2005) in defining temporal expres￾sions, we make no contribution in this area, and pre-mark temporal expressions before the timeline annotation. What NarrativeTime does improve is [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: NarrativeTime annotation interface their linking with events: annotators only need to include any temporal expressions in the span of the event clusters which they anchor, so the spans function as temporal containers (Pustejovsky and Stubbs, 2011). For example, if [John met Mary on Monday] is chosen as event span, then the meeting event would be linked to Monday. This approach echoes treating temporal expr… view at source ↗
read the original abstract

For the past decade, temporal annotation has been sparse: only a small portion of event pairs in a text was annotated. We present NarrativeTime, the first timeline-based annotation framework that achieves full coverage of all possible TLinks. To compare with the previous SOTA in dense temporal annotation, we perform full re-annotation of TimeBankDense corpus, which shows comparable agreement with a significant increase in density. We contribute TimeBankNT corpus (with each text fully annotated by two expert annotators), extensive annotation guidelines, open-source tools for annotation and conversion to TimeML format, baseline results, as well as quantitative and qualitative analysis of inter-annotator agreement.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper introduces NarrativeTime, a timeline-based annotation framework claimed to be the first to achieve full coverage of all possible TLinks between events in a text. It demonstrates the approach via full re-annotation of the TimeBankDense corpus, reporting comparable inter-annotator agreement alongside a significant increase in annotation density. Contributions include the resulting TimeBankNT corpus (double-annotated by experts), extensive guidelines, open-source annotation and TimeML conversion tools, baseline results, and quantitative/qualitative IAA analysis.

Significance. If the central claims hold, the work is significant for temporal information extraction: it supplies denser, fully-covered annotation data and practical tools that address the sparsity limitation of prior schemes like TimeBank. Explicit strengths include the release of a new double-annotated corpus, open-source tools, and baseline results, all of which lower barriers for follow-on modeling work. The re-annotation experiment provides direct evidence of feasibility via maintained IAA and density gains.

major comments (1)
  1. [Evaluation / re-annotation experiment] Evaluation / re-annotation experiment: IAA is reported only at the pairwise level (with comparable agreement to prior dense annotation) and density is shown to increase, but no quantitative check for global consistency—such as transitivity violations, before/after cycle detection, or other timeline-level inconsistencies—is presented. This is load-bearing for the claim of reliable full TLink coverage, because pairwise agreement alone does not rule out systematic shortcuts or inconsistencies that could arise specifically from forcing annotations on every event pair.
minor comments (2)
  1. [Abstract / Introduction] The abstract and introduction could more explicitly define 'full coverage' (e.g., whether it includes all event pairs or only those on the constructed timeline) to avoid ambiguity for readers.
  2. [Figures and tables] Figure captions and table headers should consistently use the same terminology for TLinks and timelines as the main text to improve readability.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and positive assessment of the work's significance. We address the single major comment below.

read point-by-point responses
  1. Referee: [Evaluation / re-annotation experiment] Evaluation / re-annotation experiment: IAA is reported only at the pairwise level (with comparable agreement to prior dense annotation) and density is shown to increase, but no quantitative check for global consistency—such as transitivity violations, before/after cycle detection, or other timeline-level inconsistencies—is presented. This is load-bearing for the claim of reliable full TLink coverage, because pairwise agreement alone does not rule out systematic shortcuts or inconsistencies that could arise specifically from forcing annotations on every event pair.

    Authors: We agree that pairwise IAA alone does not fully rule out potential global inconsistencies arising from the full-coverage requirement. The current manuscript follows the standard evaluation protocol from prior dense annotation work (pairwise agreement plus density), but does not include explicit timeline-level metrics such as transitivity violation counts or cycle detection. We will add a quantitative global consistency analysis to the revised evaluation section, reporting these metrics on the TimeBankNT annotations and comparing them to the original TimeBankDense annotations where possible. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical annotation contribution is self-contained

full rationale

The paper introduces a timeline-based annotation framework and re-annotates an existing corpus to demonstrate full TLink coverage with comparable IAA and higher density. No equations, fitted parameters, predictions, or derivations are present that could reduce to inputs by construction. Self-citations to prior temporal annotation schemes (e.g., TimeML, TimeBank) are standard background and not load-bearing for the novelty claim, which rests on new guidelines, tools, and the re-annotation results themselves. The framework is externally falsifiable via the contributed corpus and IAA metrics.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

No free parameters or invented entities are introduced. The work relies on standard annotation practices and the existing TimeBank corpus.

axioms (1)
  • domain assumption Standard inter-annotator agreement metrics (e.g., kappa or similar) are appropriate for comparing annotation quality across sparse and dense regimes.
    Invoked when claiming comparable agreement with prior SOTA despite the density increase.

pith-pipeline@v0.9.0 · 5647 in / 1137 out tokens · 19208 ms · 2026-05-24T15:44:42.363500+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

62 extracted references · 62 canonical work pages · 1 internal anchor

  1. [1]

    https://tac.nist.gov/2016/KBP/guidelines/summary_rich_ere_v4.2.pdf Rich ERE Annotation Guidelines Overview V4 .2

    2016. https://tac.nist.gov/2016/KBP/guidelines/summary_rich_ere_v4.2.pdf Rich ERE Annotation Guidelines Overview V4 .2 . Technical report, Linguistic Data Consortium

  2. [2]

    James F Allen. 1984. Towards a general theory of action and time. Artificial intelligence, 23(2):123--154

  3. [3]

    Jun Araki, Lamana Mulaffer, Arun Pandian, Yukari Yamakawa, Kemal Oflazer, and Teruko Mitamura. 2018. http://aclweb.org/anthology/W18-4702 Interoperable Annotation of Events and Event Relations across Domains . In Proceedings 14th Joint ACL - ISO Workshop on Interoperable Semantic Annotation , pages 10--20. Association for Computational Linguistics

  4. [4]

    Bethard, J

    S. Bethard, J. H. Martin, and S. Klingenstein. 2007. https://doi.org/10.1109/ICSC.2007.77 Timelines from Text : Identification of Syntactic Temporal Relations . In International Conference on Semantic Computing ( ICSC 2007) , pages 11--18

  5. [5]

    Steven Bethard, Oleksandr Kolomiyets, and Marie-Francine Moens. 2012. Annotating Story Timelines as Temporal Dependency Structures . In Language Resources and Evaluation Conference , pages 2721--2726

  6. [6]

    Philip Bramsen, Pawan Deshpande, Yoong Keok Lee, and Regina Barzilay. 2006. http://dl.acm.org/citation.cfm?id=1610075.1610105 Inducing Temporal Graphs . In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing , EMNLP '06, pages 189--198, Stroudsburg, PA, USA. Association for Computational Linguistics

  7. [7]

    Tommaso Caselli and Piek Vossen. 2016. https://doi.org/10.18653/v1/W16-5708 The Storyline Annotation and Representation Scheme ( StaR ): A Proposal . In Proceedings of the 2nd Workshop on Computing News Storylines ( CNS 2016) , pages 67--72. Association for Computational Linguistics

  8. [8]

    Tommaso Caselli and Piek Vossen. 2017. https://doi.org/10.18653/v1/W17-2711 The Event StoryLine Corpus : A New Benchmark for Causal and Temporal Relation Extraction . In Proceedings of the Events and Stories in the News Workshop , pages 77--86. Association for Computational Linguistics

  9. [10]

    Taylor Cassidy, Bill McDowell, Nathanael Chambers, and Steven Bethard. 2014 b . https://doi.org/10.3115/v1/P14-2082 An Annotation Framework for Dense Event Ordering . In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics ( Volume 2: Short Papers ) , pages 501--506. Association for Computational Linguistics

  10. [11]

    Susan E. F. Chipman, Martha Palmer, Claire Bonial, and Jena Hwang. 2017. http://www.oxfordhandbooks.com/view/10.1093/oxfordhb/9780199842193.001.0001/oxfordhb-9780199842193-e-15 VerbNet : Capturing English verb behavior, meaning, and usage . In Susan E. F. Chipman, editor, The Oxford Handbook of Cognitive Science . Oxford University Press

  11. [12]

    Kiel Christianson. 2016. https://doi.org/10.1080/17470218.2015.1134603 When language comprehension goes wrong for the right reasons: Good -enough, underspecified, or shallow language processing . The Quarterly Journal of Experimental Psychology, 69(5):817--828

  12. [13]

    Berry Claus. 2012. Processing Narrative Texts : Melting Frozen Time ? Constraints in Discourse 3: Representing and Inferring Discourse Structure, 223:17

  13. [14]

    Marta Coll-Florit and Silvia P. Gennari. 2011. https://doi.org/10.1016/j.cogpsych.2010.09.002 Time in language: Event duration in language comprehension . Cognitive Psychology, 62(1):41--79

  14. [15]

    William Croft. 2012. Verbs: Aspect and Causal Structure. Oxford Linguistics. Oxford University Press , Oxford [England]; New York

  15. [16]

    Agata Cybulska and Piek Vossen. 2014. http://www.lrec-conf.org/proceedings/lrec2014/pdf/840_Paper.pdf Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution . In Proceedings of the Ninth International Conference on Language Resources and Evaluation ( LREC -2014) . European Language Resources Association (ELRA)

  16. [17]

    Agata Cybulska and Piek Vossen. 2015. http://www.newsreader-project.eu/files/2013/01/NWR-2014-1.pdf Guidelines for ECB + Annotation of Events and their Coreference . Technical Report NWR-2014-1, VU Amsterdam

  17. [18]

    Quang Xuan Do, Wei Lu, and Dan Roth. 2012. http://dl.acm.org/citation.cfm?id=2390948.2391023 Joint Inference for Event Timeline Construction . In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning , EMNLP - CoNLL '12, pages 677--687, Stroudsburg, PA, USA. Association for...

  18. [19]

    David R. Dowty. 1986. http://www.springerlink.com/index/t53r44x56137104m.pdf The effects of aspectual class on the temporal structure of discourse: Semantics or pragmatics? Linguistics and philosophy, 9(1):37--61

  19. [20]

    Christine Farag, Vanessa Troiani, Michael Bonner, Chivon Powers, Brian Avants, James Gee, and Murray Grossman. 2010. https://doi.org/10.1093/cercor/bhp313 Hierarchical Organization of Scripts : Converging Evidence from fMRI and Frontotemporal Degeneration . Cerebral Cortex, 20(10):2453--2463

  20. [21]

    Fernanda Ferreira, Paul E Engelhardt, Paul Engelhardt, and Manon W Jones. 2009. https://escholarship.org/uc/item/5vr5976s Good Enough Language Processing : A Satisficing Approach . Proceedings of the Annual Meeting of the Cognitive Science Society, 31:413--418

  21. [22]

    Frase and Barry J

    Lawrence T. Frase and Barry J. Schwartz. 1979. https://doi.org/10.1037/0022-0663.71.2.197 Typographical cues that facilitate comprehension. Journal of Educational Psychology, 71(2):197--206

  22. [23]

    Rei Ikuta, Will Styler, Mariah Hamang, Tim O'Gorman, and Martha Palmer. 2014. Challenges of adding causation to richer event descriptions. In Proceedings of the Second Workshop on EVENTS : Definition , Detection , Coreference , and Representation , pages 12--20

  23. [24]

    Oleksandr Kolomiyets, Steven Bethard, and Marie-Francine Moens. 2012. http://aclweb.org/anthology/P12-1010 Extracting Narrative Timelines as Temporal Dependency Structures . In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics ( Volume 1: Long Papers ) , pages 88--97. Association for Computational Linguistics

  24. [25]

    Beth Levin. 1993. English Verb Classes and Alternations: A Preliminary Investigation. University of Chicago Press , Chicago

  25. [26]

    Anne-Lyse Minard, Manuela Speranza, Ruben Urizar, Marieke van Erp , Anneleen Schoen, and Chantal van Son . 2016. MEANTIME , the NewsReader Multilingual Event and Time Corpus . In Language Resources and Evaluation Conference , pages 4417--4422

  26. [27]

    Misyak, Morten H

    Jennifer B. Misyak, Morten H. Christiansen, and J. Bruce Tomblin. 2010. https://doi.org/10.3389/fpsyg.2010.00031 On- Line Individual Differences in Statistical Learning Predict Language Processing . Frontiers in Psychology, 1

  27. [28]

    Marc Moens and Mark Steedman. 1988. http://dl.acm.org/citation.cfm?id=55058 Temporal ontology and temporal reference . Computational linguistics, 14(2):15--28

  28. [29]

    Nasrin Mostafazadeh, Alyson Grealish, Nathanael Chambers, James Allen, and Lucy Vanderwende. 2016. https://doi.org/10.18653/v1/W16-1007 CaTeRS : Causal and Temporal Relation Scheme for Semantic Annotation of Event Structures . In Proceedings of the Fourth Workshop on Events , pages 51--61. Association for Computational Linguistics

  29. [30]

    Qiang Ning, Hao Wu, and Dan Roth. 2018. http://aclweb.org/anthology/P18-1122 A Multi - Axis Annotation Scheme for Event Temporal Relations . In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics ( Volume 1: Long Papers ) , pages 1318--1328, Melbourne, Australia. Association for Computational Linguistics

  30. [31]

    Tim O'Gorman. 2017. https://github.com/timjogorman/RicherEventDescription RicherEventDescription : A living document with the guidelines for RED annotation, and any scripts useful for working with RED documents

  31. [32]

    Tim O'Gorman, Kristin Wright-Bettner , and Martha Palmer. 2016. Richer Event Description : Integrating event coreference with temporal, causal and bridging annotation. In Proceedings of the 2nd Workshop on Computing News Storylines ( CNS 2016) , pages 47--56

  32. [33]

    O'Shea and Paul T

    Lawrence J. O'Shea and Paul T. Sindelar. 1983. https://doi.org/10.2307/747380 The Effects of Segmenting Written Discourse on the Reading Comprehension of Low - and High - Performance Readers . Reading Research Quarterly, 18(4):458--465

  33. [34]

    James Pustejovsky. 1991. http://www.sciencedirect.com/science/article/pii/001002779190032Y The syntax of event structure . Cognition, 41(1):47--81

  34. [35]

    Casta\ no, Robert Ingria, Roser Saur\'i, Robert J

    James Pustejovsky, Jos\'e M. Casta\ no, Robert Ingria, Roser Saur\'i, Robert J. Gaizauskas, Andrea Setzer, Graham Katz, and Dragomir R. Radev. 2003 a . http://www.timeml.org/publications/timeMLpubs/IWCS-v4.pdf TimeML : Robust Specification of Event and Temporal Expressions in Text . In New Directions in Question Answering

  35. [36]

    James Pustejovsky, Patrick Hanks, Roser Saur\'i, Andrew See, Robert Gaizauskas, Andrea Setzer, Dragomir Radev, Beth Sundheim, David Day, Lisa Ferro, and Marcia Lazo. 2003 b . The TIMEBANK Corpus . In Proceedings of Corpus Linguistics , pages 647--656

  36. [37]

    James Pustejovsky, Bob Ingria, Roser Sauri, Jose Castano, Jessica Littman, Rob Gaizauskas, Andrea Setzer, Graham Katz, and Inderjeet Mani. 2005. http://www.academia.edu/download/31025523/PusIngSau04.pdf The specification language TimeML . The language of time: A reader, pages 545--557

  37. [38]

    James Pustejovsky, Kiyong Lee, Harry Bunt, and Laurent Romary. 2010. http://www.lrec-conf.org/proceedings/lrec2010/pdf/55_Paper.pdf ISO - TimeML : An International Standard for Semantic Annotation . In Proceedings of the Seventh Conference on International Language Resources and Evaluation ( LREC '10) . European Languages Resources Association (ELRA)

  38. [39]

    James Pustejovsky and Amber Stubbs. 2011. http://aclweb.org/anthology/W11-0419 Increasing Informativeness in Temporal Annotation . In Proceedings of the 5th Linguistic Annotation Workshop , pages 152--160. Association for Computational Linguistics

  39. [40]

    Quinn, Richard K

    Jamie M. Quinn, Richard K. Wagner, Yaacov Petscher, and Danielle Lopez. 2015. https://doi.org/10.1111/cdev.12292 Developmental Relations Between Vocabulary Knowledge and Reading Comprehension : A Latent Change Score Modeling Study . Child Development, 86(1):159--175

  40. [41]

    Rajendran, Andrew T

    Dhevi J. Rajendran, Andrew T. Duchowski, Pilar Orero, Juan Mart\'inez, and Pablo Romero-Fresco . 2013. https://doi.org/10.1080/0907676X.2012.722651 Effects of text chunking on subtitling: A quantitative and qualitative examination . Perspectives, 21(1):5--21

  41. [42]

    Keith Rayner and Erik D. Reichle. 2010. https://doi.org/10.1002/wcs.68 Models of the Reading Process . Wiley interdisciplinary reviews. Cognitive science, 1(6):787--799

  42. [43]

    Nils Reimers, Nazanin Dehghani, and Iryna Gurevych. 2016. https://doi.org/10.18653/v1/P16-1207 Temporal Anchoring of Events for the TimeBank Corpus . In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics ( Volume 1: Long Papers ) , pages 2195--2204. Association for Computational Linguistics

  43. [44]

    Schotter, Randy Tran, and Keith Rayner

    Elizabeth R. Schotter, Randy Tran, and Keith Rayner. 2014. https://doi.org/10.1177/0956797614531148 Don't Believe What You Read ( Only Once ): Comprehension Is Supported by Regressions During Reading . Psychological Science, 25(6):1218--1226

  44. [45]

    Oakhill, and Nicola M

    Alix Seigneuric, Marie-France Ehrlich, Jane V. Oakhill, and Nicola M. Yuill. 2000. https://doi.org/10.1023/A:1008088230941 Working memory resources and children's reading comprehension . Reading and Writing, 13(1):81--103

  45. [46]

    Shebilske and L

    Wayne L. Shebilske and L. Starling Reid. 1979. https://doi.org/10.1007/978-1-4684-0994-9_7 Reading Eye Movements , Macro -structure and Comprehension Processes . In Paul A. Kolers, Merald E. Wrolstad, and Herman Bouma, editors, Processing of Visible Language , Nato Conference Series , pages 97--110. Springer US , Boston, MA

  46. [47]

    Carlota S. Smith. 1997. The Parameter of Aspect, 2. ed edition. Number 43 in Studies in Linguistics and Philosophy. Kluwer Academic Publ , Dordrecht

  47. [48]

    Zhiyi Song, Ann Bies, Stephanie Strassel, Joe Ellis, Teruko Mitamura, Hoa Trang Dang, Yukari Yamakawa, and Sue Holm. 2016. https://doi.org/10.18653/v1/W16-1005 Event Nugget and Event Coreference Annotation . In Proceedings of the Fourth Workshop on Events , pages 37--45. Association for Computational Linguistics

  48. [49]

    Zhiyi Song, Ann Bies, Stephanie Strassel, Tom Riese, Justin Mott, Joe Ellis, Jonathan Wright, Seth Kulick, Neville Ryant, and Xiaoyi Ma. 2015. https://doi.org/10.3115/v1/W15-0812 From Light to Rich ERE : Annotation of Entities , Relations , and Events . In Proceedings of the 3rd Workshop on EVENTS at the NAACL-HLT 2015, pages 89--98. Association for Compu...

  49. [50]

    Styler, Steven Bethard, Sean Finan, Martha Palmer, Sameer Pradhan, Piet C de Groen , Brad Erickson, Timothy Miller, Chen Lin, Guergana Savova, and James Pustejovsky

    William F. Styler, Steven Bethard, Sean Finan, Martha Palmer, Sameer Pradhan, Piet C de Groen , Brad Erickson, Timothy Miller, Chen Lin, Guergana Savova, and James Pustejovsky. 2014. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5657277/ Temporal Annotation in the Clinical Domain . Transactions of the Association for Computational Linguistics, 2:143--154

  50. [51]

    Sara Tonelli, Rachele Sprugnoli, Manuela Speranza, and Anne-Lyse Minard. 2014. http://www.newsreader-project.eu/files/2014/12/NWR-2014-2-2.pdf NewsReader Guidelines for Annotation at Document Level . Technical Report NWR-2014-2-2, ICT 316404

  51. [52]

    Naushad UzZaman, Hector Llorens, James Allen, Leon Derczynski, Marc Verhagen, and James Pustejovsky. 2012. http://arxiv.org/abs/1206.5333 Tempeval-3: Evaluating events, time expressions, and temporal relations . arXiv preprint arXiv:1206.5333

  52. [53]

    Elke van der Meer , Reinhard Beyer, Bertram Heinze, and Isolde Badel. 2002. Temporal order relations in language comprehension. Journal of Experimental Psychology. Learning, Memory, and Cognition, 28(4):770--779

  53. [54]

    Marieke van Erp , Piek Vossen, Rodrigo Agerri, Anne-Lyse Minard, Manuela Speranza, Ruben Urizar, Egoitz Laparra, Itziar Aldabe, and German Rigau. 2015. http://www.newsreader-project.eu/files/2012/12/NWR-D3-3-2.pdf Annotated Data , version 2 . Technical Report D3-3-2, VU Amsterdam

  54. [55]

    Zeno Vendler. 1957. https://doi.org/10.2307/2182371 Verbs and Times . The Philosophical Review, 66(2):143

  55. [56]

    Verhagen, R

    M. Verhagen, R. Knippen, I. Mani, and J. Pustejovsky. 2006. http://www.lrec-conf.org/proceedings/lrec2006/pdf/638_pdf.pdf Annotation of Temporal Relations with Tango . In Proceedings of the Fifth International Conference on Language Resources and Evaluation ( LREC '06) . European Language Resources Association (ELRA)

  56. [57]

    Marc Verhagen, Robert Gaizauskas, Frank Schilder, Mark Hepple, Graham Katz, and James Pustejovsky. 2007. http://aclweb.org/anthology/S07-1014 SemEval -2007 Task 15: TempEval Temporal Relation Identification . In Proceedings of the Fourth International Workshop on Semantic Evaluations ( SemEval -2007) , pages 75--80. Association for Computational Linguistics

  57. [58]

    Marc Verhagen, Robert Gaizauskas, Frank Schilder, Mark Hepple, Jessica Moszkowicz, and James Pustejovsky. 2009. https://doi.org/10.1007/s10579-009-9086-z The TempEval challenge: Identifying temporal relations in text . Language Resources and Evaluation, 43(2):161--179

  58. [59]

    Marc Verhagen, Roser Sauri, Tommaso Caselli, and James Pustejovsky. 2010. http://www.aclweb.org/anthology/S10-1010 SemEval -2010 Task 13: TempEval -2 . In Proceedings of the 5th International Workshop on Semantic Evaluation , pages 57--62, Uppsala, Sweden, 15-16 July 2010. ACL

  59. [60]

    Christopher Walker, Stephanie Strassel, Julie Medero, and Kazuaki Maeda. 2006. https://catalog.ldc.upenn.edu/LDC2006T06 ACE 2005 Multilingual Training Corpus

  60. [61]

    Rolf A. Zwaan. 2016. https://doi.org/10.3758/s13423-015-0864-x Situation models, mental simulations, and abstract concepts in discourse comprehension . Psychonomic Bulletin & Review, 23(4):1028--1034

  61. [62]

    URL: " 'urlintro :=

    ENTRY address author booktitle chapter edition editor howpublished institution journal key month note number organization pages publisher school series title type volume year eprint doi pubmed url lastchecked label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block STRINGS urlintro eprinturl eprintpr...

  62. [63]

    write newline

    " write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize " " * FUNCT...