Index Thomisticus Treebank

dependency tree structure

The Index Thomisticus Treebank contains Latin texts of Thomas Aquinas (Medieval Latin) enhanced with complex and interlinked morphological, syntactic (around 450,000 nodes and more than 26,000 sentences) and semantic/pragmatic annotation (around 28,000 nodes and 2,000 sentences). The texts of Thomas Aquinas are taken from the Index Thomisticus corpus. Built by father Roberto Busa SJ, the Index Thomisticus is considered to be a pathfinder resource in humanities computing and computational linguistics.

The text are excerpted from Summa contra Gentiles (entirely annotated), Scriptum super Sententiis Magistri Petri Lombardi and Summa Theologiae.

The annotation guidelines for the syntactic and the semantic/pragmatic levels of annotation resembles those for the so called "analytical" and "tectogrammatical" layers of the Prague Dependency Treebank respectively. The theoretical framework that motivates the annotation style is Functional Generative Description (P. Sgall, E. Hajicová, and J. Panevová. 1986. The Meaning of the Sentence in its Semantic and Pragmatic Aspects, D. Reidel, Dordrecht, NL). The Index Thomisticus Treebank can be browsed here through the PML-TQ web interface.