Index Thomisticus Treebank

dependency tree structure

The Index Thomisticus Treebank contains Latin texts of Thomas Aquinas (Medieval Latin) enhanced with complex and interlinked morphological, syntactic (around 354,000 nodes and 21,000 sentences) and semantic/pragmatic annotation (around 28,000 nodes and 2,000 sentences). The texts of Thomas Aquinas are taken from the Index Thomisticus corpus. Built by father Roberto Busa SJ, the Index Thomisticus is considered to be a pathfinder resource in humanities computing and computational linguistics.

The text are excerpted from Summa contra Gentiles (full syntactic annotation of books 1, 2 and 3), Scriptum super Sententiis Magistri Petri Lombardi and Summa Theologiae.

The annotation guidelines for the syntactic and the semantic/pragmatic levels of annotation resembles those for the so called "analytical" and "tectogrammatical" layers of the Prague Dependency Treebank respectively. The theoretical framework that motivates the annotation style is Functional Generative Description (P. Sgall, E. Hajicová, and J. Panevová. 1986. The Meaning of the Sentence in its Semantic and Pragmatic Aspects, D. Reidel, Dordrecht, NL). The Index Thomisticus Treebank can be browsed here through the PML-TQ web interface.