Beside the raw text (the so called 'word-layer': w), the annotation style of the Index Thomisticus treebank features a three-layer structure, ordered as follows:
The development of each layer requires the availability of the previous one(s). The analytical and the tectogrammatical layers
are represented by dependency tree-graphs, respectively named 'analytical tree structures' (ATS) and 'tectogrammatical tree-structures' (TGTS).
The main features of ATS and TGTS are the following:
-
in ATS every word and punctuation mark of the sentence is represented as a node of a rooted dependency tree.
The edges of the tree correspond to (surface) syntactic dependency relations, such as Subject, Object, etc.;
-
TGTS describe the underlying structure of the sentence, conceived as the semantically relevant counterpart of the grammatical means of expression.
The nodes of TGTS represent autosemantic words only (function words are excluded), labeled with semantic role labels (called 'functors') divided
into 'inner participants' (i.e. arguments), and 'free modifications' (i.e. adjuncts). The two dimensions of TGTS represent the syntactic structure of
the sentence (the vertical dimension) and its information structure ('topic-focus articulation', TFA), based on the underlying word order (the horizontal dimension).
In FGD, TFA deals with the opposition between contextual boundness (the 'given' information, on the left) and contextual unboundness (the 'new' information, on the right).
Ellipsis and anaphora resolution and coreferential relations are provided.
Please, find below the documentation for each of the annotation layers of the Index Thomisticus Treebank (plus the valency lexicon for Latin VALLEX):