Talkbank Schema version 1.1.2
Notation and color conventions of this schema representation:
a choice: [x y z] ; a sequence: (x y z) ; (* x,y) is the same as the dtd-like notation (x,y)*
Colours used:
{type declarations};
element names attribute names ;
restriction/extension facets;
{links to type definitions};
comments;
an extension of a type is {<<type>>} ; and a restriction to it : {>>type<<}
The current version defines two types of documents: the global elements below...
The global types are available for reusing through schema type extension/restriction.
The most up to date document definition is CHAT, it is also the richest in structure. Ideally, each group should develop a schema module defining the structure of their specific (class of) annotations, this schema should be an assembly of their definitions.
Developed by Romeo Anghelache, from the CHAT specifications,
released under the GNU Public License, 2001
Imported schemata:
Global Elements in main module
-
<CHAT>
-
(
-
<Participants>
@Participants; a structure enumerating the beings participating
-
[+
-
<tcu>
a group of utterances having something in common, usually the speaker
]
)
-
Version :
-
>>
{xs:string}<<
pattern:1.1.[0-9]{1,}
-
Date :
{xs:date}
31 March 1999 is formatted as 1999-03-31
-
Duration :
{xs:duration}
this work might be done in an extended interval of time; a duration of 1 year, 2 months, 3 days, 10 hours, and 30 minutes, one would write: P1Y2M3DT10H30M
-
Script :
{xs:language}
e.g. when you write russian words using english characters, then Lang="ru" and Script is "en"
Global Types in main module
-
annotation :
{free text}
-
flavor :
{xs:string}
allows semi structured extensions to the current set of annotations:
-
type :
these are the (legacy) dependent tiers, %mor line is, now, <morphemics> element
-
>>
{xs:string}<<
SALT
%def; on the main line, not recommended
actions
%act
addressee
%add
alternative
%alt
cohesion
%coh; cohesion tier
comments
%com;[% text]; comments by investigator
completion
[0 text]; an omitted word
english translation
%eng
errcoding
%err; error coding
excluded material
[%exc ...]
explanation
%exp; [= text]
extension
%cod; general purpose coding
extension
arbitrary annotations, CHAT postcodes, intended as an extension mechanism
facial
%fac
flow
%flo
gesture
%gpx
intonation
%int
language
%lan
paralinguistics
%par:
phonetic
%pho:
phonomodel
%mod:
replacement
[: text]
situation
%sit
speech act
%spa
target gloss
%gls
time stamp
%tim
-
comment :
-
<<
{xs:string}
>>
-
type :
-
>>
{xs:string}<<
Activities
Background
Coder
Coding
Date
Education
Generic
Language
Location
New Episode
Room Layout
Script
Situation
Stimulus
Tape Location
Time Duration
Time Start
Transcriber
Warning
-
group :
structure used to let annotations to belong to more than one word, can be recursive, although unnecessary: one can attach more than one annotations to a word, group of words, or whole utterances
-
(
-
[+
-
<wn> {wordnet}
a construct formed by words linked through clitic or compound e.g. once+and+for+all
-
<e> {event}
a reference to a point/portion of a mute/action signal, e.g. 0
]
)
-
morphemics :
the unit of a %mor line corresponding to a word (this element belongs to a word element, but, if the precise correspondence is not yet established, these elements will be present at the utterance level (contained in an utterance);
-
prosody :
-
type :
-
>>
{xs:string}<<
accented nucleus
//
blocking
^
contrastive stress
///
creaky
*text* in CA
drawl
:
pause
::
stress
/
-
tone :
-
type :
-
>>
{xs:string}<<
fall rise
-,.
falling final
-.
falling nonfinal
-_
final exclamation
-!
lengthened
-:
level nonfinal
-,
low level
-
rise fall
-'.
rising final
-?
rising nonfinal
-'
tag question
,,
-
utterance :
-
(
-
[+
-
<wn> {wordnet}
a construct formed by words linked through clitic or compound e.g. once+and+for+all
-
<e> {event}
a reference to a point/portion of a mute/action signal, e.g. 0
]
)
-
word :
{free text}
-
[*
-
<f>* {form}
equivalent of CHAT symbol @;
]
-
wordnet_marker :
-
type :
-
>>
{xs:string}<<
cli
clitic, CHAT ~
cmp
compound, CHAT +
Useful groups in main module
-
morphemic_unit :
-
(
-
<pfx>?
the whole morpheme is actually a prefix, CHAT equivalent is ~#
-
<pos>
-
(
-
<s>*{xs:string}
subcategory
)
)
Schema viewer developed by Romeo Anghelache, copyleft GNU GPL, 2001, version 1.01