
This document is the second part of the overview of the conceptual model underlying the general purpose lexicons constructed in the course of the LE-PAROLE project.
The LE-PAROLE model is based on the GENELEX model with contributions from the MLAP/LE EAGLES project.
This report describes the PAROLE syntactic model:
Please note that the document contains lots of examples and SGML encodings. Most of them were extracted from PAROLE lexicons, but some of them were created as a matter of illustration and do not correspond to encoded data.
2.DESCRIPTION OF SYNTACTIC UNITS
The syntactic level is where all the information about syntactic behaviour(s) of lexical unit is described, especially what cannot be predicted from just knowing its morphosyntactic category and subcategory. As for morphology, complex and structured objects are defined in order to support explicitly the syntactic properties of each lexicon unit.
The " syntax " layer of the PAROLE conceptual level deals with:
These descriptive objects of the syntactic level consist mostly of:
The basic descriptive element of a syntactic behaviour is a Description.
2.1 Articulation with the Morphological Layer
The morphological layer of the model describes lexical units from a morphological viewpoint.
The description of lexical units also requires the representation of other types of information, for example concerning the particular syntactic behaviour of the entry. The objective here is to record as accurately as necessary the specific characteristics that distinguish it from the general behaviour associated with its grammatical category assigned in the morphological layer. For instance, we know that verbs have one subject and 0 to n complements. We will thus have to specify the number of complements and their nature. The syntactic layer of PAROLE is dedicated to the recording of this type of information.
A morphological unit (Mu) having one and only one syntactic category at the level of the morphological layer (grammatical category) may have one or several "syntactic behaviours".
Examples:
| L'homme arrive
à Paris. (The man arrives in Paris) | movement verb |
| L'homme arrive à comprendre. (The man succeeds in understanding) | modal verb |
| L'homme vole une pomme. (The man steals an apple) | transitive verb |
| L'oiseau vole rapidement. (The bird flies quickly) | intransitive verb |
The PAROLE model provides several ways for dealing with surface variation depending on how we use and combine the different syntactic objects: viz. syntactic unit, description, self, construction, position and syntagma. Since the model is not restrictive in this respect, one of the central tasks of syntactic encoding is to decide how the different surface realizations are modelled into subcategorization frames and how the resulting set of frames belonging to a particular lexical entry are organized into syntactic units.
The PAROLE model presented concerns both simple and compound lexical units. Besides, the formalization of simple words obviously applies to the representation of the external syntax of compound words.
Each morphological unit of type MuC (compound) and MuS (simple) may be in relation with at least one syntactic unit. If a Mu has several syntactic behaviours, then it may be in relation with several SynUs, it may be associated with only one SynU containing more than one description, or the construction associated to it may have alternating categories in its positions.In our approach, we consider that MuAff (affix) and MuCont (concatenate) do not have distinct syntactic behaviours that need to be represented in a lexicon.
A Syntactic Unit is equivalent to a syntactic entry or reading, and has one base description and possibly several derived descriptions.
If the entry belongs to a major category, then the SynU gives a minimum description of its complementation pattern (complementation of the Verb, the Noun, the Adjective, the Adverb) through a base Description (attribute ‘description'). Transformed descriptions may be specified in the attribute ‘descriptionl’.
If the entry does not belong to a major category, the SynU may express the context in which the entry inserts.
All SynUs may bear a CombUF, a combination of four usage features: level of language (‘style’), frequency (‘frequency’), geographical variant (‘vargeog’) and datation (‘dating’).
A SynU can be simple or compound. A compound syntactic unit is lexicalized with respect to phrase structure. It bears a Composition that contains the list of its components and the Self of its base Description has an internal structure (for a complete description of compound SynU, please refer to §2.12).
The element TransfSynU is used to put in relation two specific SynUs; it is especially useful for derived words (e.g. cook - cooker).
The Descriptions inside a SynU may be related by means of a FrameSet or not. The element FrameSet bears an identifier and thus it may be shared by several objects. It is used to express systematic relations between descriptions (e.g. pronominalisation).
A SynU is defined by a Base description and 0 to n related descriptions.
Descriptions are shared objects defined by a Self and a Construction. The Self describes the morphosyntactic and semantic characteristics of the head of the SynU and the Construction specifies the complementation pattern of the head.
A description is not intrinsically basic or transformed, but it fulfils these roles for a given SynU. The same description may play both the role of a base description for a SynU and that of a related description for another.
A Description may not bear a Construction if one wants to describe the behaviour of the Mu without specifying its context of occurrence. But a Description consists at least of a Self.
A Description can belong to one or more FrameSets.
The Self carries the information of the lexical item when inserted in the context given by the Construction.
For compound syntactic units, Self also allows to describe the internal structure of the compound unit through ‘syntagmats’ and ‘syntagmatsl’ features.
Given a syntactic construction, Self allows to express all the characteristics of the entry for this construction as a caller through Intervconst:
Example:
abricot
(apricot)
Self : IntervConst : A
Example:
Il est intéressant de
remarquer cela
(It is interesting to note that)
bc : P0 P1 P2
P0 : PRO[LEX:il][MORPHSUBCAT:IMPERSONAL]
P1 : V[SYNSUBCAT:COPULA]
P2[Function:SUBJPRED]:AP
AP : (P0) SELF P1
Self : IntervConst[Function:HEAD]:A
P0 : ADVP
P1 : PP[PREP:de]
Example of conjugation auxiliary:
tomber
(to fall)
Self : IntervConst : V[AuxFeature:ETRE]
Example of a morphological restriction:
lustres
Self : IntervConst : N[TNUMBER:PLURAL]
Example of preverbal particle for uses of French "true pronominal verbs"
s'en aller
(to go away)
Self : IntervConst : V[NPRONOMINAL:SEEN]
A Construction describes the syntactic context required and/or restricted by the described entry. In other words, for verbs, a construction describes what is usually called a "complementation pattern".
A construction is defined by:
The list of non-terminal syntagmatic label is given in §2.7.3.
A list of available syntactic features is given in §2.8.
As there is the possibility to rewrite some Syntagmas and insert the Self (together with its immediate context) in a wider context (a tree, as deep as necessary), Self is not always inserted at the top level of the construction. So, the attribute ‘selfinsertion’ is used to specify the insertion level for the entry. It has to be noted that this rewriting possibility for wider context is generally not useful for verb description, as the verb usually does not need to be inserted in wider context. But this is very useful for other categories.
One may also wish to express the insertion point of Self in the construction (or the phrase in which Self occurs). To do so, the attribute ‘selfinsertion’ that takes the value i is used. It means that Self is inserted before the position Pi. If Self comes after all the positions of the construction, the value of i will be the value of the last position + 1. If one does not want to record the insertion point, the attribute will not be documented.
Example of the French Construction NT162:
This Construction has 2 PositionCs and -its Self inserts before the second position. It is used to encode verbal heads which are preceded by a subject and followed by a direct object.
Examples:
Paul lit un livre
Paul reads
a book
Sgml encoding:
<Construction
id="NT162"
syntlabel="Clause"
selfinsertion="1">
<InstantiatedPositionC
range="0"
optional="RATHERNOO"
positionc="P0SynSN">
<InstantiatedPositionC
range="1"
optional="RATHERNOO"
positionc="P1SynSN"></Construction>
A Position is an element entering into the definition of a construction or a non-terminal phrase.
A Position is associated to a set of three elements:
The attribute ‘repetable’ serves to encode if a position can be repeated several times in a Construction.
Positions may be shared by different constructions although their rank (P0, P1, ...) in these constructions may vary.
2.6.2 List of available functions
Here follows the final list of functions used in the PAROLE project. Not all functions are used by all Partners, please refer to P-WP1.1-MEMO-ERLI-5 V2: " Annex to TA: Encoding features and values for the morphological layer in the lexicon merged tags " for more information on this point.
HEAD, SUBJECT, OBJECT, INDIRECTOBJECT, OBLIQUE, SUBJPRED, OBJPRED, NCOMP, NSUBJ, NOFCOMP, NPREPCOMP, NAPPOSITION, NADJUNCT, NCLAUSCOMP, NDETERMINATIVE, NATTRIBUTIVE, NMODIFIER, ACOMP, APREPCOMP, ACLAUSCOMP, AADJUNCT, AMODIFIER, ADVCOMP, ADVPREPCOMP, ADVMODIFIER, DETMODIFIER, PREPDEPENDENT, CONJDEPENDENT, PREPOBJ, ADVERBIAL, COMPL, CLAUSCOMP, NGENATTRIBUTIVE, NLEFTATTRIBUTIVE, NPOSTPCOMP, NRIGHTATTRIBUTIVE, REALSUBJ.
2.6.3 Link between Positions, SyntagmaNTC and Constructions
Contrary to the GENELEX model, Positions are not directly linked to Constructions or SyntagmaNTC. This link is ensured through InstantiatedPositionC objects.
If a SyntagmaNTC or a Construction consists of 2 Positions, then 2 InstantiatedPositionCs are linked to the SyntagmaNTC or the Construction. InstantiatedPositionC objects encode:
Example of the French Construction NT162 that has 2 PositionCs and 2 InstantiatedPositionC (please refer to §2.5 for an explanation of the Construction):
Sgml encoding:
<Construction
id="NT162"
syntlabel="Clause"
selfinsertion="1">
<InstantiatedPositionC
range="0"
optional="RATHERNOO"
positionc="P0SynSN">
<InstantiatedPositionC
range="1"
optional="RATHERNOO"
positionc="P1SynSN"></Construction>
<PositionC
id="P0SynSN"
comment="Fonction attribuee"
function="SUBJECT"
syntagmacl="SynSN">
<PositionC
id="P1SynSN"
comment="Fonction attribuee"
function="OBJECT"
syntagmacl="SynSN">
A position may be filled either by one or several terminal phrases (SyntagmaT) or by one or several non-terminal phrases (SyntagmaNTC).
A phrase occupying a position is formally described by a terminal or non-terminal syntagmatic label with which a set of constraints is associated if any.
SyntagmaNTCs and Constructions share the same definition, they have the same attributes and they are connected to the same objects. Please refer to §2.5 for information concerning the structure of SyntagmaNTC.
SyntagmaT are defined by:
2.7.2 Syntagmatic labels for SyntagmaT
Here follows the final list of terminal syntagmatic labels used in the PAROLE project. Not all labels are used by all Partners, please refer to P-WP1.1-MEMO-ERLI-5 V2: " Annex to TA: Encoding Features and values for the morphological layer in the lexicon merged tags " for more information on this point.
Available values:
V, N, A, PRO, ADV, CONJ, ADP, DET, ART, NUM, RES, UNIQUE, INTER, ADADJ, POSTADV, E.
2.7.3 Syntagmatic labels for SyntagmaNTC
Here follows the final list of non-terminal syntagmatic labels used in the PAROLE project. Not all labels are used by all Partners, please refer to P-WP1.1-MEMO-ERLI-5 V2: " Annex to TA: Encoding Features and values for the morphological layer in the lexicon merged tags " for more information on this point.
Available values:
NP, VP, PP, AP, ADVP, Clause, NG, DETP, PSP, WITHOUTE.
2.7.4 Alternatives of realization
A Position can be filled by one or more Syntagma types. This is useful to encode alternatives of distribution, that is to say distribution paradigms.
For instance, if a lexicographer wishes to encode that a transitive verb can either take a clausal direct object or a nominal direct object, he can do so by linking 2 SyntagmaNTCs to the same position.
Example of the Catalan verb sentir which can either be followed by a VP or a VP introduced by a as a complement:
Examples:
He sentit a dir que apujaran els
preus.
I have heard that there will be a rise in the prices
He sentit dir que apujaran els preus.
Construction of the verb sentir (hear):
<Construction
id="CSnOinfA"
selfinsertion="1"
syntlabel="Clause">
<InstantiatedPositionC
range="0"
optional="NOO"
positionc="Snp">
<InstantiatedPositionC
range="1"
optional="NOO"
positionc="OvpinfA">
Position that encodes the alternatives of realizations:
<PositionC
id="OvpinfA"
function="OBJECT"
syntagmacl="VPinf VPinfA">
In most cases, the label referring to the phrase is sufficient to describe it as a position filler, and no constraint on its structure needs to be expressed for the described entry.
However one may need to express constraints on the structure in which a lexical entry inserts.
To do so, it is possible to use:
2.7.5.1 Tree-structured rewriting
Positions can recursively be rewritten because the SyntagmaNTC and the Construction have both the same definition:

AP will be rewritten in French as follows:

The optionality of ADVP pre-modifier is indicated using parenthesis.
Examples:
rouge
(red)
très beau
(very nice)

In these schemata, the star (*) encodes the fact that the position can be repeated.
Example:
le très grand lit
(the very large bed)

Example:
le lit rouge
(the red bed)
Example:
le lit est rouge
(the bed is red)
2.7.5.2 Partial rewriting of Phrases
For some entries we may want to express partial restrictions on a phrase without having to rewrite it completely, because it is not always possible; for instance, in the case of a verbal phrase, we do not know how many positions there are if we do not know the head.
In that case, the structure of a "prototypic" phrase (list of positions) is maintained, but certain position fillers are restricted both at the level of their list (removal of fillers) and at the level of constraints (addition of restricting features) on each.
The solution is then to give only the list of positions that is restricted or the type of fillers, and to use the attribute ‘positionl‘ whose values (OPEN, CLOSED) allow to specify whether the list of rewriting positions specifies entirely (CLOSED) or partially (OPEN) the rewriting of the phrase.
Features are restrictions adding to the syntagmatic label in the specification of a Phrase.
The different usable types and sub-types of features, their attributes and the values they can take are going to be presented.
Since time has past since the specification of the features and the encoding, some features were finally not used by partners. In this case, only available values are indicated for the feature, no explanation or sgml encoding is given.
Lexical features (LexFeature) allow to constrain all or part of the lexicalization of a phrase.
To do so, one has to specify:
There are two types of lexical features: introducers and the LEX feature itself.
2.8.1.1 Introducers (INTROD, PREP, CONJ, RELPRO, INTPRO and POSTP)
These features allow to specify the lexicalization of phrase introducers without having to rewrite them. They are not ambiguous and only apply to non-terminal phrases (SyntagmaNTC).
6 types of features are distinguished:
1.- PREP is used for the preposition introducing a non-terminal phrase, typically a Prepositional Phrase
Example of the encoding of the adjunct introduced by about in English:
People joke about politics
Sgml encoding:
<PositionC
id="positionc-v018"
function="VADJUNCT"
syntagmacl="syntagmantc-v-prep-about">
<SyntagmaNTC
id="syntagmantc-v-prep-about"
syntlabel="PP"
featurel="prep-about" ></SyntagmaNTC>
<LexFeature
id="prep-about"
featurename="PREP"
value="about"
mu="PREP12">
2.- CONJ is used for the conjunction introducing a sentence typically in a that-clause.
Example of the encoding of the que that introduces that-clauses in French:
Paul pense que Pierre est un menteur
Paul thinks that Pierre is a liar
Sgml encoding:
<SyntagmaNTC
id="QUEP"
syntlabel="Clause"
featurel="T_CONJ_UM39774" ></SyntagmaNTC>
<LexFeature
id="T_CONJ_UM39774"
featurename="CONJ"
value="que"
mu="UM39774"
saturesynt="YESSA">
3.- RELPRO is used for the relative pronoun introducing a sentence
4.- INTPRO is used for interrogative pronouns that can be specified by verbs requiring interrogative clauses (mainly verbs of speech).
Example of si (wether) that introduces interrogative clauses of verbs like demander (ask)):
Paul demande si Pierre vient
Paul asks wether Pierre come
Sgml encoding:
<SyntagmaNTC
id="SIP"
syntlabel="Clause"
featurel="T_CONJ_SI" ></SyntagmaNTC>
<LexFeature
id="T_CONJ_SI"
featurename="CONJ"
value="si">
5.- INTROD is used for any non-terminal phrase introducer (particle, "recategorizer", etc.) not belonging to any of the four other types.
In the Danish Lexicon, the value at for INTROD feature (corresponding to english that) encodes the presence of the introducer at in subordinate clauses.
Example:
Jeg håbede på
at han ville komme
I hoped that he would come
Sgml encoding:
<LexFeature
id="Trait_Lex8"
featurename="INTROD"
value="at"
mu="UM057173"
saturesynt="YESSA">
<SyntagmaNTC
id="S_NT_Cics"
naming="infinitive with introducer subject control "
syntlabel="Clause"
featurel="Trait_Lex8">
<SyntFeatureClosed
featurename="MOOD"
value="INFINITIVE">
<SyntFeatureClosed
featurename="SYNSUBCAT"
value="SSINFINITIVE">
<SyntFeatureClosed
featurename="COREF"
value="COI">
<SyntFeatureClosed
featurename="CONTROLT"
value="SUBJECTCONTROL"></SyntagmaNTC>
6.- POSTP
The LEX feature allows to specify the lexicalization of:
N for NP
P for PP
V for VP or Clause
A for AP
Adv for ADVP
A particular case of restriction is when the phrase is saturated by its head, i.e. the noun phrase is limited to the noun. To express this property, we use the attribute ‘saturesynt.’
When it has the value YESSA, the head of the phrase is the only leaf with the Introducer, if any. This applies to phrases entering in the definition of simple and compound SynUs.
The field saturesynt must always have the value YESSA when the feature applies to a terminal category: by definition, the leaf is saturated by the lexicalizing element.
In the rewriting alternative, we will use the LEX feature (see next paragraph) on terminal phrases.
Example of the French verb accepter (accept) that sub-categorize for a VP complement introduced by de:
Example:
Il a accepté de venir
He agreed to come
Sgml encoding:
<LexFeature
id="T0"
featurename="LEX"
value="de"
mu="UM13551">
2.8.1.3 Cooccurrence of lexical features
A particular phrase can only bear one and only one LEX feature that will lexicalize the terminal phrase in the one case, and the head, and only the head of the non-terminal phrase, in the other case.
Since each non-terminal phrase has only one introducer, then a phrase can only bear one feature specifying its introducer.
On the contrary, for non-terminal phrases, introducers may be combined to the LEX feature.
Ex: PP[PREP:de][LEX:concert]
These remarks are also valid by reflection for the corresponding RefLex features which are described in the chapter on compound syntactic units (cf. §2.12).
They allow to express a restriction on the value of a morphological feature (MOOD(mood of the verb), TENSE (tense of the verb), PERSON (person), GENDER (gender), TNUMBER (number), and POSSESSOR (number of the possessor), GENDERPOSS (gender of the possessor)) of the phrase or of one of its components.
Ex: NP[TNUMBER:PLURAL]
MOOD, TENSE, PERSON, GENDER, TNUMBER and POSSESSOR are features that can combine with one another to form a specific combination of morphological features.
Ex: [MOOD:SUBJUNCTIVE]
[TENSE:PRESENT]
[PERSON:3]
[TNUMBER:SINGULAR]
Here follows the list of available values for morphological features as they are specified in the document P-WP1.1-MEMO-ERLI-7(V3): " List of feature names and feature values used in the PAROLE lexicon DTD ".
MOOD:
Available values:
INDICATIVE, SUBJUNCTIVE, IMPERATIVE, GERUND, CONDITIONAL, INFINITIVE, PARTICIPLE, SUPINO, PRESPART, PASTPART, INFLECINF, CONJUNCTIVE.
Example of Italian verbs that sub-categorize for a that-clause whose Mood is indicative:
Example:
Egli afferma la sua innocenza / di
essere innocente / che è innocente
He claims his innocence / to be innocent / that he is innocent
Sgml encoding:
<SyntagmaNTC
id="S_NT_thind"
syntlabel="Clause">
<SyntFeatureClosed
featurename="SYNSUBCAT"
value="THATCL">
<SyntFeatureClosed
featurename="MOOD"
value="INDICATIVE"></SyntagmaNTC>
TENSE:
Available values:
PRESENT, IMPERFECT, FUTURE, PAST, PLUSQUEPARFAIT.
In the Danish Lexicon, the value PAST of the TENSE feature is used for verbs to mark constructions with past participle.
Example:
jeg mindes ikke brevet sendt
I do not remember that the letter has been posted
Sgml encoding:
<SyntagmaT
id="S_Tpastp"
naming="Past participle"
syntlabel="V">
<SyntFeatureClosed
featurename="MOOD"
value="PARTICIPLE">
<SyntFeatureClosed
featurename="TENSE"
value="PAST"></SyntagmaT>
PERSON:
Available values:
1, 2, 3, 4, NEGATIVE.
In the Spanish lexicon, PERSON is used to distinguish 'terciopersonal' verbs; in this case, the value is "3". These verbs do not inflect in person and always occur in third person singular form, these include:
(a) 'metereological' verbs
Example:
llover
rain
(b) verbs (exclussively) taking sentential subjects
Example:
me consta que le gustas
I know that he likes you
(c) and impersonal verbs (no subject) such as bastar
Example:
me basta con eso
to me suffices with that
Sgml encoding:
<Description
id="DescVaEmpty"
representativemu="llover"
naming="Impersonal intransitive"
example="Ha llovido toda la mañana"
self="SELFVa3"
construction="ConsEmpty" >
<Self
id="SELFVa3"
comment="para verbos terciopersonales"
intervconst="ICV8">
<IntervConst
id="ICV8"
function="HEAD"
syntagmatl="Va3">
<SyntagmaT
id="Va3"
syntlabel="V"
comment="verbos terciopersonales">
<SyntFeatureClosed
featurename="PERSON"
value="3">
<SyntFeatureClosed
featurename="VOICE"
value="ACTIVE">
GENDER:
Available values:
MASCULINE, FEMININE, NEUTER, GCOMMON, MF, CONT, INDISCRIMINATE, OO, INANIMATE, NONMASCULINE, NONNEUTER, .
GENDER is used for the description of impersonal adjectives in the Greek Lexicon. In Greek, besides impersonal verbs, there are two kinds of impersonal expressions, both introduced with the verb "eimai" (be) the third singular person, and either a noun or an adjective in a specific form: both the noun and the adjective must be in the singular nominative case, and, furthermore, the adjective appears only in the neuter gender. Impersonal expressions subcategorise for clausal complements.
Sgml encoding:
<SyntagmaT id="synadjimp"
syntlabel="A">
<SyntFeatureClosed
featurename="TNUMBER"
value="SINGULAR">
<SyntFeatureClosed
featurename="CASE"
value="NOMINATIVE">
<SyntFeatureClosed
featurename="GENDER"
value= "NEUTER"></SyntagmaT>
Example:
tha itan kalo na pigainame dialkopes
tora
it would be nice to go on holydays now
POSSESSOR:
Available values:
SING, PLUR, SG1, SG2, PL1, PL2, SGPL3.
In the Finnish lexicon, the feature POSSESSOR is used in the Syntactic layer in the 'participial construction'. The participial construction can function as the object of certain verbs of communication or certain verbs indicating perception or some other cognitive processes (e.g. 'sanoa'-'say'; 'n‰hd‰-'see'; 'kuulla'- 'hear'; 'ajatella'-'think'). The participial construction corresponds to an affirmative 'ett‰'-clause (that-clause) in an object position. The subject of the participial construction is expressed by a possessive suffix (encoded through a POSSESSIVE feature) if the subject of the participle is the same as that of the main clause.
Example:
That-clause
Leena huomasi, [ett‰ h‰n itki]
Leena noticed [that she was crying]
Participial construction
Leena huomasi [itkev‰ns‰]
Leena noticed [that she was crying]
Sgml encoding:
<Construction
id="Syntagme_NT_2DPcp_1"
naming="verb + participial construction
as direct object"
example="Leena huomasi itkev‰ns‰
(PCP1 ACT GEN SG3)"
synt="Clause"
positionl="OPEN"
<InstantiatedPositionC
range="0"
optional="YESO"
positioncl="PositionC_V1SCtr"
</InstantiatedPositionC>
<InstantiatedPositionC
range="1"
optional="NOO"
positioncl="PositionC_V3ScpCOREF_1"
</InstantiatedPositionC>
</Construction>
<PositionC
id="PositionC_V1SCtr"
naming="NP in nominative"
function="SUBJECT"
syntagmacl="Syntagme_NT_1PSCtr"
</PositionC>
<PositionC
id="PositionC_V3ScpCOREF_1"
naming="participial construction
as direct object"
example="Leena huomasi itkev‰ns‰"
function="OBJECT"
syntagmacl="Syntagme_NT_3PaScpCOREF"
</PositionC>
<SyntagmaNTC
id="Syntagme_NT_1PSCtr"
naming="NP in nominative"
syntlabel="NP"
<SyntFeatureClosed
featurename="CASE"
featurevalue="NOMINATIVE"
</SyntFeatureClosed>
<SyntFeatureClosed
featurename="COREF"
featurevalue="COI"
</SyntFeatureClosed>
</SyntagmaNTC>
<SyntagmaNTC
id="Syntagme_NT_3PaScpCOREF"
comment="The possessive suffix
is congruent with the subject"
example="Leena huomasi itkev‰ns‰"
naming="participle"
syntlabel="VP"
<SyntFeatureClosed
featurename="NONFINITE"
featurevalue="PARTICIPLEONE"
</SyntFeatureClosed>
<SyntFeatureClosed
featurename="VOICE"
featurevalue="ACTIVE"
</SyntFeatureClosed>
<SyntFeatureClosed
featurename="CASE"
featurevalue="GENITIVE"
</SyntFeatureClosed>
<SyntFeatureClosed
featurename="POSSESSOR"
featurevalue="SGPL3"
</SyntFeatureClosed>
<SyntFeatureClosed
featurename="COREF"
featurevalue="COI"
</SyntFeatureClosed>
</SyntagmaNTC>
Structure of Finnish non-finite verb-forms:
STEM+NUMBER+CASE+POSSESSIVE SUFFIX+CLITIC PARTICLE
The structure of Finnish nominals: Example of a paradigm of the noun ‘cat’:
STEM + NUMBER + CASE + POSSESSIVE SUFFIX
cat+ PL + ADESSIVE
(minu-n) kisso i lla ni (SG1)
(Minun) kisso-i-lla-ni on n‰lk‰
My cats are hungry
GENDERPOSS:
Available values:
GPCOMMON, GPNEUTER.
2.8.3 Morpho-syntactic features
This family of features includes MORPHSUBCAT, Aux (AuxFeature), NPRONOMINAL and AGREEM. All the constraints expressed by these features may add to one other.
Ex: V[AuxFeature:ETRE][NPRONOMINAL:SE]
MORPHSUBCAT:
This feature takes as possible values the morphological subcategories (refer to the P-WP1.1-MEMO-ERLI-32: " PAROLE report on the Morphological Layer ").
They may apply to terminal and non-terminal phrases.
The list of values is finite and corresponds to the list of PAROLE grammatical subcategories defined in morphology.
Available values:
PROPER, COMMON, MAIN, AUX, MODAL, COPULA, QUALI, POSSESSIVE, DEMONSTRATIVE, INTERROGATIVE, RELATIVE, RECIPROCAL, EXCLAMATIVE, REFLEXIVE, PERSONAL, SSCPARTITIVE, INDEFINITE, DEFINITE, GENERAL, PARTICLE, PRONOMINAL, CIRCUMPOSITION, POSTPOSITION, PREPOSITION, COORDINATIVE, CARDINAL, ORDINAL, VPART, FRACT, FOREIGN, FORMULA, IMPERSONAL, RELINDEF, SUBJUNCTION, NEGATION, MULTIPLICATIVE, ANALOG, SYMBOL, COLLECTIVE, MEDIAL, SSCCOMPARATIVE, ANSWER, INFINM, COMPM, NEGM, EN, HI, HO, ABBREVIATION, MEDIOPASSIVE, ACRONYM, PERSONALSUBJECT, PERSONALOBJECT, CLASSIFYING, NORMAL, NONINFLECTING, SSCSTRONG, SSCWEAK, POSITION, INFMARK, SSCNUMERAL, NONMODAL, SSCFUTURE, FORMALSUBJECT, OTHER.
Example of Dutch noun that must be interpreted as proper in some syntactic contexts:
Example:
Natuurreservaat Friescheveen
nature reserve Friescheveen
Sgml encoding:
<SyntagmaT
id="SyntagmeT4"
syntlabel="N"
featurel="TraitLex4 ">
<SyntFeatureClosed
featurename="MORPHSUBCAT"
value="PROPER"></SyntagmaT>
Aux (AuxFeature):
AuxFeature is a complex feature whose value corresponds to the conjugation auxiliary of the verb.
Example of Italian verbs that conjugates with avere (have) auxiliary:
Abbagliare qlcu con false promesse
to mislead someone with false promises
Sgml encoding:
<AuxFeature
id="T_AUX_avere"
value="avere">
NPRONOMINAL:
This feature allows to specify for a given verb the non-referential preverbal particle that is associated with that verb in such-and-such syntactic turn. This is the way to indicate the behaviours of "true pronominal" verbs. This feature may be borne by the Syntagma linked to the Self of a Description.
Available values:
SE, LE, LA, LES, Y, NPEN, SELE, SELA, SELES, SEY, SEEN, NPHI.
Example of Catalan pronominal verbs:
Es tracta que siguis puntual
It is a question of your being in time
Sgml encoding:
<SyntagmaT
id="Vpronse"
syntlabel="V">
<SyntFeatureClosed
featurename="NPRONOMINAL"
value="SE">
AGREEM:
When applying to several terminal or non-terminal phrases, this feature allows to express agreement constraints determined by a lexical unit.
Therefore, this feature allows to explicitely record difficult agreement rules. It goes without saying that, in the absence of this feature, general agreement rules apply.
Available features for AGREEM:
Values used to indicate the elements in agreement:
I, J, K, L.
Values indicating no agreement:
NOTI, NOTJ, NOTK, NOTL
Example of Italian verb addestrare (to train):
Addestrarsi nell'uso delle armi; -all'uso delle armi; - a usare le armi
to be trained in the use of the arms
Sgml encoding:
<SyntagmaT
id="S_T_pro[[s]]"
syntlabel="PRO">
<SyntFeatureClosed
featurename="CLITICPARTICLE"
value="CPYES">
<SyntFeatureClosed
featurename="AGREEM"
value="I"></SyntagmaT>
<SyntagmaNTC
id="S_NT_np11s22"
syntlabel="NP">
<SyntFeatureClosed
featurename="AGREEM"
value="I"></SyntagmaNTC>
SUBORDT
Available values:
PLUSINFVE, COMPAR, PLUSFIN
TENSEMOOD
Available values:
TMPRESENT, TMPAST, TMCONDITIONAL, TMIMPERATIVE, TMPOTENTIAL
WHTYPE
Available values:
WHINT, REL, WHNO
In the Spanish lexicon, 3 types of modifiers adverbs (adverbs that do not subcategorize complements) can be distinguished: general adverbs (e.g. llegamos tarde / we arrived late), degree adverbs (e.g. my bueno / very good) and interrogative adverbs (e.g. dónde estas? / where are you). In the Spanish Lexicon, they are distinguished by means of the MORPHSUBCAT, SYNSUBCAT and WHTYPE attributes:
Sgml encoding:
<!-- Selfs objects for adverbs-->
<Self
id="SELFAdvVMODIFIER"
comment="Adverbs which are Verb modifiers"
example=" e.g., 'llegamos tarde' (we arrived late)"
intervconst="ICADV1">
<Self
id="SELFAdvAMODIFIER"
comment="Adverbs which are Adjective modifiers"
example="e.g., 'muy/francamente buena' (very/really good)"
intervconst="ICADV2">
<Self
id="SELFAdvINT"
comment="Interrogative Adverbs"
example="e.g., '¿dónde estas?' (Where are you?)"
intervconst="ICADV4">
...
<!-- Adverbs -->
<IntervConst
id="ICADV1"
function="VMODIFIER"
syntagmatl="Adv">
<IntervConst
id="ICADV2"
function="AMODIFIER"
syntagmatl="Advdeg Adv">
<IntervConst
id="ICADV4"
function="ADVMODIFIER"
syntagmatl="AdvintSelf">
...
<!-- SyntagmaT for Adverbs-->
<SyntagmaT
id="Adv"
syntlabel="ADV"
comment="general adverbs">
<SyntFeatureClosed
featurename="MORPHSUBCAT"
value="GENERAL">
<SyntFeatureClosed
featurename="WHTYPE"
value="WHNO">
<SyntFeatureClosed
featurename="NDEGREE"
value="POSITIVE">
<SyntagmaT
id="Advdeg"
syntlabel="ADV"
comment="degree adverbs">
<SyntFeatureClosed
featurename="MORPHSUBCAT"
value="GENERAL">
<SyntFeatureClosed
featurename="SYNSUBCAT"
value="DEGREE">
<SyntFeatureClosed
featurename="WHTYPE"
value="WHNO">
<SyntFeatureClosed
featurename="NDEGREE"
value="POSITIVE">
<SyntagmaT
id="AdvintSelf"
syntlabel="ADV"
comment="interrogative adverb"
example="dónde">
<SyntFeatureClosed
featurename="MORPHSUBCAT"
value="INTERROGATIVE">
<SyntFeatureClosed
featurename="WHTYPE"
value="WHINT">
<SyntFeatureClosed
featurename="NDEGREE"
value="POSITIVE">
ZUINCL
Available values:
NORM, ZU
All the constraints expressed by these features may add to one another.
Ex: V[PASSIVIZABLE:PASYES][NEGATIVE:NO]
SYNSUBCAT:
This feature takes as possible values the syntactic subcategories that label subsets of syntactic behaviours for a given category.
They can apply to terminal or non-terminal phrases.
Available values:
SSRELATIVE, INTERROGATIVED, INTERROGATIVEIND, SSINFINITIVE, SSINTERROGATIVE, THATCL, TIME, PLACE, MANNER, DEGREE, QUANTITY, WITHOUTDET, DETERMINED, COORDINATE, COPULATIVE, NACLAUSE, SUBORDINATE, SSCOMPARATIVE, SSSUPERLATIVE, DETERMINATIVE, LOCATION, DIRECTION, DIRECTIONAWAYFROM, DIRECTIONTOWARDS.
Example:
Egli afferma la sua innocenza / di essere innocente / che Ë innocente
He claims his innocence / to be innocent / that he is innocent
Sgml encoding:
<SyntagmaNTC
id="S_NT_thind"
syntlabel="Clause">
<SyntFeatureClosed
featurename="SYNSUBCAT"
value="THATCL">
<SyntFeatureClosed
featurename="MOOD"
value="INDICATIVE"></SyntagmaNTC>
TOURNURE:
The presence of a TOURNURE feature allows to indicate whether a sentence is obligatorily exclamative or interrogative.
Available values:
TOINTERROGATIVE, TOEXCLAMATIVE.
NEGATIVE:
Available values:
YES, NO.
NEGATIVE is a boolean feature which is used in the Catalan lexicon to mark negative VPs. Some adverbs in Cataln can occur only in a negated (or interogative) context: gaire (much), gens (at all), mai (ever), enlloc (anywhere).
Sgml encoding:
<SyntagmaNTC
id="VPneg"
syntlabel="VP">
<SyntFeatureClosed
featurename="NEGATIVE"
featurevalue="YES">
Example:
No em canso gaire
I don't get tired very easily
PASSIVIZABLE:
The presence of a PASSIVIZABLE encodes if the verb may (value: PASYES) or may not (value: PASNO) be passivized.
Note: the absence of the PASSIVIZABLE feature involves an active form, which prevents from recording the feature [PASSIVIZABLE:PASNO] for each entry.
Example:
Catalan verbs like tenir (to have):
Els arbres tenen fulles
Trees have leaves
Sgml encoding:
<SyntagmaT
id="Vnopas"
syntlabel="V">
<SyntFeatureClosed
featurename="PASSIVIZABLE"
value="PASNO">
<Description
id="DVnpasSnOn"
representativemu="tenir"
example ="Els arbres tenen fulles"
comment ="Verbs transitius no passivitzables"
self="SELFVnpas"
construction="CSnOn">
MVF:
Available values:
TRS, INT, IMP.
VOICE
Available values:
ACTIVE, PASSIVE.
VOICE is used in the Greek lexicon for verbs which present a particular syntactic behaviour in the passive voice.
Example:
O Giannis fortothike sti Maria na ton voithisei
John pestered Maria to help him
In the previous example, the verb fortono normally means load, and the meaning given above occurs only in the passive voice and in the specific construction.
Sgml encoding:
<SyntagmaT
id="synverbmedctl"
syntlabel="V">
<SyntFeatureClosed featurename="VOICE" value="PASSIVE"> <SyntFeatureClosed
featurename="CONTROLT"
"featurevalue="INDIRECTOBJECTCONTROL">
FUNCT
Available values:
MOD,SPE, ATTRIBUTIVE, PREDICATIVE ,ADVERBIAL, ADJECTIVAL, NOMINAL, FUPRONOMINAL, DETERMIN, VERBAL, NONATTR.
FUNCT is used in the Italian lexicon in order to differentiate adjectives that can be used predicaticatively, PREDICATIVE value, (e.g. bianco / white) from those that can only be used attibutively, ATTRIBUTIVE value (e.g. nucleare / nuclear).
Example:
Il muro è bianco
The wall is white
Il muro bianco
The white wall
energia nucleare
nuclear energy
Sgml encoding:
<SyntagmaT
id="S_T_A_x_npred_post_ng"
syntlabel="A">
<SyntFeatureClosed
featurename="FUNCT"
value="ATTRIBUTIVE">
<SyntFeatureOpen
featurename="POSITION"
value="POSTPOSITION">
<SyntFeatureOpen
featurename="GRADABLE"
value="NO"></SyntagmaT>
CASE
Available values:
NOMINATIVE, GENITIVE, DATIVE, ACCUSATIVE, VOCATIVE, OBLIQUE, PREPOSOBJ, REFL, UNMARKED, PARTITIVE, INESSIVE, ELATIVE, ILLATIVE, ADESSIVE, ABLATIVE, ALLATIVE, ESSIVE, TRANSLATIVE, INSTRUCTIVE, ABESSIVE, COMITATIVE, ORDINARY, OBJECT.
In the Sweedish Lexicon, the value OBLIQUE is used for pronouns used in the direct object, indirect objetc position or when the pronoun is preceded by a preposition.
Example:
Jag såg henne
I saw her
Jag gav henne en bok
I gave her a book
Sgml encoding:
<SyntagmaT
id="PROoblPX"
syntlabel="PRO">
<SyntFeatureClosed
featurename="CASE"
value="OBLIQUE">
<SyntFeatureOpen
featurename="SSEM"
value="INANIMATUM"></SyntagmaT>
CLITICPARTICLE
Available values:
CPYES.
Example of the Italian verb (to subscribe):
Abbonarsi ad una rivista
to subscribe a magazine
Sgml encoding:
<SyntagmaT
id="S_T_pro[[s]]"
syntlabel="PRO">
<SyntFeatureClosed
featurename="CLITICPARTICLE"
value="CPYES">
<SyntFeatureClosed
featurename="AGREEM"
value="I"></SyntagmaT>
2.8.5 Syntactico-semantic features
COREF:
The COREF feature allows to express the coreference indices used in distributional grammars to give account of the pronominal reference, the control of infinitive clauses, etc.
This feature can take the following values:
Values encoding coreference:
COI, COJ, COK, COL.
Values encoding no coreference:
CONOTI, CONOTJ, CONOTK, CONOTL.
Conventionally, when the feature COREF applies to an infinitival clause or VP, it refers to the empty subject of that Clause or VP.
Coreference must be resolvable: if a feature with a value COI is present on a phrase of a position in a given construction, there is at least one feature COI or CONOTI that corresponds to it on a phrase of another position in the same construction. COREF features do not impose the co-realization of the phrases that bear them. If we want to impose this co-realization, we will do it as usual by means of conditions applying to position realizations.
CONTROLT
No special distinction has been made at syntactic level between equi and raising constructions, but the feature CONTROLT at Construction level can be used to specify the kind of control.
Available values:
SUBJECTCONTROL, OBJECTCONTROL, RAISING, INDIRECTOBJECTCONTROL.
Example of Catalan subject control verb "voler" (want)
<Description
id="DVSnOinfcs"
representativemu="voler"
comment="Infinitival clausal complement controlled by the subject"
example="Els nois no volen venir">
<Construction
id="CSnOinfcs"
selfinsertion="1"
syntlabel="Clause">
<InstantiatedPositionC
range="0"
optional="NOO"
positionc="Snpcont">
<InstantiatedPositionC
range="1"
optional="NOO"
positionc="Ovpinfcont">
<SyntFeatureClosed
featurename="CONTROLT"
value="SUBJECTCONTROL">
<PositionC
id="Snpcont"
function="SUBJECT"
syntagmacl="NPcontrol">
<PositionC
id="Ovpinfcont"
function="OBJECT"
syntagmacl="VPinfcont">
<SyntagmaNTC
id="NPcontrol"
syntlabel="NP">
<SyntFeatureClosed
featurename="COREF"
value="COI">
<SyntFeatureOpen
featurename="ANIM"
value="ANYES">
<SyntagmaNTC
id="VPinfcont"
syntlabel="VP">
<SyntFeatureClosed
featurename="MOOD"
value="INFINITIVE">
<SyntFeatureClosed
featurename="COREF"
value="COI">
COORDT
Available values:
SIMPLE, INITIAL, NOTINITIAL, CORREL.
DECLIN
Available values:
DADJECTIVAL, DNOMINAL.
DEFIN
Available values:
DEF, INDEF.
DEFIN is a binary feature used in the Catalan lexicon to mark the definiteness of the NP and, in particular, to specify that the Direct Object of an Object Attributive verb must be a definite NP.
Sgml encoding:
<SyntagmaNTC
id="NPdef"
syntlabel="NP">
<SyntFeatureClosed
featurename="DEFIN"
value="DEF">
Example:
En Pol troba molt pesats els nens
* En Pol troba molt pesats nens
(Pol thinks kids are a pain)
FINITENESS
Available values:
FINITE, NOFINITE.
FORMATION
Available values:
FSIMPLE, FUSED.
FWFORM
Available values:
ORIGINAL, TRANSLITERATED.
INFLECT
Available values:
WEAK, STRONG, MIXED, INFL, NOINFL.
INFLECTION
Available values:
IWEAK, ISTRONG, BASIC, IMIXED, INFLECTED, NOINFLECTED, DOUBLE.
INFLECTION is used in the Greek lexicon to encode the fact that some adverbs subcategorize for an object realized either as a prepositional phrase or as a weak pronoun.
Sgml encoding:
<SyntagmaT id="SynProGen" syntlabel="PRO"> <SyntFeatureClosed value="PERSONAL"> <SyntFeatureClosed featurename="INFLECTION" value="IWEAK">
Example:
konta se mena
near to me
konta mou
near me
NDEGREE
Available values:
POSITIVE, COMPARATIVE, SUPERLATIVE, ABSOLUTESUPERLATIVE, APPRECIATIVE, INTENSIVE.
NDEGREE is used in the Catalan lexicon in order to identify comparative adverbs whose behaviour may be different from other adverbs: més (more), tan (so).
Sgml encoding:
<SyntagmaT
id="ADVcompar"
naming="ADVcompar"
syntlabel="ADV">
<SyntFeatureClosed
featurename="MORPHSUBCAT"
value="GENERAL">
<SyntFeatureClosed
featurename="NDEGREE"
value="COMPARATIVE">
Example:
Hem de procedir més sistemàticament
We have to procede in a more systematic way
* Jo hi he arribat més abans que tu
*I was here more before you
NREFLEXIVE
Available values:
RREFL, NOREFL.
For those who have chosen to explicitely use semantic properties right from the syntax level, it is possible to record these properties in the form of features.
MASPECT
Certain verbs appearing in the context of the entry are only used with a given aspect.
Ex: V[MASPECT:PROCESSIVE]
Available values:
PROCESSIVE, STATIVE, RESULTATIVE.
ASPECT
Certain verbs appearing in the context of the entry are only used with such or such aspect.
Ex: V[ASPECT:PERFECTIVE]
Available values:
PERFECTIVE, IMPERFECTIVE.
POLITENESS
Available value:
POLITE.
REGISTER
Available values:
FORMAL, OBSOLETE, RPOLITE.
SEX
Available values:
SEFEMININE, SEMASCULINE.
Free features are defined by a name and a value, they can be used when the lexicographer wants to encode features which does not correspond to any category described previously.
In the Catalan lexicon, the Free feature whose name is ANIM (animate) and whose value may either be ANYES and ANNO serves to express constraints on a position.
Example of relative pronoun qui, which is head of an animated NP.
Qui ha dit aixÚ menteix
Whoever said that, was lying
Sgml encoding:
<Description
id="DPronrelQUInoant"
representativemu="qui"
example="(El) qui ha dit aixÚ menteix"
comment="Relative pronoun with no antecedent. Head of an animated NP"
self="SELFPROrel"
construction="CNArtdefanim">
<Construction
id="CNArtdefanim"
naming="CNArtdefanim"
syntlabel="NP"
selfinsertion="1">
<InstantiatedPositionC
range="0"
optional="NOO"
positionc="NDetDef" >
<SyntFeatureOpen
featurename="ANIM"
value="ANYES">
Framesets were introduced by EAGLES to the original GENELEX model. They relate Descriptions (and Positions inside them) and are used to express deep syntactic relations like decausativisation, passivisation, etc. The PAROLE model leaves open the criteria used to group Descriptions in a FrameSet. This object is used to capture some generalizations on a set of regular alternations and to represent something like " deep-syntax structure ".
A FrameSet can consist of 2 or more Descriptions. The Positions and/or Syntagmas of these Descriptions can be linked to one another using Related objects.
Related objets serve to link 2 Positions and/or Syntagmas through RelElement1 and RelElement2 objects. One has to encode as many Related objects as the number of binary correspondances he wants to specify.
RelElement objects must specify:
Embedded Positions are reached by using WayToPosition / WayToSyntagma mechanism.
Example of entry for the Catalan verb 'albergar' (to lodge)
Examples:
(1) La dona alberg‡ el pobre a casa seva.
The woman lodged the beggar in her house
(2) La dona alberg‡ el pobre.
The woman lodged the beggar
(3) El pelegrÌ alberg‡ en un hostal.
The pilgrims lodged in a hostel
These sentences are described by means of the following Descriptions:
(1) Snp+V+Onp+ADVloc
(2) Snp+V+Onp
(3) Snp+V+ADVloc
The three Descriptions listed are collected in one Syntactic Unit and are related via Framesets.
Sgml encoding:
<SynU
id="USALBERGAR"
description="DVSnOnADVloc" <- Base Description
descriptionl="DVSnOn DVSnADVloc <- Derived Descriptions
framesetl="FSOp18 FSErg02"> <- List of FrameSets
FSOp18 is a FrameSet expressesing optionality of one complement. It relates Description #2 (DVSnOn) with the Base Description (DVSnOnADVloc)
<FrameSet
id="FSOp18"
example="col∑locar"
descriptionl="DVSnOnADVloc DVSnOn">
FSErg02 expresses an ergative relation between the Base Description and Description #3 (DVSnADVloc).
The second Position (P1) in DVSnOnADVloc is related with the first Position (P0) in DVSnADVloc. This accounts for the fact that these Positions impose the same kind of restrictions on the NP.
<FrameSet
id="FSErg02"
example="albergar"
descriptionl="DVSnOnADVloc DVSnADVloc">
<Related>
<RelElement1
description="DVSnOnADVloc">
<WayToPosition
targetposition="1">
<RelElement2
description="DVSnADVloc">
<WayToPosition
targetposition="0">
If one needs to go deeper, the solution is to specify through a WayToSyntagma the Syntagma from which the deeper Position must be reached and to use this mechanism recursively till the target Position and/or Syntagma is reached.
Two SynUs drawn from the same Mu can be connected by a named relational link that makes it possible to connect two distinct turns right from the syntactic level. This relational link is encoded through TransfSynU objects.
The link between Positions and/or Syntagmas of the connected SynUs is expressed using Related objects (refer to §2.9.2 for a description of the mechanism).
Two SynUs drawn from different Mus can be connected by a relational link. This type of relation corresponds to the coding of a syntactic derivation, typically nominalizations.
Example:
coller (Verbe)
le militant colle des affiches (Um01 SynU01)
the militant is sticking up posters
collage (Nom)
le militant fait du collage d'affiches (Um2 SynU02)
(nominalization)
SynU01 --------------------> SynU02
Since Constructions and SyntagmaNTCs are linked to one or several Positions, it is possible to express linear order constraints between their Positions or between Syntagmas that fill their position. This can be done through OrderConstraints objects.
OrderConstraint objects resemble closely Related objects. They specify linear order constraints between 2 Syntagmas and/or 2 Positions through Before and After objects.
Before and After objects must specify:
Embedded Positions are reached by using WayToPosition / WayToSyntagma mechanism (cf. §2.9.2).
An example of the Catalan Construction CAdvVPmod shows the case where the element OrderConstraint is used to impose a relative ordering between the verb and the ADVP. Remember that the list of positions (P0, P1,..) reflects only a conventionally established canonical linearity and not a surface ordering.
Example:
Tot es va decidir molt ràpidament
Everything was decided very quickly
Sgml encoding:
<Construction
id="CAdvVPmod"
naming="CAdvVPmod"
syntlabel="VP">
<OrderConstraint>
<Before>
<WayToPosition
targetposition="0">
<After>
<WayToPosition
targetposition="1">
<InstantiatedPositionC
range="0"
optional="NOO"
positionc="Hvp">
<InstantiatedPositionC
range="1"
optional="NOO"
positionc="Vmod">
As the GENELEX report states:, syntactic compounds have the following characteristics:
We will then consider that a "syntactic compounds" is a well-constructed phrase partially or fully constrained. While its inner structure is described by a non-terminal category phrase ("internal category"), on the contrary, its outer behaviour may be compared to that of a terminal phrase ("external category").
The simple or compound nature of an entry does not affect the recording of a Construction specifying its complementation or its context. Compound syntactic units are therefore described just like simple syntactic units as regards their external behaviour.
What is specific for them is the fact that the described entry (Self) is not atomic but composite (multi-lexical).
Lists of components (Composition and RCompose) manage the multi-lexical aspect.
SyntagmaNTS manage the encoding of internal structure of the compound (example: Noun Adjective compound).
What formally distinguishes compound syntactic units from simple syntactic units is only:
• the presence of lists of components on the SynU,
• the presence of an internal structure on the Self of the SynU.
On the other hand, they are described by the same formalism as simple syntactic units (Description).
The articulation between the external syntactic behaviour of the compound and the description of its internal behaviour is drawn using the entity Self that is present both on simple and compound units.
Components that lexicalize the internal structure of Compound Syntactic Units are explicitely listed on the SynU. Components are either of the SynU or Mu type. A component is of the SynU type when we want it to inherit, totally or partially, from its syntactic behaviour. Otherwise the component is of the Mu type.
To record variants of lexicalization, we use two levels of lists.
Ex: ( (avoir admiration) (to have, to feel... admiration)
(éprouver admiration)
(ressentir admiration)
(vouer admiration) )
The first level of the list is that of lexicalization alternatives (Composition).
The second level of the list is that of lexicalized components for each alternative (RComposMu/RComposeSynU).
Example:
<SynU
id="SYNU1"
description="D1">
<Composition>
<RComposeMu
mu="eprouver1">
<RComposeMu
mu="admiration1"><\Composition>
<Composition>
<RComposeMu
mu="avoir1">
<RComposeMu
mu="admiration1"><\Composition><\SynU>
Referring to the component is made by double indexing: [RefLex: Nth alternative . Nth component].
Ex: [RefLex:1.1]--->avoir
Index 0 on the alternatives axis means that all of them are selected.
Ex: [RefLex:0.1]--->avoir,éprouver,
ressentir,vouer
Note: In the appended examples, we have decided to explicitly indicate lexicalization to ensure a certain level of legibility: [RefLex:mot], rather than using the notation: [RefLex : Nième alternative . Nième composant]. This comment applies to the whole family of RefLex and includes RefPrep, RefConj, etc.
2.12.2 Internal structure of compounds
The internal category of the compound is borne by SyntagmaNTS that describes the internal structure of Self. The external category of the compound is borne by the Phrase(s) of its IntervConst.
Ex : boîte aux lettres (internal category Nbarre, external category N)
mail box
In the PAROLE model, the internal base structure of compounds is described in Self by a "non-terminal structure phrase" (SyntagmaNTS). This phrase is characterized by its label, the features that are associated with it, and it can be rewritten by a list of positions, in the same way as all non-terminal phrases.
Ex : la plupart internal structure NP
(most)
tarte à la crème internal structure Nbar
(slapstick)
pierre qui roule n'amasse pas mousse internal structure Clause
(a rolling stone gathers no moss)
mettre en œuvre internal structure VP
(to implement)
en connaissance de cause internal structure PP
(with full knowledge of the facts)
sain et sauf internal structure AP
(safe and sound)
plus ou moins internal structure ADVP
(more or less)
These phrases are perfectly identical to non-terminal Phrases that appear in Positions describing simple units, except the following:
For the remaining part (tree rewriting, labelling of nodes, coordination, etc.), refer to the paragraph on Phrases in the section on simple syntactic units.
Example:
<SynU
id="SYNU1"
description="D1">
<Composition>
<RComposeMu
mu="eprouver1">
<RComposeMu
mu="admiration1"><\Composition>
<Composition>
<RComposeMu
mu="avoir1">
<RComposeMu
mu="admiration1"><\Composition><\SynU>
<Description
id=D1
self=S1
construction=C1>
<Self
id=S1
syntagmants=SNTS1>
<SyntagmaNTS
id=SNTS1>
<InstantiatedPositionS
range=0
positions=PS0>
<InstantiatedPositionS
range=1
positions=PS1>
<PositionS
id=PS0
function=HEAD
syntagmasl=ST0>
<PositionS
id=PS1
function=OBJECT
syntagmasl=SNTS1>
<SyntagmaT
id=ST0
syntlabel=V>
<SyntagmaNTS
id=SNTS1
syntlabel=VP>
Re-entering phenomena are represented by Insertions of Positions of the base Construction into the internal structure of the compound.
Ex : mettre NP en marche
(to start up)
prendre NP en compte
(to take NP into account)
mettre NP en œuvre
(to settle)
Insertions of Positions of the external Construction are recorded in SyntagmaNTS. An Insertion is therefore both a pointer to a Position of the external Construction and an insertion point mark. They may have several Insertions. Just as Positions, Insertions are numbered from 0 to n.
Pointing to a Position of the external Construction is done through the WayToPosition mechanism.
The insertion point is indicated through a number corresponding to the internal position after which the Insertion takes place.
The Insertion may be mandatory or not, the attribute ‘mandatory’ serves to encode this information.
Example:
<Description
id=D2
self=S2
construction=C2>
<Self
id=S2
syntagmants=SNTS2>
<Construction
id=C2>
<InstantiatedPositionC
range=0
positionc=PSN0>
<InstantiatedPositionC
range=1
positionc=PSN1><\Construction>
<SyntagmaNTS
id=SNTS2>
<InstantiatedPositionS
range=0
positions=Pmettre0>
<InstantiatedPositionS
range=1
positions=Pen_marche1>
<Insertion
suitposition=0
mandatory=NO
<WayToPosition
tagetposition=1><\SyntagmaNTS>
Elements that may be interdependent, i.e. they form an indivisible sequence prohibiting any interpolation, apposition or floating quantifier, are Position, Self, Insertion.
This indivisible sequence is represented in a field ‘solidarity’ within the compound by a graphical string with a dash between interdependent elements. It is therefore possible to have a solidarity field with the value:
P0 I0-I2-P1
All restricting features defined by simple units are available for compound units. Some of them, however, need to be better explained.
The lexicalization of terminal or non-terminal syntagmas entering in the description of the syntactic compound is expressed by lexical features borne by those syntagmas. It is possible to re-use all the features defined for simple units, i.e. Lex, Prep, Conj, ProRel, ProIntrog, Introd. Moreover, the features RefLex, RefPrep, RefConj, RefProRel, RefProIntrog and RefIntrod are available in ‘reflection’. These "Ref" features have exactly the same semantics as their correspondents but they point at Morphological Units (Mus) or Syntactic Units (SynUs) not directly, but indirectly by rank numbers in lists of components (refer to the paragraph on the lists of components). This indirection has been introduced so as to be able to share the internal structures (N à N (N to N), N de N (N of N), etc.) independently from their actual lexicalization. RefLex features are only borne by Phrases of the internal structure and are excluded from the external construction.
Ex : voie lactée
carte bleue
cerveau malade
N[RefLex:1.1] ADJ[RefLex:1.2]
Lexicalizing elements need not be dealt with in the list of components. It is possible to specify the prepositions and determiners (including partitive determiners) in the internal structure of the compound rather than in its list of components. In that case, the specifications are expressed like for simple units using the features of the "Lex" family (e.g. PP[prep:à]) or using subcategory features (e.g. DET[SsCatMorph:PARTITIVE]).
As a general rule, compounds (whether they are morphological or syntactic compounds) are units with a complex lexical realization. As units, one has to describe their way of functioning by specifying their complementation pattern or their occurrence context. From that point of view, they behave exactly just as simple units and all the mechanisms that have been previously defined for these simple units also apply to compounds.
In the Construction, Self is used to refer to the compound. In other words, the process is the same as for simple units, except that in the composition, Self also gives the Inner Structure of the unit considered as a whole in the base Construction.
3.1 Syntactic Unit, Description and Self

3.4 SyntagmaNTS, PositionS and Insertion
<!DOCTYPE Parole [
<!ELEMENT Parole - O
( CombUF* , ParoleMorpho? , ParoleSyntaxe? , ParoleSemant?)>
<!ATTLIST Parole
lexiconname CDATA #REQUIRED
language CDATA #REQUIRED
version CDATA #IMPLIED
creationdate1 CDATA #IMPLIED
creationdateglx CDATA #IMPLIED
modificationdate CDATA #IMPLIED
property CDATA #IMPLIED
copyright CDATA #IMPLIED
integrity (YES|NO|WITHOUTB) WITHOUTB
certification CDATA #IMPLIED>
<!-- ***********************************************************
As a general rule, in the whole file:
- naming : allows to name the object in a comprehensible
and, if possible, univocal way
- example : allows to illustrate the use (quotation, example
taken from a dictionary or a linguist)
- comment : free field for the user
*********************************************************** -->
<!ELEMENT CombUF - O EMPTY>
<!ATTLIST CombUF
id ID #REQUIRED
dating (WITHOUTD|ARCHAIC|
OLD|MODERN) WITHOUTD
style (WITHOUTNL|FAMILIAR|
VULGAR|SLANG|POPULAR|
LITERARY|SCHOLARLY|
STANDARD|ORAL|
KATHAREVOUSSA) WITHOUTNL
frequency (WITHOUTF|RARE|USUAL) WITHOUTF
vargeog CDATA #IMPLIED>
<!-- CombUF serves to express usage features. They are linked to:
morphological objects:
MuS, MuCont, MuC, MuAff, Gmu, Pmu and CombMFCif
syntactic objetcs:
semantic objects:
-->
<!-- ********************************************************* -->
<!-- ***** MORPHOLOGICAL INFORMATION ***** -->
<!-- ***** ParoleMorpho ***** -->
<!-- ********************************************************* -->
<!ELEMENT ParoleMorpho - O
((MuS|MuC|MuCont|MuAff)* &
Etymon* &
GInP* &
PInP* &
CombMF* &
InPc* &
CombComb*)>
<!-- *************************************************** -->
<!-- ******* DEFINITION OF MORPHOLOGICAL UNITS ****** -->
<!-- *************************************************** -->
<!ELEMENT MuS - O ((Gmu|Pmu)+
& Derivation* & ShortForm*)>
<!ATTLIST MuS
id ID #REQUIRED
naming CDATA #IMPLIED
attestation CDATA #IMPLIED
combuf IDREF #IMPLIED
etymonlist IDREFS #IMPLIED
gramcat (WITHOUTC|NOUN|VERB|
ADJECTIVE|PRONOUN|
ADVERB|ADPOSITION|
CONJUNCTION|NUMERAL|
DETERMINER|ARTICLE|
INTERJECTION|RESIDUAL|
UNIQUE|ADADJECTIVE|
POSTADVERB) WITHOUTC
gramsubcat (WITHOUTSC|PROPER|
COMMON|MAIN|AUX|MODAL|
COPULA|QUALI|
POSSESSIVE|DEMONSTRATIVE|
INTERROGATIVE|RELATIVE|
RECIPROCAL|EXCLAMATIVE|
REFLEXIVE|PERSONAL|
SSCPARTITIVE|INDEFINITE|
DEFINITE|GENERAL|PARTICLE|
PRONOMINAL|CIRCUMPOSITION|
POSTPOSITION|PREPOSITION|
COORDINATIVE|SUBORDINATIVE|
CARDINAL|ORDINAL|VPART|
FRACT|FOREIGN|FORMULA|
IMPERSONAL|RELINDEF|
SUBJUNCTION|NEGATION|
MULTIPLICATIVE|ANALOG|
SYMBOL|COLLECTIVE|MEDIAL|
SSCCOMPARATIVE|ANSWER|
INFINM|COMPM|NEGM|EN|HI|HO|
ABBREVIATION|MEDIOPASSIVE|
ACRONYM|PERSONALSUBJECT|
PERSONALOBJECT|CLASSIFYING|
NORMAL|NONINFLECTING|
SSCSTRONG|SSCWEAK|POSITION|
INFMARK|SSCNUMERAL|
NONMODAL|SSCFUTURE|
FORMALSUBJECT|OTHER) WITHOUTSC
autonomy (WITHOUTB|YES|NO) WITHOUTB
synulist IDREFS #IMPLIED>
<!-- The content token, (Gmu|Pmu)+,
indicates that a Simple
Morphological Unit must have either
a Graphical Unit or a
Phonemic Unit.
The content token, Derivation, indicates
the possible
derivations that are associated with
a MuS.
The content token, ShortForm indicates
those
relations that a MuS may have with
other Units that are
abbreviated forms. -->
<!ELEMENT MuC - O (RCompos+ & ShortForm*)>
<!ATTLIST MuC
id ID #REQUIRED
naming CDATA #IMPLIED
attestation CDATA #IMPLIED
combuf IDREF #IMPLIED
etymonlist IDREFS #IMPLIED
gramcat (WITHOUTC|NOUN|VERB|
ADJECTIVE|PRONOUN|
ADVERB|ADPOSITION|
CONJUNCTION|NUMERAL|
DETERMINER|ARTICLE|
INTERJECTION|RESIDUAL|
UNIQUE|ADADJECTIVE|
POSTADVERB) WITHOUTC
gramsubcat (WITHOUTSC|PROPER|
COMMON|MAIN|AUX|MODAL|
COPULA|QUALI|
POSSESSIVE|DEMONSTRATIVE|
INTERROGATIVE|RELATIVE|
RECIPROCAL|EXCLAMATIVE|
REFLEXIVE|PERSONAL|
SSCPARTITIVE|INDEFINITE|
DEFINITE|GENERAL|PARTICLE|
PRONOMINAL|CIRCUMPOSITION|
POSTPOSITION|PREPOSITION|
COORDINATIVE|SUBORDINATIVE|
CARDINAL|ORDINAL|VPART|
FRACT|FOREIGN|FORMULA|
IMPERSONAL|RELINDEF|
SUBJUNCTION|NEGATION|
MULTIPLICATIVE|ANALOG|
SYMBOL|COLLECTIVE|MEDIAL|
SSCCOMPARATIVE|ANSWER|
INFINM|COMPM|NEGM|EN|HI|HO|
ABBREVIATION|MEDIOPASSIVE|
ACRONYM|PERSONALSUBJECT|
PERSONALOBJECT|CLASSIFYING|
NORMAL|NONINFLECTING|
SSCSTRONG|SSCWEAK|POSITION|
INFMARK|SSCNUMERAL|
NONMODAL|SSCFUTURE|
FORMALSUBJECT|OTHER) WITHOUTSC
synulist IDREFS #IMPLIED>
<!-- A Compound Morphological
Unit has no Gmu or Pmu of its own:
these graphic and phonemic forms
are deduced from the Units
which make up the Compound Unit.
Each Component that participates
in the MuC is indicated
by an RCompos relationship.
A MuC consists at least of 2 Rcompos
(which the DTD does not show) -->
<!ELEMENT MuCont - O ((Gmu|Pmu)+
& RCompos+)>
<!ATTLIST MuCont
id ID #REQUIRED
naming CDATA #IMPLIED
attestation CDATA #IMPLIED
combuf IDREF #IMPLIED
etymonlist IDREFS #IMPLIED
obligatory (WITHOUTB|YES|NO) WITHOUTB>
<!-- A Contracted Morphological Unit is associated with the
elements that have been incorporated in the contraction by
way of the R_Compose relationship.
A MuCont has a Gmu or/and a Pmu which express(es) the resulting form of
the contraction.
The attribute, obligatory indicates whether
the use of the contraction, as opposed to the corresponding
expanded form, is mandatory or optional. -->
<!ELEMENT MuAff - O ((Gmu|Pmu)+ & SelectGramCat* & ResultGramCat* &
ResultGender* & MorphFeature*)>
<!ATTLIST MuAff
id ID #REQUIRED
naming CDATA #IMPLIED
attestation CDATA #IMPLIED
combuf IDREF #IMPLIED
etymonlist IDREFS #IMPLIED
typaff (WITHOUTS|PREFIX|
SUFFIX|INFIX|BASE) WITHOUTS
mustbeattachedto (WITHOUTMAT|STEM|VOICE|
TENSEMOOD|DEGREE|
MATNUMBER|NONFINITE|
CASE|STEMORNONFINITE|
NUMBERORNONFINITE) WITHOUTMAT
mustbefollowedby (WITHOUTMFB|MFBNUMBER|CASE2|
TENSEMOOD2|NONFINITE2|
PERSON|TENSEMOODORNONFINITE) WITHOUTMFB
maybeattachedto (WITHOUTMBT|CASE3|POSS|
PERSON2|IMPERATIVE|
CLITICPARTICLE|CASEORPOSSORPERSONORIMPERATIVE|
CASEORPOSSORPERSONORIMPERATIVEORCLITICPARTICLE)
WITHOUTMBT
affsemulist IDREFS #IMPLIED>
<!-- The attribute, typaff records the type of
a Morphological Affix Unit; in the case in which an affix
may be typed only within its derivation context, this
attribute will have the value, WITHOUTS.
The context tokens, Result/SelectGramCat and ResultGender indicate for a given Morphological Affix Unit
possible restrictions concerning the grammatical category
and the gender of the Units which result from the
derivation.
If MuAff are linked to GInP, the corresponding combinaison of features
may be indicated using MorphFeature object. In this case, one can use
mustbefollowedby, maybeattachedto, mustbeattachedto attributes to encode the using constraints which affects the MuAff.
In the case the MuAff is used to encode inflection paradigms, (it is the
case, for instance, in the Finnish morphology), the MorphFeature objects
serve to express the morphological features corresponding to the MuAff. The attributes mustbeattachedto, mustbefollowedby,
maybeattachedto serves to encode the conditions that allow the concatenation of MuAff -->
<!ELEMENT ResultGramCat - O EMPTY>
<!ATTLIST ResultGramCat
gramcat (WITHOUTC|NOUN|VERB|
ADJECTIVE|PRONOUN|
ADVERB|ADPOSITION|
CONJUNCTION|NUMERAL|
DETERMINER|ARTICLE|
INTERJECTION|RESIDUAL|
UNIQUE|ADADJECTIVE|
POSTADVERB) WITHOUTC>
<!ELEMENT SelectGramCat - O EMPTY>
<!ATTLIST SelectGramCat
gramcat (WITHOUTC|NOUN|VERB|
ADJECTIVE|PRONOUN|
ADVERB|ADPOSITION|
CONJUNCTION|NUMERAL|
DETERMINER|ARTICLE|
INTERJECTION|RESIDUAL|
UNIQUE|ADADJECTIVE|
POSTADVERB) WITHOUTC>
<!ELEMENT ResultGender - O EMPTY>
<!ATTLIST ResultGender
gender (WITHOUTG|MASCULINE|FEMININE|
NEUTER|GCOMMON|MF|
CONT|INDISCRIMINATE|
OO|INANIMATE|NONMASCULINE|NONNEUTER) WITHOUTG>
<!-- ************************************************** -->
<!-- ********* GRAPHICAL FORM / PHONIC FORM ******* -->
<!-- ************************************************** -->
<!ELEMENT Gmu - O (Spelling & GStem*)>
<!ATTLIST Gmu
range NUMBER #IMPLIED
reference (WITHOUTB|YES|NO) WITHOUTB
naming CDATA #IMPLIED
attestation CDATA #IMPLIED
combuf IDREF #IMPLIED
inp IDREF #REQUIRED
corresplist NUMBERS #IMPLIED>
<!ELEMENT Pmu - O (Spelling & PStem*)>
<!ATTLIST Pmu
range NUMBER #IMPLIED
reference (WITHOUTB|YES|NO) WITHOUTB
naming CDATA #IMPLIED
attestation CDATA #IMPLIED
combuf IDREF #IMPLIED
inp IDREF #REQUIRED
corresplist NUMBERS #IMPLIED>
<!-- In the case a Unit has Graphic and/or Phonic
variants, that is either several Gmu and or Pmu, these
Gmu and Pmu will have an attribute identifying the rank
of the variant (range). The relationship between a Gmu and a
Pmu is established by a list of integers, corresplist.
If there is a preferred form among the variants, this can be
recorded in the attribute, reference.
The field, inp, indicates the system of inflection. In the case of
those Units that are not inflected (prepositions, ...)
the mf field contains an empty value: such as "mf_empty".
The different stems of the gmu can be encoded using P/GStem objetcs. -->
<!ELEMENT GStem - O (Spelling)>
<!ELEMENT PStem - O (Spelling)>
<!ATTLIST GStem
range NUMBER #REQUIRED
contextvar CDATA #IMPLIED
back (WITHOUTB|NO|YES) WITHOUTB
stemtype (WITHOUTST|BASE|SSG|
WSG|SPL|WPL|CON) WITHOUTST
dstem (WITHOUTDS|PVE|CVE|SVE) WITHOUTDS
vv (WITHOUTVV|NOVV|YESVV) WITHOUTVV
endingv (WITHOUTEV|NOEV|YESEV) WITHOUTEV
basei (WITHOUTBI|NOBI|YESBI) WITHOUTBI
syll1p3 (WITHOUTS1p3|NOSY1|YESSY1) WITHOUTS1p3
syll2p3 (WITHOUTS2p3|NOSY2|YESSY2) WITHOUTS2p3
vq (WITHOUTVQ|VA|VO|VU|VE|
VI|VAUML|VOUML|VY) WITHOUTVQ
stemc (WITHOUTSC|NOST|YESST) WITHOUTSC
ending2v (WITHOUT2V|NOEN|YESEN) WITHOUT2V
px (YESPX|NOPX|CAN|WITHOUTPX) WITHOUTPX
px3 (WITHOUTPX3|NOP|YESP) WITHOUTPX3
vqending (WITHOUTVQE|VQA|VQAUML|VQE) WITHOUTVQE
vstem (WITHOUTVS|SSGVST|WSGVST|SSGPAST|
WSGPAST|SSGCOND|PASS|CONVST|
CONPOTN|SSGINF) WITHOUTVS
has2v (WITHOUT2|NO2V|YES2V) WITHOUT2
cq (WITHOUTCQ|CQH|CQL|CQN|
CQR|CQS|CQT) WITHOUTCQ
stempotn (WITHOUTSP|NOPO|YESPO) WITHOUTSP
steminf (WITHOUTSI|NOIN|YESIN) WITHOUTSI
gradation (WITHOUTGR|WEAK|
STRONG|STRONGPAST) WITHOUTGR>
<!ATTLIST PStem
range NUMBER #REQUIRED
contextvar CDATA #IMPLIED
back (WITHOUTB|NO|YES) WITHOUTB
stemtype (WITHOUTST|BASE|SSG|
WSG|SPL|WPL|CON) WITHOUTST
dstem (WITHOUTDS|PVE|CVE|SVE) WITHOUTDS
vv (WITHOUTVV|NOVV|YESVV) WITHOUTVV
endingv (WITHOUTEV|NOEV|YESEV) WITHOUTEV
basei (WITHOUTBI|NOBI|YESBI) WITHOUTBI
syll1p3 (WITHOUTS1p3|NOSY1|YESSY1) WITHOUTS1p3
syll2p3 (WITHOUTS2p3|NOSY2|YESSY2) WITHOUTS2p3
vq (WITHOUTVQ|VA|VO|VU|VE|
VI|VAUML|VOUML|VY) WITHOUTVQ
stemc (WITHOUTSC|NOST|YESST) WITHOUTSC
ending2v (WITHOUT2V|NOEN|YESEN) WITHOUT2V
px (YESPX|NOPX|CAN|WITHOUTPX) WITHOUTPX
px3 (WITHOUTPX3|NOP|YESP) WITHOUTPX3
vqending (WITHOUTVQE|VQA|VQAUML|VQE) WITHOUTVQE
vstem (WITHOUTVS|SSGVST|WSGVST|SSGPAST|
WSGPAST|SSGCOND|PASS|CONVST|
CONPOTN|SSGINF) WITHOUTVS
has2v (WITHOUT2|NO2V|YES2V) WITHOUT2
cq (WITHOUTCQ|CQH|CQL|CQN|
CQR|CQS|CQT) WITHOUTCQ
stempotn (WITHOUTSP|NOPO|YESPO) WITHOUTSP
steminf (WITHOUTSI|NOIN|YESIN) WITHOUTSI
gradation (WITHOUTGR|WEAK|
STRONG|STRONGPAST) WITHOUTGR>
<!-- the radical has two functions:
- it is used by the GinP.PinP (graphic/phonic system of
inflection) to calculate the inflected forms.
A radical that is the same as the label, Spelling, of the
G/Pmu is not recorded as a radical element, but
simply as the label; one can however, refer to it as
the 0th radical. The attributes 'back', 'stemtype', ..., 'gradation' are used to encode Finnish system of inflection
- it is used in the derivation process
In the case a G/Pmu has several stems, the attribute 'range' serves to identify the rank of the stems.
The attribute contextvar can be used to encode the context in which the stem will be used. -->
<!-- ************************************************** -->
<!-- *********** ETYMOLOGY ************** -->
<!-- ************************************************** -->
<!ELEMENT Etymon - O (Spelling?)>
<!ATTLIST Etymon
id ID #REQUIRED
language CDATA #IMPLIED
meaning CDATA #IMPLIED
date CDATA #IMPLIED
naming CDATA #IMPLIED>
<!-- ************************************************** -->
<!-- ***** GRAPHIC AND PHONIC SYSTEM OF INFLECTION *** -->
<!-- ************************************************** -->
<!ELEMENT GInP - O (CombMFCif+)>
<!ATTLIST GInP
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
affmorphfeaturel IDREFS #IMPLIED>
<!ELEMENT PInP - O (CombMFCif+)>
<!ATTLIST PInP
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
affmorphfeaturel IDREFS #IMPLIED>
<!-- The attribute affmorphfeaturel refers to a list of MuAff which corresponds to the set of
relevant affixes for morphological features, pertaining to the concerned GInP.
A G/PinP contains at least one CombMfCif -->
<!ELEMENT CombMFCif - O (Cif+)>
<!ATTLIST CombMFCif
combmf IDREF #REQUIRED
combuf IDREF #IMPLIED>
<!-- A CombMFCif refers to a CombTM (Combination of Morphological Features) via the
'combmf' feature and to one or more Cif (calculation of inflected form).
A CombUF (Combination of Usage Features) can indicate the context in which the form is
available (attribute 'combuf') -->
<!ELEMENT Cif - O (Removal, AddedBefore, AddedAfter) >
<!ATTLIST Cif
range NUMBER #IMPLIED
stemind NUMBER #IMPLIED
contextvar CDATA #IMPLIED
corresplist NUMBERS #IMPLIED>
<!-- The attributes 'range' and 'corresplist'
are used to associate possible
variations in the inflected forms.
Ex: calculation of je peux/je puis (two forms of "I can")
The attribute 'stemind' indicates which
radical, the range attribute of G/PStem, is used to form the
inflected form.
A value of 0 refers to the attribute 'lib' of the G/Pmu. -->
<!ELEMENT MorphFeature - O EMPTY>
<!ATTLIST MorphFeature
featurename CDATA #REQUIRED
featurevalue CDATA WITHOUTFV>
<!-- The element MorphFeature refers to a morphological feature attribute name and value -->
<!ELEMENT Spelling O O (#PCDATA)>
<!ELEMENT AddedBefore O O (#PCDATA)>
<!ELEMENT AddedAfter O O (#PCDATA)>
<!ELEMENT Removal O O (#PCDATA)>
<!ELEMENT CombMF - O EMPTY>
<!ATTLIST CombMF
id ID #REQUIRED
gender (WITHOUTG|MASCULINE|FEMININE|
NEUTER|GCOMMON|MF|
CONT|INDISCRIMINATE|
OO|INANIMATE|NONMASCULINE|NONNEUTER) WITHOUTG
number (WITHOUTN|SINGULAR|PLURAL|
INVARIANT) WITHOUTN
case (WITHOUTC|NOMINATIVE|GENITIVE|
DATIVE|ACCUSATIVE|VOCATIVE|
OBLIQUE|PREPOSOBJ|REFL|
UNMARKED|PARTITIVE|INESSIVE|
ELATIVE|ILLATIVE|ADESSIVE|
ABLATIVE|ALLATIVE|ESSIVE|
TRANSLATIVE|INSTRUCTIVE|
ABESSIVE|COMITATIVE|ORDINARY|
OBJECT) WITHOUTC
defin (WITHOUTD|DEF|INDEF) WITHOUTD
inflect (WITHOUTI|WEAK|STRONG|
MIXED|INFL|NOINFL) WITHOUTI
finiteness (WITHOUTF|FINITE|NOFINITE) WITHOUTF
mood (WITHOUTM|INDICATIVE|
SUBJUNCTIVE|IMPERATIVE|
CONDITIONAL|INFINITIVE|
PARTICIPLE|GERUND|SUPINO|
PRESPART|PASTPART|
INFLECINF|CONJUNCTIVE) WITHOUTM
tense (WITHOUTT|PRESENT|IMPERFECT|
FUTURE|PAST|PLUSQUEPARFAIT) WITHOUTT
mvf (WITHOUTMV|TRS|INT|IMP) WITHOUTMV
person (WITHOUTP|1|2|3|4|NEG) WITHOUTP
aspect (WITHOUTA|PERFECTIVE|
IMPERFECTIVE) WITHOUTA
voice (WITHOUTV|ACTIVE|PASSIVE) WITHOUTV
reflexive (WITHOUTR|RREFL|NOREFL) WITHOUTR
degree (WITHOUTDEG|POSITIVE|
COMPARATIVE|SUPERLATIVE|
ABSOLUTESUPERLATIVE|
APPRECIATIVE|INTENSIVE) WITHOUTDEG
possessor (WITHOUTPOS|SING|PLUR|
SG1|SG2|PL1|PL2|SGPL3) WITHOUTPOS
politeness (WITHOUTPL|POLITE) WITHOUTPL
funct (WITHOUTFU|ATTRIBUTIVE|
PREDICATIVE|ADVERBIAL|
ADJECTIVAL|NOMINAL|FUPRONOMINAL|
DETERMIN|VERBAL|NONATTR) WITHOUTFU
subordt (WITHOUTSU|PLUSINFVE|
COMPAR|PLUSFIN) WITHOUTSU
coordt (WITHOUTCO|SIMPLE|INITIAL|
NOTINITIAL|CORREL) WITHOUTCO
formation (WITHOUTFO|FSIMPLE|FUSED) WITHOUTFO
whtype (WITHOUTW|WHINT|REL|WHNO) WITHOUTW
inflection (WITHOUTIN|IWEAK|ISTRONG|BASIC|
IMIXED|INFLECTED|NOINFLECTED|DOUBLE) WITHOUTIN
zuincl (WITHOUTZ|NORM|ZU) WITHOUTZ
tensemood (WITHOUTTM|TMPRESENT|TMPAST|
TMCONDITIONAL|TMIMPERATIVE|
TMPOTENTIAL) WITHOUTTM
nonfinite (WITHOUTNF|INFINITIVEONE|INFINITIVETWO|
INFINITIVETHREE|PARTICIPLEONE|PARTICIPLETWO|
AGENTPARTICIPLE) WITHOUTNF
declin (WITHOUTDE|DADJECTIVAL|DNOMINAL) WITHOUTDE
genderposs (WITHOUTGP|GPCOMMON|GPNEUTER) WITHOUTGP
fwform (WITHOUTFW|ORIGINAL|TRANSLITERATED) WITHOUTFW
register (WITHOUTRE|FORMAL|OBSOLETE|
RPOLITE) WITHOUTRE
cliticparticle (WITHOUTCP|CPYES) WITHOUTCP
sex (WITHOUTSE|SEFEMININE|SEMASCULINE) WITHOUTSE
transcat (WITHOUTTR|TRNOMINAL|
TRADVERBIAL|TRADJECTIVAL) WITHOUTTR>
<!-- ************************************************ -->
<!-- ******* MORPHOLOGICAL DERIVATION *********** -->
<!-- ************************************************ -->
<!ELEMENT Derivation - O (RestrictMu* & RDeriv+)>
<!ATTLIST Derivation
naming CDATA #IMPLIED
comment CDATA #IMPLIED>
<!-- The content token 'RDeriv' is used to record the
different components of a derivation. Concurrent
derivations are indicated by recording several Derivation
elements on one derived Unit.
The content token 'RestrictMu' refers to the
derived unit.-->
<!ELEMENT RDeriv - O (RestrictMu*)>
<!ATTLIST RDeriv
linearorder NUMBER #IMPLIED
status (WITHOUTS|PREFIX|
SUFFIX|INFIX|BASE) WITHOUTS
gremovalend CDATA #IMPLIED
withdrawnpbeb CDATA #IMPLIED
mu IDREF #REQUIRED>
<!-- The field 'mu' indicates the component of the derivation.
'RestrictMu' applies here to that component.
The attribute 'linearorder' indicates the range of the Mu in the
derivation; the attributes 'gremovalend' and 'withdrawnpbeb' indicate
the modifications that apply to the g/pmu during the derivation
process -->
<!ELEMENT RestrictMu - O EMPTY>
<!ATTLIST RestrictMu
gmuindex NUMBER #IMPLIED
gstemindex NUMBER #IMPLIED
pmuindex NUMBER #IMPLIED
pstemindex NUMBER #IMPLIED>
<!-- In the context of a Morphological Unit, this element
expresses a restriction on that unit while allowing the
selection of a graphic and/or phonemic variant
(or a radical). -->
<!-- *********************************************** -->
<!-- *************** SHORT FORM *************** -->
<!-- *********************************************** -->
<!ELEMENT ShortForm - O EMPTY>
<!ATTLIST ShortForm
typeshort (WITHOUTTB|ABBREVIATION|
INITIALS|ACRONYM) WITHOUTTB
mu IDREF #REQUIRED>
<!-- The attribute 'mu' ("morphological unit"), indicates the Mu
which is the short form of that Mu which has the
relationship ShortForm -->
<!-- ********************************************** -->
<!-- ******* MORPHOLOGICAL COMPOSITION ********** -->
<!-- ********************************************** -->
<!ELEMENT RCompos - O (RestrictMu*)>
<!ATTLIST RCompos
linearorder NUMBER #REQUIRED
gsepar (ATTAQUEG|HYPHEN|
APOSTROPHE|SPACE|
JOIN|HYPHENSPACE|
HYPHENJOIN|HYPHENAPOSTROPHE|
HYPHENSPACEJOIN|
APOSTROPHEJOIN|
SPACEJOIN) ATTAQUEG
psepar (ATTAQUEP|LIAISONT|
LIAISONZ|LIAISONK|
LIAISONN|LIAISONR|
WORDFRONTIER) ATTAQUEP
mu IDREF #REQUIRED
inpc IDREF #REQUIRED>
<!-- The attribute 'mu' indicates a MuS/Cont/Agg/C (Um sub-
classes) component which participates in the composition.
The attribute 'linearorder' specifies
the position of the component in the composition.
The attributs 'g/psepar' ("graphic/phonemic seperators"),
gives the list of possible separators which may appear
before the component. -->
<!-- ********************************************** -->
<!-- **** systems of inflection *** -->
<!-- **** for composed morphological units *** -->
<!-- ********************************************** -->
<!ELEMENT InPc - O EMPTY>
<!ATTLIST InPc
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
combcomblist IDREFS #REQUIRED>
<!ELEMENT CombComb - O EMPTY>
<!ATTLIST CombComb
id ID #REQUIRED
contextvar CDATA #IMPLIED
combcpound IDREF #REQUIRED
combcponentlist IDREFS #REQUIRED>
<!-- The element, CombComb, establishes a relation between :
- a combination of inflectional features for a Compound
Morphological Unit, and
- one (or more in the case in which a compound allows
variations of inflection) combination of inflectional
features of a component.
The attribute, contextvar ("context of variant"), labels
the inflection variations for the componds.
Ex : des pare-soleil(s)
The plural of the compound is formed from
either the singular (old spelling) or the
plural (new spelling) of the component, soleil.
Indications such as "old spelling" or "new spelling" are
noted in the attribute, contextvar.
One must allow for a separator between the zones of this
CDATA type,the order of the zones must correspond to the
order of the IDREFS in the attribute, combcponentlist
("compound component list"):
"old spelling | new spelling" -->
<!-- ********************************************************* -->
<!-- ***** SYNTACTIC INFORMATION ***** -->
<!-- ***** PAROLESYNTAXE ***** -->
<!-- ********************************************************* -->
<!ELEMENT ParoleSyntaxe - O (
(LexFeature* &
AuxFeature* &
RefLexFeature*)
,
(FrameSet* &
SynU+ &
Description+ &
Self+ &
IntervConst+ &
Construction* &
PositionC* &
PositionS* &
SyntagmaT* &
SyntagmaNTC* &
SyntagmaNTS*)
)>
<!ELEMENT FrameSet - O (Related*)>
<!ATTLIST FrameSet
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
descriptionl IDREFS #REQUIRED>
<!-- FrameSets serves to encode regular syntactic alternations.
They gather a set of Descriptions (attribute 'descriptionl').
Positions and/or Syntagma of these Descriptions can be linked
to one another through Related objects.
One has to encode as many Related objects as couple of Position
and/or Syntagma he wants to specify -->
<!ELEMENT Related - O (RelElement1, RelElement2)>
<!ELEMENT RelElement1 - O (WayToPosition)>
<!ATTLIST RelElement1
description IDREF #REQUIRED
access (INTERNE|EXTERNE) EXTERNE
syntagma IDREF #IMPLIED>
<!ELEMENT RelElement2 - O (WayToPosition)>
<!ATTLIST RelElement2
description IDREF #REQUIRED
access (INTERNE|EXTERNE) EXTERNE
syntagma IDREF #IMPLIED>
<!-- Related serves to link 2 Position and/or Syntagma through
RelElement objects.
RelElement serves to encode:
- the Description that encodes the Position and/or Syntagma
(attribute 'description'). This implies that, for a same Related,
the attribute 'description' of RelElement objects must not be filled by
the same value,
- the path to follow to reach the Position/Syntagma (WayToPosition object).
For compound SynU, the Position/Syntagma can be embedded in the
complex Self. In this case, the value of 'access' attribute is INTERNE.
- the Syntagma to take into account. -->
<!ELEMENT OrderConstraint - O (Before, After)>
<!-- OrderConstraint serves to encode linear order constraints between 2
Position/Syntagma.
The Position/Syntagma encoded in the Before object is interpreted as
following the Position/Syntagma encoded in the After object. -->
<!ELEMENT Before - O (WayToPosition?)>
<!ATTLIST Before
access (INTERNE|EXTERNE|SELF) EXTERNE
syntagma IDREF #IMPLIED>
<!ELEMENT After - O (WayToPosition?)>
<!ATTLIST After
access (INTERNE|EXTERNE|SELF) EXTERNE
syntagma IDREF #IMPLIED>
<!-- Before and After objects encode:
- the Syntagma to be reached,
- the path to reached the Position/Syntagma (WayToPosition objects).
If the position/Syntagma is (or is contained in) the Self, the value of 'access' attribute is SELF,
in case of compound SynU, this value can be INTERNE if the Position/Syntagma is embedded in the Self,
in other cases, the value is EXTERNE. -->
<!ELEMENT TransfSynU - O (Related*)>
<!ATTLIST TransfSynU
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
sourcesynu IDREF #REQUIRED
targetsynu IDREF #REQUIRED>
<!-- TransSynUs gather 2 SynU connected by a transformation.
The correspondences between Position/Syntagma can be encoded
through Related objects. -->
<!-- ********************************************************* -->
<!-- ***** SYNTACTIC UNIT, DESCRIPTION ***** -->
<!-- ***** Correspondence with SEMANTIC ***** -->
<!-- ********************************************************* -->
<!ELEMENT SynU - O (Composition?, TransfSynU*, CorrespSynUSemU*)>
<!ATTLIST SynU
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
attestation CDATA #IMPLIED
combuf IDREF #IMPLIED
description IDREF #REQUIRED
descriptionl IDREFS #IMPLIED
framesetl IDREFS #IMPLIED>
<!-- The field 'attestation' allows to specify the source of the use
encountered (name or title of the dictionary, author text,
or linguistics article) -->
<!-- SynU describes one syntactic behaviour of a Mu.
One has to encode as many SynU as syntactic behaviours for a same Mu.
- The attribute 'description' records the base description,
- the list 'descriptionl' record the transformed descriptions,
- the list 'framesetl' records the related FrameSets,
- TransfSynU encodes the corresponding transformed SynU,
- for compound SynU, Composition encodes the components of the compound,
- CorrespSynUSemU encodes the correspondence with the semantic level. -->
<!ELEMENT CorrespSynUSemU - O EMPTY>
<!ATTLIST CorrespSynUSemU
targetsemu IDREF #REQUIRED
correspondence IDREF #IMPLIED>
<!-- The field 'targetsemu' records the id of the corresponding SemU -->
<!ELEMENT Description - O EMPTY>
<!ATTLIST Description
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
representativemu CDATA #IMPLIED
self IDREF #REQUIRED
construction IDREF #IMPLIED>
<!-- The attribute 'representativemu' records the id of MU,
the attribute 'self' records the id of the Self,
the attribute 'construction' records the id of the Construction -->
<!-- ********************************************************* -->
<!-- ***** SELF ***** -->
<!-- ********************************************************* -->
<!ELEMENT Self - O EMPTY>
<!ATTLIST Self
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
syntagmants IDREF #IMPLIED
syntagmantsl IDREFS #IMPLIED
intervconst IDREF #REQUIRED>
<!-- The field 'intervconst' gives the realizations of Self intervening
in the external construction:
- as filler of the construction if this latter describes
a context of occurrence into which Self inserts,
- as predicate associated with a construction describing a
complementation pattern
The field 'syntagmants' is only instantiated for compound SynUs,
it expresses their internal structure, if necessary reduced to
the syntagmatic label, by a 'SyntagmaNTS' with or without rewriting.
In the same way, the list 'syntagmantsl' only concern compound Usyns
and is used to record possible transformations on the internal
structure. -->
<!ELEMENT IntervConst - O EMPTY>
<!ATTLIST IntervConst
id ID #REQUIRED
function (HEAD|SUBJECT|OBJECT|
INDIRECTOBJECT|OBLIQUE|
SUBJPRED|OBJPRED|
NCOMP|NSUBJ|NOFCOMP|
NPREPCOMP|NAPPOSITION|
NADJUNCT|NCLAUSCOMP|
NDETERMINATIVE|NATTRIBUTIVE|
NMODIFIER|ACOMP|APREPCOMP|
ACLAUSCOMP|AADJUNCT|
AMODIFIER|ADVCOMP|
ADVPREPCOMP|ADVMODIFIER|
DETMODIFIER|PREPDEPENDENT|
CONJDEPENDENT|
PREPOBJ|
ADVERBIAL|COMPL|
CLAUSCOMP|
NGENATTRIBUTIVE|NLEFTATTRIBUTIVE|
NPOSTPCOMP|NRIGHTATTRIBUTIVE|
REALSUBJ) HEAD
throle CDATA #IMPLIED
syntagmatl IDREFS #REQUIRED>
<!-- The element 'IntervConst' is a 3-upple containing:
a function: attribute 'function'
a th-role: attribute 'throle'
one/several sytagmat: list syntagmatl
It is possible to express variations of realizations of Self:
Ex: N + [Number:PLURAL]
Ex: V
V + [PRONOMINAL:SE]
V + [PRONOMINAL:SEEN] -->
<!-- ********************************************************* -->
<!-- ***** CONSTRUCTION ***** -->
<!-- ********************************************************* -->
<!ELEMENT Construction - O (OrderConstraint*, InstantiatedPositionC*,
SyntFeatureClosed*, SyntFeatureOpen*)>
<!ATTLIST Construction
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
syntlabel (WITHOUTE|NP|VP|PP|
AP|ADVP|Clause|NG|DETP|
PSP) WITHOUTE
solidarity CDATA #IMPLIED
positionl (OPEN|CLOSED) CLOSED
selfinsertion NUMBER #IMPLIED
featurel IDREFS #IMPLIED>
<!-- A Construction describes the context or syntactic frame specific
to the entry described.
It is basically defined by a list of InstantiatedPositionC.
The attribute 'positionl' indicates whether the list of rewritten
positions is fully (CLOSED) or partially (OPEN) specified.
The attribute 'selfinsertion' indicates, when it is documented, the
insertion point of Self in the list of positions: before the rank
position corresponding to its value.
The attribute 'solidarity' indicates with dashes the pairs
of interdependent positions. -->
<!-- ********************************************************* -->
<!-- ***** POSITION AND INSTANTIATEDPOSITION ***** -->
<!-- ********************************************************* -->
<!ELEMENT InstantiatedPositionC - O EMPTY>
<!ATTLIST InstantiatedPositionC
range NUMBER #REQUIRED
optional (YESO|NOO|
RATHERNOO) YESO
positionc IDREF #REQUIRED>
<!-- InstantiatedPositionC are interfaces between SyntagmaNTC/Construction and
PositionC.
- The attribute 'positionc' encodes the PositionC
- The attribute 'range' encodes the range of the PositionC in the
Construction/SyntagmaNTC - the range of the first one is 0 -
- The attribute 'optional' specifies if the position is/is not optional -->
<!ELEMENT InstantiatedPositionS - O EMPTY>
<!ATTLIST InstantiatedPositionS
range NUMBER #REQUIRED
optional (YESO|NOO|
RATHERNOO) YESO
positions IDREF #REQUIRED>
<!-- InstantiatedPositionS are interfaces between SyntagmaNTS and PositionS.
- The attribute 'positions' encodes the PositionS
- The attribute 'range' encodes the range of the PositionS in the SyntagmaNTS - the range
of the first one is 0 -
- The attribute 'optional' specifies if the position is/is not optional -->
<!ELEMENT PositionC - O EMPTY>
<!ATTLIST PositionC
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
repetable (WITHOUTB|YES|NO) WITHOUTB
function (HEAD|SUBJECT|OBJECT|
INDIRECTOBJECT|OBLIQUE|
SUBJPRED|OBJPRED|
NCOMP|NSUBJ|NOFCOMP|
NPREPCOMP|NAPPOSITION|
NADJUNCT|NCLAUSCOMP|
NDETERMINATIVE|NATTRIBUTIVE|
NMODIFIER|ACOMP|APREPCOMP|
ACLAUSCOMP|AADJUNCT|
AMODIFIER|ADVCOMP|
ADVPREPCOMP|ADVMODIFIER|
DETMODIFIER|PREPDEPENDENT|
CONJDEPENDENT|
PREPOBJ|
ADVERBIAL|COMPL|
CLAUSCOMP|
NGENATTRIBUTIVE|NLEFTATTRIBUTIVE|
NPOSTPCOMP|NRIGHTATTRIBUTIVE|
REALSUBJ) HEAD
throle CDATA #IMPLIED
syntagmacl IDREFS #REQUIRED>
<!-- The attribute 'syntagmacl' refers the possible fillers of
'PositionC':
- terminal phrase (SyntagmaT')
- non-terminal phrase, the rewriting of which is or not
described ('SyntagmaNTC')
The attribute 'repetable' indicates if a position can be realized
several times
The attributes 'throle' and 'function' indicate the th-role and
the function of the Position in the Construction/SyntagmaNTC -->
<!ELEMENT PositionS - O EMPTY>
<!ATTLIST PositionS
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
repetable (WITHOUTB|YES|NO) WITHOUTB
function (HEAD|SUBJECT|OBJECT|
INDIRECTOBJECT|OBLIQUE|
SUBJPRED|OBJPRED|
NCOMP|NSUBJ|NOFCOMP|
NPREPCOMP|NAPPOSITION|
NADJUNCT|NCLAUSCOMP|
NDETERMINATIVE|NATTRIBUTIVE|
NMODIFIER|ACOMP|APREPCOMP|
ACLAUSCOMP|AADJUNCT|
AMODIFIER|ADVCOMP|
ADVPREPCOMP|ADVMODIFIER|
DETMODIFIER|PREPDEPENDENT|
CONJDEPENDENT|
PREPOBJ|
ADVERBIAL|COMPL|
CLAUSCOMP|
NGENATTRIBUTIVE|NLEFTATTRIBUTIVE|
NPOSTPCOMP|NRIGHTATTRIBUTIVE|
REALSUBJ) HEAD
throle CDATA #IMPLIED
syntagmasl IDREFS #REQUIRED>
<!-- The attribute 'syntagmasl' refers the possible fillers of
'PositionS':
- terminal phrase (SyntagmaT')
- non-terminal phrase, the rewriting of which is or not
described ('SyntagmaNTS')
The attribute 'repetable' indicates if a position can be realized
several times
The attributes 'throle' and 'function' indicate the th-role and
the function of the Position in the SyntagmaNTS -->
<!ELEMENT Insertion - O (WayToPosition)>
<!ATTLIST Insertion
suitposition NUMBER 0
mandatory (WITHOUTO|YES|NO) WITHOUTO
takeawaysyntagmal IDREFS #IMPLIED >
<!-- 'Insertion' in a structural phrase is only used to represent the
case of an insertion that refers to a position
described in the external syntactic construction:
the element 'WayToPosition' gives access to this position.
The value of attribute 'suitposition' indicates the rank of the internal
position after which the external position must be inserted.
The attribute 'takeawaysyntagma' indicate the removal of phrases
that can be applied to the referenced position:
Ex: the compound "mettre en oeuvre" has in its external
construction a position of direct object containing for instance
a noun phrase and a personal pronoun:
"mettre en oeuvre un processus", "le mettre en oeuvre"
(to implement a process, to implement it)
insertion is only possible for the noun phrase:
"mettre un processus en oeuvre"
The attribute 'mandatory' indicates, whether, during a realisation
of the referenced external position, the insertion phenomenon is
compulsory or optional -->
<!ELEMENT WayToSyntagma - O (WayToSyntagma?)>
<!ATTLIST WayToSyntagma
positionrange NUMBER 0
syntagma IDREF #REQUIRED>
<!-- This element allows to select a particular phrase.
Recursivity is used to go down in a possible rewriting. The result
of the element is always a phrase.
The positions are referred by 'positionrange' that indicates
their rank (= the value of 'range' attribute of the corresponding
InstantitedPositionC/S) in the list in which they occurs.
In the case only one Syntagma of the Position has to be selected,
the attribute 'syntagma' can be filled. -->
<!ELEMENT WayToPosition - O (WayToSyntagma?)>
<!ATTLIST WayToPosition
targetposition NUMBER 0>
<!-- For a given construction or a phrase, this element allows
to select one of its positions.
The element 'targetposition' indicates the selected position;
if this latter occurs in a phrase rewriting, the element
'WayToSyntagma' is used to reach that phrase -->
<!-- *********************************************************** -->
<!-- ***** TERMINAL AND NON-TERMINAL SYNTAGMAS ****** -->
<!-- *********************************************************** -->
<!-- 'SyntagmaT' is a terminal position filler,
'SyntagmaNTC/S' is a non-terminal position filler,
one will use its list of positions if one wants to
specify the way it should be rewritten.
Besides, 'SyntagmaNTS' is used to describe the internal structure
of a compound unit (field 'syntagmants' of the Self).
The SyntFeatureClosed/Open refer to restricting features and thus it
allows to specify on a phrase a set of constraints:
lexical, morphological, morpho-syntactical, syntactico-semantic,
and even semantic.
The field 'naming' allows to record the usual name of the phrase.
Ex : SyntagmaNTC
syntlabel = "Clause"
Feature = [MOOD:INFINITIVE]
naming = "phrase infinitive"
Category "e" allows adherents to the generative grammar to record
traces and to consider them as 'phantom' phrases
with which one can associate the necessary restrictions.
Ex : SyntagmaT
syntlabel = "e"
Feature = [Personne:3][Nombre:SINGULIER]
naming = "elt vide" -->
<!ELEMENT SyntagmaT - O (SyntFeatureClosed*, SyntFeatureOpen*)>
<!ATTLIST SyntagmaT
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
syntlabel (V|N|A|PRO|ADV|CONJ|
ADP|DET|ART|NUM|RES|
UNIQUE|INTER|ADADJ|
POSTADV|E) #REQUIRED
featurel IDREFS #IMPLIED>
<!ELEMENT SyntagmaNTC - O (OrderConstraint*, InstantiatedPositionC*,
SyntFeatureClosed*, SyntFeatureOpen*)>
<!ATTLIST SyntagmaNTC
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
syntlabel (WITHOUTE|NP|VP|PP|
AP|ADVP|Clause|NG|DETP|
PSP) WITHOUTE
solidarity CDATA #IMPLIED
positionl (OPEN|CLOSED) CLOSED
selfinsertion NUMBER #IMPLIED
featurel IDREFS #IMPLIED>
<!-- SyntagmaNTC can be rewritten using as many InstantiatedPositionC as necessary
The attribute 'positionl' indicates whether the list of rewritten positions is
fully (CLOSED) or partially (OPEN) specified
The attribute 'solidarity' indicates with dashes the pairs
of interdependent positions. -->
<!ELEMENT SyntagmaNTS - O (OrderConstraint*, InstantiatedPositionS*,
Insertion*, SyntFeatureClosed*, SyntFeatureOpen*)>
<!ATTLIST SyntagmaNTS
id ID #REQUIRED
comment CDATA #IMPLIED
example CDATA #IMPLIED
naming CDATA #IMPLIED
syntlabel (WITHOUTE|NP|VP|PP|
AP|ADVP|Clause|NG|DETP|
PSP) WITHOUTE
solidarity CDATA #IMPLIED
positionl (OPEN|CLOSED) CLOSED
featurel IDREFS #IMPLIED>
<!-- Insertions indicate the external positions that can be inserted in the SyntagmaNTS.
InstantiatedPositionsS describes the rewritten positions of the SyntagmaNTS.
The attribute 'solidarity' may either refer to Insertion and to InstantiatedPositionS.
The attribute 'positionl' indicates whether the list of rewritten positions is fully
(CLOSED) or partially (OPEN) specified -->
<!-- ********************************************************* -->
<!-- ***** COMPOSITION ***** -->
<!-- ********************************************************* -->
<!ELEMENT Composition - O (RComposeMu|RComposeSynU)+>
<!-- The elements 'Composition' borne by a compound syntactic unit
record the alternative lists of lexicalizations:
Ex : ("avoir admiration pour") (to have admiration)
("eprouver admiration pour") (to feel admiration)
("eprouver admiration envers") (...)
("porter admiration a") -->
<!-- The list of 'R_ComposeUm' and 'R_ComposeUsyn' gives the list of
components for a given composition alternative. -->
<!-- The components will be referred by RefLex features bearing
two index values:
- the composition index in the list of compositions
- the component index in the list of components
Ex. : [RefLex:1,2]:
This mechanism will be used in the internal structural phrase of Self. -->
<!ELEMENT RComposeMu - O (RestrictMu*)>
<!ATTLIST RComposeMu
mu IDREF #REQUIRED>
<!ELEMENT RComposeSynU - O EMPTY>
<!ATTLIST RComposeSynU
synu IDREF #REQUIRED >
<!-- ********************************************************* -->
<!-- ***** FEATURES ***** -->
<!-- ********************************************************* -->
<!ELEMENT SyntFeatureClosed - O EMPTY>
<!ATTLIST SyntFeatureClosed
featurename (MORPHSUBCAT|MASPECT|MOOD|
TENSE|MVF|PERSON|GENDER|ASPECT|
TNUMBER|POSSESSOR|SYNSUBCAT|
NPRONOMINAL|NEGATIVE|AGREEM|
TOURNURE|COREF|VOICE|FUNCT|
PASSIVIZABLE|CONTROLT|CASE|
CLITICPARTICLE|COORDT|DECLIN|
DEFIN|FINITENESS|FORMATION|
FWFORM|GENDERPOSS|INFLECT|
INFLECTION|NDEGREE|NREFLEXIVE|
POLITENESS|REGISTER|SEX|SUBORDT|
TENSEMOOD|WHTYPE|ZUINCL|NONFINITE) MORPHSUBCAT
value (PROPER|COMMON|MAIN|
AUX|MODAL|COPULA|QUALI|
POSSESSIVE|DEMONSTRATIVE|INTERROGATIVE|RELATIVE|
RECIPROCAL|EXCLAMATIVE|REFLEXIVE|PERSONAL|
SSCPARTITIVE|INDEFINITE|DEFINITE|GENERAL|
PARTICLE|PRONOMINAL|CIRCUMPOSITION|
POSTPOSITION|PREPOSITION|COORDINATIVE|SUBORDINATIVE|
CARDINAL|ORDINAL|VPART|FRACT|
FOREIGN|FORMULA|IMPERSONAL|RELINDEF|
SUBJUNCTION|NEGATION|MULTIPLICATIVE|
ANALOG|SYMBOL|COLLECTIVE|MEDIAL|
SSCCOMPARATIVE|ANSWER|INFINM|COMPM|
NEGM|EN|HI|HO|
ABBREVIATION|MEDIOPASSIVE|ACRONYM|PERSONALSUBJECT|
PERSONALOBJECT|CLASSIFYING|NORMAL|
NONINFLECTING|SSCSTRONG|SSCWEAK|
POSITION|INFMARK|SSCNUMERAL|NONMODAL|
SSCFUTURE|FORMALSUBJECT|OTHER|
PROCESSIVE|RESULTATIVE|STATIVE|
PERFECTIVE|IMPERFECTIVE|
INDICATIVE|SUBJUNCTIVE|IMPERATIVE|GERUND|
CONDITIONAL|INFINITIVE|PARTICIPLE|
SUPINO|PRESPART|PASTPART|INFLECINF|
CONJUNCTIVE|
PRESENT|IMPERFECT|FUTURE|PAST|
PLUSQUEPARFAIT|
TRS|INT|IMP|
1|2|3|4|
NEG|
MASCULINE|FEMININE|NEUTER|
GCOMMON|MF|CONT|INDISCRIMINATE|
OO|INANIMATE|NONMASCULINE|NONNEUTER|
SINGULAR|PLURAL|INVARIANT|
SING|PLUR|SG1|
SG2|PL1|PL2|SGPL3|
THATCL|INTERROGATIVED|INTERROGATIVEIND|TIME|
PLACE|MANNER|DEGREE|QUANTITY|
WITHOUTDET|DETERMINED|COORDINATE|COPULATIVE|
NACLAUSE|SSINFINITIVE|SUBORDINATE|SSCOMPARATIVE|
SSSUPERLATIVE|DETERMINATIVE|SSINTERROGATIVE|LOCATION|
DIRECTION|DIRECTIONAWAYFROM|DIRECTIONTOWARDS|SSRELATIVE|
SE|LE|LA|LES|
Y|NPEN|SELE|SELA|
SELES|SEY|SEEN|NPHI|
YES|NO|
I|J|K|L|NOTI|NOTJ|NOTK|NOTL|
TOINTERROGATIVE|TOEXCLAMATIVE|
COI|COJ|COK|COL|CONOTI|CONOTJ|CONOTK|CONOTL|
ACTIVE|PASSIVE|
MOD|SPE|ATTRIBUTIVE|PREDICATIVE|
ADVERBIAL|ADJECTIVAL|NOMINAL|FUPRONOMINAL|
DETERMIN|VERBAL|NONATTR|
PASYES|PASNO|
SUBJECTCONTROL|OBJECTCONTROL|RAISING|INDIRECTOBJECTCONTROL|
NOMINATIVE|GENITIVE|DATIVE|
ACCUSATIVE|VOCATIVE|OBLIQUE|PREPOSOBJ|
REFL|UNMARKED|PARTITIVE|INESSIVE|
ELATIVE|ILLATIVE|ADESSIVE|ABLATIVE|
ALLATIVE|ESSIVE|TRANSLATIVE|INSTRUCTIVE|
ABESSIVE|COMITATIVE|ORDINARY|OBJECT|
CPYES|
SIMPLE|INITIAL|NOTINITIAL|CORREL|
DADJECTIVAL|DNOMINAL|
DEF|INDEF|
FINITE|NOFINITE|
FSIMPLE|FUSED|
ORIGINAL|TRANSLITERATED|
GPCOMMON|GPNEUTER|
WEAK|STRONG|MIXED|INFL|
NOINFL|
IWEAK|ISTRONG|BASIC|
IMIXED|INFLECTED|NOINFLECTED|DOUBLE|
POSITIVE|COMPARATIVE|SUPERLATIVE|
ABSOLUTESUPERLATIVE|APPRECIATIVE|INTENSIVE|
RREFL|NOREFL|
POLITE|
FORMAL|OBSOLETE|RPOLITE|
SEFEMININE|SEMASCULINE|
PLUSINFVE|COMPAR|PLUSFIN|
TMPRESENT|TMPAST|
TMCONDITIONAL|TMIMPERATIVE|TMPOTENTIAL|
WHINT|REL|WHNO|
NORM|ZU|
INFINITIVEONE|INFINITIVETWO|INFINITIVETHREE|
PARTICIPLEONE|PARTICIPLETWO|AGENTPARTICIPLE) #REQUIRED >
<!ELEMENT SyntFeatureOpen - O EMPTY>
<!ATTLIST SyntFeatureOpen
featurename CDATA #REQUIRED
value CDATA #REQUIRED>
<!ELEMENT LexFeature - O EMPTY>
<!ATTLIST LexFeature
id ID #REQUIRED
featurename (INTROD|PREP|CONJ|
RELPRO|INTPRO|POSTP|
LEX) #REQUIRED
mu IDREF #REQUIRED
value CDATA #REQUIRED
saturesynt (YESSA|
WITHOUTSA) WITHOUTSA>
<!-- LexFeature allows one to specify the lexicalization of a syntactic
leaf or of the head of a phrase.
The lexicographer is free to refer to :
- a morphological unit (the attribute 'mu' encodes the identifier of the selected Mu),
- a string, borne by the 'value' attribute, corresponding to the graphical form of the lexical unit.
The attribute 'saturesynt' indicates wether the phrase is saturated or not by the selected Mu.
Thus, it will always have YESSA value when the feature applies to a SyntagmaT. -->
<!ELEMENT AuxFeature - O EMPTY>
<!ATTLIST AuxFeature
id ID #REQUIRED
value CDATA #REQUIRED
tense CDATA #IMPLIED
mood CDATA #IMPLIED >
<!-- AuxFeature encodes the auxiliary used to conjugate the verb (attribute 'value').
The attribute 'tense' and 'mood' serves to add restriction concerning tense and mood. -->
<!ELEMENT RefLexFeature - O EMPTY>
<!ATTLIST RefLexFeature
id ID #REQUIRED
featurename (RINTROD|RPREP|RCONJ|
RRELPRO|RINTPRO|RPOSTP|
RLEX) #REQUIRED
cponentrange NUMBER #REQUIRED
saturesynt (YESSA|
WITHOUTSA) WITHOUTSA>
<!-- RefLexFeature are similar to LexFeature except that they are used in compound SynU.
Specify a component is done using the mechanism described with Composition object. -->
4.2 List of features names and feature values used in the PAROLE lexicon DTD
This document is the third version of P-WP1.1-MEMO-ERLI-7. The differences between the tables of this document and the tables of the previous version are indicated by a shaded background.
This document is an inventory of the features actually implemented in AlethGD tool V2.1.0 standard filler, mapper and DTD (ETC-2.1.2 version) following the decisions taken by partners and by PAROLE Syntax Workshop. On no account, this set of values is closed; new features and values may be added to this list.
The following table contains the set of morphological and syntactic features, which are not specified in the document P-WP1.1-MEMO-ERLI-5: "Annex to the Addendum to the contract", as it has been implemented in the PAROLE lexicon DTD.
The first column contains the name of the corresponding Ozon object from which features can be searched for with AlthGD Browser.
The second column contains the feature name used in the alethkem.fpd file.
The third column contains the DTD feature and, written within angular brackets, the name of the object containing this feature.
If the feature names and/or the feature values you use in your database or SGML data do not conform to the following list, you will not be able to load and/or unload data via the PAROLE standard Mapper and Filler. In this case, please contact ERLI who will add the missed informations.
| Ozon name | FPD feature declaration | DTD feature name and location | DTD feature value |
| Statut | Statut | status | |
| <Rderiv> | WITHOUTS | ||
| typaff | PREFIX | ||
| <MuAff> | SUFFIX | ||
| INFIX | |||
| BASE | |||
| Separg | Separg | gsepar | |
| <RCompos> | ATTAQUEG | ||
| HYPHEN | |||
| APOSTROPHE | |||
| SPACE | |||
| JOIN | |||
| HYPHENSPACE | |||
| HYPHENJOIN | |||
| HYPHENAPOSTROPHE | |||
| HYPHENSPACEJOIN | |||
| APOSTROPHEJOIN | |||
| Separp | Separp | psepar | |
| <RCompos> | ATTAQUEP | ||
| LIAISONT | |||
| LIAISONZ | |||
| LIAISONK | |||
| LIAISONN | |||
| LIAISONR | |||
| WORDFRONTIER | |||
| TypeBref | TypeBref | typeshort | |
| <ShortForm> | WITHOUTTB | ||
| ABBREVIATION | |||
| INITIALS | |||
| ACRONYM | |||
| ValeurE | ValeurE.DATATION | dating | |
| <CombUF> | WITHOUTD | ||
| ARCHAIC | |||
| OLD | |||
| MODERN | |||
| ValeurE | ValeurE.NIVEAULGUE | style | |
| <CombUF> | WITHOUTNL | ||
| FAMILIAR | |||
| VULGAR | |||
| SLANG | |||
| POPULAR | |||
| LITERARY | |||
| SCHOLARLY | |||
| STANDARD | |||
| ORAL | |||
| KATHAREVOUSSA | |||
| ValeurE | ValeurE.FREQUENCE | frequency | |
| <CombUF> | WITHOUTF | ||
| RARE | |||
| USUAL | |||
| Trait_F | Trait_F.MUSTBEATTACHEDTO | mustbeattachedto | |
| <MuAff> | WITHOUTMAT | ||
| STEM | |||
| VOICE | |||
| TENSEMOOD | |||
| DEGREE | |||
| MATNUMBER | |||
| NONFINITE | |||
| CASE | |||
| STEMORNONFINITE | |||
| NUMBERORNONFINITE | |||
| Trait_F | Trait_F.MUSTBEFOLLOWEDBY | mustbefollowedby | |
| <MuAff> | WITHOUTMFB | ||
| MFBNUMBER | |||
| CASE2 | |||
| TENSEMOOD2 | |||
| NONFINITE2 | |||
| PERSON | |||
| TENSEORMOODORNONFINITE | |||
| Trait_F | Trait_F.MAYBEATTACHEDTO | maybeattachedto | |
| <MuAff> | WITHOUTMBT | ||
| CASE3 | |||
| POSS | |||
| PERSON2 | |||
| IMPERATIVE | |||
| CLITICPARTICLE | |||
| CASEORPOSSORPERSONORIMPERATIVE | |||
| CASEORPOSSORPERSONORIMPERATIVEORCLITICPARTICLE | |||
| Trait_F | Trait_F.BACK | back | |
| <GStem> | WITHOUTB | ||
| <PStem> | NO | ||
| YES | |||
| Trait_F | Trait_F.STEMTYPE | stemtype | |
| <GStem> | WITHOUTST | ||
| <PStem> | BASE | ||
| SSG | |||
| WSG | |||
| SPL | |||
| WPL | |||
| CON | |||
| Trait_F | Trait_F.DSTEM | dstem | |
| <GStem> | WITHOUTDS | ||
| <PStem> | PVE | ||
| CVE | |||
| SVE | |||
| Trait_F | Trait_F.VV | vv | |
| <GStem> | WITHOUTVV | ||
| <PStem> | NOVV | ||
| YESVV | |||
| Trait_F | Trait_F.ENDINGV | endingv | |
| <GStem> | WITHOUTEV | ||
| <PStem> | NOEV | ||
| YESEV | |||
| Trait_F | Trait_F.BASEI | basei | |
| <GStem> | WITHOUTBI | ||
| <PStem> | NOBI | ||
| YESBI | |||
| Trait_F | Trait_F.SYLL1P3 | syll1p3 | |
| <GStem> | WITHOUTS1p3 | ||
| <PStem> | NOSY1 | ||
| YESSY1 | |||
| Trait_F | Trait_F.SYLL2P3 | syll2p3 | |
| <GStem> | WITHOUTS2p3 | ||
| <PStem> | NOSY2 | ||
| YESSY2 | |||
| Trait_F | Trait_F.VQ | vq | |
| <GStem> | WITHOUTVQ | ||
| <PStem> | VA | ||
| VO | |||
| VU | |||
| VE | |||
| VI | |||
| VAUML | |||
| VOUML | |||
| VY | |||
| Trait_F | Trait_F.STEMC | stemc | |
| <GStem> | WITHOUTSC | ||
| <PStem> | NOST | ||
| YESST | |||
| Trait_F | Trait_F.ENDING2V | ending2v | |
| <GStem> | WITHOUT2V | ||
| <PStem> | NOEN | ||
| YESEN | |||
| Trait_F | Trait_F.PX | px | |
| <GStem> | YESPX | ||
| <PStem> | NOPX | ||
| CAN | |||
| WITHOUTPX | |||
| Trait_F | Trait_F.PX3 | px3 | |
| <GStem> | WITHOUTPX3 | ||
| <PStem> | NOP | ||
| YESP | |||
| Trait_F | Trait_F.VQENDING | vqending | |
| <GStem> | WITHOUTVQE | ||
| <PStem> | VQA | ||
| VQAUML | |||
| VQE | |||
| Trait_F | Trait_F.VSTEM | vstem | |
| <GStem> | WITHOUTVS | ||
| <PStem> | SSGVST | ||
| WSGVST | |||
| SSGPAST | |||
| WSGPAST | |||
| SSGCOND | |||
| PASS | |||
| CONVST | |||
| CONPOTN | |||
| SSGINF | |||
| Trait_F | Trait_F.HAS2V | has2v | |
| <GStem> | WITHOUT2 | ||
| <PStem> | NO2V | ||
| YES2V | |||
| Trait_F | Trait_F.CQ | cq | |
| <GStem> | WITHOUTCQ | ||
| <PStem> | CQH | ||
| CQL | |||
| CQN | |||
| CQR | |||
| CQS | |||
| CQT | |||
| Trait_F | Trait_F.STEMPOTN | stempotn | |
| <GStem> | WITHOUTSP | ||
| <PStem> | NOPO | ||
| YESPO | |||
| Trait_F | Trait_F.STEMINF | steminf | |
| <GStem> | WITHOUTSI | ||
| <PStem> | NOIN | ||
| YESIN | |||
| Trait_F | Trait_F.GRADATION | gradation | |
| <GStem> | WITHOUTGR | ||
| <PStem> | WEAK | ||
| STRONG | |||
| STRONGPAST |
| Ozon name | FPD feature declaration | DTD feature name and location | DTD feature value | |
| Fonction | Fonction | function | ||
| <Intervconst> | HEAD | |||
| <PositionC> | SUBJECT | |||
| <PositionS> | OBJECT | |||
| INDIRECTOBJECT | ||||
| OBLIQUE | ||||
| SUBJPRED | ||||
| OBJPRED | ||||
| NCOMP | ||||
| NSUBJ | ||||
| NOFCOMP | ||||
| NPREPCOMP | ||||
| NAPPOSITION | ||||
| NADJUNCT | ||||
| NCLAUSCOMP | ||||
| NDETERMINATIVE | ||||
| NATTRIBUTIVE | ||||
| NMODIFIER | ||||
| ACOMP | ||||
| APREPCOMP | ||||
| ACLAUSCOMP | ||||
| AADJUNCT | ||||
| AMODIFIER | ||||
| ADVCOMP | ||||
| ADVPREPCOMP | ||||
| ADVMODIFIER | ||||
| DETMODIFIER | ||||
| PREPDEPENDENT | ||||
| CONJDEPENDENT | ||||
| PREPOBJ | ||||
| ADVERBIAL | ||||
| COMPL | ||||
| CLAUSCOMP | ||||
| NGENATTRIBUTIVE | ||||
| NLEFTATTRIBUTIVE | ||||
| NPOSTPCOMP | ||||
| NRIGHTATTRIBUTIVE | ||||
| REALSUBJ | ||||
| EtiquetteSynt_T | EtiquetteSynt_T | syntlabel | ||
| <SyntagmaT> | V | |||
| N | ||||
| A | ||||
| PRO | ||||
| ADV | ||||
| CONJ | ||||
| ADP | ||||
| DET | ||||
| ART | ||||
| NUM | ||||
| RES | ||||
| UNIQUE | ||||
| INTER | ||||
| ADADJ | ||||
| POSTADV | ||||
| E | ||||
| EtiquetteSynt_NT | EtiquetteSynt_NT | syntlabel | ||
| <SyntagmaNTC> | WITHOUTE | |||
| <SyntagmaNTS> | NP | |||
| <Construction> | VP | |||
| PP | ||||
| AP | ||||
| ADVP | ||||
| Clause | ||||
| NG | ||||
| DETP | ||||
| PSP | ||||
| featurename | ||||
| Trait_S | Trait_S_Fermes.MORPHSUBCAT | <SyntFeatureClosed> | MORPHSUBCAT | |
| Trait_S_Fermes.MASPECT | MASPECT | |||
| Trait_S_Fermes.ASPECT | ASPECT | |||
| Trait_S_Fermes.MOOD | MOOD | |||
| Trait_S_Fermes.TENSE | TENSE | |||
| Trait_S_Fermes.MVF | MVF | |||
| Trait_S_Fermes.PERSON | PERSON | |||
| Trait_S_Fermes.GENDER | GENDER | |||
| Trait_S_Fermes.TNUMBER | TNUMBER | |||
| Trait_S_Fermes.POSSESSOR |
|
POSSESSOR | ||
| Trait_S_Fermes.SYNSUBCAT | SYNSUBCAT | |||
| Trait_S_Fermes.NPRONOMINAL | NPRONOMINAL | |||
| Trait_S_Fermes.NEGATIVE | NEGATIVE | |||
| Trait_S_Fermes.AGREEM | AGREEM | |||
| Trait_S_Fermes.TOURNURE | TOURNURE | |||
| Trait_S_Fermes.COREF | COREF | |||
| Trait_S_Fermes.VOICE | VOICE | |||
| Trait_S_Fermes.FUNCT | FUNCT | |||
| Trait_S_Fermes.PASSIVIZABLE | PASSIVIZABLE | |||
| Trait_S_Fermes.CONTROLT | CONTROLT | |||
| Trait_S_Fermes.CASE | CASE | |||
| Trait_S_Fermes.CLITICPARTICLE | CLITICPARTICLE | |||
| Trait_S_Fermes.COORDT | COORDT | |||
| Trait_S_Fermes.DECLIN | DECLIN | |||
| Trait_S_Fermes.DEFIN | DEFIN | |||
| Trait_S_Fermes.FINITENESS | FINITENESS | |||
| Trait_S_Fermes.FORMATION | FORMATION | |||
| Trait_S_Fermes.FWFORM | FWFORM | |||
| Trait_S_Fermes.GENDERPOSS | GENDERPOSS | |||
| Trait_S_Fermes.INFLECT | INFLECT | |||
| Trait_S_Fermes.INFLECTION | INFLECTION | |||
| Trait_S_Fermes.NDEGREE | NDEGREE | |||
| Trait_S_Fermes.NREFLEXIVE | NREFLEXIVE | |||
| Trait_S_Fermes.POLITENESS | POLITENESS | |||
| Trait_S_Fermes.REGISTER | REGISTER | |||
| Trait_S_Fermes.SEX | SEX | |||
| Trait_S_Fermes.SUBORDT | SUBORDT | |||
| Trait_S_Fermes.TENSEMOOD | TENSEMOOD | |||
| Trait_S_Fermes.WHTYPE | WHTYPE | |||
| Trait_S_Fermes.ZUINCL | ZUINCL | |||
| Trait_S_Fermes.NONFINITE | NONFINITE | |||
| value | ||||
| Trait_S | Trait_S_Fermes.MORPHSUBCAT | <SyntFeatureClosed> | PROPER | |
| COMMON | ||||
| MAIN | ||||
| AUX | ||||
| MODAL | ||||
| COPULA | ||||
| QUALI | ||||
| POSSESSIVE | ||||
| DEMONSTRATIVE | ||||
| INTERROGATIVE | ||||
| RELATIVE | ||||
| RECIPROCAL | ||||
| EXCLAMATIVE | ||||
| REFLEXIVE | ||||
| PERSONAL | ||||
| SSCPARTITIVE | ||||
| INDEFINITE | ||||
| DEFINITE | ||||
| GENERAL | ||||
| PARTICLE | ||||
| PRONOMINAL | ||||
| CIRCUMPOSITION | ||||
| POSTPOSITION | ||||
| PREPOSITION | ||||
| COORDINATIVE | ||||
| CARDINAL | ||||
| ORDINAL | ||||
| VPART | ||||
| FRACT | ||||
| FOREIGN | ||||
| FORMULA | ||||
| IMPERSONAL | ||||
| RELINDEF | ||||
| SUBJUNCTION | ||||
| NEGATION | ||||
| MULTIPLICATIVE | ||||
| ANALOG | ||||
| SYMBOL | ||||
| COLLECTIVE | ||||
| MEDIAL | ||||
| SSCCOMPARATIVE | ||||
| ANSWER | ||||
| INFINM | ||||
| COMPM | ||||
| NEGM | ||||
| EN | ||||
| HI | ||||
| HO | ||||
| ABBREVIATION | ||||
| MEDIOPASSIVE | ||||
| ACRONYM | ||||
| PERSONALSUBJECT | ||||
| PERSONALOBJECT | ||||
| CLASSIFYING | ||||
| NORMAL | ||||
| NONINFLECTING | ||||
| SSCSTRONG | ||||
| SSCWEAK | ||||
| POSITION | ||||
| INFMARK | ||||
| SSCNUMERAL | ||||
| NONMODAL | ||||
| SSCFUTURE | ||||
| FORMALSUBJECT | ||||
| OTHER | ||||
| Trait_S | Trait_S_Fermes.MASPECT | PROCESSIVE | ||
| RESULTATIVE | ||||
| STATIVE | ||||
| Trait_S | Trait_S_Fermes.ASPECT | PERFECTIVE | ||
| IMPERFECTIVE | ||||
| Trait_S | Trait_S_Fermes.MOOD | INDICATIVE | ||
| SUBJUNCTIVE | ||||
| IMPERATIVE | ||||
| GERUND | ||||
| CONDITIONAL | ||||
| INFINITIVE | ||||
| PARTICIPLE | ||||
| SUPINO | ||||
| PRESPART | ||||
| PASTPART | ||||
| INFLECINF | ||||
| CONJUNCTIVE | ||||
| Trait_S | Trait_S_Fermes.TENSE | PRESENT | ||
| IMPERFECT | ||||
| FUTURE | ||||
| PAST | ||||
| PLUSQUEPARFAIT | ||||
| Trait_S | Trait_S_Fermes.MVF | TRS | ||
| INT | ||||
| IMP | ||||
| Trait_S | Trait_S_Fermes.PERSON | 1 | ||
| 2 | ||||
| 3 | ||||
| 4 | ||||
| NEG | ||||
| Trait_S | Trait_S_Fermes.GENDER | MASCULINE | ||
| FEMININE | ||||
| NEUTER | ||||
| GCOMMON | ||||
| MF | ||||
| CONT | ||||
| INDISCRIMINATE | ||||
| OO | ||||
| INANIMATE | ||||
| NONMASCULINE | ||||
| NONNEUTER | ||||
| Trait_S | Trait_S_Fermes.TNUMBER | SINGULAR | ||
| PLURAL | ||||
| INVARIANT | ||||
| Trait_S | Trait_S_Fermes.POSSESSOR | SING | ||
| PLUR | ||||
| SG1 | ||||
| SG2 | ||||
| PL1 | ||||
| PL2 | ||||
| SGPL3 | ||||
| Trait_S | Trait_S_Fermes.SYNSUBCAT | |||
| SSRELATIVE | ||||
| INTERROGATIVED | ||||
| INTERROGATIVEIND | ||||
| SSINFINITIVE | ||||
| SSINTERROGATIVE | ||||
| THATCL | ||||
| TIME | ||||
| PLACE | ||||
| MANNER | ||||
| DEGREE | ||||
| QUANTITY | ||||
| WITHOUTDET | ||||
| DETERMINED | ||||
| COORDINATE | ||||
| COPULATIVE | ||||
| NACLAUSE | ||||
| SUBORDINATE | ||||
| SSCOMPARATIVE | ||||
| SSSUPERLATIVE | ||||
| DETERMINATIVE | ||||
| LOCATION | ||||
| DIRECTION | ||||
| DIRECTIONAWAYFROM | ||||
| DIRECTIONTOWARDS | ||||
| Trait_S | Trait_S_Fermes.NPRONOMINAL | SE | ||
| LE | ||||
| LA | ||||
| LES | ||||
| Y | ||||
| NPEN | ||||
| SELE | ||||
| SELA | ||||
| SELES | ||||
| SEY | ||||
| SEEN | ||||
| NPHI | ||||
| Trait_S | Trait_S_Fermes.NEGATIVE | YES | ||
| NO | ||||
| Trait_S | Trait_S_Fermes.AGREEM | I | ||
| J | ||||
| K | ||||
| L | ||||
| NOTI | ||||
| NOTJ | ||||
| NOTK | ||||
| NOTL | ||||
| Trait_S | Trait_S_Fermes.TOURNURE | TOINTERROGATIVE | ||
| TOEXCLAMATIVE | ||||
| Trait_S | Trait_S_Fermes.COREF | COI | ||
| COJ | ||||
| COK | ||||
| COL | ||||
| CONOTI | ||||
| CONOTJ | ||||
| CONOTK | ||||
| CONOTL | ||||
| Trait_S | Trait_S_Fermes.VOICE | ACTIVE | ||
| PASSIVE | ||||
| Trait_S | Trait_S_Fermes.FUNCT | MOD | ||
| SPE | ||||
| ATTRIBUTIVE | ||||
| PREDICATIVE | ||||
| ADVERBIAL | ||||
| ADJECTIVAL | ||||
| NOMINAL | ||||
| FUPRONOMINAL | ||||
| DETERMIN | ||||
| VERBAL | ||||
| NONATTR | ||||
| Trait_S | Trait_S_Fermes.PASSIVIZABLE | PASYES | ||
| PASNO | ||||
| Trait_S | Trait_S_Fermes.CONTROLT | SUBJECTCONTROL | ||
| OBJECTCONTROL | ||||
| RAISING | ||||
| INDIRECTOBJECTCONTROL | ||||
| Trait_S | Trait_S_Fermes.CASE | NOMINATIVE | ||
| GENITIVE | ||||
| DATIVE | ||||
| ACCUSATIVE | ||||
| VOCATIVE | ||||
| OBLIQUE | ||||
| PREPOSOBJ | ||||
| REFL | ||||
| UNMARKED | ||||
| PARTITIVE | ||||
| INESSIVE | ||||
| ELATIVE | ||||
| ILLATIVE | ||||
| ADESSIVE | ||||
| ABLATIVE | ||||
| ALLATIVE | ||||
| ESSIVE | ||||
| TRANSLATIVE | ||||
| INSTRUCTIVE | ||||
| ABESSIVE | ||||
| COMITATIVE | ||||
| ORDINARY | ||||
| OBJECT | ||||
| Trait_S | Trait_S_Fermes.CLITICPARTICLE | CPYES | ||
| Trait_S | Trait_S_Fermes.COORDT | SIMPLE | ||
| INITIAL | ||||
| NOTINITIAL | ||||
| CORREL | ||||
| Trait_S | Trait_S_Fermes.DECLIN | DADJECTIVAL | ||
| DNOMINAL | ||||
| Trait_S | Trait_S_Fermes.DEFIN | DEF | ||
| INDEF | ||||
| Trait_S | Trait_S_Fermes.FINITENESS | FINITE | ||
| NOFINITE | ||||
| Trait_S | Trait_S_Fermes.FORMATION | FSIMPLE | ||
| FUSED | ||||
| Trait_S | Trait_S_Fermes.FWFORM | ORIGINAL | ||
| TRANSLITERATED | ||||
| Trait_S | Trait_S_Fermes.GENDERPOSS | GPCOMMON | ||
| GPNEUTER | ||||
| Trait_S | Trait_S_Fermes.INFLECT | WEAK | ||
| STRONG | ||||
| MIXED | ||||
| INFL | ||||
| NOINFL | ||||
| Trait_S | Trait_S_Fermes.INFLECTION | IWEAK | ||
| ISTRONG | ||||
| BASIC | ||||
| IMIXED | ||||
| INFLECTED | ||||
| NOINFLECTED | ||||
| DOUBLE | ||||
| Trait_S | Trait_S_Fermes.NDEGREE | POSITIVE | ||
| COMPARATIVE | ||||
| SUPERLATIVE | ||||
| ABSOLUTESUPERLATIVE | ||||
| APPRECIATIVE | ||||
| INTENSIVE | ||||
| Trait_S | Trait_S_Fermes.NREFLEXIVE | RREFL | ||
| NOREFL | ||||
| Trait_S | Trait_S_Fermes.POLITENESS | POLITE | ||
| Trait_S | Trait_S_Fermes.REGISTER | FORMAL | ||
| OBSOLETE | ||||
| RPOLITE | ||||
| Trait_S | Trait_S_Fermes.SEX | SEFEMININE | ||
| SEMASCULINE | ||||
| Trait_S | Trait_S_Fermes.SUBORDT | PLUSINFVE | ||
| COMPAR | ||||
| PLUSFIN | ||||
| Trait_S | Trait_S_Fermes.TENSEMOOD | TMPRESENT | ||
| TMPAST | ||||
| TMCONDITIONAL | ||||
| TMIMPERATIVE | ||||
| TMPOTENTIAL | ||||
| Trait_S | Trait_S_Fermes.WHTYPE | WHINT | ||
| REL | ||||
| WHNO | ||||
| Trait_S | Trait_S_Fermes.ZUINCL | NORM | ||
| ZU | ||||
| Trait_S | Trait_S_Fermes.NONFINITE | INFINITIVEONE | ||
| INFINITIVETWO | ||||
| INFINITIVETHREE | ||||
| PARTICIPLEONE | ||||
| PARTICIPLETWO | ||||
| AGENTPARTICIPLE | ||||
| Trait_Lex | featurename | |||
| <LexFeature> | INTROD | |||
| PREP | ||||
| CONJ | ||||
| RELPRO | ||||
| INTPRO | ||||
| POSTP | ||||
| LEX | ||||
| Trait_RefLex | featurename | |||
| <RefLexFeature> | RINTROD | |||
| RPREP | ||||
| RCONJ | ||||
| RRELPRO | ||||
| RINTPRO | ||||
| RPOSTP | ||||
| RLEX | ||||