SIMPLE LE4-8346

WP01
 
 
 
 
 
 

SIMPLE - LEXICON DOCUMENTATION

GERMAN - GRAZ GROUP
 
 

***

Document first version date
01/05/2000    
Document date
01/05/2000
Document ID
Deliverable D3.9.2 , WP03.9
Version
01    
Doc. type
     
Document status
to be validated    
Validation type
     
Comments
 
       
  Name Organisation Purpose
       
From
Rudolf Muhr GRA Documentation
       
       
       
       
       
To
TM   Documentation
       
       

 

Lexicon Documentation for German - Data provided by GRAZ






1. General design information
 

The SIMPLE Semantic Lexicon for German has been developed in the framework of the SIMPLE project, which started in April 1998. It aimed at adding semantic information on top of the German PAROLE morphological and syntactic lexicon. In assigning the relevant semantic information to the Parole units we followed the common model, linguistic specifications, DTD and other formats as laid down in the Linguistic specifications.

1.1. Lexicon population

The German part of SIMPLE consists of two parts - the data which were provided by MANNHEIM (MAN) (4/5th) and those of GRAZ (GRA) (1/5th). All initial data for encoding were extracted from the German - PAROLE lexicon by MAN and then made available to Graz. The frequency of the lexical items was taken into consideration and items with high frequency preferred. Due to the linking with the PAROLE-data Graz was dependant on the data provided by Mannheim which supplied us with a list of 327 noun lemmas, 153 verb lemmas and 637 adjective lemmas. As Mannheim was not doing any adjective encoding, this part of the encoding was left to Graz which explains why the amount of the adjective lemmas was considerably higher than the numbers of the other POS.

The Base Concepts have been selected for encoding when equivalents were present in the German PAROLE lexicon. However, no previsions were taken to chose mainly "words" which were particularly representative for the Eurowordnet Basic concepts as the basic concepts (BC) turned out to be biased by semantic divisions only representative for English and on the other hand not covering important semantic features. Some readings could not be associated with BCs and were therefore marked with the feature BC0.

The choice of the lemmas selected was primarily driven by two criteria:

presence of the item in German Parole lexicon (having a syntactic link) and providing a large range of synonyms.

1.2 Background resources and lexicographical tools

The tool for achieving the encoding-tasks was the MS-Access database developed by the Catalan partners of the Institut d'Estudis Catalans (IEC) in Barcelona. We would like to express our gratitude to our colleagues for the generosity and the support we received in the course of the project. In particular we would like to thank Teresa Sadurni for the development and the maintenance of the database and her help we received on many occasions.
 

We have also used background resources and extracted semantic groups from CD dictionaries and printed synonym dictionaries. One of the main sources was the LEXIROM, the Duden Synonym dictionary as well as Mackensen’s synonym dictionary. Most definitions have been extracted from the electronic versions of the Lexirom and other dictionary sources. All encoded words in our lexicon include a definition which were mainly based on the dictionary sources or - if for some reason inappropriate - we developed one of our own. For the encoding of predicates we used standard valence dictionaries like Helbig/Schenkel and other sources (see the bibliography).
 

1.3 Coverage and completeness

The encoding task has been organised in such a way that encoders first chose the lemmas which according to the dictionaries provided large groups of closely related entries. This strategy proved to be quite useful for the consistency of the encoding and the numbers of Semus.

It is important to point out that due to the structure of the German Parole lexicon, our list of basic lemmas consisted mainly of abstract nouns and verbs with cognitive content. The encoding of these items proofed to be quite challenging as there were only few template types and semantic features available in the given ontology. To cope with this would have meant to introduce a number of additional templates and possibly  requiring a  modification of the framework set out by the Specification Group. We therefore refrained from taking this step and restricted the encoding mainly to the mandatory and required features.

As most abstract nouns at our disposition were mainly derivational and deverbal by origin one of the effects was that a number of Dummy Usems had to be encoded as they were missing in the PAROLE lexicon.

The dummy elements are of two types:

For each predicate we have encoded the most common semantic readings, this includes the MASTER entry which eventually is linked to the equivalent nouns when they represented AgentNominalization or VerbNominalizations.

1.4. Validation

In order to check the consistency of our encoded date we used the utilities and checking routines provided by the MS-Access database provided by BAR. All checkings were run and the reported errors as well as inconsistencies in the encoding removed.

1.5. Lexicon Contents

Factual data about the lexicon related to the Parole lexicons.

Table 1: Number of fully encoded Usem's per CG (2000 required)

Total of encoded SEMUS 
2031 SemUs 
Total for ADJECTIVE: 
593 SemUs 
Total for NOUN: 
1036 SemUs 
Total for VERB: 
420 SemUs 

Table 2: TOTAL number of Dummys, Predicates, Synus addressed in SIMPLE
 

Total of Dummy SemUs
1045 SemUs
Total of Predicates
412 SemUs
SynUs adressed in Simple:
791 SemUs

Table 5: Total number of Usem's per Template type


Temp1 Entity:29SemUs
Temp2 Telic:129SemUs
Temp3 Agentive:24SemUs
Temp4 Constitutive:11SemUs
Temp41 Part:17SemUs
Temp411 Body_part:3SemUs
Temp421 Human_group:23SemUs
Temp43 Amount:19SemUs
Temp5 Concrete_entity:26SemUs
Temp51 Location:4SemUs
Temp511 3_D_location:2SemUs
Temp512 Geopolitical_location:13SemUs
Temp513 Area:15SemUs
Temp514 Opening:3SemUs
Temp515 Building:18SemUs
Temp516 Artifactual_area:1SemUs
Temp52 Material:1SemUs
Temp53 Artifact:3SemUs
Temp531 Furniture:1SemUs
Temp532 Clothing:2SemUs
Temp534 Artwork:21SemUs
Temp536 Money:7SemUs
Temp537 Vehicle:3SemUs
Temp538 Semiotic_artifact:17SemUs
Temp55 Physical_object:4SemUs
Temp57 Living_entity:3SemUs
Temp572 Human:27SemUs
Temp5722 Role:15SemUs
Temp57222 Kinship:1SemUs
Temp57223 Social_status:38SemUs
Temp5723 Profession:23SemUs
Temp5724 Agent_of_temporary_activity:43SemUs
Temp5725 Agent_of_persistent_activity:1SemUs
Temp581 Natural_substance:2SemUs
Temp591 Artifactual_drink:2SemUs
Temp6 Property:4SemUs
Temp61 Quality:2SemUs
Temp62 Psych_property:6SemUs
Temp63 Physical_property:2SemUs
Temp631 Physical_power:2SemUs
Temp632 Color:1SemUs
Temp633 Shape:4SemUs
Temp64 Social_Property:6SemUs
Temp7 Abstract_entity:166SemUs
Temp71 Domain:2SemUs
Temp72 Time:20SemUs
Temp73 Moral_standard:4SemUs
Temp74 Cognitive_Fact:36SemUs
Temp75 Movement_of_thought:108SemUs
Temp76 Institution:6SemUs
Temp77 Convention:12SemUs
Temp81 Language:25SemUs
Temp82 Sign:1SemUs
Temp83 Information:58SemUs
Temp85 Unit_of_measurement:3SemUs
Temp9 Event:101SemUs
Temp921 Cause_Aspectual:1SemUs
Temp93 State:2SemUs
Temp931 Exist:4SemUs
Temp9321 Identificational_State:3SemUs
Temp9324 Stative_Possession:3SemUs
Temp94 Act:37SemUs
Temp941 Non_Relational_Act:1SemUs
Temp9421 Cooperative_Activity:22SemUs
Temp9422 Purpose_Act:24SemUs
Temp943 Move:16SemUs
Temp9431 Cause_Motion:9SemUs
Temp944 Cause_Act:5SemUs
Temp9441 Speech_Act:1SemUs
Temp94411 Cooperative_Speech_Act:6SemUs
Temp94413 Commissive_Speech_Act:1SemUs
Temp94415 Expressive_Speech_Act:11SemUs
Temp94416 Declarative_Speech_Act:8SemUs
Temp95 Psychological_Event:4SemUs
Temp951 Cognitive_Event:18SemUs
Temp9511 Judgement:9SemUs
Temp952 Experience_Event:8SemUs
Temp953 Perception:1SemUs
Temp954 Modal_Event:1SemUs
Temp96 Change:2SemUs
Temp9612 Change_of_State:27SemUs
Temp9613 Change_of_Value:4SemUs
Temp962 Change_of_Possession:8SemUs
Temp9621 Transaction:8SemUs
Temp963 Change_of_Location:10SemUs
Temp965 Acquire_Knowledge:4SemUs
Temp97 Cause_Change:1SemUs
Temp9711 Cause_Constitutive_Change:1SemUs
Temp9712 Cause_Change_of_State:34SemUs
Temp972 Cause_Change_Location:7SemUs
Temp974 Creation:1SemUs
Temp9742 Mental_Creation:2SemUs
Temp975 Give_Knowledge:12SemUs
TempA1 Intensional:3SemUs
TempA11 Modal:49SemUs
TempA12 Temporal:20SemUs
TempA13 Emotive:32SemUs
TempA14 Manner:137SemUs
TempA15 Emphasizer:13SemUs
TempA16 ObjectRelated:123SemUs
TempA2 Extensional:1SemUs
TempA21 PsychologicalProperty:56SemUs
TempA22 SocialProperty:54SemUs
TempA23 PhysicalProperty:29SemUs
TempA24 TemporalProperty:8SemUs
TempA25 IntensifyingProperty:35SemUs

Table 6: TOTAL number of Usem's per Domain


AGRICULTURE-FISHING-FORESTRY_TS_domaine 4 SemUs
ADVERTISING_TS_domaine 3 SemUs
AGRICULTURE_TS_domaine 1 SemUs
AIR_TRANSPORT_TS_domaine 1 SemUs
ANATOMY_TS_domaine 1 SemUs
ARCHITECTURE_TS_domaine 5 SemUs
ARMY_TS_domaine 1 SemUs
ARTS_TS_domaine 27 SemUs
ASTRONOMY_TS_domaine 2 SemUs
AUDIOVISUAL_TS_domaine 1 SemUs
BALLET_TS_domaine 1 SemUs
BANKING_TS_domaine 5 SemUs
BOOKBINDING_TS_domaine 18 SemUs
BOTANY_TS_domaine 2 SemUs
BUILDING_CRAFTS_TS_domaine 2 SemUs
BUILDING_TS_domaine 14 SemUs
BUSINESS_TS_domaine 31 SemUs
CAR_TRANSPORT_TS_domaine 3 SemUs
CHEMISTRY_TS_domaine 9 SemUs
CHRISTIANITY_TS_domaine 3 SemUs
CHURCH_OF_ENGLAND_TS_domaine 1 SemUs
CIRCUS_TS_domaine 1 SemUs
CITY_PLANNING_TS_domaine 2 SemUs
CIVIL_LAW_TS_domaine 1 SemUs
CLEANING_TS_domaine 1 SemUs
CLOTHING_INDUSTRY_TS_domaine 8 SemUs
COMMERCE_TS_domaine 2 SemUs
CONSTRUCTION_TS_domaine 4 SemUs
CRAFT_INDUSTRY_TS_domaine 19 SemUs
CREATIVE_WRITING_TS_domaine 1 SemUs
CRIMINAL_LAW_TS_domaine 1 SemUs
CUISINE_TS_domaine 1 SemUs
DANCE_TS_domaine 1 SemUs
DEATH_TS_domaine 3 SemUs
DIRECTION_TS_classificateur_de_nom_C 1 SemUs
DRINK_TS_domaine 3 SemUs
DRUGS_TS_domaine 1 SemUs
DYEING_TS_domaine 3 SemUs
EARTH_SCIENCES_TS_domaine 1 SemUs
ECONOMICS_TS_domaine 10 SemUs
EDUCATION_TS_domaine 25 SemUs
ELECTRONIC_ENGINEERING_TS_domaine 2 SemUs
EMPLOYMENT_TS_domaine 29 SemUs
EPISCOPAL_CHURCH_TS_domaine 1 SemUs
ETHNOLOGY_TS_domaine 4 SemUs
FAMILY_PLANNING_TS_domaine 4 SemUs
FASHION_TS_domaine 5 SemUs
FILM_TS_domaine 6 SemUs
FINANCE_TS_domaine 14 SemUs
FIRE_TS_domaine 1 SemUs
FISHING_TS_domaine 1 SemUs
FOOD_TS_domaine 6 SemUs
FURNITURE_TS_domaine 4 SemUs
GAMES_TS_domaine 2 SemUs
GEOGRAPHY_TS_domaine 7 SemUs
GEOMETRY_TS_domaine 6 SemUs
GEOPOLITICS_TS_domaine 3 SemUs
GOVERNMENT-ADMINISTRATION_TS_domaine 5 SemUs
GRAPHIC_ARTS_TS_domaine 7 SemUs
HAIR_TS_domaine 2 SemUs
HEALTH_AND_MEDICINE_TS_domaine 11 SemUs
HEALTH_TS_domaine 9 SemUs
HIGHER_EDUCATION_TS_domaine 1 SemUs
HISTORY_TS_domaine 7 SemUs
HOME_AND_GARDEN_TS_domaine 1 SemUs
HOME_LAUNDRY_TS_domaine 1 SemUs
HOME_LOANS_TS_domaine 2 SemUs
HORSE_RACING_TS_domaine 4 SemUs
HUMAN_SCIENCES_TS_domaine 36 SemUs
HUNTING_AND_SHOOTING_TS_domaine 1 SemUs
INSURANCE_TS_domaine 1 SemUs
INTELLIGENCE_TS_domaine 10 SemUs
INTERNATIONAL_AFFAIRS_TS_domaine 2 SemUs
ISLAM_TS_domaine 1 SemUs
JEWELRY_TS_domaine 1 SemUs
JUDAISM_TS_domaine 1 SemUs
KITCHEN_EQUIMENT_TS_domaine 3 SemUs
LAW_ENFORCEMENT_TS_domaine 3 SemUs
LAW_TS_domaine 20 SemUs
LEISURE_TS_domaine 6 SemUs
LIFE_SCIENCES_TS_domaine 22 SemUs
LINGUISTICS_TS_domaine 23 SemUs
MAIL_TS_domaine 6 SemUs
MANAGEMENT_TS_domaine 7 SemUs
MANUFACTURING_INDUSTRY_TS_domaine 21 SemUs
MARKETING_TS_domaine 9 SemUs
MARRIAGE_TS_domaine 5 SemUs
MATHEMATICS_TS_domaine 8 SemUs
MEDIA_TS_domaine 26 SemUs
MEDICINE_TS_domaine 4 SemUs
MEETING_TS_domaine 1 SemUs
MILITARY_TS_domaine 20 SemUs
MINING-GENERAL_TS_domaine 1 SemUs
MONARCHY_TS_domaine 5 SemUs
MORMONISM_TS_domaine 1 SemUs
MUSIC_TS_domaine 3 SemUs
NEWSPAPER_PUBLISHING_TS_domaine 4 SemUs
OPERA_TS_domaine 2 SemUs
OPTICS_TS_domaine 4 SemUs
ORTODOX_CHURCH_TS_domaine 1 SemUs
PACKAGING_TS_domaine 1 SemUs
PENAL_SYSTEM_TS_domaine 2 SemUs
PETS_TS_domaine 1 SemUs
PHILOSOPHY_TS_domaine 37 SemUs
PHOTOGRAPHY_TS_domaine 1 SemUs
PHYSICAL_SCIENCES_TS_domaine 1 SemUs
PHYSICS_TS_domaine 1 SemUs
PLUS_TS_ADMINISTRATIVE_T 60 SemUs
PLUS_TS_ANIMAL_T 2 SemUs
PLUS_TS_CHEMICAL_T 11 SemUs
PLUS_TS_COLLECTIVE_T 143 SemUs
PLUS_TS_COMBUSTIBLE_T 3 SemUs
PLUS_TS_EDIBLE_T 1 SemUs
PLUS_TS_ELABORATE_T 63 SemUs
PLUS_TS_FICTIVE_T 35 SemUs
PLUS_TS_GEOMETRIC_T 15 SemUs
PLUS_TS_HUMAN_T 600 SemUs
PLUS_TS_LIQUID_T 2 SemUs
PLUS_TS_LOCATIVE_T 61 SemUs
PLUS_TS_MEASURABLE_T 93 SemUs
PLUS_TS_MENTAL_T 51 SemUs
PLUS_TS_MIMETIC_T 3 SemUs
PLUS_TS_MINERAL_T 6 SemUs
PLUS_TS_MORAL_T 25 SemUs
PLUS_TS_NATURAL_T 21 SemUs
PLUS_TS_PART_T 48 SemUs
PLUS_TS_PATHOLOGICAL_T 1 SemUs
PLUS_TS_PLANT_T 1 SemUs
PLUS_TS_PROFESSIONAL_T 53 SemUs
PLUS_TS_SYMBOLIC_T 43 SemUs
PLUS_TS_SYMPTOMATIC_T 2 SemUs
POETICS_TS_domaine 1 SemUs
POLITICS_AND_GOVERNMENT_TS_domaine 30 SemUs
POLITICS_TS_domaine 12 SemUs
PRIMARY_AND_SECONDARY_EDUCATION_TS_domaine 1 SemUs
PRINTING_TS_domaine 4 SemUs
PROTESTANTISM_TS_domaine 1 SemUs
PSYCHOLOGY_TS_domaine 57 SemUs
PUBLISHING_TS_domaine 8 SemUs
RADIO-TELEVISION_TS_domaine 2 SemUs
RAIL_TRANSPORT_TS_domaine 2 SemUs
RELIGION_TS_domaine 8 SemUs
RESTAURATION_TS_domaine 2 SemUs
RHETORIC_TS_domaine 23 SemUs
ROAD_TRANSPORT_TS_domaine 1 SemUs
ROMAN_CATHOLICISM_TS_domaine 1 SemUs
SCIENCES_TS_domaine 83 SemUs
SERVICE_INDUSTRY_TS_domaine 7 SemUs
SEX_TS_domaine 1 SemUs
SHOWS_TS_domaine 4 SemUs
SITU_TS_classificateur_de_nom_C 7 SemUs
SKIING_TS_domaine 1 SemUs
SMOKING_TS_domaine 2 SemUs
SOCCER_TS_domaine 3 SemUs
SOCIAL_ACTION_TS_domaine 237 SemUs
SOCIAL_SECURITY_TS_domaine 9 SemUs
SOCIOLOGY_TS_domaine 49 SemUs
SPORT_TS_domaine 7 SemUs
SPORTS_AND_LEISURE_TS_domaine 5 SemUs
STATISTICS_TS_domaine 3 SemUs
SWIMMING_TS_domaine 1 SemUs
THEATER_TS_domaine 2 SemUs
THEMES_TS_domaine 1 SemUs
TOWN_AND_COUNTRY_PLANNING_TS_domaine 8 SemUs
TRANSPORT_TS_domaine 7 SemUs
ZOOLOGY_TS_domaine 1 SemUs

Table 7: TOTAL number of Usem's per Semantic Class for Nouns


ATTRIBUTE_TS_classificateur_de_nom 537 SemUs
ABSTRACT_TS_classificateur_de_nom 234 SemUs
ACTIVITY_TS_classificateur_de_nom 224 SemUs
HUMAN_TS_classificateur_de_nom 101 SemUs
EVENT_TS_classificateur_de_nom 87 SemUs
NOTION_TS_classificateur_de_nom 85 SemUs
OCCUPATION_AGENT_TS_classificateur_de_nom 68 SemUs
COGNITIVE_FACT_TS_classificateur_de_nom 55 SemUs
ARTIFACT_TS_classificateur_de_nom 49 SemUs
AFFECTION_TS_classificateur_de_nom 45 SemUs
SYSTEM_OF_THOUGHT_TS_classificateur_de_nom 40 SemUs
LOCATION_TS_classificateur_de_nom 30 SemUs
AMOUNT_TS_classificateur_de_nom 30 SemUs
PROCESS_TS_classificateur_de_nom 26 SemUs
TIME_PERIOD_TS_classificateur_de_nom 19 SemUs
PSYCHOLOGICAL_FEATURE_TS_classificateur_de_nom 17 SemUs
BUILDING_TS_classificateur_de_nom 17 SemUs
GEOGRAPHY_TS_classificateur_de_nom 14 SemUs
ENTITY_TS_classificateur_de_nom 14 SemUs
FUNCTIONAL_SPACE_TS_classificateur_de_nom 11 SemUs
CURRENCY_TS_classificateur_de_nom 9 SemUs
STATE_TS_classificateur_de_nom 8 SemUs
OBJECT_TS_classificateur_de_nom 7 SemUs
CONCRETE_TS_classificateur_de_nom 7 SemUs
PHENOMENON_TS_classificateur_de_nom 4 SemUs
PERIOD_TS_classificateur_de_nom 4 SemUs
LETTER_TS_classificateur_de_nom 4 SemUs
GROUP_NAMES_TS_classificateur_de_nom 4 SemUs
OCCUPATION_TS_classificateur_de_nom 3 SemUs
MEASURE_UNIT_TS_classificateur_de_nom 3 SemUs
LIVING_BEING_TS_classificateur_de_nom 3 SemUs
GARMENT_TS_classificateur_de_nom 3 SemUs
FACULTY_TS_classificateur_de_nom 3 SemUs
BODY_PART_TS_classificateur_de_nom 3 SemUs
ALTERATION_TS_classificateur_de_nom 3 SemUs
VEHICLE_TS_classificateur_de_nom 2 SemUs
SUBSTANCE_TS_classificateur_de_nom 2 SemUs
OPERATION_TS_classificateur_de_nom 2 SemUs
NON_LIVING_TS_classificateur_de_nom 2 SemUs
INSTRUMENT_TS_classificateur_de_nom 2 SemUs
FORM_TS_classificateur_de_nom 2 SemUs
MONTH_TS_classificateur_de_nom 1 SemUs
MATTER_TS_classificateur_de_nom 1 SemUs
FURNITURE_TS_classificateur_de_nom 1 SemUs
DAY_TS_classificateur_de_nom 1 SemUs
BIO_TS_classificateur_de_nom 1 SemUs
APPARATUS_TS_classificateur_de_nom 1 SemUs

Table 8: Number of Usem's per Semantic Class for Verb


CHANGE_TS_classificateur_de_verbe 71 SemUs
COGNITION_TS_classificateur_de_verbe 50 SemUs
MOTION_TS_classificateur_de_verbe 35 SemUs
COMMUNICATION_TS_classificateur_de_verbe 29 SemUs
EMOTION_TS_classificateur_de_verbe 14 SemUs
PERCEPTION_TS_classificateur_de_verbe 11 SemUs
SOCIAL_TS_classificateur_de_verbe 11 SemUs
STATIVE_TS_classificateur_de_verbe 9 SemUs
CONTACT_TS_classificateur_de_verbe 8 SemUs
POSSESSION_TS_classificateur_de_verbe 8 SemUs
CREATION_TS_classificateur_de_verbe 3 SemUs
WEATHER_TS_classificateur_de_verbe 1 SemUs

2. Semantic encoding

In the German SIMPLE lexicon the encoding of entries was performed using the encoding tool supplied by the Catalan site. The tool is an interface which allows to encode/check/browse the Simple data, stored in a relational database (MS-Access). It has been created by Marta Villegas and Teresa Sadurní and developed by Teresa Sadurní at the Institut d'Estudis Catalans (IEC), in Barcelona.

This tool has provided most valuable help for the encoding process. Its great number of very useful options for inserting entries, copying and maintaining existing data has allowed to perform an accurate and quick coding of entries and a continuous consistency checking of data.

2.1. Criteria for Syntax-Semantic linking

The syntactic-semantic linking is established on a one-to-one relation between SynUs and SemUs at the level of correspondence objects. As we were using the MS-Access database the choices for the linking were automatically proposed by the db and had to be chosen according to the particular features of the SEMU.

In the German PAROLE lexicon, entries may have one or more syntactic units. As a rule, all meanings of one lemma corresponding to the most general syntactic description were first encoded and the others left aside for eventual later treatment. Consequently, this one syntactic unit is linked to all the subsenses (semantic units) of the word in question.
 

2.1.1 One to many:
 

<SynU
id="USADJ1235"
example=""
naming=".A1"
description="DADJ17">
<CorrespSynUSemU
targetsemu="USem1948"
correspondence="ISObivalent">
<CorrespSynUSemU
targetsemu="USem1949"
correspondence="ISObivalent">
<CorrespSynUSemU
targetsemu="USem2262">
<CorrespSynUSemU
targetsemu="USem2263">
</SynU>



<SemU
id = "USem1948"
naming = "inhuman" [inhuman]
example = "unmenschlich" [cruel]
formaldefinition = "grausam zu anderen Menschen" - "Sein Verhalten zu den Gefangenen ist unmenschlich."


<SemU
id = "USem1949"
naming = "inhuman"
example = "unbarmherzig" [merciless]
formaldefinition = "ohne Mitleid zu empfinden" - "Der Gouverneur war bei der Verstreckung des Urteils unbarmherzig"


<SemU
id = "USem2262"
naming = "inhuman"
example = "gnadenlos" [merciless]
formaldefinition = "Ohne die geringste Rücksicht" - Die Truppen gingen gnadenlos gegen den Widerstand vor.



<SemU
id = "USem2263"
naming = "inhuman"
example = "brutal" [brutal]
formaldefinition = "gewalttätig" - "Der Bankräuber war äußerst brutal."

 2.1.2 One to one:

<SynU
id="USN7132"
example=""
naming=".N1"
description="DN2">
<CorrespSynUSemU
targetsemu="USem406"
correspondence="ISOmonovalent">



<SemU
id = "USem406"
naming = "Perspektive"
example = "Darstellung"
comment = " "
freedefinition = "BC0"
formaldefinition = "Versuch, eine bildliche oder räumliche Vorstellung zu vermitteln"

2.1. 1.Types of Correspondence

Subtypes of Correspondence used:
 
Correspondence 
Description 
Example 
Comment 
ISOmonovalent Isomorphic mapping for unary predicates Autor [author]
gehen [to go]
Nouns
Monovalent verbs
ISObivalent Isomorphic mapping for bivalent predicates ändern[to change] Bivalent verbs
ISOtrivalent Isomorphic mapping for trivalent predicates überzeugen [to convince] Trivalent verbs
RED2to1 reduced mapping for predicates with two arguments where Arg1 changes to Arg0 Aufdruck [impresssion]

Ausbildung [education]

deverbal nouns derived from verbs with a direct object or prepositional object
RED3to2 reduced mapping for predicates with three arguments where Arg2 goes to Arg0 and Arg2 to Arg1 Rennen [run]

Auseinandersetzung [conflict, quarrel, discussion]

deverbal nouns with two prepositional objects or two objects

 2.2. Criteria for assigning Domain Features

As recommended in Linguistic Specifications, we tried to assign to each Usem a specific semantic class and template type available in the Linguistic Specifications as provided by the Access database. Due to the internal structure of the database, it was not possible to assign more than one domain.

Assignment of the Domain categories has followed in most cases the recommendations given in the Linguistic Specifications D2.1. The main criteria for the assignment of the Domain values were the following:

2.3. Criteria for assigning Semantic Class and Template Type

Linguistic specifications by the specification group were the main criteria for assigning template type and semantic class information. The dictionaries of Helbig/Schenkel and Ballmer/Brennenstuhl were used for the encoding of verbs, in adjective and noun encoding valuable help was found in the dictionaries of Sommerfeldt/Schreiber.
 

2.3.1. Language specific typing

This gives an overview over a demonstrative selection of most common assingement types for template types:

  1. General process verbs like"verändern" [change] "verarbeiten"[process] or"abservieren" [clear the table]which are either monovalent or bivalent with a direct object were assigned the template type "Change_state" or "Cause_Change_of_State".
  2. Event verbs like "verschwinden" [vanish] or "fortbringen" [remove] which imply a change of location were assigned thetemplate type "Cause_Change_Location".
  3. Process verbs like "austeilen/verteilen"[distribute] which imply a change of ownership were assigned thetemplate type "Change_of_Possession
  4. Causative verbs like "veranlassen" [cause, bring to action] were assigned the template type "Activity".
  5. Factitive verbs like "tangieren" [concern, effect] which imply an obligatory patient role were assigned the template type "Cause_Motion".
  6. Modality verbs like "verbessern" [improve] which imply a modal adjective complement were assigned the template type "Change_of_Value"
  7. Cognitive verbs like "erinnern" [remember] and "überlegen" [reflect, think about] (which are also event verbs) were assigned the template type "Cognitive_Event"
  8. Nouns denoting cognitive states like"Geistlosigkeit" [mindlessness] or regular (stative) processes like "Einfluss" [influence] were assigned the template type "Cognitive_Fact".
  9. Nouns denoting social events like "Feier" [celebration] were assigned the template type "Convention"
  10. Cooperative verbs describing general social activities like "treffen" [meet] or "helfen" [help] which require cooperative action were assigned the template type "Cooperative_Activity".
  11. A considerable number of nouns (mostly of deverbal origin) like "Informierung" [briefing], Engagement [commitment] which imply some intentional process or activity are assigned the template "telic".
With very few concrete nouns available most of the relevant template types e.g. [Animal].[Material], [Food], .[Plant], [Flower].[Colour] etc. were not or almost not used.

2.3.3. Criteria for encoding Semantic Relations

In the SIMPLE model semantic relations between SemU’s are intended to increase the descriptive power of the model. The encoding of semantic relations between different entries was done on the basis on choosing the linguistically relevant distinctive semantic relations, even if these point to dummy SemU's.

The dummy Usems collected in the course of the encoding arose from the expansion of lexicon population mainly by virtue of the many abstract derivational nouns we had to deal with. The other source was the already mentioned wish to cover the relevant parts of the semantics of the words encoded in a comprehensive way.

Examples to illustrate the assignment of dummys are
 

Another point is that the semantic relations specify in some sense the collocations the word typically has: 2.4. Classes derived from encoding
  • Polysemic relations have not been implemented which is again due to the fact that we mainly had to deal with abstract word senses.
  • Synonimy was only encoded with some adjectives.
  • Hyponimy was not encoded.
  • 2.5. Representation of Predicative information.

    Each predicative entry contains information in respect to:

    Each predicative entry is linked to a lexical predicate, carrying the maximal possible number of arguments.

    The same predicate is used

    The type of link depends on the lemma and the relation it has with the predicate : Predicative information is encoded for the following categories of words : 3. Statistics

    Table 8: Number of Usem's per Semantic Relations for Verbs

    SRActivity 15 SemUs
    SRAgentive 204 SemUs
    SRAgentiveCause 44 SemUs
    SRAgentiveExperience 4 SemUs
    SRAgentiveprog 1 SemUs
    SRAgentverb 5 SemUs
    SRAntonym 262 SemUs
    SRAntonymComp 179 SemUs
    SRAntonymGrad 48 SemUs
    SRCausedby 145 SemUs
    SRCauses 112 SemUs
    SRConcerns 19 SemUs
    SRConstitutive 7 SemUs
    SRConstitutiveactivity 31 SemUs
    SRContains 68 SemUs
    SRCreatedby 341 SemUs
    SRDenominalNounVerb 2 SemUs
    SRDerivational 6 SemUs
    SRDerivedfrom 33 SemUs
    SRDirecttelic 92 SemUs
    SREntail 4 SemUs
    SREventverb 10 SemUs
    SRFormal 5 SemUs
    SRHasAsEffect 158 SemUs
    SRHasasmember 65 SemUs
    SRHasaspart 101 SemUs
    SRHasasproperty 15 SemUs
    SRIndirecttelic 8 SemUs
    SRInstrument 31 SemUs
    SRIsa 51 SemUs
    SRIsafollowerof 1 SemUs
    SRIsamemberof 108 SemUs
    SRIsapartof 160 SemUs
    SRIsin 9 SemUs
    SRIstheabilityof 18 SemUs
    SRIstheactivityof 54 SemUs
    SRIsthehabitof 26 SemUs
    SRKinship 2 SemUs
    SRLocation 12 SemUs
    SRMadeof 16 SemUs
    SRMeasuredby 52 SemUs
    SRMeasures 1 SemUs
    SRMetaphor 16 SemUs
    SRNominalization 98 SemUs
    SRNounadjective 79 SemUs
    SRNounPropernoun 1 SemUs
    SRObjectoftheactivity 15 SemUs
    SRPolysemy 20 SemUs
    SRPolysemyAgentofpersistentactivity-Profession 1 SemUs
    SRPolysemyBuilding-Institution 4 SemUs
    SRPolysemyCauseact-Nonrelationalact 1 SemUs
    SRPolysemyChangeofstate-Causechangeofstate 10 SemUs
    SRPolysemyCognitiveevent-Experienceevent 12 SemUs
    SRPolysemyConstitutivechange-Causeconstitutivechange 1 SemUs
    SRPolysemyDomain-Activity 1 SemUs
    SRPolysemyHumanGroup-Building 2 SemUs
    SRPolysemyHumanGroup-GeopoliticalLocation 8 SemUs
    SRPolysemyHumanGroup-Institution 14 SemUs
    SRPolysemyLocation-HumanGroup 19 SemUs
    SRPolysemyMove-Causemotion 26 SemUs
    SRPolysemyOpening-Artifact 7 SemUs
    SRPolysemyPlant-Substance 1 SemUs
    SRPolysemyPurposeact-Domain 3 SemUs
    SRPolysemySemioticartifact-Information 2 SemUs
    SRProcessverb 10 SemUs
    SRProducedby 250 SemUs
    SRProduces 279 SemUs
    SRProperty 2 SemUs
    SRPropertyof 24 SemUs
    SRPurpose 42 SemUs
    SRQuantifies 8 SemUs
    SRRelatedto 18 SemUs
    SRRelates 4 SemUs
    SRResult 4 SemUs
    SRResultingState 48 SemUs
    SRResultof 333 SemUs
    SRSource 13 SemUs
    SRSuccessorof