SIMPLE LE4-8346
WP01
SIMPLE - LEXICON DOCUMENTATION
GERMAN - GRAZ GROUP
***
|
Document
first version date
|
01/05/2000 | |||
|
Document
date
|
01/05/2000 | |||
|
Document
ID
|
Deliverable D3.9.2 , WP03.9 | |||
|
Version
|
01 | |||
|
Doc.
type
|
||||
|
Document
status
|
to be validated | |||
|
Validation
type
|
||||
|
Comments
|
||||
| Name | Organisation | Purpose | ||
|
From
|
Rudolf Muhr | GRA | Documentation | |
|
To
|
TM | Documentation | ||
Lexicon Documentation for German - Data provided by GRAZ
1. General design information
The SIMPLE Semantic Lexicon for German has been developed in the framework of the SIMPLE project, which started in April 1998. It aimed at adding semantic information on top of the German PAROLE morphological and syntactic lexicon. In assigning the relevant semantic information to the Parole units we followed the common model, linguistic specifications, DTD and other formats as laid down in the Linguistic specifications.
1.1. Lexicon population
The German part of SIMPLE consists of two parts - the data which were provided by MANNHEIM (MAN) (4/5th) and those of GRAZ (GRA) (1/5th). All initial data for encoding were extracted from the German - PAROLE lexicon by MAN and then made available to Graz. The frequency of the lexical items was taken into consideration and items with high frequency preferred. Due to the linking with the PAROLE-data Graz was dependant on the data provided by Mannheim which supplied us with a list of 327 noun lemmas, 153 verb lemmas and 637 adjective lemmas. As Mannheim was not doing any adjective encoding, this part of the encoding was left to Graz which explains why the amount of the adjective lemmas was considerably higher than the numbers of the other POS.
The Base Concepts have been selected for encoding when equivalents were present in the German PAROLE lexicon. However, no previsions were taken to chose mainly "words" which were particularly representative for the Eurowordnet Basic concepts as the basic concepts (BC) turned out to be biased by semantic divisions only representative for English and on the other hand not covering important semantic features. Some readings could not be associated with BCs and were therefore marked with the feature BC0.
The choice of the lemmas selected was primarily driven by two criteria:
presence of the item in German Parole lexicon (having a syntactic link) and providing a large range of synonyms.
1.2 Background resources and lexicographical tools
The tool for achieving the encoding-tasks was the MS-Access database
developed by the Catalan partners of the Institut d'Estudis Catalans (IEC)
in Barcelona. We would like to express our gratitude to our colleagues
for the generosity and the support we received in the course of the project.
In particular we would like to thank Teresa Sadurni for the development
and the maintenance of the database and her help we received on many occasions.
We have also used background resources and extracted semantic groups
from CD dictionaries and printed synonym dictionaries. One of the main
sources was the LEXIROM, the Duden Synonym dictionary as well as Mackensen’s
synonym dictionary. Most definitions have been extracted from the electronic
versions of the Lexirom and other dictionary sources. All encoded words
in our lexicon include a definition which were mainly based on the dictionary
sources or - if for some reason inappropriate - we developed one of our
own. For the encoding of predicates we used standard valence dictionaries
like Helbig/Schenkel and other sources (see the bibliography).
1.3 Coverage and completeness
The encoding task has been organised in such a way that encoders first chose the lemmas which according to the dictionaries provided large groups of closely related entries. This strategy proved to be quite useful for the consistency of the encoding and the numbers of Semus.
It is important to point out that due to the structure of the German Parole lexicon, our list of basic lemmas consisted mainly of abstract nouns and verbs with cognitive content. The encoding of these items proofed to be quite challenging as there were only few template types and semantic features available in the given ontology. To cope with this would have meant to introduce a number of additional templates and possibly requiring a modification of the framework set out by the Specification Group. We therefore refrained from taking this step and restricted the encoding mainly to the mandatory and required features.
As most abstract nouns at our disposition were mainly derivational and deverbal by origin one of the effects was that a number of Dummy Usems had to be encoded as they were missing in the PAROLE lexicon.
The dummy elements are of two types:
1.4. Validation
In order to check the consistency of our encoded date we used the utilities and checking routines provided by the MS-Access database provided by BAR. All checkings were run and the reported errors as well as inconsistencies in the encoding removed.
1.5. Lexicon Contents
Factual data about the lexicon related to the Parole lexicons.
Table 1: Number of fully encoded Usem's per CG (2000 required)
Total of encoded SEMUS |
2031 SemUs |
Total for ADJECTIVE: |
593 SemUs |
Total for NOUN: |
1036 SemUs |
Total for VERB: |
420 SemUs |
Table 2: TOTAL number of Dummys, Predicates, Synus addressed in SIMPLE
| Total of Dummy SemUs |
|
| Total of Predicates |
|
| SynUs adressed in Simple: |
|
Table 5: Total number of Usem's per Template type
| Temp1 | Entity:29SemUs |
| Temp2 | Telic:129SemUs |
| Temp3 | Agentive:24SemUs |
| Temp4 | Constitutive:11SemUs |
| Temp41 | Part:17SemUs |
| Temp411 | Body_part:3SemUs |
| Temp421 | Human_group:23SemUs |
| Temp43 | Amount:19SemUs |
| Temp5 | Concrete_entity:26SemUs |
| Temp51 | Location:4SemUs |
| Temp511 | 3_D_location:2SemUs |
| Temp512 | Geopolitical_location:13SemUs |
| Temp513 | Area:15SemUs |
| Temp514 | Opening:3SemUs |
| Temp515 | Building:18SemUs |
| Temp516 | Artifactual_area:1SemUs |
| Temp52 | Material:1SemUs |
| Temp53 | Artifact:3SemUs |
| Temp531 | Furniture:1SemUs |
| Temp532 | Clothing:2SemUs |
| Temp534 | Artwork:21SemUs |
| Temp536 | Money:7SemUs |
| Temp537 | Vehicle:3SemUs |
| Temp538 | Semiotic_artifact:17SemUs |
| Temp55 | Physical_object:4SemUs |
| Temp57 | Living_entity:3SemUs |
| Temp572 | Human:27SemUs |
| Temp5722 | Role:15SemUs |
| Temp57222 | Kinship:1SemUs |
| Temp57223 | Social_status:38SemUs |
| Temp5723 | Profession:23SemUs |
| Temp5724 | Agent_of_temporary_activity:43SemUs |
| Temp5725 | Agent_of_persistent_activity:1SemUs |
| Temp581 | Natural_substance:2SemUs |
| Temp591 | Artifactual_drink:2SemUs |
| Temp6 | Property:4SemUs |
| Temp61 | Quality:2SemUs |
| Temp62 | Psych_property:6SemUs |
| Temp63 | Physical_property:2SemUs |
| Temp631 | Physical_power:2SemUs |
| Temp632 | Color:1SemUs |
| Temp633 | Shape:4SemUs |
| Temp64 | Social_Property:6SemUs |
| Temp7 | Abstract_entity:166SemUs |
| Temp71 | Domain:2SemUs |
| Temp72 | Time:20SemUs |
| Temp73 | Moral_standard:4SemUs |
| Temp74 | Cognitive_Fact:36SemUs |
| Temp75 | Movement_of_thought:108SemUs |
| Temp76 | Institution:6SemUs |
| Temp77 | Convention:12SemUs |
| Temp81 | Language:25SemUs |
| Temp82 | Sign:1SemUs |
| Temp83 | Information:58SemUs |
| Temp85 | Unit_of_measurement:3SemUs |
| Temp9 | Event:101SemUs |
| Temp921 | Cause_Aspectual:1SemUs |
| Temp93 | State:2SemUs |
| Temp931 | Exist:4SemUs |
| Temp9321 | Identificational_State:3SemUs |
| Temp9324 | Stative_Possession:3SemUs |
| Temp94 | Act:37SemUs |
| Temp941 | Non_Relational_Act:1SemUs |
| Temp9421 | Cooperative_Activity:22SemUs |
| Temp9422 | Purpose_Act:24SemUs |
| Temp943 | Move:16SemUs |
| Temp9431 | Cause_Motion:9SemUs |
| Temp944 | Cause_Act:5SemUs |
| Temp9441 | Speech_Act:1SemUs |
| Temp94411 | Cooperative_Speech_Act:6SemUs |
| Temp94413 | Commissive_Speech_Act:1SemUs |
| Temp94415 | Expressive_Speech_Act:11SemUs |
| Temp94416 | Declarative_Speech_Act:8SemUs |
| Temp95 | Psychological_Event:4SemUs |
| Temp951 | Cognitive_Event:18SemUs |
| Temp9511 | Judgement:9SemUs |
| Temp952 | Experience_Event:8SemUs |
| Temp953 | Perception:1SemUs |
| Temp954 | Modal_Event:1SemUs |
| Temp96 | Change:2SemUs |
| Temp9612 | Change_of_State:27SemUs |
| Temp9613 | Change_of_Value:4SemUs |
| Temp962 | Change_of_Possession:8SemUs |
| Temp9621 | Transaction:8SemUs |
| Temp963 | Change_of_Location:10SemUs |
| Temp965 | Acquire_Knowledge:4SemUs |
| Temp97 | Cause_Change:1SemUs |
| Temp9711 | Cause_Constitutive_Change:1SemUs |
| Temp9712 | Cause_Change_of_State:34SemUs |
| Temp972 | Cause_Change_Location:7SemUs |
| Temp974 | Creation:1SemUs |
| Temp9742 | Mental_Creation:2SemUs |
| Temp975 | Give_Knowledge:12SemUs |
| TempA1 | Intensional:3SemUs |
| TempA11 | Modal:49SemUs |
| TempA12 | Temporal:20SemUs |
| TempA13 | Emotive:32SemUs |
| TempA14 | Manner:137SemUs |
| TempA15 | Emphasizer:13SemUs |
| TempA16 | ObjectRelated:123SemUs |
| TempA2 | Extensional:1SemUs |
| TempA21 | PsychologicalProperty:56SemUs |
| TempA22 | SocialProperty:54SemUs |
| TempA23 | PhysicalProperty:29SemUs |
| TempA24 | TemporalProperty:8SemUs |
| TempA25 | IntensifyingProperty:35SemUs |
Table 6: TOTAL number of Usem's per Domain
| AGRICULTURE-FISHING-FORESTRY_TS_domaine | 4 SemUs |
| ADVERTISING_TS_domaine | 3 SemUs |
| AGRICULTURE_TS_domaine | 1 SemUs |
| AIR_TRANSPORT_TS_domaine | 1 SemUs |
| ANATOMY_TS_domaine | 1 SemUs |
| ARCHITECTURE_TS_domaine | 5 SemUs |
| ARMY_TS_domaine | 1 SemUs |
| ARTS_TS_domaine | 27 SemUs |
| ASTRONOMY_TS_domaine | 2 SemUs |
| AUDIOVISUAL_TS_domaine | 1 SemUs |
| BALLET_TS_domaine | 1 SemUs |
| BANKING_TS_domaine | 5 SemUs |
| BOOKBINDING_TS_domaine | 18 SemUs |
| BOTANY_TS_domaine | 2 SemUs |
| BUILDING_CRAFTS_TS_domaine | 2 SemUs |
| BUILDING_TS_domaine | 14 SemUs |
| BUSINESS_TS_domaine | 31 SemUs |
| CAR_TRANSPORT_TS_domaine | 3 SemUs |
| CHEMISTRY_TS_domaine | 9 SemUs |
| CHRISTIANITY_TS_domaine | 3 SemUs |
| CHURCH_OF_ENGLAND_TS_domaine | 1 SemUs |
| CIRCUS_TS_domaine | 1 SemUs |
| CITY_PLANNING_TS_domaine | 2 SemUs |
| CIVIL_LAW_TS_domaine | 1 SemUs |
| CLEANING_TS_domaine | 1 SemUs |
| CLOTHING_INDUSTRY_TS_domaine | 8 SemUs |
| COMMERCE_TS_domaine | 2 SemUs |
| CONSTRUCTION_TS_domaine | 4 SemUs |
| CRAFT_INDUSTRY_TS_domaine | 19 SemUs |
| CREATIVE_WRITING_TS_domaine | 1 SemUs |
| CRIMINAL_LAW_TS_domaine | 1 SemUs |
| CUISINE_TS_domaine | 1 SemUs |
| DANCE_TS_domaine | 1 SemUs |
| DEATH_TS_domaine | 3 SemUs |
| DIRECTION_TS_classificateur_de_nom_C | 1 SemUs |
| DRINK_TS_domaine | 3 SemUs |
| DRUGS_TS_domaine | 1 SemUs |
| DYEING_TS_domaine | 3 SemUs |
| EARTH_SCIENCES_TS_domaine | 1 SemUs |
| ECONOMICS_TS_domaine | 10 SemUs |
| EDUCATION_TS_domaine | 25 SemUs |
| ELECTRONIC_ENGINEERING_TS_domaine | 2 SemUs |
| EMPLOYMENT_TS_domaine | 29 SemUs |
| EPISCOPAL_CHURCH_TS_domaine | 1 SemUs |
| ETHNOLOGY_TS_domaine | 4 SemUs |
| FAMILY_PLANNING_TS_domaine | 4 SemUs |
| FASHION_TS_domaine | 5 SemUs |
| FILM_TS_domaine | 6 SemUs |
| FINANCE_TS_domaine | 14 SemUs |
| FIRE_TS_domaine | 1 SemUs |
| FISHING_TS_domaine | 1 SemUs |
| FOOD_TS_domaine | 6 SemUs |
| FURNITURE_TS_domaine | 4 SemUs |
| GAMES_TS_domaine | 2 SemUs |
| GEOGRAPHY_TS_domaine | 7 SemUs |
| GEOMETRY_TS_domaine | 6 SemUs |
| GEOPOLITICS_TS_domaine | 3 SemUs |
| GOVERNMENT-ADMINISTRATION_TS_domaine | 5 SemUs |
| GRAPHIC_ARTS_TS_domaine | 7 SemUs |
| HAIR_TS_domaine | 2 SemUs |
| HEALTH_AND_MEDICINE_TS_domaine | 11 SemUs |
| HEALTH_TS_domaine | 9 SemUs |
| HIGHER_EDUCATION_TS_domaine | 1 SemUs |
| HISTORY_TS_domaine | 7 SemUs |
| HOME_AND_GARDEN_TS_domaine | 1 SemUs |
| HOME_LAUNDRY_TS_domaine | 1 SemUs |
| HOME_LOANS_TS_domaine | 2 SemUs |
| HORSE_RACING_TS_domaine | 4 SemUs |
| HUMAN_SCIENCES_TS_domaine | 36 SemUs |
| HUNTING_AND_SHOOTING_TS_domaine | 1 SemUs |
| INSURANCE_TS_domaine | 1 SemUs |
| INTELLIGENCE_TS_domaine | 10 SemUs |
| INTERNATIONAL_AFFAIRS_TS_domaine | 2 SemUs |
| ISLAM_TS_domaine | 1 SemUs |
| JEWELRY_TS_domaine | 1 SemUs |
| JUDAISM_TS_domaine | 1 SemUs |
| KITCHEN_EQUIMENT_TS_domaine | 3 SemUs |
| LAW_ENFORCEMENT_TS_domaine | 3 SemUs |
| LAW_TS_domaine | 20 SemUs |
| LEISURE_TS_domaine | 6 SemUs |
| LIFE_SCIENCES_TS_domaine | 22 SemUs |
| LINGUISTICS_TS_domaine | 23 SemUs |
| MAIL_TS_domaine | 6 SemUs |
| MANAGEMENT_TS_domaine | 7 SemUs |
| MANUFACTURING_INDUSTRY_TS_domaine | 21 SemUs |
| MARKETING_TS_domaine | 9 SemUs |
| MARRIAGE_TS_domaine | 5 SemUs |
| MATHEMATICS_TS_domaine | 8 SemUs |
| MEDIA_TS_domaine | 26 SemUs |
| MEDICINE_TS_domaine | 4 SemUs |
| MEETING_TS_domaine | 1 SemUs |
| MILITARY_TS_domaine | 20 SemUs |
| MINING-GENERAL_TS_domaine | 1 SemUs |
| MONARCHY_TS_domaine | 5 SemUs |
| MORMONISM_TS_domaine | 1 SemUs |
| MUSIC_TS_domaine | 3 SemUs |
| NEWSPAPER_PUBLISHING_TS_domaine | 4 SemUs |
| OPERA_TS_domaine | 2 SemUs |
| OPTICS_TS_domaine | 4 SemUs |
| ORTODOX_CHURCH_TS_domaine | 1 SemUs |
| PACKAGING_TS_domaine | 1 SemUs |
| PENAL_SYSTEM_TS_domaine | 2 SemUs |
| PETS_TS_domaine | 1 SemUs |
| PHILOSOPHY_TS_domaine | 37 SemUs |
| PHOTOGRAPHY_TS_domaine | 1 SemUs |
| PHYSICAL_SCIENCES_TS_domaine | 1 SemUs |
| PHYSICS_TS_domaine | 1 SemUs |
| PLUS_TS_ADMINISTRATIVE_T | 60 SemUs |
| PLUS_TS_ANIMAL_T | 2 SemUs |
| PLUS_TS_CHEMICAL_T | 11 SemUs |
| PLUS_TS_COLLECTIVE_T | 143 SemUs |
| PLUS_TS_COMBUSTIBLE_T | 3 SemUs |
| PLUS_TS_EDIBLE_T | 1 SemUs |
| PLUS_TS_ELABORATE_T | 63 SemUs |
| PLUS_TS_FICTIVE_T | 35 SemUs |
| PLUS_TS_GEOMETRIC_T | 15 SemUs |
| PLUS_TS_HUMAN_T | 600 SemUs |
| PLUS_TS_LIQUID_T | 2 SemUs |
| PLUS_TS_LOCATIVE_T | 61 SemUs |
| PLUS_TS_MEASURABLE_T | 93 SemUs |
| PLUS_TS_MENTAL_T | 51 SemUs |
| PLUS_TS_MIMETIC_T | 3 SemUs |
| PLUS_TS_MINERAL_T | 6 SemUs |
| PLUS_TS_MORAL_T | 25 SemUs |
| PLUS_TS_NATURAL_T | 21 SemUs |
| PLUS_TS_PART_T | 48 SemUs |
| PLUS_TS_PATHOLOGICAL_T | 1 SemUs |
| PLUS_TS_PLANT_T | 1 SemUs |
| PLUS_TS_PROFESSIONAL_T | 53 SemUs |
| PLUS_TS_SYMBOLIC_T | 43 SemUs |
| PLUS_TS_SYMPTOMATIC_T | 2 SemUs |
| POETICS_TS_domaine | 1 SemUs |
| POLITICS_AND_GOVERNMENT_TS_domaine | 30 SemUs |
| POLITICS_TS_domaine | 12 SemUs |
| PRIMARY_AND_SECONDARY_EDUCATION_TS_domaine | 1 SemUs |
| PRINTING_TS_domaine | 4 SemUs |
| PROTESTANTISM_TS_domaine | 1 SemUs |
| PSYCHOLOGY_TS_domaine | 57 SemUs |
| PUBLISHING_TS_domaine | 8 SemUs |
| RADIO-TELEVISION_TS_domaine | 2 SemUs |
| RAIL_TRANSPORT_TS_domaine | 2 SemUs |
| RELIGION_TS_domaine | 8 SemUs |
| RESTAURATION_TS_domaine | 2 SemUs |
| RHETORIC_TS_domaine | 23 SemUs |
| ROAD_TRANSPORT_TS_domaine | 1 SemUs |
| ROMAN_CATHOLICISM_TS_domaine | 1 SemUs |
| SCIENCES_TS_domaine | 83 SemUs |
| SERVICE_INDUSTRY_TS_domaine | 7 SemUs |
| SEX_TS_domaine | 1 SemUs |
| SHOWS_TS_domaine | 4 SemUs |
| SITU_TS_classificateur_de_nom_C | 7 SemUs |
| SKIING_TS_domaine | 1 SemUs |
| SMOKING_TS_domaine | 2 SemUs |
| SOCCER_TS_domaine | 3 SemUs |
| SOCIAL_ACTION_TS_domaine | 237 SemUs |
| SOCIAL_SECURITY_TS_domaine | 9 SemUs |
| SOCIOLOGY_TS_domaine | 49 SemUs |
| SPORT_TS_domaine | 7 SemUs |
| SPORTS_AND_LEISURE_TS_domaine | 5 SemUs |
| STATISTICS_TS_domaine | 3 SemUs |
| SWIMMING_TS_domaine | 1 SemUs |
| THEATER_TS_domaine | 2 SemUs |
| THEMES_TS_domaine | 1 SemUs |
| TOWN_AND_COUNTRY_PLANNING_TS_domaine | 8 SemUs |
| TRANSPORT_TS_domaine | 7 SemUs |
| ZOOLOGY_TS_domaine | 1 SemUs |
Table 7: TOTAL number of Usem's per Semantic Class for Nouns
| ATTRIBUTE_TS_classificateur_de_nom | 537 SemUs |
| ABSTRACT_TS_classificateur_de_nom | 234 SemUs |
| ACTIVITY_TS_classificateur_de_nom | 224 SemUs |
| HUMAN_TS_classificateur_de_nom | 101 SemUs |
| EVENT_TS_classificateur_de_nom | 87 SemUs |
| NOTION_TS_classificateur_de_nom | 85 SemUs |
| OCCUPATION_AGENT_TS_classificateur_de_nom | 68 SemUs |
| COGNITIVE_FACT_TS_classificateur_de_nom | 55 SemUs |
| ARTIFACT_TS_classificateur_de_nom | 49 SemUs |
| AFFECTION_TS_classificateur_de_nom | 45 SemUs |
| SYSTEM_OF_THOUGHT_TS_classificateur_de_nom | 40 SemUs |
| LOCATION_TS_classificateur_de_nom | 30 SemUs |
| AMOUNT_TS_classificateur_de_nom | 30 SemUs |
| PROCESS_TS_classificateur_de_nom | 26 SemUs |
| TIME_PERIOD_TS_classificateur_de_nom | 19 SemUs |
| PSYCHOLOGICAL_FEATURE_TS_classificateur_de_nom | 17 SemUs |
| BUILDING_TS_classificateur_de_nom | 17 SemUs |
| GEOGRAPHY_TS_classificateur_de_nom | 14 SemUs |
| ENTITY_TS_classificateur_de_nom | 14 SemUs |
| FUNCTIONAL_SPACE_TS_classificateur_de_nom | 11 SemUs |
| CURRENCY_TS_classificateur_de_nom | 9 SemUs |
| STATE_TS_classificateur_de_nom | 8 SemUs |
| OBJECT_TS_classificateur_de_nom | 7 SemUs |
| CONCRETE_TS_classificateur_de_nom | 7 SemUs |
| PHENOMENON_TS_classificateur_de_nom | 4 SemUs |
| PERIOD_TS_classificateur_de_nom | 4 SemUs |
| LETTER_TS_classificateur_de_nom | 4 SemUs |
| GROUP_NAMES_TS_classificateur_de_nom | 4 SemUs |
| OCCUPATION_TS_classificateur_de_nom | 3 SemUs |
| MEASURE_UNIT_TS_classificateur_de_nom | 3 SemUs |
| LIVING_BEING_TS_classificateur_de_nom | 3 SemUs |
| GARMENT_TS_classificateur_de_nom | 3 SemUs |
| FACULTY_TS_classificateur_de_nom | 3 SemUs |
| BODY_PART_TS_classificateur_de_nom | 3 SemUs |
| ALTERATION_TS_classificateur_de_nom | 3 SemUs |
| VEHICLE_TS_classificateur_de_nom | 2 SemUs |
| SUBSTANCE_TS_classificateur_de_nom | 2 SemUs |
| OPERATION_TS_classificateur_de_nom | 2 SemUs |
| NON_LIVING_TS_classificateur_de_nom | 2 SemUs |
| INSTRUMENT_TS_classificateur_de_nom | 2 SemUs |
| FORM_TS_classificateur_de_nom | 2 SemUs |
| MONTH_TS_classificateur_de_nom | 1 SemUs |
| MATTER_TS_classificateur_de_nom | 1 SemUs |
| FURNITURE_TS_classificateur_de_nom | 1 SemUs |
| DAY_TS_classificateur_de_nom | 1 SemUs |
| BIO_TS_classificateur_de_nom | 1 SemUs |
| APPARATUS_TS_classificateur_de_nom | 1 SemUs |
Table 8: Number of Usem's per Semantic Class for Verb
| CHANGE_TS_classificateur_de_verbe | 71 SemUs |
| COGNITION_TS_classificateur_de_verbe | 50 SemUs |
| MOTION_TS_classificateur_de_verbe | 35 SemUs |
| COMMUNICATION_TS_classificateur_de_verbe | 29 SemUs |
| EMOTION_TS_classificateur_de_verbe | 14 SemUs |
| PERCEPTION_TS_classificateur_de_verbe | 11 SemUs |
| SOCIAL_TS_classificateur_de_verbe | 11 SemUs |
| STATIVE_TS_classificateur_de_verbe | 9 SemUs |
| CONTACT_TS_classificateur_de_verbe | 8 SemUs |
| POSSESSION_TS_classificateur_de_verbe | 8 SemUs |
| CREATION_TS_classificateur_de_verbe | 3 SemUs |
| WEATHER_TS_classificateur_de_verbe | 1 SemUs |
2. Semantic encoding
In the German SIMPLE lexicon the encoding of entries was performed using the encoding tool supplied by the Catalan site. The tool is an interface which allows to encode/check/browse the Simple data, stored in a relational database (MS-Access). It has been created by Marta Villegas and Teresa Sadurní and developed by Teresa Sadurní at the Institut d'Estudis Catalans (IEC), in Barcelona.
This tool has provided most valuable help for the encoding process. Its great number of very useful options for inserting entries, copying and maintaining existing data has allowed to perform an accurate and quick coding of entries and a continuous consistency checking of data.
2.1. Criteria for Syntax-Semantic linking
The syntactic-semantic linking is established on a one-to-one relation between SynUs and SemUs at the level of correspondence objects. As we were using the MS-Access database the choices for the linking were automatically proposed by the db and had to be chosen according to the particular features of the SEMU.
In the German PAROLE lexicon, entries may have one or more syntactic
units. As a rule, all meanings of one lemma corresponding to the most general
syntactic description were first encoded and the others left aside for
eventual later treatment. Consequently, this one syntactic unit is linked
to all the subsenses (semantic units) of the word in question.
2.1.1 One to many:
<SynU
id="USADJ1235"
example=""
naming=".A1"
description="DADJ17">
<CorrespSynUSemU
targetsemu="USem1948"
correspondence="ISObivalent">
<CorrespSynUSemU
targetsemu="USem1949"
correspondence="ISObivalent">
<CorrespSynUSemU
targetsemu="USem2262">
<CorrespSynUSemU
targetsemu="USem2263">
</SynU>
2.1.2 One to one:
<SynU
id="USN7132"
example=""
naming=".N1"
description="DN2">
<CorrespSynUSemU
targetsemu="USem406"
correspondence="ISOmonovalent">
2.1. 1.Types of Correspondence
Subtypes of Correspondence used:
|
|
|
|
|
| ISOmonovalent | Isomorphic mapping for unary predicates | Autor [author]
gehen [to go] |
Nouns
Monovalent verbs |
| ISObivalent | Isomorphic mapping for bivalent predicates | ändern[to change] | Bivalent verbs |
| ISOtrivalent | Isomorphic mapping for trivalent predicates | überzeugen [to convince] | Trivalent verbs |
| RED2to1 | reduced mapping for predicates with two arguments where Arg1 changes to Arg0 | Aufdruck [impresssion]
Ausbildung [education] |
deverbal nouns derived from verbs with a direct object or prepositional object |
| RED3to2 | reduced mapping for predicates with three arguments where Arg2 goes to Arg0 and Arg2 to Arg1 | Rennen [run]
Auseinandersetzung [conflict, quarrel, discussion] |
deverbal nouns with two prepositional objects or two objects |
2.2. Criteria for assigning Domain Features
As recommended in Linguistic Specifications, we tried to assign to each Usem a specific semantic class and template type available in the Linguistic Specifications as provided by the Access database. Due to the internal structure of the database, it was not possible to assign more than one domain.
Assignment of the Domain categories has followed in most cases the recommendations given in the Linguistic Specifications D2.1. The main criteria for the assignment of the Domain values were the following:
id = "USem413"
naming = "Polizei"
example = "Sicherheitsbehörde"
In a considerable number of cases we encountered problems with abstract
deverbal nouns like "Abbruch" [demolition]
which
has a relatively "concrete" meaning in connection with the demolition of
house in the field of construction works. In this connection it can be
assigned the domain TSVP_CONSTRUCTION_TS_domaine_D.The
word has a even more general sense denoting any kind of stopping or finishing
of an action which cannot be linked to any specific field. In this case
the domain field was left unspecified which is equivalent to the domain
class "General". Another illustrative case is the word "Anlass"
[occasion,
cause, reason] which- like many other expressions denoting
cognitive processes or relations - does not allow the assignment of any
specific domain.
Linguistic specifications by the specification group were the main criteria
for assigning template type and semantic class information. The dictionaries
of Helbig/Schenkel and Ballmer/Brennenstuhl were used for the encoding
of verbs, in adjective and noun encoding valuable help was found in the
dictionaries of Sommerfeldt/Schreiber.
2.3.1. Language specific typing
This gives an overview over a demonstrative selection of most common assingement types for template types:
2.3.3. Criteria for encoding Semantic Relations
In the SIMPLE model semantic relations between SemU’s are intended to increase the descriptive power of the model. The encoding of semantic relations between different entries was done on the basis on choosing the linguistically relevant distinctive semantic relations, even if these point to dummy SemU's.
The dummy Usems collected in the course of the encoding arose from the expansion of lexicon population mainly by virtue of the many abstract derivational nouns we had to deal with. The other source was the already mentioned wish to cover the relevant parts of the semantics of the words encoded in a comprehensive way.
Examples to illustrate the assignment of dummys are
2.5. Representation of Predicative information.Polysemic relations have not been implemented which is again due to the fact that we mainly had to deal with abstract word senses. Synonimy was only encoded with some adjectives. Hyponimy was not encoded.
Each predicative entry contains information in respect to:
The same predicate is used
Table 8: Number of Usem's per Semantic Relations for Verbs
| SRActivity | 15 SemUs |
| SRAgentive | 204 SemUs |
| SRAgentiveCause | 44 SemUs |
| SRAgentiveExperience | 4 SemUs |
| SRAgentiveprog | 1 SemUs |
| SRAgentverb | 5 SemUs |
| SRAntonym | 262 SemUs |
| SRAntonymComp | 179 SemUs |
| SRAntonymGrad | 48 SemUs |
| SRCausedby | 145 SemUs |
| SRCauses | 112 SemUs |
| SRConcerns | 19 SemUs |
| SRConstitutive | 7 SemUs |
| SRConstitutiveactivity | 31 SemUs |
| SRContains | 68 SemUs |
| SRCreatedby | 341 SemUs |
| SRDenominalNounVerb | 2 SemUs |
| SRDerivational | 6 SemUs |
| SRDerivedfrom | 33 SemUs |
| SRDirecttelic | 92 SemUs |
| SREntail | 4 SemUs |
| SREventverb | 10 SemUs |
| SRFormal | 5 SemUs |
| SRHasAsEffect | 158 SemUs |
| SRHasasmember | 65 SemUs |
| SRHasaspart | 101 SemUs |
| SRHasasproperty | 15 SemUs |
| SRIndirecttelic | 8 SemUs |
| SRInstrument | 31 SemUs |
| SRIsa | 51 SemUs |
| SRIsafollowerof | 1 SemUs |
| SRIsamemberof | 108 SemUs |
| SRIsapartof | 160 SemUs |
| SRIsin | 9 SemUs |
| SRIstheabilityof | 18 SemUs |
| SRIstheactivityof | 54 SemUs |
| SRIsthehabitof | 26 SemUs |
| SRKinship | 2 SemUs |
| SRLocation | 12 SemUs |
| SRMadeof | 16 SemUs |
| SRMeasuredby | 52 SemUs |
| SRMeasures | 1 SemUs |
| SRMetaphor | 16 SemUs |
| SRNominalization | 98 SemUs |
| SRNounadjective | 79 SemUs |
| SRNounPropernoun | 1 SemUs |
| SRObjectoftheactivity | 15 SemUs |
| SRPolysemy | 20 SemUs |
| SRPolysemyAgentofpersistentactivity-Profession | 1 SemUs |
| SRPolysemyBuilding-Institution | 4 SemUs |
| SRPolysemyCauseact-Nonrelationalact | 1 SemUs |
| SRPolysemyChangeofstate-Causechangeofstate | 10 SemUs |
| SRPolysemyCognitiveevent-Experienceevent | 12 SemUs |
| SRPolysemyConstitutivechange-Causeconstitutivechange | 1 SemUs |
| SRPolysemyDomain-Activity | 1 SemUs |
| SRPolysemyHumanGroup-Building | 2 SemUs |
| SRPolysemyHumanGroup-GeopoliticalLocation | 8 SemUs |
| SRPolysemyHumanGroup-Institution | 14 SemUs |
| SRPolysemyLocation-HumanGroup | 19 SemUs |
| SRPolysemyMove-Causemotion | 26 SemUs |
| SRPolysemyOpening-Artifact | 7 SemUs |
| SRPolysemyPlant-Substance | 1 SemUs |
| SRPolysemyPurposeact-Domain | 3 SemUs |
| SRPolysemySemioticartifact-Information | 2 SemUs |
| SRProcessverb | 10 SemUs |
| SRProducedby | 250 SemUs |
| SRProduces | 279 SemUs |
| SRProperty | 2 SemUs |
| SRPropertyof | 24 SemUs |
| SRPurpose | 42 SemUs |
| SRQuantifies | 8 SemUs |
| SRRelatedto | 18 SemUs |
| SRRelates | 4 SemUs |
| SRResult | 4 SemUs |
| SRResultingState | 48 SemUs |
| SRResultof | 333 SemUs |
| SRSource | 13 SemUs |
| SRSuccessorof | 1 SemUs |
| SRSynonym | 310 SemUs |
| SRTelic | 45 SemUs |
| SRTypicalof | 18 SemUs |
| SRUsedagainst | 7 SemUs |
| SRUsedas | 46 SemUs |
| SRUsedby | 14 SemUs |
| SRUsedfor | 107 SemUs |
Table 9: Number of Usem's per Semantic Relations for Nouns, Verbs, Adjectives
| TemplateADJSuperTypeIntensional | 373 SemUs |
| SyntacticTypeAttrPROT | 311 SemUs |
| SyntacticTypeAttrPredPROT | 200 SemUs |
| TemplateADJSuperTypeExtensional | 200 SemUs |
| TemplateSuperTypeEntityPROT | 144 SemUs |
| EventTypeProcessPROT | 117 SemUs |
| AspectINCHOATIVEESS | 106 SemUs |
| ScalarPlusPROT | 103 SemUs |
| DurationUnderspecifiedPROT | 89 SemUs |
| ScalarMinusPROT | 83 SemUs |
| ConnotationNegativePROT | 76 SemUs |
| EventTypeTransitionPROT | 75 SemUs |
| ConnotationPositivePROT | 73 SemUs |
| AspectDURATIVEESS | 71 SemUs |
| ConnotationPositiveESS | 69 SemUs |
| ConnotationUnderspecifiedESS | 59 SemUs |
| ConnotationNegativeESS | 57 SemUs |
| RealityConcrete | 57 SemUs |
| AudienceUnderspecifiedESS | 54 SemUs |
| AffectednessPositiveESS | 50 SemUs |
| TemplateSuperTypeEventPROT | 45 SemUs |
| UnificationPathHuman-AgentivePROT | 43 SemUs |
| IntentionalityYesESS | 41 SemUs |
| IterativePlusESS | 39 SemUs |
| TemplateSuperTypeRolePROT | 39 SemUs |
| DurationTEMPORARYPROT | 38 SemUs |
| Intersective | 38 SemUs |
| AgeUnderspecifiedESS | 35 SemUs |
| TemplateSuperTypeCauseRelationalChangePROT | 34 SemUs |
| TemplateSuperTypeTopPROT | 34 SemUs |
| MeaningCompBehaviourPositive | 32 SemUs |
| MeaningCompBehaviourNegative | 31 SemUs |
| PunctualYesESS | 31 SemUs |
| UnificationPathRelationalChange-AgentivePROT | 31 SemUs |
| AspectTERMINATIVEESS | 30 SemUs |
| UnificationPathConcreteentity-ArtifactAgentive-TelicPROT | 30 SemUs |
| TemplateSuperTypeLocationPROT | 29 SemUs |
| DurationTEMPORARYESS | 28 SemUs |
| MeaningCompAttitudeEvaluationPositive | 28 SemUs |
| RealityAbstract | 28 SemUs |
| EventTypeStatePROT | 27 SemUs |
| TemplateSuperTypeLivingentityPROT | 26 SemUs |
| MeaningCompSocietyPositive | 25 SemUs |
| DurationPERSISTENTPROT | 24 SemUs |
| MeaningCompPowerPositive | 24 SemUs |
| UnificationPathHuman-TelicPROT | 24 SemUs |
| UnificationPathRelationalAct-TelicPROT | 24 SemUs |
| TemplateSuperTypeGroupPROT | 23 SemUs |
| DurationUnderspecifiedESS | 22 SemUs |
| MeaningCompPsychologicalPropertyPositive | 22 SemUs |
| UnificationPathChange-AgentivePROT | 22 SemUs |
| UnificationPathRelationalAct-AgentivePROT | 22 SemUs |
| AttitudeFORESS | 21 SemUs |
| Dimension3PROT | 21 SemUs |
| UnificationPathConcreteentity-ArtifactAgentivePROT | 21 SemUs |
| MeaningCompActivenessPositive | 20 SemUs |
| StateSolidESS | 20 SemUs |
| TemplateSuperTypeCauseChangePROT | 20 SemUs |
| UnificationPathSpeechAct-TelicPROT | 20 SemUs |
| TemplateSuperTypePsychologicalEventPROT | 19 SemUs |
| AttitudeUnderspecifiedESS | 18 SemUs |
| ModalityFeatureExtrinsic | 18 SemUs |
| UnificationPathLocation-ArtifactAgentive-TelicPROT | 18 SemUs |
| MeaningCompAttitudeEvaluationNegative | 17 SemUs |
| Subsective | 17 SemUs |
| TemplateSuperTypeActPROT | 17 SemUs |
| Dimension2PROT | 16 SemUs |
| DirectionFORWARDESS | 16 SemUs |
| AgeAdultPROT | 15 SemUs |
| LegalYesESS | 14 SemUs |
| TemplateSuperTypeHumanPROT | 14 SemUs |
| MeaningCompSocialPropertyPositive | 13 SemUs |
| AgeAdultESS | 12 SemUs |
| AspectDURATIVEPROT | 12 SemUs |
| ModalityTypePossible | 12 SemUs |
| AffectednessNegativeESS | 11 SemUs |
| Dimension3ESS | 11 SemUs |
| DirectionUnderspecifiedESS | 11 SemUs |
| UnificationPathAbstractEntity-ResultAgentivePROT | 11 SemUs |
| Dimension2ESS | 10 SemUs |
| PrivativeYesESS | 10 SemUs |
| ScalarPlusESS | 10 SemUs |
| TemplateSuperTypeConcreteentityPROT | 10 SemUs |
| MeaningCompMoralPositive | 9 SemUs |
| MeaningCompNormalNeutral | 9 SemUs |
| MeaningCompPowerNegative | 9 SemUs |
| SexMalePROT | 9 SemUs |
| UnificationPathCognitiveEvent-TelicPROT | 9 SemUs |
| UnificationPathMove-CauseAgentivePROT | 9 SemUs |
| AudiencePUBLICESS | 8 SemUs |
| DirectionINWARDESS | 8 SemUs |
| LegalUnderspecifiedESS | 8 SemUs |
| SexFemalePROT | 8 SemUs |
| TemplateSuperTypeChangeofPossessionPROT | 8 SemUs |
| UnificationPathPsychologicalEvent-AgentivePROT | 8 SemUs |
| AgeOldPROT | 7 SemUs |
| MannerYesPROT | 7 SemUs |
| MeaningCompPsychStatePositive | 7 SemUs |
| MeaningCompSocialPropertyNeutral | 7 SemUs |
| ContactYesESS | 6 SemUs |
| HabitatEarthESS | 6 SemUs |
| MeaningCompBodilySensationPositive | 6 SemUs |
| MeaningCompBodyPositive | 6 SemUs |
| MeaningCompEffortPositive | 6 SemUs |
| MeaningCompNormalcyNegative | 6 SemUs |
| MeaningCompRelationalPropertyPositive | 6 SemUs |
| MeaningCompSocietyNeutral | 6 SemUs |
| MeaningCompTerminativePositive | 6 SemUs |
| PunctualNoESS | 6 SemUs |
| TemplateSuperTypeRelationalStatePROT | 6 SemUs |
| TemplateSuperTypeSpeechActPROT | 6 SemUs |
| AudiencePRIVATEESS | 5 SemUs |
| DirectionFORWARDPROT | 5 SemUs |
| DirectionUnderspecifiedPROT | 5 SemUs |
| MeaningCompActivenessNegative | 5 SemUs |
| MeaningCompEconomyPositive | 5 SemUs |
| MeaningCompLawPositive | 5 SemUs |
| MeaningCompSocialPropertyNegative | 5 SemUs |
| MeaningCompSocietyNegative | 5 SemUs |
| ReciprocalYesESS | 5 SemUs |
| SexMaleESS | 5 SemUs |
| TemporalityTypePresent | 5 SemUs |
| UnificationPathAct-CauseAgentivePROT | 5 SemUs |
| AgeYoungPROT | 4 SemUs |
| FormalityUnderspecifiedESS | 4 SemUs |
| MeaningCompActivenessNeutral | 4 SemUs |
| MeaningCompConsistencyPositive | 4 SemUs |
| MeaningCompInchoativePositive | 4 SemUs |
| MeaningCompIntensifyingPropertyNeutral | 4 SemUs |
| MeaningCompMaterialPositive | 4 SemUs |
| MeaningCompMilitaryNeutral | 4 SemUs |
| MeaningCompMilitaryPositive | 4 SemUs |
| MeaningCompPsychologicalPropertyNeutral | 4 SemUs |
| MeaningCompStabilityPositive | 4 SemUs |
| MeaningCompVisualPositive | 4 SemUs |
| PresuppositionNEWPROT | 4 SemUs |
| SexFemaleESS | 4 SemUs |
| TemplateSuperTypeStatePROT | 4 SemUs |
| AspectTERMINATIVEPROT | 3 SemUs |
| IntentionalityYesPROT | 3 SemUs |
| IterativeMinusPROT | 3 SemUs |
| MeaningCompComparisonNegative | 3 SemUs |
| MeaningCompConstitutionPositive | 3 SemUs |
| MeaningCompEconomyNegative | 3 SemUs |
| MeaningCompIllnessNegative | 3 SemUs |
| MeaningCompIterativePositive | 3 SemUs |
| MeaningCompMilitaryNegative | 3 SemUs |
| MeaningCompPhysicalPropertyNegative | 3 SemUs |
| MeaningCompPlaceNeutral | 3 SemUs |
| MeaningCompPunctualNeutral | 3 SemUs |
| MeaningCompTemporalPositive | 3 SemUs |
| ModalityFeatureIntrinsic | 3 SemUs |
| ModalityTypeEpistemic | 3 SemUs |
| PartitiveYesESS | 3 SemUs |
| PolarityNegativeESS | 3 SemUs |
| TemplateSuperTypePartPROT | 3 SemUs |
| UnificationPathConcreteentity-Agentive-TelicPROT | 3 SemUs |
| UnificationPathLocation-AgentivePROT | 3 SemUs |
| AffectednessNegativePROT | 2 SemUs |
| AgeYoungESS | 2 SemUs |
| Dimension1ESS | 2 SemUs |
| DurationPERSISTENTESS | 2 SemUs |
| MeaningCompDurativePositive | 2 SemUs |
| MeaningCompEconomyNeutral | 2 SemUs |
| MeaningCompHearingNegative | 2 SemUs |
| MeaningCompHearingPositive | 2 SemUs |
| MeaningCompPerceptionPositive | 2 SemUs |
| MeaningCompPoliticalNeutral | 2 SemUs |
| MeaningCompPoliticalPositive | 2 SemUs |
| PartitiveYesPROT | 2 SemUs |
| PolarityPositiveESS | 2 SemUs |
| TemplateSuperTypeCreationPROT | 2 SemUs |
| TemplateSuperTypePropertyPROT | 2 SemUs |
| TemplateSuperTypeSubstancePROT | 2 SemUs |
| TemporalityTypeFuture | 2 SemUs |
| AgeOldESS | 1 SemUs |
| ExplicitnessExplicitESS | 1 SemUs |
| FormalityFORMALESS | 1 SemUs |
| IntensityHighPROT | 1 SemUs |
| IterativeMinusESS | 1 SemUs |
| MannerYesESS | 1 SemUs |
| MeaningCompAttitudeEvaluationNeutral | 1 SemUs |
| MeaningCompAttitudeSaliencePositive | 1 SemUs |
| MeaningCompBodyNegative | 1 SemUs |
| MeaningCompComparisonPositive | 1 SemUs |
| MeaningCompConsistencyNeutral | 1 SemUs |
| MeaningCompEffortNeutral | 1 SemUs |
| MeaningCompEstheticPositive | 1 SemUs |
| MeaningCompIntelligenceNeutral | 1 SemUs |
| MeaningCompLawNegative | 1 SemUs |
| MeaningCompLawNeutral | 1 SemUs |
| MeaningCompMaterialNegative | 1 SemUs |
| MeaningCompMoralNegative | 1 SemUs |
| MeaningCompMovementNegative | 1 SemUs |
| MeaningCompPrecisionPositive | 1 SemUs |
| MeaningCompPsychologicalPropertyNegative | 1 SemUs |
| MeaningCompPsychStateNegative | 1 SemUs |
| MeaningCompQuantityPositive | 1 SemUs |
| MeaningCompReligionPositive | 1 SemUs |
| MeaningCompSetMembershipNeutral | 1 SemUs |
| MeaningCompSpaceDistributionPositive | 1 SemUs |
| MeaningCompSpacePositive | 1 SemUs |
| MeaningCompTemporalNegative | 1 SemUs |
| MeaningCompTemporalNeutral | 1 SemUs |
| MeaningCompTouchNegative | 1 SemUs |
| MeaningCompVisualNeutral | 1 SemUs |
| MeaningCompWeightPositive | 1 SemUs |
| PunctualUnderspecifiedESS | 1 SemUs |
| StateGasPROT | 1 SemUs |
| StateSolidPROT | 1 SemUs |
| TemplateSuperTypeAbstractEntityPROT | 1 SemUs |
| UnificationPathAspectual-CauseAgentivePROT | 1 SemUs |
| UnificationPathConcreteentity-TelicPROT | 1 SemUs |
| UnificationPathPsychologicalEvent-TelicPROT | 1 SemUs |
4. Bibliography