% % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % File: nation.dtr % % Purpose: analysis of nationality adjectives in English % % Author: Geoffrey K. Pullum, October 15, 1994 % % Email: gkp@ling.ucsc.edu % % Address: Stevenson College, UCSC, Santa Cruz, CA 95064, USA % % % % Copyright (c) UC Santa Cruz, 1994. All rights reserved. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % Phonological representations: % ae cardinal 4+ vowel (ash) % ay cardinal 4-1 diphthong % au cardinal 6 vowel % ch voiceless palato-alveolar affricate % j voiced palato-alveolar affricate % & silent front vowel causing velar softening ([k] --> [s]) % e cardinal 2 % ey cardinal 3-1 diphthong % @r rhotacized central unrounded vowel % ii long (tense) cardinal 1 vowel % ng voiced velar nasal % sh voiceless palato-alveolar fricative % y voiced palatal central approximant # vars $v: a aa e ee i ii o oo u uu ae au . % The Null node defines a message that is shown when a form does not exist: Null: <> == '-- nonexistent --'. % The Morphonology node is a function doing some crude morphophonemics: Morphonology: <> == == <$seg> == $seg <> == l <> == s <> == j <> == ng g <> == aur <> == a <> <$v i @> == $v y @ <> == i y @ <> == sh $v <>. % Nation is the main set of defaults for nation-state-name lexemes. % A nation-state noun like `Russia' is typically a nationality noun stem plus % a suffix, typically -a; a nationality noun stem like `Russi-' is typically % a root plus a stem augment; a nationality adjective like `Russian' is a stem % plus an adjectival suffix; an inhabitant noun (as in `a Russian') is usually % like the nationality adjective. By default, there is no generic plural % adjective, hence no `*The Russian are coming'. Nation: == "" "" == "" "" == "" "" == "" == "" == Null == == @ == @ n == Null == Null == Null == Null == Morphonology:<"" !>. % The Land node deals with nationality words typically having nation-state % noun in -land and nationality adjective in -ish, e.g. England, English, % and these have a generic plural adjective as in `The English are coming': Land: <> == Nation == "" == "" == i sh == l @ n d == "" "". % The Ese node deals with nationality words with nationality adjective % in -ese, e.g. China, Chinese. These have a generic plural adjective % as in `The Chinese are coming', but in the author's dialect they lack % the inhabitant noun form (*a Chinese, *several Chinese) -- though this % is known from other dialects and can be accounted for through a simple % modification: Ese: <> == Nation == i z == "" == Null. % For those dialects accepting `a Chinese', include the following: % == "" % The Stan node is for country lexemes with adjective in -i, e.g. Iraqi: Stan: <> == Nation == i ==. Islands: <> == Nation == @ z. % The Xman node is for the properties of the (vaguely archaic or demeaning) % alternative inhabitant noun forms in -man: Xman: == "" == "" m @ n. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % The countries of the world Afghanistan: <> == Stan == 'Republic of Afghanistan' == ae f g ae n i s t aa n. Albania: <> == Nation == 'Republic of Albania' == ae l b ey n == i. Algeria: <> == Nation == 'Democratic and Popular Republic of Afghanistan' == ae l j ii r == i. America: <> == Nation == 'United States of America' == @ m e r i k == 'USA'. Andorra: <> == Nation == 'Principality of Andorra' == ae n d o r. Angola: <> == Nation == 'Republic of Angola' == ae n g o l. Antigua: <> == Nation == 'Antigua and Barbuda' == ae n t ii g w. Arabia: <> == Nation == 'Kingdom of Saudi Arabia' == @ r ey b == i. Argentina: <> == Nation == 'Argentine Republic' == aa r j @ n t ii n == aa r j @ n t ay n == i. Armenia: <> == Nation == 'Republic of Armenia' == aa r m ii n == i. Australia: <> == Nation == 'Commonwealth of Australia' == a u s t r ey l == i. Austria: <> == Nation == 'Republic of Austria' == a u s t r == i. Azerbaijan: <> == Stan == 'Azerbaijani Republic' == ae z @r b ay j aa n. Bahamas: <> == Islands == 'The Bahamas' == b @ h aa m == b @ h ey m i. Bahrain: <> == Stan == 'State of Bahrain' == b aa r ey n. Bangladesh: <> == Stan == 'Peoples Republic of Bangladesh' == b ae n g l @ d ey sh. Barbados: <> == Nation == 'Barbados' == b aa r b ey d == o s == i. Barbuda: <> == Nation == 'Antigua and Barbuda' == b aa r b u d. Belarus: <> == Nation == 'Republic of Belarus' == b ey l @ r uu s == == b ey l @ r @ sh. Belgium: <> == Nation == 'Kingdom of Belgium' == b e l j == @ m. Belize: <> == Nation == 'Belize' == b e l i z == b e l i z == i. Benin: <> == Nation == 'Republic of Benin' == b e n i n == b e n i n == i. Bhutan: <> == Ese == 'Kingdom of Bhutan' == b uu t aa n == b uu t @ n. Bolivia: <> == Nation == 'Republic of Bolivia' == b o l i v == i. Bosnia: <> == Nation == 'Bosnia and Herzegovina' == b o z n == i. Botswana: <> == Nation == 'Republic of Botswana' == b o t s w aa n. Brazil: <> == Nation == 'Federative Republic of Brazil' == b r @ z i l == i. Britain: <> == Land == 'United Kingdom of Great Britain and Northern Ireland' == b r i t == @ n. Brunei: <> == Nation == 'Brunei Darussalam' == b r uu n e i. Bulgaria: <> == Nation == 'Republic of Bulgaria' == b @ l g e r == i. % Burkina Faso: ? % <> == Nation. Burma: <> == Ese == 'Union of Myanmar' b @r m. Burundi: <> == Nation == 'Republic of Burundi' == i. Cambodia: <> == Nation == 'State of Cambodia' == k ae m b o d == i. Cameroon: <> == Nation == 'Republic of Cameroon' == k ae m @r uu n == i. Canada: <> == Nation == 'Canada' == k a n a d == i == a. Chad: <> == Nation == 'Republic of Chad' == ch ae d == i. Chile: <> == Nation == 'Republic of Chile' == ch i l e == . China: <> == Ese == 'Peoples Republic of China' == ch ay n == == Xman. Colombia: <> == Nation == 'Republic of Colombia' == k @ l @ m b. Comoros: <> == Nation == 'Federal Islamic Republic of the Comoros' == k o m @ r o. Congo: <> == Ese == 'Republic of Congo' == == k a n g o == l. CostaRica: <> == Nation == 'Republic of Costa Rica' == k o s t a r ii k. Croatia: <> == Nation == 'Republic of Croatia' == k r o ey t == i. Cuba: <> == Nation == 'Republic of Cuba' == k y uu b. Cyprus: <> == Nation == 'Republic of Cyprus' == s ay p r == @ s == s i p r i == @ t. Czechoslovakia: <> == Nation == 'Czech and Slovak Federal Republic' == ch e k o s l o v ae k == Null == i. Denmark: <> == Land == 'Kingdom of Denmark' == d e n m a r k == d ey n. Djibouti: <> == Nation == 'Republic of Djibouti' == j i b uu t == i ==. Dominica: <> == Nation == 'Commonwealth of Dominica' == d a m i n ii k. Ecuador: <> == Nation == 'Republic of Ecuador' == e k w @ d o r == i. Egypt: <> == Nation == 'Arab Republic of Egypt' == ii j i p t == i. ElSalvador: <> == Nation == 'Republic of El Salvador' == s ae l v @ d o r == i. Eritrea: <> == Nation == 'State of Eritrea' == e r i t r ey. Estonia: <> == Nation == 'Republic of Estonia' == e s t o n == i. England: <> == Land == 'England' == i n g l == Null == Xman. Ethiopia: <> == Nation == 'Ethiopia' == ii th i o p == i. Fiji: <> == Nation == 'Republic of Fiji' == f i j i. Finland: <> == Land == 'Republic of Finland' == f i n. France: <> == Nation == 'French Republic' == f r ae n k == f r e n ch == Null == Xman == &. Gabon: <> == Ese == 'Gabonese Republic' == g ae b a n. Gambia: <> == Nation == 'Republic of the Gambia' == g ae m b == i. Georgia: <> == Nation == 'Republic of Georgia' == j o r j. Germany: <> == Nation == 'Federal Republic of Germany' == j @r m == y. Ghana: <> == Nation == 'Republic of Ghana' == g aa n == g aa n ey. Greece: <> == Nation == 'Hellenic Republic' == g r ii k == &. Greenland: <> == Land == 'Kalaallit Nunaat' == g r ii n == l ae n d i k == @r. Grenada: <> == Nation == 'Grenada' == g r e n ey d == i. Guatemala: <> == Nation == 'Republic of Guatemala' == g w a t @ m aa l. Guinea: <> == Nation == 'Republic of Guinea' == g i n i == . Guyana: <> == Nation == 'Co-operative Republic of Guyana' == g ay aa n. Haiti: <> == Nation == 'Republic of Haiti' == h ey t i. Holland: <> == Land == h a l == n e dh @r l @ n d z == d @ ch == Xman == Null. Hungary: <> == Nation == h @ n g e r == i. Iceland: <> == Land == ay s == l ae n d i k == @r. India: <> == Nation == i n d == i. Ireland: <> == Land == ay r == e@r @ == Xman. Iraq: <> == Stan == i r ae k. Israel: <> == Stan == i z r ey l. Italy: <> == Nation == i t ae l == i == y. Liechtenstein: <> == Nation == l i k t @ n s t ay n == == i. Luxembourg: <> == Nation == l @ k s @ m b @r g == i == . Mexico: <> == Nation == m e k s i k == o. Nauru: <> == Nation == n a u r u. Norway: <> == Nation == n o r w == ey == == ii j. Pakistan: <> == Stan == p aa k i s t aa n. Poland: <> == Land == p o l == . Portugal: <> == Ese == p o r t j u g == i z == @ l. Scotland: <> == Land == s k a t == s == . Slovakia: <> == Nation == s l o v ae k == == i. Spain: <> == Nation == s p ae n == s p ey n == Land == y a r d. Sweden: <> == Nation == s w ii d == en == Land. Switzerland: <> == Land == s == s w i == t s @r == Null. Turkey: <> == Land == t @r k == i. Wales: <> == Land == w ey l z == w e l sh == Null == Xman. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % Some example theorems: % % Wales: % = w ey l z % = w ey l z % = -- nonexistent -- % = -- nonexistent -- % = w e l sh m @ n % = w e l sh m @ n % = w e l sh % = w e l sh % = w e l sh % = w e l sh % = -- nonexistent -- % = -- nonexistent --. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % # hide Ese Land Morphonology Nation Null Stan Xman Islands. # show . % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % The next line is the Revision Control System Archive Id: do not delete it. % $Id: archive.dtr,v 1.1 1997/04/09 20:40:33 root Exp $