% % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % File: numerals.dtr % % Purpose: map number-language pairs into numerals % % Authors: Lynne Cahill & Gerald Gazdar, 1st January 1996 % % Email: lynneca@cogs.sussex.ac.uk, geraldg@cogs.sussex.ac.uk % % Address: COGS, Sussex University, Brighton BN1 9QH, UK % % Documentation: see paper noted below % % Related files: numerale.dtr % % Version: 2.01 % % % % Copyright (c) University of Sussex 1996. All rights reserved. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % This fragment is that discussed in Lynne Cahill & Gerald Gazdar (1996) % "A lexical analysis of numeral expressions in three related languages", % unpublished manuscript, University of Sussex, available by anonymous % FTP from ftp.cogs.sussex.ac.uk in /pub/nlp/POLYLEX/papers/numerals.ps. # atom A D E G I M V 'u:' 'i:' '3:' 'O:' T 'y:' O 'e:' 'A:' U '|:'. # vars $v1: 1 2 3 4 5 6 7 8 9. # vars $v2: 2 3 4 5 6 7 8 9. # vars $v3: 3 4 5 6 7 8 9. % Variable $L ranges over the language identification atoms: # vars $L: D E G. % Set node L to the desired language identification atom here: L:<> == G. % or D or E % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % The Numeral node finds a value for the path by packaging up the % language identification atom with a linear representation of the meaning % of the numeral in a path and handing it over to the Syntax node for % evaluation: Numeral: == 0 == "Syntax:" c "" x "" i "">". % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % The Syntax node examines the initial attribute to ascertain the identity % of the language. If the language is Dutch, then the attribute is % simply removed and the semantic representation is processed by the % Syntax node itself. If the language is English, then the semantic % representation is passed over to Syntax/E for processing; if the language % is German, then the semantic representation is passed over to Syntax/G. % Syntax, Syntax/G and Syntax/E recursively transduce the semantic % representation into a syntactic representation that consists of a sequence % of roots. Each such root is then mapped to a phonological representation % via the Morphology node (or Morphology/E). Syntax: == <> == "Syntax/G:<>" == "Syntax/E:<>" == "<>" == "<>" Morphology: Morphology == Morphology == "<>" == Morphology "<>" == Morphology == "<>" == Morphology "<>" == Morphology Morphology: "<>" == Morphology == Morphology: <> == Morphology. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % For the most part, the Morphology node simply maps the individual components % of the syntactic representation to the corresponding multilingual phonology % nodes which determine the phonology of the roots from the identity of the % language. Morphology: == "PhonCnj:" == "PhonNil:" == == "Phon001:" == "Phon002:" == "Phon003:" == "Phon004:" == "Phon005:" == "Phon006:" == "Phon007:" == "Phon008:" == "Phon009:" == "Phon010:" == "Phon011:" == "Phon012:" % The next two equations have the effect of selecting the variant roots % required in examples such as "thirteen" (3 + 10 ==> "thir" ^ "teen") % and "siebzig" (7 * 10 ==> "sieb" ^ "zig"): == "PhonA10:" == "PhonB10:" % The final set of equations deals with the realization of the hundreds and % thousands components: == "Phon100:" == "Phon100:" == "Phon000:". % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % Numbers like 23 should be realized as "twenty three" in English, not as % "three and twenty" as they would be if English followed exactly the pattern % of Dutch and German. And numbers like 302 should be realized as "three % hundred and two", not as "three hundred two" as they are in Dutch and % German. Syntax/E: <> == Syntax == Morphology <> == Morphology/E Morphology: <> == Morphology/E. % The number 100 should be realized as "one hundred" in English, not as % "hundred" as it would be if English followed exactly the pattern of % Dutch and German. Morphology/E: <> == Morphology == Morphology. % The number 30 in German is `dreissig', /draisIx/ and not `dreizig', % /draitsIx/. Syntax/G: <> == Syntax == == "<>" Morphology: Morphology/G == Morphology/G. Morphology/G: <> == Morphology == "PhonC10:". % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % The phonology nodes have been kept simple. No distinction is drawn % between roots and words: the only distinctions are those between % Syllable, Disyllable and Trisyllable. And there is no treatment of % stress. Null: <> == . Syllable: <> == Null <$L> == "" == "" == "" "" == "" "" == "" "". % Note that initial and final syllables within roots are not given any % internal analysis at present, though they really need to be (see the % redundant specification of final syllables given in the Phon100 and % Phon000 nodes). Disyllable: <> == Syllable == "" "". Trisyllable: <> == Syllable == "" "" "". % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % The multilingual root phonology nodes given below are all defined at the % level of the segment. But they could be much more effectively stated at % a featural level -- especially the vowels. PhonCnj: <> == Syllable == n == E == '{' == d == U == t. PhonNil: <> == Syllable == n == U == l == I. Phon001: <> == Syllable == n == 'e:' == w == V == a i == s. Phon002: <> == Syllable == n == t w == 'e:' == I == t == t w == 'u:' == E == t s v == a i == a. Phon003: <> == Syllable == d r == 'i:' == 'i:' == T r == T == '3:' == a i. Phon004: <> == Syllable == f == 'i:' == r == v == 'O:'. Phon005: <> == Syllable == f == f == v == E I == I == a I == f == v == 'y:' == n. Phon006: <> == Syllable == z == E == k == s == Null == s == I. Phon007: <> == Disyllable == z == v == @ n == 'e:' == @ == s == E == 'i:' == b == Null. Phon008: <> == Syllable == x == t == A == e I == Null == 'A:'. Phon009: <> == Syllable == n == n == Disyllable: == % This line cannot be omitted! == 'e:' == g @ == a I == O y. Phon010: <> == Syllable == t == n == 'i:' == E == t s == 'e:'. PhonA10: <> == Phon010 == Phon010:. PhonB10: <> == Syllable == Phon010 == x == @ == 'i:' == Null == Phon010 == I. % Deviant form of -zig suffix for German `dreissig'. PhonC10: <> == PhonB10 == s. Phon011: <> == Syllable == E == l == f == Trisyllable: == @ == l == % This line cannot be omitted! == v @ n. Phon012: <> == Syllable == t w == l == f == 'a:' == E == v == Phon002 == '|:'. Phon100: <> == Disyllable == h == n == d @ r t == O == V == d r E d == U. Phon000: <> == Disyllable == z @ n t == d == / I == T == a U == z @ n d == t == a u. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % Test nodes: Word_1: <> == Numeral == 1. Word_2: <> == Numeral == 2. Word_3: <> == Numeral == 3. Word_4: <> == Numeral == 4. Word_5: <> == Numeral == 5. Word_6: <> == Numeral == 6. Word_7: <> == Numeral == 7. Word_8: <> == Numeral == 8. Word_9: <> == Numeral == 9. Word_10: <> == Numeral == 1. Word_11: <> == Numeral == 1 == 1. Word_12: <> == Numeral == 2 == 1. Word_13: <> == Numeral == 3 == 1. Word_14: <> == Numeral == 4 == 1. Word_15: <> == Numeral == 5 == 1. Word_16: <> == Numeral == 6 == 1. Word_17: <> == Numeral == 7 == 1. Word_18: <> == Numeral == 8 == 1. Word_19: <> == Numeral == 9 == 1. Word_20: <> == Numeral == 2. Word_21: <> == Numeral == 1 == 2. Word_31: <> == Numeral == 1 == 3. Word_41: <> == Numeral == 1 == 4. Word_51: <> == Numeral == 1 == 5. Word_61: <> == Numeral == 1 == 6. Word_71: <> == Numeral == 1 == 7. Word_81: <> == Numeral == 1 == 8. Word_91: <> == Numeral == 1 == 9. Word_92: <> == Numeral == 2 == 9. Word_93: <> == Numeral == 3 == 9. Word_94: <> == Numeral == 4 == 9. Word_95: <> == Numeral == 5 == 9. Word_96: <> == Numeral == 6 == 9. Word_97: <> == Numeral == 7 == 9. Word_98: <> == Numeral == 8 == 9. Word_99: <> == Numeral == 9 == 9. Word_100: <> == Numeral == 1. Word_101: <> == Numeral == 1 == 1. Word_121: <> == Numeral == 1 == 2 == 1. Word_221: <> == Numeral == 2 == 2 == 1. Word_521: <> == Numeral == 5 == 2 == 1. Word_5031: <> == Numeral == 5 == 3 == 1. Word_2345: <> == Numeral == 2 == 3 == 4 == 5. % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % # hide Numeral Syntax Morphology Phon001 Phon002 Phon003 Phon004 Phon005 Phon006 Phon007 Phon008 Phon009 Phon010 PhonA10 PhonB10 Phon011 Phon012 Phon100 Phon000 PhonCnj PhnNil Null Syllable Disyllable Trisyllable L PhonC10 Syntax/E Morphology/E. # show . % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % % The next line is the Revision Control System Id: do not delete it. % $Id: archive.dtr,v 1.1 1997/04/09 20:40:33 root Exp $