theohiwbkrlaweconsocartspsyscitweagrmedfarmfaber
CCL Centre for Computational Linguistics K.U.Leuven
Leuven    - Search Staff Students Organizational chart Search matrix Keywords
Home
Call for papers
Abstract Submission
Important Dates
Location
Program
Registration
Proceedings
Local Organization
Sponsors
Pictures
Centre for Computational Linguistics
---
-  

CLIN 17 - Program

Conditional entropy as a measure of linguistic remoteness between related languages

Jens Moberg, Charlotte Gooskens, John Nerbonne

University of Groningen

The Scandinavian languages are so alike that their speakers often communicate, each using their own language, which Haugen dubbed 'semi-communication'.

The success of semi-communication depends on the languages involved, and, moreover, is asymmetric: Swedish is more easily understandable for a Dane, than Danish for a Swede. We model the success of semi-communication through the conditional entropy of the phoneme mapping in corresponding words.

Semantically corresponding words were taken from frequency lists, and aligned, and the conditional entropy of the phoneme mapping in aligned word pairs was calculated. This gives us information about the difficulty of predicting a phoneme in the native language given a corresponding phoneme in the foreign language. We also examine the conditional entropy of selected word classes, such as function/content and native/loan words.

  
NEWSFLASH
CLIN-17 PICTURES now available

   
K.U.Leuven - CWIS  Copyright © Katholieke Universiteit Leuven | reacties op de inhoud: Vincent Vandeghinste
Realisatie: Vincent Vandeghinste | Laatste wijziging: 21 november 2006 | Disclaimer
URL: http://www.ccl.kuleuven.be/CLIN17/pos14.php