Change log entry 75116 | |
---|---|
Processed by: | richwarm (2022-07-01 11:23:04 UTC) |
Comment: |
<< review queue entry 71036 - submitted by 'cws' >> If 米高 as 'Michael (name)' will be accepted, might as well accept this too: 米高 米高 [mi3 gao1] /meters high/meters/tall/ Otherwise, if [Mi3 gao1] only will be accepted but not [mi3 gao1], ambiguous pinyinizations will occur when a software that relies on dictionary terms have only a proper noun definition for two or more consecutive characters but don't have the another definition for the typical use of that same two or more consecutive characters https://user-images.githubusercontent.com/91402471/176124127-77115b8f-1ec3-41aa-9899-8bbff2ea49e0.png In fact, there are many words like 米高 that have two or more meanings. 都会 comes to mind. It means metropolis, but it also means 'all will', so for this kind of phrase, the app can just split 都会 for its typical use, so the language learners can interpret '都 会' as 'all will', but the software should still allow the language learners to know that the split ‘都 会’ can also mean metropolis It's hard to pin down the exact meaning of non-metropolis use of 都会 though; so splitting is a good option, instead of providing an unclear definition for non-metropolis use of ‘都 会' For [mi3 gao1] though, it has a well-defined meaning for non-Michael definition of 米高, that is 'meters high/meters tall'. So I guess, even if 米高 is segmented together for Michael, the software could still provide language learners a 'meters high/meters tall' definition for 米高 if the dictionary has a 'meters high/meters tall' definition for that. Otherwise, if [Mi3 gao1] will be accepted but not [mi3 gao1], the software should also allow the users to read 米高 in its split form ‘米 高’, so users can still read its typical use, that is 'meters high/meters tall' I raise this issue on another dictionary too: https://github.com/ssb22/CedPane/issues/6 ------------------------------------ Editor: 1) I'd say 两米高衣柜 should be parsed as (两米)高(衣柜) rather than 两(米高)(衣柜). 2) Most dictionaries (including CC-CEDICT) only provide the metropolis sense of 都会 because 都会, when it means "can all" or "will all" etc., is a sequence of two words (都 and 会) and those two words are defined elsewhere in the dictionary. It seems like CC-CEDICT is not ideal for the purpose for which you want to use it. |
Diff: |
# 米高 米高 [Mi3 gao1] /Michael (name)/ - 米高 米高 [Mi3 gao1] /Michael/ + 米高 米高 [Mi3 gao1] /Michael (name)/ |