Login

View change log entry

Back

Navigation:  ◀ 75116  75118 ▶ 

Change log entry 75117
Processed by: richwarm (2022-07-01 11:26:00 GMT)
Comment: << review queue entry 71037 - submitted by 'cws' >>
TL;DR If a two (or more) syllable proper noun can be read in other way(s), provide a definition for those other way(s)

I'm not advocating to provide definitions of all the possible combinations of measurement uses as was assumed here:
"Rather than add a long list of terms like
- meter high
- foot long
- centimeter deep
- meter wide
- meter deep
- inches high
... a dictionary will expect you to learn the pattern mentioned above.
(and so on)"

Only when a proper noun has an ambiguity with a usual phrase, then might as well disambiguate the proper noun and the phrase by providing a definition for the phrase

If 米高 as 'Michael (name)' will be accepted, might as well accept this too:

米高 米高 [mi3 gao1] /meters high/meters/tall/

Otherwise, if [Mi3 gao1] only will be accepted but not [mi3 gao1], ambiguous pinyinizations will occur when a software that relies on dictionary terms have only a proper noun definition for two or more consecutive characters but don't have the another definition for the typical use of that same two or more consecutive characters

https://user-images.githubusercontent.com/91402471/176124127-77115b8f-1ec3-41aa-9899-8bbff2ea49e0.png

In fact, there are many words like 米高 that have two or more meanings. 都会 comes to mind. It means metropolis, but it also means 'all will', so for this kind of phrase, the app can just split 都会 for its typical use, so the language learners of Chinese can interpret '都 会' as 'all will', but the software should still allow the language learners to know that the split ‘都 会’ can also mean metropolis

It's hard to pin down the exact meaning of non-metropolis use of 都会 though; so splitting is a good option, instead of providing an unclear definition for non-metropolis use of ‘都 会'

For [mi3 gao1] though, it has a well-defined meaning for non-Michael definition of 米高, that is 'meters high/meters tall'. So I guess, even if 米高 is combined together for Michael, the software could still provide language learners a 'meters high/meters tall' definition for that combined 米高 if the dictionary has a 'meters high/meters tall' definition for that. Otherwise, if [Mi3 gao1] will be accepted but not [mi3 gao1], the software should allow the users to read 米高 in its split form ‘米 高’, so users can still read its typical use, that is 'meters high/meters tall'

I raised this issue on another dictionary too: https://github.com/ssb22/CedPane/issues/6
---------------------------------

Editor:
> "if [Mi3 gao1] only will be accepted but not [mi3 gao1], ambiguous pinyinizations will occur ..."

两米高衣柜 not really ambiguous if you apply common sense, though. The alternatives include
a) "two Michael wardrobe"
b) "two meter high wardrobe"
c) "two rice high wardrobe"
etc.
Clearly, (b) is the most likely to be correct.

There are limitations to what one can achieve with CC-CEDICT unless one uses human intelligence OR one's software incorporates sophisticated AI.
Diff:
# - 米高 米高 [Mi3 gao1] /Michael/
# + 米高 米高 [Mi3 gao1] /Michael (name)/
By MDBG 2024
Privacy and cookies
Help wanted: the CC-CEDICT project is looking for new volunteer editors!