| Change log entry 94938 | |
|---|---|
| Processed by: | kbaiko (2026-05-03 13:39:16 UTC) |
| Comment: |
<< review queue entry 87068 - submitted by 'kbaiko' >> << follow-up of change log entry 94707 >> << review queue entry 86759 >> [[{...}]] messes up the preprocessor that splits the pinyin for the output data set. Use [[..]] or [...]. These were the only characters in the current output data with pinyin that did not have spaces. (And subsequently broke something I was using.) Note 1: Additionally, I tried to search on this site for '[[{' but I get nothing. I'm not sure if there is a way to do literals or regex or get the current full export to check for other similar issues. Note 2: It doesn't help me to search the final data set because most of the extra characters in the pinyin gets removed before the final csv output. Feels like a data standardizer script should rip through here and force a convention for some things and flag the rest. (probably for all sorts of things. and then archive the scripts for reuse.) ---------- Editor: Following up on this submission - I don't have a way of contacting you as you submitted anonymously, so I'm going to put this on the change log and just hope you see it. Feel free to reach out with comments or questions. First, my previous comments didn't explain why we needed {}. I've updated our wiki here https://cc-cedict.org/wiki/syntax_v2#non-chinese_characters with an explanation and how to handle some special entries. I hope it clears up confusion as this was previously undocumented. Second, the parsing logic has been updated and {}'s are actually no longer necessary for many entries. In particular they were removed in the "coser" entry and from a handful of entries containing numbers in the headword. However, they are still necessary for 兡 and similar characters. We probably would not have discussed this without your submission so thank you for inspiring us to do so :) |
| Diff: |
# - 兡 兡 [[{bai3ke4}]] /(old) contracted variant of 百克[bai3ke4]/ # + 兡 兡 [[bai3ke4]] /(old) contracted variant of 百克[bai3ke4]/ # - 粨 粨 [[{bai3mi3}]] /(old) contracted variant of 百米[bai3mi3]/ # + 粨 粨 [[bai3mi3]] /(old) contracted variant of 百米[bai3mi3]/ # - 瓸 瓸 [[{bai3wa3}]] /(old) contracted variant of 百瓦[bai3wa3]/ # + 瓸 瓸 [[bai3wa3]] /(old) contracted variant of 百瓦[bai3wa3]/ # - 兝 兝 [[{fen1ke4}]] /(old) contracted variant of 分克[fen1ke4]/ # + 兝 兝 [[fen1ke4]] /(old) contracted variant of 分克[fen1ke4]/ # - 瓰 瓰 [[{fen1wa3}]] /(old) contracted variant of 分瓦[fen1wa3]/ # + 瓰 瓰 [[fen1wa3]] /(old) contracted variant of 分瓦[fen1wa3]/ # - 兞 兞 [[{hao2ke4}]] /(old) contracted variant of 毫克[hao2ke4]/ # + 兞 兞 [[hao2ke4]] /(old) contracted variant of 毫克[hao2ke4]/ # - 瓱 瓱 [[{hao2wa3}]] /(old) contracted variant of 毫瓦[hao2wa3]/ # + 瓱 瓱 [[hao2wa3]] /(old) contracted variant of 毫瓦[hao2wa3]/ # - 兣 兣 [[{li2ke4}]] /(old) contracted variant of 釐克|厘克[li2ke4]/ # + 兣 兣 [[li2ke4]] /(old) contracted variant of 釐克|厘克[li2ke4]/ # - 兛 兛 [[{qian1ke4}]] /(old) contracted variant of 千克[qian1ke4]/ # + 兛 兛 [[qian1ke4]] /(old) contracted variant of 千克[qian1ke4]/ # - 瓩 瓩 [[{qian1wa3}]] /(old) contracted variant of 千瓦[qian1wa3]/ # + 瓩 瓩 [[qian1wa3]] /(old) contracted variant of 千瓦[qian1wa3]/ # - 兙 兙 [[{shi2ke4}]] /(old) contracted variant of 十克[shi2ke4]/ # + 兙 兙 [[shi2ke4]] /(old) contracted variant of 十克[shi2ke4]/ # - 瓧 瓧 [[{shi2wa3}]] /(old) contracted variant of 十瓦[shi2wa3]/ # + 瓧 瓧 [[shi2wa3]] /(old) contracted variant of 十瓦[shi2wa3]/ |