目錄
Some guidelines
- English hyphenation is removed.
- Use
'
as the apostrophe. - Text will be in an Asciidoc file first, then an HTML file later.
- The dictionary part will be well-structured YAML.
- 原本的漢字在Unicode裡的話用原本的漢字,Unicode裡找不到的話會用Unicode裡有的變體。不確定的會先用?代替。
- Syntactic misspellings (like the
n.n.
thing is misspelled once asn..n
on page 18) are corrected without extra markings - Minor typos are corrected and marked with a comment
- Errata corrections are applied and marked with a comment
- Misspelled words are preserved with another
<…>-orig
key. -
Some entries map a single POJ to multiple Han character words. There are two possible ways to deal with this:
- write them as two entries but use a
-orig
key like a misspell - write them as two entries but use a comment marker
I think I'll go with the second option. Use this marker: “Was listed as […]”
- write them as two entries but use a
Markers
- DIFFERENCE
- disagrees with Chhoetaigi. Not well applied.
- Was listed as […]
- Some entries have multiple Han character versions of one single POJ, for instance
Chhiⁿ-poàn-hā 生半夏、青半夏
. This becomes two mappings in plants.yaml, accompanied by this marker. - original typo
- typos from the original text, either judged by me or from the errata. The corrected version is on the key, while the original version is in the
<key>-orig
key. - UNCLEAR
- unclear entries
- Missing Han character
- there is a character that isn't in Unicode. The marker will be followed by a Ideographic Description Sequence describing the character. The main field will use a ? in place of the character; a
han-ids
key will be present which uses the sequence instead of a question mark. Han characters that are in Unicode but which are rare enough to appear as a tofu in my Emacs are marked with its parts using IDEOGRAPH DESCRIPTION characters, with a glyphwiki link afterwards.