資料綱要
在這裡以 YAML 格式說明各欄位與其意思。欄位名稱寫 ?
的時候代表這個欄位不一定會出現。
plants.yaml
- title: "學名" # 像是 Engelhardtia formosana
by: "人名" # 像是 Hayata
# 主學名之後的斜體學名。凡例中沒有提到,但可以判斷是別名。
# 在第 255 頁、第 264 頁等處使用。
aliases?:
- title: "學名別名"
by: "人名"
# 俗名列表。
names?:
- romaji: "Romaji" # 日語羅馬字
romaji-orig?: "" # 原文有錯字的時候原文在此
kana: "ロマジ" # 日語片假名(舊假名用法)
kana-orig?: "" # 原文有錯字的時候原文在此
n.n.?: true # 是否有 "n.n." 標記
- poj: "Chhiū-m̂" # 傳統白話字
poj-orig?: "" # 原文有錯字的時候原文在此
han: "樹梅" # 漢字
han-orig?: "白話字" # 原文有錯字的時候原文在此
hakka?: true # 是否為客語,true 代表是,false 或不存在代表否(是臺語)
- native: "..." # 原住民語以日語相近發音再轉寫成羅馬字
group: "..." # 哪個原住民族
- note: "..." # 見第 247 頁 "Citrus maxima" 條目
where?: "全島"
page: 10
family: "Polypodiaceae"
# 是否為原生種,true 為是,否代表 "cultivated or introduced"
indigenous: true
categories.yaml
關於這本書所使用的植物分類系統,作者在自序中提到:
About four years ago I had worked through the whole system and through all the works relating to our flora. Having finished this I thought it desirable to change from Benthan and Hooker’s Natural System to that of Engler and Prantl, as laid down in the Engler und Gilg’s “Syllabus der Pflanzenfamilien” (1924). The latter system is followed in this list, with the exception, however, of the Conifers and Taxads, which are arranged according to Pilger’s work on the Coniferae in Engler und Prantl’s “Natürliche Pflanzenfamilien”, 2. Aufl. (1926).
- latin: "Embryophyta Asiphonogama"
han: "無管精有胚植物(高等隱花植物)"
children:
…
- …
各科的 schema 則是:
- latin: "Marattiaceae"
han: "觀音座蓮科"
romaji: "Ryûbintai-kwa" # 依文內所寫
indigenous: true
hanIndex.yaml
- title: "手藥"
refpages: [140]
page: 543
stroke_count: 4
- …
漢字索引,會在完成 plants.yaml 後大幅完善。
資料內標記
-
DIFFERENCE :: disagrees with Chhoetaigi. Not well applied.
-
Was listed as […] :: Some entries have multiple Han character versions of one single POJ, for instance =Chhiⁿ-poàn-hā 生半夏、青半夏=. This becomes two mappings in plants.yaml, accompanied by this marker.
-
original typo :: typos from the original text, either judged by me or from the errata. The corrected version is on the key, while the original version is in the =
-orig= key. -
UNCLEAR :: unclear entries
-
Missing Han character :: there is a character that isn't in Unicode. The marker will be followed by a Ideographic Description Sequence describing the character. The main field will use a ? in place of the character; a =han-ids= key will be present which uses the sequence instead of a question mark.
Han characters that are in Unicode but which are rare enough to appear as a tofu in my Emacs are marked with its parts using IDEOGRAPH DESCRIPTION characters, with a glyphwiki link afterwards.