# Corpus **Repository Path**: darkyu/Corpus ## Basic Information - **Project Name**: Corpus - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-11-04 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Corpus - parallel\_corpus_dict: 平行语料相关字典 + 'simple_jp_zh_proper_noun.txt': 专有词表, 用于平行句判断。 + 'kanji_hanzi_list.txt': 中日汉字对照表 - zh\_dict: + strokes.txt: (unihan对应的)笔画数, 只有按unicode顺序的笔画数 + unihan\_strokes.txt: unicode [space] stroke