ZHAO Mengling,JIN Xiaofeng.Korean ancient books character recognition method based on unified Chinese and Korean characters ideographic description sequences coding[J].Journal of Yanbian University,2024,(02):101-106.
基于中朝统一IDS编码的朝鲜语古籍文字识别方法
- Title:
- Korean ancient books character recognition method based on unified Chinese and Korean characters ideographic description sequences coding
- 文章编号:
- 1004-4353(2024)02-00101-06
- Keywords:
- Korean ancient books; zero-shot; character recognition; character coding; ideographic description sequences
- 分类号:
- TP391.1
- 文献标志码:
- A
- 摘要:
- 为解决朝鲜语古籍中的中文和朝鲜文字混排的识别难题,提出一种中朝文字的表意文字描述序列(IDS)统一编码方案,旨在通过利用偏旁分解字符识别模型(CCR-CLIP)识别朝鲜语古籍文字.首先,根据中朝文字结构的相似性,对文字中出现的汉字偏旁、朝鲜文字字母和12种基本结构进行了统一编码;其次,通过加入朝鲜文字的IDS序列扩充了CCR-CLIP原模型中提供的汉字的IDS序列文件;最后,通过在训练阶段使用印刷体文字训练的方式解决了朝鲜语古籍样本少的问题.
- Abstract:
- In order to solve the problem of recognition of mixed Chinese and Korean characters in ancient Korean books,this paper proposes a unified ideographic description sequence (IDS) encoding scheme for Chinese and Korean characters,which aims to recognize ancient Korean books by using a side decomposition chinese character recognition-contrastive language–image pre-training) (CCR-CLIP). Firstly,according to the similarity of Chinese and Korean characters,the Chinese characters’ side edges,Korean characters’ letters and 12 kinds of basic structures are uniformly coded. Secondly,the IDS sequence file of Chinese characters provided in the original model of CCR-CLIP is extended by adding IDS sequence of Korean characters. Finally,the problem of few samples of Korean ancient books was solved by using printed characters in the training stage. The results show that compared with the CCR-SLD method,the character recognition accuracy of this method is improved by 13.8% in the experiment of Korean ancient books. In the printed text experiment,the accuracy of character recognition improved by 5.38%. The established method is better than other methods in solving the problem of Korean ancient text recognition,and can provide reference for solving the problem of Korean ancient text recognition.
参考文献/References:
[1] HUANG G,LUO X,WANG S,et al. Hippocampus-heuristic character recognition network for zero-shot learning in Chinese character recognition[J]. Pattern Recognition,2022,130:108818.
[2] CHEN Z,YANG W,LI X. Stroke-Based Autoencoders:Self-Supervised Learners for Efficient Zero-Shot Chinese Character Recognition[J]. arXiv preprint arXiv2207.08191 2022. https://arxiv.org/abs/2207.08191.
[3] GAN J,CHEN Y,HU B,et al. Characters as graphs:Interpretable handwritten Chinese character recognition via Pyramid Graph Transformer[J]. Pattern Recognition,2023,137:109317.
[4] YU H,WANG X,LI B,et al. Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Paris:IEEE/CVF 2023 11943-11952.
[5] LUO G F,WANG D H,DU X,et al. Self-information of radicals:A new clue for zero-shot Chinese character recognition[J]. Pattern Recognition,2023,140:109598.
[6] CHEN J,LI B,XUE X. Zero-shot Chinese character recognition with stroke-level decomposition[J]. arXiv preprint arXiv:2106.11613 2021. https://arxiv.org/abs/2106.11613.
[7] ZU X,YU H,LI B,et al. Chinese Character Recognition with Augmented Character Profile Matching[C]//Proceedings of the 30th ACM International Conference on Multimedia. Lisbon ACM 2022 6094-6102.
[8] LI M,YU Y,YANG Y,et al. Stroke extraction of chinese character based on deep structure deformable image registration[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2023,37(1):1360-1367.
[9] KIM G,SON J,LEE K,et al. Character decomposition to resolve class imbalance problem in Hangul OCR[J]. arXiv preprint arXiv:2208.06079 2022. https://arxiv.org/abs/2208.06079.
[10] ???,???,???,等. ?? ???? ??? ?? ? ?????? ?? ?? ?? ??? ??[J]. Journal of Korea Multimedia Society,2023:26(6) 795-803.
[11] 刘晓童. 基于笔画分解的朝鲜语古籍字符识别方法研究与应用[D]. 延吉:延边大学,2023.
备注/Memo
收稿日期:2023-12-19
基金项目:吉林省教育厅人文社科基础研究项目(JJKH20230608SK)
第一作者:赵梦玲(1998—)女,硕士研究生,研究方向为文字识别.
通信作者:金小峰(1970—)男,教授,研究方向为语音信息处理、计算机视觉.