WO2016197265A1 - Method for inputting rarely-used characters - Google Patents
Method for inputting rarely-used characters Download PDFInfo
- Publication number
- WO2016197265A1 WO2016197265A1 PCT/CN2015/000407 CN2015000407W WO2016197265A1 WO 2016197265 A1 WO2016197265 A1 WO 2016197265A1 CN 2015000407 W CN2015000407 W CN 2015000407W WO 2016197265 A1 WO2016197265 A1 WO 2016197265A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- uncommon
- word
- words
- characters
- input
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
Definitions
- the invention particularly relates to a method for inputting uncommon words, and belongs to the technical field of computer information processing.
- some dictionaries also received more words, such as the Kangxi Dictionary of the Qing Dynasty, which received 47,035 words; Japan's “Dahan and Dictionary” received 48,902 words, and another appendix of 1062; Taiwan's “Chinese Dictionary” received 49,905 words.
- the "Chinese Dictionary” received 54,678 words.
- the most frequently published words in the 20th century were "Zhonghua Zihai", which received 85,000 words.
- the Unicode Chinese, Japanese and Korean unified ideogram basic character set contains 20,902 simplified Chinese characters, traditional Chinese characters, and Korean and Korean characters. There are two extension areas, nearly 70,000 words. In fact, the number of Chinese characters is far more than 70,000 words.
- the Chinese character database of Beijing Guoan Information Equipment Co., Ltd. contains 91,251 Chinese characters; while the Japanese and Japanese text mirrors contain nearly 15 Chinese characters.
- Input Method refers to the input coding method rather than the software for inputting text; for example, the Chinese Pinyin scheme widely used in Chinese input method and the use of phonetic symbols in Taiwan can be used as the encoding method of Chinese character input method, thus forming Enter the pinyin input method or phonetic input of Chinese characters.
- Pinyin input method has a natural advantage over other input methods, because every Chinese who is educated in modern times uses a lot of time to learn Chinese pinyin or phonetic symbols before they can learn Chinese characters.
- the original pinyin for Chinese characters can be easily
- the ground is used as the input code for Chinese characters; another advantage of the Pinyin input method is that it is close to spoken language, so the Pinyin input method can be adapted in a short time.
- the Pinyin input method has a fatal weakness.
- the single-word repetition rate is quite high, even if the phrase repetition rate is very high.
- the input efficiency is very low, but most users are dissatisfied.
- the Input Method Editor is a program that allows users to enter thousands of characters in Asian languages using the "104-key" standard keyboard; IMEs are used to store system input method files including input method programs, Dictionary/thesaurus (for the composition of ideograms), coding scheme. When the user enters a keystroke, the IME engine attempts to determine which character(s) should be converted into a keystroke.
- the present invention is difficult to grasp for inputting uncommon words in the prior art, simplifying input rules and generating a large number of uncommon words, requiring multiple page flipping to find the problem of uncommon words to be input, and providing a method for inputting uncommon words.
- the method includes the following steps:
- Step 1 User input
- the user activates the uncommon word input method system, and inputs the spelling of the full spelling or non-wording part of a character part to be input into the uncommon word.
- the system of the uncommon word input method all the Chinese characters of the same Chinese pinyin are all spelled or not.
- the composition of the characters is composed of uncommon words composed of pinyin A collection of uncommon words.
- the uncommon words of the full-spelling or non-word-forming parts of the word-making parts form a subset of the uncommon words;
- Step 2 Retrieve the collection of uncommon words and display
- the input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin input in step 1, and numbers and displays all the uncommon word subsets in the uncommon word set in the input interface;
- Step 3 The user selects the uncommon word on the screen
- the user selects the corresponding subset of the rare words according to the basic words and presses the number key corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and the user selects the uncommonly selected ones.
- the number of the word corresponds to the number key, and the uncommon word to be selected is screened.
- the subset of the characters are vertically arranged and numbered sequentially.
- the uncommon characters are vertically arranged and numbered sequentially, and each of the uncommon words is marked with the pronunciation information such as the pronunciation and the corresponding fake characters, and the annotation information is only used for labeling, and does not follow the screen of the uncommon words.
- the invention has the beneficial effects that the vast majority of the uncommon words are composed of two or more word-making parts or non-word-making parts, and the user only needs to know the Chinese pinyin of one of the word-making parts or the non-word-making parts.
- the method of the invention can be used to select a subset of uncommon words and then select uncommon words, thereby avoiding a large number of page turning search work, and the input of the uncommon words is accurate and convenient, and the speed is fast, and the method can also be used for common Chinese character input.
- the common input method generally includes a common Chinese character font, and there are not many options corresponding to each Chinese pinyin. Therefore, the present invention is more suitable for the input of uncommon words, and is particularly suitable for use in computer terminals for organizing ancient documents, and can also be used for intelligence.
- the mobile terminal meets the needs of some users who like to use uncommon words.
- Figure 1 is a schematic diagram of the input "vba" display
- FIG. 2 is a schematic view of the button "3"
- Figure 3 is a program flow chart.
- a method for inputting a rare word includes the following steps:
- Step 1 User input
- the user activates the uncommon word input method system, and inputs the pinyin "vba" through the keyboard, and the all-word or non-word of all the Chinese characters with the pinyin of "ba" in the uncommon word input method system.
- the uncommon characters composed of the parts of the phonetic alphabet form a collection of uncommon words.
- the uncommon words of the same parts of the word parts or the non-word parts are composed of uncommon characters. a subset of words, including "stop, eight, bar, tyrant, stop” and other word-making components or non-word-making components;
- Step 2 Retrieve the collection of uncommon words and display
- the input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin "ba" input in step 1, and numbers all the uncommon word subsets in the uncommon word set, and displays them in the input interface, and displays The result is shown in Figure 2.
- the subset of uncommon words is arranged vertically and numbered sequentially, and this sorting method makes the user more convenient to find.
- Step 3 The user selects the uncommon word on the screen
- the user selects the corresponding subset of the rare words according to the basic words and presses the number key "3" corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and displays the result map. 3 is shown.
- the uncommon words are arranged vertically and numbered sequentially, and each uncommon word is marked with the pronunciation information such as the pronunciation and the corresponding fake characters.
- the annotation information is only used for labeling, and does not follow the screen of the uncommon word. The user can obtain the pronunciation information of the uncommon word and the corresponding fake word while inputting the uncommon word.
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Document Processing Apparatus (AREA)
Abstract
The present invention relates to the technical field of computer information processing. Disclosed is a method for inputting rarely-used characters. The present invention is to resolve the problems in the prior art of difficulty in mastering the input of rarely-used characters, generation of a huge number of rarely-used character options when an input rule is simplified, and failure to find a rarely-used character to be inputted without flicking through multiple pages. The method comprises the following steps: step 1: a user performs input; step 2: a rarely-used character set is invoked and displayed; and step 3: the user selects a rarely-used character and allows the rarely-used character to be displayed on a screen. The method is suitable for inputting rarely-used characters on a computer terminal or a smart phone terminal.
Description
本发明具体涉及一种输入生僻字的方法,属于计算机信息处理技术领域。The invention particularly relates to a method for inputting uncommon words, and belongs to the technical field of computer information processing.
汉字数量的首次统计是汉朝许慎在《说文解字》中进行的,共收录9353字。其后,南朝时顾野王所撰的《玉篇》据记载共收16917字,在此基础上修订的《大广益会玉篇》则据说有22726字。此后收字较多的是宋朝官修的《类篇》,收字31319个;另一部宋朝官修的《集韵》中收字53525个,曾经是收字最多的一部书。另外有些字典收字也较多,如清朝的《康熙字典》收字47035个;日本的《大汉和字典》收字48902个,另有附录1062个;台湾的《中文大字典》收字49905个;《汉语大字典》收字54678个。20世纪已出版的字数最多的是《中华字海》,收字85000个。在汉字计算机编码标准中,Unicode的中日韩统一表意文字基本字集则收录汉字简体、繁体及日语、韩语汉字共20902个,另有两个扩展区,近七万字。事实上,汉字数量要远远超过7万字,北京国安咨讯设备公司汉字字库收录了91251个汉字;而日本的《今昔文字镜》收录了近15个汉字。The first statistics of the number of Chinese characters was carried out by Xu Shen in the "Speaking of the Words" in the Han Dynasty. A total of 9353 words were included. Later, the "Jade Article" written by Gu Yewang in the Southern Dynasties was recorded as a total of 16917 words. On the basis of this revision, the "Da Guangyi Hui Jade Article" is said to have 22,726 words. After that, more songs were collected in the "Classes" of the Song Dynasty, which received 31,319 characters; another Song Dynasty officially edited "Ji Yun", which received 53525 words, and was once the most accepted book. In addition, some dictionaries also received more words, such as the Kangxi Dictionary of the Qing Dynasty, which received 47,035 words; Japan's "Dahan and Dictionary" received 48,902 words, and another appendix of 1062; Taiwan's "Chinese Dictionary" received 49,905 words. The "Chinese Dictionary" received 54,678 words. The most frequently published words in the 20th century were "Zhonghua Zihai", which received 85,000 words. In the Chinese character computer coding standard, the Unicode Chinese, Japanese and Korean unified ideogram basic character set contains 20,902 simplified Chinese characters, traditional Chinese characters, and Korean and Korean characters. There are two extension areas, nearly 70,000 words. In fact, the number of Chinese characters is far more than 70,000 words. The Chinese character database of Beijing Guoan Information Equipment Co., Ltd. contains 91,251 Chinese characters; while the Japanese and Japanese text mirrors contain nearly 15 Chinese characters.
输入法(Input Method)指的是输入编码方式而不是实现文字输入的软件;如中文输入法中广泛使用的汉语拼音方案以及台湾使用注音符号,都能够作为汉字输入法的编码方式,从而形成能够录入汉字的拼音输入法或注音输入。拼音输入法相比其他输入法有着天然的优势,因为,现代每个接受教育的中国人在还没学习汉字前,就先用大量时间学习汉语拼音或注音符号,原本标记汉字的拼音也就可以轻松地作为汉字的输入编码;拼音输入法另一个优势就是接近口语,所以拼音输入法可以在短时间内适应。但是拼音输入法有着致命的弱点,就是汉字输入法编码时,单字重码率相当高,即使词组重码率也是非常高。在输入汉字过程中,常常要翻很多页,才能找到需要的汉字,输入效率非常低下导致但大部分用户的不满。Input Method refers to the input coding method rather than the software for inputting text; for example, the Chinese Pinyin scheme widely used in Chinese input method and the use of phonetic symbols in Taiwan can be used as the encoding method of Chinese character input method, thus forming Enter the pinyin input method or phonetic input of Chinese characters. Pinyin input method has a natural advantage over other input methods, because every Chinese who is educated in modern times uses a lot of time to learn Chinese pinyin or phonetic symbols before they can learn Chinese characters. The original pinyin for Chinese characters can be easily The ground is used as the input code for Chinese characters; another advantage of the Pinyin input method is that it is close to spoken language, so the Pinyin input method can be adapted in a short time. However, the Pinyin input method has a fatal weakness. When encoding Chinese characters, the single-word repetition rate is quite high, even if the phrase repetition rate is very high. In the process of inputting Chinese characters, it is often necessary to turn over a lot of pages in order to find the required Chinese characters. The input efficiency is very low, but most users are dissatisfied.
输入法编辑器(Input Method Editor)是一种程序,能使用户用“104键”的标准键盘,输入亚洲语言中数以千计的字符;IME用来存放系统输入法文件包括输入法程序、字典/词库(用于表意字的组成)、编码方案。当用户输入键击时,IME引擎会尝试确定应将键击转换成哪个(哪些)字符。The Input Method Editor is a program that allows users to enter thousands of characters in Asian languages using the "104-key" standard keyboard; IMEs are used to store system input method files including input method programs, Dictionary/thesaurus (for the composition of ideograms), coding scheme. When the user enters a keystroke, the IME engine attempts to determine which character(s) should be converted into a keystroke.
多数中国人认识的汉字数量在4000个左右,含国标一级3776个和部分国标二级汉字;因此4千字之外的几乎都是生僻字,所谓生僻字,就是多数人不知道这些字正确的读音及释义,因此很难用拼音输入法在计算机/智能手机上输入它们,这对于古籍文献的整理,以及汉字的信息化是一个巨大瓶颈。Most Chinese people know about 4,000 Chinese characters, including 3,776 national standard and some national standard Chinese characters; therefore, almost all of the four thousand words are uncommon words. The so-called uncommon words are that most people don’t know that these words are correct. The pronunciation and interpretation, it is difficult to input them on the computer / smart phone using the Pinyin input method, which is a huge bottleneck for the collation of ancient books and the informationization of Chinese characters.
根据GB18030-2000,合体字有以下结构:
According to GB18030-2000, the combined characters have the following structure:
1.左右结构、左中右结构;1. Left and right structure, left middle right structure;
2.上下结构、上中下结构;2. Upper and lower structure, upper middle and lower structure;
3.全包围结构;3. Fully enclosed structure;
4.向上包围结构、向下包围结构;4. The upward surrounding structure and the downward surrounding structure;
5.向右包围结构、向右上包围结构、向右下包围结构;5. Enclose the structure to the right, surround the structure to the upper right, and surround the structure to the lower right;
6.向左下包围结构;6. Enclose the structure to the left;
7.嵌套结构;7. Nested structure;
在中文拼音输入法状态下,要输入生僻字是非常困难的,如果要查字典,这样会中断输入,降低效率;如果凭猜测,有可能读音错误,如“埭”字,多数人可能会误读为“隶(li)”,但事实上,该字的正确读音是“dài”,这样的现象屡见不鲜。申请号为200710065842.5的中国专利《一种汉语拼音输入法》,针对“左右结构、上下结构、左中右结构、上中下结构”等合体字,提供了输入方法比如“玺(尔+玉)”的,采用“eryu?”输入;对于“汉字+部首/部件”结构的生僻字,通过输入“成字部件的全拼加上非成字部件拼音的第一个音节”来实现,如果非成字部件拼音多于一个,则选取任一个简单的非成字部件拼音的第一个音节,非成字部件拼音的读音规则根据国家标准;例如“菝(bá)”字由“拔”字和“草字头(cao)”,输入“bac?”,找到“菝”字。In the Chinese Pinyin input method state, it is very difficult to input a rare word. If you want to look up the dictionary, this will interrupt the input and reduce the efficiency. If you guess by mistake, there may be a pronunciation error, such as "埭", most people may mistake Read as "Li (li)", but in fact, the correct pronunciation of the word is "dài", such a phenomenon is not uncommon. The Chinese patent "Chinese Pinyin Input Method" with the application number of 200710065842.5 provides input methods such as "玺(尔+玉) for the combination of "left and right structure, upper and lower structure, left middle right structure, upper middle and lower structure" ", use "eryu?" input; for the "Chinese characters + radicals / parts" structure of the uncommon words, by inputting "the full spelling of the word parts plus the first syllable of the non-word parts", if If there are more than one pinyin of non-word parts, then the first syllable of any simple non-word-making part is selected. The pronunciation rules of pinyin of non-word parts are according to national standards; for example, the word “菝 (bá)” is “pull” Word and "cao", enter "bac?" and find the word "菝".
在实际使用中,有些问题,因为有些人不知道国家标准的非成字部件的正确读法,也就无法知道其拼音编码,如“(dī)”,“氐”字上的部件“罒”,估计多数人不太清楚如何读;而如果不输入非成字部件的读音编码,就会产生大量的重码生字;为了视觉友好性,目前输入法的输入窗平均显示5-7个候选字,通常需要不断的翻屏才能找到目标生僻字,如果是10万字,其重码生僻字的数量可想而知。In actual use, there are some problems, because some people do not know the correct reading of the non-word-making parts of the national standard, and they cannot know the pinyin coding, such as " (dī)", the part "氐" on the word "氐", it is estimated that most people are not very clear how to read; and if you do not input the pronunciation code of non-word parts, a large number of re-coded words will be generated; for visual friendliness, At present, the input window of the input method displays 5-7 candidate words on average. Usually, it is necessary to constantly scroll through the screen to find the target uncommon word. If it is 100,000 words, the number of repetitive characters can be imagined.
由于人们可能不知道所有组成生僻字的成字部件的全拼或非成字部件的汉语拼音,仍然很难输入生僻字,如果在输入法系统中简化输入规则,直接按组成生僻字的成字部件的全拼或非成字部件拼音的一个部分选择生僻字,将会产生大量的生僻字选项,需要多次的翻页才能找到要输入的生僻字。这种方法,对于拼音组合比较少,且生僻字选项较少的情况比较适用Since people may not know the Chinese Pinyin of all the spelled or non-worded parts that make up the uncommon word, it is still difficult to input the uncommon words. If the input rules are simplified in the input method system, the words that make up the uncommon words are directly Selecting a part of the spelling of a full-spelling or non-word-making part of a part will result in a large number of uncommon words, requiring multiple page flips to find the uncommon word to be entered. This method is more suitable for the case where the combination of pinyin is less and the number of uncommon words is less.
发明内容Summary of the invention
因此,本发明针对现有技术输入生僻字难以掌握,简化输入规则又会产生大量的生僻字选项,需要多次的翻页才能找到要输入的生僻字的问题,提供一种输入生僻字的方法,所述方法包括以下步骤:Therefore, the present invention is difficult to grasp for inputting uncommon words in the prior art, simplifying input rules and generating a large number of uncommon words, requiring multiple page flipping to find the problem of uncommon words to be input, and providing a method for inputting uncommon words. The method includes the following steps:
步骤1:用户输入Step 1: User input
用户激活生僻字输入法系统,输入要输入生僻字的一个成字部件的全拼或非成字部件拼音,所述生僻字输入法系统内,所有汉语拼音相同的成字部件的全拼或非成字部件拼音组成的生僻字组成
一个生僻字集合,在一个生僻字集合内,成字部件的全拼或非成字部件拼音相同的生僻字组成生僻字子集;The user activates the uncommon word input method system, and inputs the spelling of the full spelling or non-wording part of a character part to be input into the uncommon word. In the system of the uncommon word input method, all the Chinese characters of the same Chinese pinyin are all spelled or not. The composition of the characters is composed of uncommon words composed of pinyin
A collection of uncommon words. In a collection of uncommon words, the uncommon words of the full-spelling or non-word-forming parts of the word-making parts form a subset of the uncommon words;
步骤2:调取生僻字集合并显示Step 2: Retrieve the collection of uncommon words and display
生僻字输入法系统的输入法程序根据步骤1输入的汉语拼音调取对应的生僻字集合,并将所述生僻字集合内的所有生僻字子集进行编号并显示在输入界面内;The input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin input in step 1, and numbers and displays all the uncommon word subsets in the uncommon word set in the input interface;
步骤3:用户选择生僻字上屏Step 3: The user selects the uncommon word on the screen
用户根据基本字选择对应的生僻字子集并按其编号对应的数字键,输入界面内显示选择的生僻字子集的所有生僻字并对其中每个生僻字进行编号,用户按要选择的生僻字的编号对应的数字键,要选择的生僻字上屏。The user selects the corresponding subset of the rare words according to the basic words and presses the number key corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and the user selects the uncommonly selected ones. The number of the word corresponds to the number key, and the uncommon word to be selected is screened.
进一步的,所述步骤2中生僻字子集竖向排列并按顺序编号。Further, in the step 2, the subset of the characters are vertically arranged and numbered sequentially.
进一步的,所述步骤3中生僻字竖向排列并按顺序编号,每个生僻字后面标注其读音及对应的通假字等标注信息,所述标注信息只用于标注,不随生僻字上屏。Further, in the step 3, the uncommon characters are vertically arranged and numbered sequentially, and each of the uncommon words is marked with the pronunciation information such as the pronunciation and the corresponding fake characters, and the annotation information is only used for labeling, and does not follow the screen of the uncommon words.
本发明的有益效果在于:绝大多数生僻字是由两个或两个以上的成字部件或非成字部件组成的,用户只要知道其中的一个成字部件或非成字部件的汉语拼音,输入后就可以利用本发明的方法,先选择生僻字子集,再选择生僻字,避免了大量的翻页查找工作,生僻字输入准确方便且速度快,本方法也可以用于常用汉字输入,但是普通的输入法一般包括了常用汉字字库,每个汉语拼音对应的选项并不多,因此本发明更适合生僻字的输入,尤其适合于计算机终端进行古籍文献的整理时使用,也可用于智能手机终端,满足一些用户喜欢使用生僻字的需求。The invention has the beneficial effects that the vast majority of the uncommon words are composed of two or more word-making parts or non-word-making parts, and the user only needs to know the Chinese pinyin of one of the word-making parts or the non-word-making parts. After inputting, the method of the invention can be used to select a subset of uncommon words and then select uncommon words, thereby avoiding a large number of page turning search work, and the input of the uncommon words is accurate and convenient, and the speed is fast, and the method can also be used for common Chinese character input. However, the common input method generally includes a common Chinese character font, and there are not many options corresponding to each Chinese pinyin. Therefore, the present invention is more suitable for the input of uncommon words, and is particularly suitable for use in computer terminals for organizing ancient documents, and can also be used for intelligence. The mobile terminal meets the needs of some users who like to use uncommon words.
图1是输入“vba”显示的示意图Figure 1 is a schematic diagram of the input "vba" display
图2是按键“3”的示意图Figure 2 is a schematic view of the button "3"
图3是程序流程图。Figure 3 is a program flow chart.
下面结合附图对本发明的具体实施方式进行说明:The specific embodiments of the present invention are described below with reference to the accompanying drawings:
如图1所示,一种输入生僻字的方法,包括以下步骤:As shown in FIG. 1, a method for inputting a rare word includes the following steps:
步骤1:用户输入Step 1: User input
在计算机终端或智能手机终端,用户激活生僻字输入法系统,通过键盘,输入拼音“vba”,生僻字输入法系统内,所有汉语拼音为“ba”的成字部件的全拼或非成字部件拼音组成的生僻字组成一个生僻字集合,在一个生僻字集合内,成字部件的全拼或非成字部件拼音相同的生僻字组成生僻
字子集,包括“罷、八、巴、霸、罢”等成字部件或非成字部件;In the computer terminal or the smart phone terminal, the user activates the uncommon word input method system, and inputs the pinyin "vba" through the keyboard, and the all-word or non-word of all the Chinese characters with the pinyin of "ba" in the uncommon word input method system. The uncommon characters composed of the parts of the phonetic alphabet form a collection of uncommon words. In a collection of uncommon words, the uncommon words of the same parts of the word parts or the non-word parts are composed of uncommon characters.
a subset of words, including "stop, eight, bar, tyrant, stop" and other word-making components or non-word-making components;
步骤2:调取生僻字集合并显示Step 2: Retrieve the collection of uncommon words and display
生僻字输入法系统的输入法程序根据步骤1输入的汉语拼音“ba”调取对应的生僻字集合,并将生僻字集合内的所有生僻字子集进行编号,并显示在输入界面内,显示结果如图2所示。The input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin "ba" input in step 1, and numbers all the uncommon word subsets in the uncommon word set, and displays them in the input interface, and displays The result is shown in Figure 2.
其中,生僻字子集竖向排列并按顺序编号,这样的排序方式使用户更加方便查找。Among them, the subset of uncommon words is arranged vertically and numbered sequentially, and this sorting method makes the user more convenient to find.
步骤3:用户选择生僻字上屏Step 3: The user selects the uncommon word on the screen
用户根据基本字选择对应的生僻字子集并按其编号对应的数字键“3”,输入界面内显示选择的生僻字子集的所有生僻字并对其中每个生僻字进行编号,显示结果图3所示。The user selects the corresponding subset of the rare words according to the basic words and presses the number key "3" corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and displays the result map. 3 is shown.
其中,生僻字竖向排列并按顺序编号,每个生僻字后面标注其读音及对应的通假字等标注信息,标注信息只用于标注,不随生僻字上屏。用户可以在输入生僻字的同时,获取该生僻字的读音及对应的通假字等标注信息。Among them, the uncommon words are arranged vertically and numbered sequentially, and each uncommon word is marked with the pronunciation information such as the pronunciation and the corresponding fake characters. The annotation information is only used for labeling, and does not follow the screen of the uncommon word. The user can obtain the pronunciation information of the uncommon word and the corresponding fake word while inputting the uncommon word.
用户按要选择的生僻字的编号对应的数字键“4”,要选择的生僻字“祓”上屏,标注信息不上屏。The user presses the number key "4" corresponding to the number of the uncommon word to be selected, and the uncommon word "祓" to be selected is displayed on the screen, and the label information is not on the screen.
上述具体实施方式只列举了对应汉语拼音“ba”的一部分生僻字,实际上还有许多符合条件的生僻字没有列入,如果将其全部列入,则本发明的优势将更加得到体现。The above specific implementation only enumerates some of the uncommon characters corresponding to the Chinese phonetic alphabet "ba". In fact, there are still many uncommon characters that are eligible to be included. If all of them are included, the advantages of the present invention will be more reflected.
以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。
The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should also be considered as the scope of protection of the present invention.
Claims (4)
- 一种输入生僻字的方法,所述的生僻字是国标一级3775之外的全部汉字包括日本汉字及韩国汉字,并且生僻字的结构是“通用字+非成字部件”,其中通用字是国标1级和2级的6780个汉字之中的简体字/繁体字或者异体字,其特征在于,所述生僻字中,具有相同汉语拼音的“通用字”的生僻字,组成一个生僻字集合,在该集合内,包括同一个通用字构成的生僻字,组成一个生僻字子集;所述的方法包括以下步骤:A method for inputting uncommon words, the uncommon characters are all Chinese characters except the national standard level 3775, including Japanese characters and Korean characters, and the structure of the uncommon words is "general words + non-word parts", wherein the common words are Simplified/traditional or variant of the 6780 Chinese characters of the national standard 1 and 2, characterized in that in the uncommon word, the uncommon words of the "universal word" with the same Chinese pinyin form a collection of uncommon words. Within the set, a rare word composed of the same universal word is formed to form a subset of the uncommon word; the method includes the following steps:步骤1:用户输入Step 1: User input用户激活生僻字输入法系统,输入“v”后,连续输入所述的生僻字包括的“通用字”的拼音的全拼编码;The user activates the uncommon word input method system, and after inputting "v", continuously inputs the full spelling code of the pinyin of the "common word" included in the uncommon word;步骤2:调取生僻字集合并显示Step 2: Retrieve the collection of uncommon words and display输入法读取生僻字集合,并将由同一个通用字构成的生僻字子集编号、排序并显示在输入界面内;The input method reads the set of uncommon words, and numbers, sorts and displays the subset of the uncommon words consisting of the same universal word in the input interface;步骤3:调取生僻字子集并显示Step 3: Retrieve the subset of uncommon words and display用户键入某候选数字键,该候选数字编号在步骤2的输入界面对应的同一类生僻字子集内的全部生僻字进一步显示在输入界面内,而且所述的每一个生僻字被全部编号、排序;The user inputs a candidate number key, and all the uncommon words in the same type of rare word subset corresponding to the input interface of step 2 are further displayed in the input interface, and each of the uncommon words is numbered and sorted. ;步骤4:用户选择生僻字上屏Step 4: The user selects the uncommon word on the screen用户键入某候选数字键,步骤3的输入界面内对应的生僻字上屏。The user types a candidate number key, and the corresponding uncommon word in the input interface of step 3 is on the screen.
- 如权利要求1所述的输入生僻字的方法,其特征在于,所述步骤2中,生僻字子集竖向排序并按顺序编号。The method of inputting a rare word according to claim 1, wherein in the step 2, the subset of the characters are vertically sorted and numbered sequentially.
- 如权利要求1所述的输入生僻字的方法,其特征在于,所述步骤3中,生僻字竖向排序并按顺序编号。The method of inputting a rare word according to claim 1, wherein in the step 3, the uncommon words are vertically sorted and numbered sequentially.
- 如权利要求3所述的输入生僻字的方法,其特征在于,所述步骤3中,每个生僻字后面标注其读音及对应的通假字等标注信息,所述标注信息只用于标注,不随生僻字上屏。 The method for inputting a rare word according to claim 3, wherein in the step 3, each of the uncommon words is followed by an annotation information such as a pronunciation and a corresponding false word, and the annotation information is only used for labeling, and does not follow Uncommon words on the screen.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510319307.2 | 2015-06-11 | ||
CN201510319307.2A CN105425976A (en) | 2015-06-11 | 2015-06-11 | Rarely-used Chinese character input method |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016197265A1 true WO2016197265A1 (en) | 2016-12-15 |
Family
ID=55504232
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/000407 WO2016197265A1 (en) | 2015-06-11 | 2015-06-15 | Method for inputting rarely-used characters |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105425976A (en) |
WO (1) | WO2016197265A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111444680A (en) * | 2020-04-25 | 2020-07-24 | 中信银行股份有限公司 | Rarely-used word encoding expansion method and device, storage medium and electronic equipment |
CN112698734A (en) * | 2020-12-24 | 2021-04-23 | 维沃移动通信有限公司 | Candidate word display method and device and electronic equipment |
CN117151041A (en) * | 2023-10-27 | 2023-12-01 | 成方金融科技有限公司 | PDF (Portable document Format) generation method, device, equipment and storage medium compatible with rarely used words |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107145244A (en) * | 2017-04-26 | 2017-09-08 | 北京汉王数字科技有限公司 | A kind of special-shaped characters input method, device and electronic equipment |
CN112199031B (en) * | 2020-10-15 | 2022-08-05 | 科大讯飞股份有限公司 | Input method, device, equipment and storage medium |
CN114637407A (en) * | 2020-12-15 | 2022-06-17 | 中国联合网络通信集团有限公司 | Chinese character input method, system, computer device and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101923421A (en) * | 2009-06-09 | 2010-12-22 | 邓桂成 | Candidate Chinese character display method and system, as well as Chinese character input method and system |
CN102566769A (en) * | 2010-12-13 | 2012-07-11 | 腾讯科技(深圳)有限公司 | Chinese character input method and Chinese character input system |
CN102736741A (en) * | 2011-04-12 | 2012-10-17 | 腾讯科技(深圳)有限公司 | Pinyin input method and system of Chinese characters |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7382358B2 (en) * | 2003-01-16 | 2008-06-03 | Forword Input, Inc. | System and method for continuous stroke word-based text input |
CN1719390A (en) * | 2005-07-18 | 2006-01-11 | 王宏源 | Character searching and logging method for obtaining rarely used word using Chinese character full information and full text retrieval |
CN101236461A (en) * | 2008-02-25 | 2008-08-06 | 黄健风 | Chinese characters input method based on font style characteristic |
CN102541281A (en) * | 2010-12-22 | 2012-07-04 | 张家港市赫图阿拉信息技术有限公司 | Method for imputing knotty characters |
-
2015
- 2015-06-11 CN CN201510319307.2A patent/CN105425976A/en active Pending
- 2015-06-15 WO PCT/CN2015/000407 patent/WO2016197265A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101923421A (en) * | 2009-06-09 | 2010-12-22 | 邓桂成 | Candidate Chinese character display method and system, as well as Chinese character input method and system |
CN102566769A (en) * | 2010-12-13 | 2012-07-11 | 腾讯科技(深圳)有限公司 | Chinese character input method and Chinese character input system |
CN102736741A (en) * | 2011-04-12 | 2012-10-17 | 腾讯科技(深圳)有限公司 | Pinyin input method and system of Chinese characters |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111444680A (en) * | 2020-04-25 | 2020-07-24 | 中信银行股份有限公司 | Rarely-used word encoding expansion method and device, storage medium and electronic equipment |
CN111444680B (en) * | 2020-04-25 | 2023-05-16 | 中信银行股份有限公司 | Encoding expansion method and device for rarely used words, storage medium and electronic equipment |
CN112698734A (en) * | 2020-12-24 | 2021-04-23 | 维沃移动通信有限公司 | Candidate word display method and device and electronic equipment |
CN112698734B (en) * | 2020-12-24 | 2024-04-26 | 维沃移动通信有限公司 | Candidate word display method and device, and electronic device |
CN117151041A (en) * | 2023-10-27 | 2023-12-01 | 成方金融科技有限公司 | PDF (Portable document Format) generation method, device, equipment and storage medium compatible with rarely used words |
CN117151041B (en) * | 2023-10-27 | 2024-02-27 | 成方金融科技有限公司 | PDF (Portable document Format) generation method, device, equipment and storage medium compatible with rarely used words |
Also Published As
Publication number | Publication date |
---|---|
CN105425976A (en) | 2016-03-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI468954B (en) | Method for classifying and retrieving recurring graphemic components found in chinese-type characters and method for classifying and retrieving chinese-type characters based on said recurring graphemic components in electronic and non-electronic contexts | |
WO2016197265A1 (en) | Method for inputting rarely-used characters | |
Hermjakob et al. | Out-of-the-box universal Romanization tool uroman | |
CN103109250B (en) | A kind of Chinese character simple stroke input method and Chinese character simple stroke descriptor index method | |
Sodhar et al. | Identification of issues and challenges in romanized Sindhi text | |
US20150088486A1 (en) | Written language learning using an enhanced input method editor (ime) | |
Sharma et al. | Word prediction system for text entry in Hindi | |
CN103376910A (en) | Coincident-code-free Shuangpin input method | |
Lu | Computers and Chinese writing systems | |
CN102750000A (en) | Binary syllabification input method | |
TW201512969A (en) | Method of increasing handwriting input efficiency | |
Greenwood | International cultural differences in software | |
Ghosh et al. | Labeling of query words using conditional random field | |
CN103984420B (en) | A kind of Tibetan language intelligent input method based on phonetic | |
JP2010073177A (en) | Globally compatible japanese input system | |
Iglesias-Franjo et al. | TIR over Egyptian hieroglyphs | |
US12307023B2 (en) | Chinese character input method and system and keyboard | |
US20240184381A1 (en) | Chinese character input method and system and keyboard | |
Joshi et al. | Input Scheme for Hindi Using Phonetic Mapping | |
CN103576891A (en) | One key fast typing method | |
JP7223450B2 (en) | Automatic translation device and automatic translation program | |
CN110502128B (en) | Chinese character multi-element input method and system | |
TWI230870B (en) | New Chinese input method | |
JP2008217770A (en) | Language data display system, language data display method, and language data display program | |
JP2008262248A (en) | Character search method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15894536 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15894536 Country of ref document: EP Kind code of ref document: A1 |