WO2016197265A1

WO2016197265A1 - Method for inputting rarely-used characters

Info

Publication number: WO2016197265A1
Application number: PCT/CN2015/000407
Authority: WO
Inventors: 周连惠
Original assignee: 周连惠
Priority date: 2015-06-11
Filing date: 2015-06-15
Publication date: 2016-12-15
Also published as: CN105425976A

Abstract

The present invention relates to the technical field of computer information processing. Disclosed is a method for inputting rarely-used characters. The present invention is to resolve the problems in the prior art of difficulty in mastering the input of rarely-used characters, generation of a huge number of rarely-used character options when an input rule is simplified, and failure to find a rarely-used character to be inputted without flicking through multiple pages. The method comprises the following steps: step 1: a user performs input; step 2: a rarely-used character set is invoked and displayed; and step 3: the user selects a rarely-used character and allows the rarely-used character to be displayed on a screen. The method is suitable for inputting rarely-used characters on a computer terminal or a smart phone terminal.

Description

A method of inputting uncommon words

Technical field

The invention particularly relates to a method for inputting uncommon words, and belongs to the technical field of computer information processing.

Background technique

The first statistics of the number of Chinese characters was carried out by Xu Shen in the "Speaking of the Words" in the Han Dynasty. A total of 9353 words were included. Later, the "Jade Article" written by Gu Yewang in the Southern Dynasties was recorded as a total of 16917 words. On the basis of this revision, the "Da Guangyi Hui Jade Article" is said to have 22,726 words. After that, more songs were collected in the "Classes" of the Song Dynasty, which received 31,319 characters; another Song Dynasty officially edited "Ji Yun", which received 53525 words, and was once the most accepted book. In addition, some dictionaries also received more words, such as the Kangxi Dictionary of the Qing Dynasty, which received 47,035 words; Japan's "Dahan and Dictionary" received 48,902 words, and another appendix of 1062; Taiwan's "Chinese Dictionary" received 49,905 words. The "Chinese Dictionary" received 54,678 words. The most frequently published words in the 20th century were "Zhonghua Zihai", which received 85,000 words. In the Chinese character computer coding standard, the Unicode Chinese, Japanese and Korean unified ideogram basic character set contains 20,902 simplified Chinese characters, traditional Chinese characters, and Korean and Korean characters. There are two extension areas, nearly 70,000 words. In fact, the number of Chinese characters is far more than 70,000 words. The Chinese character database of Beijing Guoan Information Equipment Co., Ltd. contains 91,251 Chinese characters; while the Japanese and Japanese text mirrors contain nearly 15 Chinese characters.

Input Method refers to the input coding method rather than the software for inputting text; for example, the Chinese Pinyin scheme widely used in Chinese input method and the use of phonetic symbols in Taiwan can be used as the encoding method of Chinese character input method, thus forming Enter the pinyin input method or phonetic input of Chinese characters. Pinyin input method has a natural advantage over other input methods, because every Chinese who is educated in modern times uses a lot of time to learn Chinese pinyin or phonetic symbols before they can learn Chinese characters. The original pinyin for Chinese characters can be easily The ground is used as the input code for Chinese characters; another advantage of the Pinyin input method is that it is close to spoken language, so the Pinyin input method can be adapted in a short time. However, the Pinyin input method has a fatal weakness. When encoding Chinese characters, the single-word repetition rate is quite high, even if the phrase repetition rate is very high. In the process of inputting Chinese characters, it is often necessary to turn over a lot of pages in order to find the required Chinese characters. The input efficiency is very low, but most users are dissatisfied.

The Input Method Editor is a program that allows users to enter thousands of characters in Asian languages using the "104-key" standard keyboard; IMEs are used to store system input method files including input method programs, Dictionary/thesaurus (for the composition of ideograms), coding scheme. When the user enters a keystroke, the IME engine attempts to determine which character(s) should be converted into a keystroke.

Most Chinese people know about 4,000 Chinese characters, including 3,776 national standard and some national standard Chinese characters; therefore, almost all of the four thousand words are uncommon words. The so-called uncommon words are that most people don’t know that these words are correct. The pronunciation and interpretation, it is difficult to input them on the computer / smart phone using the Pinyin input method, which is a huge bottleneck for the collation of ancient books and the informationization of Chinese characters.

According to GB18030-2000, the combined characters have the following structure:

1. Left and right structure, left middle right structure;

2. Upper and lower structure, upper middle and lower structure;

3. Fully enclosed structure;

4. The upward surrounding structure and the downward surrounding structure;

5. Enclose the structure to the right, surround the structure to the upper right, and surround the structure to the lower right;

6. Enclose the structure to the left;

7. Nested structure;

In the Chinese Pinyin input method state, it is very difficult to input a rare word. If you want to look up the dictionary, this will interrupt the input and reduce the efficiency. If you guess by mistake, there may be a pronunciation error, such as "埭", most people may mistake Read as "Li (li)", but in fact, the correct pronunciation of the word is "dài", such a phenomenon is not uncommon. The Chinese patent "Chinese Pinyin Input Method" with the application number of 200710065842.5 provides input methods such as "玺(尔+玉) for the combination of "left and right structure, upper and lower structure, left middle right structure, upper middle and lower structure" ", use "eryu?" input; for the "Chinese characters + radicals / parts" structure of the uncommon words, by inputting "the full spelling of the word parts plus the first syllable of the non-word parts", if If there are more than one pinyin of non-word parts, then the first syllable of any simple non-word-making part is selected. The pronunciation rules of pinyin of non-word parts are according to national standards; for example, the word “菝 (bá)” is “pull” Word and "cao", enter "bac?" and find the word "菝".

In actual use, there are some problems, because some people do not know the correct reading of the non-word-making parts of the national standard, and they cannot know the pinyin coding, such as "

(dī)", the part "氐" on the word "氐", it is estimated that most people are not very clear how to read; and if you do not input the pronunciation code of non-word parts, a large number of re-coded words will be generated; for visual friendliness, At present, the input window of the input method displays 5-7 candidate words on average. Usually, it is necessary to constantly scroll through the screen to find the target uncommon word. If it is 100,000 words, the number of repetitive characters can be imagined.

Since people may not know the Chinese Pinyin of all the spelled or non-worded parts that make up the uncommon word, it is still difficult to input the uncommon words. If the input rules are simplified in the input method system, the words that make up the uncommon words are directly Selecting a part of the spelling of a full-spelling or non-word-making part of a part will result in a large number of uncommon words, requiring multiple page flips to find the uncommon word to be entered. This method is more suitable for the case where the combination of pinyin is less and the number of uncommon words is less.

Summary of the invention

Therefore, the present invention is difficult to grasp for inputting uncommon words in the prior art, simplifying input rules and generating a large number of uncommon words, requiring multiple page flipping to find the problem of uncommon words to be input, and providing a method for inputting uncommon words. The method includes the following steps:

Step 1: User input

The user activates the uncommon word input method system, and inputs the spelling of the full spelling or non-wording part of a character part to be input into the uncommon word. In the system of the uncommon word input method, all the Chinese characters of the same Chinese pinyin are all spelled or not. The composition of the characters is composed of uncommon words composed of pinyin A collection of uncommon words. In a collection of uncommon words, the uncommon words of the full-spelling or non-word-forming parts of the word-making parts form a subset of the uncommon words;

Step 2: Retrieve the collection of uncommon words and display

The input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin input in step 1, and numbers and displays all the uncommon word subsets in the uncommon word set in the input interface;

Step 3: The user selects the uncommon word on the screen

The user selects the corresponding subset of the rare words according to the basic words and presses the number key corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and the user selects the uncommonly selected ones. The number of the word corresponds to the number key, and the uncommon word to be selected is screened.

Further, in the step 2, the subset of the characters are vertically arranged and numbered sequentially.

Further, in the step 3, the uncommon characters are vertically arranged and numbered sequentially, and each of the uncommon words is marked with the pronunciation information such as the pronunciation and the corresponding fake characters, and the annotation information is only used for labeling, and does not follow the screen of the uncommon words.

The invention has the beneficial effects that the vast majority of the uncommon words are composed of two or more word-making parts or non-word-making parts, and the user only needs to know the Chinese pinyin of one of the word-making parts or the non-word-making parts. After inputting, the method of the invention can be used to select a subset of uncommon words and then select uncommon words, thereby avoiding a large number of page turning search work, and the input of the uncommon words is accurate and convenient, and the speed is fast, and the method can also be used for common Chinese character input. However, the common input method generally includes a common Chinese character font, and there are not many options corresponding to each Chinese pinyin. Therefore, the present invention is more suitable for the input of uncommon words, and is particularly suitable for use in computer terminals for organizing ancient documents, and can also be used for intelligence. The mobile terminal meets the needs of some users who like to use uncommon words.

DRAWINGS

Figure 1 is a schematic diagram of the input "vba" display

Figure 2 is a schematic view of the button "3"

Figure 3 is a program flow chart.

detailed description

The specific embodiments of the present invention are described below with reference to the accompanying drawings:

As shown in FIG. 1, a method for inputting a rare word includes the following steps:

Step 1: User input

In the computer terminal or the smart phone terminal, the user activates the uncommon word input method system, and inputs the pinyin "vba" through the keyboard, and the all-word or non-word of all the Chinese characters with the pinyin of "ba" in the uncommon word input method system. The uncommon characters composed of the parts of the phonetic alphabet form a collection of uncommon words. In a collection of uncommon words, the uncommon words of the same parts of the word parts or the non-word parts are composed of uncommon characters. a subset of words, including "stop, eight, bar, tyrant, stop" and other word-making components or non-word-making components;

Step 2: Retrieve the collection of uncommon words and display

The input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin "ba" input in step 1, and numbers all the uncommon word subsets in the uncommon word set, and displays them in the input interface, and displays The result is shown in Figure 2.

Among them, the subset of uncommon words is arranged vertically and numbered sequentially, and this sorting method makes the user more convenient to find.

Step 3: The user selects the uncommon word on the screen

The user selects the corresponding subset of the rare words according to the basic words and presses the number key "3" corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and displays the result map. 3 is shown.

Among them, the uncommon words are arranged vertically and numbered sequentially, and each uncommon word is marked with the pronunciation information such as the pronunciation and the corresponding fake characters. The annotation information is only used for labeling, and does not follow the screen of the uncommon word. The user can obtain the pronunciation information of the uncommon word and the corresponding fake word while inputting the uncommon word.

The user presses the number key "4" corresponding to the number of the uncommon word to be selected, and the uncommon word "祓" to be selected is displayed on the screen, and the label information is not on the screen.

The above specific implementation only enumerates some of the uncommon characters corresponding to the Chinese phonetic alphabet "ba". In fact, there are still many uncommon characters that are eligible to be included. If all of them are included, the advantages of the present invention will be more reflected.

The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should also be considered as the scope of protection of the present invention.

Claims

A method for inputting uncommon words, the uncommon characters are all Chinese characters except the national standard level 3775, including Japanese characters and Korean characters, and the structure of the uncommon words is "general words + non-word parts", wherein the common words are Simplified/traditional or variant of the 6780 Chinese characters of the national standard 1 and 2, characterized in that in the uncommon word, the uncommon words of the "universal word" with the same Chinese pinyin form a collection of uncommon words. Within the set, a rare word composed of the same universal word is formed to form a subset of the uncommon word; the method includes the following steps:

Step 1: User input

The user activates the uncommon word input method system, and after inputting "v", continuously inputs the full spelling code of the pinyin of the "common word" included in the uncommon word;

Step 2: Retrieve the collection of uncommon words and display

The input method reads the set of uncommon words, and numbers, sorts and displays the subset of the uncommon words consisting of the same universal word in the input interface;

Step 3: Retrieve the subset of uncommon words and display

The user inputs a candidate number key, and all the uncommon words in the same type of rare word subset corresponding to the input interface of step 2 are further displayed in the input interface, and each of the uncommon words is numbered and sorted. ;

Step 4: The user selects the uncommon word on the screen

The user types a candidate number key, and the corresponding uncommon word in the input interface of step 3 is on the screen.
The method of inputting a rare word according to claim 1, wherein in the step 2, the subset of the characters are vertically sorted and numbered sequentially.
The method of inputting a rare word according to claim 1, wherein in the step 3, the uncommon words are vertically sorted and numbered sequentially.
The method for inputting a rare word according to claim 3, wherein in the step 3, each of the uncommon words is followed by an annotation information such as a pronunciation and a corresponding false word, and the annotation information is only used for labeling, and does not follow Uncommon words on the screen.