WO2016197265A1 - Procédé pour entrer des caractères rarement utilisés - Google Patents
Procédé pour entrer des caractères rarement utilisés Download PDFInfo
- Publication number
- WO2016197265A1 WO2016197265A1 PCT/CN2015/000407 CN2015000407W WO2016197265A1 WO 2016197265 A1 WO2016197265 A1 WO 2016197265A1 CN 2015000407 W CN2015000407 W CN 2015000407W WO 2016197265 A1 WO2016197265 A1 WO 2016197265A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- uncommon
- word
- words
- characters
- input
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
Definitions
- the invention particularly relates to a method for inputting uncommon words, and belongs to the technical field of computer information processing.
- some dictionaries also received more words, such as the Kangxi Dictionary of the Qing Dynasty, which received 47,035 words; Japan's “Dahan and Dictionary” received 48,902 words, and another appendix of 1062; Taiwan's “Chinese Dictionary” received 49,905 words.
- the "Chinese Dictionary” received 54,678 words.
- the most frequently published words in the 20th century were "Zhonghua Zihai", which received 85,000 words.
- the Unicode Chinese, Japanese and Korean unified ideogram basic character set contains 20,902 simplified Chinese characters, traditional Chinese characters, and Korean and Korean characters. There are two extension areas, nearly 70,000 words. In fact, the number of Chinese characters is far more than 70,000 words.
- the Chinese character database of Beijing Guoan Information Equipment Co., Ltd. contains 91,251 Chinese characters; while the Japanese and Japanese text mirrors contain nearly 15 Chinese characters.
- Input Method refers to the input coding method rather than the software for inputting text; for example, the Chinese Pinyin scheme widely used in Chinese input method and the use of phonetic symbols in Taiwan can be used as the encoding method of Chinese character input method, thus forming Enter the pinyin input method or phonetic input of Chinese characters.
- Pinyin input method has a natural advantage over other input methods, because every Chinese who is educated in modern times uses a lot of time to learn Chinese pinyin or phonetic symbols before they can learn Chinese characters.
- the original pinyin for Chinese characters can be easily
- the ground is used as the input code for Chinese characters; another advantage of the Pinyin input method is that it is close to spoken language, so the Pinyin input method can be adapted in a short time.
- the Pinyin input method has a fatal weakness.
- the single-word repetition rate is quite high, even if the phrase repetition rate is very high.
- the input efficiency is very low, but most users are dissatisfied.
- the Input Method Editor is a program that allows users to enter thousands of characters in Asian languages using the "104-key" standard keyboard; IMEs are used to store system input method files including input method programs, Dictionary/thesaurus (for the composition of ideograms), coding scheme. When the user enters a keystroke, the IME engine attempts to determine which character(s) should be converted into a keystroke.
- the present invention is difficult to grasp for inputting uncommon words in the prior art, simplifying input rules and generating a large number of uncommon words, requiring multiple page flipping to find the problem of uncommon words to be input, and providing a method for inputting uncommon words.
- the method includes the following steps:
- Step 1 User input
- the user activates the uncommon word input method system, and inputs the spelling of the full spelling or non-wording part of a character part to be input into the uncommon word.
- the system of the uncommon word input method all the Chinese characters of the same Chinese pinyin are all spelled or not.
- the composition of the characters is composed of uncommon words composed of pinyin A collection of uncommon words.
- the uncommon words of the full-spelling or non-word-forming parts of the word-making parts form a subset of the uncommon words;
- Step 2 Retrieve the collection of uncommon words and display
- the input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin input in step 1, and numbers and displays all the uncommon word subsets in the uncommon word set in the input interface;
- Step 3 The user selects the uncommon word on the screen
- the user selects the corresponding subset of the rare words according to the basic words and presses the number key corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and the user selects the uncommonly selected ones.
- the number of the word corresponds to the number key, and the uncommon word to be selected is screened.
- the subset of the characters are vertically arranged and numbered sequentially.
- the uncommon characters are vertically arranged and numbered sequentially, and each of the uncommon words is marked with the pronunciation information such as the pronunciation and the corresponding fake characters, and the annotation information is only used for labeling, and does not follow the screen of the uncommon words.
- the invention has the beneficial effects that the vast majority of the uncommon words are composed of two or more word-making parts or non-word-making parts, and the user only needs to know the Chinese pinyin of one of the word-making parts or the non-word-making parts.
- the method of the invention can be used to select a subset of uncommon words and then select uncommon words, thereby avoiding a large number of page turning search work, and the input of the uncommon words is accurate and convenient, and the speed is fast, and the method can also be used for common Chinese character input.
- the common input method generally includes a common Chinese character font, and there are not many options corresponding to each Chinese pinyin. Therefore, the present invention is more suitable for the input of uncommon words, and is particularly suitable for use in computer terminals for organizing ancient documents, and can also be used for intelligence.
- the mobile terminal meets the needs of some users who like to use uncommon words.
- Figure 1 is a schematic diagram of the input "vba" display
- FIG. 2 is a schematic view of the button "3"
- Figure 3 is a program flow chart.
- a method for inputting a rare word includes the following steps:
- Step 1 User input
- the user activates the uncommon word input method system, and inputs the pinyin "vba" through the keyboard, and the all-word or non-word of all the Chinese characters with the pinyin of "ba" in the uncommon word input method system.
- the uncommon characters composed of the parts of the phonetic alphabet form a collection of uncommon words.
- the uncommon words of the same parts of the word parts or the non-word parts are composed of uncommon characters. a subset of words, including "stop, eight, bar, tyrant, stop” and other word-making components or non-word-making components;
- Step 2 Retrieve the collection of uncommon words and display
- the input method of the uncommon word input method system retrieves the corresponding uncommon word set according to the Chinese pinyin "ba" input in step 1, and numbers all the uncommon word subsets in the uncommon word set, and displays them in the input interface, and displays The result is shown in Figure 2.
- the subset of uncommon words is arranged vertically and numbered sequentially, and this sorting method makes the user more convenient to find.
- Step 3 The user selects the uncommon word on the screen
- the user selects the corresponding subset of the rare words according to the basic words and presses the number key "3" corresponding to the number, inputs all the uncommon words of the selected subset of the rare words in the input interface, and numbers each of the uncommon words, and displays the result map. 3 is shown.
- the uncommon words are arranged vertically and numbered sequentially, and each uncommon word is marked with the pronunciation information such as the pronunciation and the corresponding fake characters.
- the annotation information is only used for labeling, and does not follow the screen of the uncommon word. The user can obtain the pronunciation information of the uncommon word and the corresponding fake word while inputting the uncommon word.
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Document Processing Apparatus (AREA)
Abstract
La présente invention se rapporte au domaine technique du traitement d'informations informatique. La présente invention concerne un procédé pour entrer des caractères rarement utilisés. La présente invention a pour but de résoudre les problèmes de difficulté à maîtriser l'entrée de caractères rarement utilisés, de génération d'un très grand nombre d'options de caractères rarement utilisés lorsqu'une règle d'entrée est simplifiée, et d'échec lorsqu'il s'agit de trouver un caractère rarement utilisé à entrer à moins que des pages soient tournées de multiples fois dans l'état antérieur de la technique. Le procédé comprend les étapes suivantes : étape 1 : un utilisateur réalise une entrée ; étape 2 : appeler un ensemble de caractères rarement utilisés et afficher l'ensemble de caractères rarement utilisés ; et étape 3 : l'utilisateur sélectionne un caractère rarement utilisé et permet au caractère rarement utilisé d'être affiché sur un écran. Le procédé est approprié pour entrer des caractères rarement utilisés sur un terminal informatique ou un terminal de téléphone intelligent.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510319307.2 | 2015-06-11 | ||
CN201510319307.2A CN105425976A (zh) | 2015-06-11 | 2015-06-11 | 一种输入生僻字的方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016197265A1 true WO2016197265A1 (fr) | 2016-12-15 |
Family
ID=55504232
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/000407 WO2016197265A1 (fr) | 2015-06-11 | 2015-06-15 | Procédé pour entrer des caractères rarement utilisés |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105425976A (fr) |
WO (1) | WO2016197265A1 (fr) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111444680A (zh) * | 2020-04-25 | 2020-07-24 | 中信银行股份有限公司 | 生僻字的编码扩展方法、装置、存储介质及电子设备 |
CN112698734A (zh) * | 2020-12-24 | 2021-04-23 | 维沃移动通信有限公司 | 候选词显示方法及装置、电子设备 |
CN117151041A (zh) * | 2023-10-27 | 2023-12-01 | 成方金融科技有限公司 | 兼容生僻字的pdf生成方法、装置、设备和存储介质 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107145244A (zh) * | 2017-04-26 | 2017-09-08 | 北京汉王数字科技有限公司 | 一种异形字符输入方法、装置及电子设备 |
CN112199031B (zh) * | 2020-10-15 | 2022-08-05 | 科大讯飞股份有限公司 | 输入方法、装置、设备及存储介质 |
CN114637407A (zh) * | 2020-12-15 | 2022-06-17 | 中国联合网络通信集团有限公司 | 汉字输入方法、系统、计算机设备及存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101923421A (zh) * | 2009-06-09 | 2010-12-22 | 邓桂成 | 候选汉字显示方法和系统以及汉字输入方法和系统 |
CN102566769A (zh) * | 2010-12-13 | 2012-07-11 | 腾讯科技(深圳)有限公司 | 汉字输入方法及系统 |
CN102736741A (zh) * | 2011-04-12 | 2012-10-17 | 腾讯科技(深圳)有限公司 | 一种汉字的拼音输入方法及系统 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7382358B2 (en) * | 2003-01-16 | 2008-06-03 | Forword Input, Inc. | System and method for continuous stroke word-based text input |
CN1719390A (zh) * | 2005-07-18 | 2006-01-11 | 王宏源 | 利用汉字全信息通过全文检索获取生僻字的检字录入方法 |
CN101236461A (zh) * | 2008-02-25 | 2008-08-06 | 黄健风 | 一种基于字形特征的汉字输入法 |
CN102541281A (zh) * | 2010-12-22 | 2012-07-04 | 张家港市赫图阿拉信息技术有限公司 | 一种输入疑难字的方法 |
-
2015
- 2015-06-11 CN CN201510319307.2A patent/CN105425976A/zh active Pending
- 2015-06-15 WO PCT/CN2015/000407 patent/WO2016197265A1/fr active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101923421A (zh) * | 2009-06-09 | 2010-12-22 | 邓桂成 | 候选汉字显示方法和系统以及汉字输入方法和系统 |
CN102566769A (zh) * | 2010-12-13 | 2012-07-11 | 腾讯科技(深圳)有限公司 | 汉字输入方法及系统 |
CN102736741A (zh) * | 2011-04-12 | 2012-10-17 | 腾讯科技(深圳)有限公司 | 一种汉字的拼音输入方法及系统 |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111444680A (zh) * | 2020-04-25 | 2020-07-24 | 中信银行股份有限公司 | 生僻字的编码扩展方法、装置、存储介质及电子设备 |
CN111444680B (zh) * | 2020-04-25 | 2023-05-16 | 中信银行股份有限公司 | 生僻字的编码扩展方法、装置、存储介质及电子设备 |
CN112698734A (zh) * | 2020-12-24 | 2021-04-23 | 维沃移动通信有限公司 | 候选词显示方法及装置、电子设备 |
CN112698734B (zh) * | 2020-12-24 | 2024-04-26 | 维沃移动通信有限公司 | 候选词显示方法及装置、电子设备 |
CN117151041A (zh) * | 2023-10-27 | 2023-12-01 | 成方金融科技有限公司 | 兼容生僻字的pdf生成方法、装置、设备和存储介质 |
CN117151041B (zh) * | 2023-10-27 | 2024-02-27 | 成方金融科技有限公司 | 兼容生僻字的pdf生成方法、装置、设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN105425976A (zh) | 2016-03-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI468954B (zh) | 分類及檢索在中文型文字中找到的重現字形偏旁及根據在電子及非電子本文中的重現字形偏旁分類及檢索中文型文字之方法及電子字典 | |
WO2016197265A1 (fr) | Procédé pour entrer des caractères rarement utilisés | |
Hermjakob et al. | Out-of-the-box universal Romanization tool uroman | |
CN103109250B (zh) | 一种汉字简易笔划输入法和汉字简易笔划检索法 | |
Sodhar et al. | Identification of issues and challenges in romanized Sindhi text | |
US20150088486A1 (en) | Written language learning using an enhanced input method editor (ime) | |
Sharma et al. | Word prediction system for text entry in Hindi | |
CN103376910A (zh) | 无重码双拼输入法 | |
Lu | Computers and Chinese writing systems | |
CN102750000A (zh) | 双拼王输入法 | |
TW201512969A (zh) | 增進手寫輸入效率之方法 | |
Greenwood | International cultural differences in software | |
Ghosh et al. | Labeling of query words using conditional random field | |
CN103984420B (zh) | 一种基于拼音的藏文智能输入法 | |
JP2010073177A (ja) | 国際対応型日本語入力システム | |
Iglesias-Franjo et al. | TIR over Egyptian hieroglyphs | |
US12307023B2 (en) | Chinese character input method and system and keyboard | |
US20240184381A1 (en) | Chinese character input method and system and keyboard | |
Joshi et al. | Input Scheme for Hindi Using Phonetic Mapping | |
CN103576891A (zh) | 一键快打字 | |
JP7223450B2 (ja) | 自動翻訳装置及び自動翻訳プログラム | |
CN110502128B (zh) | 一种汉字多元输入方法及系统 | |
TWI230870B (en) | New Chinese input method | |
JP2008217770A (ja) | 言語データ表示システム、言語データ表示方法、及び言語データ表示プログラム | |
JP2008262248A (ja) | 文字検索方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15894536 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15894536 Country of ref document: EP Kind code of ref document: A1 |