JPH05334490A

JPH05334490A - Table recognizing device

Info

Publication number: JPH05334490A
Application number: JP4161858A
Authority: JP
Inventors: Katsuhiko Itonori; 糸乘勝彦
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1992-05-29
Filing date: 1992-05-29
Publication date: 1993-12-17
Anticipated expiration: 2014-07-28
Also published as: JP2926066B2

Abstract

PURPOSE:To provide a table recognizing device capable of accurately segmenting respective frames constituting a table even in the case of a table omitting a part or the whole of vertical and horizontal ruled lines. CONSTITUTION:The device is provided with a character block extracting means 11 for extracting a character block from a table image and a positional relation identifying means 12 for identifying positional relation between character blocks extracted by the means 11 and outputting data expressing the structure of the table. Since the structure of the table is recognized by using the arrangement of character blocks, the structure of the table can be accurately recognized even when part or the whole of vertical and horizontal ruled lines is omitted.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文書画像処理の分野にお
いて、表画像から表の構造を認識する表認識装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a table recognition device for recognizing a table structure from a table image in the field of document image processing.

【０００２】[0002]

【従来の技術】従来の表認識の方式としては、表領域の
周辺分布を用いる方式や、表を構成する罫線をベクトル
線分に変換して、罫線で囲まれた矩形枠を抽出する方式
が知られている。周辺分布を使用する方式として例えば
特開平２−６１７７５公報記載のものがあり、ベクトル
線分を使用する方式として例えば特開平１−１２９３５
８公報記載のものがある。2. Description of the Related Art Conventional table recognition methods include a method using a peripheral distribution of a table area and a method of converting a ruled line forming a table into a vector line segment and extracting a rectangular frame surrounded by the ruled line. Are known. A method using the marginal distribution is disclosed in, for example, Japanese Patent Laid-Open No. 2-61775, and a method using a vector line segment is, for example, Japanese Patent Laid-Open No. 1-21935.
8 publications are available.

【０００３】特開平２−６１７７５公報記載の周辺分布
を使用する方式は、表領域の画像の周辺分布をとり、そ
の周辺分布のヒストグラムからある閾値以上の高さを持
つ山から罫線の位置を推定し、罫線の位置が表の最も外
側にある外枠の罫線を取り出す。次にこの外枠に両端を
接する罫線を求め、その罫線により外枠を複数の矩形枠
に分割する。さらに、分割された各矩形枠内に対して同
様の処理を再帰的に施すことにより、罫線で囲まれた矩
形枠を抽出する。後者の特開平１−１２９３５８公報記
載の方式は、ベクトル線分を追跡して取り出した各矩形
枠の位置関係を調べることで表の認識を行なう。The method using the peripheral distribution described in Japanese Patent Laid-Open No. 2-61775 takes the peripheral distribution of an image in a table area and estimates the position of a ruled line from a mountain having a height higher than a certain threshold from the histogram of the peripheral distribution. Then, the ruled line of the outer frame having the ruled line position on the outermost side of the table is taken out. Next, a ruled line contacting both ends of this outer frame is obtained, and the outer frame is divided into a plurality of rectangular frames by the ruled line. Further, the same processing is recursively performed on each of the divided rectangular frames to extract a rectangular frame surrounded by ruled lines. The latter method disclosed in Japanese Patent Laid-Open No. 1-129358 recognizes a table by tracing the vector line segments and checking the positional relationship between the extracted rectangular frames.

【０００４】これらの方式は、表を構成する罫線に省略
が無いことを前提としているが、実際に文書中に使用さ
れる表には罫線の一部が省略されているものも結構多
い。特開平２−２６４３８６公報記載の方式においては
表の両脇の罫線が省略されている場合でも、正しく矩形
枠を取り出せる方式である。すなわち、表画像から取り
出した縦罫線、横罫線から表の両脇に罫線があるかを判
別し、無い場合に表の両脇に縦罫線を仮想的に生成する
方式である。These systems are based on the assumption that the ruled lines that make up the table are not omitted, but there are quite a few tables that are actually used in a document in which some of the ruled lines are omitted. In the method described in Japanese Patent Laid-Open No. 2-264386, even if the ruled lines on both sides of the table are omitted, the rectangular frame can be taken out correctly. That is, it is a method of determining whether there are ruled lines on both sides of the table from the vertical ruled lines and the horizontal ruled lines extracted from the table image and virtually generating vertical ruled lines on both sides of the table when there is no ruled line.

【０００５】[0005]

【発明が解決しようとする課題】従来、文書中に使用さ
れている表には、様々な形態のものがある。図２はその
例を示すもので、同図（ａ）の表は全ての罫線が揃った
表、（ｂ）は両脇の罫線が省略された表、（ｃ）および
（ｄ）は両脇の罫線の他にも省略されている縦罫線、横
罫線がある表、（ｅ）は全ての罫線が省略された表であ
る。このうち（ａ）および（ｂ）の各表に関しては従来
の技術によって対応可能であるが、（ｃ）（ｄ）の表の
ように両脇の罫線の他にも省略されている縦罫線、横罫
線がある場合および（ｅ）の表のように全ての罫線が省
略されている場合には表の構造を正確に認識して、表と
して意味のある単位で文字列を取り出すことができなか
った。本発明の目的は、縦罫線、横罫線の一部または全
部に省略のあるような表であっても、表を構成する各枠
を正確に切り出すことのできる、表認識装置を提供する
ことにある。Conventionally, there are various types of tables used in documents. FIG. 2 shows an example thereof. The table in FIG. 2 (a) is a table in which all ruled lines are aligned, (b) is a table in which ruled lines on both sides are omitted, and (c) and (d) are both sides. In addition to the above ruled lines, a table having vertical ruled lines and horizontal ruled lines which are omitted, and (e) is a table in which all ruled lines are omitted. Of these, the tables of (a) and (b) can be dealt with by the conventional technique, but vertical ruled lines omitted in addition to the ruled lines on both sides as in the tables of (c) and (d), When there are horizontal ruled lines and when all ruled lines are omitted like the table in (e), it is not possible to correctly recognize the structure of the table and extract the character strings in meaningful units as a table. It was An object of the present invention is to provide a table recognition device capable of accurately cutting out each frame forming a table even in a table in which some or all of vertical and horizontal ruled lines are omitted. is there.

【０００６】[0006]

【課題を解決するための手段および作用】本発明の表認
識装置は、表画像から文字ブロックを抽出する文字ブロ
ック抽出手段（図１の１１、図８の８１）と、前記文字
ブロック抽出手段により抽出された文字ブロック相互の
位置関係を識別し、表の構造を表すデータを出力する位
置関係識別手段（図１の１２、図８の８２）とを基本的
な構成として備えたものである。この発明によれば、文
字ブロック抽出手段により抽出された文字ブロック相互
の位置関係を位置関係識別手段により識別する。表にお
ける文字ブロックは表の構成要素として一般に規則正し
く整列した位置関係にあるので、文字ブロック相互の位
置関係を見ることにより表の構造を認識できる。従来
は、表の罫線のみに着目して表を構成する枠を求めてい
たので、縦罫線、横罫線の一部または全部に省略のある
ような表の構造を正確に認識することができないという
問題があったが、本発明によれば文字ブロックの並びを
用いて表の構造を認識するので、その問題は解消でき
る。The table recognition device of the present invention comprises a character block extracting means (11 in FIG. 1 and 81 in FIG. 8) for extracting a character block from a table image, and the character block extracting means. It is provided with a positional relationship identifying means (12 in FIG. 1, 82 in FIG. 8) for identifying the positional relationship between the extracted character blocks and outputting data representing the structure of the table as a basic configuration. According to the present invention, the positional relationship between the character blocks extracted by the character block extracting means is identified by the positional relationship identifying means. Since the character blocks in the table generally have a positional relationship in which they are regularly arranged as constituent elements of the table, the structure of the table can be recognized by looking at the positional relationship between the character blocks. In the past, since the frame forming the table was sought by paying attention only to the ruled lines of the table, it is impossible to accurately recognize the structure of the table in which some or all of the vertical ruled lines and the horizontal ruled lines are omitted. Although there is a problem, according to the present invention, since the structure of the table is recognized by using the arrangement of the character blocks, the problem can be solved.

【０００７】本発明の一態様によれば、前記の基本的な
構成において、前記文字ブロック抽出手段は、文字の書
かれている画素の塊を囲む矩形領域を求める文字矩形抽
出手段（図１の１１１）と、その文字矩形抽出手段で求
めた各文字矩形間の距離を求めて、その距離がある閾値
より小さな文字矩形を全て１つの文字ブロックとして統
合する文字ブロック矩形抽出手段（図１の１１２）を備
えている。その閾値は全体の文字矩形間の距離の統計を
調べて決めたり、文字の幅を基準にしてその何％という
ようにして決めたりすればよい。According to one aspect of the present invention, in the basic configuration described above, the character block extracting means is a character rectangle extracting means (see FIG. 1) for obtaining a rectangular area surrounding a block of pixels in which characters are written. 111) and the distance between the respective character rectangles obtained by the character rectangle extracting means, and all the character rectangles whose distances are smaller than a certain threshold are integrated as one character block. Character block rectangle extracting means (112 in FIG. 1). ) Is provided. The threshold value may be determined by examining statistics of the distance between the entire character rectangles, or by determining what percentage of the width of the character is the standard.

【０００８】本発明の他の態様によれば、前記の基本的
な構成において、前記文字ブロック抽出手段は、表中の
文字と罫線を分離して、罫線をベクトル化する罫線ベク
トル化手段（図８の８１１）と、罫線ベクトル化手段に
より得られた罫線のベクトルデータを基に文字が書かれ
ているべき矩形領域を文字領域として抽出する文字領域
抽出手段（図８の８１２）と、その文字領域抽出手段で
求めた各文字領域に対して、文字の書かれている画素の
塊を囲む矩形領域を求める文字矩形抽出手段（図８の８
１３）と、その文字矩形抽出手段で求めた各文字矩形間
の距離を求めて、ある閾値より小さな文字矩形を全て１
つの文字ブロックとして統合する文字ブロック矩形抽出
手段（図８の８１４）とを備えている。これは前の段落
（０００７）で説明した文字ブロック抽出手段に、罫線
ベクトル化手段と文字領域抽出手段とを付加した構成の
ものである。この態様によれば、罫線ベクトル化手段で
罫線を求め、文字領域抽出手段により罫線により挟まれ
た領域を調べて文字が書かれるべき各文字領域を把握
し、その各文字領域において文字矩形を抽出するように
したので、文字ブロックを精度良く抽出することができ
る。According to another aspect of the present invention, in the above basic structure, the character block extracting means separates the characters in the table from the ruled lines and vectorizes the ruled lines (see FIG. 811), a character area extraction means (812 in FIG. 8) for extracting a rectangular area in which a character should be written as a character area based on the ruled line vector data obtained by the ruled line vectorization means, and the character. For each character area obtained by the area extracting means, a character rectangle extracting means (8 in FIG. 8) for obtaining a rectangular area surrounding a block of pixels in which characters are written.
13) and the distance between the character rectangles obtained by the character rectangle extraction means, and all character rectangles smaller than a certain threshold are set to 1
And a character block rectangle extraction unit (814 in FIG. 8) that is integrated as one character block. This has a configuration in which ruled line vectorization means and character area extraction means are added to the character block extraction means described in the previous paragraph (0007). According to this aspect, the ruled line vectorization unit obtains the ruled line, the character region extraction unit checks the region sandwiched by the ruled lines to grasp each character region in which the character is to be written, and the character rectangle is extracted in each of the character regions. As a result, the character block can be accurately extracted.

【０００９】本発明の他の態様によれば、前記の基本的
な構成において、前記位置関係識別手段は、文字ブロッ
ク抽出処理により抽出された文字ブロック矩形を構成枠
とし、その構成枠の行方向の並びを識別する行抽出手段
（図１の１２１、図８の８２１）と、構成枠識別手段で
抽出した表を構成する構成枠の列方向の並びを識別する
列抽出手段（図１の１２２、図８の８２２）とを備えて
いる。また、そのさらに具体的態様においては、前記行
抽出手段は各構成枠の中心のｙ座標が所定の誤差範囲で
同一である構成枠の群を同一の行として抽出するよう構
成され、前記列抽出手段は各構成枠の中心のｘ座標が所
定の誤差範囲で同一である構成枠の群を同一の列として
抽出するよう構成される。表における文字ブロックは一
般に表の行および列に沿って配置されているので、この
ように文字ブロック矩形を構成枠として行および列方向
の並びを調べ、行および列にグループ化することにより
表構造の構成要素を抽出することができる。According to another aspect of the present invention, in the above basic structure, the positional relationship identifying means uses a character block rectangle extracted by the character block extraction process as a constituent frame, and a line direction of the constituent frame. Row extracting means (121 in FIG. 1, 821 in FIG. 8) for identifying the arrangement of columns, and column extracting means for identifying the arrangement in the column direction of the constituent frames that make up the table extracted by the constituent frame identifying means (122 in FIG. 1). , 822 in FIG. 8). Further, in a more specific aspect thereof, the row extracting means is configured to extract a group of constituent frames having the same y-coordinate of the center of each constituent frame within a predetermined error range as the same row, and extract the column. The means are arranged to extract groups of constituent frames in which the x-coordinate of the center of each constituent frame is the same within a predetermined error range as the same column. Since the character blocks in a table are generally arranged along the rows and columns of the table, the table structure is determined by grouping rows and columns by checking the arrangement in the row and column directions using the character block rectangle as a frame. Can be extracted.

【００１０】さらに、本発明の他の態様では、前記の基
本的な構成において、さらに罫線によって囲まれる矩形
枠を抽出する矩形枠抽出手段を設け、位置関係識別手段
において罫線で囲まれた矩形枠と表中の文字ブロックを
同等に扱い各位置関係を識別するようにしたものであ
る。すなわち、この表認識装置は、表画像から文字ブロ
ックを抽出する文字ブロック抽出手段（図１１の１１
３）と、表画像から表を構成する罫線によって囲まれる
矩形枠を抽出する矩形枠抽出手段（図１１の１１２）
と、前記矩形枠抽出手段により抽出された矩形枠および
前記文字ブロック抽出手段により抽出された文字ブロッ
ク相互の位置関係を識別し、表の構造を表すデータを作
成する位置関係識別手段（図１１の１１４）とを備えて
いる。表の罫線で囲まれた矩形枠と表中の文字ブロック
を同等に扱うことにより、罫線で囲まれていない表中の
枠であっても、文字ブロックとして表の中の１つの構成
要素であると識別されるので、図２における（ａ），
（ｂ）の表はもちろん、（ｃ），（ｄ），（ｅ）の表も
正確に認識することができる。Further, according to another aspect of the present invention, in the above basic structure, a rectangular frame extracting means for extracting a rectangular frame surrounded by ruled lines is further provided, and the rectangular frame surrounded by the ruled lines in the positional relationship identifying means. The character blocks in the table are treated equally and each positional relationship is identified. That is, this table recognition device is a character block extracting means (11 in FIG. 11) for extracting a character block from a table image.
3) and a rectangular frame extracting means for extracting a rectangular frame surrounded by ruled lines forming a table from the table image (112 in FIG. 11).
And a positional relationship identifying means for identifying the positional relationship between the rectangular frame extracted by the rectangular frame extracting means and the character blocks extracted by the character block extracting means, and creating data representing the structure of the table (see FIG. 11). 114) and. By treating the rectangular frame surrounded by the ruled line of the table and the character block in the table equally, even a frame in the table not surrounded by the ruled line is one component in the table as a character block. (A) in FIG. 2,
Not only the table of (b) but also the tables of (c), (d), and (e) can be recognized accurately.

【００１１】上記発明において、矩形枠抽出手段は、罫
線画像をベクトルデータに変換する罫線ベクトル化手段
（図８の１１２１）と、その罫線ベクトル化手段により
出力された罫線ベクトルの接続関係を基に矩形枠を求め
る第１の矩形枠抽出手段（図１１の１１２１）と、一端
が他のいずれの罫線ベクトルにも接続されていない罫線
ベクトルから一部の罫線が省略された矩形枠を抽出する
第２の矩形枠抽出手段（図１１の１１２３）とを備えて
いる。In the above invention, the rectangular frame extraction means is based on the connection relation between the ruled line vectorization means (1121 in FIG. 8) for converting the ruled line image into vector data and the ruled line vector output by the ruled line vectorization means. First rectangular frame extraction means (1121 in FIG. 11) for obtaining a rectangular frame, and a rectangular frame in which some ruled lines are omitted from a ruled line vector whose one end is not connected to any other ruled line vector 2 rectangular frame extracting means (1123 in FIG. 11).

【００１２】上記発明において、位置関係識別手段は、
その一態様によれば、前記矩形抽出手段により抽出した
表の罫線から構成される矩形枠と文字ブロック抽出処理
により抽出された文字ブロック矩形枠とから表を構成す
る構成枠を識別する構成枠識別手段（図１１の１１４
１）と、その構成枠識別手段で抽出した表を構成する構
成枠の行方向の並びを識別する行抽出手段（図１１の１
１４２）と、構成枠識別手段で抽出した表を構成する構
成枠の列方向の並びを識別する列抽出手段（図１１の１
１４３）を備えている。また、その構成枠識別手段は、
具体的態様においては、前記矩形抽出手段により抽出し
た矩形枠については、その矩形枠内の文字ブロックを抽
出し、複数の文字ブロックがあったときは、その複数の
文字ブロックをそれぞれ構成枠と決定し、単一の文字ブ
ロックがあったときは矩形枠を構成枠と決定するもので
ある。このように構成枠を決定（認識）することによ
り、図２の（ｄ）のように一部に罫線が省略されている
罫線の矩形枠があっても、表の構成要素である構成枠を
正確に決定することができる。In the above invention, the positional relationship identifying means is
According to one aspect thereof, a component frame identification for identifying a component frame forming a table from a rectangular frame formed by the ruled lines of the table extracted by the rectangle extraction means and a character block rectangular frame extracted by the character block extraction processing. Means (114 in FIG. 11)
1) and the row extracting means (1 in FIG. 11) for identifying the arrangement in the row direction of the constituent frames forming the table extracted by the constituent frame identifying means.
142) and column extraction means (1 in FIG. 11) for identifying the arrangement in the column direction of the constituent frames that make up the table extracted by the constituent frame identification means.
143). Further, the constituent frame identification means is
In a specific aspect, with respect to the rectangular frame extracted by the rectangular extraction means, a character block within the rectangular frame is extracted, and when there are a plurality of character blocks, the plurality of character blocks are respectively determined as constituent frames. However, when there is a single character block, the rectangular frame is determined as the constituent frame. By determining (recognizing) the constituent frame in this way, even if there is a rectangular frame of ruled lines in which the ruled lines are partially omitted as shown in FIG. Can be accurately determined.

【００１３】[0013]

【Example】

（第１の実施例）図１は本発明の第１の実施例の構成を
示す図である。この実施例の表認識装置は、一連の文字
からなる文字ブロックの配置状態を調べて表の構造を認
識するものであって、図１に示すように表画像中の文字
画像から文字ブロックを抽出する文字ブロック抽出部１
１と、文字ブロック抽出部１１により抽出された文字ブ
ロック相互の位置関係を識別し表の構造を表すデータを
得る位置関係識別部１２とを備えている。(First Embodiment) FIG. 1 is a diagram showing the configuration of the first embodiment of the present invention. The table recognition device of this embodiment is for recognizing the structure of a table by checking the arrangement state of a character block consisting of a series of characters, and extracting the character block from the character image in the table image as shown in FIG. Character block extraction unit 1
1 and a positional relationship identifying unit 12 for identifying the positional relationship between the character blocks extracted by the character block extracting unit 11 and obtaining data representing the structure of the table.

【００１４】文字ブロック抽出部１１は、文字の書かれ
ている画素の塊を囲む矩形領域を求める文字矩形抽出処
理部１１１と、その文字矩形抽出処理部１１１で求めた
各文字矩形間の距離を求めて、その距離がある閾値より
小さな文字矩形を文字ブロックとして統合する文字ブロ
ック矩形抽出処理部１１２からなっている。また、位置
関係識別部１２は、文字ブロック抽出処理により抽出さ
れた文字ブロック矩形を構成枠として受け取り、その構
成枠の行方向の並びを識別する行抽出処理部１２１と、
構成枠の列方向の並びを識別する列抽出処理部１２２
と、位置関係の識別結果を記憶する表構造記憶部１２３
からなっている。The character block extraction unit 11 calculates a character rectangle extraction processing unit 111 that obtains a rectangular area that surrounds a block of pixels in which characters are written, and a distance between each character rectangle obtained by the character rectangle extraction processing unit 111. A character block rectangle extraction processing unit 112 that obtains and integrates a character rectangle whose distance is smaller than a certain threshold as a character block. Further, the positional relationship identifying unit 12 receives a character block rectangle extracted by the character block extracting process as a constituent frame, and a line extraction processing unit 121 for identifying the arrangement in the row direction of the constituent frame,
Column extraction processing unit 122 for identifying the arrangement of the constituent frames in the column direction
And a table structure storage unit 123 that stores the identification result of the positional relationship.
It consists of

【００１５】以上のように構成された本実施例の各部の
処理について、詳細に説明する。本実施例で処理の対象
とする画像は、イメージスキャナなどの画像入力装置に
より入力された表を含む文書画像から表領域が分離され
て得られた表画像である。表領域の分離手段は画面上で
マウスのようなポインティングデバイスにより操作者が
指定するものや、画像の属性を基に自動的に分離する表
領域分離装置（例えば、特開平２−２１０５８６号公報
参照）などがあり、いずれも公知の技術である。文字矩
形抽出処理部１１１は、表画像中の文字画像部分に対し
て、図３の（ａ），（ｂ）に示すように、字の書かれて
いる画素の塊３１，３２，３３，３４を囲む矩形領域３
５，３６，３７，３８を求める。すなわち、表の画像が
背景の画素値が０、文字／線の画素値が１で書かれてい
る時、画素値が１である塊を取り出してその矩形領域を
求める。このとき、２つの矩形領域が重なってるとき
は、図３の（ｂ）のように２つの矩形領域３７，３８を
包含できるような矩形領域３９で表す。なお、文字の矩
形領域を抽出する方法は、良く知られている技術（例え
ば、特開平２−２６７６７８号公報参照）であるので詳
細な説明は省略する。The processing of each section of the present embodiment configured as above will be described in detail. The image to be processed in this embodiment is a table image obtained by separating a table area from a document image including a table input by an image input device such as an image scanner. The table area separating means is a table area separating device that is specified by an operator on a screen with a pointing device such as a mouse, or a table area separating device that automatically separates images based on image attributes (see, for example, Japanese Patent Laid-Open No. 2-210586). ) And the like, all of which are known techniques. The character rectangle extraction processing unit 111, as shown in (a) and (b) of FIG. 3, for the character image portion in the front image, a block of pixels 31, 32, 33, 34 in which characters are written. Rectangular area 3 surrounding
Find 5, 36, 37, 38. That is, when the table image is written with a background pixel value of 0 and a character / line pixel value of 1, a block having a pixel value of 1 is taken out to obtain its rectangular area. At this time, when the two rectangular areas overlap each other, they are represented by a rectangular area 39 which can include the two rectangular areas 37 and 38 as shown in FIG. 3B. The method of extracting the rectangular area of the character is a well-known technique (see, for example, Japanese Patent Laid-Open No. 2-267678), and detailed description thereof will be omitted.

【００１６】さらに文字ブロック矩形抽出処理部１１２
では、文字矩形抽出処理部１１２で求めた各文字矩形間
の距離を求めて、ある閾値より小さな文字矩形を全て１
つの文字ブロックとして統合する処理を行なう。この処
理で用いる閾値は、全体の文字矩形間の距離の統計を調
べて決めてもいいし、文字の大きさの数％として決めて
もよく、ここでは特に閾値の決定方法については定めな
い。この処理を図４の（ａ）に示す罫線のない表に適用
した時の結果は、同図（ｂ）のようになる。これらの処
理の結果得られた文字ブロックの矩形枠はそれぞれに識
別子が付され矩形枠の位置（ｘ座標，ｙ座標）、幅、高
さ等がデータとして適宜のメモリに蓄積される。Further, the character block rectangle extraction processing unit 112.
Then, the distance between the character rectangles calculated by the character rectangle extraction processing unit 112 is calculated, and all character rectangles smaller than a certain threshold are set to 1
Performs the process of integrating as one character block. The threshold used in this processing may be determined by examining statistics of the distances between the entire character rectangles, or may be determined as a few% of the size of the character, and the method of determining the threshold is not specified here. When this process is applied to the table without ruled lines shown in FIG. 4A, the result is as shown in FIG. The rectangular frame of the character block obtained as a result of these processes is given an identifier, and the position (x coordinate, y coordinate), width, height, etc. of the rectangular frame is stored as data in an appropriate memory.

【００１７】図５ａおよび図５ｂは、文字ブロック矩形
抽出処理部１１２の処理のフローを示す図である。図５
ａは文字矩形をブロックにまとめるための前記閾値を求
めるための処理手順を示すものである。処理に必要な定
数や中間結果を格納する格納部として、定数Ｎ、文字矩
形の幅の集計結果を格納する変数ｓｕｍ_w、文字矩形の
高さの集計結果を格納する変数ｓｕｍ_h、幅および高さ
の閾値Ｔ_w，Ｔ_h、変数ｉが用意されている。まず、初期
設定としてＮには文字矩形抽出処理部１１１で抽出した
文字矩形の総数を設定し、ｓｕｍ_w、ｓｕｍ_h、およびｉ
はそれぞれ０に設定する（ステップ５０１）。そして、
ｉがＮを越えていないかどうかを判定し（ステップ５０
２）、ｉ＜Ｎのときは、ｓｕｍ_w、ｓｕｍ_hに文字矩形Ｃ
ｉの幅、高さを加算し（ステップ５０３）、その加算値
を２Ｎで除する（ステップ５０４）。そしてｉを１ずつ
増加させながら（ステップ５０５）、ｉがＮより大きく
なるまでステップ５０２〜５０５の処理を繰り返す。ｉ
がＮより大きくなったとき、幅および高さの閾値Ｔ_w，
Ｔ_hは文字矩形の幅の平均値の１／２の値として得られ
る。FIGS. 5a and 5b are diagrams showing the flow of processing of the character block rectangle extraction processing unit 112. Figure 5
Reference character a indicates a processing procedure for obtaining the threshold value for grouping the character rectangles into blocks. As a storage unit that stores constants and intermediate results necessary for processing, a constant N, a variable sum _w that stores the totaling result of the width of the character rectangle, a variable sum _h that stores the totaling result of the height of the character rectangle, width, and height Thresholds T _w and T _h and a variable i are prepared. First, as an initial setting, the total number of character rectangles extracted by the character rectangle extraction processing unit 111 is set to N, and sum _w , sum _h, and i are set.
Are set to 0 (step 501). And
It is determined whether i does not exceed N (step 50).
2) When i <N, sum _w and sum _h are character rectangles C
The width and height of i are added (step 503), and the added value is divided by 2N (step 504). Then, while increasing i by 1 (step 505), the processes of steps 502 to 505 are repeated until i becomes larger than N. i
Is greater than N, the width and height thresholds T _w ,
T _h is obtained as a value of ½ of the average width of the character rectangle.

【００１８】図５ａの処理で閾値が得られると、図５ｂ
の処理により文字矩形をブロックにまとめる処理を行
う。変数ｊおよびＢを０に設定する（ステップ５０
６）。文字矩形Ｃ_jはいずれかの文字ブロックＣＢに登
録済かを判定する（ステップ５０７）。登録済みであれ
ば、次の文字矩形を処理するため変数ｊを１だけ増加さ
せる（ステップ５１７）。ステップ５０７の判定の結
果、文字矩形Ｃ_jがまだ未登録であったなら、文字ブロ
ックＣＢ_Bに文字矩形Ｃ_jを登録する（ステップ５０
８）。この登録された文字矩形Ｃ_jは一つの文字ブロッ
クＣＢ_Bの先頭の文字矩形となる。次に、その登録した
文字と距離が閾値Ｔ_wあるいはＴ_h以内の距離にある文字
矩形を探して文字ブロックＣＢ_Bに登録する処理を行
う。そのため、先ず変数ｋをｊに設定する（ステップ５
０９）。そして文字矩形Ｃ_kはいずれかの文字ブロック
に登録済かどうかを調べる（ステップ５１０）。登録済
みでなければ、ＣＢ_BとＣ_kとの距離Ｄを求める（ステッ
プ５１１）。求めた距離Ｄが閾値Ｔ_wあるいはＴ_h以内の
距離にあるか否かを調べる（ステップ５１２）。距離Ｄ
が閾値Ｔ_wあるいはＴ_hの範囲内にあったならば、文字矩
形Ｃ_kを文字ブロックＣＢ_Bに追加し、ＣＢ_Bの大きさを
変更する（ステップ５１３）。ステップ５１０で、文字
矩形Ｃ_kが登録済みであると判定されたとき、ステップ
５１２で距離Ｄが閾値Ｔ_wあるいはＴ_hの範囲内にないと
判定されたとき、およびステップ５１３での追加の処理
を終えたときには、次の文字矩形を探すために、ｋ＝ｋ
＋１に設定し（ステップ５１４）、すべての文字矩形に
対する処理が終えたか否かを判定した後（ステップ５１
５）、まだ処理が終わっていないときはその設定した次
の文字矩形についてステップ５１０〜５１４の処理を繰
り返す。ステップ５１５の判定で、ｋ＜Ｎではなくなっ
たときは、次の文字ブロックを求めるために、Ｂ＝Ｂ＋
１とすると共に（ステップ５１６）、ｊ＝ｊ＋１とする
（ステップ５１７）。ｊ＜Ｎの間は、ステップ５０７〜
ステップ５１８の処理を続行し、ｊ＜Ｎでなくなったと
き処理を終了する（ステップ５１９）。When the threshold value is obtained by the process of FIG.
The process of grouping the character rectangles into blocks is performed. Set variables j and B to 0 (step 50).
6). It is determined whether the character rectangle C _j has been registered in any of the character blocks CB (step 507). If registered, the variable j is incremented by 1 to process the next character rectangle (step 517). Is determined in step 507, if the character rectangle C _j was still registered, registers the character rectangle C _j into character blocks CB _B (Step 50
8). The registered character rectangle C _j becomes the leading character rectangle of one character block CB _B. Next, the character rectangle whose distance from the registered character is within the threshold T _w or T _h is searched for and registered in the character block CB _B. Therefore, first, the variable k is set to j (step 5).
09). Then, it is checked whether or not the character rectangle C _k is already registered in any of the character blocks (step 510). If it has not been registered, the distance D between CB _B and C _k is calculated (step 511). It is checked whether or not the obtained distance D is within the threshold T _w or T _h (step 512). Distance D
If There were within the range of the threshold value T _w or T _h, and add the character rectangle C _k into character blocks CB _B, to change the size of the CB _B (step 513). When it is determined in step 510 that the character rectangle C _k has been registered, when it is determined in step 512 that the distance D is not within the range of the threshold T _w or T _h , and additional processing in step 513. When you have finished, to find the next character rectangle, k = k
After setting it to +1 (step 514), it is determined whether or not the processing for all the character rectangles has been completed (step 51).
5) If the processing is not finished yet, the processing of steps 510 to 514 is repeated for the set next character rectangle. When it is determined in step 515 that k <N is not satisfied, B = B + in order to obtain the next character block.
The value is set to 1 (step 516) and j = j + 1 (step 517). While j <N, steps 507-
The process of step 518 is continued, and when j <N is not satisfied, the process ends (step 519).

【００１９】次に、位置関係識別部１２の働きについて
説明する。位置関係識別部１２は、前述のように行抽出
処理部１２１、列抽出処理部１２２の３つの処理部から
なり、以下に順をおって説明する。この実施例では、文
字ブロック抽出部１１で抽出した文字ブロックをそのま
ま構成枠として登録する。図４の（ｃ）が構成枠を示す
ものである。Next, the function of the positional relationship identifying section 12 will be described. The positional relationship identifying unit 12 is composed of the three processing units, the row extraction processing unit 121 and the column extraction processing unit 122 as described above, and will be described below in order. In this embodiment, the character block extracted by the character block extraction unit 11 is registered as it is as a constituent frame. FIG. 4C shows the configuration frame.

【００２０】行抽出処理部１２１と列抽出処理部１２２
では、文字ブロック矩形抽出処理部１２１で抽出した文
字ブロック矩形を表を構成する構成枠とみなし、それら
の並びを識別する。図６ａおよび図６ｂは行抽出処理の
フロー、図７ａおよび図７ｂは列抽出処理のフローを示
す図である。同図に示すように、全ての構成枠の中心点
の座標を求め、行抽出処理では構成枠の中心点のＹ座標
がある誤差範囲内に並んでいる構成枠を表の行と識別
し、列抽出処理では構成枠の中心点のＸ座標がある誤差
範囲内に並んでいる構成枠を表の列と識別する。Row extraction processing section 121 and column extraction processing section 122
Then, the character block rectangles extracted by the character block rectangle extraction processing unit 121 are regarded as the constituent frames forming the table, and their arrangement is identified. 6a and 6b are flowcharts of the row extraction process, and FIGS. 7a and 7b are flowcharts of the column extraction process. As shown in the figure, the coordinates of the center points of all the constituent frames are calculated, and in the line extraction processing, the constituent frames in which the Y coordinates of the central points of the constituent frames are arranged within a certain error range are identified as rows of the table, In the column extraction processing, the constituent frames in which the X-coordinates of the center points of the constituent frames are arranged within a certain error range are identified as columns in the table.

【００２１】即ち、行抽出処理では、図６ａおよび図６
ｂに示すように、先ず構成枠の総数を変数Ｎに設定する
（ステップ６０１）。全ての構成枠の中心点のＹ座標を
求め，配列ＣＢに格納する（ステップ６０２）。全矩形
枠の中で最大の高さを持つものを探索し、その高さの１
／２を誤差範囲の閾値Ｔ_hの値とする（ステップ６０
３）。次に文字ブロックのＹ座標の配列ＣＢを昇順にソ
ートする（ステップ６０４）。そして、ｉ＝Ｇ＝０、ｙ
＝ＣＢ_iに設定し、行配列をクリアする（ステップ６０
５）。次に、配列ＣＢから一つの構成枠のＹ座標ＣＢ_i
を取り出し、行配列に登録済かどうかを判定し（ステッ
プ６０７）、登録されていない構成枠ＣＢ_iにたいして
は、ｙとの距離が閾値Ｔ_h以内の範囲にあるか否かを｜
ＣＢ_i−ｙ｜＜Ｔ_hの演算により判定し（ステップ６０
８）、ｙとの距離が閾値Ｔ_h以内の範囲にあった場合は
ＣＢ_iに対応する構成枠を行配列に格納する（ステップ
６０９）。文字ブロックが登録済みであった場合、およ
びｙとの距離が閾値Ｔ_h以内の範囲になかった場合は、
次の文字ブロックを取り出すためにｉ＝ｉ＋１とする
（ステップ６１０）。取り出した新しい文字ブロックに
対して同様の行配列への判定、登録処理（ステップ６０
７〜６０９）を行う。処理が進みｉ＜Ｎでなくなったら
（ステップ６０６の判定）、一つの行に対する抽出処理
が終了し、次の行の抽出処理を行うため図６ｂのフロー
へ進む。行配列の内容をＧ番目の行情報として出力する
（ステップ６１１）。次に行配列をクリアするととも
に、ｉ＝０、Ｇ＝Ｇ＋１に設定する（ステップ６１
２）。そして、Ｇ＋１番目の行の先頭となるべき構成枠
を探す。すなわち、構成枠を最初から一つずつ取り出
し、いずれかの行に登録済みか否かを判定し（ステップ
６１４）、最初に見つかった未登録の構成枠をＧ＋１番
目の行の先頭となるべき構成枠ｙとして指定するととも
に、ｉ＝０に設定し（ステップ６１６）、図６ａのステ
ップ６０６〜６１０の１行の抽出処理へ移る。なお、ス
テップ６１３においてｉ＜Ｎでないと判定されたとき、
すなわち未登録の構成枠がなくなったときは行の抽出処
理を終了する。That is, in the line extraction process, the process shown in FIGS.
As shown in b, first, the total number of constituent frames is set to a variable N (step 601). The Y coordinates of the center points of all the constituent frames are obtained and stored in the array CB (step 602). Search for the one with the maximum height in all the rectangular frames, and set the height to 1
/ 2 is the value of the error range threshold T _h (step 60)
3). Next, the array CB of Y coordinates of the character blocks is sorted in ascending order (step 604). Then, i = G = 0, y
= CB _i and clear the row array (step 60)
5). Next, the Y coordinate CB _i of one constituent frame from the array CB
Is taken out and it is judged whether or not it has been registered in the row array (step 607). For the unregistered component frame CB _i , it is judged whether or not the distance from y is within the threshold value T _h.
CB _i −y | <T _h is determined (step 60
8) If the distance from y is within the threshold value T _h , the configuration frame corresponding to CB _i is stored in the row array (step 609). When the character block has been registered, and when the distance from y is not within the range of the threshold value T _h ,
Set i = i + 1 to retrieve the next character block (step 610). Judgment and registration processing in the same row array for the new character block taken out (step 60)
7 to 609). When the process progresses and i <N is not satisfied (determination in step 606), the extraction process for one row ends, and the process proceeds to the flow of FIG. 6b to perform the extraction process for the next row. The contents of the row array are output as the Gth row information (step 611). Next, the row array is cleared, and i = 0 and G = G + 1 are set (step 61).
2). Then, the constituent frame that should be the head of the (G + 1) th row is searched. That is, the composition frames are taken out one by one from the beginning, and it is judged whether or not the composition frames are already registered in any of the rows (step 614), and the first unregistered composition frame found should be the head of the G + 1th row. The frame y is designated, and i = 0 is set (step 616), and the process proceeds to the extraction processing of one row in steps 606 to 610 of FIG. 6a. When it is determined in step 613 that i <N is not satisfied,
That is, when there is no unregistered component frame, the line extraction process is terminated.

【００２２】列抽出処理は、図７ａおよび図７ｂに示す
通りであり、行抽出処理とは行と列とが入れ替わりって
いる点を除けばほぼ同様の処理を行う。すなわち、全て
の構成枠の中心点のＸ座標を配列ＣＢに格納し（ステッ
プ７０２）、昇順にソートする（ステップ７０４）。全
矩形枠の中で最大の幅の１／２を誤差範囲の閾値Ｔ_wの
値とし（ステップ７０３）、ｉ＝Ｇ＝０、ｘ＝ＣＢ_i
に設定し、行配列をクリアする（ステップ７０５）。次
に、配列ＣＢに格納された構成枠ＣＢ_iを一つずつ取り
出し、登録されていない構成枠ＣＢ_iにたいしては、ｘ
との距離が誤差範囲の閾値Ｔ_w以内の範囲にあるか否か
を判定し（ステップ７０７〜７０８）、誤差範囲内にあ
った場合はＣＢ _iに対応する構成枠を列配列に格納する
（ステップ７０９）。一つの列に対する抽出処理が終了
したら、次の列の抽出処理を行うため図７ｂのフローへ
進む。次に、次の列の先頭となるべき未登録の最初の構
成枠を探し（ステップ７１４〜７１５）、見つかった
ら、図７ａのステップ７０６〜７１０の１列の抽出処理
へ移る。未登録の構成枠がなくなったときは列の抽出処
理を終了する。The column extraction process is shown in FIGS. 7a and 7b.
It ’s the same as the row extraction process
Except for the fact that it is present, almost the same processing is performed. That is, all
The X coordinate of the center point of the component frame of is stored in the array CB (step
702) and sort in ascending order (step 704). all
½ of the maximum width in the rectangular frame is the threshold T of the error range_wof
Value (step 703), i = G = 0, x = CB_i
To clear the row array (step 705). Next
And the configuration frame CB stored in the array CB._iTake one by one
Outgoing and unregistered component frame CB_iFor x
The distance T is the threshold T of the error range_wWhether it is within the range
Is determined (steps 707 to 708) and is within the error range.
CB if _iStore the configuration frame corresponding to to the column array
(Step 709). Extraction process for one column ends
Then, to perform the extraction process of the next column, go to the flow of FIG. 7b.
move on. Next, the first unregistered structure that should be the beginning of the next column.
I searched for a frame (steps 714-715) and found it
Et al., The extraction process of one column in steps 706 to 710 of FIG. 7a.
Move to. Column extraction process when there are no unregistered components
End the reason.

【００２３】行抽出処理部１２１および列抽出処理部１
２２の処理により、図４の（ｄ）（ｅ）に示すように構
成枠は行と列にグループ化される。その出力データは、
例えば、構成枠を表す識別番号に、行番号と列番号を与
えた形式で表構造記憶部１２３に記憶され、任意のシス
テム例えばワープロで利用可能な状態となる。Row extraction processing section 121 and column extraction processing section 1
By the processing of 22, the constituent frames are grouped into rows and columns as shown in (d) and (e) of FIG. The output data is
For example, it is stored in the table structure storage unit 123 in a format in which the row number and the column number are given to the identification number representing the configuration frame, and the system can be used in an arbitrary system such as a word processor.

【００２４】以上に説明したように、この第１の実施例
は、文字ブロック抽出部１１で抽出した文字ブロックを
表の構成枠とし、その並びにより行および列からなる表
の構造を抽出するので、図４の（ａ）に示すような全く
罫線のない表であっても、表構造を認識することができ
る。なお、罫線のある表の場合でも、この第１の実施例
により同様に文字ブロックのみに基づいて表構造を認識
することができる。As described above, according to the first embodiment, the character blocks extracted by the character block extraction unit 11 are used as a table configuration frame, and the table structure including the rows and columns is extracted. The table structure can be recognized even in a table having no ruled lines as shown in FIG. Even in the case of a table having ruled lines, the table structure can be recognized based on only the character blocks by the first embodiment.

【００２５】（第２の実施例）図８は本発明の第２の実
施例の構成を示す図である。この実施例の表認識装置
は、罫線を基に表における文字領域を抽出し、その文字
領域内で文字ブロックを抽出し、抽出した文字ブロック
の配置状態を調べて表の構造を認識するものであって、
図１に示す第１の実施例の構成と同様に、表画像から文
字ブロックを抽出する文字ブロック抽出部８１と、文字
ブロック抽出部８１により抽出された文字ブロック相互
の位置関係を識別し表構造を表すデータを生成する位置
関係識別部８２とからなる基本構成を備えている。そし
て、この第２の実施例は、第１の実施例とは、文字ブロ
ック抽出部８１の構成が異なり、文字矩形抽出処理部８
１３の前段に、罫線ベクトル化処理部８１１および文字
領域抽出処理部８１２からなる文字領域を求めるための
手段が付加されている。その罫線ベクトル化処理部８１
１は、表中の文字と罫線を分離して、罫線をベクトル化
するものである。また、文字領域抽出処理部８１２は、
罫線ベクトル化処理部８１１により得られた罫線のベク
トルデータを基に文字が書かれているべき矩形領域を文
字領域として抽出するものである。(Second Embodiment) FIG. 8 is a diagram showing the configuration of the second embodiment of the present invention. The table recognition device of this embodiment is for recognizing the structure of a table by extracting a character area in a table based on a ruled line, extracting a character block in the character area, and checking the arrangement state of the extracted character block. There
Similar to the configuration of the first embodiment shown in FIG. 1, a character block extraction unit 81 for extracting character blocks from a table image and a positional relationship between the character blocks extracted by the character block extraction unit 81 are identified to form a table structure. And a positional relationship identifying unit 82 that generates data representing The second embodiment is different from the first embodiment in the configuration of the character block extraction unit 81, and the character rectangle extraction processing unit 8 is different.
A unit for determining a character area including a ruled line vectorization processing unit 811 and a character area extraction processing unit 812 is added to the preceding stage of 13. The ruled line vectorization processing unit 81
1 is to separate the characters and ruled lines in the table and vectorize the ruled lines. Further, the character area extraction processing unit 812
Based on the ruled line vector data obtained by the ruled line vectorization processing unit 811, a rectangular area in which a character should be written is extracted as a character area.

【００２６】表の罫線だけをベクトル化するには表を構
成する線の部分と文字の部分とに分ける必要がある。こ
の分離処理は、図形中の文字と線分を分離する処理と同
様の既存の手法を用いることができる。なお、本願出願
人が先に特許出願した特願平３−２９０２９９号「文字
／図形分離装置」（発明者清水昇）の技術を用いた場
合は、誤りの少ない正確な分離処理をより高速に行うこ
とができる。その文字／図形分離装置について簡単に説
明する。これは、図９に示すように、入力画像における
各黒画素塊の二以上の特徴を抽出する特徴抽出部９１
と、その特徴抽出手段９１の特徴抽出結果を利用して初
期クラスタ中心を求める初期クラスタ中心決定部９２
と、特徴抽出部９１の特徴抽出結果と初期クラスタ中心
決定部９２の決定結果とを利用してクラスタリングする
ことにより領域の判定を行う領域判定部９３とを備えて
いる。各黒画素塊の特徴量としては、たとえば黒画素塊
の面積、偏平率、輪郭線の複雑さなどを用いることがで
きる。特徴抽出部９１でこのような特徴量が抽出される
と、次に初期クラスタ中心決定部９２は、抽出された黒
画素塊の特徴量の分布を用いて初期クラスタの中心を求
める。領域判定部９３は、抽出された黒画素塊の２以上
の特徴量に対して、初期クラスタ中心決定部９２により
求められた初期クラスタ中心を用いて、クラスタリング
を行って各黒画素塊の属すべき領域を判定する。In order to vectorize only the ruled lines of the table, it is necessary to divide the table into lines and characters. For this separation processing, an existing method similar to the processing for separating a character and a line segment in a figure can be used. It should be noted that, when the technique of the Japanese Patent Application No. 3-290299 “Character / figure separation device” (inventor Noboru Shimizu), which was previously filed by the applicant of the present application, is used, accurate separation processing with few errors can be performed faster. It can be carried out. The character / graphic separation device will be briefly described. As shown in FIG. 9, this is a feature extraction unit 91 that extracts two or more features of each black pixel block in the input image.
And an initial cluster center determination unit 92 that obtains an initial cluster center using the feature extraction result of the feature extraction means 91.
And a region determination unit 93 that determines a region by performing clustering using the feature extraction result of the feature extraction unit 91 and the determination result of the initial cluster center determination unit 92. As the feature amount of each black pixel block, for example, the area of the black pixel block, the flatness ratio, the complexity of the contour line, or the like can be used. When such a feature amount is extracted by the feature extraction unit 91, the initial cluster center determination unit 92 then obtains the center of the initial cluster using the distribution of the extracted feature amounts of the black pixel blocks. The area determination unit 93 uses the initial cluster centers determined by the initial cluster center determination unit 92 for two or more feature amounts of the extracted black pixel clusters, and should belong to each black pixel cluster. Determine the area.

【００２７】分離された表の罫線の領域は、２値画像
を、端点、折れ線、交差点、分岐点などの特徴点を始点
および終点とするベクトルデータに変換する。このベク
トルデータに変換する方法は既存の技術（例えば、信学
技報ＰＲＬ８３−８、ＰＲＬ８５−２４、ＰＲＬ８６−
８９、特開平２−２１０５８６号公報、特開平２−１０
５２６５号公報等参照）を用いればよいのでここでは説
明を省略する。The ruled line area of the separated table converts the binary image into vector data having start points and end points of characteristic points such as end points, polygonal lines, intersections, and branch points. The method for converting this vector data is based on existing technology (for example, Technical Report PRL83-8, PRL85-24, PRL86-).
89, JP-A-2-210586, JP-A-2-10
5265, etc.) may be used, and the description thereof is omitted here.

【００２８】図１０ａおよび図１０ｂは文字領域抽出処
理部８１２の抽出処理のフローを示す図である。罫線ベ
クトル化処理部８１１で得られた罫線ベクトルを縦罫線
ＶＲと横罫線ＨＲに分け（ステップ１００１）、それぞ
れをカウントして、縦罫線の数をＶに格納し、横罫線を
Ｈに格納する（ステップ１００２）。横罫線の有無を判
定し（ステップ１００３）、横罫線がなければ文字領域
数Ｒを１に設定し、領域の大きさを入力画像の大きさと
する（ステップ１００９）。横罫線があれば文字領域数
ＲをＨ−１に設定し、ｉを０に設定する（ステップ１０
０４）。次に、横罫線をＹ座標の昇順にソートする（ス
テップ１００５）。そして、Ｙ座標の小さい順から文字
領域に番号を割り付けて行く。すなわち、ｉ番目の横罫
線とｉ＋１番目の横罫線で区切られる領域をｉ番目の文
字領域とする（ステップ１００７）。ｉ＜Ｒでなくなっ
たら（ステップ１００６）、番号の割り付けが終わり、
図１０ｂに示す垂直方向の罫線による文字領域の処理に
移る。FIGS. 10a and 10b are diagrams showing the flow of the extraction processing of the character area extraction processing unit 812. The ruled line vector obtained by the ruled line vectorization processing unit 811 is divided into vertical ruled lines VR and horizontal ruled lines HR (step 1001), each is counted, the number of vertical ruled lines is stored in V, and the horizontal ruled lines are stored in H. (Step 1002). The presence / absence of a horizontal ruled line is determined (step 1003). If there is no horizontal ruled line, the number of character areas R is set to 1 and the size of the area is set as the size of the input image (step 1009). If there is a horizontal ruled line, the number of character areas R is set to H-1, and i is set to 0 (step 10).
04). Next, the horizontal ruled lines are sorted in ascending order of the Y coordinate (step 1005). Then, numbers are assigned to the character areas in ascending order of the Y coordinate. That is, the area delimited by the i-th horizontal ruled line and the i + 1-th horizontal ruled line is set as the i-th character area (step 1007). When i <R is not satisfied (step 1006), the number assignment is completed,
The process moves to the processing of the character area by the vertical ruled lines shown in FIG. 10b.

【００２９】縦罫線の有無を判定し（ステップ１０１
０）、縦罫線がなければ文字領域数Ｒを１に設定し、領
域の大きさを入力画像の大きさとする（ステップ１０１
８）。縦罫線があれば、図１０ａの処理で求めた横罫線
による文字領域数Ｒの内容をＲ１に移し、ＲにはＲ＋Ｖ
−１を設定し、ｉ，ｊ，ｋをそれぞれ０に設定する（ス
テップ１０１１）。次に、縦罫線をＸ座標の昇順にソー
トする（ステップ１０１２）。そして横罫線で区切られ
た各領域ごとに、縦罫線で区切られた領域を求めて行く
（ステップ１０１４〜１０１７）。すなわち、横罫線で
区切られたｊ番目の領域について、ｉ番目とｉ＋１番目
の縦罫線で区切られる領域をｋ番目の文字領域とする
（ステップ１０１３）。この番号付けを順次ｉおよびｋ
を１ずつ増加しながら、ｊ番目の領域に縦罫線で区切ら
れた未処理の領域がなくなったと判定されるまで、繰り
返す（ステップ１０１４，１０１５）。そして、つぎの
横罫線で区切られた領域について処理するため、ｊを１
だけ増加させるとともにｉを０にクリアする。そして横
罫線で区切られた領域の最後のものについて処理が終わ
るまで、すなわちｊ＜Ｒ１ではなくなったと判定される
まで、ステップ１０１３〜１０１７を繰り返す。以上の
ようにして、罫線で区切られた文字領域が抽出され、そ
の結果は文字矩形抽出処理部８１３へ渡される。The presence or absence of vertical ruled lines is determined (step 101
0), if there is no vertical ruled line, the number R of character areas is set to 1 and the size of the area is set as the size of the input image (step 101).
8). If there is a vertical ruled line, the content of the number R of character areas by the horizontal ruled line obtained in the processing of FIG. 10A is transferred to R1, and R + V
-1 is set, and i, j, and k are set to 0 (step 1011). Next, the vertical ruled lines are sorted in ascending order of the X coordinate (step 1012). Then, for each area delimited by the horizontal ruled line, the area delimited by the vertical ruled line is obtained (steps 1014 to 1017). That is, regarding the j-th area delimited by the horizontal ruled lines, the area delimited by the i-th and i + 1-th vertical ruled lines is set as the k-th character area (step 1013). This numbering is sequentially i and k
Is incremented by 1 and repeated until it is determined that there is no unprocessed area delimited by the vertical ruled line in the j-th area (steps 1014 and 1015). Then, in order to process the area separated by the next horizontal ruled line, j is set to 1
And i is cleared to 0. Then, steps 1013 to 1017 are repeated until the processing is completed for the last one of the areas delimited by the horizontal ruled lines, that is, until it is determined that j <R1 is not satisfied. As described above, the character area delimited by the ruled lines is extracted, and the result is passed to the character rectangle extraction processing unit 813.

【００３０】文字矩形抽出処理部８１３以降の処理部の
動作は、基本的には第１の実施例と同じである。ただ、
文字矩形抽出処理および文字ブロック抽出処理は、文字
領域抽出処理部８１２により抽出された文字領域の情報
を用いて行われる。従って、文字矩形の抽出が容易にな
り、しかも確実となるとともに、文字ブロックについて
も、罫線を挟んで近接している文字を一つのブロックと
して検出する誤りがなくなり、文字ブロックを確実に抽
出することができる。The operation of the processing units after the character rectangle extraction processing unit 813 is basically the same as that of the first embodiment. However,
The character rectangle extraction processing and the character block extraction processing are performed using the information of the character area extracted by the character area extraction processing unit 812. Therefore, the extraction of the character rectangle becomes easier and more reliable, and with respect to the character block, there is no error in detecting characters that are close to each other with the ruled line as one block, and the character block can be reliably extracted. You can

【００３１】（第３の実施例）図１１は本発明の第３の
実施例を示すブロック図である。この実施例の表認識装
置は、表画像に含まれる文字部分と罫線部分を分離する
文字・罫線分離処理部１１１０と、文字・罫線分離処理
部１１１０により分離された罫線画像から表を構成する
罫線によって囲まれる矩形枠を抽出する矩形枠抽出部１
１２０と、文字・罫線分離処理部１１１０により分離さ
れた文字画像から表を構成する文字ブロック矩形枠を抽
出する文字ブロック抽出部１１３０と、矩形枠抽出部１
１２０により抽出された矩形枠および文字ブロック抽出
部１１３０により抽出された文字ブロック相互の位置関
係を識別し表の構造を表すデータを作成する位置関係識
別部１１４０と、位置関係識別部１１４０により識別さ
れた表の構造を表すデータを記憶する表構造記憶部１１
４４とを備えている。(Third Embodiment) FIG. 11 is a block diagram showing a third embodiment of the present invention. The table recognition device according to this embodiment includes a character / ruled line separation processing unit 1110 for separating a character portion and a ruled line portion included in a table image, and a ruled line forming a table from the ruled line image separated by the character / ruled line separation processing unit 1110. Rectangular frame extraction unit 1 for extracting a rectangular frame surrounded by
120, a character block extraction unit 1130 for extracting a character block rectangular frame forming a table from the character images separated by the character / ruled line separation processing unit 1110, and a rectangular frame extraction unit 1
The rectangular frame extracted by 120 and the positional relationship identifying unit 1140 that identifies the positional relationship between the character blocks extracted by the character block extracting unit 1130 and creates data representing the structure of the table, and the positional relationship identifying unit 1140 Table structure storage unit 11 for storing data representing the structure of the opened table
44 and.

【００３２】矩形枠抽出部１１２０は、罫線画像をベク
トル化する罫線ベクトル化処理部１１２１と、罫線ベク
トル化処理部１１２１の出力する罫線ベクトルを基に、
罫線ベクトルにより囲まれた完全な矩形枠を抽出する完
全矩形枠抽出処理部１１２２と、罫線の一部が省略され
て矩形枠の一部がない不完全な矩形枠を抽出し足りない
ところを補って矩形枠とする不完全矩形枠抽出処理部１
１２３とを備えている。The rectangular frame extraction unit 1120, based on the ruled line vectorization processing unit 1121 for vectorizing the ruled line image and the ruled line vector output from the ruled line vectorization processing unit 1121,
A complete rectangular frame extraction processing unit 1122 that extracts a complete rectangular frame surrounded by a ruled line vector, and an incomplete rectangular frame in which some of the ruled lines are omitted and there is no part of the rectangular frame is extracted to compensate for the lack. Incomplete rectangular frame extraction processing unit 1 that creates a rectangular frame
And 123.

【００３３】文字ブロック抽出部１１３０は、字領域抽
出処理部１１３１と、文字矩形抽出処理部１１３１と、
文字ブロック矩形抽出処理部矩形抽出部１１とを備えて
いる。文字領域抽出処理部１１３１は、矩形枠抽出部１
１２０により得られた矩形枠により囲まれた領域をそれ
ぞれ文字領域と決定し、文字・罫線分離処理部１１１０
からの文字画像を各文字領域ごとに切り出し、文字矩形
抽出処理部１１３２へ渡す。文字矩形抽出処理部１１３
２は、文字の書かれている画素の塊を囲む矩形領域を求
めるものである。文字ブロック矩形抽出処理部１１３３
は、文字矩形抽出処理部１１３２で求めた各文字矩形間
の距離を求めて、その距離がある閾値より小さな文字矩
形を全て１つの文字ブロックとして統合するものであ
る。The character block extraction unit 1130 includes a character area extraction processing unit 1131, a character rectangle extraction processing unit 1131,
The character block rectangle extraction processing unit rectangle extraction unit 11 is provided. The character area extraction processing unit 1131 is a rectangular frame extraction unit 1.
Each of the areas surrounded by the rectangular frame obtained by 120 is determined as a character area, and the character / ruled line separation processing unit 1110 is executed.
The character image from is cut out for each character area and passed to the character rectangle extraction processing unit 1132. Character rectangle extraction processing unit 113
2 is for obtaining a rectangular area surrounding a block of pixels in which characters are written. Character block rectangle extraction processing unit 1133
Is to obtain the distance between the character rectangles obtained by the character rectangle extraction processing unit 1132 and integrate all the character rectangles whose distance is smaller than a certain threshold value as one character block.

【００３４】また、位置関係識別部１１４０は、矩形枠
抽出部１１２０により抽出された矩形枠および前記文字
ブロック抽出１１３により抽出された文字ブロック相互
の位置関係を識別するものであって、構成枠識別処理部
１１４１と、行抽出処理部１１４２と、列抽出処理部１
１４３と、それらの抽出結果を記憶する表構造記憶部１
１４１からなっている。構成枠識別処理部１１４１は、
前記矩形抽出手段により抽出した表の罫線から構成され
る矩形枠と文字ブロック抽出手段により抽出された文字
ブロック矩形枠から表を構成する構成枠を識別するもの
である。行抽出処理部１１４２は、構成枠識別処理部１
１４１で抽出した表を構成する構成枠の行方向の並びを
識別し、列抽出処理部１１４３は、構成枠識別処理部１
１４１で抽出した表を構成する構成枠の列方向の並びを
識別するものである。The positional-relationship identifying unit 1140 identifies the positional relationship between the rectangular frame extracted by the rectangular-frame extracting unit 1120 and the character blocks extracted by the character block extracting 113. Processing unit 1141, row extraction processing unit 1142, column extraction processing unit 1
143 and a table structure storage unit 1 for storing the extraction results thereof
It consists of 141. The configuration frame identification processing unit 1141
The constituent frame forming the table is identified from the rectangular frame composed of the ruled lines of the table extracted by the rectangle extracting means and the character block rectangular frame extracted by the character block extracting means. The line extraction processing unit 1142 includes the component frame identification processing unit 1
The arrangement in the row direction of the constituent frames forming the table extracted in 141 is identified, and the column extraction processing unit 1143 determines that the constituent frame identification processing unit 1
It identifies the arrangement in the column direction of the constituent frames that make up the table extracted in 141.

【００３５】以上のように構成された本実施例の動作に
ついて説明する。文字・罫線分離処理部１１１０は、図
形中の文字と線分を分離する処理と同様の既存の手法を
用いることができる。なお、第２の実施例において挙げ
た特願平３−２９０２９９号「文字／図形分離装置」の
技術を用いると、誤りの少ない正確な分離処理をより高
速に行うことができる。ここで分離した罫線画像の情報
は矩形枠抽出部１１２０に出力され、文字画像の情報は
文字ブロック抽出部１１３０へ出力される。The operation of this embodiment configured as described above will be described. The character / ruled line separation processing unit 1110 can use an existing method similar to the processing for separating a character and a line segment in a figure. By using the technique of Japanese Patent Application No. 3-290299 "Character / Figure Separation Device" mentioned in the second embodiment, an accurate separation process with few errors can be performed at a higher speed. The information of the ruled line image separated here is output to the rectangular frame extraction unit 1120, and the information of the character image is output to the character block extraction unit 1130.

【００３６】罫線画像は罫線ベクトル化処理部１１２１
でベクトル化される。すなわち、２値画像を、端点、折
れ線、交差点、分岐点などの特徴点を始点および終点と
するベクトルデータに変換する。このベクトルデータに
変換する方法は前掲の既存の技術を用いればよい。変換
された罫線ベクトルデータは、罫線ベクトルにより囲ま
れた完全な矩形枠を抽出する完全矩形枠抽出処理部１１
２２と、罫線の一部が省略されて矩形枠の一部がない不
完全な矩形枠を抽出し足りないところを補って矩形枠と
する不完全矩形枠抽出処理部１１２３とに渡される。The ruled line image is a ruled line vectorization processing unit 1121.
Is vectorized with. That is, the binary image is converted into vector data having characteristic points such as end points, polygonal lines, intersections, and bifurcation points as start points and end points. As a method of converting this vector data, the existing technology described above may be used. The converted ruled line vector data is a complete rectangular frame extraction processing unit 11 for extracting a complete rectangular frame surrounded by the ruled line vector.
22 and an incomplete rectangular frame extraction processing unit 1123 that extracts an incomplete rectangular frame in which a part of the ruled lines is omitted and a part of the rectangular frame is omitted to form a rectangular frame.

【００３７】完全矩形枠抽出処理部１１２２は、罫線ベ
クトルデータを基に罫線ベクトルにより囲まれた完全な
矩形枠を取り出す。図１２ａおよび１２ｂはその処理の
フローチャートである。表の矩形枠は、１つの水平ベク
トルデータの左右に垂直ベクトルデータが接続し、さら
にその下に水平ベクトルデータが接続していることか
ら、各水平ベクトルデータを調べて、条件を満たすベク
トルデータを図１４に示す矩形枠構成表に記入する。ま
ず、表を構成する全てのベクトルデータの数を計数する
（ステップ１２０１）。以下のステップ１２０２からス
テップ１２１２の処理を全てのベクトルデータに対して
適用する。矩形枠の上罫線となる水平ベクトルデータＶ
_iを捜す（ステップ１２０３）。これは、ベクトルデー
タと水平線とのなす角度がある閾値以下であることから
水平なベクトルデータを見つけることができる。ここで
みつけた水平ベクトルデータＶ_iは、ｋ番目の矩形枠の
上罫線となる可能性があるので、矩形枠構成表１４１の
ｋ番目の矩形枠の上罫線の欄にこのベクトルデータＶ_i
を登録する（ステップ１２０４）。次に矩形枠Ｗ_kの右
側の辺を構成するベクトルデータを捜す（ステップ１２
０５）。すなわち、ベクトルデータＶ_iの右端の端点に
接し、かつベクトルデータＶ_iに接していないほうの端
点がベクトルデータＶ_iより下にあるような垂直ベクト
ルデータをみつける処理を行なう。垂直ベクトルデータ
は、垂線とのなす角度がある閾値以下であることから容
易に求めることができる。このステップで見つけたベク
トルデータは矩形枠Ｗ_kの右罫線を構成する可能性があ
るので、矩形枠構成表１４１のｋ番目の矩形枠の右罫線
の欄に登録する（ステップ１２０６）。同様に矩形枠Ｗ
_kの左罫線を捜し（ステップ１２０７）、矩形枠構成表
１４１のｋ番目の矩形枠の左罫線の欄に登録する（ステ
ップ１２０８）。さらに、いま求めた右罫線、左罫線の
下側に接するベクトルデータを見つけ（ステップ１２０
９）、矩形枠構成表１４１のｋ番目の矩形枠の下罫線の
欄に登録する（ステップ１２１０）。以上の処理のう
ち、１つでも罫線が見つからない場合は、矩形枠構成表
１４１のｋ番目の矩形枠のすべての登録を破棄して、他
のベクトルデータで構成される矩形枠を登録できるよう
にリセットする。以上の処理を図１３の表に適用した時
の矩形枠構成表１４１は図１４のようになる。また、他
の例として図１５のような表に対する処理では、矩形枠
構成表１４１は図１６のようになる。さらに、この後の
処理に便利なように矩形枠構成表１４１を、各矩形枠の
左上隅のＸ座標、Ｙ座標と矩形の幅と高さで表す矩形枠
テーブル１７１に書き換える。図１４の矩形枠テーブル
は図１７の（ａ）のようになる。The complete rectangular frame extraction processing unit 1122 extracts a complete rectangular frame surrounded by the ruled line vector based on the ruled line vector data. 12a and 12b are a flow chart of the process. In the rectangular frame in the table, vertical vector data is connected to the left and right of one horizontal vector data, and horizontal vector data is connected below it. Therefore, each horizontal vector data is examined and vector data satisfying the condition is searched. Fill in the rectangular frame configuration table shown in FIG. First, the number of all vector data forming the table is counted (step 1201). The processes of steps 1202 to 1212 below are applied to all vector data. Horizontal vector data V that is the upper ruled line of the rectangular frame
Search for _i (step 1203). This is because the horizontal vector data can be found because the angle between the vector data and the horizontal line is less than or equal to a certain threshold. Since the horizontal vector data V _i found here may become the upper ruled line of the k-th rectangular frame, the vector data V _{i is written in} the column of the upper ruled line of the k-th rectangular frame of the rectangular frame configuration table 141.
Is registered (step 1204). Next, the vector data forming the right side of the rectangular frame W _k is searched (step 12).
05). That is, in contact with the right end of the end point of the vector data V _i, and the end point of more not in contact with the vector data V _i performs the processing of finding the vertical vector data such that below the vector data V _i. The vertical vector data can be easily obtained because the angle formed by the perpendicular is less than or equal to a certain threshold. Since the vector data found in this step may form the right ruled line of the rectangular frame W _k , it is registered in the right ruled line of the k-th rectangular frame of the rectangular frame configuration table 141 (step 1206). Similarly, a rectangular frame W
The left ruled line of _k is searched (step 1207) and registered in the column of the left ruled line of the kth rectangular frame of the rectangular frame configuration table 141 (step 1208). Further, find the vector data that touches the lower sides of the right and left ruled lines that have just been found (step 120).
9), it is registered in the column of the lower ruled line of the kth rectangular frame of the rectangular frame configuration table 141 (step 1210). If at least one ruled line is not found in the above processing, all the registrations of the kth rectangular frame in the rectangular frame configuration table 141 are discarded, and a rectangular frame composed of other vector data can be registered. Reset to. The rectangular frame configuration table 141 when the above processing is applied to the table of FIG. 13 is as shown in FIG. Further, as another example, in the processing on the table shown in FIG. 15, the rectangular frame configuration table 141 becomes as shown in FIG. Further, the rectangular frame configuration table 141 is rewritten to a rectangular frame table 171 represented by the X coordinate, Y coordinate of the upper left corner of each rectangular frame, and the width and height of the rectangle for convenience of the subsequent processing. The rectangular frame table of FIG. 14 is as shown in FIG.

【００３８】不完全矩形枠抽出処理部１１２３は、罫線
の一部が省略されて矩形枠の一部がない不完全な矩形枠
を抽出し足りないところを補って表の矩形枠として取り
出す。図１８ａおよび１８ｂはその処理のフローチャー
トである。まず、完全矩形枠抽出処理部１１２２により
抽出された矩形枠の要素として矩形枠構成表に登録され
ているベクトルデータ以外の未登録ベクトルを抽出する
（ステップ１８０１〜１８０６）。そのために、まず、
Ｎにベクトルデータの総数を設定し、ｉ＝ｋ＝０にクリ
アする（ステップ１８０１）。ベクトルデータＶ_iを取
り出し、矩形枠構成表に登録されているか否かを調べ
（ステップ１８０３）、登録されていなければベクトル
列ＶＶに登録するとともに（ステップ１８０４）、カウ
ンタｋにより計数する（ステップ１８０５）。そして次
のベクトルを取り出すためにｉ＝ｉ＋１とする（ステッ
プ１８０６）。ベクトルデータＶ_iが矩形枠構成表に登
録されていた場合には、そのまま次のベクトルの処理に
移る（ステップ１８０６）。ｉがＮに達したとき、すな
わちすべてのベクトルについて未登録ベクトルの登録処
理が終わったら（ステップ１８０２）、未登録ベクトル
列ＶＶ内で、最も近い２つの端点を結ぶ水平／垂直なベ
クトルを補う（ステップ１８０７）。その補った数をｎ
とする。ベクトルの総数ｋをｋ＋ｎとし、またｉ＝ｍ＝
０にクリアする（ステップ１８０８）。矩形枠の上罫線
となる水平ベクトルデータＶＶ_iを捜す（ステップ１８
１０）。これは、ベクトルデータと水平線とのなす角度
がある閾値以下であることから水平なベクトルデータを
見つけることができる。ここでみつけた水平ベクトルデ
ータＶＶ_iは、ｍ番目の矩形枠Ｗ_mの上罫線となる可能性
があるので、不完全矩形枠構成表のｍ番目の矩形枠の上
罫線の欄にこのベクトルデータＶＶ_iを登録する（ステ
ップ１８１１）。次に矩形枠Ｗ_mの右側の辺を構成する
ベクトルデータを捜す（ステップ１８１２）。すなわ
ち、ベクトルデータＶＶ_iの右端の端点に接し、かつベ
クトルデータＶＶ_iに接していないほうの端点がベクト
ルデータＶ_iより下にあるような垂直ベクトルデータを
みつける処理を行なう。垂直ベクトルデータは、垂線と
のなす角度がある閾値以下であることから容易に求める
ことができる。このステップで見つけたベクトルデータ
は矩形枠Ｗ_mの右罫線を構成する可能性があるので、不
完全矩形枠構成表のｍ番目の矩形枠の右罫線の欄に登録
する（ステップ１８１３）。同様に矩形枠Ｗ_mの左罫線
を捜し（ステップ１８１４）、不完全矩形枠構成表のｍ
番目の矩形枠の左罫線の欄に登録する（ステップ１８１
５）。さらに、いま求めた右罫線、左罫線の下側に接す
るベクトルデータを見つけ（ステップ１８１６）、不完
全矩形枠構成表のｍ番目の矩形枠Ｗ_mの下罫線の欄に登
録する（ステップ１８１７）。以上の処理のうち、１つ
でも罫線が見つからない場合は、不完全矩形枠構成表の
ｍ番目の矩形枠Ｗ_mのすべての登録を破棄して、他のベ
クトルデータで構成される矩形枠を登録できるようにリ
セットする。図２０の（ｂ）は不完全矩形枠構成表の例
を示すもので、これは図１９の表の不完全矩形枠部分を
表すものである。さらに、この後の処理に便利なように
不完全矩形枠構成表を、各矩形枠の左上隅のＸ座標、Ｙ
座標と矩形の幅と高さで表す矩形枠テーブルに書き換え
る。The incomplete rectangular frame extraction processing unit 1123 extracts an incomplete rectangular frame in which a part of the ruled lines is omitted and a part of the rectangular frame is omitted, and the insufficient part is supplemented to extract it as a rectangular frame of the table. 18a and 18b are a flow chart of the process. First, unregistered vectors other than the vector data registered in the rectangular frame configuration table as the elements of the rectangular frame extracted by the complete rectangular frame extraction processing unit 1122 are extracted (steps 1801 to 1806). To do that, first
The total number of vector data is set in N, and i = k = 0 is cleared (step 1801). The vector data V _i is taken out, and it is checked whether or not it is registered in the rectangular frame configuration table (step 1803). If it is not registered, it is registered in the vector column VV (step 1804) and counted by the counter k (step 1805). ). Then, i = i + 1 is set to retrieve the next vector (step 1806). If the vector data V _i is registered in the rectangular frame configuration table, the process moves to the next vector as it is (step 1806). When i reaches N, that is, when the registration processing of the unregistered vector is completed for all the vectors (step 1802), the horizontal / vertical vector connecting the two nearest endpoints in the unregistered vector string VV is complemented ( Step 1807). The supplemented number is n
And Let k + n be the total number of vectors and i = m =
It is cleared to 0 (step 1808). Search for horizontal vector data VV _i which is the upper ruled line of the rectangular frame (step 18).
10). This is because the horizontal vector data can be found because the angle between the vector data and the horizontal line is less than or equal to a certain threshold. Here found horizontal vector data VV _i is, m-th rectangular frame W so on is likely to be a border of _m, the vector data in the column on borders of the m-th rectangular frame incomplete rectangular frame configuration table to register the VV _i (step 1811). Then search for vector data constituting the right side of the rectangular frame W _m (step 1812). That is, in contact with the right end of the end point of the vector data VV _i, and the end point of more not in contact with the vector data VV _i performs the processing of finding the vertical vector data such that below the vector data V _i. The vertical vector data can be easily obtained because the angle formed by the perpendicular is less than or equal to a certain threshold. Since the vector data found in this step may form the right ruled line of the rectangular frame W _m , it is registered in the right ruled line of the m-th rectangular frame in the incomplete rectangular frame configuration table (step 1813). Similarly looking left border of the rectangular frame W _m (step 1814), incomplete rectangular frame configuration table m
It is registered in the left ruled line of the second rectangular frame (step 181).
5). Further, the vector data which touches the lower side of the right ruled line and the left ruled line thus obtained is found (step 1816), and registered in the column of the lower ruled line of the m-th rectangular frame W _m of the incomplete rectangular frame configuration table (step 1817). .. If at least one ruled line is not found in the above processing, all the registrations of the m-th rectangular frame W _{m in} the incomplete rectangular frame configuration table are discarded, and a rectangular frame composed of other vector data is deleted. Reset to allow registration. FIG. 20B shows an example of an incomplete rectangular frame configuration table, which represents the incomplete rectangular frame portion of the table in FIG. Further, the incomplete rectangular frame configuration table is set to the X coordinate, Y coordinate of the upper left corner of each rectangular frame for convenience of the subsequent processing.
Rewrite to the rectangular frame table expressed by coordinates, width and height of rectangle.

【００３９】次に文字ブロック抽出部１１３０の処理に
ついて説明する。文字領域抽出処理部１１３１では、表
の中で罫線で区切られ、文字が書かれているべき矩形を
見つけ文字領域テーブルに登録する。本実施例では、完
全矩形枠抽出部および不完全矩形枠抽出部により矩形枠
が抽出されているので、これを文字領域テーブルに登録
すればよい。図１９の例では、２個の完全矩形枠に囲ま
れた文字領域と、４個の不完全矩形枠内の文字領域とが
得られる。他の例としては、図２１の（ａ）のような罫
線の不足している表に対してこの処理は、図２１の
（ｂ）のように罫線を補い、図２１の（ｃ）のように複
数の文字ブロックを包含する文字領域２１１を抽出す
る。この後の処理は、ここで求めた文字領域ごとに処理
を進める。このように文字領域を得て、文字領域ごとに
文字ブロックの抽出を行うようにすることにより罫線を
またぐような文字ブロックの抽出を防ぐことができる。Next, the processing of the character block extraction unit 1130 will be described. The character area extraction processing unit 1131 finds a rectangle in which a character should be written and which is delimited by ruled lines, and registers it in the character area table. In the present embodiment, since the rectangular frame is extracted by the complete rectangular frame extraction unit and the incomplete rectangular frame extraction unit, it may be registered in the character area table. In the example of FIG. 19, a character area surrounded by two complete rectangular frames and a character area within four incomplete rectangular frames are obtained. As another example, for a table lacking ruled lines as shown in FIG. 21A, this processing is performed by supplementing the ruled lines as shown in FIG. 21B and then as shown in FIG. A character area 211 including a plurality of character blocks is extracted. Subsequent processing proceeds for each of the character areas obtained here. By thus obtaining the character regions and extracting the character blocks for each of the character regions, it is possible to prevent the extraction of the character blocks that straddle the ruled line.

【００４０】次の文字矩形抽出処理１１３２について説
明する。ここでは、文字領域抽出処理部１１３１で求め
た各文字領域に対して、文字の書かれている画素の塊を
囲む矩形領域を求める。すなわち、表の画像が背景の画
素値が０、文字／線の画素値が１で書かれている時、画
素値が１である塊を取り出してその矩形領域を求める。
２つの矩形領域が重なってるときは、図３の（ｂ）のよ
うに２つの矩形領域３７，３８を包含できるような矩形
領域３９で表す。なお、文字の矩形領域を抽出する方法
は、既存の技術であるので詳細な説明は省略する。Next, the character rectangle extraction processing 1132 will be described. Here, for each character area obtained by the character area extraction processing unit 1131, a rectangular area enclosing a block of pixels in which characters are written is obtained. That is, when the table image is written with a background pixel value of 0 and a character / line pixel value of 1, a block having a pixel value of 1 is taken out to obtain its rectangular area.
When the two rectangular areas overlap, they are represented by a rectangular area 39 that can include the two rectangular areas 37 and 38 as shown in FIG. Since the method of extracting the rectangular area of the character is an existing technique, detailed description thereof will be omitted.

【００４１】さらに文字ブロック矩形抽出処理部１１３
３では、文字矩形抽出処理部１１３２で求めた各文字矩
形間の距離を求めて、ある閾値より小さな文字矩形を全
て１つの文字ブロックとして統合する処理を行なう。そ
の処理の詳細は、第１の実施例における文字ブロック矩
形抽出処理部１１２の処理と基本的には同じであり、図
５ａおよび図５ｂのフローチャートに示されている。こ
のフローチャートについては第１の実施例において既に
説明したので、ここでの説明は省略する。ただ、第１の
実施例の場合はブロックに統合するか否かを前記閾値の
みにより判定していたが、本実施例は文字領域の情報を
参照して同じ文字領域にある場合にのみ一つの文字ブロ
ックに統合する。これにより罫線をまたぐような文字ブ
ロックの抽出を防ぐことができる。Further, the character block rectangle extraction processing unit 113
In step 3, the distance between the character rectangles calculated by the character rectangle extraction processing unit 1132 is calculated, and all the character rectangles smaller than a certain threshold are integrated into one character block. The details of the processing are basically the same as the processing of the character block rectangle extraction processing unit 112 in the first embodiment, and are shown in the flowcharts of FIGS. 5a and 5b. Since this flow chart has already been described in the first embodiment, description thereof will be omitted here. However, in the case of the first embodiment, whether or not to integrate into a block is determined only by the threshold value. However, this embodiment refers to the information of the character area and only if the same character area exists. Integrate into a character block. As a result, it is possible to prevent the extraction of character blocks that cross ruled lines.

【００４２】最後に位置関係識別部１１４０の働きにつ
いて説明する。位置関係識別１１４はさらに、構成枠識
別処理部１１４１、行抽出処理１１４２、列抽出処理１
１４３の３つの処理部からなる。構成枠識別処理では、
実際に表の構造を構成する枠は、表の罫線から構成され
る矩形枠なのか、文字ブロックの枠なのかを識別し、選
択する処理である。矩形枠の内部には少なくとも１つ以
上の文字ブロックが存在しているはずなので、矩形枠の
内部にある文字ブロックの数を計数して、２つ以上の文
字ブロックが確認された場合は、文字ブロックを構成枠
として登録し、また、１つの文字ブロックだけが存在す
る場合は、矩形枠を構成枠として登録する。Finally, the operation of the positional relationship identifying section 1140 will be described. The positional relationship identification 114 further includes a component frame identification processing unit 1141, a row extraction processing 1142, a column extraction processing 1
It is composed of three processing units 143. In the component frame identification process,
The frame that actually forms the structure of the table is a process of identifying and selecting whether the frame is a rectangular frame composed of ruled lines of the table or a frame of a character block. Since there should be at least one character block inside the rectangular frame, if the number of character blocks inside the rectangular frame is counted and two or more character blocks are confirmed, the A block is registered as a constituent frame, and when only one character block exists, a rectangular frame is registered as a constituent frame.

【００４３】図２２は上記の構成枠識別処理の詳細を示
すフロー図である。Ｎに完全矩形枠の総数、Ｍに文字ブ
ロックの総数を設定し、完全矩形枠ｗの配列の要素を指
定する変数ｉ、識別した構成枠の配列の要素を指定する
変数Ｃ、各完全矩形枠に含まれる文字ブロックを計数す
る変数ｓをそれぞれ０に設定する（ステップ２２０
１）。文字ブロックＣＢ_jの配列の要素を指定する変数
ｊと変数ｓを０に設定する（ステップ２２０２）。文字
ブロックＣＢ_jを取り出し、完全矩形枠Ｗ_iに含まれるか
否かを判定し（ステップ２２０３）、含まれる場合には
ｓをインクリメントし（ステップ２２０４）、含まれな
い場合には何もしない。そして、次の文字ブロックを取
り出すためｊをインクリメントする（ステップ２２０
５）。以上のステップ２２０４〜２２０５の処理を、順
次、未処理の文字ブロックがなくなったと判定される
（ステップ２２０６）まで繰り返す。このようにして、
ひとつの完全矩形枠について、すべての文字ブロックを
調べ終わったら、その完全矩形枠に含まれる文字ブロッ
クの数ｓが複数あるか否かを判定し（ステップ２２０
７）、複数あっだ場合には、完全矩形枠Ｗ_iに含まれる
ｓ個の文字ブロックを構成枠として登録する（ステップ
２２０８）。ｓ個登録したのでＣをＣ＋ｓとする（ステ
ップ２２０９）。一方、その完全矩形枠Ｗ_iに含まれる
文字ブロックの数ｓが１であったときは、完全矩形枠Ｗ
_iをＣ番目の構成枠として登録し（ステップ２２１
０）、Ｃをインクリメントする（ステップ２２１１）。
以上の処理により、ひとつの完全矩形枠について、関連
する構成枠を識別したら、次に完全矩形枠について同様
の処理を行うためｉ＝ｉ＋１とし、ステップ２２０２に
戻る。ｉ＜Ｎでないとの判定（ステップ２２１３）がな
されると、すべての処理が終了する。図２３の（ａ）お
よび（ｂ）に、構成枠を抽出した結果の一例を示す。同
図（ａ）は表の例、（ｂ）は（ａ）の表から抽出した構
成枠を示す。FIG. 22 is a flow chart showing details of the above-mentioned component frame identification processing. A variable i that specifies the elements of the array of the complete rectangular frame w, where N is the total number of complete rectangular frames and M is the total number of character blocks, variable C that specifies the elements of the identified constituent frame array, and each complete rectangular frame Each variable s for counting the character blocks included in the is set to 0 (step 220).
1). The variables j and s that specify the elements of the array of the character block CB _j are set to 0 (step 2202). The character block CB _j is extracted, and it is determined whether or not it is included in the complete rectangular frame W _i (step 2203). If it is included, s is incremented (step 2204), and if it is not included, nothing is done. Then, j is incremented to fetch the next character block (step 220).
5). The above steps 2204 to 2205 are sequentially repeated until it is determined that there are no unprocessed character blocks (step 2206). In this way
When all the character blocks have been checked for one complete rectangular frame, it is determined whether or not there are a plurality of character blocks s included in the complete rectangular frame (step 220).
7) If there are a plurality of them, s character blocks included in the complete rectangular frame W _i are registered as constituent frames (step 2208). Since s pieces have been registered, C is set to C + s (step 2209). On the other hand, when the number s of character blocks included in the complete rectangular frame W _i is 1, the complete rectangular frame W _i
_i is registered as the Cth component frame (step 221).
0) and C are incremented (step 2211).
Through the above processing, when the related constituent frame is identified for one complete rectangular frame, i = i + 1 is set to perform similar processing for the complete rectangular frame, and the process returns to step 2202. When it is determined that i <N is not satisfied (step 2213), all the processes are completed. 23A and 23B show an example of the result of extracting the constituent frames. FIG. 10A shows an example of the table, and FIG. 9B shows the constituent frames extracted from the table of FIG.

【００４４】行抽出処理部１１４２と列抽出処理部１１
４３では、構成枠識別処理部１１４１で抽出した表を構
成する枠の並びを識別する。すなわち、全ての構成枠の
中心点の座標を求め、行抽出処理部１１４２では構成枠
の中心点のＹ座標がある誤差範囲内に並んでいる構成枠
を表の行と識別し、列抽出処理１１４３では構成枠の中
心点のＸ座標がある誤差範囲内に並んでいる構成枠を表
の列と識別する。この処理の詳細は、行抽出処理フロー
を図６ａおよび図６ｂに示し、列抽出処理フローを図７
ａおよび図７ｂに示す。これらの処理の詳細な説明は、
第１の実施例により説明したところと同じである。行抽
出処理１５２、列抽出処理１５３の結果を図２４と図２
５に示す。Row extraction processing section 1142 and column extraction processing section 11
At 43, the arrangement of the frames forming the table extracted by the configuration frame identification processing unit 1141 is identified. That is, the coordinates of the center points of all the constituent frames are obtained, and the row extraction processing unit 1142 identifies the constituent frames in which the Y coordinates of the central points of the constituent frames are arranged within a certain error range as a row of the table, and performs the column extraction processing. In 1143, the constituent frames in which the X coordinates of the center points of the constituent frames are arranged within a certain error range are identified as columns in the table. The details of this processing are shown in FIG. 6A and FIG. 6B for the row extraction processing flow and in FIG.
a and FIG. 7b. For a detailed description of these processes,
This is the same as that described in the first embodiment. The results of the row extraction processing 152 and the column extraction processing 153 are shown in FIGS.
5 shows.

【００４５】このような処理を行なった後に、抽出した
行と列の並びで順に番号付けを行ない、この番号付けに
従って、ワープロの表のデータを記述することで、画像
として入力した表をワープロで編集が可能な表に変換す
ることが可能である。また、構成枠を用いて文字を切り
出すことにより、図２４の（ａ）に示した表のように表
の両脇に罫線が不足している場合、図２４の（ｂ）のよ
うに表の第２列目と第３列目が罫線からなる矩形枠で、
残りは文字ブロックであるような構成枠が抽出できる。
線の不足している表も容易に文字認識装置へ入力するこ
とも可能となる。たとえば、図２４の（ａ）に示した表
のように表の両脇に罫線が不足している場合、図２４の
（ｂ）のように表の第２列目と第３列目が罫線からなる
矩形枠で、残りは文字ブロックであるような構成枠が抽
出できる。また、図２５の（ａ）に示した表のように縦
の罫線が全てと横罫線の一部が省略されている場合に
は、同図（ｂ）のように構成枠は全て文字ブロックとな
る。また、図２６の（ａ）に示した表のように表の縦罫
線、横罫線の一部が省略されている場合、表の中に罫線
からなる矩形枠を抽出することができるが、さらにその
内部に複数の文字ブロックを含んでいるために、図２６
の（ｂ）のようにその構成枠は全て文字ブロックとな
る。図２４、図２５、図２６に示したようにどのタイプ
の表に対しても、本実施例は表の行と列の構造を正確に
取り出すことができる。本実施例では少なくとも罫線が
書かれている表を対象として説明したが、同様の処理を
行なうことにより、罫線をまったく含まない表に対して
も適用しうるものである。さらに、明示的に表として書
かれていない文章、たとえば箇条書の文書に対して、表
としての構造を付加することも可能である。After performing such processing, the extracted rows and columns are numbered in order, and the data of the word processor table is described according to this numbering, so that the table input as an image can be written in the word processor. It is possible to convert it into a table that can be edited. Further, when the characters are cut out by using the configuration frame, when the ruled lines are insufficient on both sides of the table as shown in the table of FIG. 24A, the table of FIG. The second and third columns are rectangular frames with ruled lines,
The rest can be extracted as constituent blocks that are character blocks.
It is also possible to easily input a table lacking lines into the character recognition device. For example, when there are insufficient ruled lines on both sides of the table as shown in the table of FIG. 24 (a), the second and third columns of the table have ruled lines as shown in FIG. 24 (b). It is possible to extract a constituent frame which is a rectangular frame consisting of, and the rest is a character block. Further, when all the vertical ruled lines and part of the horizontal ruled lines are omitted as in the table shown in FIG. 25A, the constituent frames are all character blocks as shown in FIG. 25B. Become. Further, when a part of the vertical ruled lines and the horizontal ruled lines of the table is omitted as in the table shown in FIG. 26A, a rectangular frame composed of ruled lines can be extracted in the table. Due to the fact that it contains multiple character blocks within it, FIG.
As shown in (b), all the constituent frames are character blocks. For any type of table as shown in FIGS. 24, 25, and 26, the present embodiment can accurately extract the row and column structure of the table. In the present embodiment, at least the table in which the ruled lines are written has been described as an object, but by performing the same processing, the present invention can be applied to a table including no ruled lines at all. Furthermore, it is possible to add a structure as a table to a sentence that is not explicitly written as a table, for example, a document of a clause.

【００４６】[0046]

【発明の効果】本発明によれば、文字ブロック抽出手段
により抽出された文字ブロック相互の位置関係を位置関
係識別手段により識別する。表における文字ブロックは
表の構成要素として一般に規則正しく整列した位置関係
にあるので、文字ブロック相互の位置関係を見ることに
より表の構造を認識できる。従来は、表の罫線のみに着
目して表を構成する枠を求めていたので、縦罫線、横罫
線の一部または全部に省略のあるような表の構造を正確
に認識することができないという問題があったが、本発
明によれば文字ブロックの並びを用いて表の構造を認識
するので、その問題は解消できる。According to the present invention, the positional relationship between the character blocks extracted by the character block extracting means is identified by the positional relationship identifying means. Since the character blocks in the table generally have a positional relationship in which they are regularly arranged as constituent elements of the table, the structure of the table can be recognized by looking at the positional relationship between the character blocks. In the past, since the frame forming the table was sought by paying attention only to the ruled lines of the table, it is impossible to accurately recognize the structure of the table in which some or all of the vertical ruled lines and the horizontal ruled lines are omitted. Although there is a problem, according to the present invention, since the structure of the table is recognized by using the arrangement of the character blocks, the problem can be solved.

【００４７】また、本発明の文字領域抽出手段を設けた
態様によれば、文字領域抽出手段により罫線の情報を用
いて文字領域を抽出し、文字領域ごとに文字ブロックを
抽出する。したがって、罫線を挟んで近接した文字矩形
を一つのブロックとして抽出されるおそれはなく、文字
ブロックを精度よく抽出することができ、ひいては表の
構造を正確に認識することができる。According to the aspect in which the character area extracting means of the present invention is provided, the character area extracting means extracts the character area by using the information of the ruled line, and the character block is extracted for each character area. Therefore, there is no possibility that character rectangles that are close to each other with a ruled line in between will be extracted as one block, the character blocks can be extracted with high accuracy, and the structure of the table can be accurately recognized.

【００４８】本発明において、罫線によって囲まれる矩
形枠を抽出する矩形枠抽出手段を設け、位置関係識別手
段において罫線で囲まれた矩形枠と表中の文字ブロック
を同等に扱い各位置関係を識別するようにした態様のも
のにおいては、表の罫線で囲まれた矩形枠と表中の文字
ブロックを同等に扱うことにより、罫線で囲まれていな
い表中の枠であっても、文字ブロックとして表の中の１
つの構成要素であると識別されるので、罫線の一部また
は全部が省略された表も、罫線が全部揃っている表と同
様に正確に認識することができる。In the present invention, a rectangular frame extracting means for extracting a rectangular frame surrounded by ruled lines is provided, and the positional relationship identifying means treats the rectangular frame surrounded by the ruled lines and the character blocks in the table equally and identifies each positional relationship. In such a mode, by treating the rectangular frame surrounded by the ruled lines of the table and the character block in the table equally, even the frame in the table not surrounded by the ruled lines is treated as the character block. 1 in the table
Since it is identified as one component, a table in which some or all of the ruled lines are omitted can be recognized as accurately as a table in which all the ruled lines are complete.

[Brief description of drawings]

【図１】本発明の第１の実施例の構成を示す図FIG. 1 is a diagram showing a configuration of a first exemplary embodiment of the present invention.

【図２】（ａ）〜（ｅ）は文書中で使われる表の例を
示す図2A to 2E are diagrams showing examples of tables used in a document.

【図３】（ａ）および（ｂ）は文字矩形の例を示した
図3A and 3B are diagrams showing an example of a character rectangle.

【図４】（ａ）〜（ｅ）は罫線がすべて省略された表
の認識を説明するための図4A to 4E are views for explaining recognition of a table in which all ruled lines are omitted.

【図５ａ】文字ブロックの抽出処理のフローを示す図FIG. 5a is a diagram showing a flow of character block extraction processing.

【図５ｂ】文字ブロックの抽出処理のフローを示す図
（図５ａの続き）FIG. 5b is a diagram showing a flow of character block extraction processing (continuation from FIG. 5a).

【図６ａ】行抽出処理のフローを示す図FIG. 6a is a diagram showing a flow of a line extraction process.

【図６ｂ】行抽出処理のフローを示す図（図６ａの続
き）FIG. 6b is a diagram showing a flow of line extraction processing (continuation of FIG. 6a).

【図７ａ】列抽出処理のフローを示す図FIG. 7a is a diagram showing a flow of column extraction processing.

【図７ｂ】列抽出処理のフローを示す図（図７ａの続
き）FIG. 7b is a diagram showing a flow of column extraction processing (continuation from FIG. 7a).

【図８】本発明の第２の実施例の構成を示す図FIG. 8 is a diagram showing a configuration of a second exemplary embodiment of the present invention.

【図９】[Figure 9]

【図１０ａ】第２の実施例における文字領域抽出処理
のフローを示す図FIG. 10a is a diagram showing a flow of character region extraction processing in the second embodiment.

【図１０ｂ】第２の実施例における文字領域抽出処理
のフローを示す図（図１０ａの続き）FIG. 10b is a diagram showing a flow of character area extraction processing in the second embodiment (sequel to FIG. 10a).

【図１１】本発明の第３の実施例の構成を示す図FIG. 11 is a diagram showing a configuration of a third exemplary embodiment of the present invention.

【図１２ａ】完全矩形枠抽出処理部の処理フローを示
す図FIG. 12a is a diagram showing a processing flow of a complete rectangular frame extraction processing unit.

【図１２ｂ】完全矩形枠抽出処理部の処理フローを示
す図（図１２ａの続き）FIG. 12b is a diagram showing a processing flow of the complete rectangular frame extraction processing unit (continuation of FIG. 12a).

【図１３】表を構成するベクトルデータの例を示す図FIG. 13 is a diagram showing an example of vector data forming a table.

【図１４】矩形枠構成表の一例を示す図を示す図FIG. 14 is a diagram showing a diagram showing an example of a rectangular frame configuration table.

【図１５】表を構成するベクトルデータの他の例を示
す図FIG. 15 is a diagram showing another example of vector data forming a table.

【図１６】矩形枠構成表の他の例を示す図FIG. 16 is a diagram showing another example of the rectangular frame configuration table.

【図１７】（ａ）矩形枠テーブルおよび（ｂ）文字領
域テーブルの一例を示す図FIG. 17 is a diagram showing an example of (a) a rectangular frame table and (b) a character area table.

【図１８ａ】不完全矩形枠抽出処理のフローを示す図FIG. 18a is a diagram showing a flow of incomplete rectangular frame extraction processing.

【図１８ｂ】不完全矩形枠抽出処理のフローを示す図
（図１８ａの続き）FIG. 18b is a diagram showing a flow of incomplete rectangular frame extraction processing (continuation of FIG. 18a).

【図１９】一部の罫線が省略された表を構成するベク
トルデータの例を示す図FIG. 19 is a diagram showing an example of vector data forming a table in which some ruled lines are omitted.

【図２０】図１９の表に対応する矩形枠構成表の例を
示すもので、（ａ）は完全矩形枠構成表、（ｂ）は不完
全矩形枠構成表をそれぞれ示す図20 shows an example of a rectangular frame configuration table corresponding to the table of FIG. 19, (a) showing a complete rectangular frame configuration table, and (b) showing an incomplete rectangular frame configuration table.

【図２１】文字領域の抽出を説明するための図FIG. 21 is a diagram for explaining extraction of a character area.

【図２２】構成枠識別処理部の処理のフローを示す図FIG. 22 is a diagram showing a processing flow of a component frame identification processing unit.

【図２３】構成枠、行および列の抽出の例を示した図
で、（ａ）は右端の縦罫線が省略された表の例、（ｂ）
は抽出された構成枠、（ｃ）は抽出された行、（ｄ）は
抽出された列をそれぞれ示す。FIG. 23 is a diagram showing an example of extraction of a configuration frame, rows, and columns, (a) is an example of a table in which the vertical ruled line at the right end is omitted, (b)
Indicates an extracted constituent frame, (c) indicates an extracted row, and (d) indicates an extracted column, respectively.

【図２４】構成枠、行および列の抽出の他の例を示し
た図で、（ａ）は左右両端の縦罫線が省略された表の
例、（ｂ）は抽出された構成枠、（ｃ）は抽出された
行、（ｄ）は抽出された列をそれぞれ示す。FIG. 24 is a diagram showing another example of extraction of a configuration frame, rows and columns, where (a) is an example of a table in which vertical ruled lines at the left and right ends are omitted, and (b) is an extracted configuration frame ( c) shows the extracted row, and (d) shows the extracted column, respectively.

【図２５】構成枠、行および列の抽出の他の例を示し
た図で、（ａ）は縦罫線がすべて省略された表の例、
（ｂ）は抽出された構成枠、（ｃ）は抽出された行、
（ｄ）は抽出された例をそれぞれ示す。FIG. 25 is a diagram showing another example of extraction of a configuration frame, rows and columns, (a) is an example of a table in which all vertical ruled lines are omitted,
(B) is the extracted configuration frame, (c) is the extracted line,
(D) shows each extracted example.

【図２６】構成枠、行および列の抽出の他の例を示し
た図で、（ａ）は縦罫線および横罫線の一部が省略され
た表の例、（ｂ）は抽出された構成枠、（ｃ）は抽出さ
れた行、（ｄ）は抽出された列をそれぞれ示す。FIG. 26 is a diagram showing another example of extraction of a configuration frame, rows, and columns, (a) is an example of a table in which some vertical and horizontal ruled lines are omitted, and (b) is an extracted configuration. A frame, (c) shows an extracted row, and (d) shows an extracted column, respectively.

[Explanation of symbols]

１１，８１…文字ブロック抽出部、１１１，８１３…文
字矩形抽出処理部、１１２，８１４…文字ブロック矩形
抽出処理部、１２，８２…位置関係識別部、１２１，８
２１…行抽出処理部、１２２，８２２…列抽出処理部、
１２３，８２３…表構造記憶部、８１１…罫線ベクトル
化処理部、８１２…文字領域抽出処理部、１１１０…文
字・罫線分離処理部、１１２０…矩形枠抽出部、１１２
１…罫線ベクトル化処理部、１１２２…完全矩形枠抽出
処理部、１１２３…不完全矩形抽出処理部、１１３０…
文字ブロック抽出部、１１３２…文字矩形抽出処理部、
１１３３…文字ブロック矩形抽出処理部、１１４０…位
置関係識別部、１１４１…構成枠識別処理部、１１４２
…行抽出処理部、１１４３…列抽出処理部、１１４４…
表構造記憶部、３１，３２，３３，３４…黒画素塊、３
５，３６，３９…文字矩形、１６１…構成枠、１７１…
矩形枠テーブル、１７２…文字領域テーブル２１１…文
字領域。11, 81 ... Character block extraction unit, 111, 813 ... Character rectangle extraction processing unit, 112, 814 ... Character block rectangle extraction processing unit, 12, 82 ... Positional relationship identification unit, 121, 8
21 ... Row extraction processing unit, 122, 822 ... Column extraction processing unit,
123, 823 ... Table structure storage unit, 811 ... Ruled line vectorization processing unit, 812 ... Character area extraction processing unit, 1110 ... Character / ruled line separation processing unit, 1120 ... Rectangular frame extraction unit, 112
1 ... Ruled line vectorization processing unit, 1122 ... Complete rectangular frame extraction processing unit, 1123 ... Incomplete rectangle extraction processing unit, 1130 ...
Character block extraction unit 1132 ... Character rectangle extraction processing unit,
1133 ... Character block rectangle extraction processing unit, 1140 ... Positional relationship identification unit, 1141 ... Configuration frame identification processing unit, 1142
... Row extraction processing unit, 1143 ... Column extraction processing unit, 1144 ...
Table structure storage unit, 31, 32, 33, 34 ... Black pixel block, 3
5, 36, 39 ... Character rectangle, 161, ... Composition frame, 171 ...
Rectangular frame table, 172 ... Character area table 211 ... Character area.

Claims

[Claims]

1. A character block extracting unit for extracting a character block from a table image, and a positional relationship identifying unit for identifying a positional relationship between the character blocks extracted by the character block extracting unit and generating data representing a table structure. A table recognition device having means.

2. A character / ruled line separating means for separating a front image into a character image and a ruled line image is provided, and the character image separated by the character / ruled line separating means is input to the character block extracting means. The table recognition device according to claim 1.

3. The character block extracting means calculates a value of 1 based on a character rectangle extracting means for obtaining a rectangular area surrounding a block of pixels in which characters are written and a distance between the character rectangles obtained by the character rectangle extracting means. The table recognition device according to claim 1, further comprising a character block rectangle extraction unit that integrates the above character rectangles as a character block.

4. The character block rectangle extraction means obtains the distance between the character rectangles obtained by the character rectangle extraction means, and integrates consecutive character rectangle groups with a distance smaller than a certain threshold value into one character block. The table recognition device according to claim 3, wherein the table recognition device is performed.

5. The character block extraction means is a character / ruled line separation means for separating characters in the table from the ruled lines, a ruled line vectorization means for vectorizing the ruled lines separated by the character / ruled line separation means, and a ruled line vectorization. 1 based on the distance between each character rectangle obtained by the character rectangle extraction means and the character area extraction means for extracting a rectangular area in which a character should be written as a character area based on the ruled line vector data obtained by the means.
A character block rectangle extracting means for integrating the above-mentioned character rectangles as a character block is provided.
The described table recognition device.

6. The positional relationship identifying means regards the character block rectangle extracted by the character block extracting means as a constituent frame of a table, and the row extracting means for identifying the arrangement of the constituent frame in the row direction; and the constituent frame. 2. The table recognition device according to claim 1, further comprising a column extracting means for identifying the arrangement in the column direction.

7. The row extracting means extracts a group of constituent frames having the same y-coordinate of the center of each constituent frame within a predetermined error range as the same row, and the column extracting means extracts each constituent frame. 7. The table recognition device according to claim 6, wherein groups of constituent frames having the same x-coordinate of the center within a predetermined error range are extracted as the same column.

8. A rectangular frame extracting means for extracting a rectangular frame surrounded by ruled lines forming a table from a target table area, a character block extracting means for extracting a character block from the target table area, and the rectangular frame. A table recognition device comprising: a rectangular frame extracted by the extraction means; and a positional relationship identification means for identifying the positional relationship between the character blocks extracted by the character block extraction means.

9. A character / ruled line separating means for separating a front image into a character image and a ruled line image is provided, and the character image separated by the character / ruled line separating means is input to the character block extracting means to extract the rectangle. 9. The table recognition device according to claim 8, wherein the ruled line image separated by the character / ruled line separating means is input to the means.

10. The rectangular frame extraction means is based on a ruled line vectorization means for converting the ruled line image separated by the character / ruled line separation means into vector data, and a connection relationship between the ruled line vectors output by the ruled line vectorization means. And a second rectangular frame extracting means for estimating a rectangular frame in which some of the ruled lines are omitted from a ruled line vector whose one end is not connected to any other ruled line vector. 9. The table recognition device according to claim 8, further comprising means.

11. The character block extracting means obtains a character rectangle extracting means for obtaining a rectangular area surrounding a block of pixels in which characters are written, and a distance between each character rectangle obtained by the character rectangle extracting means, 9. The table recognition device according to claim 8, further comprising character block rectangle extraction means for integrating one or more character rectangles as a character block based on the distance.

12. The character block extracting means obtains the character area extracting means for extracting a rectangular area in which a character is to be written as a character area based on the output of the rectangular frame extracting means, and the character area extracting means. For each character area, one or more character rectangles based on the distance between the character rectangle extracting means for obtaining a rectangular area enclosing a block of pixels in which characters are written and each character rectangle obtained by the character rectangle extracting means. 9. The table recognition device according to claim 8, further comprising a character block rectangle extraction unit that integrates as a character block.

13. The positional relationship identifying means identifies a constituent frame forming a table from a rectangular frame composed of ruled lines of the table extracted by the rectangle extracting means and a character block rectangle extracted by the character block extracting means. And a row extracting means for identifying the arrangement in the row direction of the constituent frames constituting the table extracted by the constituent frame identifying means, and a column direction of the constituent frames constituting the table extracted by the constituent frame identifying means. 9. The table recognition device according to claim 8, further comprising a column extraction unit for identifying a row.

14. The component frame identifying means extracts a character block in the rectangular frame extracted by the rectangular extracting means, and when there are a plurality of character blocks, the plurality of character blocks are extracted. 14. The table recognizing device according to claim 13, wherein each of the above is determined as a constituent frame, and when there is a single character block, a process of determining a rectangular frame as a constituent frame is performed.

15. The line extracting means is arranged at the center y of each constituent frame.
A group of constituent frames whose coordinates are the same within a predetermined error range is extracted as the same row, and the column extracting means of the constituent frames whose x-coordinates of the centers of the respective constituent frames are the same within a predetermined error range. 14. The table recognition device according to claim 13, wherein the groups are extracted as the same column.