CN118467549B - A visual data connection method and device based on data model - Google Patents
A visual data connection method and device based on data model Download PDFInfo
- Publication number
- CN118467549B CN118467549B CN202410937870.5A CN202410937870A CN118467549B CN 118467549 B CN118467549 B CN 118467549B CN 202410937870 A CN202410937870 A CN 202410937870A CN 118467549 B CN118467549 B CN 118467549B
- Authority
- CN
- China
- Prior art keywords
- data
- tables
- entity
- metadata
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2282—Tablespace storage structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/26—Visual data mining; Browsing structured data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
- G06F16/287—Visualization; Browsing
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域Technical Field
本申请涉及数据处理领域,特别是涉及一种基于数据模型的可视化数据引接方法和装置。The present application relates to the field of data processing, and in particular to a method and device for visual data connection based on a data model.
背景技术Background Art
现有的主要数据引接软件工具,主要是从源数据库中读取数据表中数据及相关数据文件,进行表结构转换和字段格式转换后,存入目的数据库。The existing major data import software tools mainly read the data in the data table and related data files from the source database, convert the table structure and field format, and then store them in the target database.
但在实际应用中,一般需要将若干个数据表(实体)有机组织起来形成数据模型以支持对业务对象的完整描述。However, in actual applications, it is generally necessary to organically organize several data tables (entities) to form a data model to support a complete description of business objects.
采用普通的数据引接方式引接基于复杂数据模型的数据时,存在数据引接配置人机交互操作比较复杂、对用户技术能力要求较高、难以全面完整的描述数据表或数据实体之间关联性等困难。When using ordinary data connection methods to connect data based on complex data models, there are difficulties such as complex human-computer interaction operations in data connection configuration, high requirements on user technical capabilities, and difficulty in fully and completely describing the relationships between data tables or data entities.
发明内容Summary of the invention
基于此,有必要针对上述技术问题,提供一种基于数据模型的可视化数据引接方法和装置,针对复杂异构的外部数据结构,归纳出一种通用、普适的引接数据模型描述方法,辅助用户进行引接数据模型构建,提升数据引接配置的完整性和便利性。Based on this, it is necessary to provide a visual data connection method and device based on a data model to address the above technical problems, and to summarize a general and universal connection data model description method for complex and heterogeneous external data structures to assist users in building connection data models and improve the integrity and convenience of data connection configuration.
一种基于数据模型的可视化数据引接方法,所述方法包括:A visual data connection method based on a data model, the method comprising:
获取和完善数据源端的所有数据表的描述信息,并标记每个数据表的表类型;表类型包括:实体表、关系表和字典表;Obtain and complete the description information of all data tables on the data source side, and mark the table type of each data table; table types include: entity table, relationship table, and dictionary table;
面向需引接的业务对象,从数据源端的数据表中选择一个包含业务对象标识性信息的实体表作为引接数据模型的中心表,选择和中心表相关的可描述业务对象属性的实体表和字典表加入引接数据模型,并配置引接数据模型中各数据表之间的关系信息;For the business object to be linked, select an entity table containing the identification information of the business object from the data table on the data source end as the central table of the linked data model, select the entity table and dictionary table related to the central table that can describe the attributes of the business object and add them to the linked data model, and configure the relationship information between the data tables in the linked data model;
将引接数据模型中的数据表描述信息、数据表中的数据字段的描述信息以及各数据表之间的关系信息存储到预先构建的多维度元数据描述模板中,得到业务对象对应的元数据描述模型;其中,多维度元数据描述模板具体包括:数据表元数据表、数据字段元数据表以及数据表关系元数据表;数据表元数据表用于描述业务数据对象对应的数据模型中所有数据表的元数据信息;一个业务数据对象对应一个数据模型;数据字段元数据表用于描述数据模型中各数据表包含的所有数据字段的元数据信息;数据表关系元数据表用于描述数据模型中所有数据表之间的关系的元数据信息;The data table description information in the access data model, the description information of the data fields in the data table, and the relationship information between the data tables are stored in a pre-built multi-dimensional metadata description template to obtain a metadata description model corresponding to the business object; wherein the multi-dimensional metadata description template specifically includes: a data table metadata table, a data field metadata table, and a data table relationship metadata table; the data table metadata table is used to describe the metadata information of all data tables in the data model corresponding to the business data object; one business data object corresponds to one data model; the data field metadata table is used to describe the metadata information of all data fields contained in each data table in the data model; the data table relationship metadata table is used to describe the metadata information of the relationship between all data tables in the data model;
结合元数据描述模型配置设计引接模板,基于引接模板完成数据引接。Design the connection template in combination with the metadata description model configuration, and complete the data connection based on the connection template.
一种基于数据模型的可视化数据引接装置,所述装置包括:A visual data connection device based on a data model, the device comprising:
描述信息获取模块,用于获取和完善数据源端的所有数据表的描述信息,并标记每个数据表的表类型;表类型包括:实体表、关系表和字典表;The description information acquisition module is used to obtain and improve the description information of all data tables on the data source side, and mark the table type of each data table; the table types include: entity table, relationship table and dictionary table;
引接数据模型构建模块,用于面向需引接的业务对象,从数据源端的数据表中选择一个包含业务对象标识性信息的实体表作为引接数据模型的中心表,选择和中心表相关的可描述业务对象属性的实体表和字典表加入引接数据模型,并配置引接数据模型中各数据表之间的关系信息;The connection data model construction module is used to select an entity table containing identification information of the business object from the data table at the data source end as the central table of the connection data model for the business object to be connected, select entity tables and dictionary tables related to the central table that can describe the attributes of the business object to add to the connection data model, and configure the relationship information between the data tables in the connection data model;
描述模板填充模块,用于将引接数据模型中的数据表描述信息、数据表中的数据字段的描述信息以及各数据表之间的关系信息存储到预先构建的多维度元数据描述模板中,得到业务对象对应的元数据描述模型;其中,多维度元数据描述模板具体包括:数据表元数据表、数据字段元数据表以及数据表关系元数据表;数据表元数据表用于描述业务数据对象对应的数据模型中所有数据表的元数据信息;一个业务数据对象对应一个数据模型;数据字段元数据表用于描述数据模型中各数据表包含的所有数据字段的元数据信息;数据表关系元数据表用于描述数据模型中所有数据表之间的关系的元数据信息;The description template filling module is used to store the data table description information in the access data model, the description information of the data fields in the data table, and the relationship information between the data tables into the pre-built multi-dimensional metadata description template to obtain the metadata description model corresponding to the business object; wherein the multi-dimensional metadata description template specifically includes: a data table metadata table, a data field metadata table, and a data table relationship metadata table; the data table metadata table is used to describe the metadata information of all data tables in the data model corresponding to the business data object; one business data object corresponds to one data model; the data field metadata table is used to describe the metadata information of all data fields contained in each data table in the data model; the data table relationship metadata table is used to describe the metadata information of the relationship between all data tables in the data model;
引接任务执行模块,用于结合元数据描述模型配置设计引接模板,基于引接模板完成数据引接。The induction task execution module is used to design the induction template in combination with the metadata description model configuration and complete the data induction based on the induction template.
一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现以下步骤:A computer device comprises a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the following steps are implemented:
获取和完善数据源端的所有数据表的描述信息,并标记每个数据表的表类型;表类型包括:实体表、关系表和字典表;Obtain and complete the description information of all data tables on the data source side, and mark the table type of each data table; table types include: entity table, relationship table, and dictionary table;
面向需引接的业务对象,从数据源端的数据表中选择一个包含业务对象标识性信息的实体表作为引接数据模型的中心表,选择和中心表相关的可描述业务对象属性的实体表和字典表加入引接数据模型,并配置引接数据模型中各数据表之间的关系信息;For the business object to be linked, select an entity table containing the identification information of the business object from the data table on the data source end as the central table of the linked data model, select the entity table and dictionary table related to the central table that can describe the attributes of the business object and add them to the linked data model, and configure the relationship information between the data tables in the linked data model;
将引接数据模型中的数据表描述信息、数据表中的数据字段的描述信息以及各数据表之间的关系信息存储到预先构建的多维度元数据描述模板中,得到业务对象对应的元数据描述模型;其中,多维度元数据描述模板具体包括:数据表元数据表、数据字段元数据表以及数据表关系元数据表;数据表元数据表用于描述业务数据对象对应的数据模型中所有数据表的元数据信息;一个业务数据对象对应一个数据模型;数据字段元数据表用于描述数据模型中各数据表包含的所有数据字段的元数据信息;数据表关系元数据表用于描述数据模型中所有数据表之间的关系的元数据信息;The data table description information in the access data model, the description information of the data fields in the data table, and the relationship information between the data tables are stored in a pre-built multi-dimensional metadata description template to obtain a metadata description model corresponding to the business object; wherein the multi-dimensional metadata description template specifically includes: a data table metadata table, a data field metadata table, and a data table relationship metadata table; the data table metadata table is used to describe the metadata information of all data tables in the data model corresponding to the business data object; one business data object corresponds to one data model; the data field metadata table is used to describe the metadata information of all data fields contained in each data table in the data model; the data table relationship metadata table is used to describe the metadata information of the relationship between all data tables in the data model;
结合元数据描述模型配置设计引接模板,基于引接模板完成数据引接。Design the connection template in combination with the metadata description model configuration, and complete the data connection based on the connection template.
一种基于数据模型的可视化数据引接方法和装置,包括:提出引接数据模型描述方法,从多维度描述复杂数据结构的特征,实现对复杂业务数据对象的全面、完整的元数据记录和管理;基于引接数据模型的设计,提出配套的可视化建模方法,支持快速、准确的构建引接数据模型;设计全面的引接配置模板,可支持基于模板配置信息的引接代码自动生成和执行。本发明构造了一套通用化、可扩展、不依赖于数据管理系统的引接数据模型,以适应复杂异构的引接数据结构,在控制模型复杂度的前提下保证对数据描述的完整性,然后基于此数据模型描述方法设计动态、直观、简明的人机交互方法,可以引导用户快捷、方便地完成数据引接工作。A visual data connection method and device based on a data model, including: proposing a connection data model description method to describe the characteristics of complex data structures from multiple dimensions, and realizing comprehensive and complete metadata recording and management of complex business data objects; based on the design of the connection data model, proposing a matching visual modeling method to support the rapid and accurate construction of the connection data model; designing a comprehensive connection configuration template to support the automatic generation and execution of connection codes based on template configuration information. The present invention constructs a set of universal, extensible, and data management system-independent connection data models to adapt to complex and heterogeneous connection data structures, and ensures the integrity of data description under the premise of controlling the complexity of the model. Then, based on this data model description method, a dynamic, intuitive, and concise human-computer interaction method is designed to guide users to quickly and conveniently complete data connection work.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为一个实施例中一种基于数据模型的可视化数据引接方法的流程示意图;FIG1 is a schematic diagram of a flow chart of a method for visualizing data connection based on a data model in one embodiment;
图2为一个实施例中数据引接操作过程规范的示意图;FIG2 is a schematic diagram of a data connection operation process specification in one embodiment;
图3为一个实施例中选定一个中心表拖拽到界面中心的示意图;FIG3 is a schematic diagram of selecting a center table and dragging it to the center of the interface in one embodiment;
图4为一个实施例中拖拽相关字典表和实体表到界面上,并配置和中心表之间的关系的示意图;FIG4 is a schematic diagram of dragging related dictionary tables and entity tables onto an interface and configuring the relationship between them and the central table in one embodiment;
图5为一个实施例中计算机设备的内部结构图。FIG. 5 is a diagram showing the internal structure of a computer device in one embodiment.
具体实施方式DETAILED DESCRIPTION
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solution and advantages of the present application more clearly understood, the present application is further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application and are not used to limit the present application.
在一个实施例中,如图1所示,提供了一种基于数据模型的可视化数据引接方法,包括以下步骤:In one embodiment, as shown in FIG1 , a visual data connection method based on a data model is provided, comprising the following steps:
步骤102,获取和完善数据源端的所有数据表的描述信息,并标记每个数据表的表类型。Step 102: Obtain and complete the description information of all data tables at the data source end, and mark the table type of each data table.
从数据源端获取所有数据表的描述信息,主要包括表名称、表注释等,可以列表方式显示,得到一个数据源端表描述信息列表。在数据源端表描述信息列表中,人工配置每个表的类型(主要包括实体表、关系表、字典表三类),可补充完善表描述信息(如表的中文名称、表注释等)。Get the description information of all data tables from the data source, including table names, table comments, etc., which can be displayed in a list to obtain a data source table description information list. In the data source table description information list, manually configure the type of each table (mainly including entity tables, relationship tables, and dictionary tables), and supplement the table description information (such as the Chinese name of the table, table comments, etc.).
此处的所有数据表是指可从数据源端获取的所有数据表,比如数据源端是某各信息管理系统的数据库,当有权限访问该数据库连接时,可以获取该数据库中所有数据表的描述信息(受数据库权限限制)。All data tables here refer to all data tables that can be obtained from the data source. For example, if the data source is a database of an information management system, when you have permission to access the database connection, you can obtain the description information of all data tables in the database (subject to database permissions).
步骤104,面向需引接的业务对象,从数据源端的数据表中选择一个包含业务对象标识性信息的实体表作为引接数据模型的中心表,选择和中心表相关的可描述业务对象属性的实体表和字典表加入引接数据模型,并配置引接数据模型中各数据表之间的关系信息。Step 104, for the business object to be linked, select an entity table containing the identification information of the business object from the data table on the data source end as the central table of the linked data model, select entity tables and dictionary tables related to the central table that can describe the attributes of the business object and add them to the linked data model, and configure the relationship information between the data tables in the linked data model.
在实际的数据库应用中,一个业务对象往往不仅由一个实体表来描述,它可能包括多个组成部分,需要用多个相关联的实体表以及其他类型的数据表进行描述,如描述某型号飞机的属性,需要飞机的基本信息表+飞机上配备传感器的技术参数表+飞机上配备发动机的技术参数表+大量的字典表(飞机类型、国别等)。In actual database applications, a business object is often not only described by one entity table, but may include multiple components and need to be described by multiple related entity tables and other types of data tables. For example, to describe the properties of a certain model of aircraft, you need a basic information table of the aircraft + a technical parameter table of the sensors equipped on the aircraft + a technical parameter table of the engines equipped on the aircraft + a large number of dictionary tables (aircraft type, country, etc.).
步骤106,将引接数据模型中的数据表描述信息、数据表中的数据字段的描述信息以及各数据表之间的关系信息存储到预先构建的多维度元数据描述模板中,得到业务对象对应的元数据描述模型。Step 106, storing the data table description information in the reference data model, the description information of the data fields in the data table, and the relationship information between the data tables into a pre-built multi-dimensional metadata description template to obtain a metadata description model corresponding to the business object.
其中,多维度元数据描述模板具体包括:数据表元数据表、数据字段元数据表以及数据表关系元数据表。引接数据时应按需获取一个或多个业务对象的数据信息,而非简单的迁移数据表中所有内容,这就需要对业务对象所对应的相关数据表、表关系及表内数据要素进行完整、全面的描述,以确保引接数据的完整性和一致性。具体的元数据描述通过数据表元数据表、数据字段元数据表、数据表关系元数据表三个元数据表来记录。对三个表说明如下:Among them, the multi-dimensional metadata description template specifically includes: data table metadata table, data field metadata table and data table relationship metadata table. When importing data, the data information of one or more business objects should be obtained on demand, rather than simply migrating all the content in the data table. This requires a complete and comprehensive description of the relevant data tables, table relationships and data elements in the table corresponding to the business objects to ensure the integrity and consistency of the imported data. The specific metadata description is recorded in three metadata tables: data table metadata table, data field metadata table and data table relationship metadata table. The three tables are explained as follows:
数据表元数据表用于描述业务数据对象对应的数据模型中所有数据表的元数据信息。一个业务数据对象对应一个数据模型。The data table metadata table is used to describe the metadata information of all data tables in the data model corresponding to the business data object. One business data object corresponds to one data model.
数据字段元数据表用于描述数据模型中各数据表包含的所有数据字段的元数据信息。The data field metadata table is used to describe the metadata information of all data fields contained in each data table in the data model.
数据表关系元数据表用于描述数据模型中所有数据表之间的关系的元数据信息。The data table relationship metadata table is used to describe the metadata information of the relationship between all data tables in the data model.
采用“表信息+(表中)数据字段信息+表关系信息”多维度的元数据描述模型,实现了对复杂业务数据对象的全面、完整的元数据记录和管理,可以全面、完整地描述待引接的数据结构,确保引接数据在业务语义上的完整性和一致性。The multi-dimensional metadata description model of "table information + (in the table) data field information + table relationship information" is adopted to achieve comprehensive and complete metadata recording and management of complex business data objects. It can comprehensively and completely describe the data structure to be introduced and ensure the integrity and consistency of the introduced data in business semantics.
步骤108,结合元数据描述模型配置设计引接模板,基于引接模板完成数据引接。Step 108, design an initiation template in combination with the metadata description model configuration, and complete data initiation based on the initiation template.
引接模板是指数据引接过程中涉及的所有相关配置信息的集合。各类配置信息可重复利用,通过可视化界面引导用户进行各类配置信息的选用或新增修改,组合形成完整的引接模板。在引接过程中,可基于引接模板信息进行数据处理或者计算,支撑数据引接完成。The import template refers to the collection of all relevant configuration information involved in the data import process. Various configuration information can be reused, and the user is guided through the visual interface to select or add and modify various configuration information to form a complete import template. During the import process, data processing or calculation can be performed based on the import template information to support the completion of data import.
上述一种基于数据模型的可视化数据引接方法能很好地适用于面向业务数据对象,精准引接所需数据内容的场景,并具有较强的通用性和可扩展性,可在精细描述引接数据模型和引接模板的前提下,实现引接代码的自动生成和执行。The above-mentioned visual data connection method based on data model can be well applied to scenarios of accurately connecting required data content to business data objects, and has strong versatility and scalability. It can realize automatic generation and execution of connection code on the premise of detailed description of connection data model and connection template.
应该理解的是,虽然图1的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图1中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that, although the various steps in the flowchart of FIG. 1 are shown in sequence according to the indication of the arrows, these steps are not necessarily executed in sequence according to the order indicated by the arrows. Unless there is a clear explanation in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least a portion of the steps in FIG. 1 may include a plurality of sub-steps or a plurality of stages, and these sub-steps or stages are not necessarily executed at the same time, but can be executed at different times, and the execution order of these sub-steps or stages is not necessarily to be carried out in sequence, but can be executed in turn or alternately with other steps or at least a portion of the sub-steps or stages of other steps.
在一个实施例中,在多维度元数据描述模板中:In one embodiment, in a multi-dimensional metadata description template:
数据表元数据表中每一行记录有:一个特定数据表的标识、名称、说明以及表类型等;表类型包括:实体表、关系表和字典表等;Each row in the data table metadata table records: the identifier, name, description, and table type of a specific data table; table types include: entity table, relationship table, and dictionary table;
数据字段元数据表中每一行记录有:一个特定数据字段的标识、名称、所属于的数据表、说明、数据类型、长度、精度、量纲、访问控制等级以及格式约束(可以采用正则表达式规约)等信息;Each row in the data field metadata table records: the identifier, name, data table, description, data type, length, precision, dimension, access control level, and format constraints (regular expression specifications can be used) of a specific data field;
数据表关系元数据表中每一行记录有:两个特定数据表之间的关系,具体包括两个数据表的标识符、两个数据表中相互引用的数据字段标识符、两个数据表之间的关系类型;关系类型包括:一对多、多对一、一对一以及多对多等。Each row in the data table relationship metadata table records: the relationship between two specific data tables, including the identifiers of the two data tables, the identifiers of the data fields that reference each other in the two data tables, and the relationship type between the two data tables; relationship types include: one-to-many, many-to-one, one-to-one, and many-to-many, etc.
为了兼容非结构化数据,在数据表元数据表中每行记录的可以是一个数据库中的结构化数据表的描述信息,也可以是一组相关文件的整体名称和标识,这些文件包括视频、图片等非结构化数据文件,以及XML/EXCEL等半结构化数据文件。这一组相关文件视为一个数据表,其中每个文件作为一个实体,拥有唯一标识符(可以是访问路径)。In order to be compatible with unstructured data, each row in the data table metadata table can record the description information of a structured data table in a database, or the overall name and identifier of a group of related files, including unstructured data files such as videos and pictures, and semi-structured data files such as XML/EXCEL. This group of related files is regarded as a data table, in which each file is an entity with a unique identifier (which can be an access path).
在一个实施例中,从数据源端的数据表中选择一个包含业务对象标识性信息的实体表作为引接数据模型的中心表,选择和中心表相关的可描述业务对象属性的实体表和字典表加入引接数据模型,并配置引接数据模型中各数据表之间的关系信息,包括:In one embodiment, an entity table containing identification information of a business object is selected from a data table at a data source end as a central table of a connection data model, an entity table and a dictionary table related to the central table that can describe attributes of the business object are selected and added to the connection data model, and relationship information between the data tables in the connection data model is configured, including:
从数据源端的数据表中选择一个包含业务对象标识性信息的实体表作为引接数据模型的中心表,并以第一图标显示在可视化界面上。将中心表拖拽到界面上,可以圆形图案作为标记。Select an entity table containing business object identification information from the data table at the data source end as the central table of the connected data model, and display it on the visualization interface as the first icon. Drag the central table to the interface, and use a circular pattern as a mark.
选取和中心表相关的可描述业务对象属性的实体表,作为引接数据模型的其他实体表,并以第二图标显示在可视化界面上,于可视化界面上在中心表和其他实体表构成的已有实体表之间连线,并在线上标记对应的两个实体表之间的关联字段;如果两个实体表之间通过关系表进行关联,则线上需要标记对应的关系表,以及两个实体表分别和关系表的关联字段。将其他实体表拖拽到界面上,可以椭圆形图案作为标记。这里的已有实体表指的是已经拖拽到图形化界面上以第一图标的中心表(本质也是实体表)+以第二图标显示的多个其他实体表。这里的连线包括中心表和其他实体表之间的连线,以及其他实体表之间的连线。Select the entity table that can describe the attributes of the business object related to the central table as the other entity table to be connected to the data model, and display it on the visual interface with the second icon. On the visual interface, connect the existing entity tables consisting of the central table and other entity tables, and mark the corresponding associated fields between the two entity tables on the line; if the two entity tables are associated through a relationship table, the corresponding relationship table and the associated fields between the two entity tables and the relationship table need to be marked on the line. Drag other entity tables to the interface, and you can use an elliptical pattern as a mark. The existing entity table here refers to the central table (essentially also an entity table) that has been dragged to the graphical interface with the first icon + multiple other entity tables displayed with the second icon. The connection here includes the connection between the central table and other entity tables, as well as the connection between other entity tables.
选取和中心表相关的可描述业务对象属性的字典表,作为引接数据模型的字典表,并以第三图标显示在可视化界面上,于可视化界面上在已有实体表和字典表之间连线,并在线上标记对应的实体表和字典表之间的关联字段。将字典表拖拽到界面上,可以菱形图案作为标记。实体表和字典表有连线,字典表之间不需要连线,因为字典表的内容只是枚举型字段的取值范围。Select the dictionary table that can describe the attributes of the business object related to the central table as the dictionary table for the data model, and display it on the visualization interface with the third icon. Drag the existing entity table and the dictionary table on the visualization interface, and mark the corresponding associated fields between the entity table and the dictionary table on the line. Drag the dictionary table to the interface and use a diamond pattern as a mark. The entity table and the dictionary table are connected, but there is no need to connect the dictionary tables because the content of the dictionary table is only the value range of the enumeration field.
在图上点击任意表对应的图标,可展示表中各数据字段的定义信息,包括字段名称、字段注释、字段类型、是否为空、字段长度等,这些信息有可能直接从数据源端获取,还可进一步进行补充和完善。完成图形化的数据模型配置后,保存引接数据模型即可。在可视化界面上配置好的引接数据模型,其描述信息保存到引接数据模型描述方法中定义的三个元数据表中。Click the icon corresponding to any table on the diagram to display the definition information of each data field in the table, including field name, field comment, field type, whether it is empty, field length, etc. This information may be obtained directly from the data source and can be further supplemented and improved. After completing the graphical data model configuration, save the access data model. The description information of the access data model configured on the visual interface is saved in the three metadata tables defined in the access data model description method.
每个数据模型只有一个中心表,在同一个数据源端的所有表信息基础上,可以配置多个数据模型。比如,数据源端是一个课程安排系统的数据库,那么可以配置一个以学生信息为中心的数据模型,也可以配置出以课程信息为中心的数据模型,也可以配置出以院系信息为中心的数据模型。每个数据模型的配置步骤可按上述步骤顺次开展,也就是说执行上述步骤中的操作,可以基于数据源端的所有表空间创建一个新的数据模型。综上,基于引接数据模型的设计,提出配套的可视化建模方法,支持引接操作人员在图形化界面上,通过拖拽、点选、配置的方式,快速、准确的构建引接数据模型。以不同形状的图标代表数据表,以连线代表数据表之间的关系,在图标和连线上提供右键菜单支持对数据表及其关系的编辑和配置。主要操作包括数据表类型的配置、模型中心表的确定、模型中包含数据表的拖拽布局、模型中各个数据表中各个数据字段的格式配置、模型中各个数据表间关系的配置等。Each data model has only one central table. Based on all the table information on the same data source, multiple data models can be configured. For example, if the data source is a database of a course scheduling system, then a data model centered on student information can be configured, a data model centered on course information can be configured, or a data model centered on department information can be configured. The configuration steps of each data model can be carried out in sequence according to the above steps, that is, by performing the operations in the above steps, a new data model can be created based on all the table spaces on the data source. In summary, based on the design of the interface data model, a supporting visual modeling method is proposed to support interface operators to quickly and accurately build the interface data model on a graphical interface by dragging, clicking, and configuring. Data tables are represented by icons of different shapes, and the relationship between data tables is represented by lines. Right-click menus are provided on icons and lines to support editing and configuration of data tables and their relationships. The main operations include configuration of data table types, determination of the model central table, drag-and-drop layout of data tables included in the model, format configuration of each data field in each data table in the model, and configuration of the relationship between each data table in the model.
在一个实施例中,结合元数据描述模型配置设计引接模板,基于引接模板完成数据引接,包括:In one embodiment, a design of an induction template is combined with the metadata description model configuration, and data induction is completed based on the induction template, including:
创建引接任务,并为其关联引接模板;引接模板中包含有数据源信息、数据目的信息、元数据描述模型信息、引接过滤规则、引接数据转换规则、引接数据校验规则以及引接数据质量评价规则。Create an induction task and associate it with an induction template; the induction template includes data source information, data destination information, metadata description model information, induction filtering rules, induction data conversion rules, induction data verification rules and induction data quality evaluation rules.
自动解析引接模板执行引接任务。Automatically parse the boot template to execute the boot task.
通过引导式的可视化配置形成的引接模板(含引接数据模型)包含了数据源配置信息、数据目的配置信息、引接数据模型信息、引接过滤规则、引接数据转换规则、引接数据校验规则、引接数据质量评价规则、引接性能参数配置信息等大量配置信息。在实际数据引接执行过程中,可将引接模板配置信息自动转换成可执行代码并执行,实现基于引接模板的自动数据引接。The connection template (including connection data model) formed by guided visual configuration contains a large amount of configuration information such as data source configuration information, data destination configuration information, connection data model information, connection filtering rules, connection data conversion rules, connection data verification rules, connection data quality evaluation rules, connection performance parameter configuration information, etc. In the actual data connection execution process, the connection template configuration information can be automatically converted into executable code and executed to realize automatic data connection based on the connection template.
在一个实施例中,引接模板是指数据引接过程中涉及的所有相关配置信息的集合,包括数据源信息、数据目的信息、引接数据模型信息、引接过滤规则、引接数据转换规则、引接数据校验规则、引接数据质量评价规则、引接性能参数配置信息等。In one embodiment, a connection template refers to a collection of all relevant configuration information involved in the data connection process, including data source information, data destination information, connection data model information, connection filtering rules, connection data conversion rules, connection data verification rules, connection data quality evaluation rules, connection performance parameter configuration information, etc.
数据源信息包括:数据源类型、数据源访问地址、数据源名称以及数据源等描述。数据引接时,源端可能是数据库、数据文件、数据接口等形式。Data source information includes: data source type, data source access address, data source name, and data source description. When data is connected, the source end may be in the form of a database, data file, data interface, etc.
数据目的信息包括数据目的类型、数据目的访问地址、数据目的名称以及数据目的描述等信息。数据引接时,目的端一般是本地的数据库或者数据仓库。The data destination information includes the data destination type, data destination access address, data destination name, data destination description, etc. When data is connected, the destination is generally a local database or data warehouse.
引接数据模型信息:引接数据模型是指面向应用需要,将源端的多个数据实体(表)组合起来可支持信息表示和操作的数据集合描述,包括组成的实体(表)描述、实体(表)间关系、实体(表)内结构、实体(表)内数据要素(字段)的格式和描述等信息。引接数据模型信息是按步骤104进行配置的结果。Imported data model information: The imported data model refers to a data set description that combines multiple data entities (tables) at the source end to support information representation and operation, based on application needs, including descriptions of the constituent entities (tables), relationships between entities (tables), structures within entities (tables), formats and descriptions of data elements (fields) within entities (tables), etc. The imported data model information is the result of configuration according to step 104.
引接过滤规则构建在引接数据模型的中心表之上与引接任务关联,通过限定中心表上各个数据字段的过滤条件,从中心表中过滤出需要引接的数据内容,并通过元数据描述模型信息中数据表之间的关系信息从相关表中抽取所需数据内容;The induction filtering rules are built on the central table of the induction data model and are associated with the induction tasks. By limiting the filtering conditions of each data field on the central table, the data content that needs to be inducted is filtered out from the central table, and the required data content is extracted from the related tables through the relationship information between the data tables in the metadata description model information;
引接数据转换规则对源端和目的端的对应数据字段格式转换规则进行配置并记录,支持源端数据格式向目的端数据格式的自动转换。引接数据源端和目的端的数据结构可能有一定差别,可能是由于两端的数据库或者数据文件的型号不同导致的,如源端某字段格式为number,目的端需要转换为decimal,如源端某字段格式为字符串(来自excel),目的端需要转换为数值型格式。The data conversion rules for the connection configure and record the corresponding data field format conversion rules of the source and destination, and support the automatic conversion of the source data format to the destination data format. The data structure of the connection data source and destination may be different, which may be caused by the different models of the databases or data files at both ends. For example, if the format of a field on the source is number, the destination needs to convert it to decimal. If the format of a field on the source is string (from Excel), the destination needs to convert it to numeric format.
引接数据校验规则用于将数据引接到本地后,对数据的规范性、完整性、一致性进行校验。将数据引接到本地后,需要对数据进行整编,发现和解决数据中存在的问题,主要针对数据的规范性、完整性、一致性等方面。可在选取数据模型中的表关系、表中业务规则和表关系的基础上,进一步完善细化形成校验规则,支持对数据的自动校验发现问题。The imported data verification rules are used to verify the standardization, integrity, and consistency of the data after the data is imported locally. After the data is imported locally, it is necessary to organize the data to find and solve the problems in the data, mainly focusing on the standardization, integrity, and consistency of the data. Based on the table relationships, business rules in the tables, and table relationships in the selected data model, the verification rules can be further improved and refined to support automatic verification of data to find problems.
引接数据质量评价规则用于对引接数据的质量进行综合评价,以决定是否将数据保存。在将数据引接到本地后,可对数据进行质量评价,量化评估数据的整体质量水平,当数据质量水平不佳时,分析具体原因,根据原因进行数据整编或者数据重新引接以提高数据质量。质量检查规则是进行质量评价的依据,对其配置主要包括对各类型问题的评分规则及其权重设置。The imported data quality evaluation rules are used to comprehensively evaluate the quality of the imported data to decide whether to save the data. After the data is imported locally, the data quality can be evaluated to quantitatively assess the overall quality level of the data. When the data quality level is poor, the specific reasons are analyzed and the data is reorganized or re-imported according to the reasons to improve the data quality. The quality inspection rules are the basis for quality evaluation, and their configuration mainly includes the scoring rules for various types of questions and their weight settings.
引接性能参数信息,用于配置在执行引接任务时的引接性能参数,包括:是否多线程,以及占用数据缓存大小等。The boot performance parameter information is used to configure the boot performance parameters when executing the boot task, including: whether it is multi-threaded, and the size of the data cache occupied, etc.
可以理解,本方法通用性强,可适用于对各种复杂异构数据的引接应用中。通过灵活配置包括引接数据模型、引接数据源、引接过滤条件等在内的引接配置模板信息,可提升引接配置的可复用性和可扩展性。It can be understood that this method is highly versatile and can be applied to the induction applications of various complex heterogeneous data. By flexibly configuring the induction configuration template information including the induction data model, induction data source, induction filtering conditions, etc., the reusability and scalability of the induction configuration can be improved.
综上,本发明构造了一套通用化、可扩展、不依赖于数据管理系统的引接数据模型,以适应复杂异构的引接数据结构,在控制模型复杂度的前提下保证对数据描述的完整性,然后基于此数据模型描述方法设计动态、直观、简明的人机交互方法,引导用户快捷、方便的完成数据引接工作。In summary, the present invention constructs a universal, extensible, and data management system-independent access data model to adapt to complex and heterogeneous access data structures, while ensuring the integrity of data descriptions while controlling model complexity. Based on this data model description method, a dynamic, intuitive, and concise human-computer interaction method is designed to guide users to complete data access tasks quickly and conveniently.
在一个实施例中,引接数据校验规则主要包括字段校验、实体校验、关系校验;In one embodiment, the data verification rules mainly include field verification, entity verification, and relationship verification;
字段校验包括非空性校验、值域校验以及格式校验等;Field validation includes non-emptiness validation, value range validation, and format validation, etc.
实体校验主要包括实体数据要素完整性校验和实体数据重复性校验等;Entity verification mainly includes entity data element integrity verification and entity data duplication verification;
关系校验主要包括实体间(含字典)关系的一致性校验等,也就是包括实体表之间的关系的一致性校验,以及实体表与字典表之间的关系的一致性校验。Relationship verification mainly includes consistency verification of relationships between entities (including dictionaries), that is, consistency verification of relationships between entity tables, and consistency verification of relationships between entity tables and dictionary tables.
在一个实施例中,引接数据质量评价规则的配置包括:对各类型质量问题的评分规则及其权重设置;In one embodiment, the configuration of the data quality evaluation rules includes: scoring rules and weight settings for various types of quality issues;
质量问题主要包括:Quality issues mainly include:
1)引接过程监控时记录下的漏接数据以及数据转换不正确等问题;1) Problems such as missed data and incorrect data conversion recorded during the connection process monitoring;
2)数据整编时利用引接数据校验规则发现的数据格式不正确、数据值精度不正确、数据内容不规范等问题;2) When compiling data, problems such as incorrect data format, incorrect data value precision, and non-standard data content are discovered using the data verification rules;
3)人工查看数据结构和内容后发现和记录的问题。3) Issues discovered and recorded after manually reviewing the data structure and content.
为提升对复杂数据结构的外部数据的引接操作的规划范化、标准化水平,并保证数据完整性、一致性,在前述数据模型、引接模板设计的基础上,提出通用的数据引接过程规范,主要包括引接任务创建和模板配置、引接任务执行和过程监控、引接数据整编和质量评估等三个步骤,参见图2。In order to improve the planning and standardization of external data connection operations with complex data structures and ensure data integrity and consistency, a general data connection process specification is proposed based on the aforementioned data model and connection template design. It mainly includes three steps: connection task creation and template configuration, connection task execution and process monitoring, and connection data compilation and quality assessment, see Figure 2.
1)引接任务创建和模板配置1) Create a new task and configure a template
首先创建引接任务。如果有可用的引接模板,则将其关联到引接任务;如果没有可复用的引接模板,则需要创建新的引接模板。First, create an induction task. If there is an available induction template, associate it with the induction task; if there is no reusable induction template, you need to create a new induction template.
引接模板配置中需要分别配置源端和目的端的数据源信息、引接数据模型信息、引接过滤条件信息、引接转换规则等。整个模板配置以及每个配置步骤都可以通过查找选用系统中已保存的配置信息并进行调整来实现快速配置。In the connection template configuration, it is necessary to configure the data source information, connection data model information, connection filtering condition information, connection conversion rules, etc. of the source and destination ends respectively. The entire template configuration and each configuration step can be quickly configured by searching and adjusting the configuration information saved in the selection system.
此外,还可针对引接任务进行执行参数配置,包括定时执行配置,执行性能参数设置等。In addition, you can also configure execution parameters for the induction task, including timed execution configuration, execution performance parameter settings, etc.
2)引接任务执行和过程监控2) Task execution and process monitoring
完成引接任务配置后,就可以执行该引接任务。After completing the induction task configuration, you can execute the induction task.
执行引接任务时,根据精细描述的引接模板,可通过自动解析模板生成代码并执行,完成数据引接过程。When executing the connection task, according to the carefully described connection template, the code can be generated and executed by automatically parsing the template to complete the data connection process.
在引接任务执行过程中,引接模板校验、引接模板解析、引接代码执行等活动都会产生一些过程信息,需要记录到引接任务日志中去。During the execution of the induction task, activities such as induction template verification, induction template parsing, and induction code execution will generate some process information, which needs to be recorded in the induction task log.
为监视引接任务执行进度,可提供图形化交互界面,向用户展示引接进度、数据源信息、存储位置信息、已用时间、剩余时间等信息(可选)。In order to monitor the execution progress of the induction task, a graphical interactive interface can be provided to show the user the induction progress, data source information, storage location information, elapsed time, remaining time and other information (optional).
3)引接数据整编和质量评估3) Data compilation and quality assessment
执行引接任务,将外部数据保存到本地数据库之后,需要进一步对数据进行整编和质量评估,然后保留满足质量要求的数据。After executing the induction task and saving the external data to the local database, the data needs to be further compiled and quality assessed, and then the data that meets the quality requirements is retained.
首先可通过设置数据校验规则,自动对引接数据进行校验,发现数据中存在的问题,主要可分为字段校验规则与数据表校验规则两大类。First, you can set data validation rules to automatically validate the imported data and find problems in the data. These rules can be mainly divided into two categories: field validation rules and data table validation rules.
然后可以基于校验出的问题情况,设定数据处理的规则,进行一些数据批量填充、替换等操作。Then, based on the problems found, you can set data processing rules and perform some operations such as batch filling and replacement of data.
也可以通过人工查看和整编数据的方式对数据中的质量问题进行纠正,其中数据质量问题可以由校验规则自动发现并提示用户。Quality problems in the data can also be corrected by manually reviewing and compiling the data, where data quality problems can be automatically discovered by the verification rules and prompted to the user.
最后,汇总查看从多个渠道获取和记录数据质量问题,包括从引接监控日志文件中获取引接过程中发生的质量问题,获取和记录整编时基于校验规则的发现的数据质量问题,在整编时人工记录质量问题等,对引接数据的质量进行综合评价,以决定是否将数据保存。Finally, we review and summarize the data quality issues obtained and recorded from multiple channels, including obtaining quality issues that occur during the connection process from the connection monitoring log files, obtaining and recording data quality issues discovered based on verification rules during compilation, and manually recording quality issues during compilation, etc. We conduct a comprehensive evaluation of the quality of the connection data to decide whether to save the data.
综上,在数据模型和引接模板设计的基础上,提出通用的数据引接过程规范,主要包括引接任务创建、引接模板配置(复用)、引接任务执行和过程监控、引接数据整编和质量评估等。可支持对复杂数据结构的外部数据的规划范化、标准化引接操作,并保证数据完整性、一致性。In summary, based on the data model and the design of the interface template, a general data interface process specification is proposed, which mainly includes interface task creation, interface template configuration (reuse), interface task execution and process monitoring, interface data compilation and quality assessment, etc. It can support the planning and normalization of external data with complex data structures, standardize the interface operation, and ensure data integrity and consistency.
以引接某系统中的飞机平台及技术参数数据模型为例,说明本发明方法的应用方式和作用。Taking the aircraft platform and technical parameter data model in a certain system as an example, the application mode and function of the method of the present invention are explained.
1)待引接的数据结构分析1) Analysis of the data structure to be introduced
需要引接某数据库中关于某些型号飞机的平台及技术参数数据,相关的数据表包括飞机型号信息表、飞机上装载的传感器技术参数表、飞机上配备的发动机技术参数表等实体表,以及飞机类型、国别等字典表。由于飞机型号和传感器型号、飞机型号和发动机型号之间是多对多的关系,必须要增加专门的飞机-传感器关系表和飞机-发动机关系表来描述这种关系。It is necessary to access the platform and technical parameter data of certain aircraft models in a database. The relevant data tables include entity tables such as aircraft model information table, aircraft sensor technical parameter table, aircraft engine technical parameter table, and aircraft type, country, etc. Since there is a many-to-many relationship between aircraft model and sensor model, aircraft model and engine model, it is necessary to add a special aircraft-sensor relationship table and aircraft-engine relationship table to describe this relationship.
上面提到的各数据表及其数据要素分析如表1所示。The above mentioned data tables and their data element analysis are shown in Table 1.
表1 飞机平台及技术参数数据结构组成Table 1 Aircraft platform and technical parameter data structure composition
注:实际用于描述飞机的平台及技术参数的数据表和数据要素比较复杂,上表中仅列举部分内容作为示例。Note: The data tables and data elements actually used to describe the aircraft platform and technical parameters are relatively complex, and only some of the contents are listed in the above table as examples.
2)存储数据模型信息的元数据结构设计2) Metadata structure design for storing data model information
引接数据模型的信息记录到数据表元数据表、数据字段元数据表、数据表关系元数据表等三个元数据表中,如下面表2、表3、表4所示。The information of the linked data model is recorded in three metadata tables, namely, the data table metadata table, the data field metadata table, and the data table relationship metadata table, as shown in Table 2, Table 3, and Table 4 below.
表2 数据表元数据表Table 2 Data table metadata table
表3 数据字段元数据表Table 3 Data field metadata table
表4数据表关系元数据表Table 4 Data table relationship metadata table
3)可视化的引接数据模型构造过程3) Visualized data model construction process
在可视化界面上进行数据模型的构建,图3和图4给出了可视化配置过程示意图。The data model is constructed on the visualization interface. Figures 3 and 4 show schematic diagrams of the visualization configuration process.
在图3中,在可视化界面左侧列举了相关的数据表清单,并标记为实体表、关系表、字典表等分类;将飞机基本信息表作为中心表拖拽到界面上,以圆形图标显示;在图标上点击右键,可配置该表与其他数据表的关系。In Figure 3, a list of related data tables is listed on the left side of the visualization interface and marked as entity tables, relationship tables, dictionary tables, etc. The aircraft basic information table is dragged as the central table to the interface and displayed as a circular icon; right-click on the icon to configure the relationship between the table and other data tables.
在图4中,已配置好中心表和其他两个数据表的关系,其中椭圆形图标代表一个实体表(传感器技术参数表),菱形图标代表一个字典表(飞机类型表),图标之间的连线代表两个表之间的关联关系。In Figure 4, the relationship between the central table and the other two data tables has been configured, where the oval icon represents an entity table (sensor technical parameter table), the diamond icon represents a dictionary table (aircraft type table), and the line between the icons represents the association relationship between the two tables.
各个图标代表的数据表的名称、类型等表描述信息将存储到数据表元数据表中,并自动赋唯一标识;各个图标代表的数据表中的数据字段的字段名称、格式等信息将存储到数据字段元数据表中,并自动赋唯一标识;连线上的关联关系信息,包括两侧的数据表唯一标识、关联的数据字段唯一标识等,将存储到表关系元数据表中。The name, type and other table description information of the data table represented by each icon will be stored in the data table metadata table and automatically assigned a unique identifier; the field name, format and other information of the data field in the data table represented by each icon will be stored in the data field metadata table and automatically assigned a unique identifier; the association relationship information on the connection line, including the unique identifiers of the data tables on both sides and the unique identifiers of the associated data fields, will be stored in the table relationship metadata table.
4)基于引接过程规范的数据引接4) Data integration based on integration process specifications
根据发明内容中的数据引接过程规范,完成数据引接全过程。According to the data connection process specification in the invention content, the entire data connection process is completed.
在一个实施例中,提供了一种基于数据模型的可视化数据引接装置,包括:描述信息获取模块、引接数据模型构建模块、描述模板填充模块和引接任务执行模块,其中:In one embodiment, a visual data connection device based on a data model is provided, comprising: a description information acquisition module, a connection data model construction module, a description template filling module and a connection task execution module, wherein:
描述信息获取模块,用于获取和完善数据源端的所有数据表的描述信息,并标记每个数据表的表类型;表类型包括:实体表、关系表和字典表;The description information acquisition module is used to obtain and improve the description information of all data tables on the data source side, and mark the table type of each data table; the table types include: entity table, relationship table and dictionary table;
引接数据模型构建模块,用于面向需引接的业务对象,从数据源端的数据表中选择一个包含业务对象标识性信息的实体表作为引接数据模型的中心表,选择和中心表相关的可描述业务对象属性的实体表和字典表加入引接数据模型,并配置引接数据模型中各数据表之间的关系信息;The connection data model construction module is used to select an entity table containing identification information of the business object from the data table at the data source end as the central table of the connection data model for the business object to be connected, select entity tables and dictionary tables related to the central table that can describe the attributes of the business object to add to the connection data model, and configure the relationship information between the data tables in the connection data model;
描述模板填充模块,用于将引接数据模型中的数据表描述信息、数据表中的数据字段的描述信息以及各数据表之间的关系信息存储到预先构建的多维度元数据描述模板中,得到业务对象对应的元数据描述模型;其中,多维度元数据描述模板具体包括:数据表元数据表、数据字段元数据表以及数据表关系元数据表;数据表元数据表用于描述业务数据对象对应的数据模型中所有数据表的元数据信息;一个业务数据对象对应一个数据模型;数据字段元数据表用于描述数据模型中各数据表包含的所有数据字段的元数据信息;数据表关系元数据表用于描述数据模型中所有数据表之间的关系的元数据信息;The description template filling module is used to store the data table description information in the reference data model, the description information of the data fields in the data table, and the relationship information between the data tables into the pre-built multi-dimensional metadata description template to obtain the metadata description model corresponding to the business object; wherein the multi-dimensional metadata description template specifically includes: a data table metadata table, a data field metadata table, and a data table relationship metadata table; the data table metadata table is used to describe the metadata information of all data tables in the data model corresponding to the business data object; one business data object corresponds to one data model; the data field metadata table is used to describe the metadata information of all data fields contained in each data table in the data model; the data table relationship metadata table is used to describe the metadata information of the relationship between all data tables in the data model;
引接任务执行模块,用于结合元数据描述模型配置设计引接模板,基于引接模板完成数据引接。The induction task execution module is used to design the induction template in combination with the metadata description model configuration and complete the data induction based on the induction template.
关于基于数据模型的可视化数据引接装置的具体限定可以参见上文中对于基于数据模型的可视化数据引接方法的限定,在此不再赘述。上述基于数据模型的可视化数据引接装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific definition of the visual data connection device based on the data model, please refer to the definition of the visual data connection method based on the data model above, which will not be repeated here. Each module in the above-mentioned visual data connection device based on the data model can be implemented in whole or in part by software, hardware and a combination thereof. The above-mentioned modules can be embedded in or independent of the processor in the computer device in the form of hardware, or can be stored in the memory of the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是终端,其内部结构图可以如图5所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口、显示屏和输入装置。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种基于数据模型的可视化数据引接方法。该计算机设备的显示屏可以是液晶显示屏或者电子墨水显示屏,该计算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be shown in FIG5. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected via a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of the operating system and the computer program in the non-volatile storage medium. The network interface of the computer device is used to communicate with an external terminal via a network connection. When the computer program is executed by the processor, a visual data connection method based on a data model is implemented. The display screen of the computer device may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer device may be a touch layer covered on the display screen, or a key, trackball or touchpad provided on the housing of the computer device, or an external keyboard, touchpad or mouse, etc.
本领域技术人员可以理解,图5中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art will understand that the structure shown in FIG. 5 is merely a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may include more or fewer components than shown in the figure, or combine certain components, or have a different arrangement of components.
在一个实施例中,提供了一种计算机设备,包括存储器和处理器,该存储器存储有计算机程序,该处理器执行计算机程序时实现上述实施例中方法的步骤。In one embodiment, a computer device is provided, including a memory and a processor, wherein the memory stores a computer program, and the processor implements the steps of the method in the above embodiment when executing the computer program.
在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现上述实施例中方法的步骤。In one embodiment, a computer-readable storage medium is provided, on which a computer program is stored. When the computer program is executed by a processor, the steps of the method in the above embodiment are implemented.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink) DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be completed by instructing the relevant hardware through a computer program, and the computer program can be stored in a non-volatile computer-readable storage medium. When the computer program is executed, it can include the processes of the embodiments of the above-mentioned methods. Among them, any reference to memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM) or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments may be combined arbitrarily. To make the description concise, not all possible combinations of the technical features in the above embodiments are described. However, as long as there is no contradiction in the combination of these technical features, they should be considered to be within the scope of this specification.
以上实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above embodiments only express several implementation methods of the present application, and the descriptions thereof are relatively specific and detailed, but they cannot be understood as limiting the scope of the invention patent. It should be pointed out that, for a person of ordinary skill in the art, several variations and improvements can be made without departing from the concept of the present application, and these all belong to the protection scope of the present application. Therefore, the protection scope of the patent of the present application shall be subject to the attached claims.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410937870.5A CN118467549B (en) | 2024-07-12 | 2024-07-12 | A visual data connection method and device based on data model |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410937870.5A CN118467549B (en) | 2024-07-12 | 2024-07-12 | A visual data connection method and device based on data model |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN118467549A CN118467549A (en) | 2024-08-09 |
| CN118467549B true CN118467549B (en) | 2024-10-11 |
Family
ID=92167249
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202410937870.5A Active CN118467549B (en) | 2024-07-12 | 2024-07-12 | A visual data connection method and device based on data model |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN118467549B (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119917582A (en) * | 2025-04-02 | 2025-05-02 | 南京安夏电子科技有限公司 | Design method of user-oriented visual data model |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115757689A (en) * | 2022-09-21 | 2023-03-07 | 中国人民解放军军事科学院军事科学信息研究中心 | Information query system, method and equipment |
| CN116991923A (en) * | 2023-06-21 | 2023-11-03 | 中国电子科技集团公司第十五研究所 | Drag type data model visual modeling system and method based on browser |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20210397763A1 (en) * | 2020-06-22 | 2021-12-23 | Honeywell International Inc. | System for dynamic configuration of devices/equipment using templates |
| CN113934820A (en) * | 2021-10-14 | 2022-01-14 | 广州广电运通金融电子股份有限公司 | Visual processing system, method, storage medium and terminal for unstructured data |
| CN114237564A (en) * | 2021-12-20 | 2022-03-25 | 平安不动产有限公司 | Data visualization method, device, equipment and medium |
| CN115827921A (en) * | 2022-12-06 | 2023-03-21 | 中电科大数据研究院有限公司 | Data fusion system, data fusion method and device, computer storage medium |
| CN117389541B (en) * | 2023-12-13 | 2024-02-23 | 中国人民解放军国防科技大学 | Configuration system and device for generating template based on dialogue retrieval |
-
2024
- 2024-07-12 CN CN202410937870.5A patent/CN118467549B/en active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115757689A (en) * | 2022-09-21 | 2023-03-07 | 中国人民解放军军事科学院军事科学信息研究中心 | Information query system, method and equipment |
| CN116991923A (en) * | 2023-06-21 | 2023-11-03 | 中国电子科技集团公司第十五研究所 | Drag type data model visual modeling system and method based on browser |
Also Published As
| Publication number | Publication date |
|---|---|
| CN118467549A (en) | 2024-08-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9372667B2 (en) | Ontology driven requirements engineering system and method | |
| US9182963B2 (en) | Computerized migration tool and method | |
| US11080305B2 (en) | Relational log entry instituting system | |
| US20120330911A1 (en) | Automatic generation of instantiation rules to determine quality of data migration | |
| US20060184867A1 (en) | Method for reusing definitions in documents and monitoring thereof | |
| CN118467549B (en) | A visual data connection method and device based on data model | |
| US12265949B2 (en) | Dynamically controlling case model structure using case fragments | |
| CN112799718A (en) | A method, device, electronic device and storage medium for generating an enumeration document | |
| US20250156384A1 (en) | Migration platform for legacy database migration with a large language model | |
| Krogstie | Capturing enterprise data integration challenges using a semiotic data quality framework | |
| CN116775685A (en) | A data processing method, task scheduling method, device and storage medium | |
| CN118312175A (en) | Electronic batch record template implementation method based on dynamic form | |
| Calvanese et al. | Extracting event data from document-driven enterprise systems | |
| CN113553821B (en) | A multi-scale business model construction method, system, medium, device, and terminal | |
| US7865335B2 (en) | System and methods for integrating authoring with complexity analysis for computing system operation procedures | |
| CN106021209A (en) | Source data tracking management system and comprehensive compilation management system for technical publications | |
| Mäder | Rule-based maintenance of post-requirements traceability | |
| Eichelberger et al. | A comprehensive survey of UML compliance in current modelling tools | |
| JP6336922B2 (en) | Business impact location extraction method and business impact location extraction device based on business variations | |
| Wautelet et al. | Investigating Quality Attributes in Behavior-Driven Development Scenarios: An Evaluation Framework and an Experimental Supporting Tool | |
| Guerra-García et al. | Developing web applications with awareness of data quality elements–DQAWA | |
| Wiesner et al. | An ontology-based environment for effective collaborative and concurrent process engineering | |
| JP4312485B2 (en) | Parts information management program and parts information management apparatus | |
| CN120066468B (en) | Data development system and method based on template configuration | |
| Cheney | Provenance, XML and the scientific web |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |