CN106294772B

CN106294772B - The buffer memory management method of distributed memory columnar database

Info

Publication number: CN106294772B
Application number: CN201610659223.8A
Authority: CN
Inventors: 段翰聪; 闵革勇; 张建; 郑松; 詹文翰
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2016-08-11
Filing date: 2016-08-11
Publication date: 2019-03-19
Anticipated expiration: 2036-08-11
Also published as: CN106294772A

Abstract

The invention discloses a kind of buffer memory management methods of distributed memory columnar database, comprising: establishes buffer queue in caching main controlled node；Physics executive plan where cutting it as root node using each physical tasks calculates track to obtain the corresponding caching of each physical tasks；Track building cache feature tree in caching main controlled node is calculated according to the corresponding caching of each physical tasks；When inquiry request arrives, SQL statement is parsed into physics executive plan by query execution engine；Each node in physics executive plan is traversed since level the root node of physics executive plan, judges that the corresponding caching of each physical tasks calculates whether track matches with cache feature tree；If matching directly reads the caching real data of the physical tasks from node from caching, otherwise calculates the physical tasks.The buffer memory management method of distributed memory columnar database provided by the invention detects rapidly whether caching hits by efficient cache match algorithm, improves search efficiency.

Description

The buffer memory management method of distributed memory columnar database

Technical field

The present invention relates to computer software technical fields, and in particular to a kind of caching pipe of distributed memory columnar database Reason method.

Background technique

With the development of information age, explosive increase is presented in data scale, and how to extract from these mass datas has The information of value is the huge challenge that current social faces.And on-line analytical processing (OLAP, On-Line Analytical Processing) system is demonstrated by its powerful data analysis capabilities, it is widely used to bank, telecommunications, stock exchange Equal commercial fields.

Support the distributed memory columnar database of OLAP system that user is allowed to extract from mass data with multiple dimensions With the valuable information of analysis, these information may be a simple report, it is also possible to a complicated analysis result.With The promotion of query statement complexity, the time required for inquiry operation also can be longer, and the query statement of high complexity is propping up It is very high to hold the frequency occurred in the distributed memory columnar database of OLAP system.

Inquiry request has very strong correlation in terms of semanteme in database, therefore partial query result is likely to going through Occurred in history inquiry.Distributed memory columnar database introduces cache management system and saves certain historical query informations, from And reduce the execution number for repeating inquiry.Fig. 1 is the structural schematic diagram of the cache management system of distributed memory columnar database, The cache management system includes query execution engine (Query Engine) 11, caching main controlled node (Cache Master) 12, standby node (Standby Cache Master) 13 and at least one caching are from node (Cache Slave) 14.Its In, the query execution engine 11 is responsible for parsing and executes user's SQL request, returns to query result；The caching main controlled node 12 are responsible for all cachings of management from node 14, safeguard cache metadata, eliminate caching according to related algorithm, maintenance caching is consistent Property；All cache metadatas that main controlled node 12 is cached described in 13 periodic synchronization of standby node, when the caching master control section When point 12 breaks down, the caching main controlled node 12 is substituted immediately, continues to provide buffer service；The caching is negative from node 14 Memory buffers real data is blamed, the request that the query execution engine 11 reads caching is responded.

How under distributed scene as far as possible the high data of cache access frequency, accelerate data base querying speed, exactly Distributed caching management system will solve the problems, such as.

Summary of the invention

It is to be solved by this invention be how under distributed scene as far as possible the high data of cache access frequency, accelerate number The problem of according to library inquiry speed.

The present invention is achieved through the following technical solutions:

A kind of buffer memory management method of distributed memory columnar database, the caching of the distributed memory columnar database Management system includes query execution engine, caches main controlled node and at least one caching from node, the buffer memory management method It include: to establish buffer queue in caching main controlled node, each element corresponds to a physical tasks in the buffer queue Cache metadata；Physics executive plan where cutting it as root node using each physical tasks is to obtain each physical tasks pair The caching answered calculates track；It is special that track building caching in caching main controlled node is calculated according to the corresponding caching of each physical tasks Sign tree；When inquiry request arrives, SQL statement is parsed into physics executive plan by query execution engine；From physics executive plan Root node start each node in level traversal physics executive plan, judge each physical tasks corresponding caching calculating track Whether matched with the cache feature tree；If the caching actual number of the physical tasks is directly read in matching from caching from node According to otherwise calculating the physical tasks.

The present invention calculates track unique identification caching using caching, when inquiry request arrives, it is only necessary to by each physics The corresponding caching of task calculates track and is matched with cache feature tree, can detect whether caching hits rapidly, from certain journey The calculating that iterative task in distributed data base is reduced on degree, saves query time, improves search efficiency.

Further, sequence arrangement of each element by weight from big to small in the buffer queue.

Further, in the buffer queue weight of each element according to W_i=q_i×(a×S_i+b×P_i) obtain, In, W_iFor the weight of i-th element, q_iFor the weight factor of the corresponding physical tasks of i-th element, S_iFor i-th element when Sky ratio andt_iThe time that root node longest path is arrived in track, k are calculated for the corresponding caching of i-th element_iFor The corresponding storage strategy constant of i-th element, m_iFor system memory space shared by i-th element reality, P_iFor i-th element Hit frequency andn_iFor the historical hit number of i-th element, d_iFor i-th last hit of element distance Time interval, v_iTime interval is averagely hit for i-th element, a is the impact factor that space-time compares weight, and b is hit frequency For rate to the impact factor of weight, i is positive integer.

Further, the buffer memory management method of the distributed memory columnar database further includes when the cache management system System carries out following steps when receiving the storage request newly cached: step S1 updates the weight of each element in the buffer queue； Step S2, judges whether the cache management system current residual space is enough to store new caching；If the cache management system Current residual space is enough to store new caching, executes step S3, and caching main controlled node notice caching stores new caching from node, will New cache metadata is put into the buffer queue, and records in the cache feature tree and newly cache corresponding caching calculating rail Mark；If the cache management system current residual space is not enough to store new caching, step S4 is executed, judges the weight newly cached Whether the weight of in the buffer queue last element is greater than；If the weight newly cached is not more than in the buffer queue most The weight of latter element, executes step S5, and caching main controlled node notice caching refuses the new caching of storage from node；If new caching Weight be greater than the buffer queue in last element weight, execute step S6, judging whether still to have inquiry operation just Using last element in the buffer queue；If still there is inquiry operation that last in the buffer queue is used Element executes step S7, is to be deleted by last rubidium marking in the buffer queue；If no inquiry operation is used Last element in the buffer queue, executes step S8, and caching main controlled node deletes last in the buffer queue Element recycles the memory space that the caching that is eliminated in the cache management system occupies, deletes institute from the cache feature tree It states the corresponding caching of last in buffer queue element and calculates track, and go to step S2.

Further, the buffer memory management method of the distributed memory columnar database further include: be the buffer queue In each element one corresponding reference count is set, when an element initial hit, by this element corresponding reference meter Number sets 1, and starts a timer；If before the timer expires, caching main controlled node receives the SQL language using an element The feedback that sentence inquiry finishes, then subtract 1 for the reference count of this element, and reset timer；In the corresponding reference of an element When counting equal to 0, then timer is closed.

Further, it is judgement that judging whether, which still has inquiry operation that last element in the buffer queue is used, Whether the corresponding reference count of last element is 0 in the buffer queue.

Further, the result that the corresponding caching of each physical tasks calculates that track includes each node calculates time, knot The data transmission period that fruit storage size and each edge represent.

Further, table involved in all nodes is owned by version number in the cache feature tree.

Compared with prior art, the present invention having the following advantages and benefits:

The buffer memory management method of distributed memory columnar database provided by the invention passes through efficient cache match algorithm Whether detection caching hits rapidly, guarantees system availability and stability using reasonable caching life cycle algorithm, to a certain degree On reduce the calculating of iterative task in distributed data base, save query time and memory space, improve search efficiency.

Detailed description of the invention

Attached drawing described herein is used to provide to further understand the embodiment of the present invention, constitutes one of the application Point, do not constitute the restriction to the embodiment of the present invention.In the accompanying drawings:

Fig. 1 is the structural schematic diagram of the cache management system of distributed memory columnar database；

Fig. 2 is the structural schematic diagram of the physics executive plan of the embodiment of the present invention；

Fig. 3 is that the caching of the embodiment of the present invention calculates the structural schematic diagram of track；

Fig. 4 is the structural schematic diagram of the cache feature tree of the embodiment of the present invention；

Fig. 5 is that the T2-Join of the embodiment of the present invention calculates the structural schematic diagram of track；

Fig. 6 is that the T3-Join of the embodiment of the present invention calculates the structural schematic diagram of track；

Fig. 7 is the structural schematic diagram of the cache match result of the embodiment of the present invention；

Fig. 8 is that the caching of the embodiment of the present invention eliminates the flow diagram of method.

Specific embodiment

To make the objectives, technical solutions, and advantages of the present invention clearer, below with reference to embodiment and attached drawing, to this Invention is described in further detail, and exemplary embodiment of the invention and its explanation for explaining only the invention, are not made For limitation of the invention.

Embodiment

The present embodiment provides a kind of buffer memory management method of distributed memory columnar database, the distributed memory column The structural schematic diagram of the cache management system of database can refer to Fig. 1, including query execution engine, caching main controlled node, spare Node and at least one caching are from node.

When inquiry request arrives, SQL statement is parsed into the physics executive plan indicated by DAG by query execution engine. One physical tasks of each node on behalf in physics executive plan, physical tasks be divided into again GetColumn, Join, Filter, Group and BuildRow etc., each edge represent the transmission relationship of calculated result between two physical tasks.One typical inquiry The physics of sentence (B.id≤80 SELECT A.id FROM A, B WHERE A.id=B.id AND A.id≤100AND) Executive plan is as shown in Figure 1.In cache management system, data cached granularity is the calculated result of single physical task.When When caching is the calculated result of BuildRow, then the final query result of whole SQL statement has been cached.The present embodiment is using caching Calculate track unique identification caching.

Firstly, establishing buffer queue in caching main controlled node, each element corresponds to an object in the buffer queue The cache metadata of reason task.As previously mentioned, data cached granularity is the calculated result of single physical task, initially setting up When buffer queue, the cache management system has enough memory spaces, therefore, when receiving cache request, directly by object The calculated result of reason task is stored in caching from node, using the cache metadata of physical tasks as a member of buffer queue Element.

Physics executive plan where cutting it as root node using each physical tasks is corresponding to obtain each physical tasks Caching calculate track.Specifically, each physical tasks of caching is requested to be corresponding with a physics executive plan.It is held in physics Row in the works, cuts physics executive plan as root node using the physical tasks of request caching and obtains subgraph, which is exactly root The corresponding caching of node calculates track.Caching calculates track record from initial data to all information for generating the caching, wraps The result for including each node calculates the data transmission period that time, result storage size and each edge represent.Caching calculates rail Each node of mark is a characteristic point, and all characteristic points and its relationship constitute the feature of caching.With physics shown in Fig. 2 execution For plan, if user will cache the calculated result of T3-Join physical tasks, then using the physical tasks node as root node, It is as shown in Figure 3 to obtain corresponding caching calculating track.

Track building cache feature tree in caching main controlled node is calculated according to the corresponding caching of each physical tasks.It is described Cache feature tree has recorded the characteristic point and its relationship of each caching, it presses hierarchical structure tissue according to table relationship, and the number of plies represents The table number that this layer is related to, the bottom pertain only to a table.One cache feature point of each node on behalf of the cache feature tree, Mutual dependence is indicated by line between characteristic point, a caching is characterized in by series of features point and its relationship structure At.

Data cached in order to prevent and database initial data is inconsistent, in the cache feature tree involved by all nodes Table be owned by version number, only just can be carried out matching under the premise of table version number is consistent.User attempts to read or write-in is slow When depositing, input comprising the table that is related in current distributed database newest version number, when caching main controlled node finds certain When the version of a table is expired, then all cachings related with the table are deleted.One typical cache feature tree as shown in figure 4, its In, solid box indicates that the caching of this feature point description exists, and dotted line frame indicates that the caching of this feature point description is not present, and the latter is only It is the feature integrality in order to guarantee caching belonging to it.

When inquiry request arrives, SQL statement is parsed into physics executive plan, physics executive plan by query execution engine It is indicated by a physical tasks DAG.How SQL statement is parsed into physics and held by query execution engine as known to those skilled in the art Row plan, details are not described herein.

Each node in physics executive plan is traversed since level the root node of physics executive plan, judges each physics The corresponding caching of task calculates whether track matches with the cache feature tree.Cache match includes two kinds of situations: exact matching It is matched with part.Exact matching refers to that all characteristic points and its relationship can be special in the caching in the caching calculating track of request It is found in sign tree；Part matching is substantially consistent with exact matching, and unique difference is that the root node that caching calculates track is described The subset of some characteristic point in cache feature tree.If matching, the caching that the physical tasks are directly read from node from caching is real Otherwise border data calculate the physical tasks.

On the basis of the cache feature tree of Fig. 4 description, the SQL statement (SELECT A.id the FROM A, B that are described with Fig. 2 B.id≤80 WHERE A.id=B.id AND A.id≤100AND) for, from physics executive plan root node start layers The secondary each node of traversal, steps are as follows for corresponding cache match:

Whether detection T1-BuildRow hits caching.It is exactly whole DAG figure that the caching of T1-BuildRow, which calculates track, from Root node starts level and scans whether each characteristic point hits, as long as detecting a characteristic point miss, which is not ordered Middle caching.Since node T1 is related to two Table A B, so search whether in the Level 2 of characteristics tree there are this feature point, it is practical The result is that nothing, so T1-BuildRow miss caches.

Whether detection T2-Join hits caching.The caching of T2-Join calculates track as shown in figure 5, node T2 matching caching Characteristic point 2_1 in characteristics tree, node T5 match the characteristic point 1_1 in cache feature tree, and node T4 is the son of characteristic point 1_5 Collection, but relationship (1_5,2_1) is not present, so T2-Join miss caches.

Whether detection T3-Join hits caching.The caching of T3-Join calculates track as shown in fig. 6, node T3 matching caching Characteristic point 2_1 in characteristics tree, node T6 match the characteristic point 1_2 in cache feature tree, and node T5 is matched known to aforementioned Characteristic point 1_1 in cache feature tree, relationship (T5, T3) matching relationship (1_1,2_1), relationship (T6, T3) matching relationship (1_2, 2_1), so T3-Join is exactly matched.

Node T4 is the subset of characteristic point 1_5, i.e. the part T4-GetColumn matches, node T5 matching characteristic point 1_1, therefore T5-GetColumn exact matching.Node T6 matching characteristic point 1_2, i.e. T6-GetColumn exact matching.

After above-mentioned steps, the cache match result of the available SQL statement is as shown in Figure 7.It can from Fig. 7 Out, query execution engine only needs to calculate T1 and T2 physical tasks, so that it may which the inquiry for completing whole SQL statement greatly mentions High search efficiency.

Since the memory space of cache management system is limited, after memory space is all occupied, new caching is received Storage request when need to carry out existing caching it is superseded.It is one of core of cache management system that caching, which is eliminated, is determined slow The swapping in and out deposited, and then influence the hit rate of caching and the stability of buffer service.In the present embodiment, the buffer queue In sequence arrangement of each element by weight from big to small, tail of the queue weight is minimum.When eliminating caching every time, member is popped up from tail of the queue Element.

Assuming that t_iIt is that the corresponding caching of i-th element calculates the time (unit: ms) that root node longest path is arrived in track, m_iIt is system memory space shared by i-th element reality (unit: Byte), k_iIt is that the corresponding storage strategy of i-th element is normal Amount, different storage strategies, the mode of reading cache data and time are all different.The space-time of i-th element compares calculation formula Are as follows:

Assuming that d_iIt is the time interval (unit: ms) of i-th last hit of element distance, n_iIt is the history of i-th element Hit-count, v_iIt is that i-th element averagely hits time interval (unit: ms).The hit frequency calculation formula of i-th element Are as follows:If i-th element is that queue is added for the first time, P_i=1.

In conclusion in the buffer queue i-th element weight calculation formula are as follows: W_i=q_i×(a×S_i+b× P_i).Wherein, W_iFor the weight of i-th element, q_iFor the weight factor of the corresponding physical tasks of i-th element.I-th element pair The physical tasks answered are more complicated, the weight factor q of the corresponding physical tasks of i-th element_iValue it is bigger.Such as Join task pair The value of the corresponding weight factor of value ratio GetColumn task for the weight factor answered is big.Under default situations, i-th element pair The weight factor q for the physical tasks answered_iEqual to 1.0.A and b is constant, respectively represent space-time than with hit frequency to weight Impact factor.If system memory space is smaller, the value of a can be tuned up, increase the influence that space-time compares weight, it is on the contrary then Turn a down.

In the present embodiment, one corresponding reference count is set for each element in the buffer queue, represents Use the inquiry operation number of the caching.When an element initial hit, the corresponding reference count of this element sets 1, and Start a timer, default value can be configured according to the actual situation, such as 30 seconds.If before the timer expires, delayed It deposits main controlled node and receives certain feedback finished using the SQL statement inquiry of an element, then by the corresponding reference of this element Counting subtracts 1, and resets timer.When the corresponding reference count of an element is equal to 0, then timer is closed.The work of timer With being to prevent query execution engine from breaking down, the feedback that inquiry finishes can not be received by causing to cache main controlled node.

Caching is when being eliminated, if being eliminated, to cache corresponding reference count be not 0, illustrates still have inquiry operation using The caching may cause query execution engine and obtain cache failure if deleting the caching immediately, expends over head time and is appointed Business fault recovery.The caching being eliminated only be marked as it is to be deleted, until reference count be 0 when be just really deleted.

Fig. 8 be the embodiment of the present invention caching eliminate method flow diagram, when system receive one newly cache deposit When storage request, caching main controlled node carries out following steps:

Step S1 updates the weight of each element in the buffer queue.Due in buffer queue each element apart from upper The time interval of hit at first time can increase as time goes by, so needing first to update the weight of each element in buffer queue.

Step S2, judges whether the cache management system current residual space is enough to store new caching.

If the cache management system current residual space is enough to store new caching, step S3 is executed, caches main controlled node Notice caching stores new caching from node, new cache metadata is put into the buffer queue, and in the cache feature tree Record newly caches corresponding caching and calculates track.

If the cache management system current residual space is not enough to store new caching, step S4 is executed, judges new caching Weight whether be greater than the weight of last element in the buffer queue.

If the weight newly cached executes step S5, caching no more than the weight of last element in the buffer queue Main controlled node notice caching stores new caching from node refusal.

If the weight newly cached is greater than the weight of last element in the buffer queue, step S6 is executed, judgement is It is no still to have inquiry operation that last element in the buffer queue is used, that is, judge last in the buffer queue Whether the corresponding reference count of element is 0.

If still there is inquiry operation that last element in the buffer queue is used, step S7 is executed, it will be described slow It is to be deleted for depositing last in queue rubidium marking.It is to the corresponding reference count of last element in the buffer queue When 0, then last element in the buffer queue deleted.

If last element in the buffer queue is used in no inquiry operation, step S8 is executed, caches master control section It is empty to recycle the storage that caching occupies that is eliminated in the cache management system for last in buffer queue described in point deletion element Between, the corresponding caching of last element in the buffer queue is deleted from the cache feature tree and calculates track, and is gone to Step S2.

Above-described specific embodiment has carried out further the purpose of the present invention, technical scheme and beneficial effects It is described in detail, it should be understood that being not intended to limit the present invention the foregoing is merely a specific embodiment of the invention Protection scope, all within the spirits and principles of the present invention, any modification, equivalent substitution, improvement and etc. done should all include Within protection scope of the present invention.

Claims

1. A cache management method of a distributed memory columnar database, the cache management system of the distributed memory columnar database comprises a query execution engine, a cache master node and at least one cache slave node, characterized in that the cache Management methods include:

A cache queue is established in the cache master control node, and each element in the cache queue corresponds to the cache metadata of a physical task;

Take each physical task as the root node to cut the physical execution plan where it is located to obtain the cache calculation track corresponding to each physical task; the cache calculation track corresponding to each physical task includes the result calculation time, result storage size and The data transfer time represented by the edge;

Build a cache feature tree in the cache master node according to the cache calculation trajectory corresponding to each physical task;

When a query request arrives, the query execution engine parses the SQL statement into a physical execution plan;

Starting from the root node of the physical execution plan, traverse each node in the physical execution plan hierarchically, and determine whether the cache calculation track corresponding to each physical task matches the cache feature tree;

If it matches, directly read the actual data of the physical task from the cache slave node, otherwise calculate the physical task;

Each element in the cache queue is arranged in descending order of weight; the weight of each element in the cache queue is obtained according to W _i =q _i ×(a×S _i +b×P _i ), wherein, Wi is the weight of the _i -th element, qi is the weight factor of the physical task corresponding to the _i -th element, S _i is the space-time ratio of the i-th element and t _i is the time from the longest path to the root node in the cache calculation track corresponding to the _i -th element, ki is the storage policy constant corresponding to the _i -th element, mi is the system storage space actually occupied by the i-th element, P _i is the hit frequency of the i-th element and n _i is the number of historical hits of the i-th element, d _i is the time interval from the i-th element to the previous hit, vi is the average hit time interval of the _i -th element, a is the influence factor of the space-time ratio on the weight, b is the impact factor of the hit frequency on the weight, i is a positive integer.

2. The cache management method of a distributed in-memory columnar database according to claim 1, further comprising performing the following steps when the cache management system receives a newly cached storage request:

Step S1, update the weight of each element in the cache queue;

Step S2, judging whether the current remaining space of the cache management system is enough to store a new cache;

If the current remaining space of the cache management system is sufficient to store the new cache, step S3 is executed, and the cache master node notifies the cache slave node to store the new cache, puts the new cache metadata into the cache queue, and stores it in the cache feature tree. Record the cache calculation track corresponding to the new cache;

If the current remaining space of the cache management system is insufficient to store the new cache, step S4 is performed to determine whether the weight of the new cache is greater than the weight of the last element in the cache queue;

If the weight of the new cache is not greater than the weight of the last element in the cache queue, step S5 is performed, and the cache master node informs the cache slave node to refuse to store the new cache;

If the weight of the new cache is greater than the weight of the last element in the cache queue, step S6 is performed to determine whether there is still a query operation using the last element in the cache queue;

If there is still a query operation using the last element in the cache queue, step S7 is performed to mark the last element in the cache queue as to be deleted;

If no query operation is using the last element in the cache queue, step S8 is executed, the cache master control node deletes the last element in the cache queue, and reclaims the storage space occupied by the eliminated cache in the cache management system, Delete the cache calculation track corresponding to the last element in the cache queue from the cache feature tree, and go to step S2.

3. The cache management method of the distributed in-memory columnar database according to claim 2, characterized in that, further comprising:

A corresponding reference count is set for each element in the cache queue, when an element hits for the first time, the reference count corresponding to the element is set to 1, and a timer is started;

If the cache master node receives the feedback that the query of the SQL statement using an element is completed before the timer expires, the reference count corresponding to the element is decremented by 1, and the timer is reset;

When the reference count corresponding to an element is equal to 0, the timer is closed.

4. The cache management method of a distributed memory columnar database according to claim 3, wherein judging whether there is still a query operation using the last element in the cache queue is to judge the last element in the cache queue. Whether the reference count corresponding to the item element is 0.

5 . The cache management method for a distributed in-memory columnar database according to claim 1 , wherein the tables involved in all nodes in the cache feature tree have version numbers. 6 .