CN119088761B - An electronic management method and system for international standardization organization system documents - Google Patents
An electronic management method and system for international standardization organization system documents Download PDFInfo
- Publication number
- CN119088761B CN119088761B CN202411185578.9A CN202411185578A CN119088761B CN 119088761 B CN119088761 B CN 119088761B CN 202411185578 A CN202411185578 A CN 202411185578A CN 119088761 B CN119088761 B CN 119088761B
- Authority
- CN
- China
- Prior art keywords
- file
- access
- files
- data
- storage
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/13—File access structures, e.g. distributed indices
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/16—File or folder operations, e.g. details of user interfaces specifically adapted to file systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/172—Caching, prefetching or hoarding of files
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/1734—Details of monitoring file system events, e.g. by the use of hooks, filter drivers, logs
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/185—Hierarchical storage management [HSM] systems, e.g. file migration or policies thereof
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to the technical field of data management, in particular to an electronic management method and system for an international organization system file, comprising the following steps: and collecting access time, size and frequency information of the file based on the recorded international standardization organization system file, analyzing the information and the current access mode of the file, and obtaining file access data. According to the invention, the system can adjust the storage position in real time according to the use frequency and the size of the file, thereby reducing the data retrieval time and optimizing the access path. The real-time updating of the file position enables the frequently accessed file to be closer to the data processing core, and the delay of data transmission is effectively reduced. By analyzing the future access trend of the file and adjusting the expected access strategy accordingly, the adaptability of the system to future demands is improved, and the backup plan and the resource allocation are optimized. The dynamic storage adjustment mode can ensure the efficiency of resource allocation, predicts the resource pre-allocation during high load period, and effectively prevents the overload of the system.
Description
Technical Field
The invention relates to the technical field of data management, in particular to an electronic management method and system for an international organization system file.
Background
Data management involves the acquisition, validation, storage, protection, and processing of data to ensure accessibility, reliability, and timeliness of the data. Including creation, storage, maintenance, use, and deletion of data. The field is used in a variety of industries, including financial services, healthcare, government and education, for the purpose of improving organization efficiency and decision making capability by optimizing data flow.
The method is a method specially used for managing and maintaining the international organization for standardization system files. The method processes documents and records related to the stored files in an electronic way, so that the documents and records are more systematic and easy to search. The method has the main purposes of improving traceability and security of the file, optimizing the storage and access processes of the file and supporting quality management and compliance verification. Through electronic management, an organization can ensure that its standardized files are always up-to-date and can easily be shared with both internal and external stakeholders.
However, the prior art mainly focuses on the electronic processing and static storage of files, and is not timely enough for the frequently-changing file access modes and demand adjustment responses. When processing highly flexible data streams, resource utilization may be less than optimal, e.g. resource shortages may occur during periods of unpredictable high access demand. In addition, in the aspects of predicting future data trend and adjusting backup strategies based on the trend, resources cannot be dynamically adjusted when critical moments are encountered, and the speed and efficiency of data recovery are affected. For example, during times of unforeseen high demand, it is difficult for the system to efficiently allocate sufficient resources, affecting data processing and access speed, and overall business operational efficiency.
Disclosure of Invention
The invention aims to solve the defects in the prior art, and provides an electronic management method and system for an international organization system file.
In order to achieve the purpose, the invention adopts the following technical scheme that the method for electronically managing the international organization system file comprises the following steps:
S1, collecting access time, size and frequency information of a file based on a recorded international standardization organization system file, analyzing the information and a current access mode of the file, and obtaining file access data;
S2, based on the file access data, sorting the priority of the files, identifying the files needing to be processed preferentially, and generating a file priority list;
S3, reconfiguring file storage layout through the file priority list, moving the files accessed for multiple times to a more preferential access point, monitoring and recording migration operation data, and generating file position update records;
S4, analyzing future access trend of the file based on the file position update record and the historical access data, identifying trend change and access peak value, and generating file access prediction data according to the expected access strategy of the file adjusted by the analysis content;
S5, extracting key information from the file access prediction data, rearranging a backup time table and storage resource allocation according to a predicted demand time point, and creating a dynamic storage adjustment plan;
and S6, executing configuration adjustment operation according to the dynamic storage adjustment plan, including remapping the file to a new storage area and starting a new backup plan, and generating an optimized file storage configuration.
The file access data comprises access time points, byte sizes of files and access frequency counts of each file, the file priority list comprises file indexes, associated storage requirements and expected access frequencies which are arranged according to the frequency of demands, the file position update records comprise migration time from a current position to a new position, storage positions before and after migration and migration influence evaluation of each file, the file access prediction data comprises a predicted access pattern diagram, key time points and an access density increasing interval, the dynamic storage adjustment plan comprises adjustment details of a file storage strategy, optimal setting of a backup period and expected resource reconfiguration, and the optimized file storage configuration comprises a new file storage path, updated backup execution time and adjusted resource utilization rate.
The invention improves, based on the recorded international standardization organization system file, collects the access time, size and frequency information of the file, analyzes the information and the current access mode of the file, and obtains the specific steps of the file access data as follows:
S101, collecting access time, size and frequency information of a file based on a recorded international standardization organization system file, and generating a basic access data set;
s102, cleaning the collected data based on the basic access data set, eliminating abnormal values and repeated records, and generating a cleaned access data set;
And S103, carrying out data analysis based on the cleaned access data set, evaluating the current access mode of the file, including multiple access time periods and file size distribution, and generating file access data.
The invention improves, based on the file access data, the files are prioritized, the files needing to be processed preferentially are identified, and the specific steps for generating the file priority list are as follows:
S201, based on the file access data, calculating the access priority of each file, and generating an initial priority index by using the file size and the access frequency as key parameters;
S202, sorting a file list by adopting a least recently used algorithm based on the initial priority index, optimizing the access speed and generating an adjusted file list;
And S203, identifying files which are processed preferentially based on the adjusted file list, and generating a file priority list.
The invention is improved in that the file storage layout is reconfigured through the file priority list, the files accessed for multiple times are moved to a more preferential access point, migration operation data are monitored and recorded, and the specific steps for generating the file position update record are as follows:
S301, based on the file priority list, listing all files accessed for many times, designating the files as migration objects, and performing file selection and marking operation to generate a file identifier to be migrated;
S302, executing migration operation on the file in the file identifier to be migrated, wherein the migration operation comprises updating a file index and a storage path, and generating a file migration execution record;
And S303, based on the file migration execution record, monitoring the data transmission speed and success rate in the file migration process, and generating a file location update record.
The invention improves, based on the file position update record and the historical access data, analyzes the future access trend of the file, identifies trend change and access peak value, adjusts the expected access strategy of the file according to the analysis content, and generates the file access prediction data as follows:
S401, based on the file position update record and the historical access data, predicting a future access mode of the file by using a cyclic neural network, detecting the change of the file access after migration, including frequency increase and decrease and time periods in the access set, and generating an access frequency change prediction result;
S402, extracting a key time period and a file type from the access frequency change prediction result, identifying a key access peak time period and a file type, and generating a key change factor identification result;
And S403, adjusting the expected access strategy of the file based on the key change factor identification result, wherein the expected access strategy comprises optimizing a storage path and backup frequency, and generating file access prediction data.
The invention is improved in that the cyclic neural network is formed according to the formula:
Wherein E' represents the sum of prediction errors, n represents the total number of files, P i represents the number of predicted accesses of the ith file, A i represents the number of accesses of the ith file, P i-Ai represents the difference between the predicted value and the current value, F di represents the document importance index of the ith file, log (1+F di) logarithmically transforms the document importance index, S i represents the size of the ith file, and max (S) is the maximum file size observed in all files, and α is an adjustment parameter.
The invention improves, extract the key information in the predicted data based on the said file access, rearrange and stand-by schedule and storage resource allocation according to the predicted demand time point, the concrete step to establish dynamic storage and adjust the plan is as follows:
s501, identifying a key time point and resource allocation requirements based on the file access prediction data, and aiming at a predicted peak period, planning resource adjustment and backup time to generate a key time point analysis result;
S502, acquiring data from the analysis result of the key time point, redesigning and adjusting a backup time table, and generating an adjusted backup and resource configuration;
and S503, implementing the adjusted backup and resource configuration, updating the configuration setting of the storage management system, applying a new schedule and resource allocation strategy, and generating a dynamic storage adjustment plan.
The invention improves, according to the said dynamic storage adjustment plan, carry out the configuration adjustment operation, including the specific steps of the file storage configuration after remapping the file to the new storage area and starting the new backup plan after optimizing:
S601, identifying files needing to update the mapping according to the dynamic storage adjustment plan, and executing updating operation of the files, including modifying storage positions of the files, and generating file mapping updating execution records;
s602, starting a new backup plan, including setting time and resources of a backup task, monitoring the execution evaluation configuration effect of the first backup, and generating a new backup execution record;
S603, analyzing the file mapping update execution record and the new backup execution record, analyzing the performance of the whole configuration, and generating the optimized file storage configuration.
An international organization for standardization system file electronic management system, the system comprising:
the data collection and analysis module collects access time, size and frequency information of the file based on the file log, analyzes the data to reveal a current file access mode, and iteratively analyzes future access trends of the file by combining historical access data to generate a file access trend analysis result;
The priority ordering module performs priority ordering on the files based on the file access trend analysis result, selects files needing to be processed preferentially by using the access frequency and the file size of the files as key parameters, and generates a file priority list;
The storage layout adjustment module utilizes the file priority list to reconfigure the file storage layout, moves the files accessed for multiple times to a priority access point, monitors and records migration operation data at the same time, and generates a file position update record;
the dynamic storage management module adjusts a backup time table and storage resource allocation based on the file position update record and the file access trend analysis result to generate a dynamic storage adjustment plan;
And the system configuration implementation module executes the dynamic storage adjustment plan, remaps the file to a new storage area, starts a new backup plan and monitors the whole process to generate the optimized file storage configuration.
Compared with the prior art, the invention has the advantages and positive effects that:
In the invention, the system can adjust the storage position in real time according to the use frequency and the size of the file, thereby reducing the data retrieval time and optimizing the access path. The real-time updating of the file position enables the frequently accessed file to be closer to the data processing core, and the delay of data transmission is effectively reduced. By analyzing the future access trend of the file and adjusting the expected access strategy accordingly, the adaptability of the system to future demands is improved, and the backup plan and the resource allocation are optimized. The dynamic storage adjustment mode can ensure the efficiency of resource allocation, predicts the resource pre-allocation during high load period, and effectively prevents the overload of the system. The performance of the file management system and the security of the data are enhanced, so that the management process is more efficient and accurate.
Drawings
FIG. 1 is a flow chart of an electronic management method for documents of an International organization for standardization system;
FIG. 2 is a schematic diagram of the refining process of step S1 of the present invention;
FIG. 3 is a schematic diagram of the refinement flow of step S2 of the present invention;
FIG. 4 is a schematic diagram of the refinement flow of step S3 of the present invention;
FIG. 5 is a schematic diagram of the refinement flow of step S4 of the present invention;
FIG. 6 is a schematic diagram of the refinement flow of step S5 of the present invention;
FIG. 7 is a schematic diagram of the refinement flow of step S6 of the present invention;
fig. 8 is a block diagram of an electronic management system for international organization for standardization system files according to the present invention.
Detailed Description
The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.
In the description of the present invention, it should be understood that the terms "length," "width," "upper," "lower," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," etc. indicate or are based on the orientation or positional relationship shown in the drawings, merely to facilitate description of the invention and simplify description, and do not indicate or imply that the devices or elements referred to must have a specific orientation, be configured and operated in a specific orientation, and thus should not be construed as limiting the invention. Furthermore, in the description of the present invention, the meaning of "a plurality" is two or more, unless explicitly defined otherwise.
Referring to FIG. 1, the invention provides a technical scheme that an electronic management method for an international organization system file comprises the following steps:
S1, collecting access time, size and frequency information of a file based on a recorded international standardization organization system file, analyzing the information and a current access mode of the file, and obtaining file access data;
s2, based on file access data, sorting the priority of the files, identifying the files needing to be processed preferentially, and generating a file priority list;
s3, reconfiguring file storage layout through a file priority list, moving the files accessed for multiple times to a more preferential access point, monitoring and recording migration operation data, and generating a file position update record;
s4, based on the file position update record and the historical access data, analyzing future access trend of the file, identifying trend change and access peak value, and adjusting the expected access strategy of the file according to the analysis content to generate file access prediction data;
S5, extracting key information from the file access prediction data, rearranging a backup schedule and storage resource allocation according to a predicted demand time point, and creating a dynamic storage adjustment plan;
And S6, executing configuration adjustment operation according to the dynamic storage adjustment plan, including remapping the file to a new storage area and starting a new backup plan, and generating the optimized file storage configuration.
The file access data comprises access time points, byte sizes of files and access frequency counts of each file, the file priority list comprises file indexes, associated storage requirements and expected access frequencies which are arranged according to the frequency of the demands, the file position update record comprises migration time of each file from a current position to a new position, storage positions before and after migration and migration influence evaluation, the file access prediction data comprises a predicted access pattern diagram, key time points and an access density increase interval, the dynamic storage adjustment plan comprises adjustment details of a file storage strategy, optimal setting of a backup period and expected resource reconfiguration, and the optimized file storage configuration comprises a new file storage path, updated backup execution time and adjusted resource utilization rate.
Referring to fig. 2, based on the recorded international organization system file, access time, size and frequency information of the file are collected, and the information and current access mode of the file are analyzed, so as to obtain file access data, which comprises the following specific steps:
s101, collecting access time, size and frequency information of a file based on a recorded international standardization organization system file, wherein the process of generating a basic access data set is as follows;
first, the system extracts the access time stamp, the file size and the access times of each file from the log file, and constructs an initial data matrix. The data matrix uses the file identifier as a row index and the time stamp, size and frequency as column attributes. For each file, the total number of accesses N, the average size S of the file and the time stamp T of the last access are calculated by the following specific calculation method Where k is the number of records of a certain file, N is the total number of accesses, and o is the index of the record. Average size of fileWhere s o is the file size per access. In this formula, S represents the average file size, S o is the size of the file at the o-th access, and k represents the total number of accesses. The last access time t=max (T 1,t2,…,tk), where T i is the access timestamp, T represents the last access time, and T i is the i-th access timestamp. And generating a row of basic access data set for each file by using the N, S and T obtained through calculation.
S102, cleaning the collected data based on the basic access data set, removing abnormal values and repeated records, and generating a cleaned access data set according to the process;
Data cleansing includes culling file size zero records, abnormally large or small file size records, and deleting duplicate file access records. The reasonable range of the file size is set to [ L, U ] where L and U are the lower and upper limits of the file size, respectively. The rule of outlier rejection is to delete records whose size is not within [ L, U ] i.e. if s o < L or s o > U, then reject the record. Here, L denotes a minimum file size limit, U denotes a maximum file size limit, and s i is the file size of a record. For repeated access records, only the latest access record is kept. And reconstructing the cleaned access data set by using the rule, so as to ensure the accuracy and usability of the data.
S103, carrying out data analysis based on the cleaned access data set, and evaluating the current access mode of the file, wherein the current access mode comprises multiple access time periods and file size distribution, and the flow of generating file access data is as follows;
And applying statistical analysis to the cleaned data to evaluate the access mode of the file. Calculating the mean value of file access frequency And standard deviation sigma N, mean of file sizeAnd standard deviation sigma S. These statistics help identify changes in access patterns and distribution of file sizes. The formula is as follows, the mean value and standard deviation of the file access frequency are respectively:And Where m is the total number of files and N j is the number of accesses to the jth file.Representing the average number of accesses, σ N representing the standard deviation of the number of accesses, j being the index of the file. The mean and standard deviation of the file sizes are respectively:And Wherein the method comprises the steps ofRepresenting the average file size, σ S representing the standard deviation of the file size, S j being the size of the j-th file. Thereby helping to identify those files that have an abnormal access pattern or size.
Referring to fig. 3, based on file access data, files are prioritized, files to be preferentially processed are identified, and a file priority list is generated as follows:
s201, based on file access data, calculating access priority of each file, and using file size and access frequency as key parameters, wherein the process of generating an initial priority index is as follows;
And calculating the access priority of each file by using the size S and the access frequency N of the file. This calculation takes the form of a weighted sum giving higher access frequency weight w N and higher file size weight w S. The calculation formula of the access priority index P is shown as P i=wN·Ni+wS·Si, wherein P i represents the access priority of the ith file, N i is the access frequency of the ith file, and S i is the size of the ith file. Weights w N and w S are adjusted according to system requirements to reflect the relative importance of access frequency and file size to priority. This step ensures that each file is given a preliminary priority score.
S202, sorting the file list by adopting a least recently used algorithm based on an initial priority index, optimizing the access speed, and generating an adjusted file list by adopting the following steps of;
The files are ordered using a least recently used algorithm. The algorithm sorts the files according to the latest access time T of the files and the calculated access priority P. The goal of file sorting is to place frequently accessed and voluminous files at the front of the list. The ranking function F is defined as F i=αcPi-β·Ti, α and β are tuning parameters for balancing the effect of priority score and last access time. T i is the last access timestamp of the file, converted to an inverse relative to the current time to ensure that the last accessed file is ranked higher. And meanwhile, the file list is dynamically optimized according to the actual access mode and the storage characteristic.
S203, identifying files to be processed preferentially based on the adjusted file list, wherein the flow of generating the file priority list is as follows;
By referencing the priority index F i of the file with the average priority of the whole file And a dynamic threshold is set to determine whether the file is a priority handling file. Will be more dynamic and adapt to the current file access pattern. The formula can be defined as: where Y i represents the priority handling index of the ith file, F i is the ranking index of the files, Is the average of the priority indices F of all files and e is a small positive constant to avoid the situation where the divisor is zero. By quantifying the performance of each file relative to the average priority. The processing priority of a file is no longer a simple binary decision (i.e., priority or not), but rather a continuous metric, allowing finer adjustment of the file access policy. In practice, a ratio threshold may be set, such as k, and a file is marked as prioritized when the Y i value of the file is above this threshold.
Referring to fig. 4, the specific steps of reconfiguring a file storage layout through a file priority list, moving a file accessed multiple times to a more preferential access point, monitoring and recording migration operation data, and generating a file location update record are as follows:
S301, based on a file priority list, listing all files accessed for many times, designating the files as migration objects, and performing file selection and marking operation to generate a flow of file identification to be migrated as follows;
The system first extracts statistics of all files from the integrated file access and priority database. And calculating the access weight of each file, wherein the formula is as follows: Where γ, δ and ζ are dynamically adjusted weight coefficients, and T min is the earliest access time point within the investigation period. Next, a dynamic threshold θ ' is set, calculated as θ ' =μ W+k·σW file is marked as a migration candidate if its weight W exceeds θ '. The method integrates various access characteristics of the files, and accurately selects the files needing to optimize the storage position.
S302, executing migration operation on files in a file identifier to be migrated, wherein the migration operation comprises updating file indexes and storage paths, and generating a file migration execution record;
And executing actual migration operation on the marked files to be migrated. The system updates the current storage location of each file to the new optimized location. The selection of the new location is based on the size of the file and the frequency of access. In updating the index and storage path, the formula I new=α·Iold +β is used, where I new is the updated file index, I old is the original index, and α and β are adjustment coefficients. It is ensured that the file index reflects the new storage location after migration.
S303, based on the file migration execution record, monitoring the data transmission speed and success rate in the file migration process, and generating a file position update record by the following steps;
The system monitors real-time data of file migration, including data transmission speed V and success rate R. Calculated by the following formula: And Where D represents the amount of data migrated, T is the time required for migration, and S success and S total are the number of files successfully migrated and the total number of files attempted to be migrated, respectively. These data help update the location records of the files and ensure that the latest location of each file is accurately recorded in the file management system, optimizing storage management and improving the response speed of the system.
Referring to fig. 5, based on the file location update record and the historical access data, analyzing future access trend of the file, identifying trend change and access peak, adjusting the expected access policy of the file according to the analysis content, and generating file access prediction data comprises the following specific steps:
s401, based on file position update records and historical access data, predicting a future access mode of a file by using a cyclic neural network, detecting the change of file access after migration, wherein the process of generating an access frequency change prediction result comprises the steps of increasing or decreasing the frequency and concentrating the access time period;
The system analyzes the location update record and the historical access data of the file using a recurrent neural network. The network input includes features extracted from historical access frequency and recent positional variation of the file. These features are used to train a neural network model, predicting the access patterns of the file over a period of time in the future. During model training, the goal is to minimize the prediction error.
The recurrent neural network follows the formula:
Where E' represents the prediction error sum, which provides an adjusted error estimate, taking into account factors of file importance and size, for estimating the overall performance of the prediction model. n represents the total number of files used to calculate the average error, ensuring consistency of error assessment and rationality of comparison. To sum the symbols, we mean that the cumulative calculation is done for all files, ensuring that the error for each file is taken into account in the total error. P i represents the predicted number of accesses to the ith file, which is the result of the model output for comparison with the actual number of accesses. A i represents the actual number of accesses to the ith file to verify the accuracy of the prediction. P i-Ai represents the difference between the predicted value and the actual value, the square of which provides a quantized representation of the error. F di represents the document importance index of the ith file, which is an index for measuring the importance of the file, and affects the weight of the error, and is used for adjusting the contribution of the importance of the file to the prediction error. log (1+F di) carries out logarithmic transformation on the document importance index to smooth the influence of the importance index and avoid the overlarge influence of extreme values. S i indicates the size of the i-th file, which is used to influence the weight of the prediction error, so that the prediction error of the large file and the small file are differently considered. max S) the maximum file size observed in all files, for normalizing the effect of file size. Alpha is an adjustment parameter used for controlling the influence degree of the file size on the prediction error, and can be adjusted according to actual requirements.
The calculation process is as follows:
Calculate the value of P i-Ai:
P i and a i represent the predicted and actual access times, respectively, for the ith file. And calculating the difference between the two values to obtain a basic value of the prediction error.
Calculating a file importance logarithm adjustment factor 1+log (1+F di):
F di is the document importance index of the ith file, and may be derived according to the use frequency, modification frequency or other business rules of the file. Adding 1 to F di, taking the logarithm, and adding 1 again to adjust the influence of the prediction error. Such a transformation ensures that even when F di is 0, the denominator will not be 0, while logarithmic transformation can reduce the excessive impact of high importance files.
Calculating normalized impact of file size
S i is the size of the i-th file, and max (S) is the maximum size among all files. First, the ratio of each file size to the maximum file size is calculated, and then the ratio is raised to the power of alpha, where alpha is a preset parameter for adjusting the influence degree of the file size.
Calculating the adjusted error of the single file in a combined way:
The final E' value is an adjusted prediction error index integrating the file importance and the file size. The accuracy of the predictive model can be evaluated more accurately, especially on the predictive performance of different types of files. The method considers the business importance and the actual size of the file, so that the error assessment is more public and targeted. This result has an important role in optimizing the file management system, which can direct the system to more effectively adjust resource allocation and backup policies, particularly during high load or critical periods, ensuring access to critical files and data security.
S402, extracting a key time period and a file type from an access frequency change prediction result, and identifying a key access peak time period and a file type, wherein the process of generating a key change factor identification result is as follows;
the key time period in which access peaks occur in the predictions and the associated file type are identified. The analysis is based on the statistical distribution characteristics of the access frequency, and is realized by calculating the access frequency mean mu and standard deviation sigma of each time period and file type: And Wherein, And (3) representing the file access amount in the ith 1 th time period, and t is the total number of the investigated time periods. By analyzing μ and σ for each period and file type, the system can efficiently identify key access peak periods and file types.
S403, based on the key change factor identification result, adjusting the expected access strategy of the file, wherein the process of generating file access prediction data comprises the steps of optimizing a storage path and backup frequency;
And according to the identified key change factors, adjusting the storage path and the backup frequency of the file so as to optimize the access efficiency and the data security. The adjustment policy optimizes the storage path using a dynamic adjustment algorithm based on the predicted access pattern. The basis of the adjustment is a weighted score of the file access frequency and the file size, and the calculation formula is as follows, wherein Z=ζV '+ηS', V 'and S' respectively represent the access frequency and the size of the file, and ζ and η are weights dynamically adjusted according to the file access mode and the system storage strategy. The use of this formula helps the system dynamically adjust the storage location and backup frequency of each file to accommodate changes in access patterns, thereby optimizing overall system performance.
Referring to fig. 6, key information is extracted from the file access prediction data, and the backup schedule and storage resource allocation are rearranged according to the predicted demand time point, so as to create a dynamic storage adjustment plan, which comprises the following specific steps:
s501, identifying a key time point and resource allocation requirements based on file access prediction data, and planning resource adjustment and backup time aiming at a predicted peak period, wherein the process of generating a key time point analysis result is as follows;
The system determines key points in time, particularly predicted access peak periods, by analyzing file access prediction data. These points in time reflect moments when resource demand may increase significantly. To plan resource adjustment and backup, the system calculates the required amount of resources R' using the following formula: Where lambda is the resource demand coefficient, Is the predicted access amount in the j 1 th predicted peak period, and k 1 is the number of peak periods. This calculation helps determine the amount of resources that need to be increased at each critical point in time and the timing of the backup operation.
S502, obtaining data from analysis results of key time points, redesigning and adjusting a backup time table, and generating an adjusted backup and resource configuration flow as follows;
The system redesigns and adjusts the backup schedule and resource configuration. By evaluating the resource demand and current resource utilization at each key point in time, the system will calculate a resource reconfiguration index C, whose calculation formula is:
where μ is the resource adjustment coefficient, Is the resource requirement at the i 2 th point in time,Is the corresponding time interval length, and n 1 is the total key time point number. This step ensures that the backup schedule and resource configuration closely correspond to the predicted access pattern, optimizing system performance and data security.
S503, implementing the adjusted backup and resource allocation, updating the configuration setting of the storage management system, applying a new time table and resource allocation strategy, and generating a flow of a dynamic storage adjustment plan as follows;
the system implements the adjusted backup and resource configuration. To ensure that all settings are applied correctly and that the system is efficient to run, the storage configuration is optimized using a dynamic adjustment formula: wherein v is an adjustment intensity parameter, Is the backup data amount at the m 1 th time point,Is the corresponding amount of configuration resources, q is the total number of key points in time. In this way, the configuration settings of the storage management system are updated and new schedules and resource allocation policies are applied, ensuring that the system responds quickly to changing access demands.
Referring to fig. 7, according to the dynamic storage adjustment plan, a configuration adjustment operation is performed, including remapping a file to a new storage area and enabling a new backup plan, and the specific steps for generating an optimized file storage configuration are as follows:
s601, identifying files needing to update the mapping according to a dynamic storage adjustment plan, and executing updating operation of the files, wherein the process of generating file mapping updating execution records comprises the steps of modifying storage positions of the files;
the system identifies and lists files that need to update the map according to the dynamic storage adjustment plan that has been formulated. For these files, performing the update operation includes modifying its storage location. In this process, a new storage location is calculated using the following formula, ensuring that the storage efficiency of each file is maximized, M new=ξ1·Lcurrent+η1·Fusage, where M new is the new mapping location of the file, L current is the current logical location of the file, F usage is an evaluation of the frequency of use of the file, and ζ 1 and η 1 are adjustment coefficients. Each step of this process is recorded to form a file map update execution record for subsequent review and analysis.
S602, starting a new backup plan, wherein the new backup plan comprises the steps of setting the time and the resource of a backup task, monitoring the execution evaluation configuration effect of the first backup, and generating a new backup execution record as follows;
The system initiates a new backup plan that includes setting specific times and resources for the backup tasks. The new backup plan schedules the resource requirements of the backup tasks according to the following formula: Where B req is the total amount of resources required for the backup task, ρ is the unit data backup resource demand coefficient, Is the data amount of the nth 2 files, and N' is the number of files contained in the backup plan. After start-up, the execution of the first backup is carefully monitored and the associated configuration effects are evaluated by the new backup execution record, which helps to ensure the validity of the backup strategy.
S603, analyzing the file mapping update execution record and the new backup execution record, analyzing the performance of the whole configuration, and generating an optimized file storage configuration flow as follows;
The file map update execution record and the new backup execution record are analyzed to evaluate the performance of the overall configuration. By integrating these data, the overall configuration efficiency is calculated using the following formula: wherein E total is the efficiency of the overall configuration, Is the amount of resources consumed by the q 1 th configuration task,Is the data throughput of the corresponding task, and Q is the total number of tasks considered. This analysis helps determine whether further optimization measures are needed to improve the performance and efficiency of the storage configuration. By the method, the optimized file storage configuration is finally generated, and the response speed and the resource utilization efficiency of the system are improved.
Referring to fig. 8, an electronic management system for documents of the international organization for standardization system, the system includes:
the data collection and analysis module collects access time, size and frequency information of the file based on the file log, analyzes the data to reveal a current file access mode, and iteratively analyzes future access trends of the file by combining historical access data to generate a file access trend analysis result;
The priority ordering module performs priority ordering on the files based on the file access trend analysis result, selects files needing to be processed preferentially by using the access frequency and the file size of the files as key parameters, and generates a file priority list;
the storage layout adjustment module utilizes the file priority list to reconfigure the file storage layout, moves the files accessed for multiple times to the priority access point, monitors and records migration operation data at the same time, and generates a file position update record;
the dynamic storage management module adjusts a backup time table and storage resource allocation based on the file position update record and the file access trend analysis result to generate a dynamic storage adjustment plan;
The system configuration implementation module executes the dynamic storage adjustment plan, remaps the files to the new storage area, enables the new backup plan and monitors the whole process, and generates the optimized file storage configuration.
The present invention is not limited to the above embodiments, and any equivalent embodiments which can be changed or modified by the technical disclosure described above can be applied to other fields, but any simple modification, equivalent changes and modification made to the above embodiments according to the technical matter of the present invention will still fall within the scope of the technical disclosure.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411185578.9A CN119088761B (en) | 2024-08-27 | 2024-08-27 | An electronic management method and system for international standardization organization system documents |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411185578.9A CN119088761B (en) | 2024-08-27 | 2024-08-27 | An electronic management method and system for international standardization organization system documents |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN119088761A CN119088761A (en) | 2024-12-06 |
| CN119088761B true CN119088761B (en) | 2025-04-15 |
Family
ID=93697197
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202411185578.9A Active CN119088761B (en) | 2024-08-27 | 2024-08-27 | An electronic management method and system for international standardization organization system documents |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN119088761B (en) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119377997A (en) * | 2024-12-25 | 2025-01-28 | 中国标准化研究院 | A standard electronic archive management method and system based on artificial intelligence |
| CN119537324A (en) * | 2025-01-23 | 2025-02-28 | 珠海飞企耀点科技有限公司 | A multi-level quality document management method based on big data |
| CN119829817A (en) * | 2025-03-19 | 2025-04-15 | 南京审计大学 | File information management system based on cloud |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106528761A (en) * | 2016-11-04 | 2017-03-22 | 郑州云海信息技术有限公司 | File caching method and apparatus |
| CN117807425A (en) * | 2024-03-01 | 2024-04-02 | 创略智造(河北)工业设计有限公司 | Intelligent data analysis method and system |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4105260B2 (en) * | 1997-08-25 | 2008-06-25 | 富士通株式会社 | Information processing device |
| US10360076B1 (en) * | 2016-10-06 | 2019-07-23 | Sprint Communications Company L.P. | Prioritized rebalancing of distributed file system |
| CN110187840A (en) * | 2019-05-30 | 2019-08-30 | 北京奇艺世纪科技有限公司 | A kind of data migration method, device, server and storage medium |
| CN110968272B (en) * | 2019-12-16 | 2021-01-01 | 华中科技大学 | Time sequence prediction-based method and system for optimizing storage performance of mass small files |
| CN113849457B (en) * | 2021-08-25 | 2024-04-05 | 湘潭大学 | Multi-data center dynamic copy placement method based on neural network |
| US12287721B2 (en) * | 2022-01-27 | 2025-04-29 | International Business Machines Corporation | Storage management and usage optimization using workload trends |
| CN117827761A (en) * | 2023-12-12 | 2024-04-05 | 天翼云科技有限公司 | LSTM-based HDFS multi-stage copy method and system |
| CN118349185A (en) * | 2024-05-13 | 2024-07-16 | 天津盛鲸科技有限公司 | Large-scale distributed storage management and maintenance system |
| CN118427158B (en) * | 2024-07-04 | 2024-10-11 | 广州劲源科技发展股份有限公司 | File development and utilization management system based on artificial intelligence technology |
-
2024
- 2024-08-27 CN CN202411185578.9A patent/CN119088761B/en active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106528761A (en) * | 2016-11-04 | 2017-03-22 | 郑州云海信息技术有限公司 | File caching method and apparatus |
| CN117807425A (en) * | 2024-03-01 | 2024-04-02 | 创略智造(河北)工业设计有限公司 | Intelligent data analysis method and system |
Also Published As
| Publication number | Publication date |
|---|---|
| CN119088761A (en) | 2024-12-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN119088761B (en) | An electronic management method and system for international standardization organization system documents | |
| CN118761745B (en) | OA collaborative workflow optimization method applied to enterprise | |
| CN118467186B (en) | Multi-tenant server-free platform resource management method and system | |
| CN119201620B (en) | Cloud computing analysis method, device and equipment of SaaS system and storage medium | |
| CN118939438A (en) | Intelligent scheduling method and system for heterogeneous equipment system | |
| CN117667585B (en) | A method and system for evaluating operation and maintenance efficiency based on operation and maintenance quality management database | |
| CN118585944B (en) | A method for determining abnormalities in hazardous waste management data | |
| CN119443611A (en) | Task allocation method for flexible labor | |
| CN119620934B (en) | Optical and electromagnetic storage method and system based on hierarchical management | |
| CN119383095A (en) | Traffic control and load balancing methods in network communications | |
| CN118916648B (en) | An information service platform and method for intelligent fusion of big data | |
| CN118244994B (en) | Historical data storage method and device based on cloud computing | |
| CN118521239A (en) | Big data-based E-commerce logistics distribution optimization system and method | |
| CN116070249B (en) | Asset data intelligent monitoring management system and method | |
| CN113610225A (en) | Quality evaluation model training method and device, electronic equipment and storage medium | |
| CN118710225B (en) | Automatic filing method and system for personnel files | |
| CN118013043B (en) | File data management method, device, equipment and storage medium | |
| CN116820767A (en) | Cloud resource management method and device, electronic equipment and storage medium | |
| CN120104308B (en) | Cloud platform resource scheduling optimization method and system based on multi-chip architecture | |
| CN120196680B (en) | Industrial data asset distributed storage method, medium and system | |
| CN117808602B (en) | Hot account billing method and related device based on sub-account expansion | |
| CN117193675B (en) | Solid-state storage management system based on distributed computing capacity | |
| CN120428929B (en) | Cold and hot data exchange method, system and storage medium based on optical storage | |
| CN120104308A (en) | Cloud platform resource scheduling optimization method and system based on multi-chip architecture | |
| CN119987966A (en) | Task scheduling method and system based on power big data model |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |