Machine learning analysis and simulation of pharmaceutical drying process based on prediction of concentration distribution and mass transfer

Almansour, Khaled; Alsaab, Hashem O.

doi:10.1038/s41598-025-22276-9

Download PDF

Article
Open access
Published: 03 November 2025

Machine learning analysis and simulation of pharmaceutical drying process based on prediction of concentration distribution and mass transfer

Scientific Reports volume 15, Article number: 38325 (2025) Cite this article

18 Altmetric
Metrics details

Subjects

Abstract

Modeling and analysis of the lyophilization process for low-temperature drying of pharmaceutical compounds was evaluated via a hybrid model that combines mass transfer and machine learning. We investigated the predictive accuracy of three machine learning models— Ridge Regression (RR), Support Vector Regression (SVR), and Decision Tree (DT)—for estimating the concentration (C) in a three-dimensional space, characterized by coordinates X, Y, and Z. Hyper-parameter optimization was performed using the Dragonfly Algorithm (DA) to enhance model performance. Among the models evaluated, the SVR model exhibited superior predictive performance. Excellent generalization was shown by the SVR model’s R² test score of 0.999234 and R² train score of 0.999187. The Root Mean Square Error (RMSE) was recorded at 1.2619E-03, and the Mean Absolute Error (MAE) was 7.78946E-04, both reflecting high accuracies. The maximum error observed in predictions was 5.18029E-03, further underscoring the model’s precision. These findings show that SVR, when improved with the Dragonfly Algorithm, is a very accurate and dependable way to guess the amount of chemicals in a certain area. This makes SVR a useful tool for chemical engineering and related fields.

Development of hybrid robust model based on computational modeling and machine learning for analysis of drug sorption onto porous adsorbents

Article Open access 12 March 2025

Computational hybrid analysis of drug diffusion in three-dimensional domain with the aid of mass transfer and machine learning techniques

Article Open access 26 May 2025

Prediction of tablet disintegration time based on formulations properties via artificial intelligence by comparing machine learning models and validation

Article Open access 21 April 2025

Introduction

The last several decades have seen a sharp increase in the creation of biopharmaceuticals, especially protein products, although these advances have been accompanied by difficulties in formulation^1,2,3. Parenteral administration is the primary method of introducing biopharmaceuticals into the human body⁴. Biomolecules, however, are often less stable in liquid form than in solid form. It has been demonstrated that keeping biopharmaceuticals in solid form can help preserve molecular stability and increase product shelf life⁵.

Transporting goods is also made easier by the elimination of water. In solid goods, disaccharide excipients are typically used to dry the biomolecules in order to preserve their structure and guarantee stability. Either the vitrification hypothesis or the water replacement theory is frequently used to describe the stabilizing processes of excipients⁶.

Excipients significantly influence biomolecule stability the type of excipient used⁷. For the solid biopharmaceutical development process, a variety of methods with varying drying efficiencies are suggested and developed. Understanding biopharmaceuticals and the matrix in solids completely can be difficult because most characterization methods were created for liquids⁸. Dried biopharmaceuticals typically require a more involved development process than solution products⁹.

Biopharmaceuticals in the solid state have long been developed through the process of lyophilization, also known as freeze drying^10,11,12. Compared to other drying techniques like spray drying, it is a more developed method. Lyophilization, however, is a labor-intensive batch process that consumes a lot of energy. A key goal in the development of biopharmaceuticals is to preserve stability of the formulation over a long period of time. But different stresses brought on by the drying process can encourage aggregation, resulting in visible and sub-visible particles in the finished goods^8,13.

The application of machine learning (ML) has proven to be a great tool for simulating and predicting complex parameters. Ensemble models, particularly those built on decision trees, have become highly favored to accurately capture intricate relationships between inputs and target variables^14,15. In this study, we look at the complex task of predicting the concentration levels of one particular chemical, denoted as C (mol/m³), using coordinates X(m), Y(m), and Z(m). With a large dataset of over 46,000 data points, we want to use advanced modeling techniques to accurately estimate C at any given point within the defined spatial domain.

The models used in this study include Decision Tree (DT), Ridge Regression (RR), and Support Vector Regression (SVR), each with their own advantages in modeling non-linear relationships between coordinates and chemical concentrations. Decision Tree models divide data into hierarchical decision nodes, allowing for easy interpretation of complex relationships. Ridge Regression, on the other hand, uses regularization to prevent overfitting, which is especially useful in high-dimensional data sets. Support Vector Regression, an improved version of the Support Vector Machine algorithm, is effective at capturing nonlinear relationships by projecting data points into a higher-dimensional space.

To enhance the accuracy of these models, hyper-parameter optimization is conducted using the innovative Dragonfly Algorithm (DA). This optimization technique aims to adjust the parameters for each model, thereby improving their predictive performance.

This study presents a novel approach to hyperparameter optimization in the context of lyophilization modeling by combining the DA with a generalizability-specific objective function: the mean 5-fold R² score. This methodology is employed for the first time to improve the generalizability of machine learning models used for predicting chemical concentrations in pharmaceutical drying processes. The emphasis on model generalization, alongside the use of DA for hyperparameter tuning, marks a significant advancement over previous work, where such an integrated approach has not been explored. Moreover, this work demonstrates that the DA-enhanced SVR model not only outperforms traditional machine learning models but also exhibits superior generalization compared to earlier mechanistic models applied to similar contexts.

An extensive investigation of the use of DT, RR, and SVR models to chemical concentration prediction is presented in this work. The following parts will explore the approach, outcomes, and debates on how well these models estimate C, illuminating the complex relationship between chemical distribution and spatial dimensions.

Materials and methods

Dataset

The dataset comprises over 46,000 data points with coordinates of X(m), Y(m), and Z(m) as inputs and corresponding concentrations (C) as the target output. Preprocessing involved outlier removal (using Isolation Forest), normalizing features (using Min-Max scaler) and splitting the data into train (~ 80%) and test (~ 20%) sets (in random manner). The methodology developed by Alqarni¹¹ is used for building the models. The pairwise distributions of dataset variables are shown in Fig. 1¹⁵. The data used in this study can be accessed in the Supplementary File.

Concentration distribution of the moisture content was estimated in the sample via numerical solution of mass transfer equation where finite element method was applied. The numerical simulations were carried out for solving convection-diffusion equation in unsteady-state mode. Moreover, heat transfer conduction was coupled to the mass transfer for tracking the temperature variations across the sample and times. The model can help track the variations of moisture content and temperature with time and locations, while the data was used in machine learning analysis.

Preprocessing

The Isolation Forest (IF) algorithm employs an unsupervised method that relies on ensemble modeling to calculate the score for every point, thereby identifying outliers. The data is next split randomly according to particular properties of interest¹⁶.

A key advantage of the IF algorithm is its information processing methodology. By employing a decision tree to detect anomalies instead of evaluating the complete points, the algorithm efficiently saves time and space^17,18. In order to implement the subsampling approach of the Isolation Forest method, the model is divided into a grid to easily create the necessary number of segments.

In the preprocessing stage, a total of 973 data points were identified and removed as outliers using the IF algorithm to enhance the performance of ML models. This unsupervised method effectively detects anomalies by calculating an isolation score for each data point, ensuring that only relevant and reliable data were used for model training¹⁸. Additionally, a contamination parameter of 0.02 was applied, which is a reasonable and widely used threshold in outlier detection tasks.

Machine learning models

Decision tree regression (DT)

The Decision Tree (DT) regression algorithm operates on the core concept of breaking down a complex problem into a series of simpler tasks, each potentially leading to a more interpretable solution^19,20. Decision trees are made up of a hierarchy of conditions that follow a sequential process from the tree’s root to its terminal nodes. The structure of decision trees is comprehensible and clear. Following training, decision trees allow the creation of logical rules that, by repeatedly splitting datasets into subgroups, can be used to forecast new datasets²¹.

Splitting the training set repeatedly produces a DT model. Using particular criteria, the algorithm divides the data at each internal node starting with the root node and continuing until the stopping condition is reached. Every tree leaf node produces a unique and straightforward regression model.

Ridge regression (RR)

Ridge Regression is a linear approach utilized for analyzing labels by exploring their statistical correlations²². Relative to the LSM, RR can produce lower variance figures. When implementing RR, regression coefficients tend to be more consolidated, leading to fewer complications. A mathematical expression characterizing the ridge regression methodology is available. Here, the parameter k varies between 0 and 1. In the scenario of the scatter plot for RR estimators, the parameter k encompasses the complete spectrum from 0 to 1. The dataset includes variables denoted by X and Y. M symbolizes the identity matrix²³.

$$\:R{R}_{k}={\left(X{X}^{{\prime\:}}+kM\right)}^{-1}{X}^{{\prime\:}}Y$$

Support vector regression (SVR)

SVM is a useful and reliable tool, offering nonlinear function approximation and strong generalization ability²⁴. SVR (Support Vector Regression) is a regression technique based on SVM, which utilizes a non-linear support vector regressor with a kernel function $\:K\left({x}_{i},x\right)$ for a dataset $\:[{x}_{i},{y}_{i}{]}_{i}^{n}$ represented as²⁵:

$$\:y=f\left(x\right)=\sum\limits_{i=1}^{n}{{\upomega\:}}_{i}K\left({x}_{i},x\right)+b$$

where $\:{\upomega\:}$ stands for the weight vector and b denotes the bias term. The non-linear SVR primal optimization problem (ε-insensitive loss) can be written as²⁵:

$$\:mi{n}_{w,b,{{\upxi\:}}_{i},{{\upxi\:}}_{i}^{*}}\hspace{1em}\frac{1}{2}|w{|}^{2}+C\sum\limits_{i=1}^{n}\left({{\upxi\:}}_{i}+{{\upxi\:}}_{i}^{*}\right)$$

$\:\text{subject}\:\text{to}$

$$\:{y}_{i}-\langle w,{\upphi\:}\left({x}_{i}\right)\rangle-b\le\:{\upepsilon\:}+{{\upxi\:}}_{i},$$

$$\:\langle w,{\upphi\:}\left({x}_{i}\right)\rangle+b-{y}_{i}\le\:{\upepsilon\:}+{{\upxi\:}}_{i}^{*}$$

$$\:{{\upxi\:}}_{i},\:{{\upxi\:}}_{i}^{*}\ge\:0,\hspace{1em}i=1,\dots\:,n$$

where $\:{\upphi\:}\left(\cdot\:\right)$ denotes the (possibly implicit) feature mapping induced by the kernel $\:k\left({x}_{i},{x}_{j}\right)=\langle{\upphi\:}\left({x}_{i}\right),{\upphi\:}\left({x}_{j}\right)\rangle$.

By employing Lagrange multipliers $\:{a}_{i}^{\text{*}}$ and $\:{a}_{i}$, the dual form of the non-linear SVR can be expressed as²⁶:

$$\:y=f\left(x\right)=\sum\limits_{i=1}^{n}\left({a}_{i}^{\text{*}}-{a}_{i}\right)K\left({x}_{i},x\right)+b$$

Various kernel functions such as linear, Gaussian, polynomial, and sigmoid have been proposed. Previous research indicates that the Gaussian kernel is effective and provides accurate results. The widely used Gaussian kernel is defined as²⁷:

$$\:K\left(x,{x}_{i}\right)=\text{e}\text{x}\text{p}\left(-\frac{(x-{x}_{i}{)}^{2}}{2{{\sigma}}^{2}}\right)$$

In this equation, $\:{\upsigma\:}$ represents the width of the Gaussian kernel.

Hyperparameter optimization (DA)

The swarming behavior of dragonflies served as inspiration for the development of the Dragonfly Algorithm (DA). As an alternative to well-established optimization algorithms like PSO and GA, it was proposed in 2013. The DA algorithm is an example of a population-based optimization procedure that models the foraging behavior of dragonflies²⁸.

The Dragonfly Algorithm uses swarms of dragonflies to explore the search space. Every swarm embodies a prospective solution to the optimization conundrum, where dragonflies engage in interactions and communication to discover the optimal solution. The algorithm also integrates a mutation operator, which helps in exploring novel regions within the search space.

DA represents each dragonfly using a position and a velocity vector. Position vectors show where to go and at what speed as one solution to the optimization task; velocity vectors specify the direction and rate of travel²⁹. Every solution is assessed for quality using a fitness function; solutions with higher fitness values are more likely to proliferate and flourish^30,31.

The primary workflow of the DA, illustrated in Fig. 2, begins with the initialization of a population of dragonflies, each representing a potential solution. These individuals iteratively update their positions and velocities based on five key behavioral factors, and distraction from enemies that collectively simulate natural swarming dynamics. At each iteration, candidate solutions are evaluated using the defined fitness function, and the swarm progressively converges toward more optimal regions of the search space. The figure highlights how exploration and exploitation phases are balanced, with mutation operators further assisting in avoiding local optima. This structured workflow underpins the algorithm’s effectiveness in tuning hyperparameters for complex machine learning models.

The Dragonfly Algorithm has been shown to be impactful in solving a broad range of optimization problems, including image segmentation, function optimization, and feature selection. In many cases, it outperforms other optimization algorithms like PSO and genetic algorithms³².

For hyperparameter optimization, the DA was employed with a population size of 40 dragonflies and a maximum of 120 iterations, while convergence was determined by a tolerance threshold of 1.2e-4 in the fitness improvement across consecutive iterations. In this study, the fitness function was defined as the mean 5-fold R² score of the model generated by each candidate solution, ensuring emphasis on generalization rather than overfitting. Compared with alternative meta-heuristic methods such as Genetic Algorithms (GA) and Particle Swarm Optimization (PSO), DA demonstrated competitive convergence behavior and offered better exploration–exploitation balance for the studied problem.

Evaluation metrics

Several performance metrics are used to compare and rank the outcomes of optimized models in this section. The equations for these performance metrics are as follows³³:

1. Coefficient of Determination (R² Score):

$$\:{R}^{2}=1-\frac{\sum\:{\left({C}_{\text{true}}-{C}_{\text{pred}}\right)}^{2}}{\sum\:{\left({C}_{\text{true}}-\stackrel{-}{{C}_{\text{true}}}\right)}^{2}}$$

where $\:{C}_{\text{true}}$are the true concentration values, $\:{C}_{\text{pred}}$are the predicted concentration values, and $\:\stackrel{-}{{C}_{\text{true}}}$ is the mean of the true concentration values.

2. Root Mean Square Error (RMSE):

$$\:\text{RMSE}=\sqrt{\frac{1}{n}\sum\:{\left({C}_{\text{true}}-{C}_{\text{pred}}\right)}^{2}}$$

Where n stands for the dataset size.

3. Mean Absolute Error (MAE):

$$\:\text{MAE}=\:\frac{1}{n}\sum\:\left|{C}_{\text{true}}-{C}_{\text{pred}}\right|\:$$

4. Maximum Error:

$$\:\text{Max}\:\text{Error}=\text{max}\left(\left|{C}_{\text{true}}-{C}_{\text{pred}}\right|\right)$$

Results and discussion

This section presents the results obtained from the application of three models—DT, RR, and SVR—to predict the concentration distribution in a pharmaceutical drying process. A range of performance metrics, including the Coefficient of Determination (R²), RMSE, MAE, and Maximum Error, were used to evaluate the models. The final optimized hyperparameters obtained using DA are listed in Table 1.

Table 1 Final optimized hyperparameters.

Full size table

Model performance metrics

The R² scores for both train and test subset are summarized in Table 2. The SVR model demonstrated the highest R² scores, indicating superior predictive accuracy and generalization compared to DT and RR models.

Table 2 R² scores of final models.

Full size table

Table 3 compares RMSE, MAE, and Maximum Error test error rates. The SVR model consistently outperformed the other models, achieving the lowest error rates across all metrics.

Table 3 Error rates on test data.

Full size table

Also, to show the impact of DA hyperparameter tuning, we compared the performance of each model before and after optimization, as shown in Table 4. For all three models, optimization resulted in noticeable improvements. Specifically, the SVR model demonstrated the most significant enhancement, with the R² test score increasing from 0.9047 to 0.9992 after applying the DA. Similarly, both the Decision Tree and Ridge Regression models also showed improvements in their R² scores, highlighting the effectiveness of DA in fine-tuning hyperparameters and improving predictive accuracy across all models. These results emphasize the critical role of hyperparameter optimization in enhancing model performance and generalization, particularly for complex tasks such as predicting concentration in pharmaceutical processes.

Table 4 Comparison of models with optimized hyperparameters and default values.

Full size table

Comparative analysis

Figs. 3, 4 and 5 display the actual concentration values contrasted with the predicted concentration values generated by three models. SVR, optimized by the Dragonfly Algorithm, outperformed both the Decision Tree and Ridge Regression models across all evaluated metrics. The exceptionally high R² scores for SVR (0.999234 on the test set and 0.999187 on the training set) demonstrate its ability to accurately model the underlying patterns in the data. The minimal difference between the train and test R² scores also indicates that the SVR model generalizes well to unseen data, reducing the risk of overfitting.

The error metrics further validate the SVR model’s performance. The RMSE of 1.2619E-03 and MAE of 7.78946E-04 are notably lower than those of the other models, reflecting the model’s precision in predicting the concentration of the chemical compound. Additionally, the maximum error for SVR was the lowest at 5.18029E-03, indicating consistent accuracy even for the most challenging predictions.

Overall, the SVR model improved by the Dragonfly Algorithm accurately and reliably predicts chemical concentration in a spatial context, rendering it a suitable option for applications in chemical engineering and related domains. Thus, we designate this model as the top performer in our study. Through this model, the distinct impact of coordinates on the output is illustrated in Figs. 6, 7 and 8.

Although the SVR model clearly outperformed DT and RR in terms of predictive accuracy, it requires higher computational resources due to kernel operations and hyperparameter tuning. In contrast, DT and RR offer faster training and inference, which may be advantageous for real-time monitoring scenarios where rapid feedback is critical. Therefore, while SVR is most suitable for offline modeling and optimization, DT or RR could serve as lightweight alternatives in industrial drying systems where computational efficiency is prioritized over marginal gains in accuracy.

To further ensure robustness and mitigate possible bias from a single train–test split, a 5-fold cross-validation was performed on the optimized SVR model. The mean R² across folds was 0.99891, with fold-wise scores ranging from 0.99873 to 0.99912, indicating consistently high predictive accuracy. Similarly, the average RMSE and MAE values across folds were 1.39E-03 and 8.42E-04, respectively, both remaining close to the previously reported single-split results. These findings confirm that the SVR model maintains strong generalization ability across multiple data partitions, further supporting its suitability for industrial drying system simulations.

The excellent accuracy of the SVR model compared to DT and RR can be related to its ability to model complex non-linear relationships between spatial coordinates and concentration values more effectively. While the Decision Tree algorithm is highly interpretable, it often struggles with smooth function approximation and can overfit local patterns in dense datasets, even with pruning. Ridge Regression, being a linear method, is inherently limited in capturing non-linear trends present in the drying process, which involves intricate interactions between mass transfer dynamics and spatial variables. In contrast, SVR leverages the kernel trick—particularly the optimized radial basis function (RBF) kernel—to project data into a higher-dimensional space, enabling it to capture subtle non-linear dependencies. Hyperparameter optimization further enhanced its ability to generalize, as reflected by its near-perfect R² scores and minimal error metrics. This demonstrates that SVR is inherently better suited for modeling the highly non-linear and high-dimensional relationships in pharmaceutical drying simulations.

To further examine model robustness, we performed a simple uncertainty analysis by perturbing the input coordinates with small random fluctuations (± 0.48–1% of their values). The SVR predictions remained highly stable, with R² values decreasing only marginally (from 0.9992 to 0.9987) and RMSE increasing by less than 8%. This negligible degradation highlights the model’s resilience to input noise and indicates reliable generalization under realistic experimental variations. Such robustness is particularly important for industrial applications, where measurement inaccuracies and process disturbances are unavoidable.

Visualization interpretations

To better understand the predictive behavior of the optimized models, several visualizations are generated using the models that illustrate the relationships between spatial coordinates (X, Y, Z) and the predicted concentration values in the system. These plots provide intuitive insights into concentration gradients, spatial dependencies, and the impact of individual variables on model predictions, complementing the numerical performance metrics reported earlier.

Figure 6 shows how the X-coordinate affects the predicted concentration (C) with Y = 0 and Z = 0.01 fixed. The non-linear curve reveals smooth concentration changes along the X-axis, indicating its role in the drying process. The SVR model accurately captures these variations, with peaks at specific X values reflecting spatial gradients in the sample.

Figure 7 displays the Y-coordinate’s impact on concentration (C) with X = 0 and Z = 0.01 constant. The plot shows a non-linear trend, with concentration varying across Y values. The SVR model effectively detects these changes, highlighting areas of higher solute retention and the Y-coordinate’s influence on drying dynamics. The lowest concentration of sample observed can be attributed to the concentration gradient across the sample, and as a result of mass transfer in the system by molecular diffusion.

Figure 8 shows the Z-coordinate’s effect on predicted concentration (C) with X = 0 and Y = 0 fixed. Concentration decreases nearly linearly with increasing Z, suggesting a strong, consistent decline along the Z-axis, likely tied to moisture removal in drying.

Figs. 9, 10 and 11 are 3D plots illustrating the combined effects of spatial coordinates on predicted concentration (C) in the pharmaceutical drying process, modeled by the SVR. Figure 9 shows the X and Z coordinates’ impact (Y = 0 fixed), displaying a non-linear surface with Z driving steeper concentration gradients, indicating its strong influence. Figure 10 depicts X and Y’s effect (Z = 0.01 constant), revealing a non-linear surface with concentration peaks, highlighting complex X-Y interactions. Figure 11 illustrates Y and Z’s influence (X = 0 fixed), with Z dominating the concentration decline and Y contributing subtler variations. The SVR model accurately captures these non-linear spatial dependencies, showcasing its effectiveness in simulating drying dynamics.

In addition, the 3D in Fig. 12 clearly shows a concentration gradient and effectively illustrates that the Z(m) mostly affects the predicted concentration. The concentration of solute can be observed to change during the drying process which is the outcome of mass transfer inside the sample. The model can be also built to track the concentration versus time to find out when the target point has been reached to stop the drying process. So, the overall results revealed that combination of mass transfer and machine learning is a useful strategy to optimize the freeze-drying process and find the optimum time of drying¹¹. Similar trend has been observed for prediction of T distribution in freeze drying process via ML^11,15.

While this work concentrated on three representative baseline models (DT, RR, and SVR) to provide a transparent comparison under consistent optimization conditions, we recognize that more advanced approaches such as Random Forest, Gradient Boosting, and Neural Networks could potentially deliver further improvements. Moreover, emerging paradigms like graph representation learning have shown promising results in engineering applications and may offer powerful ways to capture spatial dependencies in future studies^34,35. Nevertheless, for the present work, we intentionally restricted the scope to single models to maintain interpretability, reduce computational burden, and clearly demonstrate the performance gains achievable through Dragonfly Algorithm–based hyperparameter tuning. Despite the promising outcomes, some limitations should be acknowledged. First, the study relied on a simulated dataset, and real experimental validation would be necessary to confirm the robustness of the proposed models under practical drying conditions. Second, although the Dragonfly Algorithm enhanced predictive accuracy, its computational cost may limit scalability in large-scale industrial systems. Finally, the present work focused exclusively on spatial concentration prediction; future investigations could extend the framework to incorporate temporal dynamics, variability in excipient composition, or process disturbances commonly encountered in pharmaceutical drying environments.

Conclusion

In this paper, we used a big dataset to make accurate predictions on concentration (C). We employed Isolation Forest method for outlier detection and removal, followed by training three machine learning models to predict concentration (C). Hyper-parameter optimization was performed using the DA.

This work shows that, in predicting concentration (C) based on coordinates (X, Y, Z), SVR, optimized using the DA, considerably outperforms DT and RR models. At an RMSE of 1.2619E-03, an MAE of 7.78946E-04, and a R² test score of 0.999234, the SVR model showed the highest accuracy. These results provide important new information for chemical engineering applications and validate SVR as a very accurate and dependable technique for spatial concentration prediction.

Data availability

The datasets used and analysed during the current study are available from the corresponding author on reasonable request.

References

Tiwari, G. et al. Drug delivery systems: an updated review. Int. J. Pharm. Invest. 2 (1), 2 (2012).
Article Google Scholar
Chen, T. Formulation concerns of protein drugs. Drug Dev. Ind. Pharm. 18 (11–12), 1311–1354 (1992).
Article CAS Google Scholar
Frokjaer, S. & Otzen, D. E. Protein drug stability: a formulation challenge. Nat. Rev. Drug Discovery. 4 (4), 298–306 (2005).
Article CAS PubMed Google Scholar
Zhou, X. & Po, A. L. W. Peptide and protein drugs: I. Therapeutic applications, absorption and parenteral administration. Int. J. Pharm. 75 (2–3), 97–115 (1991).
Article CAS Google Scholar
Carpenter, J. F. et al. Rational Design of Stable Lyophilized Protein Formulations: Theory and Practice (Springer, 2002).
Chang, L. L. & Pikal, M. J. Mechanisms of protein stabilization in the solid state. J. Pharm. Sci. 98 (9), 2886–2908 (2009).
Article CAS PubMed Google Scholar
Sou, T. et al. Designing a multi-component spray-dried formulation platform for pulmonary delivery of biopharmaceuticals: the use of polyol, disaccharide, polysaccharide and synthetic polymer to modify solid-state properties for glassy stabilisation. Powder Technol. 287, 248–255 (2016).
Article CAS Google Scholar
Chen, Y. et al. Pharmaceutical protein solids: drying technology, solid-state characterization and stability. Adv. Drug Deliv. Rev. 172, 211–233 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, W. Lyophilization and development of solid protein pharmaceuticals. Int. J. Pharm. 203 (1–2), 1–60 (2000).
Article CAS PubMed Google Scholar
Carpenter, J. F. et al. Rational design of stable lyophilized protein formulations: some practical advice. Pharm. Res. 14 (8), 969 (1997).
Article CAS PubMed Google Scholar
Alqarni, M. & Alqarni, A. A. Computational and intelligence modeling analysis of pharmaceutical freeze drying for prediction of temperature in the process. Case Stud. Therm. Eng. 61, 105136 (2024).
Article Google Scholar
Zadravec, M. et al. Towards a digital twin of primary drying in lyophilization using coupled 3-D equipment CFD and 1-D vial-scale simulations. Eur. J. Pharm. Biopharm. 208, 114662 (2025).
Article PubMed Google Scholar
Philo, J. S. & Arakawa, T. Mechanisms of protein aggregation. Curr. Pharm. Biotechnol. 10 (4), 348–351 (2009).
Article CAS PubMed Google Scholar
Sathiparan, N., Jeyananthan, P. & Subramaniam, D. N. A comparative study of machine learning techniques and data processing for predicting the compressive strength of pervious concrete with supplementary cementitious materials and chemical composition influence. Next Mater. 9, 100947 (2025).
Article Google Scholar
Al Hagbani, T. et al. Theoretical investigations on analysis and optimization of freeze drying of pharmaceutical powder using machine learning modeling of temperature distribution. Sci. Rep. 15 (1), 948 (2025).
Article CAS PubMed PubMed Central ADS Google Scholar
Liu, F. T., Ting, K. M. & Zhou, Z. H. Isolation forest. in eighth ieee international conference on data mining. 2008. IEEE. 2008. IEEE. (2008).
Domingues, R. et al. A comparative evaluation of outlier detection algorithms: experiments and analyses. Pattern Recogn. 74, 406–421 (2018).
Article ADS Google Scholar
Liu, Z. et al. Development of compositional-based models for prediction of heavy crude oil viscosity: application in reservoir simulations. J. Mol. Liq. 389, 122918 (2023).
Article CAS Google Scholar
Xu, M. et al. Decision tree regression for soft classification of remote sensing data. Remote Sens. Environ. 97 (3), 322–336 (2005).
Article ADS Google Scholar
Ahmad, M. W., Reynolds, J. & Rezgui, Y. Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees. J. Clean. Prod. 203, 810–821 (2018).
Article Google Scholar
Breiman, L. et al. Classification and Regression Trees (Routledge, 2017).
McDonald, G. C. Ridge regression. Wiley Interdisciplinary Reviews: Comput. Stat. 1 (1), 93–100 (2009).
Article Google Scholar
Onur, T. A comparative study on regression methods in the presence of multicollinearity. İstatistikçiler Dergisi: İstatistik Ve Aktüerya. 9 (2), 47–53 (2016).
Google Scholar
Brereton, R. G. & Lloyd, G. R. Support vector machines for classification and regression. Analyst 135 (2), 230–267 (2010).
Article CAS PubMed ADS Google Scholar
Montesinos López, O. A., Montesinos, A., López & Crossa, J. Support Vector Machines and Support Vector regression, in Multivariate Statistical Machine Learning Methods for Genomic Predictionp. 337–378 (Springer, 2022).
Kecman, V. Support vector machines–an introduction. in Support Vector Machines: Theory and Applications (ed. Wang, L.) 1–47. (Springer, 2005).
MATH Google Scholar
Elen, A., Baş, S. & Közkurt, C. An adaptive Gaussian kernel for support vector machine. Arab. J. Sci. Eng. 47 (8), 10579–10588 (2022).
Article Google Scholar
Mirjalili, S. Dragonfly algorithm: a new meta-heuristic optimization technique for solving single-objective, discrete, and multi-objective problems. Neural Comput. Appl. 27, 1053–1073 (2016).
Article Google Scholar
Li, Y. et al. RF-Based charger placement for duty cycle guarantee in Battery-Free sensor networks. IEEE Commun. Lett. 19 (10), 1802–1805 (2015).
Article Google Scholar
Meraihi, Y. et al. Dragonfly algorithm: a comprehensive review and applications. Neural Comput. Appl. 32, 16625–16646 (2020).
Article Google Scholar
Sumayli, A., Mahdi, W. A. & Alamoudi, J. A. Analysis of nanomedicine production via green processing: modeling and simulation of pharmaceutical solubility using artificial intelligence. Case Stud. Therm. Eng. 51, 103587 (2023).
Article Google Scholar
Sayed, G. I., Tharwat, A. & Hassanien, A. E. Chaotic dragonfly algorithm: an improved metaheuristic algorithm for feature selection. Appl. Intell. 49, 188–205 (2019).
Article Google Scholar
Naser, M. & Alavi, A. H. Error Metrics and Performance Fitness Indicators for Artificial Intelligence and Machine Learning in Engineering and Sciencesp. 1–19 (Architecture, 2021).
Shi, B. S. et al. Domain adaptation for graph representation learning: Challenges, progress, and prospects. J. Comput. Sci. Technol. 40, 283–300 (2025).
Article Google Scholar
Pourhanasa, R. & Monadipour, A. Concrete crack detection via graph representation learning and texture analysis. Innovative Infrastructure Solutions. 10 (8), 1–12 (2025).
Article Google Scholar

Download references

Acknowledgements

The authors extend their appreciation to Taif University, Saudi Arabia, for supporting this work through project number (TU-DSPP-2024-30).

Author information

Authors and Affiliations

Department of Pharmaceutics, College of Pharmacy, University of Hail, Hail, Saudi Arabia
Khaled Almansour
Department of Pharmaceutics and Pharmaceutical Technology, Taif University, Taif, 21944, Saudi Arabia
Hashem O. Alsaab

Authors

Khaled Almansour
View author publications
Search author on:PubMed Google Scholar
Hashem O. Alsaab
View author publications
Search author on:PubMed Google Scholar

Contributions

K.A.: Conceptualization, Writing, Methodology, Investigation, Supervision. H.O.A.: Writing, Validation, Visualization, Software.All authors reviewed the manuscript.

Corresponding author

Correspondence to Khaled Almansour.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Almansour, K., Alsaab, H.O. Machine learning analysis and simulation of pharmaceutical drying process based on prediction of concentration distribution and mass transfer. Sci Rep 15, 38325 (2025). https://doi.org/10.1038/s41598-025-22276-9

Download citation

Received: 20 May 2025
Accepted: 26 September 2025
Published: 03 November 2025
Version of record: 03 November 2025
DOI: https://doi.org/10.1038/s41598-025-22276-9

Subjects

Abstract

Similar content being viewed by others

Development of hybrid robust model based on computational modeling and machine learning for analysis of drug sorption onto porous adsorbents

Computational hybrid analysis of drug diffusion in three-dimensional domain with the aid of mass transfer and machine learning techniques

Prediction of tablet disintegration time based on formulations properties via artificial intelligence by comparing machine learning models and validation

Introduction

Materials and methods

Dataset

Preprocessing

Machine learning models

Decision tree regression (DT)

Ridge regression (RR)

Support vector regression (SVR)

Hyperparameter optimization (DA)

Evaluation metrics

Results and discussion

Model performance metrics

Comparative analysis

Visualization interpretations

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links