A self-driving physical vapor deposition system making sample-specific decisions on the fly

Zheng, Yuanlong Bill; Blake, Connor; Mravac, Layla; Zhang, Fengxue; Chen, Yuxin; Yang, Shuolong

doi:10.1038/s41524-025-01805-0

Download PDF

Article
Open access
Published: 05 November 2025

A self-driving physical vapor deposition system making sample-specific decisions on the fly

npj Computational Materials volume 11, Article number: 327 (2025) Cite this article

449 Accesses
30 Altmetric
Metrics details

Subjects

Abstract

We present an autonomous physical vapor deposition system that integrates hardware automation, in-situ optical spectroscopy, and Bayesian machine learning into a complete self-driving laboratory framework making decisions on the fly. Using silver thin films as a model material, our platform efficiently navigates a complex parameter space through active learning. By introducing a thin physical layer denoted as calibration layer, the machine learning models adapt to sample-specific conditions on the fly and reliably predict the deposition conditions to achieve user-specified optical properties. Moreover, from the high-throughput experimental data, the algorithm systematically captures the complex parameter-property relationships that are challenging to deduce by conventional trial-and-error methods. This study demonstrates the potential of self-driving laboratories for both reducing human labor and gaining new understanding of materials, providing a streamlined approach to enable self-driving physical vapor deposition systems.

Introduction

The implementation of self-driving laboratories in materials science is a promising approach to accelerate material discovery and optimization^1,2,3. In physical vapor deposition (PVD) of thin-film materials, the traditional human-led process encompasses numerous cycles of selecting deposition parameters, performing deposition, characterizing film properties, and re-adjusting deposition parameters accordingly. The field eagerly needs to harness the capabilities of machine learning (ML) and robotics to streamline and accelerate this iterative process.

Several studies have sought to integrate ML with the PVD process. Typically, these approaches involved training ML models that map deposition parameters, such as substrate temperature, deposition rate, and flux ratio, to material properties, such as stoichiometry^4,5,6, electrical conductivity^7,8,9, surface morphology^10,11, crystallinity^{12,13,14,15,16,17}, and superconducting critical temperatures¹⁸. The trained models are then used to predict material properties, with Bayesian optimization (BO) frequently employed to autonomously determine the deposition parameters for subsequent samples^{9,19,20,21,22,23}. Yet, PVD is intrinsically sensitive to subtle differences in substrate conditions and chamber environments, challenging the notion of a definitive mapping between deposition parameters and sample properties, and making traditional BO-based models difficult to implement^8,18. This calls for sample-specific, on-the-fly decision making to determine the optimal deposition condition.

Beyond the incorporation of ML algorithms, the realization of self-driving PVD hinges on the complete automation of hardware systems. PVD systems require high vacuum (HV) or ultra-high vacuum (UHV) environments, which present significant challenges for fully automating sample transfer and characterization processes. As such, most studies in the field of ML-assisted thin-film deposition still rely on traditional manual handling of samples, which limits sample throughput and hinders the realization of the fully autonomous PVD^9,18,24. Shimizu et al. demonstrated a system fully automating the deposition of Nb-doped TiO₂ films and the minimization of its resistance²⁵. However, the system requires a complex multi-chamber setup with sophisticated transfer mechanisms, thereby increasing the complexity of the setup and limiting its large-scale deployment. Harris et al. developed an automated pulsed laser deposition (PLD) system with a wheel-like sample holder that holds up to 10 samples, though such number of samples is insufficient for high throughput experiments and requires human intervention for reloading samples²³. Therefore, achieving self-driving PVD systems with a streamlined setup is crucial for the development of the field²⁶.

In this work, we extend the concept of ML-aided PVD by developing a fully self-driving PVD platform. Our system integrates a UHV chamber with a 72-slot robotic sample handling system, in-situ optical characterization, and machine learning into a closed-loop workflow. We demonstrate the autonomous deposition of silver thin films with optical reflectivities deviating from user-specified targets by less than 0.025 in an average of 2.3 attempts. Additionally, the platform employs a calibration strategy that systematically captures the effect of fluctuations in the deposition conditions on the film properties. It further uncovers nuanced relationships, such as those between effusion cell temperature and sample absorptivity, and explores the extent to which the optical spectrum can be engineered. These results validate a scalable self-driving PVD platform and highlight the transformative potential of self-driving laboratories to drastically accelerate material discovery and optimization.

Results

System design

To demonstrate the principles of self-driving PVD, we seek to fabricate silver thin films with user-specified optical properties. The optical properties of silver thin films are extremely sensitive to deposition parameters. In general, effusion cell temperature sets the arrival rate of adatoms; deposition time determines the film thickness and microstructures. Reflectivity rises with thickness and approaches the bulk limit as the film becomes optically opaque²⁷. Ultra-thin silver first nucleates as isolated islands that gradually coalesce into a continuous layer as the thickness increases, shifting the effective dielectric function during this process²⁸. Higher cell temperatures increase adatom flux, shortening surface-relaxation times, leading to a smaller void fraction and a larger extinction coefficient²⁹. Grain size further tunes free-electron scattering and thus both the real and imaginary parts of the refractive index^30,31. Therefore, it is difficult to model all of these mechanisms using simple physical laws, which warrants an ML-driving material optimization and makes it an ideal testbed for self-driving PVD.

The automated PVD system with in-situ optical characterization capabilities is described in Methods (Fig. 1a). The deposition parameters are effusion cell temperature (T) and deposition time (t). The transmitted (P_t) and reflected power (P_r) of the deposited silver thin films are measured, and we define the reflected power ratio ${\mathscr{R}}$ and absorptivity ${\mathscr{A}}$:

$${\mathscr{R}}=\frac{{P}_{r}}{{P}_{r}+{P}_{t}},\quad {\mathscr{A}}=\frac{{P}_{i}-{P}_{r}-{P}_{t}}{{P}_{i}}$$

where P_i denotes the incident power. For convenience, the reflected power ratio and the absorptivity at 443 nm are denoted as ${{\mathscr{R}}}_{443}$ and ${{\mathscr{A}}}_{443}$, respectively, and similarly for other wavelengths. To maximize data acquisition efficiency, during deposition, the system cycles through each wavelength in sequence and performs measurement at each wavelength (Supplementary Fig. 1). Each complete cycle takes 98 seconds, as determined by the linear rail speed and a 5-second measurement period per wavelength.

**Fig. 1: A self-driving physical vapor deposition system for silver thin-film deposition.**

All hardware control and data acquisition are managed by MATLAB scripts which allow the system to deposit and characterize up to 72 samples consecutively without human intervention. The optical characterization data are fed into the ML algorithm, which predicts the deposition condition required for the sample to attain user-specified ${\mathscr{R}}$ values at targeted wavelengths. Our optical measurement workflow differs from previous thin film deposition works that have implemented in-situ, real time optical reflectivity measurements, as their measurement results served as monitor of the film thickness²³ or provided parameters for solving equations that govern the deposition process³² and were not used to train a ML model.

Calibration layer

In thin-film deposition, slight variations in substrate and chamber conditions can significantly affect deposition dynamics³³. These parameters, such as substrate surface roughness and composition of the chamber residual pressure, cannot be measured exhaustively and can cause irreproducibility in the thin-film deposition process^8,18. Since noise in the training data can significantly degrade ML model performance, the field of ML-assisted thin-film deposition needs a systematic approach to effectively account for these “hidden parameters”.

To account for this variability, we introduce a physical calibration layer. For each sample, we first deposit a calibration layer under a set of universal conditions: effusion cell temperature of 875 °C and a deposition time of 1000 seconds. This initial layer, approximately 5 nm thick, serves as a probe of the intrinsic variations in the deposition condition. By measuring the ${{\mathscr{R}}}_{689}$ of this layer, denoted as ${{\mathscr{R}}}_{c}$, we obtain a quantitative indicator that partially captures the effect of the hidden parameters, and enables the self-driving system to adapt to the specific substrate and chamber conditions on-the-fly. Figure 2 schematically illustrates this two-stage process, first the calibration layer deposition, then followed by the primary deposition with measurements at alternating wavelengths. Table 1 provides a snippet of the dataset structure. Note that ${{\mathscr{R}}}_{c}$, though a measured value, is treated as an input parameter of the model in the dataset. For further details and performance assessment of this approach, please refer to Supplementary Information Section 2.

**Fig. 2: Illustration of the data acquisition process for each iteration in the active learning stage.**

Table 1 Exemplary dataset

Full size table

Machine learning setup

We employ Gaussian Process Regression (GPR), a flexible, non-parametric, and probabilistic machine learning method, for mapping the deposition parameters to the optical properties of the silver thin films. Unlike conventional regression techniques that output a single predicted value, GPR provides a probabilistic framework that estimates both the predicted mean and the associated uncertainty³⁴. Denoting the GPR output as

$${{\rm{GPR}}}_{y}({\bf{x}})=\left({\mu }_{y}({\bf{x}}),\,{\sigma }_{y}({\bf{x}})\right),$$

we refer to μ_y as the “predicted mean” and σ_y as the “predicted uncertainty.” Two separate models, ${{\rm{GPR}}}_{{\mathscr{R}}}$ and ${{\rm{GPR}}}_{{\mathscr{A}}}$, are trained for reflectance ${\mathscr{R}}$ and absorbance ${\mathscr{A}}$, respectively, each employing a Radial Basis Function (RBF) kernel. In both cases, the input vector is

$${\bf{x}}=\left(T,\,t,\,\lambda ,\,{{\mathscr{R}}}_{c}\right),$$

where T is deposition temperature, t is deposition time, λ is incident wavelength, and ${{\mathscr{R}}}_{c}$ represents the calibration-layer reflectance. The corresponding model outputs are $({\mu }_{{\mathscr{R}}},{\sigma }_{{\mathscr{R}}})$ and $({\mu }_{{\mathscr{A}}},{\sigma }_{{\mathscr{A}}})$.

Our approach adopts the explore-then-commit strategy widely used in BO^35,36,37 and is divided into two stages: active learning and adaptive testing. In the active learning stage, the ground-truth growth conditions and optical characterization data are used to train ML models, which then determine the next point to explore according to the model uncertainty (Fig. 1a). This process efficiently navigates the complex input space. In the adaptive testing stage, the trained model guides the system to fabricate silver thin films with user-specified optical properties and constantly adapts to its own prediction errors (Fig. 1b). The model is continuously updated with testing data, enabling ongoing refinement and improved predicting accuracy.

Active learning

A series of pre-training with 9 samples is first done to initialize the model, during which the deposition parameter T is uniformly sampled within the range of [820, 880] °C and deposited for time up to

$$t_{{\max}}(T) = 3.22 \times 10^{14} \times e^{-0.0285T (^{\circ} C)} ({\text{seconds}})$$

(1)

where ${t}_{\max }$ is determined such that each sample reaches a thickness yielding an ${\mathscr{R}} > 0.8$ for all wavelengths. This ensures the generation of data over a wide range of ${\mathscr{R}}$ values for training the model, while also avoiding the unnecessary time spent depositing films until ${\mathscr{R}}$ asymptotically approaches 1. The functional form of ${t}_{\max }(T)$ is motivated by the fact that vapor pressure (and hence the deposition rate) increases exponentially with temperature.

The system proceeds to the active learning stage. After each sample’s calibration layer is deposited and ${{\mathscr{R}}}_{c}$ is measured, T for the subsequent deposition is selected within the range of [820, 880] °C according to:

$${T}_{{\rm{selected}}}=\arg \mathop{\max }\limits_{T}\left\{\overline{{\sigma }_{{\mathscr{R}}}}(T,{{\mathscr{R}}}_{c})\right\}.$$

(2)

where

$$\overline{{\sigma }_{{\mathscr{R}}}}(T,{{\mathscr{R}}}_{c})=\frac{\sqrt{\mathop{\sum }\nolimits_{t = 0}^{{t}_{\max }(T)}{\sum }_{\lambda }{\sigma }_{{\mathscr{R}}}{(T,{{\mathscr{R}}}_{c},t,\lambda )}^{2}}}{{t}_{\max }(T)}.$$

(3)

Here, $\overline{{\sigma }_{{\mathscr{R}}}}(T,{{\mathscr{R}}}_{c})$ represents the uncertainty of ${\mathscr{R}}$ averaged over time and wavelengths. Since data is collected over time and wavelengths, as depicted in Fig. 2, we reduce the dimensionality of ${\sigma }_{{\mathscr{R}}}(T,{{\mathscr{R}}}_{c},t,\lambda )$ into $\overline{{\sigma }_{{\mathscr{R}}}}(T,{{\mathscr{R}}}_{c})$. The 98-second data collection interval is relatively short so that it is not a concern to miss a point of high interest. Moreover, ${{\mathscr{R}}}_{c}$ is a measured variable reflecting the substrate and chamber conditions and cannot be determined by the model. Hence, this BO problem, originally being complex 4-dimensional, is effectively reduced to a 1-dimensional constraint optimization problem. The model selects the value of T that maximizes $\overline{{\sigma }_{{\mathscr{R}}}}$, thereby capturing information at the point in the T space with the greatest uncertainty. The sample is then deposited at T_selected and measured at a series of times until $t={t}_{\max }({T}_{{\rm{selected}}})$, specified by Eq. (1).

Figure 3 (a) shows the evolution of the $\overline{{\sigma }_{{\mathscr{R}}}}$ during active learning. After 8 iterations, $\overline{{\sigma }_{{\mathscr{R}}}}$ over the entire parameter space becomes more uniformly distributed and has an average value of 0.032. The uncertainty does not further decrease below this level. The remaining uncertainty is likely due to the hidden parameters unaccounted by the calibration layer as well as measurement noises (e.g., uncertainty in laser power measurements). The maximum of $\overline{{\sigma }_{{\mathscr{R}}}}$ in the parameter space converges to 0.056 after 8 iterations (Fig. 3(b)), signaling the appropriate point to terminate the active learning process.

**Fig. 3: Performance of active learning in the fabrication of silver thin films.**

Since the goal of the system is to achieve arbitrary ${\mathscr{R}}$ requests at the specified wavelengths, we employ active learning to comprehensively sample the full parameter space. Since every target ${\mathscr{R}}$ falls within the range the model has already explored, acquisition functions that seek to push the search outside this range (e.g., expected improvement) are not appropriate for this task^38,39.

Since the goal of the system is to achieve arbitrary ${\mathscr{R}}$ requests at the specified wavelengths, we take an active learning approach to acquire comprehensive information over the entire parameter space. Moreover, as the requested ${\mathscr{R}}$ would always be within the large range of ${\mathscr{R}}$ that the model has been exposed to, it is not suitable to apply BO with acquisition functions that aim to optimize parameters beyond the existing knowledge^38,39.

Adaptive testing: single-wavelength ${\mathscr{R}}$ targets

We select 5 random single-wavelength targets ${{\mathscr{R}}}_{\lambda }^{{\rm{target}}}$, one for each wavelength of the lasers. Since given a certain ${{\mathscr{R}}}_{c}$, there exist infinitely many (T, t) that can achieve the requested ${\mathscr{R}}$ at the specified wavelength. The degeneracy is removed by also aiming for the minimum ${{\mathscr{A}}}_{\lambda }$. The loss function for each single-wavelength target is defined as

$$\begin{array}{rcl}{{\mathscr{L}}}_{\lambda }&=&{\left({\mu }_{{\mathscr{R}},\lambda }-{{\mathscr{R}}}_{\lambda }^{{\rm{target}}}\right)}^{2}+4{\sigma }_{{\mathscr{R}},\lambda }^{2}+{\mu }_{{\mathscr{A}},\lambda }^{2}+4{\sigma }_{{\mathscr{A}},\lambda }^{2}\\ &&\,\text{if}\,| {\mu }_{{\mathscr{R}},\lambda }-{{\mathscr{R}}}_{\lambda }^{{\rm{target}}}| > 0.01:\\ &&{{\mathscr{L}}}_{\lambda }\mapsto {{\mathscr{L}}}_{\lambda }+100{\left({\mu }_{{\mathscr{R}},\lambda }-{{\mathscr{R}}}_{\lambda }^{{\rm{target}}}\right)}^{2}\end{array}$$

(4)

The uncertainties of predicted ${\mathscr{R}}$ and ${\mathscr{A}}$ are added to the loss function to penalize deposition conditions with high uncertainties. The loss increases rapidly when the difference between the target and predicted ${\mathscr{R}}$ exceeds 0.01 to prioritize proximity to the target value of ${\mathscr{R}}$.

The loss function is minimized using the Adam optimizer⁴⁰. The set of deposition parameters (T, t) at the minimum of the loss function is used for deposition. After deposition, the model is updated with the new measurement data, and the sample’s measured ${{\mathscr{R}}}_{\lambda }$ is compared to ${{\mathscr{R}}}_{\lambda }^{{\rm{target}}}$. If

$$| {{\mathscr{R}}}_{\lambda }^{{\rm{measured}}}-{{\mathscr{R}}}_{\lambda }^{{\rm{target}}}| < 0.025$$

(5)

the deposition is considered successful, and the system moves to the next target. If unsuccessful, the model adapts to the new data and re-attempts the target until success. The threshold of 0.025 is determined because this level of accuracy is sufficient for most applications and closely matches our model’s predictive performance (mean absolute error in pre-training cross-validation ≈ 0.029, see Supplementary Information Fig. 3), making it both practically meaningful and realistically attainable.

Table 2 displays the results for 5 single-wavelength targets. It took 2 attempts on average for a target to be successfully achieved. For each of the 10 samples deposited during this stage, the model makes 5 predictions on its ${\mathscr{R}}$ for each wavelength. For these 50 total predictions, the mean absolute error (MAE) between the model predictions and measured results is 0.0246, which demonstrates the accuracy of the prediction. Moreover, the average model predicted uncertainty over the parameter space is 0.0267, and its proximity to the MAE demonstrates the accuracy of the model’s estimate of its uncertainty.

Table 2 Adaptive testing results

Full size table

To benchmark the effectiveness of the calibration layer and active learning, we perform a control experiment without these methods. In the control experiment, the model only goes through the pre-training process, in which 17 samples are grown at T between 820 and 880°C with 3.75 °C increments, and for t as specified in Eq. (1). The system is then requested to produce silver thin films that satisfy the same 5 single-wavelength targets. Despite the control experiment having the same amount of training sample data as the previous experiment, it requires on average 3.6 attempts to successfully achieve each target. The MAE between the model predictions and measured results is 0.0618, also significantly increased from the previous experiment (Fig. 3(c)). The control experiment hence demonstrates the superior performance of the model using calibration layers and active learning.

With strong predictive power, the trained models provide insights into the optimal deposition strategies that are otherwise difficult to deduce via human trial-and-error or physical intuition. These models can be used to explore various high-dimensional constrained optimization problems. As an illustration, we attempt to build on the single-wavelength target testing and generalize the strategy of choosing the effusion cell temperature T that minimizes ${{\mathscr{A}}}_{\lambda }$ for any ${{\mathscr{R}}}_{c}$, ${{\mathscr{R}}}_{\lambda }^{{\rm{target}}}$ and its associated λ. Our results show that the optimal temperature for minimizing ${{\mathscr{A}}}_{\lambda }$ varies significantly with the varying ${{\mathscr{R}}}_{c}$. At lower ${{\mathscr{R}}}_{c}$ values, the model consistently predicts 880°C to be the optimal temperature (Fig. 4). However, at higher ${{\mathscr{R}}}_{c}$ values, the trend of optimal temperature versus ${{\mathscr{R}}}_{c}$ is more complex and depends on the ${{\mathscr{R}}}_{\lambda }^{{\rm{target}}}$ and its associated λ. As the ${{\mathscr{R}}}_{c}$ increases, the optical temperature could increase (Fig. 4(a)), be relatively constant (Fig. 4(b)), decrease then increase (Fig. 4(c)), or decrease then remain constant (Fig. 4(d)). Note that T physically mostly reflects the deposition rate. This indicates that at different ${{\mathscr{R}}}_{c}$, or deposition environment, the deposition rate has an intricate effect on the silver thin films’ optical constants at various λ. These findings underscore the importance of adaptive decision making for each deposition process on the fly.

Fig. 4: Offset plots of predicted absorptance ${\mathscr{A}}$ versus effusion cell temperature T for various ${{\mathscr{R}}}_{c}$ values at specific ${{\mathscr{R}}}_{\lambda }^{{\rm{target}}}$ and its associated λ.

Such nuanced parameter-property maps would be challenging to discern through traditional methods, particularly when balancing multiple process variables and film characteristics. The models trained with the self-driving setup enable systematic exploration of complex relationships between parameters, showing its potential to uncover subtle dependencies that have eluded purely human-led research⁴¹.

Adaptive testing: multi-wavelength ${\mathscr{R}}$ targets

The autonomous PVD system also enables the fabrication of silver thin films satisfying multi-wavelength ${\mathscr{R}}$ targets, effectively specifying a desired spectrum. Depending on the application, one might aim for an ${\mathscr{R}}$ that remains relatively constant across wavelengths to produce a broadband beam splitter, or an ${\mathscr{R}}$ that varies sharply to form a high/low-pass optical filter. As an illustration, we define 2 multi-wavelength targets: $({{\mathscr{R}}}_{443}^{{\rm{target}}}=0.85,{{\mathscr{R}}}_{781}^{{\rm{target}}}=0.47)$ and $({{\mathscr{R}}}_{443}^{{\rm{target}}}=0.85,{{\mathscr{R}}}_{781}^{{\rm{target}}}=0.35)$. These choices exemplify efforts to achieve either a shallow or steep slope in ${{\mathscr{R}}}_{\lambda }$ vs λ.

The loss function for a multi-wavelength target is defined as:

$${\mathscr{L}}=\sum _{\lambda \in {\lambda }_{{\rm{targeted}}}}{\left({\mu }_{{\mathscr{R}},\lambda }-{{\mathscr{R}}}_{\lambda }^{{\rm{target}}}\right)}^{2}$$

(6)

As the number of wavelengths specified in the target increases, it is not guaranteed that there exists a set of (T, t), for a given ${{\mathscr{R}}}_{c}$, that produces a film with the desired optical properties. Therefore, when the loss function is minimized and any of the ${\mu }_{{\mathscr{R}},\lambda }$’s is still > 0.01 away from the target, the algorithm determines the target is unachievable with the given ${{\mathscr{R}}}_{c}$. It aborts the current sample, reports the result as “abort”, and moves on to the next substrate.

After the deposition, it is considered successful if

$$\frac{1}{N}\sum _{\lambda \in {\lambda }_{{\rm{targeted}}}}\left\vert {{\mathscr{R}}}_{\lambda }^{{\rm{measured}}}-{{\mathscr{R}}}_{\lambda }^{{\rm{target}}}\right\vert < 0.025$$

(7)

where N is the number of wavelengths being targeted. If successful, the system moves to the next target. If unsuccessful, the model takes account of the new data and re-attempts the target until success.

Table 2 displays the results using multi-wavelength targets. Our system achieves the two multi-wavelength targets in 6 deposition attempts, while for 4 other samples it decides that the ${{\mathscr{R}}}_{c}$ is unfavorable to achieve the target. Moreover, Fig. 5 displays the bounds of attainable spectra predicted by the model, given the accessible ${{\mathscr{R}}}_{c}$ values during this multi-wavelength testing. The 2 successful experimental spectra effectively explored this space of the spectrum, showcasing the versatility of the system in engineering the film’s optical response.

**Fig. 5: Predicted and experimental spectral bounds for silver thin films deposited with ${{\mathscr{R}}}_{443}$ fixed at 0.85.**

Furthermore, we investigate the rationale behind the system’s decision to abort certain deposition attempts, specifically for the second set of multi-wavelength target $({{\mathscr{R}}}_{443}^{{\rm{target}}}=0.85,{{\mathscr{R}}}_{781}^{{\rm{target}}}=0.35)$. The algorithm assesses feasibility based on the measured ${{\mathscr{R}}}_{c}$ of the sample. To evaluate this, we plot the minimum achievable ${{\mathscr{R}}}_{781}$ as a function of ${{\mathscr{R}}}_{c}$ while constraining ${{\mathscr{R}}}_{443}$ = 0.85 (Fig. 6a). The model predicts that for ${{\mathscr{R}}}_{c} < 0.0292$, the target ${{\mathscr{R}}}_{781}=0.35$ is unattainable, leading the system to abort the few initial deposition attempts that yield low ${{\mathscr{R}}}_{c}$ values. Later samples with higher ${{\mathscr{R}}}_{c}$ values show feasibility of meeting the target, prompting the system to proceed with the deposition (Fig. 6b).

**Fig. 6: Rationale for on-the-fly decision making on deposition attempts.**

Note that the data points in Fig. 6(a) are generated via brute-force iteration over all deposition parameter combinations, guaranteeing identification of the global minimum of ${{\mathscr{R}}}_{781}$ for each ${{\mathscr{R}}}_{c}$. In contrast, during experiments the algorithm selects deposition parameters on the fly by minimizing the loss function in Equation (6), so it may converge to a local rather than the global minimum. This explains although samples 1, 2, and 4 have ${{\mathscr{R}}}_{c}$ values slightly above the minimum threshold required to achieve the target, the system still decides to abort these samples. While this limitation could be addressed with a more efficient optimization algorithm, the overall decision-making trend of the algorithm remains rational and effective.

This decision-making process illustrates an advanced feature of the self-driving setup. The system not only executes experiments based on user-defined targets but also critically evaluates the feasibility of these targets. By aborting experiments unlikely to succeed, the system optimizes the use of time and resources, thereby establishing a more efficient and intelligent experimental workflow.

Discussion

We have demonstrated a fully self-driving physical vapor deposition (PVD) system that integrates advanced hardware automation, in-situ optical spectroscopy, and Bayesian machine learning to achieve targeted silver thin-film growth. This high-throughput setup enabled the deposition of 38 samples during the training and testing phases and the collection of over 20,000 data points without human intervention. By leveraging active learning and incorporating a calibration layer to account for hidden parameters, our system autonomously navigates a complex parameter space to reliably deposit films with optical properties that closely match user-specified targets.

We choose to work with silver thin films because it represents a simple material system but retains the challenging aspects of thin-film deposition. Although the functionality of the current setup is limited by factors such as the lack of control on substrate temperature, which will be addressed in future studies, the methods we have demonstrated—using pre-training to map the parameter space, active learning to minimize model uncertainty, and adaptive optimization to achieve specific targets–are broadly applicable to a wide range of thin-film deposition tasks.

Moreover, the in-situ optical measurements can be extended to various other characterization techniques, including other spectroscopic methods and diffraction measurements such as spectroscopic ellipsometry, reflective high-energy electron diffraction (RHEED), and low-energy electron diffraction (LEED). The in-situ nature for techniques such as ellipsometry, RHEED and LEED makes them easy to achieve high-throughput data collection and attractive to be integrated with ML^{16,42,43,44,45,46,47,48}. The calibration layer approach, in particular, can be expanded by incorporating additional checkpoints along the deposition trajectory to optimize in a higher-dimensional parameter space. As the number of checkpoints increases, this strategy could eventually lead to a real-time adaptive control framework, where deposition conditions are continuously updated based on immediate feedback. Our work constitutes a key step toward realizing such continuously self-adjusting thin-film growth processes.

Ultimately, our system not only streamlines experimentation, but also exhibits advanced features that pave the way for future self-driving laboratory frameworks. It demonstrates high-dimensional constrained optimization to achieve specified targets, on-the-fly calibration layer strategy that accounts for hidden deposition conditions, and real-time feasibility assessment that intelligently terminates unpromising trials. Together, these capabilities markedly elevate the system’s degree of autonomy and extend the frontier of current self-driving laboratories.

Methods

Self-driving physical vapor deposition system

The self-driving PVD system incorporates a shadow mask beneath a 72-slot sample handling system, ensuring that only one sample is exposed to the deposition source at a time (Fig. 1a). Silver (99.999%, Thermo Fisher) is deposited onto double-side polished BK7 glass (MTI) substrates using an effusion cell (MBE-Komponenten) at a base pressure of <5 × 10⁻⁹ mbar and a deposition pressure of 1 × 10⁻⁸ mbar.

The reflectivity and absorptivity of the silver thin films are characterized using five p-polarized lasers with wavelengths (λ) of 443, 514, 689, 781, and 817 nm (Coherent StingRay). The lasers are mounted on a linear rail pointing at the substrate with an incident angle of 45 degrees.

Gaussian process regression model

Let ${\bf{X}}\in {{\mathbb{R}}}^{N\times 4}$ denote the matrix of standardized inputs (temperature T, time t, wavelength λ, and calibration layer reflectance ${{\mathscr{R}}}_{c}$), and ${\bf{y}}\in {{\mathbb{R}}}^{N}$ the vector of ${\mathscr{R}}$ or ${\mathscr{A}}$. A Gaussian-process prior is placed on the latent function $f:{{\mathbb{R}}}^{4}\to {\mathbb{R}}$:

$$f({\bf{x}})\, \sim \,{\mathcal{GP}}\left(\mu ({\bf{x}}),\,k({\bf{x}},{{\bf{x}}}^{{\prime} })\right),\quad \mu ({\bf{x}})=0,$$

where μ(x) = 0 is the prior mean, and

$$k({\bf{x}},{{\bf{x}}}^{{\prime} })={\sigma }_{f}^{2}\exp \left(-\frac{1}{2}{({\bf{x}}-{{\bf{x}}}^{{\prime} })}^{\top }{\Lambda }^{-1}({\bf{x}}-{{\bf{x}}}^{{\prime} })\right)$$

$$\Lambda ={\rm{diag}}\left({\ell }_{1}^{2},\ldots ,{\ell }_{4}^{2}\right)$$

Here f_i = f(x_i) denotes the latent function value at x_i, ${\sigma }_{f}^{2}$ is the signal variance determining Var[f(x)] under the prior, ℓ_j is the length-scale along the jth input dimension.

Observations are assumed noisy:

$${y}_{i}={f}_{i}+{\varepsilon }_{i},\quad {\varepsilon }_{i} \sim {\mathcal{N}}(0,{\sigma }_{n}^{2}),$$

so that

$$p({y}_{i}| {f}_{i})={\mathcal{N}}\left({y}_{i}| {f}_{i},\,{\sigma }_{n}^{2}\right),$$

where σ_n is the noise standard deviation.

Hyperparameters $\theta =\{{\sigma }_{f}^{2},{\ell }_{1},{\ell }_{2},{\ell }_{3},{\ell }_{4},{\sigma }_{n}^{2}\}$ are learned by minimizing the negative log-marginal likelihood

$${\mathcal{L}}(\theta )=-\log p({\bf{y}}| {\bf{X}},\theta ),$$

using the Adam optimizer with learning rate of 0.1. Each optimizer step over all N points constitutes one epoch.

Data availability

Data sets generated during the current study are available from the corresponding author on reasonable request.

Code availability

Code used in the current study is available from the corresponding author on reasonable request.

References

Tom, G. et al. Self-driving laboratories for chemistry and materials science. Chem. Rev. 124, 9633–9732 (2024).
Article CAS PubMed PubMed Central Google Scholar
Häse, F., Roch, L. M. & Aspuru-Guzik, A. Next-generation experimentation with self-driving laboratories. Trends Chem. 1, 282–291 (2019).
Article Google Scholar
Abolhasani, M. & Kumacheva, E. The rise of self-driving labs in chemical and materials sciences. Nat. Synth. 2, 483–492 (2023).
Article CAS Google Scholar
Wakabayashi, Y. K. et al. Stoichiometric growth of SrTiO₃ films via Bayesian optimization with adaptive prior mean. APL Mach. Learn. 1, 026104 (2023).
Article CAS Google Scholar
Fébba, D. M. et al. Autonomous sputter synthesis of thin film nitrides with composition controlled by Bayesian optimization of optical plasma emission. APL Mater. 11, 071119 (2023).
Article Google Scholar
Johnson, N. S., Mishra, A. A., Kirsch, D. J. & Mehta, A. Active learning for rapid targeted synthesis of compositionally complex alloys. Materials 17, 4038 (2024).
Article CAS PubMed PubMed Central Google Scholar
Ishiyama, T., Nozawa, K., Nishida, T., Suemasu, T. & Toko, K. Bayesian optimization-driven enhancement of the thermoelectric properties of polycrystalline III-V semiconductor thin films. NPG Asia Mater. 16, 17 (2024).
Article CAS Google Scholar
Shrivastava, A., Kalaswad, M., Custer, J. O., Adams, D. P. & Najm, H. N. Bayesian optimization for stable properties amid processing fluctuations in sputter deposition. J. Vac. Sci. Technol. A 42, 033408 (2024).
Article CAS Google Scholar
Wakabayashi, Y. K. et al. Machine-learning-assisted thin-film growth: Bayesian optimization in molecular beam epitaxy of SrRuO₃ thin films. APL Mater. 7, 101114 (2019).
Article Google Scholar
Messecar, A. S., Durbin, S. M. & Makin, R. A. Quantum and classical machine learning investigation of synthesis–structure relationships in epitaxially grown wide band gap semiconductors. MRS Commun. 14, 660–666 (2024).
Article CAS Google Scholar
Shen, C. et al. Machine-learning-assisted and real-time-feedback-controlled growth of InAs/GaAs quantum dots. Nat. Commun. 15, 2724 (2024).
Article CAS PubMed PubMed Central Google Scholar
Kim, H. J. et al. Machine-learning-assisted analysis of transition metal dichalcogenide thin-film growth. Nano Converg. 10, 10 (2023).
Article CAS PubMed PubMed Central Google Scholar
Provence, S. R. et al. Machine learning analysis of perovskite oxides grown by molecular beam epitaxy. Phys. Rev. Mater. 4, 083807 (2020).
Article CAS Google Scholar
Guevarra, D. et al. Materials structure–property factorization for identification of synergistic phase interactions in complex solar fuels photoanodes. npj Comput. Mater. 8, 57 (2022).
Article CAS Google Scholar
Ni, Z. & Matsui, H. Phase control of heterogeneous Hf_XZr_(1-X)O₂ thin films by machine learning. Jpn. J. Appl. Phys. 61, SH1009 (2022).
Article Google Scholar
Liang, H. et al. Application of machine learning to reflection high-energy electron diffraction images for automated structural phase mapping. Phys. Rev. Mater. 6, 063805 (2022).
Article CAS Google Scholar
Ament, S. et al. Autonomous materials synthesis via hierarchical active learning of nonequilibrium phase diagrams. Sci. Adv. 7, eabg4930 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ohkubo, I. et al. Realization of closed-loop optimization of epitaxial titanium nitride thin-film growth via machine learning. Mater. Today Phys. 16, 100296 (2021).
Article CAS Google Scholar
Wakabayashi, Y. K. et al. Bayesian optimization with experimental failure for high-throughput materials growth. npj Comput. Mater. 8, 180 (2022).
Article CAS Google Scholar
Packwood, D. Bayesian Optimization for Materials Science (Springer Singapore, Singapore, 2017).
Shahriari, B., Swersky, K., Wang, Z., Adams, R. P. & De Freitas, N. Taking the human out of the loop: a review of Bayesian optimization. Proc. IEEE 104, 148–175 (2015).
Article Google Scholar
Greenhill, S., Rana, S., Gupta, S., Vellanki, P. & Venkatesh, S. Bayesian optimization for adaptive experimental design: a review. IEEE Access 8, 13937–13948 (2020).
Article Google Scholar
Harris, S. B. et al. Autonomous synthesis of thin film materials with pulsed laser deposition enabled by in situ spectroscopy and automation. Small Methods 8, 2301763 (2024).
Article CAS Google Scholar
Kusne, A. G. et al. On-the-fly closed-loop materials discovery via Bayesian active learning. Nat. Commun. 11, 5966 (2020).
Article CAS PubMed PubMed Central Google Scholar
Shimizu, R., Kobayashi, S., Watanabe, Y., Ando, Y. & Hitosugi, T. Autonomous materials synthesis by machine learning and robotics. APL Mater. 8, 111110 (2020).
Article CAS Google Scholar
Lo, S. et al. Review of low-cost self-driving laboratories in chemistry and materials science: the “frugal twin” concept. Digit. Discov. 3, 842–868 (2024).
Article Google Scholar
Sun, X., Hong, R., Hou, H., Fan, Z. & Shao, J. Optical properties and structures of silver thin films deposited by magnetron sputtering with different thicknesses. Chin. Opt. Lett. 4, 366–369 (2006).
CAS Google Scholar
Zhao, P., Su, W., Wang, R., Xu, X. & Zhang, F. Properties of thin silver films with different thickness. Phys. E: Low. Dimens. Syst. Nanostruct. 41, 387–390 (2009).
Article CAS Google Scholar
Todorov, R., Lozanova, V., Knotek, P., Černošková, E. & Vlček, M. Microstructure and ellipsometric modelling of the optical properties of very thin silver films for application in plasmonics. Thin Solid Films 628, 22–30 (2017).
Article CAS Google Scholar
Savaloni, H. & Khakpour, A. R. Substrate temperature dependence on the optical properties of cu and Ag thin films. Eur. Phys. J. Appl. Phys. 31, 101–112 (2005).
Article CAS Google Scholar
Savaloni, H. & Firouzi-Arani, M. Dependence of the optical properties of UHV deposited silver thin films on the deposition parameters and their relation to the nanostructure of the films. Philos. Mag. 88, 711–736 (2008).
Article CAS Google Scholar
Harris, S. B. et al. Online Bayesian state estimation for real-time monitoring of growth kinetics in thin film synthesis. Nano Lett. 25, 2444–2451 (2025).
Article CAS PubMed Google Scholar
Faeth, B. D. et al. Incoherent Cooper pairing and pseudogap behavior in single-layer FeSe/SrTiO₃. Phys. Rev. X 11, 021054 (2021).
CAS Google Scholar
Rasmussen, C. E. & Williams, C. K. I. Gaussian Processes for Machine Learning (The MIT Press, 2006).
Hillel, E., Karnin, Z. S., Koren, T., Lempel, R. & Somekh, O. Distributed exploration in multi-armed bandits. Adv. Neural Inform. Process. Syst. 26, 854–862 (2013).
Google Scholar
Garivier, A., Lattimore, T. & Kaufmann, E. On explore-then-commit strategies.Adv. Neural Inform. Process. Syst. 29, 784–792 (2016).
Google Scholar
Nie, G., Agarwal, M., Umrawal, A. K., Aggarwal, V. & Quinn, C. J. An explore-then-commit algorithm for submodular maximization under full-bandit feedback. In Proceedings of the Conference on Uncertainty in Artificial Intelligence, 1541–1551 (PMLR, 2022).
Gan, W., Ji, Z. & Liang, Y. Acquisition functions in Bayesian optimization. In Proceedings of the 2nd International Conference on Big Data & Artificial Intelligence & Software Engineering (ICBASE), 129–135 (IEEE, 2021).
Wilson, J., Hutter, F. & Deisenroth, M. Maximizing acquisition functions for Bayesian optimization. In Advances in Neural Information Processing Systems Vol. 31 (2018).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization https://arxiv.org/abs/1412.6980 (2017).
Kobayashi, W., Otsuka, T., Wakabayashi, Y. K. & Tei, G. Physics-informed Bayesian optimization suitable for extrapolation of materials growth. npj Comput. Mater. 11, 36 (2025).
Article CAS Google Scholar
Harris, S. B., Gemperline, P. T., Rouleau, C. M., Vasudevan, R. K. & Comes, R. B. Deep learning with reflection high-energy electron diffraction images to predict cation ratio in sr_2−xti_2(1−−x)o₃ thin films. Nano Lett. 25, 5867–5874 (2025).
Article CAS PubMed Google Scholar
Price, C. C. et al. Predicting and accelerating nanomaterial synthesis using machine learning featurization. Nano Lett. 24, 14862–14867 (2024).
Article CAS PubMed Google Scholar
Kaspar, T. C. et al. Machine-learning-enabled on-the-fly analysis of RHEED patterns during thin film deposition by molecular beam epitaxy.J. Vacuum Sci. Technol. A43, 032702 (2025).
Article Google Scholar
Houser, E., McKnight, T. V., Redwing, J. M. & Peiris, F. C. Modeling the coverage of mos₂ and ws₂ thin films using in-situ spectroscopic ellipsometry. J. Cryst. Growth 640, 127741 (2024).
Article CAS Google Scholar
Lee, K. K. et al. Using neural networks to construct models of the molecular beam epitaxy process. IEEE Trans. Semiconduct. Manuf. 13, 34–45 (2000).
Article Google Scholar
Gliebe, K. & Sehirlioglu, A. Distinct thin film growth characteristics determined through comparative dimension reduction techniques. J. Appl. Phys. 130, 125301 (2021).
Article CAS Google Scholar
Vasudevan, R. K., Tselev, A., Baddorf, A. P. & Kalinin, S. V. Big-data reflection high energy electron diffraction analysis for understanding epitaxial film growth processes. ACS Nano 8, 10899–10908 (2014).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Supratik Guha for helpful discussions. This work was primarily supported by University of Chicago Big Idea Generator Seed Grant, with additional hardware support by the National Science Foundation (NSF CNS-2019131). Collaboration between the Yang and Chen groups was also supported by the National Science Foundation (NSF ECCS-2427944). This work made use of the shared facilities at the University of Chicago Materials Research Science and Engineering Center, supported by National Science Foundation under award number DMR-2011854. This work made use of the Pritzker Nanofabrication Facility at the Pritzker School of Molecular Engineering at the University of Chicago, which receives support from Soft and Hybrid Nanotechnology Experimental (SHyNE) Resource (NSF ECCS-2025633), a node of the National Science Foundation’s National Nanotechnology Coordinated Infrastructure.

Author information

Authors and Affiliations

Department of Physics, University of Chicago, Chicago, IL, USA
Yuanlong Bill Zheng
Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL, USA
Yuanlong Bill Zheng, Connor Blake, Layla Mravac & Shuolong Yang
Department of Computer Science, University of Chicago, Chicago, IL, USA
Fengxue Zhang & Yuxin Chen

Authors

Yuanlong Bill Zheng
View author publications
Search author on:PubMed Google Scholar
Connor Blake
View author publications
Search author on:PubMed Google Scholar
Layla Mravac
View author publications
Search author on:PubMed Google Scholar
Fengxue Zhang
View author publications
Search author on:PubMed Google Scholar
Yuxin Chen
View author publications
Search author on:PubMed Google Scholar
Shuolong Yang
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.B.Z. and S.Y. conceived the idea. Y.B.Z., C.B., and L.M. built the experimental setup. Y.B.Z., C.B., and F.Z. designed and deployed the machine learning workflow. Y.B.Z., C.B., L.M., and S.Y. wrote the manuscript. S.Y. and Y.C. supervised the project. All authors contributed to the discussion and review of the manuscript.

Corresponding author

Correspondence to Shuolong Yang.

Ethics declarations

Competing interests

The authors declare the following competing interest: The subject of this manuscript is protected in a pending patent application with the USPTO filed by The University of Chicago.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zheng, Y.B., Blake, C., Mravac, L. et al. A self-driving physical vapor deposition system making sample-specific decisions on the fly. npj Comput Mater 11, 327 (2025). https://doi.org/10.1038/s41524-025-01805-0

Download citation

Received: 11 June 2025
Accepted: 13 September 2025
Published: 05 November 2025
Version of record: 05 November 2025
DOI: https://doi.org/10.1038/s41524-025-01805-0