Abstract
Laser powder bed fusion (L-PBF) parts often require post-processing prior to use in engineering applications to improve mechanical properties and modify the as-built surface topography. The ability to tune the L-PBF process parameters to obtain designer as-built surface topography could reduce the need for post-processing. However, the relationship between the as-built surface topography and the L-PBF process parameters is currently not well-understood. In this paper, we derive data-driven models from surface topography data and L-PBF process parameters using machine learning (ML) algorithms. The prediction accuracy of the data-driven models derived from ML algorithms exceeds that of the multivariate regression benchmark because the latter does not always capture the complex relationship between the as-built surface topography parameters and the corresponding L-PBF process parameters in a single best-fit equation. Data-driven models based on decision tree (interpretable) and artificial neural network (non-interpretable) algorithms display the highest prediction accuracy. We also show experimental evidence that thermocapillary convection and melt track overlap are important drivers of the formation of as-built surface topography.
1 Introduction
Laser powder bed fusion (L-PBF) is an additive manufacturing (AM) process in which a laser selectively melts and fuses metal powder particles in a layer-by-layer fashion to create macroscale parts with free-form geometry [1,2], e.g., to manufacture complex parts for biomedical [3] and aerospace [4] applications. In recent years, research and development of the L-PBF process has expanded the portfolio of available metal powders [5], increased process speed, and improved print quality by tuning the process parameters to increase resolution, reduce porosity, or reduce the as-built surface roughness [6].
Despite these advancements, L-PBF parts still require post-processing before use in engineering applications to reduce residual stress [7], improve mechanical properties [8], and modify the as-built surface topography [9]. Common post-processing methods include [10]; heat treatment to reduce porosity [11–13], chemical cleaning to remove unmelted metal powder particles from the as-built surfaces [14,15], and mechanical processing to physically modify the surface topography [16,17]. Even with these post-processing methods, tailoring the surface topography remains difficult. Hence, the ability to tune the L-PBF process parameters to obtain designer as-built surface topography is of interest, because it would reduce the need for costly post-processing. However, the relationship between the as-built surface topography and the L-PBF process parameters is currently not well-understood. Thus, we attempt to address this problem by deriving data-driven models from as-built surface topography data and the corresponding L-PBF process parameters using machine learning (ML) algorithms. This knowledge is important to contribute to transitioning L-PBF from a prototyping to a production manufacturing process.
Others have tried to predict the as-built surface topography from the L-PBF process parameters using trial-error, multivariate regression, or ML algorithms. Trial-error methodologies phenomenologically show the effect of the L-PBF process parameter on the surface topography without the use of formal statistical analysis. For instance, Eidt et al. [18] determined that increasing bulk laser power decreases the areal surface roughness (Sa, Sq, Sv, and Sp) of as-built Inconel 718 surfaces because the melt pool size increases with increasing bulk laser power, which smoothens the surface topography. Similarly, Triantaphyllou et al. [19] distinguished up-facing from down-facing as-built Ti–6Al–4V surfaces because the surface topography shows a predominance of valleys and peaks, respectively, which alters the skewness parameter Ssk.
Multivariate regression relates surface topography and L-PBF process parameters by means of a best-fit equation. Several publications documented the effect of L-PBF process parameters on the average surface roughness (Ra or Sa) of as-built surfaces and showed that Ra increases with increasing contour laser power [16], increasing hatch spacing (Calignano et al. [16]), and increasing build orientation [20] for up-facing surfaces. Alternatively, for down-facing surfaces, Ra increases with decreasing contour laser power [21] and decreasing build orientation [20]. Few publications also studied the effect of L-PBF process parameters on other R- and S- parameters. For instance, Fox et al. [22] measured the effect of build orientation on the surface peak count (Rpc), mean element profile width (Rsm), and mean element profile height (Rc) of as-built down-facing surfaces. They concluded that Rpc decreases and Rsm and Rc increase with increasing build orientation as a result of an increasing number of unmelted metal powder particles that adhere to the as-built surface. Whip et al. [23] observed that Sa, Sv, Smr2, and Svk decrease with increasing contour laser power and decreasing scan speed because of increasing melt track overlap. In our research group, Detwiler et al. [24] used multivariate regression to show that R- and S-parameters inaccurately characterize the surface topography of as-built L-PBF surfaces compared to deterministic surface topography parameters (summit density, standard deviation of summit heights, and mean summit radius).
Recently, researchers have also related the as-built surface topography to the L-PBF process parameters using data-driven models derived from ML algorithms. For instance, Khorasani et al. [25] derived a data-driven model using an artificial neural network (ANN) to predict the average surface roughness (Sa) of Ti–6Al–4V as-built surfaces as a function of L-PBF contour laser power, hatch spacing, scan speed, and pattern angle. They determined that Sa increases with increasing contour laser power and decreasing scan speed. Cao et al. [26] derived a data-driven model using a whale optimization algorithm (WOA) that enables minimizing the average surface roughness (Sa) of 316L steel as-built surfaces by tuning laser power, scan speed, and layer thickness. They determined that layer thickness shows the greatest effect on Sa. Similarly, Özel et al. [27] related the as-built surface topography parameters (Sa, Sq, Ssk, and Sku) of nickel 625 alloy to the energy density, contour laser power, and scan speed, using genetic programming and an ANN, and determined that models based on genetic programming show a higher prediction accuracy than those derived from an ANN.
Trial-error methods are time-consuming and do not necessarily lead to an optimal solution. Multivariate regression methods do not always capture the potentially complex relationship between surface topography parameters and L-PBF process parameters in a single best-fit equation that covers the entire range of the process parameters [28]. Additionally, the relationship and interaction between parameters can be challenging to visualize and interpret [29]. Publications that use ML algorithms to derive data-driven models that relate surface topography and process parameters only discuss ANN and genetic programming algorithms, without considering interpretable ML algorithms [25,27]. Furthermore, those publications focus solely on R- and S-parameters to describe the surface topography, even though it is well-known that those parameters do not uniquely describe the surface topography [30]. In contrast, deterministic surface topography parameters are based on actual surface features rather than averages or statistics and, thus, represent the surface topography more accurately than traditional R- and S-parameters [31,32].
Hence, the objective of this paper is to compare the prediction accuracy of data-driven models derived from as-built surface topography and corresponding L-PBF process parameters, using different interpretable and non-interpretable ML algorithms. We determine the deterministic surface topography parameters of as-built Inconel 718 surfaces, manufactured with different L-PBF process parameters, and we compare the prediction accuracy of different data-driven models with a multivariate regression benchmark. Additionally, using the results of the data-driven models, we explain the physical mechanisms that drive the experimental observations. The data-driven models are useful to tune L-PBF process parameters and manufacture as-built surfaces with designer surface topography, reducing the need for post-processing of as-built L-PBF parts.
2 Methods and Materials
2.1 Specimen Fabrication.
We use a 3D systems ProX SMP 320 L-PBF device and recycled Inconel 718 powder (ASTM F1877, mean particle diameter of 39.98 µm [33]) to manufacture the specimens considered in this study (ASTM E466-07, nominal geometry 101.60 mm × 19.05 mm × 3.10 mm [34]). Figure 1 schematically illustrates the location and orientation (α = 0 deg, α = 60 deg, and α = 90 deg) of the specimens on the build plate, with respect to the recoater and gas flow directions. The S1 and S2, and S3 and S4 as-built surfaces correspond to the inlet and outlet of the recoater and gas flow directions, respectively. Inert gas flow prevents oxidation and clears emissions from the process chamber to maintain a free path between the laser and the powder bed [35]. S5 is the top surface of the specimens, parallel to the build plate.
Each specimen is manufactured with different L-PBF process parameters: bulk laser power (115 W ≤ P ≤ 465 W), laser scan speed (620 mm/s ≤ v ≤ 1770 mm/s), layer thickness (t = 30 µm or t = 60 µm), and build orientation (0 deg ≤ α ≤ 90 deg). The contour laser power varies depending on the layer thickness (P = 115 W for t = 30 µm and P = 165 W for t = 60 µm) and contour scan speed (v = 625 mm/s), hatch spacing (h = 100 µm), and laser spot size (50 µm) remain constant for all specimens. The parameter ranges are chosen to maintain the laser energy density Eρ = P/vht within the recommended range for Inconel 718 (30 J/mm3 ≤ Eρ ≤ 90 J/mm3). Table 1 lists the L-PBF process parameters for each of the 24 specimens used in this work. We note that these 24 specimens are a subset of 75 specimens used in previous publications related to fatigue testing [36,37] and also for multivariate regression analysis of the surface topography [24]. The as-built surfaces remained unaltered during earlier experiments.
Specimen number | Contour laser power Pc (W) | Bulk laser power Pb (W) | Bulk laser scan speed vb (mm/s) | Layer thickness t (μm) | Build orientation α (deg) |
---|---|---|---|---|---|
1 | 115 | 220 | 1180 | 30 | 60 |
2 | 115 | 220 | 1180 | 30 | 60 |
3 | 115 | 330 | 1770 | 30 | 0 |
4 | 115 | 115 | 620 | 30 | 90 |
5 | 115 | 115 | 620 | 30 | 90 |
6 | 115 | 168 | 1475 | 30 | 0 |
7 | 115 | 168 | 1475 | 30 | 0 |
8 | 115 | 275 | 1200 | 30 | 0 |
9 | 115 | 115 | 915 | 30 | 60 |
10 | 115 | 330 | 1475 | 30 | 60 |
11 | 115 | 168 | 1180 | 30 | 90 |
12 | 115 | 200 | 800 | 30 | 90 |
13 | 115 | 275 | 1770 | 30 | 60 |
14 | 165 | 315 | 1050 | 60 | 60 |
15 | 165 | 165 | 850 | 60 | 0 |
16 | 165 | 390 | 1050 | 60 | 60 |
17 | 165 | 465 | 1400 | 60 | 90 |
18 | 165 | 240 | 1250 | 60 | 60 |
19 | 165 | 390 | 1450 | 60 | 90 |
20 | 115 | 220 | 1180 | 30 | 90 |
21 | 165 | 315 | 1050 | 60 | 0 |
22 | 165 | 315 | 1050 | 60 | 90 |
23 | 165 | 315 | 1050 | 60 | 90 |
24 | 165 | 200 | 1000 | 60 | 90 |
Specimen number | Contour laser power Pc (W) | Bulk laser power Pb (W) | Bulk laser scan speed vb (mm/s) | Layer thickness t (μm) | Build orientation α (deg) |
---|---|---|---|---|---|
1 | 115 | 220 | 1180 | 30 | 60 |
2 | 115 | 220 | 1180 | 30 | 60 |
3 | 115 | 330 | 1770 | 30 | 0 |
4 | 115 | 115 | 620 | 30 | 90 |
5 | 115 | 115 | 620 | 30 | 90 |
6 | 115 | 168 | 1475 | 30 | 0 |
7 | 115 | 168 | 1475 | 30 | 0 |
8 | 115 | 275 | 1200 | 30 | 0 |
9 | 115 | 115 | 915 | 30 | 60 |
10 | 115 | 330 | 1475 | 30 | 60 |
11 | 115 | 168 | 1180 | 30 | 90 |
12 | 115 | 200 | 800 | 30 | 90 |
13 | 115 | 275 | 1770 | 30 | 60 |
14 | 165 | 315 | 1050 | 60 | 60 |
15 | 165 | 165 | 850 | 60 | 0 |
16 | 165 | 390 | 1050 | 60 | 60 |
17 | 165 | 465 | 1400 | 60 | 90 |
18 | 165 | 240 | 1250 | 60 | 60 |
19 | 165 | 390 | 1450 | 60 | 90 |
20 | 115 | 220 | 1180 | 30 | 90 |
21 | 165 | 315 | 1050 | 60 | 0 |
22 | 165 | 315 | 1050 | 60 | 90 |
23 | 165 | 315 | 1050 | 60 | 90 |
24 | 165 | 200 | 1000 | 60 | 90 |
2.2 Surface Topography Measurement and Post-processing.
Optical profilometry, confocal laser scanning microscopy (CLSM), and scanning electron microscopy have been used to measure the surface topography of L-PBF specimens [9,38]. We selected CLSM (Olympus LEXT OLS5000) with a 1.8 mm × 1.8 mm field-of-view, 0.625 µm lateral (20× optical zoom), and 0.006 µm vertical resolution, based on the size of surface topography features, resolution, and accuracy [9,38], and after performing a convergence analysis.
Figure 2(a) schematically shows a specimen, relative to the original fatigue specimen, and Fig. 2(b) identifies the measurement locations on the as-built surfaces (squares with numbers). We measure the surface topography in three different locations on each of the three (specimen 3, 6, 7, 8, 15, 21) or four (all other specimens) as-built surfaces of each specimen; i.e., we perform 9 or 12 measurements per specimen, or (6 × 3 × 3) + (18 × 4 × 3) = 270 measurements for all 24 specimens. The fracture and sectioned surfaces of the specimen are not as-built and, thus, are excluded from the analysis. Furthermore, the six specimens that are oriented under 0 deg with respect to the build plate (specimen 3, 6, 7, 8, 15, and 21) only have three as-built surfaces because the bottom surface is oriented toward the build plate and consists of support material. Figures 2(c) and 2(d) show typical as-built surface topography measurements z = f(x,y) of specimen 2. We post-process the surface topography data by removing tilt and by digitally high-pass filtering the roughness from the entire surface topography following ISO 25178-2 [39].
2.3 Deterministic Surface Topography Parameters.
We quantify the deterministic surface topography parameters from each post-processed surface topography measurement. Deterministic surface topography parameters are based on the entire surface instead of one or multiple traces for R-, S-parameters [39] and statistical methods [40]. A nine-point peak (9PP) identification scheme [31,32] first identifies the N summits of the surface topography as the local maxima. Then, we calculate the standard deviation of summit heights σs from the N summit heights, and the summit density η = N/An, with An the surface topography measurement area (1.8 mm × 1.8 mm). The curvature of each summit i in two orthogonal directions x and y is κx,i = d2z/dx2 and κy,i = d2z/dy2, and the radius of curvature ρi of that summit is the inverse of the average of its κx and κy, i.e., ρi = −[(κx,i + κy,i)/2]−1. The mean summit radius Rs is the arithmetic mean of all individual summit radii. Finally, β = ηRsσs is a dimensionless roughness parameter.
2.4 Data-Driven Models, Machine Learning Algorithms, and Prediction Error Metrics.
We derive data-driven models that relate the deterministic surface topography parameters to the L-PBF process parameters (forward model) and vice versa (inverse model). Different physical phenomena dominate the surface topography of each as-built surface of a specimen, depending on its orientation with respect to the inert gas flow (S3 to S4) and the recoater motion (S1–S2). Thus, we derive data-driven models using different ML algorithms for each as-built surface (S1–S5) and each deterministic surface topography parameter (η, Rs, σs).
We use the hold-out method to determine the prediction error of each model and randomly split the data set 80/20 in training and validation data, which is commonly used by others [41]. Furthermore, we used a group shuffle to avoid separating the three repeat measurements of any as-built surface from each other. We note that the data sets for each as-built surface vary between 18 (S5) and 72 (S3 and S4) data points. Hence, because we have small data sets we use each surface topography measurement as an individual datapoint, rather than averaging the three repeat measurements per as-built surface, which would render the dataset even smaller.
However, we previously concluded based on a one-way ANOVA and a Tukey honestly significant difference (HSD) test [24] that only the deterministic surface topography parameters of S5 showed statistically significant differences with those of S1–S4. Therefore, when explaining the physical phenomena that drive the formation of the as-built surface topography (Sec. 3.2), we consider the surface topography parameters of S1–S4 and those of S5 as two separate data sets.
3 Results and Discussion
3.1 Prediction Accuracy of Data-Driven Models Compared to the Multivariate Regression Benchmark.
Table 2 shows an overview of the prediction error metrics (R2, RMSE, and MAE, as defined in Eqs. (1)–(3)), for each deterministic surface topography parameter (η, Rs, and σs), each as-built surface (S1–S5), and for each of the data-driven model and the multivariate regression benchmark. We highlight the ML algorithm that creates the data-driven model with the highest R2 and the lowest RMSE/MAE in gray for each deterministic surface topography parameter and as-built surface.
Surface topography parameter | Prediction error metric | k-Nearest neighbors | Decision trees | Random forest | Support vector machine | Artificial neural network | Multivariate regression | |
---|---|---|---|---|---|---|---|---|
Summit density, η | S1 | R2 | 0.582 | 0.621 | 0.619 | 0.635 | 0.531 | 0.444 |
RMSE | 0.011 | 0.010 | 0.011 | 0.010 | 0.012 | 0.013 | ||
MAE | 0.009 | 0.008 | 0.008 | 0.008 | 0.009 | 0.010 | ||
S2 | R2 | 0.784 | 0.813 | 0.803 | 0.812 | 0.654 | 0.623 | |
RMSE | 0.012 | 0.011 | 0.011 | 0.011 | 0.014 | 0.014 | ||
MAE | 0.009 | 0.008 | 0.008 | 0.008 | 0.011 | 0.011 | ||
S3 | R2 | 0.710 | 0.741 | 0.696 | 0.718 | 0.602 | 0.390 | |
RMSE | 0.011 | 0.011 | 0.011 | 0.011 | 0.013 | 0.016 | ||
MAE | 0.009 | 0.009 | 0.009 | 0.009 | 0.011 | 0.013 | ||
S4 | R2 | 0.596 | 0.633 | 0.619 | 0.611 | 0.542 | 0.532 | |
RMSE | 0.012 | 0.011 | 0.011 | 0.011 | 0.012 | 0.012 | ||
MAE | 0.009 | 0.008 | 0.009 | 0.009 | 0.010 | 0.010 | ||
S5 | R2 | 0.768 | 0.697 | 0.693 | 0.662 | 0.698 | 0.750 | |
RMSE | 0.019 | 0.019 | 0.019 | 0.020 | 0.019 | 0.019 | ||
MAE | 0.015 | 0.016 | 0.016 | 0.017 | 0.016 | 0.016 | ||
Mean summit radius, Rs | S1 | R2 | 0.822 | 0.786 | 0.779 | 0.772 | 0.718 | 0.635 |
RMSE | 0.005 | 0.005 | 0.005 | 0.005 | 0.006 | 0.007 | ||
MAE | 0.004 | 0.004 | 0.004 | 0.004 | 0.005 | 0.006 | ||
S2 | R2 | 0.930 | 0.952 | 0.922 | 0.945 | 0.721 | 0.760 | |
RMSE | 0.005 | 0.004 | 0.005 | 0.005 | 0.011 | 0.011 | ||
MAE | 0.004 | 0.003 | 0.004 | 0.004 | 0.008 | 0.008 | ||
S3 | R2 | 0.752 | 0.748 | 0.719 | 0.734 | 0.644 | 0.340 | |
RMSE | 0.007 | 0.007 | 0.008 | 0.008 | 0.009 | 0.012 | ||
MAE | 0.006 | 0.006 | 0.006 | 0.006 | 0.007 | 0.010 | ||
S4 | R2 | 0.610 | 0.671 | 0.653 | 0.611 | 0.477 | 0.401 | |
RMSE | 0.008 | 0.007 | 0.007 | 0.008 | 0.009 | 0.009 | ||
MAE | 0.006 | 0.005 | 0.006 | 0.006 | 0.007 | 0.007 | ||
S5 | R2 | 0.498 | 0.477 | 0.492 | 0.471 | 0.472 | 0.430 | |
RMSE | 0.009 | 0.010 | 0.010 | 0.010 | 0.010 | 0.011 | ||
MAE | 0.008 | 0.008 | 0.008 | 0.008 | 0.008 | 0.009 | ||
Standard deviation of summit heights, σs | S1 | R2 | 0.843 | 0.820 | 0.821 | 0.808 | 0.825 | 0.671 |
RMSE | 0.704 | 0.732 | 0.720 | 0.739 | 0.715 | 1.124 | ||
MAE | 0.565 | 0.587 | 0.571 | 0.621 | 0.567 | 0.943 | ||
S2 | R2 | 0.911 | 0.940 | 0.917 | 0.676 | 0.896 | 0.825 | |
RMSE | 0.730 | 0.583 | 0.709 | 1.391 | 0.731 | 1.190 | ||
MAE | 0.568 | 0.465 | 0.563 | 1.038 | 0.569 | 0.861 | ||
S3 | R2 | 0.750 | 0.753 | 0.732 | 0.668 | 0.735 | 0.514 | |
RMSE | 1.268 | 1.263 | 1.298 | 1.392 | 1.297 | 1.714 | ||
MAE | 0.983 | 0.968 | 1.023 | 1.123 | 1.011 | 1.411 | ||
S4 | R2 | 0.667 | 0.699 | 0.682 | 0.589 | 0.695 | 0.568 | |
RMSE | 1.107 | 1.024 | 1.040 | 1.194 | 1.031 | 1.316 | ||
MAE | 0.810 | 0.742 | 0.782 | 0.930 | 0.744 | 1.048 | ||
S5 | R2 | 0.719 | 0.725 | 0.735 | 0.700 | 0.725 | 0.738 | |
RMSE | 2.352 | 2.346 | 2.296 | 2.373 | 2.346 | 2.111 | ||
MAE | 2.012 | 2.010 | 2.000 | 2.015 | 2.010 | 1.799 |
Surface topography parameter | Prediction error metric | k-Nearest neighbors | Decision trees | Random forest | Support vector machine | Artificial neural network | Multivariate regression | |
---|---|---|---|---|---|---|---|---|
Summit density, η | S1 | R2 | 0.582 | 0.621 | 0.619 | 0.635 | 0.531 | 0.444 |
RMSE | 0.011 | 0.010 | 0.011 | 0.010 | 0.012 | 0.013 | ||
MAE | 0.009 | 0.008 | 0.008 | 0.008 | 0.009 | 0.010 | ||
S2 | R2 | 0.784 | 0.813 | 0.803 | 0.812 | 0.654 | 0.623 | |
RMSE | 0.012 | 0.011 | 0.011 | 0.011 | 0.014 | 0.014 | ||
MAE | 0.009 | 0.008 | 0.008 | 0.008 | 0.011 | 0.011 | ||
S3 | R2 | 0.710 | 0.741 | 0.696 | 0.718 | 0.602 | 0.390 | |
RMSE | 0.011 | 0.011 | 0.011 | 0.011 | 0.013 | 0.016 | ||
MAE | 0.009 | 0.009 | 0.009 | 0.009 | 0.011 | 0.013 | ||
S4 | R2 | 0.596 | 0.633 | 0.619 | 0.611 | 0.542 | 0.532 | |
RMSE | 0.012 | 0.011 | 0.011 | 0.011 | 0.012 | 0.012 | ||
MAE | 0.009 | 0.008 | 0.009 | 0.009 | 0.010 | 0.010 | ||
S5 | R2 | 0.768 | 0.697 | 0.693 | 0.662 | 0.698 | 0.750 | |
RMSE | 0.019 | 0.019 | 0.019 | 0.020 | 0.019 | 0.019 | ||
MAE | 0.015 | 0.016 | 0.016 | 0.017 | 0.016 | 0.016 | ||
Mean summit radius, Rs | S1 | R2 | 0.822 | 0.786 | 0.779 | 0.772 | 0.718 | 0.635 |
RMSE | 0.005 | 0.005 | 0.005 | 0.005 | 0.006 | 0.007 | ||
MAE | 0.004 | 0.004 | 0.004 | 0.004 | 0.005 | 0.006 | ||
S2 | R2 | 0.930 | 0.952 | 0.922 | 0.945 | 0.721 | 0.760 | |
RMSE | 0.005 | 0.004 | 0.005 | 0.005 | 0.011 | 0.011 | ||
MAE | 0.004 | 0.003 | 0.004 | 0.004 | 0.008 | 0.008 | ||
S3 | R2 | 0.752 | 0.748 | 0.719 | 0.734 | 0.644 | 0.340 | |
RMSE | 0.007 | 0.007 | 0.008 | 0.008 | 0.009 | 0.012 | ||
MAE | 0.006 | 0.006 | 0.006 | 0.006 | 0.007 | 0.010 | ||
S4 | R2 | 0.610 | 0.671 | 0.653 | 0.611 | 0.477 | 0.401 | |
RMSE | 0.008 | 0.007 | 0.007 | 0.008 | 0.009 | 0.009 | ||
MAE | 0.006 | 0.005 | 0.006 | 0.006 | 0.007 | 0.007 | ||
S5 | R2 | 0.498 | 0.477 | 0.492 | 0.471 | 0.472 | 0.430 | |
RMSE | 0.009 | 0.010 | 0.010 | 0.010 | 0.010 | 0.011 | ||
MAE | 0.008 | 0.008 | 0.008 | 0.008 | 0.008 | 0.009 | ||
Standard deviation of summit heights, σs | S1 | R2 | 0.843 | 0.820 | 0.821 | 0.808 | 0.825 | 0.671 |
RMSE | 0.704 | 0.732 | 0.720 | 0.739 | 0.715 | 1.124 | ||
MAE | 0.565 | 0.587 | 0.571 | 0.621 | 0.567 | 0.943 | ||
S2 | R2 | 0.911 | 0.940 | 0.917 | 0.676 | 0.896 | 0.825 | |
RMSE | 0.730 | 0.583 | 0.709 | 1.391 | 0.731 | 1.190 | ||
MAE | 0.568 | 0.465 | 0.563 | 1.038 | 0.569 | 0.861 | ||
S3 | R2 | 0.750 | 0.753 | 0.732 | 0.668 | 0.735 | 0.514 | |
RMSE | 1.268 | 1.263 | 1.298 | 1.392 | 1.297 | 1.714 | ||
MAE | 0.983 | 0.968 | 1.023 | 1.123 | 1.011 | 1.411 | ||
S4 | R2 | 0.667 | 0.699 | 0.682 | 0.589 | 0.695 | 0.568 | |
RMSE | 1.107 | 1.024 | 1.040 | 1.194 | 1.031 | 1.316 | ||
MAE | 0.810 | 0.742 | 0.782 | 0.930 | 0.744 | 1.048 | ||
S5 | R2 | 0.719 | 0.725 | 0.735 | 0.700 | 0.725 | 0.738 | |
RMSE | 2.352 | 2.346 | 2.296 | 2.373 | 2.346 | 2.111 | ||
MAE | 2.012 | 2.010 | 2.000 | 2.015 | 2.010 | 1.799 |
Note: Models with the highest R2 and lowest RMSE and MAE are highlighted in gray.
We note that we maximize the prediction accuracy of the data-driven models by optimizing the parameters of the respective ML algorithms. Hence, we determine that the decision tree algorithm has unrestricted tree depth with a minimum of two deterministic surface topography measurements for every decision node and at least one deterministic surface topography measurement for every leaf node. The random forest algorithm averages six decision trees with the same parameters as the individual decision tree. The k-nearest neighbors algorithm estimates the deterministic surface topography parameters from the k = 3 nearest neighbors (using the Euclidean distance), with all neighbors weighted equally. The support vector machine uses a radial basis function kernel [42], and the ANN uses the limited-memory Broyden–Fletcher–Goldfarb–Shanno (LBFGS) solver with twenty neurons [43]. Table 3 summarizes the deterministic surface topography parameters used to derive the data-driven models, and lists the average value of each parameter (over three replicate measurements) for each specimen and each as-built surface (S1–S5).
From Table 2, we observe that the prediction accuracy of the data-driven models derived from ML algorithms exceeds that of the multivariate regression benchmark for all data sets except the standard deviation of summit heights of S5. On average, the R2 metric of data-driven models exceeds that of the corresponding multivariate benchmark by 23.7%. This is likely because multivariate regression analysis requires the best-fit equation (e.g., linear, polynomial, or logarithmic) to apply to the entire parameter range and solution domain. In contrast, ML algorithms compartmentalize the parameter range and solution domain to determine the best fit in each compartment of the solution domain, thus affording more flexibility than multivariate regression analysis to relate the surface topography parameters to the L-PBF process parameters.
The prediction accuracy of models derived from interpretable ML algorithms exceeds those derived from non-interpretable algorithms in all cases except one (η of S1), which could be the result of the simplicity of the dataset (i.e., a linear fit resulted in the highest R2 with multivariate regression) [41]. The data-driven models derived from the decision tree ML algorithm show the highest prediction accuracy of all interpretable ML algorithms, with an R2 metric that on average exceeds the multivariate regression benchmark by 28.3%. Decision tree ML algorithms are more tolerant to missing and irrelevant values than other algorithms, including KNN. In addition, random forest algorithms sometimes over-segregate a small data set [44]. Interpretable ML algorithms are also beneficial because they allow a user to comprehend how the data-driven model arrives at its prediction. The model derived from the ANN algorithm shows the highest prediction accuracy of all non-interpretable ML algorithms, with an R2 metric that on average exceeds the multivariate regression benchmark by 15.0%. ANN algorithms typically outperform SVM algorithms when handling multiple input parameters [44].
We show typical results obtained from the data-driven models to illustrate the results of Table 2. We define the nondimensional power Pnd as the ratio of contour and bulk laser power and nondimensional scan speed vnd as the ratio of the contour and bulk laser scan speed. Figure 3 depicts the summit density η as a function of the nondimensional laser scan speed vnd and the nondimensional laser power Pnd for S1–S4 based on data-driven models derived using (a) the ANN algorithm (R2 = 0.534) and (b) the decision tree algorithm (R2 = 0.587). Figure 3 illustrates the results of the data-driven models. Figure 3(a) shows continuous results, whereas Fig. 3(b) depicts discrete results that correspond to the different leaves of the decision tree. However, we qualitatively observe similar trends in the results from both models.
Figure 4 illustrates a decision tree for summit density η of S1–S4. The algorithm is trained on 80% of the data (252 data points) and predicts the summit density η based on Pnd and vnd. We prune the decision tree to only show the first nine decision nodes (white) to reduce figure size and complexity. Each decision node shows the number of data points n at that node, and each end node or leaf (gray) shows the predicted summit density η for that tree branch and the number of data points at that leaf. The decision tree illustrates the hierarchy of decision-making to predict the summit density η, based on the L-PBF process parameters. In this example, we observe that the decision nodes with the highest hierarchy involve vnd rather than Pnd, indicating that vnd has a more substantial effect on η than Pnd.
3.2 Physical Phenomena Determining the As-Built Surface Topography.
The surface topography measurements and the results of the data-driven models show that the as-built surface topography depends on the nondimensional laser scan speed vnd and the nondimensional laser power Pnd, which determine the melt pool geometry (width and depth) and the temperature gradient between the melt pool and the previously melted tracks [45]. Specifically, we explain the relationship between the as-built surface topography and the melt pool geometry and temperature gradient through thermocapillary convection and melt track overlap [45] based on results from the data-driven models (see, e.g., Fig. 3), and we provide experimental evidence in support of this discussion in Fig. 6.
Figure 5(a) schematically shows a cross-sectional view of a single laser track, illustrating the temperature gradient between the hot melt pool and the cold solidified melt track. The surface tension of the molten material decreases with increasing temperature [46], and thus, the temperature gradient within the melt pool creates a surface tension gradient. As a result, the liquid melt pool flows from low to high surface tension, i.e., toward previously melted and solidified material, and it forms a smooth surface deviation or the so-called globule. This phenomenon is often referred to as thermocapillary convection [47], and its importance increases with increasing melt pool temperature or with increasing melt pool size, which occurs with increasing Pnd or decreasing vnd. Globules decrease the summit density η and increase the standard deviation of summit heights σs, which we schematically illustrate in Fig. 5(a).
Figure 5(b) schematically shows a top view and cross-sectional view of a laser melt track, illustrating the effect of insufficient melt pool overlap on the as-built surface topography. When the melt pool size does not fully cover the previously melted track, insufficient overlap exists between adjacent melt tracks, which may break up the melt track (so-called balling driven by minimizing the surface energy of the melt pool), create spatter, and leave trenches with unmelted metal powder particles between adjacent melt tracks. Decreasing melt track overlap increases the standard deviation of summit heights σs as a result of trenches with unmelted metal powder particles between adjacent melt tracks. In addition, it decreases the summit density η because the unmelted metal powder particle density increases with decreasing melt track overlap and each unmelted metal powder particles covers multiple summits on the as-built surface, as the radius of an unmelted metal powder particle (mean radius, 39.98 µm) is much larger than that of the mean summit radius (on the order of 0.1–1.0 µm, see, e.g., Fig. 6). Furthermore, the mean summit radius Rs decreases because the melt pool temperature decreases with decreasing melt track overlap, thus increasing surface tension and increasing roughness [46]. Even though the number of unmelted metal powder particles increases with decreasing melt track overlap, its effect on the mean summit radius Rs is negligible as the number of unmelted metal powder particles is several orders of magnitude smaller than the number of summits.
Figure 5(c) schematically shows a top view and cross-sectional view of multiple laser tracks, illustrating the effect of melt track overlap on the as-built surface topography. Increasing overlap increases remelting of adjacent melt tracks, thus decreasing the depth of trenches between them, melting unmelted metal powder particles and, thus, “smoothening” the as-built surface. As a result, the mean summit radius Rs increases and the standard deviation of summit heights σs decreases with increasing melt track overlap. Furthermore, the summit density η increases because the unmelted metal powder particle density decreases with increasing melt track overlap and each unmelted metal powder particle covers multiple summits on the as-built surface.
Figure 6 shows the two parameters (thermocapillary effect and overlap between adjacent melt tracks) and the three deterministic surface topography parameters (η, Rs, and σs) in matrix format. We use confocal and optical microscopy images and surface topography measurements to substantiate the effects we schematically illustrate in Fig. 5. In Fig. 6, the confocal and optical microscopy images, as well as the surface traces, represent indicative subsets of the entire surface, whereas the surface topography parameters reflect the entire surface topography measurement. Figure 6(a) shows optical microscopy and Fig. 6(b) shows confocal microscopy images of as-built surface S1 of specimen 14 (Pnd = 0.367 and vnd = 0.369) compared to specimen 24 (Pnd = 0.767 and vnd = 0.415); i.e., we increase the laser power but maintain a constant laser scan speed, and specimen 14 (Pnd = 0.367 and vnd = 0.369) compared to specimen 1 (Pnd = 0.366 and vnd = 0.269); i.e., we maintain constant laser power but decrease laser scan speed. In both comparisons, we increase the temperature gradient between the melt pool and previously solidified material, thus increasing the formation of globules (highlighted in red) due to thermocapillary convection. We observe that the summit density η decreases as a result of the presence of globules. Similarly, we observe that the standard deviation of summit heights σs increases with an increasing number of globules.
Figure 6(c) shows optical microscopy and Fig. 6(e) shows confocal microscopy images of as-built surface S1 of specimen 23 (Pnd = 0.367 and vnd = 0.369) and specimen 12 (Pnd = 0.435 and vnd = 0.654); i.e., we increase the laser scan speed, which reduces the size of the melt pool and, correspondingly, reduces the overlap between adjacent melt tracks. We identify and highlight unmelted metal powder particles in red (specimen 23: 69.1 unmelted metal powder particles/mm2 and specimen 12: 251.9 unmelted metal powder particles/mm2), which primarily reside in the trenches between adjacent melt tracks. We observe that summit density η decreases with decreasing melt track overlap because the increasing number of unmelted metal powder particles cover multiple summits. Furthermore, the presence of unmelted metal powder particles and trenches between adjacent tracks increases the “spikiness” of the surface (i.e., the mean summit radius Rs decreases and the standard deviation of summit heights σs increases), as illustrated in the traces shown in Fig. 6(d) and the standard deviation of summit heights σs shown in Fig. 6(e). Figure 6(f) shows optical microscopy and Fig. 6(h) shows confocal microscopy images of as-built surface S1 of specimen 18 (Pnd = 0.143 and vnd = 0.143) and specimen 11 (Pnd = 0.581 and vnd = 0.269); i.e., we increase the laser power, which increases the size of the melt pool and increases overlap between adjacent melt tracks, thus increasing remelting and, correspondingly, reducing the number of unmelted metal powder particles. We highlight unmelted metal powder particles in red (specimen 18: 177.8 unmelted metal powder particles/mm2 and specimen 11: 90.1 unmelted metal powder particles/mm2). We observe that the summit density η increases and the mean summit radius Rs increases with increasing melt track overlap. Figure 6(h) shows confocal microscopy images of both surfaces and illustrates that the standard deviation of summit heights σs decreases with increasing melt track overlap because the size of the trenches between adjacent melt tracks, and correspondingly, the number of unmelted metal powder particles decreases.
4 Conclusions
We have derived data-driven models that relate deterministic surface topography parameters to L-PBF process parameters using five supervised ML algorithms, and we have compared the prediction accuracy of the data-driven models to a multivariate regression benchmark model. We conclude that:
All supervised ML algorithms considered in this work derive data-driven models with prediction accuracy that exceeds that of the multivariate regression model, using the R2, RMSE, and MAE metrics, because a multivariate regression model does not always capture the complex relationship between the deterministic surface topography parameters and the L-PBF process parameters in a single best-fit equation.
The data-driven models derived from decision tree (interpretable) and ANN (non-interpretable) algorithms provide the highest prediction accuracy.
We show experimental evidence that thermocapillary convection and overlap between adjacent melt tracks are the main drivers of the deterministic surface topography parameters, i.e., the summit density η, the mean summit radius Rs, and the standard deviation of summit heights σs.
Acknowledgment
We gratefully acknowledge support from the Department of Defense, Office of Local Defense Community Cooperation, under award no. ST1605-21-04.
Conflict of Interest
There are no conflicts of interest.
Data Availability Statement
The data sets generated and supporting the findings of this article are obtainable from the corresponding author upon reasonable request.
Appendix
Table 3 shows the deterministic surface topography parameters (η, Rs, and σs) for each specimen and as-built surface S1–S5. The value of each deterministic surface topography parameter is the average of the three replicates of each as-built surface (surface topography measurement locations are shown in Fig. 2).
Deterministic surface topography parameters (η/Rs/σs) | |||||
---|---|---|---|---|---|
Specimen | S1 | S2 | S3 | S4 | S5 |
1 | 0.142/0.100/6.612 | 0.125/0.078/9.288 | 0.150/0.096/6.229 | 0.151/0.099/6.207 | – |
2 | 0.135/0.109/6.198 | 0.114/0.086/8.864 | 0.122/0.098/7.329 | 0.122/0.097/7.227 | – |
3 | – | – | 0.110/0.084/10.37 | 0.113/0.090/9.580 | 0.128/0.083/13.05 |
4 | 0.109/0.080/11.16 | 0.120/0.097/8.255 | 0.118/0.087/9.782 | 0.133/0.101/6.987 | – |
5 | 0.104/0.075/11.70 | 0.120/0.100/7.515 | 0.131/0.097/7.078 | 0.123/0.086/8.726 | – |
6 | – | – | 0.112/0.078/11.43 | 0.116/0.088/10.05 | 0.142/0.079/14.716 |
7 | – | – | 0.113/0.068/12.835 | 0.129/0.079/10.63 | 0.160/0.082/13.29 |
8 | – | – | 0.113/0.077/12.079 | 0.126/0.089/9.961 | 0.150/0.098/8.482 |
9 | 0.144/0.106/6.750 | 0.134/0.062/11.74 | 0.148/0.076/9.048 | 0.145/0.071/10.17 | – |
10 | 0.157/0.091/7.675 | 0.127/0.069/10.75 | 0.146/0.074/9.234 | 0.125/0.076/9.938 | – |
11 | 0.158/0.113/5.585 | 0.116/0.076/10.60 | 0.150/0.104/5.908 | 0.135/0.090/7.927 | – |
12 | 0.128/0.077/10.03 | 0.135/0.094/7.715 | 0.122/0.081/10.04 | 0.149/0.104/6.262 | – |
13 | 0.139/0.097/7.561 | 0.119/0.075/10.88 | 0.127/0.072/11.46 | 0.138/0.084/9.354 | – |
14 | 0.177/0.109/5.017 | 0.121/0.087/8.613 | 0.142/0.103/6.838 | 0.141/0.101/6.686 | – |
15 | – | – | 0.145/0.106/6.908 | 0.134/0.095/8.395 | 0.198/0.087/8.813 |
16 | 0.167/0.116/5.097 | 0.141/0.098/6.760 | 0.184/0.108/5.390 | 0.164/0.106/6.177 | – |
17 | 0.138/0.098/7.119 | 0.166/0.119/4.987 | 0.165/0.106/6.266 | 0.158/0.105/6.119 | – |
18 | 0.137/0.093/7.870 | 0.148/0.095/7.003 | 0.159/0.105/6.231 | 0.151/0.102/6.283 | – |
19 | 0.144/0.103/6.326 | 0.168/0.118/5.093 | 0.146/0.109/6.144 | 0.149/0.105/6.278 | – |
20 | 0.139/0.095/7.213 | 0.164/0.116/5.133 | 0.164/0.110/5.488 | 0.157/0.109/5.604 | – |
21 | – | – | 0.159/0.113/5.676 | 0.160/0.116/5.349 | 0.183/0.096/7.097 |
22 | 0.157/0.103/5.993 | 0.171/0.118/4.908 | 0.164/0.107/5.766 | 0.171/0.117/4.855 | – |
23 | 0.143/0.100/6.549 | 0.161/0.115/5.298 | 0.170/0.099/6.399 | 0.157/0.097/6.953 | – |
24 | 0.155/0.107/5.677 | 0.168/0.118/4.879 | 0.156/0.092/7.480 | 0.179/0.098/6.361 | – |
Deterministic surface topography parameters (η/Rs/σs) | |||||
---|---|---|---|---|---|
Specimen | S1 | S2 | S3 | S4 | S5 |
1 | 0.142/0.100/6.612 | 0.125/0.078/9.288 | 0.150/0.096/6.229 | 0.151/0.099/6.207 | – |
2 | 0.135/0.109/6.198 | 0.114/0.086/8.864 | 0.122/0.098/7.329 | 0.122/0.097/7.227 | – |
3 | – | – | 0.110/0.084/10.37 | 0.113/0.090/9.580 | 0.128/0.083/13.05 |
4 | 0.109/0.080/11.16 | 0.120/0.097/8.255 | 0.118/0.087/9.782 | 0.133/0.101/6.987 | – |
5 | 0.104/0.075/11.70 | 0.120/0.100/7.515 | 0.131/0.097/7.078 | 0.123/0.086/8.726 | – |
6 | – | – | 0.112/0.078/11.43 | 0.116/0.088/10.05 | 0.142/0.079/14.716 |
7 | – | – | 0.113/0.068/12.835 | 0.129/0.079/10.63 | 0.160/0.082/13.29 |
8 | – | – | 0.113/0.077/12.079 | 0.126/0.089/9.961 | 0.150/0.098/8.482 |
9 | 0.144/0.106/6.750 | 0.134/0.062/11.74 | 0.148/0.076/9.048 | 0.145/0.071/10.17 | – |
10 | 0.157/0.091/7.675 | 0.127/0.069/10.75 | 0.146/0.074/9.234 | 0.125/0.076/9.938 | – |
11 | 0.158/0.113/5.585 | 0.116/0.076/10.60 | 0.150/0.104/5.908 | 0.135/0.090/7.927 | – |
12 | 0.128/0.077/10.03 | 0.135/0.094/7.715 | 0.122/0.081/10.04 | 0.149/0.104/6.262 | – |
13 | 0.139/0.097/7.561 | 0.119/0.075/10.88 | 0.127/0.072/11.46 | 0.138/0.084/9.354 | – |
14 | 0.177/0.109/5.017 | 0.121/0.087/8.613 | 0.142/0.103/6.838 | 0.141/0.101/6.686 | – |
15 | – | – | 0.145/0.106/6.908 | 0.134/0.095/8.395 | 0.198/0.087/8.813 |
16 | 0.167/0.116/5.097 | 0.141/0.098/6.760 | 0.184/0.108/5.390 | 0.164/0.106/6.177 | – |
17 | 0.138/0.098/7.119 | 0.166/0.119/4.987 | 0.165/0.106/6.266 | 0.158/0.105/6.119 | – |
18 | 0.137/0.093/7.870 | 0.148/0.095/7.003 | 0.159/0.105/6.231 | 0.151/0.102/6.283 | – |
19 | 0.144/0.103/6.326 | 0.168/0.118/5.093 | 0.146/0.109/6.144 | 0.149/0.105/6.278 | – |
20 | 0.139/0.095/7.213 | 0.164/0.116/5.133 | 0.164/0.110/5.488 | 0.157/0.109/5.604 | – |
21 | – | – | 0.159/0.113/5.676 | 0.160/0.116/5.349 | 0.183/0.096/7.097 |
22 | 0.157/0.103/5.993 | 0.171/0.118/4.908 | 0.164/0.107/5.766 | 0.171/0.117/4.855 | – |
23 | 0.143/0.100/6.549 | 0.161/0.115/5.298 | 0.170/0.099/6.399 | 0.157/0.097/6.953 | – |
24 | 0.155/0.107/5.677 | 0.168/0.118/4.879 | 0.156/0.092/7.480 | 0.179/0.098/6.361 | – |
Note: “–“ indicates that an as-built surface did not exist for a specific specimen.