Enhanced production of alkane hydroxylase from Penicillium chrysogenum SNP5 (MTCC13144) through feed-forward neural network and genetic algorithm
AMB Express volume 12, Article number: 28 (2022)
Alkane hydroxylase (AlkB), a membrane-bound enzyme has high industrial demand; however, its economical production remains challenging due to its intrinsic nature and co-factor dependency. In the current study, various critical process parameters for optimum production of AlkB have been optimized through feed forward neural network (FFNN) and genetic algorithm (GA) models using Penicillium chrysogenum SNP5 (MTCC13144). AlkB specific activity under preliminary un-optimized conditions i.e., 1% hexadecane, 7.4 pH, 11 days incubation time, 28 °C incubation temperature and 1 ml of inoculum size was 100 U/mg. ‘One variable at a time’ (OVAT) strategy was used to identify optimum physicochemical parameters and then its output data was fed to develop a model of FFNN with ‘6-12-1’ topology. Outputs of FFNN were further optimized through GA to minimize errors and intensify search level. This has provided superior predictive performances with 0.053 U/mg overall mean absolute percentage error (MAPE), 6.801 U/mg root mean square errors (RMSE), and 0.987 overall correlation coefficient (R). The AlkB specific activity improved by 3.5-fold, i.e., from 100 U/mg under preliminary un-optimized conditions to 351.32 U/mg under optimum physicochemical conditions obtained through FFNN-GA hybrid method, i.e., hexadecane (carbon source): 1.56% v/v, FeSO4: 0.63 mM, incubation temperature: 27.40 °C, pH: 7.38, incubation time: 12.35 days and inoculums size: 1.33 ml. The developed process would be a stepping stone to fulfill the high industrial demands of Alkane hydroxylase.
The membrane-bound alkane hydroxylase (AlkB) is a versatile biocatalyst that introduces molecular oxygen in inert alkanes with regio and stereoselectivity. The AlkB system is comprised of three subunits: AlkB, soluble rubredoxin reductase, and soluble rubredoxin. AlkB incorporates molecular oxygen from O2 to the alkane and the remaining oxygen gets reduced to water by electrons released from rubredoxin on the action of rubredoxin reductase (Eidani et al. 2012). Various divergent forms of Alkane hydroxylases ubiquitously found worldwide viz. soluble methane monooxygenase (sMMO) and copper-containing methane monooxygenase (pMMO) are capable of oxidizing hydrocarbons ranging from C1 to C8 (Van Beilen and Funhoff 2007). While, integral membrane-bound AlkB insert oxygen in C5 to C16 (Aliakbari et al. 2014), other forms such as cytochrome P450, LadA, or AlmA assimilate oxygen to alkanes larger than C20 (Wang and Shao 2012) and sometimes opt for medium-chain alkanes as substrate (Xu et al. 2015).
AlkB has tremendous market demand in synthesizing industrially important molecules such as secondary metabolites, steroids, polyketides, pharmaceutical compounds, cosmetics, fragrance and agrochemical intermediates, etc. (Rojo 2010; Ramu et al. 2012). It has some other promising applications like conversion of petroleum waste into activated intermediates, bioremediation and biotransformation. In general, either whole cell or partially purified AlkB may be used in biotransformation depending upon the requirements. Ramu et al. (2012) explored this enzyme in a whole-cell biotransformation system by recombinant expression in Escherichia coli to regioselectively synthesize 2,2-, 3,3- and 4,4-difluorooctan-1-ols, from simple and inexpensive starting materials. However, opting for whole cells for oxidation becomes challenging due to the slow uptake of a lipophilic substrate which results in the production of toxic compounds and a low oxygen transfer rate (Ayala and Toress 2004). Alkane hydroxylases have been well explored in therapeutics also, where it is used to treat inflammation, vascular liver diseases and peroxisome disorders of fatty acid metabolism.
The alkane hydroxylase system is distributed in a wide range of bacterial strains (Burkholderia, Pseudomonas, Acinetobacter, Alcanivorax, and Rhodococcus strains) and a few fungal strains like Aspergillus sp. (Nie et al. 2014; Kadri et al. 2018). Its overproduction has been achieved through overexpressing the all gene of both the gram-negative and gram-positive bacteria into another host; however, rubredoxin and rubredoxin reductase were reported as their essential cofactors (Luo et al. 2015). It has been reported by Kadri et al. (2018) and Al-Hawash et al. (2018) that media engineering and selection of appropriate carbon sources improved AlkB specific activity in Alcanivorax borkumensis and Aspergillus sp. RFC-1. From the results and observations of earlier studies, it was observed that the bottleneck of the process is its intrinsic nature and cofactor dependency. Therefore, to overcome these bottlenecks various interdisciplinary approaches and techniques have to be put together to achieve optimum specific activity of AlkB economically. Being membrane-bound, its yield remains dependent on cell growth and concentration; and further cofactors are essential for its functions, hence, its yield could be enhanced through optimizing the physicochemical critical parameters, which influence cell growth most. The global scientific and industrial world has been witnessing the increasing use of FFNN and GA together for process optimization for enhanced product yield.
An experimental design, i.e., screening and optimization design is considered pivotal to computer-assisted design-guided statistical exercise. The aims of factor screening and optimization can be accomplished by opting for a design-guided experimental strategy using selected experimental designs. Experimental designs are modeled by selecting appropriate mathematical models like linear, quadratic and cubic to generate 2D and 3D-response variables to figure out inter and intra factorial interactions. To search for optimum yield or solution, various numerical and graphical optimization techniques such as FFNN, desirability function and overlay plot are opted, which are located in design and control spaces. Design space is a multidimensional combination of input and response variables to determine the optimal solution with high accuracy and quality.
In bioprocesses optimization (Negi et al. 2020), pattern recognition in spectrum data, functional analysis of genomes and proteomes (Wardah et al. 2019) and their nonlinear functions are designed through FFNN (May et al. 2011). Many studies reported that FFNN has better efficiency, accuracy and yield as compared to the other statistical optimization methods such as RSM (Prakash Maran and Priya 2015). Genetic algorithms (GAs) are randomly determined search methods based on some basic operations like selection, reproduction or crossover, and mutation as natural genetics to find out the best fitness value/outcome (Murthy 2012). GA has also been well explored by many researchers to achieve optimum process parameters for enhancing product yield in various biological systems (Kana et al. 2012). In previous studies, coupled FFNN-GA system has been effectively used for optimizing the production of cellulase (Chang et al. 2011) and glutaminase (Sathish and Prakasham 2010). These studies concluded that FFNN-GA coupled system has better proficiency with minimum errors compared to the other optimization methodologies. Hence FFNN-GA coupled system is emerging as an effective tool in optimization studies (Singh et al. 2017).
In the present study, FFNN-GA coupled system was used to optimize fermentation parameters to achieve maximum specific activity of AlkB from Penicillium chrysogenum SNP5. FFNN was used for the training of experimental data and GA was used for the optimization of input variables further with the help of weight and biases generated from the neural network.
Material and methods
Microorganism and media chemicals
Penicillium chrysogenum SNP5 (MTCC13144) strain was locally isolated from grease contaminated soil of the diesel loco shed and identified by Microbial Type Culture Collection, Chandigarh, India. Its ITS/5.8S rRNA and the β-tubulin gene sequences have been submitted to GenBank and Bankit with accession numbers: OL336466 and OL352703 respectively. Triton X-100, Phenylmethylsulfonyl fluoride (PMSF), Lauryldimethylamine oxide (LDAO), Nicotinamide adenine dinucleotide (NAD) + hydrogen (NADH), other medium components and Czapek-dox medium were procured from Sisco Research Laboratories Pvt. Ltd. (Mumbai, India). Hexadecane, Digitonin and THB (Tetrahydrobiopterin) were procured from TCI Chemicals Pvt. Ltd. (India).
Production of alkane hydroxylase under submerged fermentation (SmF)
Production of AlkB was performed in a 250 ml Erlenmeyer flask with a working volume of 100 ml. Initially, hexadecane 1% (v/v), 0.5% YEPD, 0.1% glucose and 1 mM TBH were dissolved in 50 ml of Czapek-dox broth (NaNO3—2.5 g/l, KH2PO4—1.0 g/l, MgSO4⋅7H2O—0.5 g/l, KCl—0.5 g/l, FeSO4—0.45 g/l) and final volume of 100 ml was maintained by using distilled water. The flasks were autoclaved at 121 °C, 121psi for 15 min. Penicillium chrysogenum SNP5 (MTCC13144) strain was cultivated on potato dextrose agar (PDA) slant and incubated for 6–7 days. The inoculum was prepared with sterile distilled water by maintaining the spore’s concentration of 1.4 × 107 spores/ml. Each flask was inoculated with 1.0 ml of spore suspension and incubated at 28 °C for 11 days, and growth was observed.
Extraction of AlkB and its activity measurement
After fermentation, the cells were harvested by centrifuging the fermented broth at 7826g for 10 min to separate the cells. The cell pellets containing AlkB were washed two times with Tris–HCl buffer (pH 7.4) and lysed to recover the enzyme by using ultrasonicator (Model SKL-500D) Ningbo Haishu Sklon Electronics Instrument Co, Ltd. (Mainland, China) at 70 kHz using 9 s on 9 s off pulsating cycle for 5 min in lysis buffer (150 mM NaCl, 20% (v/v) glycerol, 50 mM Tris HCl, 1 mM digitonin, 2% Triton X-100 and 1 mM PMSF) and then centrifuged at 11,269g for 15 min. The clear supernatant was collected, and the remaining cell pellet was resonicated followed by centrifugation, and both the supernatant were pooled to serve as crude AlkB enzyme for further study.
AlkB activity in the crude extract was measured by a continuous method using NADH as cofactor and hexadecane as substrate by a modified protocol of McKenna and Coon (1970). Reaction mixture contained 100 mM Tris HCl buffer (pH—7.4), 0.035% LDAO (1.5 CMC), 20% glycerol, 100 μl crude enzyme, 1 mM hexadecane. A mixture lacking NADH was used as a negative control. The reaction mixture was then incubated at room temperature for 20 min. The reaction was initiated by adding NADH to a final concentration of 50 μM. The rate of NADH consumption was determined by monitoring the change in absorbance at 340 nm at room temperature for 10 min. One unit is defined as the amount of enzyme required for the consumption of 1 μM of NADH (ε340 = 6220 M−1 cm−1) per min.
Optimization of process parameters by OVAT method
SmF optimization experiments were planned according to OVAT method with six selected process parameters as mentioned in Table 1 i.e., hexadecane concentration (as carbon source), incubation temperature, pH of the media, incubation time, inoculum size and metal ion concentration (FeSO4). Tested ranges of variables were hexadecane (0.25–4% v/v), temperature (20–38 °C), pH (4–9), incubation time (5–14 days), inoculum size (0.25–3.0 ml) and FeSO4 (0.05–0.8 mM).
Modeling of optimization process by FFNN
The feed-forward neural network along with the back-propagation learning algorithm has been employed in this study to optimize nonlinear data obtained from the ‘one variable at a time’ method and to reduce the experimental error. The network consists of input, hidden and output layers along with additional nodes termed as the bias (biasI and biasH). The connection between each layer has been denoted as weight (weightH and weightO). Tan sigmoid functions have been used for optimum output. The network outcomes such as weights and biases have been expressed with the following equation (Norgaard 2000; Izadifar 2005).
In the present study, six process parameters were selected, i.e., hexadecane (carbon source) concentration, temperature, pH, incubation time, inoculum size and metal ion concentration (i.e., FeSO4) to enhance the AlkB productivity. Process parameters were initially optimized with the ‘OVAT’ method by considering the upper and lower limits of each parameter (Table 1). Total 47 experimental sets were performed using the OVAT method, and later it was extended upto 103 sets using the regression equation. Out of 103 sets, 73 (~ 70%) were selected for training, 15 (~ 15%) were used for validation, and the rest of the 15% data were used for testing in FFNN modeling (Table 2). The neural network was trained by using MATLAB R2020a (The MathWorks, Inc., Natick, MA, USA). Levenberg–Marquardt (trainlm) algorithm was used and numbers of hidden neurons were increased one by one to obtain the best correlation. The best training run was determined by the coefficient R for training, validation and test, which describes the extent of back-propagation in the modeled network. The mean absolute percentage error (MAPE) and root mean square error (RMSE) were calculated using the experimental output and predicted output as described previously (Zhang and Fang 2006).
Optimization with a GA was carried out using FFNN outputs (i.e., weights and biases) by assigning fitness functions to each population. The global optimum was localized on objective function using genetic algorithm outputs. Our main objective was to find optimum input variables for the highest specific activity of AlkB by fixing the lower and upper bound of input variables (Table 1). For optimization of neural network output, the GA toolbox of MATLAB R2020a (The MathWorks, Inc., Natick, MA, USA) was used to achieve optimum conditions in the given range of input variables. GA optimization parameters, i.e., population size as default (i.e., 50 for five or fewer otherwise 200), crossover probability (i.e., 0.8), mutation probability (i.e., 0.01) and the maximum number of generations (i.e., 500) were considered based on literature (Prakasham et al. 2011).
Our previous studies revealed that Penicillium chrysogenum SNP5 (MTCC13144) has great potential for conversion of hydrocarbons of complex grease waste into fatty acids (Kumar et al. 2012; Kumari et al. 2017). Therefore, in the current study, Penicillium chrysogenum SNP5 has been explored for the production of AlkB, which is a key player in the uptake of hydrocarbons as a carbon source. AlkB specific activity is growth associated, therefore, depends on various fermentation parameters. In this study, an initial experimental setup was done by taking 1% (v/v) hexadecane, 0.5% (w/v) YEPD, 0.1% (w/v) glucose as carbon source, and 1 mM TBH as a modulator for inducing higher production of AlkB under physicochemical conditions of pH (7.4), incubation time (11 days), incubation temperature (28 °C) and inoculum size (1 ml). The maximum AlkB specific activity found with this setup was 100 U/mg. Further, to find optimum conditions for improved AlkB specific activity, these parameters were varied between the assigned lower and upper bounds (Table 1), and the experimental layout was prepared by the ‘OVAT’ method (Irfan et al. 2014) where one factor was varied at a time keeping others constant (Table 2). A total of 103 experimental sets were obtained and the data were analyzed using an FFNN, where predicted values were compared with experimental outputs. It was observed that the AlkB specific activity altered with variation in different fermentation conditions (Table 2).
Further, FFNN was trained one by one with Levenberg–Marquardt (Rumelhart et al. 1986), Bayesian (MacKay 1992), Conjugate (Powell 1977), and scaled conjugate gradient (Moller 1993) methods. Out of all these methods, Levenberg–Marquardt back-propagation with trainlm algorithm showed a better R value (i.e., for Training: 0.987, validation: 0.984, test: 0.988, and overall: 0.987) shown in (Fig. 1) along with experimental outputs (41.9 to 198.94 U/mg) and simulated output (39.14 to 200.03 U/mg). The mean absolute percentage error (MAPE) and root mean square error (RMSE) observed were 0.053 and 6.801 U/mg. The final weights and biases values were optimized by minimizing the network error (Table 3), and the optimum result was found in a ‘6-12-1’ FFNN network topology for this study (Fig. 2).
The network performance by mean square error (MSE) (Fig. 3) and network error by error histogram (Fig. 4) were also analyzed. The network performance plot shows that the mean square error for training, validation and test converged after the 9th epoch (Fig. 3). The error histogram showed that most of the training errors occurred between − 9.77 and 6.8 values (Fig. 4). The overall outcome of the neural network showed the goodness of the ‘6-12-1’ neural topology and excellent correlation to train the input parameter data for the production of AlkB.
The outputs obtained from a neural network were optimized to get the best optimum input parameters for maximum AlkB specific activity using a genetic algorithm because existing algorithms possess only local optimization solutions for a nonlinear function, whereas, a genetic algorithm exhibits a global solution. After large numbers of genetic algorithms trials, the five best input conditions were selected (Table 4), which could depict the fittest possible input conditions. All these conditions were employed for further verification by setting up experiments, followed by the comparison of experimental AlkB specific activity with genetic algorithm outputs. An increase in AlkB specific activity was observed by 77.4% (198.94 to 351.32 U/mg) when FFNN outputs were optimized with GA.
Figure 5 depicts the optimum output of GA optimization along with the contribution of each variable where fitness values were expressed in terms of the mean value. The best fitness was obtained at the 161th generation at which fitness value and mean value were found aligned at a constant rate. Similar results have been reported by Subba Rao et al. (2008) for optimization of protease yield and Pappu and Gummadi (2017) for xylitol production.
Based on the results obtained after FFNN-GA optimization, surface contour plots were generated to examine the impact of one variable on another using fitness function with MATLAB R2020a. The maximum AlkB specific activity observed with the combined effect of two variables at the optimum environment is shown in Fig. 6. The variation in AlkB specific activity could be seen with the variation of hexadecane from 0.25 to 3% and incubation temperature from 20 to 38 °C (Fig. 6a). Maximum AlkB specific activity was observed with 1.5% of hexadecane as carbon source and incubation temperature of 38 °C. Further increase in hexadecane concentration beyond 1.5% led to a decrease in activity, which indicated that hexadecane concentration and incubation temperature together have a higher impact over AlkB production. Figure 6b indicated variation in AlkB specific activity with variation in inoculum size (1–3 ml) and variation in supplementation of hexadecane (0.5–3%) in the medium. The maximum AlkB specific activity was observed up to 1 ml of inoculum size and beyond this, there was a sharp decline in its specific activity. However, hexadecane concentration from 1 to 3% didn’t influence AlkB specific activity much. This indicated that inoculum size played the role of key regulator for AlkB specific activity when varied in combination with hexadecane concentration. The pattern of the contour plot between pH and incubation temperature indicated that temperature in the range of 20–35 °C did not influence AlkB specific activity significantly, whereas pH above 5 has shown a negative impact on it (Fig. 6c). In Fig. 6d combined impact of incubation period and inoculum size on AlkB specific activity is shown, which indicates that less than 8 days of incubation and more than 1.5 ml of inoculum size have the least impact on AlkB specific activity. From Fig. 6e, it can be observed that AlkB specific activity remained constant with 0.1–0.8 mM FeSO4 concentration and decreased with hexadecane concentration of more than 1.5%. Interaction of inoculum size (1–3 ml) and incubation temperature (20 °C to 38 °C) showed maximum activity near 30 °C temperature and 2.5 ml of inoculum size (Fig. 6f) and reflected that both parameters were collectively regulating AlkB specific activity. An incubation time (7–11.5 days) with pH (4.5–8.5) together had a positive impact on activity, whereas, an incubation time of more than 12 days didn’t have any impact irrespective of the pH value (Fig. 6g). From Fig. 6h it could be concluded that AlkB specific activity was maximum with 1.5% of hexadecane with 8–12 days of incubation, however, it sharply declined beyond 1.5% of hexadecane. The contour surface plots (Fig. 6) show that each combination/individual fermentation parameter has an effective contribution to the overall AlkB specific activity.
Considering promising industrial applications of AlkB, process parameters were optimized for the production of AlkB from Penicillium chrysogenum SNP5. It has shown good growth on hexadecane, and its uptake as a carbon source was confirmed by AlkB specific activity of 100 U/mg and the presence of fatty acids in ferment. Banu et al. (2010) and Kadri et al. (2018) have also reported similar results. Submerged fermentation was chosen because it provides larger surface area and high oxygen availability which assisted proper growth of Penicillium chrysogenum SNP5 and facilitated the uptake of hexadecane as a carbon source. With submerged fermentation, separation of cell biomass and extraction of membrane-bound AlkB is easier than solid-state fermentation (Flores-Flores et al. 2011). It is well established that yield of enzymes varies with critical process parameters and microbial strains because the growth and metabolism of microbes are dependent on the various physicochemical environment, nutritional factors and combinatorial impact of various process parameters (Vishwanatha et al. 2010; Saxena and Singh 2011; Narra et al. 2012). Six input variables were considered to study their individual and combined effect on AlkB specific activity through OVAT. Hexadecane was used as a carbon source in the media, as AlkB had shown high specificity for it, hence could act as an inducer for AlkB production. The pH of the media becomes crucial in case of submerged fermentation and AlkB is quite susceptible to pH as well. As AlkB is integral protein (membrane bound), hence its yield is biomass dependant and growth associated, therefore, incubation temperature and period would play critical role in achieving optimum yield. The number of viable cells in the inoculum ensures rapid proliferation and biomass synthesis which results in increased production of enzymes, hence, inoculums size was selected as an input parameter. FeSO4 was selected as one of the input parameters because AlkB is a nonheme iron-containing enzyme whose catalytic property is strongly dependent upon iron. Further, data obtained from OVAT were used for FFNN-GA to achieve optimum yield with the limited experimental data.
FFNN-GA optimization showed very promising results in terms of AlkB specific activity. The results obtained from FFNN like correlation charts (Fig. 1), performance plot (Fig. 3) and error histogram (Fig. 4) indicated that training of neural network was done very accurately with network topology of ‘6-12-1’ (Fig. 2). These results are close to the published results (Das et al. 2015; Negi et al. 2020; Suryawanshi et al. 2020). A high degree of accuracy of optimization is attributed to the selection of appropriate training algorithms (i.e., trainlm), several hidden layers and the size of ‘one-variable-at-a-time’ data for the training of the network.
The GA optimization utilizes the FFNN outputs to provide the global optimum solution for non-linear problems. Results obtained after GA optimization (Table 4) clearly indicated that there was significant increase in AlkB specific activity when it was cross-validated with experimental data, which reveals that the used fitness function generated the best fitness values for optimum AlkB specific activity. This could be possible due to significant weights and biases value obtained after network training and selection of appropriate values of GA parameters (i.e., population size, crossover probability, mutation probability and the number of generations, etc.). Similar results have been published by Badhwar et al. (2020) and Prakasham et al. (2011).
The 77.4% significant improvement from FFNN-GA optimization reveals that GA has efficiently used its reproduction function, crossover function and mutation functions to generate the good strings, new populations whereas iteration process might have able to find out the best global optimal solutions. The GA might have identified slight changes in inoculum size, incubation time, pH of the media and metal ion concentration as key regulators and generated the several combinations of these critical parameters to provide the highest yield. Specially, FeSO4 has higher impact on enhanced AlkB yield due to its dependency on iron for catalytic activity.
An interactive contour plot generated with the help of GA outputs (Fig. 6) indicated that the best optimum output could be achieved by generating an infinite number of combinations of two input variables keeping other variables at their optimized level. A similar study has also been reported by Salim et al. (2019). Enhanced AlkB specific activity with an increase in hexadecane concentration and incubation temperature (Fig. 6a) suggested that uptake of hexadecane was easier at higher temperatures which induced the higher production of AlkB. On the other hand, the effects of hexadecane and inoculum size on AlkB specific activity (Fig. 6b) suggested that up to 1 ml of inoculum size was sufficient for the utilization of 1–3% of hexadecane, which might be due to a higher percentage of cell viability in a spore suspension. Figure 6c indicated high sensitivity of AlkB towards variation in pH-temperature combo, hence, a slight change in pH from 7 reduced AlkB activity, which might be due to a change in the ionic state of the active site of AlkB. Figure 6d indicates that AlkB production started after 8 days of incubation. This might be due to the availability of simple carbon source (glucose) in the media which supported its initial growth and only after the consumption of glucose present in the media, hexadecane utilization might have started, which resulted in the production of AlkB. An increase in hexadecane concentration decreased AlkB activity (Fig. 6e), which could be due to its toxicity and hydrophobicity. Combination of inoculum size and temperature (Fig. 6f) had shown less impact on the AlkB specific activity beyond inoculum size of 1.5, whereas; incubation time and pH (Fig. 6g) together influenced AlkB specific activity much more. Figure 6h suggested that higher hexadecane concentration reduced the AlkB activity due to excess substrate accumulation of substrate toxicity.
An overall observation from contour plots indicates that specific activity of AlkB was influenced by hexadecane concentration in combination with some other parameters in the following orders: metal ion concentration > incubation time > inoculum size > incubation temperature (Fig. 6). The wide variation in enzyme yield shown in Table 2 (i.e. minimum 39.44 U/mg for 56th set and maximum 198.94 U/mg) emphasizes the significance of the machine learning-based optimization approach for the cost-effective production of membrane-bound enzymes. A similar data (i.e., minimum 71.33 U/ml, maximum 218.28 U/ml), has been reported by Sathish and Prakasham (2010). Specific activity of AlkB was improved by 77.4% (i.e. from 198.94 to 351.32 U/mg) when FFNN output enzyme production data was further optimized using GA. Sathish and Prakasham (2010) also reported a 47% improvement in the yield of glutaminase after ANN output enzyme production data was further optimized using GA. An overall 3.5-fold increase in AlkB specific activity (from 100 U/mg under preliminary un-optimized conditions to 351.32 U/mg after FFNN-GA optimization) has been achieved using FFNN-GA hybrid method in this study. Subba Rao et al. (2008) had also reported more than 2.5-fold improvement in alkaline protease yield using FFNN–GA hybrid methodology. From the statistical observation, the R- Value of 0.987 of FFNN training exhibited a better correlation between predicted and experimental data with ‘6-12-1’ FFNN topology. The overall smallest values of mean absolute percentage error (MAPE) and root mean square error (RMSE) of 0.053 and 6.801, respectively suggested that chosen network had good approximation and generalization aspects for the optimization of AlkB yield.
From the overall findings in this study, it can be concluded that OVAT strategy alone is not capable to find out optimal conditions for enhancing AlkB yield due to the requirement of a large number of experiments and lack of determination of interactions among various factors. FFNN-GA coupled optimization approach significantly enhanced the AlkB specific activity. The FFNN model ‘6-12-1’ showed the best prediction accuracy after training with the Levenberg–Marquardt (trainlm) algorithm. These findings signify the utility of the FFNN-GA approach for the enhanced production of Alkane hydroxylase from Penicillium chrysogenum SNP5 and optimization of other bioprocesses in the enzyme industry.
Availability of data and materials
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
Feed Forward Neural Network
Mean absolute percentage error
Root mean square error
Mean square error
One variable at a time
Nicotinamide adenine dinucleotide (NAD) + hydrogen (H)
Potato dextrose agar
Critical micelle concentration
Al-Hawash AB, Zhang J, Li S, Liu J, Ghalib HB, Zhang X, Ma F (2018) Biodegradation of n-hexadecane by Aspergillus sp. RFC-1 and its mechanism. Ecotoxicol Environ Saf 164:398–408. https://doi.org/10.1016/j.ecoenv.2018.08.049
Aliakbari E, Tebyanian H, Hassanshahian M, Kariminik A (2014) Degradation of alkanes in contaminated sites. Int J Adv Biol Biomed Res 2:1620–1637
Ayala M, Torres E (2004) Enzymatic activation of alkanes: constraints and prospective. Appl Catal A-Gen 272:1–13. https://doi.org/10.1016/j.apcata.2004.05.046
Badhwar P, Kumar A, Yadav A, Kumar P, Siwach R, Chhabra D, Dubey KK (2020) Improved pullulan production and process optimization using novel GA–ANN and GA–ANFIS hybrid statistical tools. Biomolecules. https://doi.org/10.3390/biom10010124
Banu AR, Devi MK, Gnanaprabhal GR, Pradeep BV, Palaniswamy M (2010) Production and characterization of pectinase enzyme from Penicillium chrysogenum. Indian J Sci Technol 3:377–381
Chang C, Xu G, Yang J, Wang D (2011) Optimization of cellulase production using agricultural wastes by artificial neural network and genetic algorithm. Chem Prod Process Model. https://doi.org/10.2202/1934-2659.1553
Das S, Bhattacharya A, Haldar S, Ganguly A, Gu S, Ting YP, Chatterjee PK (2015) Optimization of enzymatic saccharification of water hyacinth biomass for bio-ethanol: comparison between artificial neural network and response surface methodology. Sustain Mater Technol 3:17–28. https://doi.org/10.1016/j.susmat.2015.01.001
Eidani SZ, Shahraki MK, Gasemisakha F, Hahsemi M, Bambai B (2012) Cloning and expression of alkane hydroxylase-1 from Alcanivorax borkumensis in Escherichia coli. Toxicol Ind Health 28:560–565. https://doi.org/10.1177/0748233711416953
Flores-Flores TC, Gutiérrez-Rojas M, Revah S, Favela-Torres E (2011) Comparative study for oxygenases produced by Aspergillus niger, ATCC 9642, in solid-state and submerged fermentation. Rev Mex Ing Quim 10:189–207
Irfan M, Nadeem M, Syed Q (2014) One-factor-at-a-time (OFAT) optimization of xylanase production from Trichoderma viride-IR05 in solid-state fermentation. J Radiat Res Appl Sci 7:317–326. https://doi.org/10.1016/j.jrras.2014.04.004
Izadifar M (2005) Neural network modeling of trans isomer formation and unsaturated fatty acid changes during vegetable oil hydrogenation. J Food Eng 66:227–232. https://doi.org/10.1016/j.jfoodeng.2004.03.010
Kadri T, Rouissi T, Magdouli S, Brar SK, Hegde K, Khiari Z, Daghrir R, Lauzon JM (2018) Production and characterization of novel hydrocarbon degrading enzymes from Alcanivorax borkumensis. Int J Biol Macromol 112:230–240. https://doi.org/10.1016/j.ijbiomac.2018.01.177
Kana EBG, Oloke JK, Lateef A, Oyebanji A (2012) Comparative evaluation of Artificial Neural Network coupled Genetic Algorithm and Response Surface Methodology for modeling and optimization of citric acid production by Aspergillus niger MCBN297. Chem Eng Trans 27:397–402. https://doi.org/10.3303/CET1227067
Kumar S, Mathur A, Singh V, Nandy S, Khare SK, Negi S (2012) Bioremediation of waste cooking oil using a novel lipase produced by Penicillium chrysogenum SNP5 grown in solid medium containing waste grease. Bioresour Technol 120:300–304. https://doi.org/10.1016/j.biortech.2012.06.018
Kumari A, Ahmad R, Negi S, Khare SK (2017) Biodegradation of waste grease by Penicillium chrysogenum for production of fatty acid. Bioresour Technol 226:31–38. https://doi.org/10.1016/j.biortech.2016.12.006
Luo Q, He Y, Hou DY, Zhang JG, Shen XR (2015) GPo1 alkB gene expression for improvement of the degradation of diesel oil by a bacterial consortium. Braz J Microbiol 46:649–657. https://doi.org/10.1590/S1517-838246320120226
MacKay DJC (1992) Bayesian interpolation. Neural Comput 4:415–447. https://doi.org/10.1162/neco.1922.214.171.1245
May R, Dandy G, Maier H (2011) Review of input variable selection methods for artificial neural networks. Artif Neural Netw Methodol Adv Biomed Appl 10:16004
McKenna EJ, Coon MJ (1970) Enzymatic ω-oxidation. J Biol Chem 245:3882–3889. https://doi.org/10.1016/s0021-9258(18)62932-1
Møller MF (1993) A scaled conjugate gradient algorithm for fast supervised learning. In: Neural Networks, pp 525–533
Murthy CA (2012) Genetic algorithms: basic principles and applications
Narra M, Dixit G, Divecha J, Madamwar D, Shah AR (2012) Production of cellulases by solid state fermentation with Aspergillus terreus and enzymatic hydrolysis of mild alkali-treated rice straw. Bioresour Technol 121:355–361. https://doi.org/10.1016/j.biortech.2012.05.140
Negi S, Jain S, Raj A (2020) Combined ANN/EVOP factorial design approach for media screening for cost-effective production of alkaline proteases from Rhizopus oryzae (SN5)/NCIM-1447 under SSF. AMB Express. https://doi.org/10.1186/s13568-020-00996-7
Nie Y, Chi CQ, Fang H, Liang JL, Lu SL, Lai GL, Tang YQ, Wu XL (2014) Diverse alkane hydroxylase genes in microorganisms and environments. Sci Rep 4:1–11. https://doi.org/10.1038/srep04968
Nørgaard M (2000) Neural network system identification version 2
Pappu SMJ, Gummadi SN (2017) Artificial neural network and regression coupled genetic algorithm to optimize parameters for enhanced xylitol production by Debaryomyces nepalensis in bioreactor. Biochem Eng J 120:136–145. https://doi.org/10.1016/j.bej.2017.01.010
Powell MJD (1977) Restart procedures for the conjugate gradient method. Math Program 12:241–254. https://doi.org/10.1007/BF01593790
Prakash Maran J, Priya B (2015) Comparison of response surface methodology and artificial neural network approach towards efficient ultrasound-assisted biodiesel production from muskmelon oil. Ultrason Sonochem 23:192–200. https://doi.org/10.1016/j.ultsonch.2014.10.019
Prakasham RS, Sathish T, Brahmaiah P (2011) Imperative role of neural networks coupled genetic algorithm on optimization of biohydrogen yield. Int J Hydrogen Energy 36:4332–4339. https://doi.org/10.1016/j.ijhydene.2011.01.031
Ramu R, Chang CW, Chou HH, Wu LL, Chiang CH, Yu SSF (2012) Erratum: Regio-selective hydroxylation of gem-difluorinated octanes by alkane hydroxylase (AlkB) [Tetrahedron Letters (2011) 52(23) 2950-2953]. Tetrahedron Lett 53:5458. https://doi.org/10.1016/j.tetlet.2012.07.099
Rojo F (2010) Handbook of hydrocarbon and lipid microbiology
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536
Salim N, Santhiagu A, Joji K (2019) Process modeling and optimization of high yielding l-methioninase from a newly isolated Trichoderma harzianum using response surface methodology and artificial neural network coupled genetic algorithm. Biocatal Agric Biotechnol 17:299–308. https://doi.org/10.1016/j.bcab.2018.11.032
Sathish T, Prakasham RS (2010) Enrichment of glutaminase production by Bacillus subtilis RSP-GLU in submerged cultivation based on neural network—genetic algorithm approach. J Chem Technol Biotechnol 85:50–58. https://doi.org/10.1002/jctb.2267
Saxena R, Singh R (2011) Amylase production by solid-state fermentation of agro-industrial wastes using Bacillus sp. Braz J Microbiol 42:1334–1342. https://doi.org/10.1590/S1517-83822011000400014
Singh V, Haque S, Niwas R, Srivastava A, Pasupuleti M, Tripathi CKM (2017) Strategies for fermentation medium optimization: an in-depth review. Front Microbiol. https://doi.org/10.3389/fmicb.2016.02087
Subba Rao C, Sathish T, Mahalaxmi M, Suvarna Laxmi G, Sreenivas Rao R, Prakasham RS (2008) Modelling and optimization of fermentation factors for enhancement of alkaline protease production by isolated Bacillus circulans using feed-forward neural network and genetic algorithm. J Appl Microbiol 104:889–898. https://doi.org/10.1111/j.1365-2672.2007.03605.x
Suryawanshi N, Sahu J, Moda Y, Eswari JS (2020) Optimization of process parameters for improved chitinase activity from Thermomyces sp. by using artificial neural network and genetic algorithm. Prep Biochem Biotechnol 50:1031–1041. https://doi.org/10.1080/10826068.2020.1780612
Van Beilen JB, Funhoff EG (2007) Alkane hydroxylases involved in microbial alkane degradation. Appl Microbiol Biotechnol 74:13–21. https://doi.org/10.1007/s00253-006-0748-0
Vishwanatha KS, Rao AGA, Singh SA (2010) Acid protease production by solid-state fermentation using Aspergillus oryzae MTCC 5341: optimization of process parameters. J Ind Microbiol Biotechnol 37:129–138. https://doi.org/10.1007/s10295-009-0654-4
Wang W, Shao Z (2012) Genes involved in alkane degradation in the Alcanivorax hongdengensis strain A-11-3. Appl Microbiol Biotechnol 94:437–448. https://doi.org/10.1007/s00253-011-3818-x
Wardah W, Khan MGM, Sharma A, Rashid MA (2019) Protein secondary structure prediction using neural networks and deep learning: a review. Comput Biol Chem 81:1–8. https://doi.org/10.1016/j.compbiolchem.2019.107093
Xu J, Liu H, Liu J, Liang R (2015) Isolation and characterization of Pseudomonas aeruginosa strain SJTD-2 for degrading long-chain n-alkanes and crude oil. Wei Sheng Wu Xue Bao=acta Microbiologica Sinica 55:755–763
Zhang G, Fang B (2006) A uniform design-based back propagation neural network model for amino acid composition and optimal pH in G/11 xylanase. J Chem Technol Biotechnol 81:1185–1189. https://doi.org/10.1002/jctb.1510
The authors are grateful to the Department of Biotechnology, Government of India and Ministry of Human Resource Development, Government of India, for providing financial assistance and Motilal Nehru National Institute of Technology, Allahabad, India providing facilities and space to carry out this study.
No funding is available.
Ethics approval and consent to participate
This article does not contain any studies with human participants or animals performed by any of the authors.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Das, S., Negi, S. Enhanced production of alkane hydroxylase from Penicillium chrysogenum SNP5 (MTCC13144) through feed-forward neural network and genetic algorithm. AMB Expr 12, 28 (2022). https://doi.org/10.1186/s13568-022-01366-1
- Alkane hydroxylase (AlkB)
- Feed Forward Neural Network (FFNN)
- Genetic algorithm (GA)
- Penicillium chrysogenum SNP5
- Submerged fermentation (SmF)