Textile Industry Wastewaters From Jetpur, Gujarat, India, Are Dominated by Shewanellaceae, Bacteroidaceae, and Pseudomonadaceae Harboring Genes Encoding Catalytic Enzymes for Textile Dye Degradation

Textile industries play an important role in uplifting the national economies worldwide. Nevertheless, they generate a huge amount of intensive colored effluent, which is a serious threat to the environment. The microbial communities present in these highly polluted environmental sites help in remediating pollutants naturally. However, little is known about their genes and enzymes in the textile wastewater systems. In this study, we explored the microbial community structure and their functional capability in three different wastewater systems, i.e., industry sites, effluent treatment plant (ETP), and common effluent treatment plant (CETP). Our findings based on shotgun metagenomics highlight the varied bacterial diversity at the three industry sites. Overall, the major dominant phyla in the industry site and CETP samples were Proteobacteria and Bacteroidetes, while in the ETP site, Firmicutes, Cyanobacteria, and Proteobacteria were predominant. The final discharge sample site was having a higher proportion of the Proteobacteria and Bacteroidetes. Aeromonas caviae, Desulfovibrio desulfuricans, Klebsiella pneumoniae, Pseudomonas stutzeri, Shewanella decolorationis, Shewanella oneidensis, Shewanella putrefaciens, and Vibrio cholera were the abundant species across the three sites. Furthermore, this research study identified the key microbial genes encoding enzymes having a known role in textile dye and aromatic compound degradation. Functional annotation of the shotgun metagenome samples indicates the presence of reductase, azoreductase, nitrate/nitrite reductase, and oxidoreductase enzyme encoding genes. Our findings provide the shotgun metagenomics-based approach for mining the textile dye degrading genes and genomic insights into the bioremediation of textile industrial effluent.


INTRODUCTION
The environmental pollution and toxic substances produced by the industrial manufacturing process are major challenges faced worldwide. Textile industries heavily rely on water during pre-processing of raw material and dyeing and printing of fabric and simultaneously consume a huge amount of synthetic chemicals and pigments. The pollutants generated from several steps involved in the textile industries create various ecological problems due to the generation of a large amount of wastewater containing pollutants. Textile industry wastewater contains a variety of persistent coloring pollutants and synthetic dyes, phenols, formaldehyde, phthalates, surfactants, chlorophenol and aromatic compounds and heavy metals such as lead (Pb), cadmium (Cd), arsenic (As), chromium (Cr), zinc (Zn), and nickel (Ni), chlorides, and sulfates (Hossain et al., 2018;Kishor et al., 2021). Synthetic chemicals, pigments, and dyes add to the complexity of the wastewater effluents and sludge, causing damage to the environment (Chen et al., 2021;Manickam and Vijay, 2021;Methneni et al., 2021).
Subsequently, the usage of synthetic dye has been increased in the textile and dyeing industry due to their cost-effectiveness, high stability to light, temperature, and detergents, and resistance to microbial attack as compared to natural dye (El-Kassas and Mohamed, 2014;Anantharaman et al., 2016;Kumar et al., 2019). Almost 10,000 different synthetic dyes are currently manufactured, mainly azo dyes, and are frequently used by the textile and various other industries for dyeing because they are more economically stable than natural dyes (Xu et al., 2007;He et al., 2016;Garg and Tripathi, 2017). Azo dyes are the largest class of synthetic dyes, which are characterized by the presence of azo (-N N-) groups in their structure (Verma et al., 2012;Yaseen and Scholz, 2019). These chemical dyes are toxic and highly persistent in the environment and their metabolic products are mutagenic and carcinogenic in nature (Xu et al., 2007). Therefore, understanding azo dye degrading enzymes and functional genes is crucial for elucidating their molecular mechanisms for the development of efficient bioremediation technology (Dixit and Garg, 2018). Further, investigating the complex dynamics of microbial diversity and community structure of the textile wastewater treatment sites can provide crucial information regarding novel molecular pathways, biosynthetic gene clusters (BGCs), and textile dye degrading genes encoding enzymes.
Microorganisms play an important role in the degradation of various pollutants through their adapted metabolic and enzymatic activity. However, the potential relationship between bacterial taxa and the niches occupied by specific microorganisms in textile wastewater and sludge is largely unknown. Therefore, the characterization and identification of microbes and their genes mediating textile dye degradation can be an ecofriendly way to treat such a complex industry effluent. Therefore, this study was undertaken to explore the microbial community and their potential genes, which may have a role in the bioremediation of textile industry effluent. Therefore, for this study, we collected textile industry effluent samples from the ETP and CETP located in Jetpur, Gujarat, India, and carried out whole-genome shotgun sequencing.

Sampling Sites
The textile industry wastewater samples were collected from Jetpur, Gujarat, India (21°37′36.42″N, 70°37′33.03″E), which is one of the largest textile industry hubs in India. The geotagged images of sampling site locations are depicted in Figure 1, while Supplementary Table S1 provides the details of the sampling site with approximate operational members and wastewater treatment capacity. The textile wastewater samples were collected in the sterile 50 ml falcon tubes from a total of nine sampling points representing three different sites, namely, textile industry site, effluent treatment plant (ETP), and common effluent treatment plant (CETP) site. The samples were stored at 4°C for downstream processing. The samples from industry sites (IND_A and IND_B) were collected from the print houses of the textile industry sites, while wastewater samples were collected from the common effluent treatment plant (CETP) and effluent treatment plant (ETP) sites.

Deoxyribonucleic Acid Isolation and Shotgun Metagenomics Sequencing
The total genomic DNA was extracted using the DNeasy PowerWater DNA isolation kit (QIAGEN) as per the manufacturer's protocol. The quality of total genomic DNA was checked on 0.8% agarose gel visualized on a gel documentation system (Bio-Rad). The quantity and purity of the DNA were assessed using the QIAxpert System (QIAGEN) and Qubit 4 Fluorometer (Invitrogen). Approximately 250 ng of genomic DNA was used for the shotgun library preparation using the Ion Xpress Plus Fragment Library Kit (Thermo Fisher Scientific) according to the manufacture's protocol. For the library size selection, we used E-Gel SizeSelect II, 2% agarose gel electrophoresis (Invitrogen), and Agencourt AMPure XP kit (Beckman Coulter) was used for purification. Shotgun metagenome sequencing was carried out using the Ion torrent S5 platform with 400 bp chemistry and Ion 530 chip.

Data Pre-Processing and Microbial Community Structure Analysis
Shotgun metagenome sequencing raw data files were uploaded on the MG-RAST version 4.0.3 (Aziz et al., 2008) server for taxonomic and functional analysis. After quality control and filtering (dereplication, removing the artificial sequences, and ambiguous bases), sequences with the "Best Hit" criteria were used for the taxonomical analysis with default parameter (i.e., E-value of 1 × 10 −5 , the minimum identity of 60%, and a minimum length of 15 amino acids). For the functional annotation, each of the metagenomic reads was annotated against the NCBI reference sequence (RefSeq) database in MG-RAST with default parameters (i.e., E-value of 1 × 10 −5 with 60% identity).

Shotgun Metagenome Assembly and Annotation
For functional annotation, reads were further processed using the SqueezeMeta bioinformatics analysis pipeline (Tamames and Puente-Sánchez, 2019). The co-assembly mode was used for FIGURE 1 | Wastewater effluent samples were collected from the Jetpur textile industry cluster from Gujarat, India, which is a major hub of textile-based industries. The sampling sites included were effluent treatment plant (ETP, 1) (A) and common effluent treatment plant (CETP, 1) (B), where the final discharge samples were treated in the lagoon by the phytoremediation process with the nearby plant species and wash area from the dyeing and print house. The reddish/orange color indicates the washing area around the effluent treatment plant.
Frontiers in Environmental Science | www.frontiersin.org September 2021 | Volume 9 | Article 720707 the annotation of genes in each metagenome sample. MEGAHIT v1.2.9 (Li D. et al., 2015) and Prodigal (Hyatt et al., 2010) were used for the assembly and gene prediction, respectively. The predicted genes were searched for homology against the KEGG database for the functional annotation (Kanehisa et al., 2016) of the genes in the contigs. The genes encoding for the enzymes involved in the textile dye degradation were identified from the nine different sampling points representing three different sites, and the biosynthetic gene clusters were reconstructed based on the enzyme encoding genes (Supplementary Table S2). For the identification of specific functional gene prediction, KEGG IDs were manually filtered based on the criteria of their role in involvement in the gene clusters known for textile degradation with predicted function based on Enzyme Commission (EC) number.

Statistical Analysis
Estimation of parametric and non-parametric diversity indices was processed using the PAST v4.0.3 software (PAST Hammer et al., 2001). Further, based on the KEGG and COG functional annotation, the metagenome samples were clustered following the principal component analysis (PCA). PCA plot and sample clustering dendrogram were done using the Microbiome Analyst tool (Dhariwal et al., 2017), and the heatmap of the xenobiotic compound degrading genes was constructed using Clustvis (Metsalu and Vilo, 2015).

Estimation of Richness and Diversity of the Microbial Community
The sequencing output generated from shotgun sequencing is tabulated in Supplementary Table S3. The rarefaction curve depicts the species richness and sequencing depth in the shotgun metagenome samples. Parametric and non-parametric diversity indices of the microbial community in the textile industry effluent, ETP, and CETP are shown in

Abundance and Microbial Diversity Analysis
Microbial diversity of the textile industry wastewater samples from three different sample collection sites indicated the  Figure 2. It is interesting to note that the dominance of Bacteroidetes in the industry effluent sites (IND_A and IND_B) is also evident from the literature where the dominant taxonomic abundance was possibly involved in the cleavage of azo dye compounds under anaerobic circumstances (Li C. et al., 2015;Waktole et al., 2020;Samuchiwal et al., 2021), highlighting the bioremediation potential of the taxonomically dominant microbial communities in the textile industry effluents. The inlet sample from the effluent treatment plant contains a mixture of compounds used during the textile printing and processing and provides better opportunities for the microbial communities to thrive on a multitude of compounds. Unclassified sequences (derived from Bacteria) were abundant in ETP_INLET (52.79%) which is the effluent treatment plant inlet site, followed by CETP_PS_OUTLET (33.46%) at the primary stage outlet site, CETP_SS_INLET (36.44%) at the secondary stage inlet site (IND_A) (34.54%), textile industry site (CETP_PS_INLET) (24.55%), primary stage inlet (CETP_SS_OUTLET) (19.97%) secondary stage outlet, CETP_Final_Discharge (19.52%) at the final outlet site of the common effluent treatment plant site, and ETP_OUTLET (17.56%) and IND_B (7.24%) sample at the industry site. It shows that as the treatment of wastewater processes, the proportion of uncultured bacteria is reduced. This may be because very few uncultured bacteria from the initial load have adapted to the conditions in the treatment plant and may play a crucial role in the degradation of waste from the textile industries. However, the role of the unclassified sequences needs to be explored further as microbial activity and adaptability in the textile industry effluents provide a biological mechanism for the reduction of the azo group of compounds. Bacterial enzymes, especially oxidoreductases, play an important role in the degradation of synthetic dyes and the dominant microbial community structure at the family level of classification reveals key species known for adaptability under highly toxic wastewater. At the family level, Shewanellaceae (48.95%), Bacteroidaceae (29.86%), and Campylobacteraceae (6.16%) were found in textile industry site A (IND_A) sample. On the other hand, in the textile industry site B (IND_B) sample, Bacteroidaceae (92.67%), Shewanellaceae (3.21%), and Aeromonadeaceae (1.48%) were dominant. Further exploration at the family level of taxonomic dominance highlighted the possible role of catalytic enzymes encoded by the genes present in these microbial communities and significantly highlighted the structural and functional performance to adapt and degrade the textile dyes, complex aromatic and benzene ring compounds, and synthetic pigments. Shewanellaceae and Bacteroidaceae are well-known taxonomic families that are characterized from the industrial sludge and wastewater Frontiers in Environmental Science | www.frontiersin.org September 2021 | Volume 9 | Article 720707 treatment plants, process industries, and fermenting and tannery waste treatment sites that contain highly toxic and complex wastewater generated during the processing steps (Xu et al., 2007;Meerbergen et al., 2017). Therefore, we anticipate the potential role and decipher the metagenomics features identified through dominant taxonomic abundance in the textile industry wastewater samples collected from the industry sites, common effluent treatment plant (CETP), and effluent treatment plant (ETP). Similarly, in the effluent treatment plant (ETP) inlet sample (ETP_INLET), Rhodocyclaceae (51.65%) and Ectothiorhodospiraceae (48.34%) were dominant, while in the sample ETP_OUTLET, Ectothiorhodospiraceae (34.14%), Oceanospirillaceae (15.89%), and Piscirickettsiaceae (15.46%) were dominant. Simultaneously, the dominant microbial populations were analyzed, which are associated with different treatment processes of the effluent treatment plant (ETP), common effluent treatment plant (CETP), and dyeing and printing wash area of the local textile print houses as revealed by the taxonomic abundance. Furthermore, the relative abundance of Bacteroidaceae was higher in the CETP_PS_INLET (66.94%) primary stage inlet of the CETP than that of the CETP_PS_OUTLET (23.38%) primary stage outlet of the common effluent treatment plant. Moreover, the Desulfovibrionaceae (9.41%) and Enterobacteriaceae (7.05%) were also present in CETP_PS_INLET, whereas 30.18% remained unassigned. It was observed that in the CETP_SS_INLET and CETP_SS_OUTLET samples, Pseudomonaceae and Flavobacteriaceae were 1.20%. Desulfovibrionaceae and Pseudomonaceae have a relative abundance >1%. However, CETP_SS_OUTLET showed the predominance of Bacteroidaceace (71.52%) among all the metagenome datasets, while the dominant bacterial family in CETP_Final_Discharge was represented by Bacteroidaceae (53.07%), Enterobacteriaceae (10.47%), and Desulfovibrionaceae (8.63%), as mentioned in Figure 3. Furthermore, the potential role of the abundant taxonomic families is also interesting to understand in nutrient recycling and biogeochemical cycling processes during the treatment process. The results indicated that multiple deterministic processes, such as effluent load, biogeochemical cycling process, nutrient requirement, adaptive capacity, and biological competition, possibly established the microbial community profiles in textile industry sites, CETP, and ETP. Moreover, based on microbial diversity assessment, purposeful bacteria such as sulfur and nitrogen cycling-related bacteria were widely distributed in the textile wastewater process showing strong biological associations among them (Watsuntorn et al., 2019). At the genus level, the top three genera in textile industry site A (Ind_A) metagenome sample were Shewanella (53.43%), Bacteroides (17.30%), and Denitrovibrio (6.13%); in textile  industry site B (IND_B), they were Bacteroides (76.33%), Shewanella (7.62%), and Pseudomonas (2.26%). In the effluent treatment plant (ETP) sites, Bacillus (30.10%), Polaromonas (7.09%), and Acidovorax (6.81%) were dominant at the ETP inlet site (ETP_INLET), while Cyanothece (9.39%), Oscillatoria (8.56%), and Microcoleus (8.46%) were dominant in ETP_OUTLET. Bacteroides (41.43%), Desulfovibrio (17.13%), Sulfurospirillum (12.58%), Aeromonas (5.16%), and Shewanella (3.79%) were dominant in CETP_PS_INLET, whereas Leadbetterella (11.62%), Hyphomonas (9.72%), and Pseudomonas (7.63%) were dominant in CETP_PS_OUTLET. Similarly, Pseudomonas (12.29%) and Hyphomonas (10.95%) were dominant in the CETP_SS_INLET, while Bacteroides (36.70%), Desulfovibrio (8.72%), Clostridium (6.32%), Pseudomonas (4.45%), and Denitrovibrio (2.77%) were abundant in the common effluent treatment plant secondary stage outlet site (CETP_SS_OUTLET). Lastly, in the final wastewater discharge site (CETP_Final_Discharge), the relative abundance of the genus Bacteroides (24.52%) decreases, whereas genera Desulfovibrio (9.78%), Pseudomonas (8.38%), and Clostridium (7.02%) increased as compared to the textile industry effluent sites (IND_A and IND_B) sampling points, as shown in Figure 4. The major species present in the sample textile industry (IND_A) were identified as Shewanella xiamenensis (46.88%), Bacteroides graminisolvens (29.86%), and Sulfurospirillum sp. MES (6.16%), whereas Bacteroides graminisolvens (92.67%), Shewanella xiamenensis (2.54%), and Aeromonascaviae (1.48%) were predominant in textile industry site (IND_B), and effluent treatment plant (ETP) has an abundance of the species Thauera sp. (51.65%), Ectothiorhodospira sp. PHS-1 (48.34%), Ectothiorhodospira sp. PHS-1 (34.14%), Thioalkalimicrobium aerophilum (12.27%), and Nitrincola lacisaponensis (12.11%). Novel strains of Gramnegative, non-motile, non-spore-forming rods Bacteroides graminisolvens are reported to be xylanolytic under anaerobe isolated from methanogenic bioreactors at an optimal pH of 7.2 and the optimum growth temperature was 30-35°C (Hatamoto et al., 2014). Moreover, Bacteroides graminisolvens (65.56%), Klebsiella sp. (6.88%), and Desulfovibrio desulfuricans (5.81%), Bacteroides graminisolvens (23.38%), and Flavobacterium sasangense (16.14%) were dominant in the primary stage inlet site (CETP_PS_INLET). On the other hand, Pseudomonas stutzeri (14.06%) and Flavobacterium sasangense (12.01%) were dominant in the secondary stage inlet (CETP_SS_INLET). At the species level, in the secondary stage outlet (CETP_SS_OUTLET), Bacteroides graminisolvens (71.58%), Desulfovibrio desulfuricans (5.49%), and Pseudomonas stutzeri (5.18%) were dominant. In the CETP_Final_Discharge sample, the dominance of Bacteroides graminisolvens (53.07%) and Klebsiella sp. (10.47%) was observed. The roles of identified catalytic enzyme encoding genes from the sample collection site and the dominant Frontiers in Environmental Science | www.frontiersin.org September 2021 | Volume 9 | Article 720707 bacterial communities, as reflected in the relative abundance (%) of the determined biosynthetic gene clusters, have to be studied and understood because they might play a significant role in the complete breakdown of the complex chemical dyes during the treatment process. Furthermore, based on the enriched metabolic pathway analysis and enzymatic abundance features, potential candidate gene families indicate xenobiotic degradation, aromatic compound, and synthetic pigment degradation, as highlighted in Figure 5. Furthermore, understanding complex microbial interactions at the genomic level could help develop suitable microbial technologies for the removal of recalcitrant compounds (Dai et al., 2020). Enzymatic degradation of azo dyes, polyaromatic compounds, and phenolic complexes by nitroreductase and azoreductase has been widely studied and reported to help in the process of remediating textile industry-based wastewater and treatment process (Khan and Malik, 2018;Yesilada et al., 2018;Mishra et al., 2020). These observations demonstrated that oxidoreductase, nitroreductase, and azoreductase enzymes from microorganisms were effectively helpful in the decolorization and degradation of synthetic textile dyes. The dominant microbial communities further highlight the significant role in the textile dye degradation process and remain the possible source of the dye degrading genes encoding the catalytic enzymes during the treatment process. Further, enzymatic reduction of azo dyes requires the addition of cofactors as electron donors for the reductive cleavage resulting in the production of aromatic amines, which are more potentially toxic health and environmental hazards. Similarly, an engineering approach and photocatalytic-based integrated design for optimal activity and efficient detoxification of wastewater containing harmful substances can be a helpful strategy at highly variable environmental conditions (Mate and Alcalde, 2015;Cardoso et al., 2016).

Enzymes Involved in Mediating Textile Industry Effluent Treatment Process
The microbial diversity of the textile industry effluents harbors the dye degrading genes and remains completely unexplored in the context of the uncultured microbial community dynamics. This research study findings indicate that the identified dominant bacterial communities were known for the bioremediation potential for the heavy metals, aromatic compounds, synthetic dyes and pigments, chemicals, and xenobiotic compounds from the highly polluted sites (Dixit and Garg, 2019;Redfern et al., 2019). Bacterial isolates help mediate the treatment process since they are capable of utilizing degraded byproducts of compounds as electron acceptors, including oxygen, iron, manganese, sulfate, nitrate, and nitrite. Microbial enzymes play an important role in degrading various xenobiotic compounds (Shah and Jha, 2013;Mate and Alcalde, 2015). Moreover, enzymes including oxidoreductases, monooxygenases, dioxygenases, peroxidases, and laccases have an important role in the bioremediation of various xenobiotic compounds. The details of the predicted and annotated features are shown in Supplementary Table S7 from the shotgun metagenome assembly, while Figure 6 highlights the read normalized relative abundance of the enzyme classes gene families identified from the different metagenome samples from textile industry effluent sites, common effluent treatment plant (CETP), and effluent treatment plant (ETP) sites.
Microbial oxidoreductases can mediate the degradation of synthetic and harmful pollutants, reduce the toxicity caused by xenobiotic chemical compounds, and reduce heavy metals present in the textile industry wastewater through their oxidation and reduction potential. Similarly, monooxygenases and dioxygenases can play a key role in the degradation and detoxification of aromatic amines, phenols, and benzoate compounds through hydroxylation and aromatic ring cleavage, while peroxidases help in bioremediation processes due to their structural stability and capacity to oxidize a wide range of chemical substrates (Forgacs et al., 2004). Similarly, Laccases can act on different kinds of polluting substances, such as aromatic hydrocarbons, paints, polymers, synthetic dyes, chemicals, and pesticide substances, via oxidation reactions, decarboxylation, and demethylation, which further can oxidize phenols and polyphenols, and oxidation and reduction of heavy metals, polyamines, azo group compounds, and aryl diamines groups present in the textile industry wastewater treatment process (Liu et al., 2011;Mate and Alcalde, 2015;Gaur et al., 2018;Yesilada et al., 2018). Similarly, NADH: quinone oxidoreductase (complex I) is a multi-subunit integral membrane enzyme found in both bacteria and eukaryotic cell organelles. Even though little is known about the unconventional role of these enzymes in the textile wastewater treatment process, they play a critical role in bacterial adaptation to the diverse energetic lifestyles in biogeochemical cycling processes under harsh environmental conditions at polluted sites, including aerobic and anaerobic conditions. Comparative genomic studies suggest a wide role for NADH: quinone oxidoreductase in re-oxidizing NADH produced from various biological and catabolic reactions, including the tricarboxylic acid (TCA) cycle and fatty acid beta-oxidation. Together, these findings highlight the importance of enzymes and their encoding genes in shaping diverse physiological functions across the diverse microbial communities and domains mediating the bioremediation process in the industrial wastewater treatment process of the industry effluents (Kaur et al., 2018;Yang et al., 2020). Furthermore, the presence of the nitro group (-NO 2 ) compounds in the textile wastewater process highlights the importance of the nitrate/nitrite oxidoreductase class of enzymes. Synthetic pigments and chemical compounds are part of the textile dye industry and printing houses. Therefore, aromatic and benzene ring nitro-compounds pollution has been a serious threat in environmental sites and water bodies, particularly due to the industrial wastewater being released into freshwater ecosystems. Bacterial heterotrophic nitrifier and aerobic denitrifier such as Klebsiella pneumoniae were isolated and reported from domestic wastewater and were also identified in the analysis of the metagenome samples with higher abundance in various samples at species level taxonomic identification suggesting a much broader role in bioremediation process at harsh environmental conditions Frontiers in Environmental Science | www.frontiersin.org September 2021 | Volume 9 | Article 720707 (Dixit and Garg, 2018). The overall abundance of the assigned enzyme classes to the assembled contigs of the assembled shotgun metagenome samples. The oxidoreductases, transferases, hydrolases, lyases, isomerases, and ligases were present in different sampling sites collected from Jetpur, Gujarat, as mentioned in Supplementary Table S8. The bioremediation process by bacterial species may further mitigate changes to the nitrogen and sulfur cycling process, thus helping maintain the survival fitness and nutrient cycling balance of the prokaryotic community with adapted functions in the wastewater treatment process. The key genes encoding for nitrate/nitrite reductase were and were nirA, nirB, nirD, nirI, nirK, nirS, NIT-6, nrfA, nrfG, nrfH, narG, narZ, nxrA, narH, narY, and nxrB, and their abundance was quantified from the metagenome samples. These findings highlight the significance and the important role of nitrite reductases as important enzymes with better bioremediation potential for the textile industry effluents. Similarly, azoreductases act on the azo group of compounds and help in the degradation and bioremediation process of the toxic compounds present in the textile industry effluent and wastewater in a non-toxic and ecofriendly approach for the effective treatment of the hazardous chemical substances present in the effluent sites. This research study reveals that the important genes encoding azoreductase enzymes were acpD and azoR encoding FMN-dependent NADH-azoreductase known for the bioremediation potential for the industry effluent sites. The part of the textile dyes used for dying fabrics does not bind to the fabric, and as a result, it is released in the wastewater effluent (Robinson et al., 2001). These dyes can be harmful and carcinogenic in nature (Alves de Lima et al., 2007); therefore, their removal from the industry effluent is essential to reduce the ecological threats. All these serious concerns have prompted scientific research into both mechanical and electrochemical reduction of azo dyes and the use of both aerobic and anaerobic bacteria for bioremediation as integrated technology for efficient process development. Apart from the synthetic dyes and pigments, textile industries also utilize other intermediate chemicals, such as sodium hydroxide (NaOH), oxalic acid, hydrogen peroxide, and acetic acid, during the pre-processing of textile fabrics before the dyeing and printing processes, which may react with the chemical dyes and form recalcitrant compounds that can hinder the treatment process and influence the functional potential of the microbial diversity of the effective microbial population Stolz, 2001;Xu et al., 2007;Wijetunga et al., 2010;Ghodake et al., 2011;Yang et al., 2020). Therefore, metagenome studies can better highlight and aid the selection criteria for the microbial consortium development for the effective treatment process of the wastewater synthetic dyes, pigments, and chemicals. The detection and identification of the aldehyde oxidoreductase and 2-oxoacid ferredoxin oxidoreductase enzymes also highlight the role of the electron acceptor components essential in the functional potential of the specific enzyme activity in the wastewater treatment process, thus effective in the

Biosynthetic Gene Clusters and Textile Dye Degradation in Effluent Treatment Process
Metagenome analysis from the textile wastewater samples indicates that it is crucial for the microorganisms in dyecontaminated sites to adapt and perform survival functions under highly polluted environmental conditions. Furthermore, the genes encoding the transporter functions are ABC transporters, amino acid substrate-binding proteins, peptidases, permeases, and transferases. The relative abundance (%) of the key oxidoreductase enzymes found in the textile industry effluent sites, common effluent treatment plant (CETP), and effluent treatment plant (ETP) is given in Figure 7. The dominant catalytic enzymes identified in the industry effluent sites (IND_A and IND_B) were NADH-quinone oxidoreductase, NADH oxidoreductase, and pyruvate-ferredoxin oxidoreductase.
The dominant microbial enzymes in the ETP sites were represented by NADH-quinone oxidoreductase and pyruvateferredoxin/flavodoxin oxidoreductase. Similarly, the CETP dataset showed the dominance of NADH-quinone oxidoreductase and pyruvate-ferredoxin/flavodoxin oxidoreductase. In contrast, the sample collected from the final discharge site (CETP_Final_Discharge) showed the dominance of the uncharacterized oxidoreductase apart from the NADHquinone oxidoreductase and pyruvate-ferredoxin/flavodoxin oxidoreductase. These enzymes perform basic metabolic functions and maintain osmoregulatory processes for bacterial adaptation and survival in polluted sites where xenobiotic compounds are abundant. Based on the gene mining and combining the tricarboxylic acid (TCA) cycle, β-oxidation and electron transport chain processes of the microbial communities were deciphered from the taxonomic abundance; the majority of the research studies were explained from the perspective of the important textile dye degrading enzymes in the metabolic pathway at the individual gene level. However, only a few literature reports have discussed the degrading enzymes, their encoding functional genes along with the source of the bacterial Frontiers in Environmental Science | www.frontiersin.org September 2021 | Volume 9 | Article 720707 species, mechanism of action in each step of the degradation pathway, and pilot-scale end-of-pipe removal of volatile aromatic compounds from textile dyeing wastewater treating plant (Freedman et al., 2005;Chen B. et al., 2009;Chen B.-Y. et al., 2009;Yaseen and Scholz, 2019). Therefore, these proposed gene clusters further provide a deeper insight into the molecular degradation mechanism of textile dyes during the industrial wastewater treatment process.

Metabolic Pathways and Mechanism for the Biodegradation of Azo Dyes in Textile Industry Effluents
Multiple copies of known genes encoding enzymes recognized for dye degradation were detected from the textile industry wastewater samples using the shotgun metagenomics approach, which further points towards the key role of integrated biosynthetic gene clusters (BGCs) for the synthetic pigments and chemical dye degradation during the wastewater treatment process. The results of the metagenome dataset collected from the industrial sites were dominated by the presence of oxidoreductase and reductase classes of enzymes. Therefore, sequence-based screening of shotgun metagenomics combined with the database annotations such as Kyoto Encyclopedia of Genes and Genomes (KEGG) and Cluster of Orthologous Groups of Proteins (COG) provides a robust approach for identification and characterization of the abundance of the enriched genes encoding for aromatic compound catabolism. Furthermore, wastewater microbial communities are often harbored by mobile genetic elements, such as plasmids, transposons, and integrative and conjugative elements that effectively disseminate the catabolic traits to phylogenetically distinct and diverse bacterial species (Ying et al., 2002;Jain et al., 2020). Previous research studies have shown that functional genes are a powerful biomarker for determining the potential of indigenous microbial communities to degrade xenobiotic compounds (More et al., 2014;Miao et al., 2015;Zhang et al., 2019;An et al., 2020;Li et al., 2020) during the wastewater and industrial effluent treatment process. Figure 8 shows the genes involved in proposed biosynthetic gene clusters (BGCs) for the textile dye degradation in the textile industry effluents. BCG-A indicates the involvement of 14 genes encoding different enzymes for the bioremediation potential in the wastewater treatment process. The higher abundance of the methyl-accepting chemotaxis proteins (MCPs) encoding genes significantly points towards the role of adaptation of the bacterial communities in the polluted environmental conditions, which further act as chemotaxis signal transducer protein sensing oxygen and were indicative of redox potential of the industry effluent sites (Engström and Hazelbauer, 1980). According to metagenome studies, textile dye degrading pathways are rich and abundant, with higher copy numbers of dye degrading enzyme encoding genes than the whole genome sequences. Therefore, we further speculate that these research findings shall considerably enhance the known gene pool diversity of the textile dye degrading genes and their encoded enzymes from the shotgun metagenomics approach. Thus, the results of this research study suggest that textile industry effluents are probably influencing the occurrence and abundance of synthetic and xenobiotic compound degrading genes across the sampling points. At the same time, the functional potential of the enzyme encoding needs to be further examined for critical gaps in the preliminary experiment in the design of the treatment strategies with scope to improve the efficiency of the textile industry wastewater treatments plants. Furthermore, validation of the polyaromatic hydrocarbon (PAH) and textile dye degradation functional genes in treatment sites can be a useful approach for biomonitoring the bioremediation process at the industry effluent sites (Zhang et al., 2019). Additionally, combined with the metatranscriptomic approach, it can be used to understand the significant role of the functional genes contributing to microbial communities for textile degradation during active metabolic processes. Also, systematic studies on the effect of azo dye gradients on azo dye-related degradation genes are required to assess the role of azo dyes on the proliferation and prevalence of azo dye-related degradation genes in environmental sites (Sho et al., 2004;Suenaga et al., 2009;Zhuang et al., 2020;Srinivasan and Sadasivam, 2021;Wang et al., 2021). Furthermore, in the continuum of the recent research studies and the context of the textile industry-based shotgun metagenomics approach, we have tried to address the functional potential of the microbial community structure dominant in the industrial wastewater, ETP, and CETP sites. Thus, leveraging the genomic attributes defining the microbial population and genes encoding the catalytic enzymes could help in the ecological and efficient treatment process.

CONCLUSION
This research study highlights the overall dominance of the Shewanella decolorationis, Pseudomonas stutzeri, Aeromonas caviae, Desulfovibrio desulfuricans, and Klebsiella pneumoniae at the species level in all metagenome samples in all industrial and ETP sites. The hidden diversity is an untapped source of the biotechnological intervention based on the enzymatic degradation of the recovery of the useful products through recycling and process optimization specifically suited to the industrial wastewater treatment process. This further makes them highly suited for bioremediation of contaminated sites and waste removal process, and hundreds of uncharacterized cultivable species remain unexplored from these highly polluted environmental sites. This study reports the shotgun metagenome-based approach to decipher the microbial community from textile industry effluent treatment process and industry sites revealing the relative abundance of the significantly important genes encoding catalytic enzymes useful for the bioremediation of wastewater. Further, it highlights the integrated genomics approach by exploring the textile dye degrading genes and microbial community structure from the entire metagenome gene pool of the cultured and uncultivated microorganisms by shotgun sequencing. These research findings will help in designing efficient biological methods for the treatment of textile industry wastewaters.

DATA AVAILABILITY STATEMENT
The raw reads generated via sequencing of the samples from the textile industry have been submitted to the National Centre for Biotechnology Information (NCBI) under the Sequence Read Archive (SRA) under BioProject PRJNA655112.

AUTHOR CONTRIBUTIONS
DK was responsible for sample collection, performing the experiments, data analysis, and manuscript writing; ZP contributed to sample collection, performing the experiments, and data analysis; PP was responsible for the scientific discussion and writing of the manuscript; RP was responsible for data analysis and writing and editing of the manuscript; AP contributed to the scientific discussion, editing, and critical review; MJ was responsible for the scientific discussion, editing, and critical review; CJ was responsible for the conceptualization, experiment design, supervision, scientific discussion, editing, and critical review.

FUNDING
This work was supported by the Department of Science and Technology (DST), Government of Gujarat, Gandhinagar, India.