Formalin-Fixed Fish Larvae Could Be Effectively Identified by DNA Barcodes: A Case Study on Thousands of Specimens in South China Sea

Delimiting ichthyoplankton is fundamental work for monitoring the recruitment process and identifying the spawning and nursing grounds of fishes. Nevertheless, it is extremely difficult to identify the fish during the early stages at the species level based on morphological characters because of the paucity of diagnostic features. In this study, we investigated the fish larval community through large-scale ecosystemic sampling in South China Sea (SCS) during 2013 and 2017 using DNA barcodes. To maintain the morphologies of fish larvae, we preserved the larvae in formalin and developed a technique to recover their DNA. Among the 3,500 chosen larvae, we successfully extracted DNA from 2,787 larval samples and obtained 1,006 high-quality sequences. Blast searches showed that 408 larvae (i.e., 40.5%) could be unambiguously identified to species, 413 larvae (i.e., 41.1%) were ambiguously species delimitation, and 185 larvae (i.e., 18.4%) showed a low match similarity with target sequences. A total of 101 species were identified, among which 38 and 33 species corresponded to demersal and reef-associated species, whereas the remaining 30 species corresponded to benthopelagic, pelagic-oceanic, bathypelagic, and pelagic-neritic species. High-quality larval photographs of the 101 diagnosed species showed intact morphological characters and thus provided a reference for identifying fish species during the early stages based on morphological characters. Our study highlighted the possibility of recovering and amplifying DNA from formalin-fixed samples and provided new insight into the fish larval community in the SCS.


INTRODUCTION
Perennial and continuous anthropogenic impacts, for example, global warming, water extraction, species invasions, overfishing, and habitat degradation, have been documented to severely decrease fish diversity in both the sea and inland water bodies (Olden et al., 2007;Rahel et al., 2008). South China Sea (SCS), an important biodiversity hot spot for marine ecosystems, has been reported to harbor more than 3,300 fish species (Alien et al., 2000;Morton and Blackmore, 2001;Shao et al., 2008;Huang et al., 2015). However, the illegal fishing, mangrove damage resulting from irrational usage and deforestation, uncontrolled wastewater emissions, and degeneration of coral reefs constantly occurring in the SCS over the last several decades have put the sustainability of fish populations at serious risk (Macusi et al., 2011;Huang et al., 2015); therefore, protecting the fish resources in the SCS is an urgent task.
Larvae, the early phase of fish without initiative migration potential, offer an ideal opportunity to obtain important information on reproductive biology, preferred reproduction sites and times, and population recruitment success rates (Bialetzki et al., 2005;Cao et al., 2007;Reynalte-Tataje et al., 2012). This information is also comparatively important for ecological monitoring and developing management and conservation plans. However, empirical studies on larvae are currently limited by the difficulty in accurately delimiting larvae at the species level based on morphological characters, even for expert taxonomists. The main limiting factor is plastic and labile diagnostic characters at various development stages (Choat and Doherty, 1993;Cao et al., 2007). Accordingly, a reliable and efficient approach that can identify fish larvae to species is urgently needed.
DNA barcoding, an identification method using molecular makers, has created new opportunities for the delimitation of fish larvae, regardless of their developmental stage and plastic morphological characters (Hubert et al., 2010(Hubert et al., , 2015Valdez-Moreno et al., 2010;Ko et al., 2013;Frantine-Silva et al., 2015;Kimmerling et al., 2018). Many previous studies consistently demonstrated that DNA barcoding could be a reliable molecular method of identifying fish larvae (Hubert et al., 2010(Hubert et al., , 2015Valdez-Moreno et al., 2010;Ko et al., 2013;Frantine-Silva et al., 2015). For example, Valdez-Moreno et al. (2010) used DNA barcodes to connect adults and early life stages of marine fishes in the Yucatan Peninsula, Mexico. Hubert et al. (2015) successfully identified 373 of 505 coral reef fish larvae in open waters in the vicinity of Moorea by applying DNA barcodes. In addition, an increasing number of sequences of fish taxa deposited in an extensive database (BOLD: The Barcode of Life Data System) 1 will aid in delimiting both adult and larvae. By early 2020, more than 19,000 fish species including more than 260,000 specimens had been deposited in the BOLD database (Ratnasingham and Hebert, 2007).
Most earlier DNA-based studies that focused on fish larvae used high concentrations of alcohol to preserve samples (Hubert et al., 2010(Hubert et al., , 2015Valdez-Moreno et al., 2010;Ko et al., 2013;Frantine-Silva et al., 2015;Kimmerling et al., 2018). However, alcohol will damage the morphological features of fish larvae to some degree, which influences the morphological studies of larvae. As a result, to maintain the morphological characters of fish larvae, we stored larvae in aqueous formalin and attempted to extract sufficient DNA for sequence amplification. This technique can overcome the major challenge of recovering DNA sequences from formalin-fixed samples. Generally known, formalin fixation of specimens will damage DNA via fragmentation, base modification, and cross-linkage within the DNA itself or between DNA and proteins (Quach et al., 2004;Wong et al., 2014). Randomly and fragmented formalin-fixed DNA poses a difficulty in Sanger sequencing.
To date, several DNA barcode studies have been conducted on the adult fish in the SCS and its adjacent areas (Wang et al., 2009(Wang et al., , 2012Zhang and Hanner, 2012;Chang et al., 2017;Hou et al., 2018), providing a large number of reference sequences for the fish fauna in this sea. Here, we sampled fish larvae from 230 sites in the SCS from 2013 to 2017 (Figure 1), preserved them in neutral formalin solution, and sequenced standardized molecular tags located in the mitochondrial cytochrome c oxidase I (COI) gene for larval delimitation. The aims of this study were (i) to assess the feasibility of recovering and amplifying DNA fragments from formalin-fixed samples, (ii) to determine how many samples can be successfully identified through DNA barcodes, and (iii) to evaluate the value of information on the reproduction and recruitment of fish populations and the distribution patterns of ichthyoplankton communities.

Sampling
Extensive ichtyoplankton sampling was conducted during a 5year fishery survey, which covered six regions in the SCS (i.e., Xuwen National Coral Reef Nature Reserve, the Gulf of Tokin, the Xisha Islands, the northern SCS, the southwestern shelf area, and the Sunda Shelf area; Figure 1) during 2013 and 2017. We collected fish larvae from a total of 230 sites with a seabed water depth ranging from 3 to 1,638 m. Two types of nets, i.e., set and zooplankton nets, were used for larval sampling. The set net was used to sample specimens on the edge of the Xuwen National Coral Reef Nature Reserve established for coral reef conservation. With respect to the remaining sites, larvae were collected using an 80-cm-diameter zooplankton net (length = 2.7 m, mesh size = 0.505 mm, and bore diameter = 400 µm) deployed in vertical and horizontal trawls. For vertical trawls, nets were lowered down to approximately the sea bottom if the seabed water depth was less than 200 m and dragged with a hauling speed of 1.5 m/s. Each site was sampled twice or thrice per survey voyage. For horizontal trawls, nets were dragged for 15 min at a speed of 1.5-2.2 knots/h. Given that the sampling boats were 35-45 m long, turbulent water created by the propeller could lead to serious disturbances of abundance and diversity, even if a sampling rope more than 30 m long were tired on the stern. Thus, we affixed a steel hanger of 5.5 m long and 9 cm in diameter to both sides of the midship, which began from the four outsidewelded rings. Among the four rings, two rings were used to hang FIGURE 1 | Sampling locations for fish larvae from five areas. Each station yielded one or more samples, and the stations that failed to obtain larval COI sequences were not shown.
the net on the ship, and the remaining two rings were used to suspend the zooplankton net. An auxiliary line connected to the front plankton net could be used to recycle nets and complete operations multiple times (Supplementary Figure 1).
At the nature reserve of Xuwen Coral Reef national, fish larvae were sampled monthly between 5 PM and 6 AM from March 2013 to March 2015. We collected four, four, and two samples for the Gulf of Tokin, the northern SCS, and the Sunda Shelf area, respectively. In the mouth of Gulf of Tokin to northwestern Xisha Islands and Southwest shelf area, only one sampling was conducted. Fish larvae were mainly sampled between 7 PM and 6 AM from November 2013 to September 2017. A total of 205,000 fish larvae were obtained during our 5-year survey.

Preservation, Photography, DNA Extraction, and Polymerase Chain Reaction Amplification
For the purpose of obtaining undamaged morphological characteristics of fish larvae for photography and sequencing, the ichthyoplankton samples were instantly preserved in 4% neutral formalin solution. Given that formalin fixation can lead to DNA corrosion and damage and thus hamper amplification of the target COI gene, especially in fragile larvae, we developed a formalin-based preservation and fixation technique for ocean fish larvae. We first added nearly 0.175 g of NaOH powder to neutralize 500 mL of seawater and then used the neutralized seawater to prepare a 4% neutral formalin solution. The 4% formalin solution was preserved at 4 • C for 24 h before being applied to preserve fish larvae. Samples preserved in the neutral formalin solution still required cold preservation (4 • C) for less than 10 days.
In the laboratory, larvae with different morphological features were manually selected to produce silhouette images. A total of 3,500 fish larvae were chosen during our 5-year sampling. The selected larvae were initially photographed using a Zeiss microscope (Axioplan 2 imaging E) and then used for DNA extraction (Figure 2). After the photography was completed, each sample was preserved in GTE solution (100 mmol/L glycine, 10 mmol/L Tris, and 1 mmol/L EDTA) to eliminate the formalin utilizing a 2-mL centrifuge tube and given a serial number. The preservation fluid was replaced once per day for 7 days. The samples were moved into a 1.5-mL centrifuge tube, air dried for 3-5 h, and then boiled by adding 1 mL of GTE solution for 20 min. Samples were dried using absorbent paper; placed in a bioclean centrifuge tube; mixed with 30 µL sodium dodecyl sulfate, 10 µL DTT, 20 µL protease K, and 200 µL STE; and heated in a 55 • C water bath for 4-5 h. When the solution was clear, the samples were centrifuged at 1,500 revolutions/min (rpm) for 30 s, mixed fully with 300 µL of NaCl solution (6 M) for 20 min, and centrifuged for 15 min at 13,000 rpm. Then, 400 µL of supernatant liquor was extracted into a new bioclean centrifuge tube. We then added 400 µL of isopropanol, mixed the solution slightly for 10 min, stored the mixture at −20 • C for 2-3 h, and centrifuged the mixture for 15 min at 13,000 rpm. After that, we removed the supernatant liquor, added 800 µL of 70% ethanol, centrifuged the solution for 10 min at 13,000 rpm, and removed the supernatant liquor. Finally, the samples were air dried for 60 min and resolved in 30 µL of TE.
C Oxidase I sequences (∼648 bp) were amplified and sequenced using the universal primers FishF1 and FishR1 (Ward et al., 2005). The polymerase chain reaction (PCR) contained approximately 100 ng of template DNA, 1 µL of each primer (10 pmol), 4 µL of 10 × reaction buffer, 1.2 µL of dNTPs (10 mM each), and 5.0 U of Taq DNA polymerase in a total volume of 40 µL. The PCR conditions and processes include initial denaturation at 95 • C for 2 min, 35 cycles at 94 • C for 30 s, 54 • C for 30 s, and 72 • C for 1 min, and a final extension at 72 • C for 10 min. The PCR products were sequenced bidirectionally on an ABI 3730XL DNA system following the manufacturer's protocol (Perkin-Elmer Applied Biosystems, Foster City, CA, United States).

Data Analyses
We initially checked the tracer files and assembled sequences using SEQMAN in Lasergene version 7.0 (DNASTAR Inc., Madison, WI, United States). High-quality sequences were aligned and manually edited using MEGA v6.0 (Tamura et al., 2013). Collection data, sequences, tracer files, and primer details were deposited in the BOLD database.
Larval identification was performed by blast searches in BOLD. We retained reference sequences of both the first and second best interspecific matches in BOLD and documented FIGURE 2 | Photographs of the specimens that produced good-quality sequences from formalin-fixed samples and that were identified to species taxa in the SCS. Frontiers in Marine Science | www.frontiersin.org the percentages of sequence matches. Three criteria similar to those used in Hubert et al. (2015) were adopted in our study: (i) sequences with greater than 99% similarity and exhibiting greater than 1% divergence between sequences and the nearest neighbor species were unambiguously tagged with that species name (case I); (ii) if the genetic divergence was less than the 1% threshold, we considered identification to species uncertain (case II); and (iii) sequences with matches less than 99% were deemed unidentified samples (case III). Canonical 98% similarity and 2% genetic divergence criteria were utilized. Genetic distance was calculated in MEGA6 based on the Kimura 2-parameter model (K2P, Kimura, 1980). A neighbor-joining (NJ) tree based on the K2P model was reconstructed to illustrate lineage diversity via phylogenetic topology. The NJ tree was constructed with 1,000 bootstrap replicates using MEGA v6.0. To compare the results of NJ tree, maximum-likelihood (ML) tree was implemented in RAXML-VI-HPC (Stamatakis, 2006) with the GTR + I + G model and 1,000 non-parametric bootstrap replicates. The optimal nucleotide substitution model (GTR + I + G) was selected based on the Akaike information criterion in MRMODELTEST version 2.3 (Nylander, 2004). Furthermore, to group specimens into operational taxonomic units (OTUs), Automatic Barcode Gap Discovery (ABGD) (Puillandre et al., 2012) was performed online 2 using the default value for relative gap width (X = 1.5) and K2P distance metrics.

RESULTS
During our 5-year sampling period, we collected more than 205,000 larvae, among which 3,500 larvae were chosen for the molecular experiment on the basis of their morphological characters. A total of 2,787 larvae (i.e., 80%) were successfully amplified, and only 1,006 samples (36%) showed high-quality sequences with no evidence of insertions, deletions, or stop codons. The larvae collected from the southwestern shelf area failed to produce high-quality DNA or sequences as the preservation time in the aqueous formalin was too long (>30 days). The 1,006 sequences obtained had a length of 604 nucleotides after alignment and trimming of noisy sites, which were subjected to subsequent analyses. Sample information and sequences for the 1,006 larvae were deposited in BOLD (see text footnote 1, Project Codes LFSCS).
Using rigorous 99% similarity and 1% genetic divergence among species thresholds, we found that approximately 81.6% of the sequences (821/1,006) had a match of 99-100%. The match results for nearest neighbors ranged from 81.24 to 100%. Among the 821 sequences, only 408 sequences fit the criteria of case I (i.e., ∼40.5%). A total of 413 sequences showed percentages of similarity with both the best match and the nearest neighbor ranging from 99 to 100% and genetic divergence from the nearest neighbor of less than 1%, indicating that 41.1% of the samples were not ambiguously assigned species names (case II). Less than 99% similarity between the analyzed sequences and the best-matching sequences was shown for the remaining 185 sequences, suggesting that 18.4% of the samples could not be identified to species (case III; Supplementary Figure 2 and Supplementary Table 1). In contrast, using a canonical 98% similarity threshold and 2% divergence among species (Hebert et al., 2003;April et al., 2011), 287 (i.e., ∼28.5%), 567 (i.e., ∼56.4%), and 152 (i.e., ∼15.1%) sequences were assigned to cases I, II, and III, respectively, with the largest proportion of sequences classified as case II.
Among the 101 identified species, the Xuwen coral reef area harbored 23 species, the Gulf of Tonkin harbored 53 species, and the northern SCS harbored 55 species. Because of DNA extraction failure or low-quality sequences, only one species (two individuals from the Xisha Islands and one individual from the Sunda Shelf area) each was obtained from the Xisha Islands and Sunda Shelf areas. Venn diagram analysis showed that the Xuwen coral reef area, Gulf of Tonkin, and northern SCS shared only three species, i.e., Omobranchus punctatus (Valenciennes, 1836), Sillago asiatica (Mckay, 1982), and Terapon jarbua (Forsskål, 1775). In addition, the Xuwen coral reef area, Gulf of Tonkin, and northern SCS harbored 9, 25, and 38 exclusive species, respectively (Figure 3). Using habitat data for adult fish species downloaded from FishBase 3 , 38, 33, 12, 9, 5, and 4 species were demersal, reef associated, benthopelagic, pelagic-oceanic, bathypelagic, and pelagic-neritic, respectively (Figure 4).
According to morphological features, we assigned 1,006 larvae to 213 morphological forms. The NJ tree based on the K2P distance model and ML tree consistently yielded 241 lineages with high support values (Supplementary Figures 3, 4). The ABGD analyses conducted with the K2P models consistently produced initial partitions with 241 OTUs (P = 0.0215-0.0599).

DNA Extraction and Sequence Amplification From Formalin-Fixed Larvae
To obtain high-quality morphological characteristics of fish larvae and to conduct further molecular analyses, we preserved the larvae in aqueous formalin rather than highly concentrated alcohol. A total of 3,500 larval photographs with corresponding serial numbers were obtained. Although formalin has many negative effects on DNA recovery and extraction from specimens (Quach et al., 2004;Wong et al., 2014), our study successfully extracted and amplified DNA from 73.3% of formalin-fixed larval samples and obtained a moderate number of highquality sequences, highlighting the opportunities to obtain genomic data from formalin-fixed specimens. As a result, our modified approaches to preserve fresh specimens and to extract and amplify DNA from formalin-fixed samples can be effectively utilized in molecular analyses that require intact morphological data.
To date, many reports have attempted to extract and amplify DNA from formalin-fixed samples and have made much progress (Wirgin et al., 1997;Xia et al., 2007;Zhang, 2010;Hykin et al., 2015;Chambers and Hebert, 2016). However, the success rates in most previous studies were lower than the rate yielded in our study. For example, Chambers and Hebert (2016) yielded only five high-quality sequences from 208 formalin-fixed samples. The relatively high success rate of DNA extraction and sequence amplification observed in our study might be attributed to the short preservation time (<10 days). Many earlier studies demonstrated that the quantities of extracted DNA and the success rates of sequencing decreased with the duration of formalin fixation (Legrand et al., 2002;Diaz-Viloria et al., 2005;Palero et al., 2010;Zhang, 2010). Failure to obtain high-quality DNA and sequences from the Xisha Islands, southwestern shelf area, and Sunda Shelf area in our study (formalin fixation > 10 days) provided a direct line of evidence. Furthermore, our study yielded high-quality sequences that exceeded 600 bp in length, whereas most past studies obtained fragmented sequences that were less than 500 bp in length (Palero et al., 2010;Zhang, 2010;Paireder et al., 2013). Previous documents showed that samples preserved in formalin for longer durations tended to yield short sequences (Palero et al., 2010;Zhang, 2010;Paireder et al., 2013). Given that such short DNA sequences provide limited information for genetic studies, researchers should reduce the duration of preservation in formalin as much as possible to ensure adequate sequence length in future genetic analyses.

DNA Barcodes of the Larvae in the SCS
Using a 99% similarity threshold and a 1% threshold for species boundaries, we successfully assigned 40.5% of sequences to species and delineated 101 unambiguous species. Furthermore, we provided a high-quality photographic library of fish larvae assigned to species. Because the fish larvae were fixed in formalin, their morphological features were better maintained than those of fish larvae preserved in ethanol. In past investigations with larval and/or planktonic delimitation based on molecular techniques, the samples were preserved in ethanol, which damaged their morphological characters (Frantine-Silva et al., 2015;Hubert et al., 2015;Ardura et al., 2016;Collet et al., 2018;Liu et al., 2018). Hence, our study is one of the first reports in which DNA-based species identification was applied to a large number of fish larvae fixed in formalin. Therefore, it provided a unique opportunity to examine their morphology. This ability to examine their morphology is particularly important, as the larval morphology of most SCS fishes remains unknown.
Our analyses revealed O. punctatus (Valenciennes, 1836), S. asiatica (McKay, 1982), and T. jarbua (Forsskål, 1775) in three sea areas. This is an expected phenomenon as the three species are widespread species that live in coastal oceans. Furthermore, many endemic species were found in the three sea regions (Figure 3). The main reason for this finding is the different ichthyofaunas of the three regions. Previous reports argued that the fish of the Xuwen National Coral Reef Nature Reserve, the Gulf of Tonkin, and the northern SCS belonged to different ichthyofaunas (Qiu, 1988;Qiu, 1990;Yang et al., 2008;Wang et al., 2011a,b). Reef fishes, demersal fishes, and pelagic-neritic fishes represented the vast majority of species, which largely agreed with the results of previous fishery surveys in the SCS (Qiu, 1988(Qiu, , 1990(Qiu, , 1996Ma et al., 2006).
The rate of successful identification in our study was similar to that in studies on fish larvae in the eastern Atlantic Ocean and Taiwanese waters (Ko et al., 2013;Ardura et al., 2016) but much lower than the rates obtained in many previous DNA barcode studies on fish larvae (Frantine-Silva et al., 2015;Hubert et al., 2015;Steinke et al., 2016;Liu et al., 2018), which may be mainly explained by the incompleteness of the DNA barcode reference libraries used for the SCS fishes. Despite the more than 3,300 fish species documented in the SCS, barcode sequences for fewer than 1,000 species have been published (Wang et al., 2009(Wang et al., , 2012Zhang and Hanner, 2012;Chang et al., 2017;Hou et al., 2018). Considering that our analyses uncovered 213 forms based on morphological features, 241 lineages through the NJ tree and 241 OTUs from ABGD, indicating that many specimens could not be assigned to species, we missed a large amount of the diversity in the larval fish community. As a result, further extensive studies on this unidentified diversity should be conducted in a timely manner.
A total of 41.1% of the sequences were more than 99% similar to sequences attributed to two different species (i.e., case II), suggesting that misidentification of these samples occurred. These cases may have resulted for the following reasons: hybridization, recent divergence, inadequate taxonomy, and erroneous identification (Meyer and Paulay, 2005;Ivanova et al., 2007). Hybridization between species is a major problem for DNA barcode identification because a hybrid will be identified as its maternal species. When closely related species only recently diverged, they can exhibit close genetic relationships or even share identical haplotypes in two isolated gene pools. Some examples of incomplete lineage sorting due to recent divergence have been recently described in the SCS and its adjacent areas (Chang et al., 2017;Hou et al., 2018), such as two species of Uranoscopus (U. oligolepis Bleeker, 1878 and U. kaianus Günther, 1880) observed by Hou et al. (2018) and multiple species of Thunnus found by Chang et al. (2017) and Hou et al. (2018). Inadequate taxonomy can lead to two different species sharing barcodes if the two nominal species derive from the same gene pool, in which case they should be treated as the same species. Erroneous identifications within the reference library could produce an apparent case of haplotype sharing. A large number of closely related species living in the SCS are difficult to delimit (Zhang and Hanner, 2012;Hou et al., 2018), and thus, specimens of the same species may be incorrectly given two different names by different identifiers. In addition, many sequences were identified based on specimens collected in different oceans and likely resulted from misidentification due to different retrieval systems and multiple independent identifications. Given this information, we argue that more geographically distinct populations of closely related species and species-specific markers should be included in studies to delimit these species with due consideration of spatial variation. In addition, explicit morphological characters among these species should be measured, and synonyms need to be revised through the efforts of researchers from different countries.
Approximately 18.4% of the sequences (case III) obtained poor matches (<99% similarity). Moreover, the match similarity of some sequences was even less than 90%. These results reflected the taxonomic incompleteness of the DNA barcode reference libraries in BOLD for the SCS fishes. Accordingly, these cases highlight that DNA barcodes for a large number of fish taxa in the SCS are still lacking, a gap that deserves more attention in future research. Establishing DNA barcode reference libraries for SCS fishes is an enormous task that will require efforts from many researchers from different countries over a long period of time. Additionally, cryptic diversity has a non-negligible effect on the failure to assign sequences to species. If cryptic speciation occurs, sequences of the same species will show deep genetic divergence and thus will not be matched. Cryptic diversity seems to be a common phenomenon within SCS fishes (Zhang and Hanner, 2012;Chang et al., 2017;Hou et al., 2018), e.g., Apogon ellioti (Day, 1875), Naucrates ductor (Linnaeus, 1758), and Psenopsis anomala (Temminck and Schlegel, 1758) (Hou et al., 2018).
Here, we successfully obtained high-quality DNA and corresponding barcode sequences from fish larvae fixed in formalin and linked the morphologies of fish species during the early stage with their molecular information, which facilitated the diagnosis of fish larvae based on morphological features. The approach used in this study may be extended to genetic studies that require both morphological and genomic data. Among the 1,006 analyzed sequences, 40.5% were unambiguously assigned to 101 species. The delimitation of fish larvae at the species level was ambiguous for approximately 41% of the sequences as misidentification occurred in the DNA barcode reference libraries. Furthermore, approximately 18.4% of the sequences were poorly matched with DNA barcode reference libraries because of the incompleteness of the libraries. To overcome this limitation, substantial efforts will be required to improve the completeness of DNA barcode reference libraries for SCS fishes. Although exact species names could not be obtained for many larvae, the identification of 101 species through the DNA barcode-based method provided new insight into the fish larval communities in the SCS.

DATA AVAILABILITY STATEMENT
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: GenBank MW379416 -MW379823.

ETHICS STATEMENT
The animal study was reviewed and approved by the Guangdong Ocean University.

AUTHOR CONTRIBUTIONS
GH and HZ analyzed the data and completed the first draft. GH, YC, SW, and JW performed the field survey. GH and WC conducted the experiments. WC and HZ provided the guidance on data analysis and structure. All authors contributed to the article and approved the submitted version.

ACKNOWLEDGMENTS
We would like to thank the editor and reviewers for their constructive comments on our manuscript. We thank numerous colleagues for their generous support of this study, the 14 crew members of Beiyu60011 and all the survey ships for their help.