236
Views
0
CrossRef citations to date
0
Altmetric
Research Paper

Transcriptional landscape of small non-coding RNAs reveals diversity of categories and functions in molluscs

ORCID Icon, , &
Pages 1-13 | Accepted 07 Dec 2022, Published online: 01 May 2024

ABSTRACT

Small non-coding RNAs (sncRNAs) are non-coding RNA molecules that play various roles in metazoans. Among the sncRNAs, microRNAs (miRNAs) guide post-translational gene regulation during cellular development, proliferation, apoptosis, and differentiation, while PIWI-interacting RNAs (piRNAs) suppress transposon activity to safeguard the genome from detrimental insertion mutagenesis. While an increasing number of piRNAs are being identified in the soma and germlines of various organisms, they are scarcely reported in molluscs. To unravel the small RNA (sRNA) expression patterns and genomic function in molluscs, we generated a comprehensive sRNA dataset by sRNA sequencing (sRNA-seq) of eight mollusc species. Abundant miRNAs were identified and characterized in all investigated molluscs, and ubiquitous piRNAs were discovered in both somatic and gonadal tissues in six of the investigated molluscs, which are more closely associated with transposon silencing. Tens of piRNA clusters were also identified based on the genomic mapping results, which varied among different tissues and species. Our dataset serves as important reference data for future genomic and genetic studies on sRNAs in these molluscs and related species, especially in elucidating the ancestral state of piRNAs in bilaterians.

Introduction

Small non-coding RNAs (sncRNAs) (<200 nucleotides (nt) long) are usually non-coding RNA molecules that play a crucial role in gene regulation at the transcriptional and post-transcriptional levels. They are mainly classified as either microRNAs (miRNAs), PIWI-interacting RNAs (piRNAs), or small interfering RNAs (siRNAs) [Citation1,Citation2]. Among these, only miRNAs have been extensively investigated in animals, while both piRNAs and siRNAs have not yet been sufficiently explored. The best-known function of miRNAs is regulation of gene expression, which affects development, proliferation, differentiation, apoptosis, oncogenesis, and tumorigenesis [Citation3]. In molluscs, miRNAs have been identified and characterized in Pinctada fucata [Citation4], Crassostrea gigas [Citation5], Chlamys farreri [Citation6], Littorina littorea [Citation7], Mytilus galloprovincialis [Citation8], and Hyriopsis cumingii [Citation9]. To date, only five miRNAs from Haliotis rufescens and 64 miRNAs from Lottia gigantea have been deposited in miRBase (Version 22.0) [Citation10]. A genome-wide profiling initiative has also been undertaken to discover miRNAs in molluscs [Citation11]. However, the research on the identification and characterization of miRNAs in molluscs is far from complete.

piRNAs are a widespread mechanism to effectively suppress transposable element (TE) activity to safeguard the genome from detrimental insertion mutagenesis [Citation12], a common issue in most animal germ lines. Interestingly, recent findings further imply that PIWI and piRNAs play vital roles in somatic cells [Citation13]. Numerous piRNAs have been identified and characterized from both the soma and germlines of Nematoda [Citation14], Porifera [Citation15], Platyhelminthes [Citation16], Annelida [Citation17], Cnidaria [Citation18,Citation19], Echinodermata [Citation17], Arthropods [Citation20], and Chordata [Citation21–23], whereas only a few piRNAs have been detected in molluscs [Citation24–27]. In fact, considering the somatic expression of piRNAs in many bilaterians, it is parsimonious to assume that somatic piRNA expression represents the ancestral state that was established in an early protostomian ancestor [Citation20,Citation24]. The development of next-generation sequencing (NGS) has made it possible to identify high-throughput miRNAs and piRNAs in non-model organisms by small RNA sequencing (sRNA-seq).

Mollusca is the second largest phylum within Metazoa and is widely distributed throughout limnetic, oceanic, and intertidal aquatic ecosystems. Molluscs are remarkably resilient to harsh environmental conditions, such as wide fluctuations in salinity, temperature, and hydraulic pressure, as well as microbe-rich environments and prolonged exposure to air [Citation11]. Recent advances in genomic studies of molluscs have provided insights into their biomineralization and immune and stress responses [Citation28–30]. Although thousands of genes and numerous genomes have been discovered in molluscs [Citation11], only a limited number of studies have addressed the diversity and function of sncRNAs. piRNAs were first identified in the somatic and gonadal tissues of the molluscs pacific oyster C. gigas and the great pond snail Lymnaea stagnalis [Citation24], and subsequently detected in the pearl oyster P. fucata [Citation25] and Biomphalaria glabrata snail [Citation26]. As limited somatic piRNAs have been detected in molluscs thus far, it is difficult to conclude whether the observed level of somatic piRNA expression represents the ancestral state.

In the present study, we constructed a total of 60 sRNA sequencing libraries from somatic and gonadal tissues of eight molluscs to identify novel miRNAs and putative piRNAs. Tens of known and novel miRNAs were identified and annotated in each species, enriching the research on miRNAs in molluscs. Furthermore, abundant putative piRNAs were also discovered in the somatic and gonadal tissues of the rock oysters Crossostrea nippona, scallop Mizuhopecten yessoensis, red abalone Haliotis rufescens, orient clam Meretrix lamarckii, and mussel Mytilus galloprovincialis. The discovery of putative piRNAs in various molluscs may contribute to the elucidation of the ancestral state of piRNAs in bilaterians. This study presents new data about small RNAs that are useful for future miRNA and piRNA studies in molluscs and related species. These data would also serve as an important reference for genomic and genetic studies in molluscs.

Materials and methods

Sample collection

Eight species of molluscs (C. nippona, M. yessoensis, H. rufescens, Meretrix lamarckii, M. galloprovincialis, Sinotaia quadrata, Anadara broughtonii, and Neptunea arthritica) were collected from Tokyo Bay and the nearby fishery market in October 2020 and cultured separately within an aquarium for one week. Ten cultured specimens of each species were dissected, and tissues were collected from their adductor muscles (bivalves) or feet (gastropods), gills, mantles, and gonads. The harvested tissues were stored in 2 ml tubes containing 15× volumes of NucleoProtect® RNA solution (TaKaRa, Shiga, Japan) for 12 h at 4°C and then preserved at −80°C until further use.

RNA extraction, cDNA library construction and illumina sequencing

Total RNA was extracted from each somatic and gonadal tissue using the ReliaPrepTM miRNA Cell and Tissue Miniprep System (Promega, WI, USA) according to the manufacturer’s instructions. The Qubit RNA Assay Kit in Qubit 3.0 Fluorometer (Life Technologies, CA, USA) and RNA ScreenTape Assay Kit in Agilent 2200 TapeStation (Agilent Technologies, Waldbronn, Germany) were used to assess the total RNA quantity and quality, respectively. Three isolated total RNA samples were mixed at equivalent concentrations to construct an sRNA-seq library for each tissue of the molluscs. Each tissue had two mixed libraries as replicates, except for the S. quadrata samples. sRNA-seq libraries were generated using the SMARTer smRNA-Seq Kit for Illumina (TaKaRa, CA, USA) following the manufacturer’s recommendations. The total RNA was first polyadenylated to provide a priming sequence for an oligo(dT) primer, and then the first strand of cDNA was synthesized by adding a second adapter sequence to the 3’ end. In the next step, full-length Illumina adapters were added during PCR amplification, including sequences required for clustering on an Illumina flow cell, Illumina TruSeq HT index sequences, and sequences bound by the Read Primer 1/2 sequencing primers. Library validation was performed using a High Sensitivity D1000 ScreenTape Kit in an Agilent 2200 TapeStation (Agilent Technologies). The cleanup products were purified, and size-selected using AMPure XP beads (Beckman Coulter, CA, USA) to obtain products <150 bp. The post-size-selection library was also validated using an Agilent 2200 TapeStation (Agilent Technologies), followed by sequencing on an Illumina HiSeq 2500 platform with a 50 bp single-end module (Macrogen, Tokyo).

Sequencing data pre-processing

To obtain high-quality sRNA reads, the raw data were filtered by removing: (1) reads containing a strong poly (A) tail, (2) reads containing poly (N), and (3) low-quality reads (average quality score ≤ 20). The 3 end adapter sequences were trimmed from sequencing reads using the fastx_toolkit (fastx_clipper – AAAAAAAA -c -l 15 -d 0 -Q 33) (http://hannonlab. cshl.edu/fastx_toolkit/index.html), followed by size filtration with a length range of 15–35 nt. The qualifying reads of these eight molluscs were then mapped to the reference genome sequences and Rfam database to remove the unmapped reads from the genome and rRNA, tRNA, snRNA, and snoRNA sequences using bowtie (v1.2.1, -f -k 3 -v2).

miRNA identification and annotation

The sRNA reads were pooled from different replicates and tissues for each species separately to perform miRNA prediction with miRBase 22.0 [Citation10] and miRDeep2 (v.2.0.0.8) [Citation31] using default parameters. The mapper.pl resource was used for preprocessing and genome mapping, miRDeep2.pl for de novo prediction, and quantifier.pl for expression profiling. Differentially expressed miRNAs were normalized to the total miRNA reads from each library between different tissues within the same species. The miRNA counts were normalized according to the reads per million (RPM) method.

miRNA target prediction

Based on the annotated genome results, 3’ UTR sequences were isolated from A. broughtonii and used for miRNA-target prediction. Target prediction for miRNAs was performed using miRanda [Citation32] and RNA22 [Citation33]. For the miRanda algorithm, an interacting score > 100 and a free energy <−10 kcal/mol were used to determine the sequence interactions between miRNAs and target genes. The RNA22 algorithm was used to determine the most favourable hybridization sites between miRNA and mRNA with a sensitivity of 63%, specificity of 61%, and seed size of 7, allowing a maximum of one unpaired base in the seed [Citation11]. Genes that were predicted to be targeted by the same miRNAs using two different algorithms were considered to be target genes of specific miRNAs. Subsequently, the predicted target genes were assessed with gene ontology (GO) and KEGG databases using Blastx with an E-value <10−5. Blast2GO [Citation34] was used to obtain GO annotation of the target genes based on Blastx hits. The network of miRNAs and their predicted target genes was drawn using NetworkAnalyst [Citation35].

piRNA identification and processing

After miRNA identification, the miRNA reads were removed from the clean reads using Seqkit (v0.7.3, common module) [Citation36]. Reads that did not match to any known ncRNAs were considered putative piRNAs and were subsequently identified as novel piRNAs. PPmeter [Citation24], unitas [Citation37], and proTRAC [Citation38] were used to quantify and compare the amount of ongoing ping-pong amplification, search ping-pong signatures, calculate a Z-score for the enrichment of 10bp overlaps according to Zhang et al. [Citation39], and predict genomic piRNA clusters using default parameters, respectively. Homologous piRNA clusters were detected using the NCBI BLASTn function. The relative expression patterns of piRNA clusters were calculated as mapped reads normalized for piRNA cluster length according to the transcripts per million reads (TPM) method [Citation40].

Transposon element annotation

For TE annotation, we performed a de novo prediction of repetitive elements in the reference genome of each species using RepeatMasker (v4.0.7) [Citation41] based on a de novo repeat library, constructed using RepeatModeler (v1.0.11) [Citation42]. The analysis was conducted with the entire repeat dataset including the repeats localized in the predicted piRNA clusters of the eight molluscs. The enrichment of transposon categories was compared between the genome and piRNA clusters. A t-test was used to compare the transposon percentages among whole genome sequences and piRNA clusters. As no reference genome available, the genomes of C. gigas [Citation43] were used as reference genome for C. nippona, Cyclina sinensis [Citation44] for M. lamarckii, and Pomacea canaliculata [Citation45] for S. quadrata, and N. arthritica, respectively.

Results

sRNA-seq in molluscs

A total of 262,882,185 raw reads were generated by Illumina sRNA-seq with a range of 21–46 million reads for each species (). After removing the low-quality and size-excluded reads, a total of 93,351,335 clean reads were obtained. In addition to the miRNA-sized (20–24 nt) reads detected, two different piRNA-sized reads (24–31 nt) were detected in the sRNA sequences of the molluscs. Two different size ranges of 25–27 nt and 29–31 nt were observed in the somatic and gonadal tissues of C. nippona, M. yessoensis, and M. lamarckii (; Figure S1A,B), whereas dromedary piRNA-sized reads of 25–27 nt in length were detected in M. galloprovincialis, H. rufescens, S. quadrata, A. broughtonii, and N. arthritica (, Figure S1C–F). The sequence composition of the clean reads displayed a first position nucleotide bias for uracil (U) (). sRNA identification and functional annotation results showed that non-coding RNAs, such as miRNA, snRNA, snoRNA, lncRNA, and coding nucleotides such as mRNA and rRNA were both detected by sRNA-seq (). However, a large proportion of the reads could not be annotated with the chosen reference of zebrafish (Danio rerio). In particular, numerous piRNA-sized reads could not be annotated since piRNA sequences are not conserved between species.

Figure 1. Characteristics of sRNAs in the investigated molluscs. (a) length distribution of clean reads in C. nippona; (b) length distribution of clean reads in M. galloprovincialis; (c) base composition at each site of the unique sRNA read from the 5’ terminal in C. nippona; (d) mapping statistics of clean reads among different RNA types in molluscs annotated by sRnatools using zebrafish (danio rerio) as the reference.

Figure 1. Characteristics of sRNAs in the investigated molluscs. (a) length distribution of clean reads in C. nippona; (b) length distribution of clean reads in M. galloprovincialis; (c) base composition at each site of the unique sRNA read from the 5’ terminal in C. nippona; (d) mapping statistics of clean reads among different RNA types in molluscs annotated by sRnatools using zebrafish (danio rerio) as the reference.

Table 1. Statistics of small RNA sequencing result of the eight investigated molluscs.

miRNA identification in molluscs

The clean reads were used to identify miRNAs in molluscs. A total of 72, 85, 69, 42, 73, 57, 42, and 43 miRNAs were identified from the eight mollusc species (Table S1). Among these miRNAs, ten were detected in all eight molluscs (): miR-92a, miR-317, miR-7024-3p, miR-4524a, miR-96-5p, miR-87, miR-2424, miR-365, miR-8935 and miR-146b-3p. Another ten miRNAs: miR-1990c-3p, miR-7171-5p, miR-1985, miR-22-3p, miR-2a, miR-9007, miR-876, miR-12336-5p, miR-193a, and miR-283 were also detected in five or more of the mollusc species. Based on the miRNA expression patterns, principal component analysis (PCA) clustered C. nippona, M. lamarckii, S. quadrata, A. broughtonii, and N. arthritica together, but segregated M. yessoensis, H. rufescens, and M. galloprovincialis (). The expression patterns of miRNAs varied among different tissues, and differed significantly between somatic and gonadal tissues in A. broughtonii, H. rufescens, and S. quadrata (; Figure S2), whereas the lengths and sequences of miRNAs were mostly conserved, especially in the seed region (2–8 bp). For instance, the mature sequence of miR-22-3p was conserved in all investigated mollusc species, with a very similar precursor sequence (). In addition, abundant isoforms were identified in numerous miRNAs in molluscs, as shown in .

Figure 2. miRNA identification in the investigated molluscs. (a) venn diagram of common and unique miRnas identified in the investigated molluscs; (b) Principal component analysis (PCA) of miRNA expression patterns in molluscs; (c) boxplot of miRNA expression density in molluscs. Adductor muscle collected from bivalves and foot muscle from gastropods; (d) alignment of precursors of miR-22-3p in molluscs; (e) alignment of precursors of miR-87 in molluscs; i1/i2/i3 indicates multiple miRNA isoforms; ‘pre’ represents precursor. The underline shows mature miRNA sequences.

Figure 2. miRNA identification in the investigated molluscs. (a) venn diagram of common and unique miRnas identified in the investigated molluscs; (b) Principal component analysis (PCA) of miRNA expression patterns in molluscs; (c) boxplot of miRNA expression density in molluscs. Adductor muscle collected from bivalves and foot muscle from gastropods; (d) alignment of precursors of miR-22-3p in molluscs; (e) alignment of precursors of miR-87 in molluscs; i1/i2/i3 indicates multiple miRNA isoforms; ‘pre’ represents precursor. The underline shows mature miRNA sequences.

miRNA target prediction in molluscs

To further understand the function of miRNAs in molluscs, we performed miRNA target prediction using miRanda and RNA22 for A. broughtonii. More than ten thousand target sites were predicted to be targeted by A. broughtonii miRNAs (). The genes predicted to interact with the same miRNAs using different algorithms were considered as strong candidates for miRNA targets and selected for further functional analysis. A total of 1,918 target sites were anticipated to be miRNA-mRNA interactions among 56 miRNAs and 1,223 genes in A. broughtonii. Most miRNAs were predicted to regulate multiple genes, with an average of 34 target genes. miR-317 had the greatest number of predicted target genes, followed by miR-2a, miR-8935, miR-3885-3p and miR-4187-5p (). miRNAs form a network with target genes associated with a complex process (). The enrichment analysis showed that 120 GO terms (Table S2) and 38 KEGG pathways () associated with metabolism, energy conversion, signal transduction, and multiple molecular and cellular processes were enriched in the predicted target genes of miRNA in A. broughtonii.

Figure 3. Target prediction of miRnas in A. broughtonii. (a) venn diagram of the number of predicted target sites between genes and miRNAs using miRanda and RNA22 algorithms. (b) the number of target genes for each miRNA. (c) miRnas-target genes regulatory network of miRnas in A. broughtonii. (d) kyoto encyclopaedia of genes and genomes (KEGG) pathways, terms, and statistics of the KEGG pathway enrichment analysis for the predicted target genes of miRnas in A. broughtonii.

Figure 3. Target prediction of miRnas in A. broughtonii. (a) venn diagram of the number of predicted target sites between genes and miRNAs using miRanda and RNA22 algorithms. (b) the number of target genes for each miRNA. (c) miRnas-target genes regulatory network of miRnas in A. broughtonii. (d) kyoto encyclopaedia of genes and genomes (KEGG) pathways, terms, and statistics of the KEGG pathway enrichment analysis for the predicted target genes of miRnas in A. broughtonii.

Identification and characteristics of piRNAs in mollusks

After removing the known ncRNAs, including the identified miRNA reads, the remaining reads were considered putative reads for piRNA processing. Abundant piRNA-sized reads with lengths of 24–32 nt were observed in the somatic and gonadal tissues of the molluscs (). Thousands of piRNAs were identified from the cluster loci in the genome (Supplementary sequences). In C. nippona, a total of 46 piRNA clusters (271.08 kb, 0.055% of the reference genome of C. gigas, assigned with 16.29% piRNAs) were identified using the reference genome of C. gigas, 7 piRNA clusters (49.08 kb, 0.005% of the genome) in M. yessoensis, 14 piRNA clusters (113.15 kb, 0.008% of the genome) in H. rufescens, 1 piRNA cluster (7.35kb, 0.001% of the reference genome of C. sinensis, assigned with 1.19% piRNAs) in M. lamarckii, 5 piRNA clusters (36.77kb, 0.003% of the genome) in M. galloprovincialis, and 6 piRNA clusters (44.41kb, 0.005% of the genome) in A. broughtonii (). However, no homologous sequence was detected among the piRNA clusters identified in molluscs. In addition, some putative piRNA sequences were not found in the identified piRNA clusters in molluscs.

Figure 4. Characteristics of piRNAs in the investigated molluscs. (a) length distribution of filtered known ncRNAs; (b) ping-pong pair reads of putative piRNAs; (c) ping-pong Z10 score for the enrichment of 10-bp overlaps; (d) ping-pong matrices illustrate frequent length-combinations of ping-pong pairs (sequences with a 10-bp 5’ overlap; E: putative piRNA length distribution and 1 U/10A bias for ping-pong reads.

Figure 4. Characteristics of piRNAs in the investigated molluscs. (a) length distribution of filtered known ncRNAs; (b) ping-pong pair reads of putative piRNAs; (c) ping-pong Z10 score for the enrichment of 10-bp overlaps; (d) ping-pong matrices illustrate frequent length-combinations of ping-pong pairs (sequences with a 10-bp 5’ overlap; E: putative piRNA length distribution and 1 U/10A bias for ping-pong reads.

Table 2. Identification of miRNAs and piRNAs in the investigated molluscs.

A significant 10 nt overlap between sense and antisense strands was detected in most of the molluscs except S. quadrata and N. arthritica, due to the lack of a reference genome for read mapping (), which is consistent with piRNAs generated by the ping-pong amplification mechanism. In gonadal tissues, most ping-pong pairs predominantly combined piRNAs of length 29/30 nt (27/28 nt in H. rufescens), suggesting homotypic ping-pong amplification (). In somatic tissues, ping-pong pairs combined piRNAs 29/30 nt and 25/26 nt in length, suggesting both heterotypic and homotypic ping-pong amplification. Furthermore, the ping-pong pairs were heavily biased towards uridine (U) at the 5 position and adenine (A) at position 10 ().

All putative piRNA reads from each species were re-mapped with the identified piRNA clusters to calculate the relative expression performance using the TPM method. In C. nippona (), M. lamarckii (), and M. yessoensis (), the piRNA clusters were highly expressed in the gonad tissues, whereas piRNA clusters were highly expressed in the somatic tissues of A. broughtonii (), H. rufescens (), and M. galloprovincialis (). In addition, different individuals also showed different expression patterns in these piRNA clusters, as seen in the gill and foot muscle of individual 2 which showed high expression in H. rufescens, and the gill tissue of individual 1 and mantle tissues of individuals 1 and 2 which showed high expression in M. galloprovincialis.

Figure 5. Heat map of expression levels of piRNA clusters in molluscs somatic and gonadal tissues. (a) C. nippona; (b) M. lamarckii; (c) M. yessoensis; (d) A. broughtonii; (e) H. rufescens; (f) M. galloprovincialis. ad: adductor muscle; Fo: foot muscle; gi: gill tissue; ma: mantle tissue; go: gonad.

Figure 5. Heat map of expression levels of piRNA clusters in molluscs somatic and gonadal tissues. (a) C. nippona; (b) M. lamarckii; (c) M. yessoensis; (d) A. broughtonii; (e) H. rufescens; (f) M. galloprovincialis. ad: adductor muscle; Fo: foot muscle; gi: gill tissue; ma: mantle tissue; go: gonad.

piRNA-mediated TE suppression

For TE annotation, the genomes of C. gigas and C. sinensis were used as reference genomes for C. nippona and M. lamarckii, respectively. The C. gigas genome contains 176.55 Mb repetitive sequences, accounting for 37% of the genome (); however, the identified piRNA clusters in C. nippona contained 22% transposon sequences (), which was lower than that of the reference genome of C. gigas. The piRNA clusters identified in M. lamarckii also contained a lower percentage of transposons compared to the reference genome of C. sinensis. However, the piRNA clusters identified in M. galloprovincialis, H. rufescens, M. galloprovincialis, and A. broughtonii had a higher percentage of transposons compared to reference genome sequences of the same species (); Figure S3). Annotation with transposon sequences revealed that they were rich in most piRNA clusters compared to the whole genome landscape, except in C. nippona and M. lamarckii, using the closely related reference genomes of C. gigas and C. sinensis (Fig. S4). Interestingly, the composition of transposons in piRNA clusters does not reflect the transposon landscape of the whole genome. Instead, piRNA clusters were rich in LTR/Pao, LTR/Gypsy, and DNA/Zator in C. nippona (), and LINE/Penelope, DNA/Harbinger, LTR/Gypsy, RC/Helitron, LINE/L1, and SINE/tRNA in M. yessoensis (), and a dominant unknown transposon detected in other molluscs (Fig. S3).

Figure 6. Transposon annotation in genome and piRNA clusters in the investigated molluscs. (a) representation of transposons in the genome of C. gigas, plotted by divergence [%] from transposon consensus. (b) representation of transposons within piRNA clusters of C. nippona plotted by divergence [%] from transposon consensus. (c) prominent transposons that are enriched or depleted in C. nippona piRNA clusters. (d) representation of transposons in the genome of M. yessoensis, plotted by divergence [%] from transposon consensus. (e) representation of transposons within piRNA clusters of M. yessoensis, plotted by divergence [%] from transposon consensus. (f) prominent transposons that are enriched or depleted in M. yessoensis piRNA clusters.

Figure 6. Transposon annotation in genome and piRNA clusters in the investigated molluscs. (a) representation of transposons in the genome of C. gigas, plotted by divergence [%] from transposon consensus. (b) representation of transposons within piRNA clusters of C. nippona plotted by divergence [%] from transposon consensus. (c) prominent transposons that are enriched or depleted in C. nippona piRNA clusters. (d) representation of transposons in the genome of M. yessoensis, plotted by divergence [%] from transposon consensus. (e) representation of transposons within piRNA clusters of M. yessoensis, plotted by divergence [%] from transposon consensus. (f) prominent transposons that are enriched or depleted in M. yessoensis piRNA clusters.

Discussion

Mollusca is the second largest phylum after Arthropoda, however the study of small RNAs is far from complete in molluscs. In the present study, a total of 60 sRNA-seq libraries were constructed for small RNA discovery in molluscs. Tens of known and novel miRNAs were identified from each species, and the characteristics and expression patterns of these miRNAs were investigated in different tissues and species. The target genes of miRNAs were also predicted using data from A. broughtonii, which further improved the understanding of miRNAs in molluscs. In addition, abundant putative piRNAs were detected in both somatic and gonadal tissue in molluscs. The dataset obtained from the present study can serve as important reference data for future genomic and genetic studies on small RNAs in molluscs.

A genome-wide computational prediction of homologous miRNAs was performed in 35 species of molluscs [Citation11], however it cannot reflect the situation of miRNA expression in vivo. In the present study, sRNA-seq was effectively used to detect ncRNAs in molluscs, enabling the discovery of known and novel miRNAs. With a limit of sequencing samples of tissue and development stage, it is impossible to discover every miRNA candidate, regardless of sequencing depth or which developmental stages are considered [Citation46]. An alternative method is to complement miRNA identification with genome scanning to obtain more comprehensive and reliable miRNA expression profiles and functional analysis. It is also worth discovering miRNAs in molluscs from an evolutionary perspective, as molluscs are an evolutionarily important, yet relatively under-examined group.

Among the identified miRNAs, ten miRNAs were detected in all investigated molluscs, and ten miRNAs were found in at least five mollusc species. In fact, miR-2 is an invertebrate-specific miRNA family that has many family members and plays a vital role in the immune response [Citation47]. miR-87 is a critical factor required in neurons to reactivate dendritic growth both in developmental remodelling and following injury in invertebrates [Citation48]. In contrast, miR-1990c-3p was specifically discovered in molluscs [Citation11], which were highly expressed in mantle tissues (Table S1). Matrix metalloproteinases (MMPs) were predicted to be targeted by miR-1990c-3p in P. fucata [Citation4], which indicated that they might play a role in biomineralization by regulating the formation of matrix proteins. These highly conserved miRNAs have a long evolutionary history, suggesting that they may play crucial roles in essential biological processes in many organisms.

The prediction of miRNA targets is of central importance for the functional understanding of miRNAs. However, functional studies of specific miRNAs are still rare in molluscs. To date, miR-15b [Citation49], miR-29a [Citation50], miR-183 [Citation51], and miR-2305 [Citation52] have been shown to participate in nacre formation in molluscs, whereas miR-124 is associated with neurogenesis in molluscs [Citation53,Citation54]. In the present study, we performed an algorithmic prediction of miRNA targets in A. broughtonii using multiple bioinformatics tools. Thousands of targets were identified based on the physical sequence interaction between miRNA and mRNA 3’ UTR [Citation55,Citation56]. Computational prediction of miRNA targets always yields a large proportion of false positives, especially in unverified miRNAs [Citation57]. Functional annotation and enrichment analyses of the predicted miRNA targets can provide a global understanding of miRNA function in a specific organism. A complex miRNA-mediated regulatory network was generated based on the predicted miRNA-mRNA interactions in A. broughtonii, but experimental work is required to confirm the authenticity of miRNA-targets. Hence, experimental verification is vital for target validation in this study.

In addition to the miRNAs detected in these molluscs, abundant piRNA-sized reads, 25–31 nt in length, were also discovered in the somatic and gonadal tissues of molluscs by sRNA-seq. To date, piRNAs have not yet been detected in Protozoa but have been in Nematoda [Citation14,Citation58], Porifera [Citation15,Citation17], and Chordata [Citation21–23,Citation59–61]. Intriguingly, ubiquitously expressed piRNAs were discovered in soma and germlines in most invertebrates, whereas very few piRNAs were discovered in the soma outside the germline in vertebrates. In molluscs, piRNAs were recently discovered to be ubiquitously expressed in both the somatic and gonadal tissues [Citation24–27]. In the present study, we also performed piRNA discovery in diverse Mollusca species, which will provide fundamental knowledge to elucidate the ancestral state of piRNAs in bilaterians from an evolutionary perspective.

The piRNAs newly identified from molluscs exhibit characteristics such as uridine at the 5’ terminal (1 U) and a length of 24–32 nt, which is consistent with piRNAs in most animals [Citation62]. Genomic mapping of piRNAs in Drosophila, zebrafish, mouse, and primates has shown that most piRNAs map within discrete genomic intervals, called piRNA clusters [Citation59,Citation63,Citation64]. In the primary biogenesis pathway, long piRNA precursors are transcribed from specific genomic loci piRNA clusters, cleaved, and modified by complex factors in the cytoplasm to form the piRNA-induced silencing complex (piRISC) [Citation65,Citation66]. Subsequently, the functional piRISC binds to specific sequences to participate in transcriptional gene silencing of TE in the nucleus or post-transcriptional gene silencing of the transcript undergoes a ping-pong amplification loop in the cytoplasm [Citation24,Citation67,Citation68]. A strong ping-pong amplification signal was detected through comprehensive computational analysis of the piRNA population of molluscs, which implies an important role of piRNA in post-transcriptional gene silencing in the cytoplasm.

In molluscs, tens of piRNA clusters with a size of 2.7 ~ 17 kb were identified based on genomic mapping data. The pronounced clustering suggests that piRNAs are processed from long primary transcripts, although the piRNA clusters constitute less than 0.1% of the genomes. In general, the piRNA clusters expressed were not substantially different among somatic tissues but varied significantly between somatic and gonadal tissues. The piRNAs and piRNA clusters also exhibited different expression patterns in the somatic and gonadal tissues of P. fucata [Citation25,Citation27] The tissular difference of piRNA expression might imply different functions of piRNAs in molluscs. The key function of piRNAs is to suppress transposon activity via transcriptional or post-transcriptional silencing mechanisms, thereby maintaining germline genomic integrity [Citation69,Citation70]. As TE annotation was limited in molluscs, we performed a de novo annotation of TE in the molluscs genomes however, a large number of identified transposons are not well classified. In addition, the reference genomes of molluscs have not been well annotated, regardless of the quality or quantity of the assembled genomes. To date, less than 50 mollusc genomes have been studied, which is a small proportion of total mollusc species [Citation11]. Of the eight species in this study, four species lacked their own reference genome, therefore closely related genomes were used as references. However, piRNAs show high diversity among species and as such closely related reference genomes can be used for miRNA detection, but are not appropriate for novel piRNA analysis. The piRNA clusters identified from H. rufescens, M. galloprovincialis, and A. broughtonii were significantly enriched in TE compared to the reference genomes, indicating the key role of piRNAs in TE silencing in molluscs. However, a decreased proportion of TE was detected in the piRNA clusters of C. nippona and M. lamarckii compared with a closely related reference genome. In the future, the reference genomes of C. nippona and M. lamarckii will enhance the functional analysis of piRNAs in these two species.

In addition to transposon silencing, the PIWI-associated piRISCs also plays an important role in genome rearrangement [Citation71], epigenetic regulation [Citation72], gene regulation and development [Citation73,Citation74], and virus defence [Citation75]. Metagenomic sequencing of small RNAs indicated the presence of endogenous RNA or DNA virus-derived piRNA expression in divergent animal phyla, including Cnidaria, Echinodermata, and Mollusca [Citation17]. More evidence on endogenous viral element-derived piRNAs supports the hypothesis that they mediate antiviral immunity, similarly to CRISPR RNAs in prokaryotes [Citation76,Citation77]. It has also been found that somatic piRNAs can regulate endogenous gene expression via miRNA-like pairing rules to regulate gene expression, especially in arthropods [Citation20,Citation74]. A previous study also revealed a piRNA-mediated maternal mRNA decay during the maternal-to-zygotic transition in the mosquito Aedes [Citation78]. Recent research also indicated that somatic piRNAs regulate endogenous gene expression in P. fucata somatic tissues [Citation27]. piRNA-mediated gene regulation will provide comprehensive insights into post-transcriptional gene silencing in animals. The piRNAs might have a great diversity of functions in molluscs, including but not limited to transposon silencing. Further studies are needed to fully understand piRNAs in mollusc somatic and gonadal tissues.

In summary, the miRNAs, piRNAs, and the piRNA clusters defined in our study provide important insights into the evolutionary origins and functional understanding of small non-coding RNAs in molluscs. These findings are useful for the identification of miRNAs and piRNAs in molluscs and other closely related species and contribute to the expanding knowledge base for miRNA and piRNA analysis in molluscs as well as for elucidating the ancestral state of somatic piRNAs in bilaterians.

Abbreviations

miRNA: microRNA; piRNA: PIWI-interacting RNA; piRISC: piRNA-induced silencing complex; siRNA: small interfering RNA; sncRNA: small non-coding RNA; sRNA-seq: small RNA sequencing; TE: transposon element.

Author contributions

S.H. designed and performed the experiments, analysed the data and drafted the manuscript. K.Y. and S.K. collected the samples and assisted with the data analysis. S.A. conceived the study, supervised the work, and co-wrote the manuscript. All authors reviewed the manuscript.

Supplemental material

Supplementary materials_R1.docx

Download MS Word (4.7 MB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The raw sequencing reads from smRNA-seq are available in the DNA Data Bank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA012198.

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/15476286.2024.2348893.

Additional information

Funding

This study was supported by grants from the National Key Research and Development Program of China [2018YFD0900601], the Japan Society for the Promotion of Science [JP24248034] and the Research Fellowship of the Japan Society for the Promotion of Science Postdoctoral Fellowship for Overseas Researchers [P20395].

References

  • Kim N. Small RNAs: classification, biogenesis, and function. Mol Cells. 2005;19(1):1–15. doi: 10.1016/S1016-8478(23)13130-X
  • Carninci P. Molecular biology: the long and short of RNAs. Nature. 2009;457(7232):974–975. doi: 10.1038/457974b
  • Kim V, Han J, Siomi M. Biogenesis of small RNAs in animals. Nat Rev Mol Cell Biol. 2009;10(2):126–139. doi: 10.1038/nrm2632
  • Huang SQ, Ichikawa Y, Yoshitake K, et al. Identification and characterization of microRnas and their predicted functions in biomineralization in the pearl oyster (pinctada fucata). Biology-Basel. 2019;8(2):47. doi: 10.3390/biology8020047
  • Xu F, Wang XT, Feng Y, et al. Identification of conserved and novel microRnas in the Pacific oyster Crassostrea gigas by deep sequencing. PLOS One. 2014;9(8):e104371. doi: 10.1371/journal.pone.0104371
  • Chen G, Zhang C, Jiang F, et al. Bioinformatics analysis of hemocyte miRNAs of scallop chlamys farreri against acute viral necrobiotic virus (AVNV). Fish Shellfish Immunol. 2014;37(1):75–86. doi: 10.1016/j.fsi.2014.01.002
  • Biggar K, Kornfeld S, Maistrovski Y, et al. MicroRNA regulation in extreme environments: differential expression of microRNAs in the intertidal snail Littorina littorea during extended periods of freezing and anoxia. Genom Proteom Bioinf. 2012;10(5):302–309. doi: 10.1016/j.gpb.2012.09.002
  • Yu D, Wu HF, Peng X, et al. Profiling of microRnas and mRNAs in marine mussel Mytilus galloprovincialis. Comp Biochem Phys C. 2020;230:108697. doi: 10.1016/j.cbpc.2019.108697
  • Wang YY, Duan SH, Wang GL, et al. Integrated mRNA and miRNA expression profile analysis of female and male gonads in hyriopsis cumingii. Sci Rep. 2021;11(1):665. doi: 10.1038/s41598-020-80264-7
  • Kozomara A, Birgaoanu M, Griffiths-Jones S. MiRBase: from microRNA sequences to function. Nucleic Acids Res. 2019;47(D1):D155–D162. doi: 10.1093/nar/gky1141
  • Huang SQ, Yoshitake K, Asaduzzaman M, et al. Discovery and functional understanding of miRNAs in molluscs: a genome-wide profiling approach. RNA Biol. 2021;18(11):1–14. doi: 10.1080/15476286.2020.1867798
  • Tóth KF, Pezic D, Stuwe E, et al. The piRNA pathway guards the germline genome against transposable elements. Adv Exp Med Biol. 2016;886:51–77.
  • Sato K, Siomi MC. The piRNA pathway in drosophila ovarian germ and somatic cells. Proc Jpn Acad Ser B. 2020;96(1):32–42. doi: 10.2183/pjab.96.003
  • Lee H, WF G, Shirayama M, et al. C. elegans piRnas mediate the genome-wide surveillance of germline transcripts. Cell. 2012;150(1):78–87. doi: 10.1016/j.cell.2012.06.016
  • Grimson A, Srivastava M, Fahey B, et al. Early origins and evolution of microRNAs and piwi-interacting RNAs in animals. Nature. 2008;455(7217):1193–1197. doi: 10.1038/nature07415
  • Lakshmanan V, Sujith TN, Bansal D, et al. Comprehensive annotation and characterization of planarian tRNA and tRNA-derived fragments (tRfs). RNA. 2021;27(4):477–495. doi: 10.1261/rna.077701.120
  • Waldron FM, Stone GN, Obbard DJ, et al. Metagenomic sequencing suggests a diversity of RNA interference-like responses to viruses across multicellular eukaryotes. PloS Genet. 2018;14(7):e1007533. doi: 10.1371/journal.pgen.1007533
  • Praher D, Zimmermann B, Genikhovich G, et al. Characterization of the piRNA pathway during development of the sea anemone nematostella vectensis. RNA Biol. 2017;14(12):1727–1741. doi: 10.1080/15476286.2017.1349048
  • Nong W, Cao JQ, Li Y, et al. Jellyfish genomes reveal distinct homeobox gene clusters and conservation of small RNA processing. Nat Commun. 2020;11(1):3051. doi: 10.1038/s41467-020-16801-9
  • Lewis S, Quarles K, Yang YJ, et al. Pan-arthropod analysis reveals somatic piRNAs as an ancestral defence against transposable elements. Nat Ecol Evol. 2018;2(1):174–181. doi: 10.1038/s41559-017-0403-4
  • Armisen J, Gilchrist MJ, Wilczynska A, et al. Abundant and dynamically expressed miRNAs, piRNAs, and other small RNAs in the vertebrate xenopus tropicalis. Genome Res. 2009;19(10):1766–1775. doi: 10.1101/gr.093054.109
  • Yan Z, Hu HY, Jiang X, et al. Widespread expression of piRNA-like molecules in somatic tissues. Nucleic Acids Res. 2011;39(15):6596–6607. doi: 10.1093/nar/gkr298
  • Roovers EF, Rosenkranz D, Mahdipour M, et al. Piwi proteins and piRnas in mammalian oocytes and early embryos. Cell Rep. 2015;10(12):2069–2082. doi: 10.1016/j.celrep.2015.02.062
  • Jehn J, Gebert D, Pipilescu F, et al. PIWI genes and piRnas are ubiquitously expressed in mollusks and show patterns of lineage-specific adaptation. Commun Bio. 2018;1(1):137. doi: 10.1038/s42003-018-0141-4
  • Huang SQ, Ichikawa Y, Igarashi Y, et al. Piwi-interacting RNA (piRNA) expression patterns in pearl oyster (pinctada fucata) somatic tissues. Sci Rep. 2019;9(1):247. doi: 10.1038/s41598-018-36726-0
  • Queiroz FR, Portilho LG, Jeremias WJ, et al. Deep sequencing of small RNAs reveals the repertoire of miRNAs and piRnas in Biomphalaria glabrata. Mem Inst Oswaldo Cruz. 2020;115:e190498. doi: 10.1590/0074-02760190498
  • Huang SQ, Ichikawa Y, Yoshitake K, et al. Conserved and widespread expression of piRNA-like molecules and PIWI-like genes reveal dual functions of transposon silencing and gene regulation in Pinctada fucata (Mollusca). Front Mar Sci. 2021;8:730556. doi: 10.3389/fmars.2021.730556
  • Li Y, Sun XQ, Hu XL, et al. Scallop genome reveals molecular adaptations to semi-sessile life and neurotoxins. Nat Commun. 2017;8(1):1721. doi: 10.1038/s41467-017-01927-0
  • Sun J, Zhang Y, Xu T, et al. Adaptation to deep-sea chemosynthetic environments as revealed by mussel genomes. Nat Ecol Evol. 2017;1(5):121. doi: 10.1038/s41559-017-0121
  • Peng J, Li Q, Xu L, et al. Chromosome-level analysis of the Crassostrea hongkongensis genome reveals extensive duplication of immune-related genes in bivalves. Mol Ecol Resour. 2020;20(4):980–994. doi: 10.1111/1755-0998.13157
  • Friedlander MR, Mackowiak SD, Li N, et al. miRdeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res. 2012;40(1):37–52. doi: 10.1093/nar/gkr688
  • Enright A, John B, Gaul U, et al. MicroRNA targets in Drosophila. Genome Biol. 2003;5(1):R1. doi: 10.1186/gb-2003-5-1-r1
  • Loher P, Rigoutsos I. Interactive exploration of RNA22 microRNA target predictions. Bioinformatics. 2012;28(24):3322–3323. doi: 10.1093/bioinformatics/bts615
  • Götz S, García-Gómez JM, Terol J, et al. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008;36(10):3420–3435. doi: 10.1093/nar/gkn176
  • Zhou G, Soufan O, Ewald J, et al. NetworkAnalyst 3.0: a visual analytics platform for comprehensive gene expression profiling and meta-analysis. Nucleic Acid Res. 2019;47(W1):W234–41. doi: 10.1093/nar/gkz240
  • Shen W, Le S, Li Y, et al. SeqKit: a cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLOS One. 2016;11(10):e0163962. doi: 10.1371/journal.pone.0163962
  • Gebert D, Hewel C, Rosenkranz D. Unitas: the universal tool for annotation of small RNAs. BMC Genomics. 2017;18(1):644. doi: 10.1186/s12864-017-4031-9
  • Rosenkranz D, Zischler H. proTRAC – a software for probabilistic piRNA cluster detection, visualization and analysis. BMC Bioinf. 2012;13(1):5. doi: 10.1186/1471-2105-13-5
  • Zhang Z, Xu J, Koppetsch BS, et al. Heterotypic piRNA Ping-Pong requires qin, a protein with both E3 ligase and tudor domains. Mol Cell. 2011;44(4):572–584. doi: 10.1016/j.molcel.2011.10.011
  • Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinf. 2011;12(1):323. doi: 10.1186/1471-2105-12-323
  • Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinform. 2004;5(1):1–14. doi: 10.1002/0471250953.bi0410s05
  • Flynn JM, Hubley R, Goubert C, et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci USA. 2020;117(17):9451–9457. doi: 10.1073/pnas.1921046117
  • Zhang GF, Fang XD, Guo XM, et al. The oyster genome reveals stress adaptation and complexity of shell formation. Nature. 2012;490(7418):49–54. doi: 10.1038/nature11413
  • Wei M, Ge HX, Shao CW, et al. Chromosome-level clam genome helps elucidate the molecular basis of adaptation to a buried lifestyle. iScience. 2020;23(6):101148. doi: 10.1016/j.isci.2020.101148
  • Liu C, Zhang Y, Ren YW, et al. The genome of the golden apple snail Pomacea canaliculata provides insight into stress tolerance and invasive adaptation. Gigascience. 2018;7(9):giy101. doi: 10.1093/gigascience/giy101
  • Prochnik SE, Rokhsar DS, Aboobaker AA. Evidence for a microRNA expansion in the bilaterian ancestor. Dev Genes Evol. 2007;217(1):73–77. doi: 10.1007/s00427-006-0116-1
  • Marco A, Hooks K, Griffiths-Jones S. Evolution and function of the extended miR-2 microRNA family. RNA Biol. 2012;9(3):242–248. doi: 10.4161/rna.19160
  • Kitatani Y, Tezuka A, Hasegawa E, et al. Drosophila miR-87 promotes dendrite regeneration by targeting the transcriptional repressor Tramtrack69. PlOS Genet. 2020;16(8):e1008942. doi: 10.1371/journal.pgen.1008942
  • Chen H, Wang L, Zhou Z, et al. The comprehensive immunomo-dulation of NeurimmiRs in haemocytes of oyster Crassostrea gigas after acetylcholine and norepinephrine stimulation. BMC Genomics. 2015;16(1):942. doi: 10.1186/s12864-015-2150-8
  • Tian R, Zheng Z, Huang R, et al. miR-29a participated in nacre formation and immune response by targeting Y2R in Pinctada martensii. Int J Mol Sci. 2015;16(12):29436–29445. doi: 10.3390/ijms161226182
  • Zheng Z, Du X, Xiong X, et al. PmRunt regulated by Pm-miR-183 participates in nacre formation possibly through promoting the expression of collagen VI-like and Nacrein in pearl oyster pinctada martensii. PLoS One. 2017;12(6):e0178561. doi: 10.1371/journal.pone.0178561
  • Jiao Y, Zheng Z, Tian R, et al. Pm-miR-2305, participates in nacre formation by targeting pearlin in pearl oyster pinctada martensii. Int J Mol Sci. 2015;16(9):21442–21453. doi: 10.3390/ijms160921442
  • Rajasethupathy P, Fiumara F, Sheridan R, et al. Characterization of small RNAs in Aplysia reveals a role for miR-124 in constraining synaptic plasticity through CREB. Neuron. 2009;63(6):803–817. doi: 10.1016/j.neuron.2009.05.029
  • Walker S, Spencer GE, Necakpv A, et al. Identification and characterization of microRNAs during retinoic acid-induced regeneration of a molluscan central nervous system. Int J Mol Sci. 2018;19(9):2741. doi: 10.3390/ijms19092741
  • Fan X, Kurgan L. Comprehensive overview and assessment of computational prediction of microRNA targets in animals. Brief Bioinform. 2015;16(5):780–794. doi: 10.1093/bib/bbu044
  • Riffo-Campos AL, Riquelme I, Brebi-Mieville P. Tools for sequence-based miRNA target prediction: what to choose? Int J Mol Sci. 2016;17(12):1987–2005. doi: 10.3390/ijms17121987
  • Mockly S, Seitz H. Inconsistencies and limitations of current microRNA target identification methods. Methods Mol Biol. 2019;1970:291–314.
  • Quarato P, Singh M, Cornes E, et al. Germline inherited small RNAs facilitate the clearance of untranslated maternal mRNAs in C. elegans embryos. Nat Commun. 2021;12(1):1441. doi: 10.1038/s41467-021-21691-6
  • Houwing S, Kamminga LM, Berezikov E, et al. A role for Piwi and piRNAs in germ cell maintenance and transposon silencing in zebrafish. Cell. 2007;129(1):69–82. doi: 10.1016/j.cell.2007.03.026
  • Yang L, Ge Y, Cheng D, et al. Detection of piRnas in whitespotted bamboo shark liver. Gene. 2016;590(1):51–56. doi: 10.1016/j.gene.2016.06.008
  • Chirn GW, Rahman R, Sytnikova YA, et al. Conserved piRNA expression from a distinct set of piRNA cluster loci in eutherian mammals. PlOS Genet. 2015;11(11):e1005652. doi: 10.1371/journal.pgen.1005652
  • Hirano T, Iwasaki YW, Lin ZY, et al. Small RNA profiling and characterization of piRNA clusters in the adult testes of the common marmoset, a model primate. RNA. 2014;20(8):1223–1237. doi: 10.1261/rna.045310.114
  • Brennecke J, Aravin AA, Stark A, et al. Discrete small RNA-Generating loci as master regulators of transposon activity in Drosophila. Cell. 2007;128(6):1089–1103. doi: 10.1016/j.cell.2007.01.043
  • Girard A, Sachidanandam R, Hannon GJ, et al. A germline-specific class of small RNAs binds mammalian piwi proteins. Nature. 2006;442(7099):199–202. doi: 10.1038/nature04917
  • Izumi N, Tomari Y. Diversity of the piRNA pathway for nonself silencing: worm-specific piRNA biogenesis factors. Genes Dev. 2014;28(7):665–671. doi: 10.1101/gad.241323.114
  • Ross RJ, Weiner MM, Lin HF. PIWI proteins and PIWI-interacting RNAs in the soma. Nature. 2014;6505(7483):353–359. doi: 10.1038/nature12987
  • Vandewege MW, Platt RN, Ray DA, et al. Transposable element targeting by piRnas in Laurasiatherians with distinct transposable element histories. Genome Biol Evol. 2016;8(5):1327–1337. doi: 10.1093/gbe/evw078
  • Onishi R, Yamanaka S, Siomi MC. piRNA- and siRNA-mediated transcriptional repression in Drosophila, mice, and yeast: new insights and biodiversity. EMBO Rep. 2021;22(10):e53062. doi: 10.15252/embr.202153062
  • Iwasaki YW, Siomi MC, Siomi H. PIWI-Interacting RNA: its biogenesis and functions. Annu Rev Biochem. 2015;84(1):405–433. doi: 10.1146/annurev-biochem-060614-034258
  • Czech B, Munafò M, Ciabrelli F, et al. piRNA-guided genome defense: from biogenesis to silencing. Ann Rev Genet. 2018;52(1):131–157. doi: 10.1146/annurev-genet-120417-031441
  • Fang W, Wang X, Bracht JR, et al. Piwi-interacting RNAs protect DNA against loss during oxytricha genome rearrangement. Cell. 2012;151(6):1243–1255. doi: 10.1016/j.cell.2012.10.045
  • Rajasethupathy P, Antonov I, Sheridan R, et al. A role for neuronal piRNAs in the epigenetic control of memory-related synaptic plasticity. Cell. 2012;149(3):693–707. doi: 10.1016/j.cell.2012.02.057
  • Zhang D, Tu S, Stubna M, et al. The piRNA targeting rules and the resistance to piRNA silencing in endogenous genes. Science. 2018;359(6375):587–592. doi: 10.1126/science.aao2840
  • Cao Z, Rosenkranz D, Wu S, et al. Different classes of small RNAs are essential for head regeneration in the planarian Dugesia japonica. BMC Genomics. 2020;21(1):876. doi: 10.1186/s12864-020-07234-1
  • Leggewie M, Schnettler E. RNAi-mediated antiviral immunity in insects and their possible application. Curr Opin Virol. 2018;32:108–114. doi: 10.1016/j.coviro.2018.10.004
  • Ophinni Y, Palatini U, Hayashi Y, et al. piRNA-guided CRISPR-like immunity in eukaryotes. Trends Immunol. 2019;40(11):998–1010. doi: 10.1016/j.it.2019.09.003
  • Crava CM, Varghese FS, Pischedda E, et al. Population genomics in the arboviral vector Aedes aegypti reveals the genomic architecture and evolution of endogenous viral elements. Mol Ecol. 2021;30(7):1594–1611. doi: 10.1111/mec.15798
  • Halbach R, Miesen P, Joosten J, et al. A satellite repeat-derived piRNA controls embryonic development of aedes. Nature. 2020;580(7802):274–277. doi: 10.1038/s41586-020-2159-2