- Research article
- Open Access
A potential role for intragenic miRNAs on their hosts' interactome
© Hinske et al; licensee BioMed Central Ltd. 2010
- Received: 19 April 2010
- Accepted: 1 October 2010
- Published: 1 October 2010
miRNAs are small, non-coding RNA molecules that mainly act as negative regulators of target gene messages. Due to their regulatory functions, they have lately been implicated in several diseases, including malignancies. Roughly half of known miRNA genes are located within previously annotated protein-coding regions ("intragenic miRNAs"). Although a role of intragenic miRNAs as negative feedback regulators has been speculated, to the best of our knowledge there have been no conclusive large-scale studies investigating the relationship between intragenic miRNAs and host genes and their pathways.
miRNA-containing host genes were three times longer, contained more introns and had longer 5' introns compared to a randomly sampled gene cohort. These results are consistent with the observation that more than 60% of intronic miRNAs are found within the first five 5' introns. Host gene 3'-untranslated regions (3'-UTRs) were 40% longer and contained significantly more adenylate/uridylate-rich elements (AREs) compared to a randomly sampled gene cohort. Coincidentally, recent literature suggests that several components of the miRNA biogenesis pathway are required for the rapid decay of mRNAs containing AREs. A high-confidence set of predicted mRNA targets of intragenic miRNAs also shared many of these features with the host genes. Approximately 20% of intragenic miRNAs were predicted to target their host mRNA transcript. Further, KEGG pathway analysis demonstrated that 22 of the 74 pathways in which host genes were associated showed significant overrepresentation of proteins encoded by the mRNA targets of associated intragenic miRNAs.
Our findings suggest that both host genes and intragenic miRNA targets may potentially be subject to multiple layers of regulation. Tight regulatory control of these genes is likely critical for cellular homeostasis and absence of disease. To this end, we examined the potential for negative feedback loops between intragenic miRNAs, host genes, and miRNA target genes. We describe, how higher-order miRNA feedback on hosts' interactomes may at least in part explain correlation patterns observed between expression of host genes and intragenic miRNA targets in healthy and tumor tissue.
- miRNA Target
- Host Gene
- miRNA Gene
- Target Prediction
- Target Coverage
microRNAs (miRNAs) are small (~22-nt) functional RNA species that provide a newly appreciated layer of gene regulation with an important role in development, cellular homeostasis and pathophysiology. miRNAs are encoded in the genome and transcribed primarily in a Pol II-dependent manner , although Pol III-dependent transcription has also been reported [2, 3]. Roughly half of the known human microRNAs are found in intergenic regions of the genome, suggesting production of unique primary transcripts (pri-miRNAs) containing one or more miRNA hairpins under the control of independent promoter elements. The overwhelming majority of the other ~50% map to previously annotated intronic regions of protein coding genes, while a small number are even found within exons. The relationship between intragenic miRNAs and their host genes presents many unique questions regarding genomic organization, transcriptional regulation, processing and function.
The genomic organization of intragenic miRNAs exhibits a strong directional bias, such that these species are predominantly oriented on the same strand of the DNA as that of the host gene. The directional bias may prevent steric interference between RNA polymerases transcribing the host gene and the miRNA gene(s) ; however, the existence of individual antisense miRNA genes and miRNA gene clusters argues that the primary evolutionary pressure for the positional bias is co-regulation of the intronic miRNA and the host gene. Indeed, microarray analyses supports the hypothesis that intronic miRNAs are usually expressed in coordination with the host gene mRNA in human tissues [4, 5], strongly suggesting that co-transcription from the host gene promoter is the most common transcriptional mechanism under normal conditions. This assumption has lately successfully been employed to identify new miRNA targets . However, recent findings demonstrate that transcription of a subset of intronic miRNAs in H. sapiens can be initiated from internal promoters within operons independently from the host gene , suggesting that utilization of internal promoters must also be considered a viable alternative strategy for intronic miRNA gene transcription.
Large portions of miRNA processing are understood (for review see ). In brief, a ~70 nucleotide stem-loop precursor pre-miRNA is excised from a relatively long primary miRNA transcript, followed by export from the nucleus via Exportin-5 in a Ran-GTP-dependent manner. In the cytoplasm, pre-miRNAs are further processed into a ~22-nt miRNA/miRNA* duplex. In the case of intronic miRNAs, early steps in the miRNA biogenesis pathway are complicated by the requirement for proper pre-mRNA splicing and mature mRNA assembly of the host message. Recent bioinformatics and experimental work demonstrates that intronic miRNAs can be processed from intronic regions co-transcriptionally  prior to the splicing reaction . Interestingly, recent work suggests that several intragenic miRNAs undergo post-transcriptional regulation , and defects in this process have been associated with tumor development [10–14]. The nature of the differences in miRNA processing and associated defects between intergenic and intragenic miRNA species is not currently elucidated.
miRNA target recognition in mammals is mainly mediated via imperfect Watson-Crick base-pairing to cognate sites primarily located in the 3'-UTR of mRNA targets. Predicted and validated miRNA targets include a functionally diverse suite of genes that include many transcription factors and cell signaling proteins, suggesting a role for miRNAs in regulatory feedback loops [15–17]. Intragenic miRNAs present unique regulatory possibilities based on functional relationships with their host genes. It has been speculated that intronic miRNAs may directly target their host message or regulate transcription factors, in what is commonly designated "first-order" or "second-order" negative feedback, respectively . Recently published work  demonstrates that miR-338, encoded in an intron of the apoptosis-associated tyrosine kinase (AATK) gene, targets several genes that are functionally antagonistic to the AATK protein. Therefore, miR-338 serves the functional interest of the host in this case via a higher-order positive feedback system that downregulates expression of AATK repressors and enforces neuronal differentiation downstream of the kinase.
In the current manuscript, large-scale bioinformatics analyses of human intronic miRNAs related to genomic organization and characterization of miRNA host and target genes are presented. We identify characteristics of host genes and predicted targets, and present evidence that intragenic miRNAs may act as negative feedback regulatory elements of their hosts' interactome (i.e., they can regulate host gene neighbours in addition to host genes).
We integrated genomic and transcriptomic information to analyze properties of intragenic miRNAs themselves, their host genes, as well as their targets. We used all known miRNAs (based on miRBase), all known human transcripts (based on RefSeq), six different and highly established miRNA target prediction algorithms, as well as the gene and pathway annotation ontologies GO and KEGG.
Classification of miRNAs
Classification of miRNAs in the Genome of Different Species
miRNAs on Host Gene Strand
miRNAs on Opposite Host Strand
Positional Bias of Intragenic miRNAs
The orientation of the gene for an intronic miRNA depends significantly on the transcription direction of its host gene (p-value = 1.3 × 10-36 in χ2 test) as shown in Table 1. We found that 65.5% of host genes had miRNAs in the first five introns. Also, we confirmed that the observed distribution differs significantly from the expected distribution within the first five introns (p = 0.030 in χ2 test, additional file 2).
Characterization of Host Genes
Properties of Host and Target Genes
Total length (basepairs)
Length 5'UTR (basepairs)
Length 3'UTR (basepairs)
ARE (per kb)
5' UTR GC content
Gene expression can be pre- and post-transcriptionally controlled through regulatory motifs in their 3'-UTRs. Even though regulatory mechanisms are not well understood, two important concepts include regulation through miRNAs, and the role of adenylate/uridylate-rich elements (AREs) in mediating mRNA decay, which plays a significant role in cancer development [21–23]. We first compared the length of the 3'-UTRs of host genes to the length of 3'-UTRs of the random sample. Host genes have 40% longer 3'-UTRs (p-value < 0.01). In a second step, we counted occurrences of the pentamer AUUUA in these regions, normalized by the length of the 3'-UTRs. We found significantly more ARE units in host genes than in the random sample (p-value < 0.01). Since recently miRNA target genes have been shown to be larger than non-target genes , we analyzed total lengths and lengths of 3'-UTRs  for host genes predicted to be targeted by their intragenic miRNA and the remaining host genes separately. No significant difference in lengths between the two groups of host genes was observed (p-value = 0.3939), but genes in both groups were longer than genes in the control group (p-value = 1.552e-07 and p-value < 2.2e-16).3'-UTRs were longer in host genes predicted to be targets of their intronic miRNA than in host genes not predicted to be targets (p-value = 0.001) and control genes (p-value = 5.012e-07). In contrast, the 5'-UTRs of host genes were not significantly longer than the ones in the control group (p-value > 0.05).
The GO Biological Process (GOBP) and KEGG are ontologies that associate genes with, cellular processes and biochemical pathways, respectively, including disease pathways. When surveying GOBP for overrepresentation of miRNA host genes in certain categories, we found significant enrichment in gene regulatory, metabolic, neurogenic, and cytoskeletal processes, which reflects the broad range of diseases with which miRNAs have been associated [12, 26–34]. Additionally, we found that host genes were overrepresented in several signaling pathways, such as the MAPK, ErbB, VEGF, and the calcium signaling pathway.
Genomic Properties of Target Genes
We looked at genomic properties of a high-confidence set of targets for hosts of intronic miRNAs (prediction agreement ≥ 6) that would give us a set of similar size as the host genes. We then randomly sampled RefSeq transcripts to match chromosome and strand distribution as a control set and performed the analysis analogously to the analysis of genomic properties of the host genes themselves. Table 2 summarizes the results, revealing that the predicted targets have properties that are highly similar to those of host genes.
Relationship Between Intragenic miRNAs and Host Genes
We found that approximatelly 20% of intragenic miRNAs (56 of them, hosted in 49 distinct genes) are predicted to target their own host by at least two methods. This number is significantly higher than would be expected by chance alone (p-value < 0.001, obtained by random sampling). Furthermore, we assessed the robustness of our approach by following the above procedure while applying a voting method as the gold standard. We assigned each of the target prediction methods to one of two groups of equal size (n = 3) and required at least one vote from each group to consider that a prediction of a miRNA-host interaction. TarBase did not contain a single instance of miRNA-host interaction, so it was excluded from the analysis. Although the numbers of miRNAs predicted to target their own host varied (12 - 55), depending on which group they had been assigned to, in each case the observed number was significantly higher than would be expected by chance (p-value < 0.05, see also additional file 3). Given that host genes that were predicted to be targets of their intragenic miRNA have longer 3'-UTR regions, statistical significance of the number of hosts being targeted by their intronic miRNAs was assessed by repeated creation of sets of non-host control genes with similar 3'-UTR distribution (see Materials and Methods). In line with our previous observations, the number of hosts predicted to be targeted by their intragenic miRNAs (49) was significantly higher than expected by chance (p-value = 0.032).
Pathways with Overrepresentation of Genes Targeted by an Intronic miRNA
Host Genes in Pathway
ATF2; DDIT3; AKT2; FGF13; ARRB1; PPP3CA; PRKCA; CACNG8; RPS6KA2; MAP2K4; RPS6KA4
PPP3CA; PTK2; SEMA4G; SEMA3F; SLIT3; ABLIM2; SLIT2
Ubiquitin Mediated Proteolysis
HUWE1; WWP2; BIRC6; ITCH
COL3A1; AKT2; PRKCA; PTK2; TLN2
Regulation of Actin Cytoskeleton
CHRM2; FGF13; SSH1; PTK2
Chronic Myloid Leukemia
ERBB4; AKT2; PRKCA; PTK2; MAP2K4
Glycan Structures Biosynthesis
MGAT4B; FUT8; CSGLCA-T; GALNT10; HS3ST3A1
HTR2C; PRKCA; PRKG1
Non-Small Cell Lung Cancer
Small Cell Lung Cancer
PPP3CA; PRKCA; RPS6KA2
T-Cell Receptor Signaling
Implications for Cancer Pathogenesis
The correlation between host and target expression levels is shown in a two-bar plot. The first bar, labeled "N", represents the correlation between host and target in normal tissue. The second bar, labeled "T", represents the correlation between host and target in cancerous tissue. In the prostate cancer dataset, seven of the fifteen targets are more negatively correlated in healthy tissue than in cancer. In four cases (AKT3, AR, MAPK1, and CTNNB1), we could observe a significant negative correlation in normal tissue, which was either non-significant or was significantly positive in cancer. A similar pattern could be observed in the non small cell lung cancer pathway.
Since the first discovery of miRNAs, our understanding of biogenesis and regulation has exponentially grown. In the recent past, it has been estimated that miRNAs that reside in intronic or exonic regions of other genes may be the dominating class . However, functional aspects of intragenic miRNAs are still largely unknown.
It is generally believed that both host and miRNA share regulatory control [4–6], although a recent study found that transcription of roughly 30% of intragenic miRNAs may be initiated independently . After mapping miRNAs to known genes, we found that most intronic miRNAs are oriented in the same direction as their host gene, significantly more than would be expected by chance. Several hypotheses related to this preferential orientation have been suggested. First, most of intragenic miRNAs may not present their own promoter and be dependent to the transcription of their host gene. Second, miRNAs may present their own promoter, and directional bias may prevent physical interference between RNA polymerases transcribing the host gene and RNA polymerases transcribing the miRNA gene .
Conservation of miRNA-Host Pairs
miRNA-Host Pairs Predicted to Target own Host
miRNA-Host Pairs Not Predicted to Target own Host
Homo sapiens - Mus musculus
Homo sapiens - Canis familiaris
Homo sapiens - Gallus gallus
Loss of negative feedback control systems is a well-known mechanism by which cancer develops. Blenkiron and coworkers  recently suggested that miRNA processing might be disturbed in cancer. If expression levels of intragenic miRNAs are reduced, as observed by some authors [13, 45], subsequently important signalling pathways may lose inhibition and this may facilitate uncontrolled cell growth. In a recent study, Tavazoie et al. analyzed six miRNAs that were significantly under-expressed in breast cancer LM2 cells, as compared to normal breast tissue. Four of these miRNAs were intragenic . The authors reported that loss of the intronic miRNA hsa-miR-335, which resides in intron 2 of its host gene MEST, led to increased migration and invasion rates and hence increased metastatic capacity. Additionally, they could show that hsa-miR-126 (intron 7, host EGFL7) significantly reduced proliferation of breast cancer cells. Likewise, hsa-miR-151 has been shown to be downregulated in chronic myloid leukemia through BCR/ABL, and silencing its host gene PTK2 inhibits leukemogenesis . A similar pattern can for example be found for hsa-miR-504 and FGF13[49, 50].
Changes in miRNA biosynthesis such as those found in cancer can interfere with the coordination of expression of miRNA and host. Thus, a negative correlation between expression levels of host and genes targeted by its intragenic miRNA in normal tissue (given that the host is not targeted by the miRNA it contains) and a less negative or even positive correlation in cancerous tissue might be expected. This phenomenon was observed in two distinct datasets in different malignancies (Figure 2, additional file 4 and additional file 6). A key to pathogenesis of both entities is the phosphatidylinositol 3-kinase(PIK3)/AKT signaling pathway, deregulation of which has been reported in several cancers, including prostate cancer , lung cancer , ovarian cancer [53, 54], breast cancer [53, 55], and colon tumors . Whereas Noske et al. discovered that silencing AKT2 through RNA interference leads to reduction in ovarian cancer cell proliferation , Maroulakou and coworkers reported accelerated development of polyoma middle T and ErbB2/Neu-driven mammary adenocarcinomas in mice after AKT2 ablation . Although these findings would appear to be contradictory at first, they can be explained by an intragenic miRNA-driven negative regulatory loop that is disturbed in cancer. Whereas in the first experiment AKT2 was targeted on mRNA level (and therefore mimicking the role of the corresponding intronic miRNA), in the second experiment both host mRNA and miRNA (if it exists in mouse) were downregulated, and therefore may have disabled a potential negative feedback regulation by hsa-miR-641.
One must remember, however, that regulatory networks are far more complex in reality than what we are currently able to model. Transcription factors, enhancers, silencers, and epigenetic modifications play major roles in cancer development and may influence correlation among expression levels of hosts and targets. Also, target prediction methods are error prone, and at this point we can only speculate about the true nature of events and therefore plan to conduct further experiments in which the hypotheses presented can be tested. For example, by integrating different target prediction methods, roughly 20% of intragenic miRNAs were predicted to target their own host. Though this number is significantly higher than expected by chance, it still does not cover the majority of miRNAs. For one, this number may underestimate the true number of miRNAs targeting their own host due to limitations of target prediction methods. Additionally, it has lately been shown that transcription of one third of intronic miRNAs can be initiated independently of the host's transcription , in which case direct feedback cannot be claimed. Also, we only investigated feedback on the level of direct miRNA-host interaction and on the order of the interactome based on the KEGG database. However, knowledge about interaction of proteins is still limited and cotranscription of host and miRNA may enable more complex mechanisms. Limitations to current knowledge may also justify, why a significant fraction of predicted targets in do not show the expected behaviour. Indeed, Cyclin E and E2F in Figure 2 show opposite behavior than what we would expect. Neither of these genes might actually be a target of hsa-miR-641; there may also exist stronger regulating elements that control their expression, or the primary mode of silencing in that specific situation may be through translational repression. Nevertheless, it is interesting how key molecules in two different datasets displayed predicted correlation patterns.
Further experiments and biological validation of computational evidence presented here may have great implications, especially in cancer therapy. Modern therapies usually target central molecules, such as AKT and PI3K with some success. However, these techniques control only single elements in a cascade of complex signalling events. In summary, our findings encourage more focused research on intragenic miRNAs and their targets.
Classification of miRNAs
miRNA genomic coordinates from miRBase release 11 (April 2008) [58–60] were crossed to genomic coordinates of RNA Reference Sequences (http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/RefSeq; Release 31)  downloaded from UCSC Genome Browser http://genome.ucsc.edu. To each genomic mapped RefSeq sequence, a single gene was assigned. The subset of miRNAs whose coordinates mapped to an annotated gene was defined as intragenic. Intragenic miRNAs were classified as exonic when their coordinates overlapped with any observed exonic region, and intronic otherwise.
Host Genes' Intronic miRNA Distribution
Introns were sequentially enumerated based on gene orientation. For each intron number, host genes containing miRNAs in this intron were counted. We calculated the expected number of genes containing an intronic miRNAs in a given intron number by adding all intron lengths of introns with the respective intron number and dividing it by the summed length of all host genes' introns, thus accounting for intron frequency and length.
The Gene Ontology  classifications of all 246 host genes of intragenic miRNA genes that were located on the same strand as their host gene were surveyed using Cytoscape 2.6.0  and BiNGO 2.3 . We focused our attention on those categories that were disproportionately overrepresented. The setting "Hypergeometric test" was chosen to calculate the probability of observing an equal or greater number of genes in a given functional category than in the test set. The False Discovery Rate (FDR), which is the standard setting in BiNGO 2.3 , was controlled.
The statistical programming software R 2.7.1 was used in combination with Bioconductor [64, 65] packages AnnBuilder 1.18.0, KEGG.db version 2.2.0, and GOStats version 1.7.4 to acquire a list of pathways that were associated with one or more of the 246 host gene proteins.
Strategies to perform high-throughput miRNA target validation are still very limited. Therefore, target prediction algorithms are employed to allow large-scale assessment of miRNA-target interaction. However, usage of target prediction methods raises two difficulties. First, target prediction methods are known to suffer from a significant number of false positive predictions. We reasoned that a possible way to address this problem would be to estimate statistical significance by generating background distributions by the very same methods. Hence, if target predictions were too close to random, the mean of the generated background distribution should be close to the observed number, whereas a significant finding should not be affected by the absolute number of false positives. Second, different target prediction algorithms incorporate different types of information about miRNA target interactions. To overcome individual biases that may be introduced by one specific method and use the wide range of experimental knowledge gained, we integrated predictions from six current algorithms. Precalculated target predictions for TargetScan release 4.2  (April 2008), PITA  catalog version 6 (August 2008), MirTarget2 (mirDB) version 2.0 [68, 69] (December 2007), miRanda  (September 2008), RNA22  (November 2006) and PicTar 5-way  were downloaded. We also included TarBase version 5.0c  (June 2008) as a reference database for miRNA target interactions with published evidence; only targets with a "Support Type" value of either "True" or "Microarray" were selected. Some miRNA symbols did not exactly match entries in the database for various reasons, including use of non-official names or older miRBase releases. Whenever a miRNA symbol could not be found, matching was attempted to an extension such as "-1" or "a" (for example, hsa-mir-511 in mirTarget2 was matched to hsa-mir-511-1 and hsa-mir-511-2). If the miRNA symbol ended with a letter, it was removed to check for other matches (from the PicTar prediction list hsa-mir-128a matched to hsa-mir-128-1, hsa-mir-128-2, and hsa-mir-128-3 for example). Predictions for a miRNA symbol were ignored if no matches could be found. Due to the diversity of underlying principles, assumptions, and scoring systems, we defined the prediction agreement, i.e. the number of methods that agree on a certain miRNA target prediction, as a measure of confidence in the target prediction. In recent work, Selbach et al. measured changes in protein and mRNA expression after transfection and overexpression of five different miRNAs (hsa-miR-1, hsa-miR-16, hsa-miR-30a, hsa-miR-255, hsa-let-7b) in HeLa cells . We evaluated the different target prediction methods used in this study by measuring the abundance of predicted products (mRNA or the proteins encoded by these mRNAs, a continuous value) and assessing discrimination by areas under the ROC curve using the predicted targets as the binary outcome. All five miRNA datasets were pooled (see additional file 3 for details). The AUC (Area under Receiver Operator Characteristic (ROC) Curve) measures how well predictions and non-predictions can be discriminated at all possible thresholds, with a value of 0.5 indicating no discrimination and a value of 1 indicating perfect discrimination. Target prediction methods varied greatly in AUCs, ranging from 0.55 to 0.92 in protein measurements. With increasing prediction agreement, an almost linear increase in AUC can be observed, indicating that prediction agreement may be used as a proxy for the confidence of a predicted miRNA target interaction (Figure 3.
Gene Expression Datasets
Two publicly available mRNA expression datasets (GSE6956, GSE7670) were downloaded from the Gene Expression Omnibus http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/geo. We included 87 prostate samples (69 tumor and 18 healthy tissue samples)  and 60 lung samples (31 non-small-cell lung cancer and 29 healthy lung tissue samples) . Preprocessing was carried out using BioConductor packages [64, 65]. Data from protein and mRNA expression change after miRNA transfection experiments were downloaded from http://psilac.mdc-berlin.de.
Genomic Host and Target Gene Properties
In order to assess genomic properties of host genes (n = 246), we constructed a set of control genes (n = 2460) that would match chromosome and strand distribution of host genes in order to exclude structural differences due to chromosomal specificities. We defined miRNA target interactions predicted by at least 6 methods as "high confidence targets" (n = 326). These predictions cover 33 host genes and 43 miRNAs when at least six methods are required and 239 hosts and 272 miRNAs when at least 2 methods are required. Statistical testing was done using Mann-Whitney-U test. For the analysis of total length and 3'-UTR length of host genes, hosts were additionally split into two groups, dependent on whether they were predicted to be targets of their intragenic miRNA. We combined the Kruskal-Wallis rank sum test with post-hoc pairwise Mann-Whitney-U test with Bonferroni correction (p < 0.016 defined as significance cut-off for three pairwise comparisons). Assessment of host genes predicted to be targeted by their intragenic miRNAs was carried out as follows: Out of the 2460 control genes, we sampled 1000 sets of genes of size 246 that would match the host gene 3'-UTR length distribution (no significant difference in Mann-Whitney-U test). Intragenic miRNAs were assigned to genes in the sets and the number of genes predicted to be targeted by that miRNA was calculated. Similarly, the observed number of miRNAs predicted to target their own host was assessed by exchanging host genes for randomly chosen genes from predicted targets and recalculation of the number of miRNAs predicted to target their host. Robustness of this approach was tested by additionally requiring a vote from each of two groups of three prediction methods each.
HomoloGene database NCBI release 61 http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/homologene and mirBase release 11 (April 2008) [58–60] were used to identify homologous host genes in Homo sapiens, Canis familiaris, and Gallus gallus. Proportions of conserved miRNA-host gene pairs for miRNAs predicted and miRNAs not predicted to target their own host were calculated. Similarly, we used information on target site conservation from TargetScan to calculate the proportion of conserved targetsites of predicted target host interactions in the hosts' pathway and of those not in the hosts' pathway. Statistical significance was assessed using the 2-sample test for equality of proportions with continuity correction.
The union of predicted targets included more than 90% of all known human genes. Since target prediction methods are very different, they are difficult to compare. In this work, only targets that were predicted by at least two different methods were considered in the calculation of target coverage. This reduced the total number of predictions by almost 70%.
Analogously, the enrichment statistics for miRNAs targeting their own hosts were calculated, where S p was defined as the set of host genes, S t as the set of targets of the intragenic miRNAs of these host genes, and S i as the set of |S p | randomly sampled genes (out of the non-redundant set of predicted targets for these miRNAs). The R-package 'q-value' was used to account for multiple hypothesis testing by controlling the False Discovery Rate (FDR) to be < 10%.
We would like to thank David Singerman, Staal Vinterbo, Jihoon Kim, Erik Pitzer, and Leo Anthony Celi for all the helpful discussions and their valuable feedback.
Part of this work was included in L.C. Hinske's doctoral thesis at Ludwig-Maximilians-University Munich. This work was funded in part by grant FAS0703850 from the Komen Foundation. P. Galante was funded in part by grant D43TW007015 from the Fogarty International Center, NIH. L. Ohno-Machado was funded in part by grants R01LM009520 and U54HL108460 from the NIH. This work was furthermore conducted with support from Harvard Catalyst | The Harvard Clinical and Translational Science Center (NIH Award #UL1 RR 025758 and financial contributions from Harvard University and its affiliated academic health care centers). The content is solely the responsibility of the authors and does not necessarily represent the official views of Harvard Catalyst, Harvard University and its affiliated academic health care centers, the National Center for Research Resources, or the National Institutes of Health.
- Lee Y, Kim M, Han J, Yeom K-H, Lee S, Baek SH, Kim VN: MicroRNA genes are transcribed by RNA polymerase II. EMBO J. 2004, 23: 4051-4060. 10.1038/sj.emboj.7600385.PubMed CentralPubMedView ArticleGoogle Scholar
- Borchert GM, Lanier W, Davidson BL: RNA polymerase III transcribes human microRNAs. Nat Struct Mol Biol. 2006, 13: 1097-1101. 10.1038/nsmb1167.PubMedView ArticleGoogle Scholar
- Ozsolak F, Poling LL, Wang Z, Liu H, Liu XS, Roeder RG, Zhang X, Song JS, Fisher DE: Chromatin structure analyses identify miRNA promoters. Genes Dev. 2008, 22: 3172-3183. 10.1101/gad.1706508.PubMed CentralPubMedView ArticleGoogle Scholar
- Baskerville S, Bartel DP: Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes. RNA. 2005, 11: 241-247. 10.1261/rna.7240905.PubMed CentralPubMedView ArticleGoogle Scholar
- Rodriguez A, Griffiths-Jones S, Ashurst JL, Bradley A: Identification of mammalian microRNA host genes and transcription units. Genome Research. 2004, 14: 1902-1910. 10.1101/gr.2722704.PubMed CentralPubMedView ArticleGoogle Scholar
- Gennarino VA, Sardiello M, Avellino R, Meola N, Maselli V, Anand S, Cutillo L, Ballabio A, Banfi S: MicroRNA target prediction by expression analysis of host genes. Genome Research. 2009, 19: 481-490. 10.1101/gr.084129.108.PubMed CentralPubMedView ArticleGoogle Scholar
- Kim VN, Han J, Siomi MC: Biogenesis of small RNAs in animals. Nature Reviews Molecular Cell Biology. 2009, 10: 126-139. 10.1038/nrm2632.PubMedView ArticleGoogle Scholar
- Morlando M, Ballarino M, Gromak N, Pagano F, Bozzoni I, Proudfoot N: Primary microRNA transcripts are processed co-transcriptionally. Nat Struct Mol Biol. 2008Google Scholar
- Kim Y-K, Kim VN: Processing of intronic microRNAs. EMBO J. 2007, 26: 775-783. 10.1038/sj.emboj.7601512.PubMed CentralPubMedView ArticleGoogle Scholar
- Obernosterer G, Leuschner PJF, Alenius M, Martinez J: Post-transcriptional regulation of microRNA expression. RNA. 2006, 12: 1161-1167. 10.1261/rna.2322506.PubMed CentralPubMedView ArticleGoogle Scholar
- Blenkiron C, Goldstein LD, Thorne NP, Spiteri I, Chin S-F, Dunning MJ, Barbosa-Morais NL, Teschendorff AE, Green AR, Ellis IO: MicroRNA expression profiling of human breast cancer identifies new markers of tumor subtype. Genome Biol. 2007, 8: R214-10.1186/gb-2007-8-10-r214.PubMed CentralPubMedView ArticleGoogle Scholar
- Lee EJ, Baek M, Gusev Y, Brackett DJ, Nuovo GJ, Schmittgen TD: Systematic evaluation of microRNA processing patterns in tissues, cell lines, and tumors. RNA. 2008, 14: 35-42. 10.1261/rna.804508.PubMed CentralPubMedView ArticleGoogle Scholar
- Thomson JM, Newman M, Parker JS, Morin-Kensicki EM, Wright T, Hammond SM: Extensive post-transcriptional regulation of microRNAs and its implications for cancer. Genes Dev. 2006, 20: 2202-2207. 10.1101/gad.1444406.PubMed CentralPubMedView ArticleGoogle Scholar
- Merritt WM, Lin YG, Han LY, Kamat AA, Spannuth WA, Schmandt R, Urbauer D, Pennacchio LA, Cheng J-F, Nick AM: Dicer, Drosha, and outcomes in patients with ovarian cancer. N Engl J Med. 2008, 359: 2641-2650. 10.1056/NEJMoa0803785.PubMed CentralPubMedView ArticleGoogle Scholar
- Fujita S, Ito T, Mizutani T, Minoguchi S, Yamamichi N, Sakurai K, Iba H: miR-21 Gene expression triggered by AP-1 is sustained through a double-negative feedback mechanism. Journal of Molecular Biology. 2008, 378: 492-504. 10.1016/j.jmb.2008.03.015.PubMedView ArticleGoogle Scholar
- Li X, Carthew RW: A microRNA mediates EGF receptor signaling and promotes photoreceptor differentiation in the Drosophila eye. Cell. 2005, 123: 1267-1277. 10.1016/j.cell.2005.10.040.PubMedView ArticleGoogle Scholar
- Martinez NJ, Ow MC, Reece-Hoyes JS, Barrasa MI, Ambros VR, Walhout AJM: Genome-scale spatiotemporal analysis of Caenorhabditis elegans microRNA promoter activity. Genome Research. 2008, 18: 2005-2015. 10.1101/gr.083055.108.PubMed CentralPubMedView ArticleGoogle Scholar
- Li S-C, Tang P, Lin W-C: Intronic microRNA: discovery and biological implications. DNA and Cell Biology. 2007, 26: 195-207. 10.1089/dna.2006.0558.PubMedView ArticleGoogle Scholar
- Barik S: An intronic microRNA silences genes that are functionally antagonistic to its host gene. Nucleic Acids Research. 2008, 36: 5232-5241. 10.1093/nar/gkn513.PubMed CentralPubMedView ArticleGoogle Scholar
- Pruitt KD, Tatusova T, Maglott DR: NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Research. 2005, 33: D501-504. 10.1093/nar/gki025.PubMed CentralPubMedView ArticleGoogle Scholar
- Grzybowska E, Wilczynska A, Siedlecki J: Regulatory Functions of 3' UTRs. Biochemical and Biophysical Research Communications. 2001Google Scholar
- Mazumder B, Seshadri V, Fox PL: Translational control by the 3'-UTR: the ends specify the means. Trends in Biochemical Sciences. 2003, 28: 91-98. 10.1016/S0968-0004(03)00002-1.PubMedView ArticleGoogle Scholar
- Mignone F, Gissi C, Liuni S, Pesole G: Untranslated regions of mRNAs. Genome Biol. 2002Google Scholar
- Hu Z: Insight into microRNA regulation by analyzing the characteristics of their targets in humans. BMC Genomics. 2009, 10: 594-10.1186/1471-2164-10-594.PubMed CentralPubMedView ArticleGoogle Scholar
- Ulitsky I, Laurent LC, Shamir R: Towards computational prediction of microRNA function and activity. Nucleic Acids Research. 2010Google Scholar
- Divakaran V, Mann DL: The emerging role of microRNAs in cardiac remodeling and heart failure. Circ Res. 2008, 103: 1072-1083. 10.1161/CIRCRESAHA.108.183087.PubMed CentralPubMedView ArticleGoogle Scholar
- Eisenberg I, Eran A, Nishino I, Moggio M, Lamperti C, Amato AA, Lidov HG, Kang PB, North KN, Mitrani-Rosenbaum S: Distinctive patterns of microRNA expression in primary muscular disorders. Proc Natl Acad Sci USA. 2007, 104: 17016-17021. 10.1073/pnas.0708115104.PubMed CentralPubMedView ArticleGoogle Scholar
- Fiore R, Siegel G, Schratt G: MicroRNA function in neuronal development, plasticity and disease. Biochim Biophys Acta. 2008, 1779: 471-478.PubMedView ArticleGoogle Scholar
- Grassmann R, Jeang K-T: The roles of microRNAs in mammalian virus infection. Biochim Biophys Acta. 2008, 1779: 706-711.PubMed CentralPubMedView ArticleGoogle Scholar
- Hansen T, Olsen L, Lindow M, Jakobsen K, Ullum H: Brain Expressed microRNAs Implicated in Schizophrenia Etiology. PLoS ONE. 2007Google Scholar
- Kuhn DE, Nuovo GJ, Martin MM, Malana GE, Pleister AP, Jiang J, Schmittgen TD, Terry AV, Gardiner K, Head E: Human chromosome 21-derived miRNAs are overexpressed in down syndrome brains and hearts. Biochemical and Biophysical Research Communications. 2008, 370: 473-477. 10.1016/j.bbrc.2008.03.120.PubMed CentralPubMedView ArticleGoogle Scholar
- Ma L, Weinberg RA: MicroRNAs in malignant progression. Cell Cycle. 2008, 7: 570-572. 10.4161/cc.7.5.5547.PubMedView ArticleGoogle Scholar
- Shi X-B, Tepper CG, deVere White RW: Cancerous miRNAs and their regulation. Cell Cycle. 2008, 7: 1529-1538.PubMedView ArticleGoogle Scholar
- Tili E, Michaille J-J, Costinean S, Croce CM: MicroRNAs, the immune system and rheumatic disease. Nature clinical practice Rheumatology. 2008, 4: 534-541. 10.1038/ncprheum0885.PubMedView ArticleGoogle Scholar
- Ambs S, Prueitt RL, Yi M, Hudson RS, Howe TM, Petrocca F, Wallace TA, Liu C-G, Volinia S, Calin GA: Genomic profiling of microRNA and messenger RNA reveals deregulated microRNA expression in prostate cancer. Cancer Research. 2008, 68: 6162-6170. 10.1158/0008-5472.CAN-08-0144.PubMed CentralPubMedView ArticleGoogle Scholar
- Carè A, Catalucci D, Felicetti F, Bonci D, Addario A, Gallo P, Bang M-L, Segnalini P, Gu Y, Dalton ND: MicroRNA-133 controls cardiac hypertrophy. Nat Med. 2007, 13: 613-618. 10.1038/nm1582.PubMedView ArticleGoogle Scholar
- ENCODE Project Consortium, Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET: Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007, 447: 799-816. 10.1038/nature05874.View ArticleGoogle Scholar
- Xie X, Lu J, Kulbokas EJ, Golub TR, Mootha V, Lindblad-Toh K, Lander ES, Kellis M: Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals. Nature. 2005, 434: 338-345. 10.1038/nature03441.PubMed CentralPubMedView ArticleGoogle Scholar
- Golan D, Levy C, Friedman B, Shomron N: Biased hosting of intronic microRNA genes. Bioinformatics. 2010Google Scholar
- Castillo-Davis CI, Mekhedov SL, Hartl DL, Koonin EV, Kondrashov FA: Selection for short introns in highly expressed genes. Nat Genet. 2002, 31: 415-418.PubMedGoogle Scholar
- Pickering BM, Willis AE: The implications of structured 5' untranslated regions on translation and disease. Seminars in Cell & Developmental Biology. 2005, 16: 39-47.View ArticleGoogle Scholar
- Niwa R, Zhou F, Li C, Slack FJ: The expression of the Alzheimer's amyloid precursor protein-like gene is regulated by developmental timing microRNAs and their targets in Caenorhabditis elegans. Dev Biol. 2008, 315: 418-425. 10.1016/j.ydbio.2007.12.044.PubMed CentralPubMedView ArticleGoogle Scholar
- Sun Y, Bai Y, Zhang F, Wang Y, Guo Y, Guo L: miR-126 inhibits non-small cell lung cancer cells proliferation by targeting EGFL7. Biochemical and Biophysical Research Communications. 2010, 391: 1483-1489. 10.1016/j.bbrc.2009.12.098.PubMedView ArticleGoogle Scholar
- Nikolic I, Plate K-H, Schmidt MH: EGFL7 meets miRNA-126: an angiogenesis alliance. J Angiogenes Res. 2010, 2: 9-10.1186/2040-2384-2-9.PubMed CentralPubMedView ArticleGoogle Scholar
- Muralidhar B, Goldstein LD, Ng G, Winder DM, Palmer RD, Gooding EL, Barbosa-Morais NL, Mukherjee G, Thorne NP, Roberts I: Global microRNA profiles in cervical squamous cell carcinoma depend on Drosha expression levels. J Pathol. 2007, 212: 368-377. 10.1002/path.2179.PubMedView ArticleGoogle Scholar
- Tavazoie SF, Alarcón C, Oskarsson T, Padua D, Wang Q, Bos PD, Gerald WL, Massagué J: Endogenous human microRNAs that suppress breast cancer metastasis. Nature. 2008, 451: 147-152. 10.1038/nature06487.PubMed CentralPubMedView ArticleGoogle Scholar
- Agirre X, Jiménez-Velasco A, San José-Enériz E, Garate L, Bandrés E, Cordeu L, Aparicio O, Saez B, Navarro G, Vilas-Zornoza A: Down-regulation of hsa-miR-10a in chronic myeloid leukemia CD34+ cells increases USF2-mediated cell growth. Mol Cancer Res. 2008, 6: 1830-1840. 10.1158/1541-7786.MCR-08-0167.PubMedView ArticleGoogle Scholar
- Le Y, Xu L, Lu J, Fang J, Nardi V, Chai L, Silberstein LE: FAK silencing inhibits leukemogenesis in BCR/ABL-transformed hematopoietic cells. Am J Hematol. 2009, 84: 273-278. 10.1002/ajh.21381.PubMedView ArticleGoogle Scholar
- Missiaglia E, Dalai I, Barbi S, Beghelli S, Falconi M, della Peruta M, Piemonti L, Capurso G, Di Florio A, delle Fave G: Pancreatic endocrine tumors: expression profiling evidences a role for AKT-mTOR pathway. J Clin Oncol. 2010, 28: 245-255. 10.1200/JCO.2008.21.5988.PubMed CentralPubMedView ArticleGoogle Scholar
- Kano M, Seki N, Kikkawa N, Fujimura L, Hoshino I, Akutsu Y, Chiyomaru T, Enokida H, Nakagawa M, Matsubara H: miR-145, miR-133a and miR-133b: Tumor suppressive miRNAs target FSCN1 in esophageal squamous cell carcinoma. Int J Cancer. 2010Google Scholar
- Boormans JL, Hermans KG, van Leenders GJLH, Trapman J, Verhagen PCMS: An activating mutation in AKT1 in human prostate cancer. Int J Cancer. 2008, 123: 2725-2726. 10.1002/ijc.23787.PubMedView ArticleGoogle Scholar
- Forgacs E, Biesterveld EJ, Sekido Y, Fong K, Muneer S, Wistuba II, Milchgrub S, Brezinschek R, Virmani A, Gazdar AF, Minna JD: Mutation analysis of the PTEN/MMAC1 gene in lung cancer. Oncogene. 1998, 17: 1557-1565. 10.1038/sj.onc.1202070.PubMedView ArticleGoogle Scholar
- Bellacosa A, de Feo D, Godwin AK, Bell DW, Cheng JQ, Altomare DA, Wan M, Dubeau L, Scambia G, Masciullo V: Molecular alterations of the AKT2 oncogene in ovarian and breast carcinomas. Int J Cancer. 1995, 64: 280-285. 10.1002/ijc.2910640412.PubMedView ArticleGoogle Scholar
- Philp AJ, Campbell IG, Leet C, Vincan E, Rockman SP, Whitehead RH, Thomas RJ, Phillips WA: The phosphatidylinositol 3'-kinase p85alpha gene is an oncogene in human ovarian and colon tumors. Cancer Research. 2001, 61: 7426-7429.PubMedGoogle Scholar
- Sun M, Paciga JE, Feldman RI, Yuan Z, Coppola D, Lu YY, Shelley SA, Nicosia SV, Cheng JQ: Phosphatidylinositol-3-OH Kinase (PI3K)/AKT2, activated in breast cancer, regulates and is induced by estrogen receptor alpha (ERalpha) via interaction between ERalpha and PI3K. Cancer Research. 2001, 61: 5985-5991.PubMedGoogle Scholar
- Noske A, Kaszubiak A, Weichert W, Sers C, Niesporek S, Koch I, Schaefer B, Sehouli J, Dietel M, Lage H, Denkert C: Specific inhibition of AKT2 by RNA interference results in reduction of ovarian cancer cell proliferation: increased expression of AKT in advanced ovarian cancer. Cancer Lett. 2007, 246: 190-200. 10.1016/j.canlet.2006.02.018.PubMedView ArticleGoogle Scholar
- Maroulakou IG, Oemler W, Naber SP, Tsichlis PN: Akt1 ablation inhibits, whereas Akt2 ablation accelerates, the development of mammary adenocarcinomas in mouse mammary tumor virus (MMTV)-ErbB2/neu and MMTV-polyoma middle T transgenic mice. Cancer Research. 2007, 67: 167-177. 10.1158/0008-5472.CAN-06-3782.PubMedView ArticleGoogle Scholar
- Griffiths-Jones S, Saini HK, Van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Research. 2008, 36: D154-158. 10.1093/nar/gkm952.PubMed CentralPubMedView ArticleGoogle Scholar
- Griffiths-Jones S, Grocock RJ, Van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Research. 2006, 34: D140-144. 10.1093/nar/gkj112.PubMed CentralPubMedView ArticleGoogle Scholar
- Griffiths-Jones S: miRBase: the microRNA sequence database. Methods Mol Biol. 2006, 342: 129-138.PubMedGoogle Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25: 25-29. 10.1038/75556.PubMed CentralPubMedView ArticleGoogle Scholar
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 2003, 13: 2498-2504. 10.1101/gr.1239303.PubMed CentralPubMedView ArticleGoogle Scholar
- Maere S, Heymans K, Kuiper M: BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005, 21: 3448-3449. 10.1093/bioinformatics/bti551.PubMedView ArticleGoogle Scholar
- Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5: R80-10.1186/gb-2004-5-10-r80.PubMed CentralPubMedView ArticleGoogle Scholar
- Gentleman R, Carey V, Dudoit S, Ellis B, Gautier L: The Bioconductor Project. bepresscom. 2003Google Scholar
- Lewis BP, Shih I-h, Jones-Rhoades MW, Bartel DP, Burge CB: Prediction of mammalian microRNA targets. Cell. 2003, 115: 787-798. 10.1016/S0092-8674(03)01018-3.PubMedView ArticleGoogle Scholar
- Kertesz M, Iovino N, Unnerstall U, Gaul U, Segal E: The role of site accessibility in microRNA target recognition. Nat Genet. 2007, 39: 1278-1284. 10.1038/ng2135.PubMedView ArticleGoogle Scholar
- Wang X, El Naqa IM: Prediction of both conserved and nonconserved microRNA targets in animals. Bioinformatics. 2008, 24: 325-332. 10.1093/bioinformatics/btm595.PubMedView ArticleGoogle Scholar
- Wang X: miRDB: a microRNA target prediction and functional annotation database with a wiki interface. RNA. 2008, 14: 1012-1017. 10.1261/rna.965408.PubMed CentralPubMedView ArticleGoogle Scholar
- John B, Enright AJ, Aravin A, Tuschl T, Sander C, Marks DS: Human MicroRNA targets. PLoS Biol. 2004, 2: e363-10.1371/journal.pbio.0020363.PubMed CentralPubMedView ArticleGoogle Scholar
- Miranda K, Huynh T, Tay Y, Ang Y, Tam W, Thomson A, Lim B, Rigoutsos I: A Pattern-Based Method for the Identification of MicroRNA Binding Sites and Their Corresponding Heteroduplexes. Cell. 2006, 126: 1203-1217. 10.1016/j.cell.2006.07.031.PubMedView ArticleGoogle Scholar
- Krek A, Grün D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, Rajewsky N: Combinatorial microRNA target predictions. Nat Genet. 2005, 37: 495-500. 10.1038/ng1536.PubMedView ArticleGoogle Scholar
- Sethupathy P, Corda B, Hatzigeorgiou AG: TarBase: A comprehensive database of experimentally supported animal microRNA targets. RNA. 2006, 12: 192-197. 10.1261/rna.2239606.PubMed CentralPubMedView ArticleGoogle Scholar
- Selbach M, Schwanhäusser B, Thierfelder N, Fang Z, Khanin R, Rajewsky N: Widespread changes in protein synthesis induced by microRNAs. Nature. 2008, 455: 58-63. 10.1038/nature07228.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.