- Research article
- Open Access
The human ABC transporter pseudogene family: Evidence for transcription and gene-pseudogene interference
BMC Genomics volume 9, Article number: 165 (2008)
Pseudogenes are an integral component of the human genome. Little attention, however, has so far been paid to the phenomenon that some pseudogenes are transcriptionally active. Recently, we demonstrated that the human ortholog of the rodent testis-specific ATP-binding cassette (ABC) transporter Abca17 is a ubiquitously transcribed pseudogene (ABCA17P). The aim of the present study was to establish a complete inventory of all ABC transporter pseudogenes in the human genome and to identify transcriptionally active ABC transporter pseudogenes. Moreover, we tested the hypothesis that a regulatory interdependency exists between ABC transporter pseudogenes and their parental protein coding equivalents.
Systematic bioinformatic analysis revealed the existence of 22 ABC transporter pseudogenes within the human genome. We identified two clusters on chromosomes 15 and 16, respectively, which harbor almost half of all pseudogenes (n = 10). Available information from EST and mRNA databases and RT-PCR expression profiling indicate that a large portion of the ABC transporter pseudogenes (45%, n = 10) are transcriptionally active and some of them are expressed as alternative splice variants. We demonstrate that both pseudogenes of the pseudoxanthoma elasticum gene ABCC6, ABCC6P1 and ABCC6P2, are transcribed. ABCC6P1 and ABCC6 possess near-identical promoter sequences and their tissue-specific expression profiles are strikingly similar raising the possibility that they form a gene-pseudogene dual transcription unit. Intriguingly, targeted knockdown of the transcribed pseudogene ABCC6P1 resulted in a significant reduction of ABCC6 mRNA expression levels.
The human genome contains a surprisingly small number of ABC transporter pseudogenes relative to other known gene families. They are unevenly distributed across the chromosomes. Importantly, a significant portion of the ABC transporter pseudogenes is transcriptionally active. The downregulation of ABCC6 mRNA levels by targeted suppression of the expression of its pseudogene ABCC6P1 provides evidence, for the first time, for a regulatory interdependence of a transcribed pseudogene and its protein coding counterpart in the human genome.
Human ATP-binding cassette (ABC) transporters form a large family of structurally highly related membrane transport proteins. Based on sequence similarities, the human ABC transporter family is categorized into seven subfamilies, denoted ABC A-G, and consists of 48 members. ABC transporters translocate a wide range of substrates across cellular membranes, e.g. lipids, sugars, peptides, ions and xenobiotics such as anticancer drugs [1, 2]. Defective ABC transporters cause monogenetic disorders and contribute to the development of complex diseases . The physiological importance of ABC proteins is highlighted by the fact that mutations in 14 human ABC transporter genes have thus far been linked to a spectrum of unrelated hereditary diseases .
Beside protein coding genes for ABC transporters, it has been shown that the human genome contains several ABC transporter pseudogenes . However, the exact number of ABC transporter pseudogenes present in the human genome is still unclear. Generally, pseudogenes are defined as non-functional sequences that are present in the genome with high similarity to one or more paralogous genes. This lack of function is a result of either failure of transcription or translation, or production of a protein that differs in function from the protein encoded by the functional gene . Pseudogenes are thought to arise either by gene duplication and subsequent degeneration of the duplicated gene or by retroposition of a reversely transcribed mRNA of the paralogous gene into the genome. The latter are intronless and referred to as "processed" pseudogenes .
During the past years, we and others provided evidence for the existence of ABC transporter pseudogenes which are transcriptionally active [7–9]. For example, we recently demonstrated that the human ortholog of the rodent testis-specific ABC transporter Abca17, a protein coding gene, is a pseudogene (ABCA17P) which is transcribed into up to 20 alternative splice variants . These results raise the intriguing possibility that ABC transporter pseudogene transcripts may serve biological functions other than providing the genetic blueprints for protein biosynthesis.
In this study, we applied a comprehensive bioinformatic approach to identify all ABC transporter pseudogenes in the human genome and systematically searched for evidence of transcribed pseudogenes. We identified a total of 22 pseudogenes within the ABC transporter family with a significant subfraction (n = 10) exhibiting transcriptional activity. We find evidence for transcription of the two previously known ABCC6 pseudogenes ABCC6P1 and ABCC6P2, respectively, and report their gene structures. Moreover, we demonstrate, for the first time, a regulatory interdependency between ABCC6P1, and its translated counterpart, the pseudoxanthoma elasticum gene ABCC6.
The human genome contains 22 ABC transporter pseudogenes
To identify homologs of human ABC transporters, we performed a genome-wide search based on the sequences of the known 48 human and the five murine ABC transporter genes, for which no orthologous counterpart as yet have been documented in the human genome (Abca14-Abca17,Abcg3). The strategy relied on a BLAST homology search (cut-off E-value: 0.001) in which the ABC transporter nucleotide reference sequences were used as query sequences. These were compared with a version of the human genome (assembly March 2006, NCBI Build 36, hg18) in which the known exons of the ABC transporter query sequences were masked. Our search algorithm resulted in 3,322,841 primary hits. Detailed sequence analyses revealed that a surprisingly large fraction (33/48, 69%) of the known human ABC transporter cDNAs contain repetitive elements (Additional File 1). We then excluded all hits that were attributable to repetitive elements and assorted the remaining hits (n = 174) according to their chromosomal position. Neighbouring hits were merged to a single pseudogene candidate when they (i) displayed highest similarity to the same protein coding ABC transporter gene, (ii) were localized at a distance < 0.1 Mb apart from each other and (iii) were oriented in the same 5'→ 3' order as the matching protein coding ABC transporter nucleotide sequence. In those cases where a neighbouring hit was absent, a flanking 0.1 Mb genomic region was aligned with the respective ABC transporter nucleotide sequence using the more rigorous alignment tool "matcher" from the EMBOSS package. This approach warrants that more distantly related ABC transporter pseudogenes are overlooked at a minimal rate. Hits with > 70% identity across at least one complete exon of the respective ABC transporter nucleotide sequence, were considered homologous and evaluated by the above criteria (i-iii). Merged hits were regarded as pseudogenes when they encompassed at least two homologous exons of the protein-coding gene and had defective reading frames which preclude translation into polypeptides that meet the minimum structural requirements for functional ABC full-size or half-size transporters. Using this strategy, we identified a total of 22 loci in the human genome that represent degenerated copies of protein coding ABC transporter genes. They show > 70% similarity to the cDNA of their protein coding counterparts across a range of 159 – 3754 bp and contain the homologs of two up to 19 exons of the respective parental gene. The ABC transporter pseudogenes we identified could be assigned to all seven subfamilies, ABC A- ABC G (Table 1). We found that, within the genome, they are routinely present on chromosomes which also harbor protein coding ABC genes (Chrs. 1, 2, 3, 4, 7, 10, 14, 16, 21) with the exception of chrs. 15 and 22, respectively, from which protein coding ABC transporter genes are absent (Figure 1). Of note, half of the 22 ABC transporter pseudogenes are localized on chromosomes 15 (n = 4) and 16 (n = 7), respectively. Intriguingly, the seven pseudogenes on the short arm of chr. 16 (16p11.2-p13.3) are clustered within a segment of 30 Mb in the order tel-ABCA17P – ABCC6P2 – ABCC6P1 – ABCA14P – ABCA15P1 – ABCA15P2 – ABCD1P3-cer.
A total of 10 ABC transporter pseudogenes (45%) were intronless, directly flanked by repetitive sequences (as identified with the RepeatMasker track, UCSC Genome Browser), and most of them (n = 8) also exhibited poly(A)-tails. These structural features are diagnostic of processed pseudogenes  strongly suggesting that they originated from retrotransposed transcripts of ancestral functional ABC transporter genes (Table 1).
A significant portion of human ABC transporter pseudogenes are transcriptionally active
To assess which of the identified human ABC transporter pseudogenes are transcribed, we analyzed all 22 pseudogenic ABC transporter loci for homologies with EST or mRNA sequences using the UCSC Genome Browser. Using this approach 101 EST and/or mRNA sequences were identified that exhibited best hits (> 99% overall identity) with 10 of the 22 human ABC transporter pseudogenes. These sequences displayed higher sequence identity to their corresponding pseudogene locus than to any other location in the human genome. This indicates that a significant portion (45%) of the human ABC transporter pseudogenes is transcriptionally active (Table 1). Among these, n = 5 (50%) are processed pseudogenes. Consistent with our in-silico results, RT-PCR expression analysis of all ABC transporter pseudogene transcripts that span at least two exons demonstrated that the pseudogenes ABCA10P, ABCA15P2, ABCA17P, ABCC2P, ABCC6P1, ABCC6P2 and ABCG2P2 are in fact transcribed (Figure 2). Interestingly, we identified alternatively spliced transcripts in three out of the 10 transcribed pseudogenes (ABCA10P, ABCA15P2, ABCA17P).
Gene structures and expression profiling of ABCC6 and its pseudogenes ABCC6P1 and ABCC6P2 in various human tissues
Two pseudogenes, ABCC6P1 and ABCC6P2, have been reported for the pseudoxanthoma elasticum gene ABCC6, a member of the ABC C-subfamily . They are located 2.3 Mb centromeric (ABCC6P1) and 1.3 Mb (ABCC6P2) telomeric, respectively, of the protein coding ABCC6 gene (Figure 3). When we performed a detailed bioinformatic analysis of both pseudogenes, we retrieved a poly(A)-containing 2.7 kb mRNA from the GenBank NCBI database which had been isolated from human liver cells during the course of the Mammalian Gene Collection Program . The transcript [GenBank: BC075833] exhibits perfect homology with the 9 putative exons of ABCC6P1 indicating that this pseudogene is transcriptionally active. Comparison of the genomic sequence and the 2.7 kb transcript revealed that the ABCC6P1 gene consists of 10 exons (Figure 3). The ABCC6P1 intron sizes range from 0.1 to 5.2 kb. Exons 1–9 have perfect or near perfect identity (98.3%–100%) to the corresponding exons of the parental gene ABCC6. In contrast, the final exon (exon 10) of ABCC6P1 shows no homology to ABCC6. Due to the observed sequence disparity in exon 10, the transcripts of ABCC6P1 and ABCC6 can be reliably distinguished by RT-PCR. Moreover, it allows selective targeting of the pseudogene transcript using RNA interference strategies.
We also found in silico evidence for transcription of the second ABCC6 pseudogene, ABCC6P2. Consistent with this, our RT-PCR experiments using pooled mRNA from 20 human tissues confirmed expression of this pseudogene (Figure 2). ABCC6P2 is composed of four exons whose lengths vary between 59–183 bp which exhibit highest sequence similarity (98–100%) with the corresponding exons of its parental gene. Intron sizes of ABCC6P2 range from 0.1 to 1.7 kb. Interestingly, Genbank mRNA database searches revealed a 0.7 kb poly(A)-tail transcript which displays perfect identity with ABCC6P2 exons 1 and 2 plus additional 504 bp of the second intron (Figure 3). This transcript has been independently identified in the Mammalian Gene Collection Program  from the kidney [GenBank: BC015978].
We next tested whether and to which degree regulatory interdependencies exist between ABCC6 and ABCC6P1. For this, we performed expression profiling of both genes in various human tissues by real-time PCR. Expression levels were normalized to the expression of the housekeeping gene β-actin. We observed a similar gene expression pattern for ABCC6 and ABCC6P1 with highest expression levels in liver and kidney (Figure 4A, B). In most tissues ABCC6 mRNA levels were moderately higher (on average 2–4 fold) than those of ABCC6P1, however, ABCC6P1 was the predominant transcript in bone marrow and fetal brain (Figure 4C). Similar results were obtained when the expression levels were normalized to cyclophilin B, another housekeeping gene (data not shown).
The striking co-expression of the ABCC6 and ABCC6P1 raises the possibility that transcription of both genes is controlled by similar regulatory elements. When we compared the 5' regulatory region sequences of both genes, we found that they share an overall sequence similarity of 98.9% within the first 2.5 kb upstream of the start codon, the region that contains the putative core promoters (Additional File 2). Of note, 68 out of the 72 potential transcription factor binding sites, which we identified within the predicted ABCC6P1 promoter region, are fully intact. In particular, an NF-κB consensus binding sequence (pos. -233 bp), known to confer high-level ABCC6 expression in HepG2 hepatoma cells , is completely conserved in the putative ABCC6P1 promoter region. Three binding sites, a glucocorticoid response element (GR, pos. -2116 bp), an octamer binding factor (oct-1, pos. -1746 bp) and a GATA-3 consensus site (pos. -1269 bp), respectively, show single base exchanges. In addition, a USF binding site (pos. -622 bp) exhibits a nonamer-deletion which disrupts the respective consensus sequence. Intriguingly, single base exchanges give rise to five novel transcription factor binding sites (SRY -2381 bp, C/EBPb -1223 bp, GATA-2 -1124 bp, AML-2 -464 bp, C/EBP -316 bp) in the predicted ABCC6P1 promoter.
Comparison of the ABCC6 promoter sequence with the ABCC6P2 5' UTR revealed an even higher degree of homology (99.6%) than that observed for ABCC6P1. A total of 71 out of 72 transcription factor binding sites were fully conserved strongly suggesting that the ABCC6P2 expression pattern is highly similar to that of ABCC6 and ABCC6P1, respectively. We were unable, however, to quantitate ABCC6P2 expression levels by real-time PCR because the small size of the transcript and its exceedingly high sequence homology with ABCC6 and ABCC6P1 hampered specific detection of ABCC6P2 mRNA.
Targeted knockdown of the pseudogene ABCC6P1 results in downregulation of ABCC6 mRNA expression
The ABCC6/ABCC6P1 gene-pseudogene pair is unique in that (i) both components exhibit similar expression patterns, (ii) share the extensive similarity of their (putative) promoter regions and (iii) are positioned in close spatial proximity on chr. 16p12. The combination of these features raises the question as to whether regulatory interdependencies exists between ABCC6P1 and its paralog ABCC6. To address this question we tested whether silencing of ABCC6P1 has an effect on ABCC6 expression. For this, a siRNA-mediated gene knockdown approach targeting the non-homologous exon 10 of ABCC6P1 was used which allows selective silencing of ABCC6P1 but not ABCC6. Among several exon10 specific siRNAs, we selected the one by which the most efficient gene silencing could be obtained in HepG2 liver cells. Knockdown effects were normalized to the housekeeping genes β-actin and cyclophilin B, respectively. Using this approach, an average 69% (SD 4%, 95%-CI: 65–73%) knockdown of the ABCC6P1 pseudogene was achieved in six independent experiments (Figure 5, lane 3). Importantly, knockdown of ABCC6P1 led to a significant reduction of ABCC6 mRNA expression. The residual expression of ABCC6 was 84% (SD 7%, p = 0.003, 95%-CI: 76–91%), when normalized to β-actin, and 82% (SD 11%, p = 0.01, 95%-CI: 72–93%) relative to cyclophilin B, respectively (Figure 5, lane 4). No significant changes in ABCC6 or ABCC6P1 expression levels were observed in HepG2 cells when unspecific siRNA were used or lipid transfection agent alone were used (negative controls, data not shown). In contrast, GAPDH-specific siRNA (positive control) showed an 89% (95%-CI: 87–97%) knockdown relative to β-actin and cyclophilin B (Figure 5, lane 2). Similarity searches using the ABCC6P1 siRNA target sequence (21 bp) as a bait consistently showed mismatches > 5 bp throughout the entire promoter region and all exons and introns of ABCC6. Therefore, the siRNA mediated suppression of ABCC6 cannot be attributed to "co-targeting" of a common sequence motif in both genes. Our finding that silencing of ABCC6P1 has a suppressive effect on ABCC6 thus suggest that the expression of ABCC6P1 and its parental gene are interdependent.
In our study, we report the existence of 22 human ABC transporter pseudogenes. The total number of pseudogenes in the human genome has been estimated to ~20000 [13–15], whereas the predicted number of protein coding genes amounts to 20000 – 25000 . This estimation indicates a genome-wide gene/pseudogene ratio of app. 1:1. Given the existence of 48 functional human ABC transporters, the ABC transporter genes/pseudogene ratio is app. 2:1. Thus, the number of ABC transporter pseudogenes in the human genome is significantly lower than the expected average frequency of pseudogenes within currently identified gene-pseudogene families [17–19].
Our results indicate that the pseudogenes of the ABC transporter family are distributed unevenly across the chromosomes. Approximately one-half of the ABC transporter pseudogenes (n = 11) is dispersed across several chromosomes, whereas the other half is clustered on chr. 15 and 16, respectively. Of these, seven pseudogenes are located on the short arm of chr. 16 within a segment of 30 Mb (16p11.2-p13.3) and three processed (intronless) pseudogenes are clustered on the long arm of chr. 15. The ABC transporter pseudogene cluster on chr. 16p consists of three protein-coding ABC transporter genes, six duplicated pseudogenes and one processed pseudogene. Clustering of pseudogenes has been observed for other gene families in various vertebrates including the group of cytochrome P450 enzymes, α-globin, and the variable gene segments of the immunoglobulin heavy and light genes, respectively [17, 20, 21]. These gene clusters are usually composed of both protein coding genes and pseudogenes from the same gene family, which likely reflects the fact that pseudogenization is a mechanism that is highly associated with gene duplication events. The close spatial proximity of genes and pseudogenes suggests that the gene duplications are more recent evolutionary events because dispersion across the genome has not yet occurred. In light of this, it is reasonable to assume that the ABC transporter pseudogenes located on chr. 15 and 16 are indeed the result of gene duplication and subsequent pseudogenization of the previously functional ABC transporter genes. Of note, the rodent genome contains protein-coding genes for Abca14, Abca15  and Abca17 . It is thus conceivable that the human orthologs of these intact genes were functional for a certain time period after duplication before they ultimately became disabled. As for ABCC6P1 and ABCC6P2, the other two duplicated pseudogenes on chr 16, it remains unclear whether they arose by complete duplication of the parental ABCC6 gene and thus may have temporarily maintained their function or whether only fragments of this gene were duplicated ab ovo.
Unlike in the case of non-processed pseudogenes, the chromosomal distribution of processed pseudogenes generally appears to be random with the number of pseudogenes per chromosome being proportional to chromosomal length . This is consistent with the view that processed pseudogenes originate from nondirected retrotransposition of preexisting transcripts. Interestingly, the pseudogene cluster on chr. 15 contains three processed pseudogenes (ABCB10P1-3) which are localized in close proximity to each other (their parental gene ABCB10 is located on chr. 1). Because it is unlikely that the ABCB10P cluster is the result of three independent retrotransposition events targeting one and the same chromosomal segment, two of the three ABCB10 pseudogenes most likely arose by gene duplication. The homology analyses, which we conducted, indeed suggest that ABCB10P1 may have originated from ABCB10 by a retrotransposition process. Intriguingly, they also strongly support the view that ABCB10P2 arose by duplication of ABCB10P1 and ABCB10P3 is the result of the duplication of ABCB10P2 (Additional File 3). Because ABCB10P2 and ABCB10P3 most likely arose by duplication of an already disabled processed pseudogene, it is reasonable to assume that these pseudogenes never had the capacity to encode proteins. To our knowledge, the ABCB10P cluster thus represents the first example of the multiplication of a processed pseudogene in the human genome.
Importantly, we obtained bioinformatic evidence indicating that app. one-half (45%) of the human ABC transporter pseudogenes are transcriptionally active. Consistent with this, our RT-PCR based survey demonstrated that at least 7 out of the predicted transcriptionally active pseudogenes were in fact transcribed. Little attention has so far been paid to the phenomenon of pseudogene transcription  owing to the fact that pseudogenes are generally thought to be functionless genomic relics. However, this view is challenged by some recent studies. For example, a comprehensive study focussing on transcription of processed pseudogenes in the human genome revealed that a subfraction of app. 5% of all processed pseudogenes identified are transcribed . Moreover, analysis of pseudogene transcription on human chr. 22 using a combined bioinformatic/microarray approach demonstrated that up to 20% of the pseudogenes identified on this chromosome are transcriptionally active . The surprisingly high proportion of potentially transcribed pseudogenes questions the traditional view of pseudogenes and raises the intriguing possibility that transcribed pseudogenes may serve defined biological functions in the genome. The concept is supported by two recent studies which provide experimental evidence for pseudogene function. Korneev et al. reported direct interference of a gene and its transcribed pseudogene by demonstrating that the antisense pseudogene anti-NOS, whose transcript forms stable RNA-RNA complexes with the mRNA of nNOS, inhibits protein expression of its parental gene . Another study highlights the fact that pseudogene expression can exert a biological function that is unrelated to its ancestral gene as evidenced for the human Xist noncoding RNA gene, which initiates X chromosome inactivation and has evolved in eutherians by pseudogenization of the protein coding gene Lnx3 .
To test whether and to which degree regulatory interdependencies exist between transcribed ABC transporter pseudogenes and their protein coding counterparts, we selected the pseudoxanthoma elasticum gene, ABCC6, and its transcribed pseudogene ABCC6P1. This gene-pseudogene pair appeared to be an ideal candidate for interference studies because ABCC6 and ABCC6P1 exhibit near perfect sequence homology in their transcripts but, at the same time, can still be discriminated by RT-PCR due to non-homology disparity in the final exon (exon 10) of ABCC6P1. Moreover, this sequence disparity allowed the selective targeting of the pseudogene transcript using an siRNA approach. mRNA expression profiling of ABCC6 and ABCC6P1 revealed that both genes are consistently co-expressed in a wide range of human tissues with predominant expression in adult liver, kidney and fetal liver (in this order). The tissue expression pattern we observed for ABCC6 is in keeping with a recent genome-wide gene expression study involving 79 human tissues . Regrettably, this work does not include expression data on the pseudogene ABCC6P1. Our finding that ABCC6 and ABCC6P1 share near-perfect identity in their respective core promoter regions together with the observed co-expression pattern strongly suggests that the expression of the gene-pseudogene pair is governed by the same transcription factor machinery.
Intriguingly, we found that selective knockdown of ABCC6P1 leads to a reduction of ABCC6 mRNA expression suggesting that the expression of ABCC6P1 and its parental gene are interdependent. At this point, the regulatory mechanisms which underlie the observed gene-pseudogene interference remain speculative. It is well established that mRNA stability and degradation significantly depends on specific cis-acting elements in mRNA molecules. These cis-acting destabilizing elements, which can be found both in the UTR and the coding regions of mRNA are attacked by trans-acting machineries facilitating mRNA decay . Transcripts from distinct genes which exhibit extensive sequence homology (as in the case of gene-pseudogene pairs) and thus contain identical destabilizing cis-elements may compete for such trans-acting elements. It is thus conceivable that the ABCC6 and the ABCC6P1 transcripts due to their high sequence similarity and their topological proximity compete for the same cellular RNA degradation route. As a consequence, lower abundance in ABCC6P1 transcripts may thus lead to an increased ABCC6 mRNA degradation rate. It will be challenging to characterize in more detail the molecular interdependency between the ABCC6 and ABCC6P1 transcription units. Knowledge of ABCC6/ABCC6P1 regulatory interaction may be of potential relevance to clinical medicine because ABCC6 is the underlying gene defect in the syndrome pseudoxanthoma elasticum.
In this study, we established the existence of 22 ABC transporter pseudogenes in the human genome. Combined bioinformatic and experimental results indicate that a significant portion of these pseudogenes (45%) are transcribed and we provide evidence, for the first time, for expression of alternatively spliced pseudogene RNAs in the human genome. Moreover, we show that ABCC6P1, one of the pseudogenes of the pseudoxanthoma elasticum gene ABCC6, is co-expressed with ABCC6 in a variety of tissues and that knockdown of ABCC6P1 expression impacts the expression of ABCC6 strongly suggesting a regulatory interdependence between ABCC6P1 and its ancestral gene. The emerging concept of pseudogene transcription and functionality has recently found its reflection in the redefinition of pseudogenes as "genomic sequences that arise from functional genes but that cannot encode the same type of functional product [...] as the original gene" . Accordingly, a subclassification into (i) 'ghost' pseudogenes with some intermediate functionality, i.e. similar to a functional gene and transcribed and/or functional, and (ii) 'dead' pseudogenes that lack any functionality has recently been proposed. Our results presented here strongly support this novel differentiated view on the biology of pseudogenes.
Bioinformatic identification of human ABC transporter pseudogenes
To identify all ABC transporter pseudogenes in the human genome, we retrieved the complete sequence of the human genome  chromosome-wise from the UCSC Genome Browser (assembly March 2006, NCBI Build 36, hg18) [32, 33].
A stand-alone version of the program MaskSeq was used to mask off the exons of all human reference ABC transporter genes. For this, a list of the reference nucleotide sequence accession numbers of all human ABC transporters was used to retrieve the exon coordinates of the human protein coding ABC transporter genes from the Table Browser application of the UCSC Genome Browser . A Perl script was designed to extract and reformat the exon coordinates from the Table browser output for application in the program MaskSeq.
In addition to all human reference ABC transporter sequences, we downloaded the reference nucleotide sequences of all known mouse ABC transporters that have no apparent orthologous counterparts in the human genome (Abca14, Abca15, Abca16, Abca17, Abcg3). Using a local installation of the BLAST application (version 2.2.14), a homology search of the reference ABC transporter nucleotide sequences was performed against the human genome masked for ABC transporter exons. In this alignment, an E-value of 0.001 and the table output format "-m8" was applied. To eliminate hits originating from repetitive sequences, the repetitive sequences in the query ABC transporter nucleotide sequences were identified by applying the program RepeatMasker (version 3.1.6) using the database Repbase (version 11.12). An in-house written Perl script was used to exclude all hits that were located completely within repetitive sequences and those that extended less than 30 bp into the flanking regions of repetitive sequences. The remaining hits were then ordered along the chromosomes and merged to a single candidate pseudogene when neighbouring hits yielded highest homology to the same protein coding gene and when they were in the same 5'→3' orientation as their homologous ABC transporter nucleotide sequence. In the absence of a neighbouring hit, a genomic region of 0.1 Mb flanking the single hit was aligned with the respective ABC transporter nucleotide sequence using the more rigorous alignment tool "matcher" from the EMBOSS package. This approach allows for identification of more distantly related pseudogenes. Hits displaying > 70% identity over an entire exon of the respective ABC transporter nucleotide sequence, were also considered homologous and further evaluated. Merged hits were classified as candidate pseudogenes when (i) the retrieved sequences showed homologies to at least two exons of a protein coding gene and (ii) the merged hit contained sequence aberrations precluding its translation into a functional ABC transporter. Subsequently, all potential pseudogenes were analyzed using the UCSC Genome Browser  to exclude the possibility that candidate pseudogenes were part of a known protein coding gene. Additionally, the human mRNA and EST track of the UCSC Genome Browser were used to search for transcription of the identified ABC transporter pseudogenes. A locus was regarded a candidate transcriptionally active pseudogene when a homologous mRNA or EST sequence exhibited overall > 99% with the pseudogene but no other sites in the human genome ("best hit").
Sequence similarities and multiple sequence alignments were calculated using EMBOSS package algorithms [35, 36]. Potential transcription factor binding sites were identified utilizing tfsearch (version 1.3, threshold score 85.0, classification: vertebrate)  and the TRANSFAC database (release 3.3) .
Confirmation of pseudogene transcription
Aliquots of 1 μg pooled total RNA (Human Total RNA Master Panel II, Clontech) were reverse-transcribed in the presence of oligo-dT primers in a 20 μl RT reaction using the Omniscript RT Kit (Qiagen). cDNA was amplified using the HotStarTaq DNA polymerase Kit (Qiagen). The cycling conditions were 94°C 30s, 54–60°C 30s and 72°C 30s-3 min followed by a final extension step at 72°C for 10 min. Transcription of pseudogenes was authenticated utilizing the following primer sets: ABCA10P 5'-GATTTTGCCAGAGCCTCAGC-3', 5'-GAGTGCATTTGCTGCTTGGA-3', ABCA15P2 5'-AACTGGCATGGACCCAGTAG-3', 5'-CTTCCTCCAAAATGCCAAAC-3', ABCA17P 5'-TCACGCACTGTCTTTCCTTG-3', 5'-CCAGTACAATGGAACTGATGATG-3', ABCC2P 5'-GCCTGGATTCAGAATTGCAT-3', 5'-GCCCCACAAGAAAGCATAAA-3', ABCC6P1 5'-CCCACGACGACAGAAGG-3', 5'-TTCGGAAACCTTGGCTCA-3', ABCC6P2 5'-CCCACGACGACAGAAGG-3', 5'-TCTGTTTGGGAGAACCGTGT-3' and ABCG2P2 5'-GGTACTCCGCTGACACCTTC-3', 5'-TTTCTCAACTGGTTTTTGACCA-3'. PCR products were separated and analyzed on GelStar® (Cambrex) stained 0.8 – 1.5% agarose gels. Bands of interest were excised and the DNA was extracted using the QIAquick® Gel Extraction Kit (Qiagen). PCR runs resulting in a single band were purified using Microcon® YM-30 columns (Millipore) for subsequent sequence analysis. Purified amplification products were sequenced as published previously . Sequence data were analyzed using the program FinchTV v.1.3.1 (Geospiza Inc.) and the UCSC Genome Browser.
HepG2 cells were purchased from ATCC (no. HB-8065) and cultivated under standard conditions (5% CO2, 37°C) in ATCC complete growth medium (Eagle's medium) with 2 mM L-glutamine and Earle's BSS (1.5 g/L sodium bicarbonate, 0.1 mM non-essential amino acids, 1.0 mM sodium pyruvate, 90%; fetal bovine serum, 10%) containing 2% penicillin/streptomycin.
siRNA mediated knockdown of ABCC6P1 in HepG2 cells
21 mer-siRNAs were designed using the siRNA design tool from the Whitehead Institute for Biomedical Research , which target the sequence of exon 10 of the pseudogene ABCC6P1. This exon displays no homology (sequence identity < 20%) with its parental gene ABCC6 and thus allows selective targeting of ABCC6P1. siRNAs including positive (Silencer GAPDH siRNA) and negative controls (Silencer Negative Control #1 siRNA) were purchased from Ambion, Inc. After screening of several siRNAs, the following siRNAs which yielded the highest knockdown rates of ABCC6P1 were used in all further experiments: Sense 5'-CUCCCAGGGAUAUAAAUCAUU-3', and antisense 5'- UGAUUUAUAUCCCUGGGAGUU-3', respectively. HepG2 cells were reverse-transfected using 1.5 μl siRNA (20 μM) complexed with 3 μl siPORT™ NeoFX™ transfection agent (Ambion) in 100 μl OPTI-MEM buffer (Invitrogen). After 10 min incubation, the siRNA/lipid-complex was mixed with 900 μl of 105cells/ml HepG2 cells in culture medium and plated in 12 well plates. After 24 h incubation under standard conditions the cells were lysed, RNA was extracted (MagNA Pure LC RNA Isolation Kit – High Performance, Roche) and quantitated on a Nano Drop® spectrophotometer. All siRNA experiments were performed in duplicate, and the results of six independent siRNA experiments with ABCC6P1 knockdown rates > 60% were used for statistical analyses.
For gene expression profiling of ABCC6 and ABCC6P1 in different human tissues, 1 μg total RNA from each tissue type of Clontech's Human Total RNA Master Panel II, and for quantitation of transcription levels in the siRNA experiments, 150–250 ng total RNA from HepG2 cells were reverse transcribed using Omniscript (Qiagen) and oligo-dT primers. For quantitation of the transcription levels of selected genes real-time PCR was performed on a 7900HT Fast Real-Time PCR System (Applied Biosystems) using standard conditions. The following TaqMan-primer/probes (Applied Biosystems) were used: ABCC6P1 (designed by Applied Biosystems): forward primer 5'-CCATCACTGGCCTGGTGTA-3', reverse primer 5'-CTCCTGCTGAAATCATCTGAGATTAGT-3', FAM-reporter probe 5'-CTGTGGATGCCTTTCTG-3' (targeting exon boundary 9/10 of ABCC6P1), ABCC6 (Assay-ID: Hs01081199_m1, targeting exon boundary 25/26 of ABCC6), cyclophilin B (PPIB) (Assay-ID: Hs00168719_m1), β-actin (ACTB) (Assay-ID: Hs99999903_m1), GAPDH (Assay-ID: Hs99999905_m1). All RT-PCR experiments were performed at least in duplicate.
Statistic analyses and calculation of gene/pseudogene expression levels
Relative transcript levels were calculated using the ΔΔ-ct method for the two housekeeping genes β-actin and cyclophilin B. Means, p-values and confidence intervals were calculated using the software package SPSS.
Dean M: The genetics of ATP-binding cassette transporters. Methods Enzymol. 2005, 400: 409-429. 10.1016/S0076-6879(05)00024-8.
Kaminski WE, Piehler A, Wenzel JJ: ABC A-subfamily transporters: structure, function and disease. Biochim Biophys Acta. 2006, 1762: 510-524.
Wenzel JJ, Piehler A, Kaminski WE: ABC A-subclass proteins: gatekeepers of cellular phospho- and sphingolipid transport. Front Biosci. 2007, 12: 3177-3193. 10.2741/2305.
Dean M, Annilo T: Evolution of the ATP-binding cassette (ABC) transporter superfamily in vertebrates. Annu Rev Genomics Hum Genet. 2005, 6: 123-142. 10.1146/annurev.genom.6.080604.162122.
Mighell AJ, Smith NR, Robinson PA, Markham AF: Vertebrate pseudogenes. FEBS Lett. 2000, 468: 109-114. 10.1016/S0014-5793(00)01199-6.
Balakirev ES, Ayala FJ: Pseudogenes: are they "junk" or functional DNA?. Annu Rev Genet. 2003, 37: 123-151. 10.1146/annurev.genet.37.040103.103949.
Pulkkinen L, Nakano A, Ringpfeil F, Uitto J: Identification of ABCC6 pseudogenes on human chromosome 16p: implications for mutation detection in pseudoxanthoma elasticum. Hum Genet. 2001, 109: 356-365. 10.1007/s004390100582.
Yabuuchi H, Takayanagi S, Yoshinaga K, Taniguchi N, Aburatani H, Ishikawa T: ABCC13, an unusual truncated ABC transporter, is highly expressed in fetal human liver. Biochem Biophys Res Commun. 2002, 299: 410-417. 10.1016/S0006-291X(02)02658-X.
Piehler AP, Wenzel JJ, Olstad OK, Haug KB, Kierulf P, Kaminski WE: The human ortholog of the rodent testis-specific ABC transporter Abca17 is a ubiquitously expressed pseudogene (ABCA17P) and shares a common 5' end with ABCA3. BMC Mol Biol. 2006, 7: 28-10.1186/1471-2199-7-28.
Vanin EF: Processed pseudogenes: characteristics and evolution. Annu Rev Genet. 1985, 19: 253-272. 10.1146/annurev.ge.19.120185.001345.
Strausberg RL, Feingold EA, Grouse LH, Derge JG, Klausner RD, Collins FS, Wagner L, Shenmen CM, Schuler GD, Altschul SF, Zeeberg B, Buetow KH, Schaefer CF, Bhat NK, Hopkins RF, Jordan H, Moore T, Max SI, Wang J, Hsieh F, Diatchenko L, Marusina K, Farmer AA, Rubin GM, Hong L, Stapleton M, Soares MB, Bonaldo MF, Casavant TL, Scheetz TE, Brownstein MJ, Usdin TB, Toshiyuki S, Carninci P, Prange C, Raha SS, Loquellano NA, Peters GJ, Abramson RD, Mullahy SJ, Bosak SA, McEwan PJ, McKernan KJ, Malek JA, Gunaratne PH, Richards S, Worley KC, Hale S, Garcia AM, Gay LJ, Hulyk SW, Villalon DK, Muzny DM, Sodergren EJ, Lu X, Gibbs RA, Fahey J, Helton E, Ketteman M, Madan A, Rodrigues S, Sanchez A, Whiting M, Madan A, Young AC, Shevchenko Y, Bouffard GG, Blakesley RW, Touchman JW, Green ED, Dickson MC, Rodriguez AC, Grimwood J, Schmutz J, Myers RM, Butterfield YS, Krzywinski MI, Skalska U, Smailus DE, Schnerch A, Schein JE, Jones SJ, Marra MA: Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc Natl Acad Sci U S A. 2002, 99: 16899-16903. 10.1073/pnas.242603899.
Jiang Q, Matsuzaki Y, Li K, Uitto J: Transcriptional regulation and characterization of the promoter region of the human ABCC6 gene. J Invest Dermatol. 2006, 126: 325-335. 10.1038/sj.jid.5700065.
Bischof JM, Chiang AP, Scheetz TE, Stone EM, Casavant TL, Sheffield VC, Braun TA: Genome-wide identification of pseudogenes capable of disease-causing gene conversion. Hum Mutat. 2006, 27: 545-552. 10.1002/humu.20335.
Torrents D, Suyama M, Zdobnov E, Bork P: A genome-wide survey of human pseudogenes. Genome Res. 2003, 13: 2559-2567. 10.1101/gr.1455503.
Zhang Z, Harrison PM, Liu Y, Gerstein M: Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome. Genome Res. 2003, 13: 2541-2558. 10.1101/gr.1429003.
Consortium IHGS: Finishing the euchromatic sequence of the human genome. Nature. 2004, 431: 931-945. 10.1038/nature03001.
Nelson DR, Zeldin DC, Hoffman SM, Maltais LJ, Wain HM, Nebert DW: Comparison of cytochrome P450 (CYP) genes from the mouse and human genomes, including nomenclature recommendations for genes, pseudogenes and alternative-splice variants. Pharmacogenetics. 2004, 14: 1-18. 10.1097/00008571-200401000-00001.
Zhang Z, Harrison P, Gerstein M: Identification and analysis of over 2000 ribosomal protein pseudogenes in the human genome. Genome Res. 2002, 12: 1466-1482. 10.1101/gr.331902.
Zhang Z, Gerstein M: Identification and characterization of over 100 mitochondrial ribosomal protein pseudogenes in the human genome. Genomics. 2003, 81: 468-480. 10.1016/S0888-7543(03)00004-1.
McCormack WT, Hurley EA, Thompson CB: Germ line maintenance of the pseudogene donor pool for somatic immunoglobulin gene conversion in chickens. Mol Cell Biol. 1993, 13: 821-830.
Tufarelli C, Hardison R, Miller W, Hughes J, Clark K, Ventress N, Frischauf AM, Higgs DR: Comparative analysis of the alpha-like globin clusters in mouse, rat, and human chromosomes indicates a mechanism underlying breaks in conserved synteny. Genome Res. 2004, 14: 623-630. 10.1101/gr.2143604.
Chen ZQ, Annilo T, Shulenin S, Dean M: Three ATP-binding cassette transporter genes, Abca14, Abca15, and Abca16, form a cluster on mouse Chromosome 7F3. Mamm Genome. 2004, 15: 335-343. 10.1007/s00335-004-2281-8.
Ban N, Sasaki M, Sakai H, Ueda K, Inagaki N: Cloning of ABCA17, a novel rodent sperm-specific ABC (ATP-binding cassette) transporter that regulates intracellular lipid metabolism. Biochem J. 2005, 389: 577-585. 10.1042/BJ20050159.
Harrison PM, Zheng D, Zhang Z, Carriero N, Gerstein M: Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability. Nucleic Acids Res. 2005, 33: 2374-2383. 10.1093/nar/gki531.
Zheng D, Zhang Z, Harrison PM, Karro J, Carriero N, Gerstein M: Integrated pseudogene annotation for human chromosome 22: evidence for transcription. J Mol Biol. 2005, 349: 27-45. 10.1016/j.jmb.2005.02.072.
Korneev SA, Park JH, O'Shea M: Neuronal expression of neural nitric oxide synthase (nNOS) protein is suppressed by an antisense RNA transcribed from an NOS pseudogene. J Neurosci. 1999, 19: 7711-7720.
Duret L, Chureau C, Samain S, Weissenbach J, Avner P: The Xist RNA gene evolved in eutherians by pseudogenization of a protein-coding gene. Science. 2006, 312: 1653-1655. 10.1126/science.1126316.
Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci U S A. 2004, 101: 6062-6067. 10.1073/pnas.0400782101.
Ross J: mRNA stability in mammalian cells. Microbiol Rev. 1995, 59: 423-450.
Zheng D, Gerstein MB: The ambiguous boundary between genes and pseudogenes: the dead rise up, or do they?. Trends Genet. 2007, 23: 219-224. 10.1016/j.tig.2007.03.003.
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la BM, Dedhia N, Blocker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowski J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de JP, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D: The human genome browser at UCSC. Genome Res. 2002, 12: 996-1006. 10.1101/gr.229102. Article published online before print in May 2002.
UCSC Genome Browser. http://genome ucsc edu/. 2006, [http://genome.ucsc.edu/]
Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ: The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 2004, 32: D493-D496. 10.1093/nar/gkh103.
Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2.
EMBOSS. 2006, [http://emboss.sourceforge.net/]
TFSEARCH. 2007, [http://www.cbrc.jp/research/db/TFSEARCH.html]
Heinemeyer T, Wingender E, Reuter I, Hermjakob H, Kel AE, Kel OV, Ignatieva EV, Ananko EA, Podkolodnaya OA, Kolpakov FA, Podkolodny NL, Kolchanov NA: Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL. Nucleic Acids Res. 1998, 26: 362-367. 10.1093/nar/26.1.362.
Piehler A, Kaminski WE, Wenzel JJ, Langmann T, Schmitz G: Molecular structure of a novel cholesterol-responsive A subclass ABC transporter, ABCA9. Biochem Biophys Res Commun. 2002, 295: 408-416. 10.1016/S0006-291X(02)00659-9.
siRNA at the Whitehead Institute for Biomedical Research. 2004, [http://jura.wi.mit.edu/bioc/siRNAext/]
Böhringer S, Gödde R, Böhringer D, Schulte T, Epplen JT: A software package for drawing ideograms automatically. Online Journal of Bioinformatics. 2002, 1: 51-59.
This work was supported by a grant from Helse ∅st RHF (200500428-243), Norway, and the Deutsche Forschungsgemeinschaft (KA 1078/4-1), Germany. We thank Runa Marie Grimholt for expert technical assistance.
APP, MH, PK, WEK designed the study. EJ, KBFH, PK, WEK supervised the project. APP, JJW performed bioinformatic analyses. APP, MH performed cell culture, siRNA and gene expression experiments. APP, MH, JJW, KBFH, WEK analyzed the data. APP, MH, EJ, WEK prepared the manuscript. All authors read and approved of the final manuscript.
Electronic supplementary material
Additional file 2: Comparison of the 5'UTR regions of ABCC6 , ABCC6P1 and ABCC6P2. The overall homology between the 5' upstream regions of ABCC6 and those of ABCC6P1 and ABCC6P2 is 98.8% and 99.5%, respectively. Transcription factor consensus binding sites are shown in green black. Importanty, all potential transcription factor binding sites within the ABCC6 promoter, with few exceptions (glucocorticoid response element GR -2116 bp, octamer binding factor oct-1 -1746 bp, GATA-3 consensus site -1269 bp and USF binding site -622 bp), are also present in ABCC6P1 and ABCC6P2. Non-homologous nucleotides are black white, the transcription start site of ABCC6 is indicated (+1). (PDF 58 KB)
About this article
Cite this article
Piehler, A.P., Hellum, M., Wenzel, J.J. et al. The human ABC transporter pseudogene family: Evidence for transcription and gene-pseudogene interference. BMC Genomics 9, 165 (2008) doi:10.1186/1471-2164-9-165
- HepG2 Cell
- UCSC Genome Browser
- Pseudoxanthoma Elasticum
- Potential Transcription Factor Binding Site
- ABCC6 Expression