Skip to main content

Table 2 Translated transcripts containing putative toxin sequences.

From: Characterization of the Conus bullatus genome and its venom-duct transcriptome

O-superfamily: C-C-CC-C-C

 

1.

MKLTCVAIVAVLLLTACQLITAEDSRGTQLHRALRKTTKLSVS TR C KGPGAK C LKTMYD CC KYS C SRGR C

 

2.

MKLTCVLIIAVLFLTAITADDSRDKQVYRAVGLIDKMRR IR ASEG C RKKGDR C GTHL CC PGLR C GSGRAGGA C RPPYN

 

3.

MKLMCVLIVSVLVLTACQLSTADDTRDKQKDRLVRLFRKKRDSSDSGLL PR T C VMFGSM C DKEEHSI CC YE C DYKKGI C V

 

4.

MKLTCVVIVAVLLLTACQLIIAEDSRGTQLHRALRKATKLSVS TR T C VMFGSM C DKEEHSI CC YE C DYKKGI C V

 

5.

MKLTCVLIVAVLFLTACQLATAENSREEQGYSAVRSSDQIQDSDLKL TK S C TDDFEP C EAGFEN CC SKS C FEFEDVYV C* GVSIDYYDSR

 

6.

MKLICVFIVAVLLLTACQLNAADDSRDTQKHRALRSTTKLSMS KK DS C VPDGDS C LFSRIP CC GT C SSRSKS C V*G

 

7.

MKLTCMMIVTVLFLTAWTFVTADDSTYGLKNLLPKARHEMMNPEAPKLNKK DE C SAPGAF C LIRPGL CC SEF C FFA C F[67]

 

8.

AEDSRGTQLHRALRKATKLSES TR C KRKGSS C RRTSYD CC TGS C RNGK C* G

 

9.

AVLLLTACQLITAEDSRDTQKHRALRSDTKLSMLT LR C ATYGKP C GIQND CC NI C DPARRT C T

 

10.

DSRGTQLHRALRKATILSVS AR C KLSGYR C KRPKQ CC NLS C GNYM C* G

 

11.

   ACQLITAEDSRGTQLHRALRSTSKVSK STS C VEAGSY C RPNVKL CC GF C SPYSKI C MNFPKN

 

12.

TAEDSRGTQLHRALRKATKLPVS TR C ITPGTR C KVPSQ CC RGP C KNGR C TPSPSEW

 

13.

AEDSRGTQLHRALRKTTKLSLS IR C KGPGAS C IRIAYN CC KYS C RNGK C S

 

14.

AACQLGTAASFARDKQDYPAVRSDGRQDSKDSTLDRIA KR C SEGGDF C SKNSE CC DKK C QDEGEGRGV C LIVPQNVILLH

M-superfamily: CC-C-C-CC

 

15.

MLKMGVLLFTFLVLFPLATLQLDADQPVERYADNKQDLNPD ER MIFLFGG CC RMSS C QPPPV C N CC AKQDLNPDER

 

16.

DQPADRPAERMQDDISSEQNPLLEKR VGER CC KNGKRG C GRW C RDHSR CC* GRR[17]

 

17.

GLY CC QPKPNGQMM C NRW C EINSR CC* GRR

A-superfamily: CC-C-C; CC-C-C-C-C

 

18.

MGMRMMFTVFLLIVLATTVVSFSTDDESDGSNEEPSADQTARSSMNR APG CC NNPA C VKHR C* G[68]

 

19.

MGMRMVFTVFLLVVLATTVVSFTSDRASDGRNAAANDKASDLAALA VR G CC HDIF C KHNNPDI C* G

 

20.

MGMRMRMMFTVFLLVVLANTVVSFPSDRDSDGADAEASDEPVEFER DENG CC WNPS C PRPR C T*GRR[68]

 

21.

DGANAEATDNKPGVFER DE KK CC WNRA C TRLVP C SK

 

22.

SDRASDGRNAAANDRASDLVALT VR G CC TYPP C AVLSPL C D

 

23.

MGMRMMVTVFLLGVLATTVVSLRSNRASDGRRGIVNK LNDLVPQYWTECC GRIGPH C SR C I C PEVV C PKN*G

 

24.

MGMRMMVTVFLLVVLATTVVSLRSNRASDGRRGIVNKLNDLVPK YWTECC GRIGPH C SR C I C PEVA C PKN*G

 

25.

MGMRMMVTVFPLVVLATTVVSLRSNRASDGRRGIVNKLNDLVPK YWTECC GRIGPH C SR C I C PGVV C PKR*G

 

26.

   LVVLATTVVSFRSNRASDGRKIAVNKRRRELVVPPG K LRE CC GRVGPM C PK C M C PPRR C

 

27.

ASDGRNAVVH ER APELVVTATTT CC GYDPMTI C PP C M C THS C PPKRKP*GRRND

J-superfamily

 

28.

   MTSVQSATCCCLLWLVLCVQLVTPDSPATAQLSRHLTAR VPVGPALAYA C SVM C AKGYDTVV C T C TRRRG*VVSSSI

Contryphan

 

29.

   MGKLTILVLVAAVLLSTQVMGQGDRDQPAARNAVPRDDNPGGASAKLMNLLHRSKCPWSPWC*G

Conkunitzin

 

30.

MEGRRFAAVLILPICMLAPGAVAS KR WTRPSV C NLPAESGTGTQSLKRFYYNSDKMQ C RTFIYKGNGGNDNNFPRTYD C QKK C LYRP*G

  1. Cysteine motifs are shown next to the superfamilies. The underlined residues indicate presumed propeptide cleavage site ascertained by analogy to previously isolated toxins; * indicate probable amidation at the C-terminal residue after cleavage of the following G residue. In the case of 23,24,25,26 where the propeptide cleavage site is uncertain, we have indicated the cleavage site at the basic residues (K) proximal to the presumed toxin sequence. The peptides Bu 7, 16, 18 and 20 have been previously characterized.