Detailed Description
The invention will be further described with reference to the following examples, which are given by way of illustration only, but the scope of the invention is not limited thereto.
EXAMPLE 1 construction of Yeast bait Strain
Randomly selecting sequence fragments on the PnSE1 promoter, wherein each fragment contains at least one main cis-acting element, and constructing the sequence fragments into the pAbAi vector by a PCR cloning mode. Specific primers were designed using primer premier 5.0 and selected SmaI and XhoI restriction enzyme sites using SnapGene 3.2.1 to predict optional cleavage sites within the sequence. The target fragment was amplified using KOD high fidelity enzyme and the PCR product was separated electrophoretically on a 1% agarose gel. And then cutting and recycling the target fragment. The target fragment and pAbAi vector were double digested with SmaI and XhoI, digested at 37℃for 6h, and then the digestion reaction was stopped by adding 5. Mu.L of 10X DNA loading buffer. And (3) carrying out electrophoresis separation on the enzyme digestion product on 1% agarose gel, and then carrying out gel digestion recovery treatment.
The fragment of interest was ligated overnight with the vector using T4 ligase and the ligation products were all transformed into E.coli DH 5. Alpha. Competent (100. Mu.L). Monoclone is selected and put into LB liquid culture medium (50 mg/L Amp), and is cultivated in an enlarged way, and then 1 mu L of bacterial liquid is taken as a template for bacterial liquid PCR verification. The positive clones obtained were verified to be extracted from the plasmid using TIANGEN TIANpure Mini Plasmid Kit II (Code No. DP107).
The bait vector constructed above was subjected to enzyme-tangential restriction with BstBI restriction enzyme. The linearized plasmid after purification is transferred into a yeast Y1H strain. Yeast bait strain identification was performed using the Matchmaker Insert Check PCR Mix I kit. Identification of successful Yeast colonies for expansion culture, selection of the appropriate Yeast decoy strain YE697 capable of being inhibited by AbA.
Example 2 selection of genes that interact with the PnSE1 Gene promoter
Total Notoginseng RNA was extracted with reference to TIANGEN RNAprep Pure plant total RNA extraction kit instructions. Using total notoginseng RNA as template, through SMART reverse transcription to synthesize cDNA first chain, using long distance PCR (LD-PCR) amplification technique to synthesize double-chain cDNA, using 1.2% agarose gel electrophoresis to detect, finally using CHROMA spin+TE-400 chromatographic column to purify, referring to Matchmaker Gold Yeast One-Hybrid Library Screening System reagent box instruction to construct notoginseng cDNA library.
The purified pseudo-ginseng cDNA library and pGADT7 vector are transformed into positive decoy yeast strain competent cells, and cultured for 3d at 30 ℃ on SD/-Leu/AbA (500 ng/mL) medium, and positive clones are selected and expanded to YPDA liquid medium. The small amount of extraction kit of the Biyundian yeast plasmid is used for extracting the yeast plasmid. PCR amplification was performed using the extracted yeast plasmid as a template, and the primers were universal primers T7 (5'-AATA CGACTCACTATAGGGCG-3') and 3-AD (5'-AGATGGTGCACGATGCACAG-3'). The PCR product obtained was subjected to sample-feeding sequencing. And (3) comparing the data obtained by the sequencing result in NCBI, pseudo-ginseng genome, transcriptome data and an Arabidopsis thaliana database to obtain a development and cell death related protein coding gene (development and cell death domain protein, DCD) which interacts with a PnSE1 gene promoter, wherein the base sequence is shown as SEQ ID NO.1, and the amino acid sequence of the coded protein is shown as SEQ ID NO. 2.
Example 3 bioinformatics analysis
The nucleotide and amino acid sequences of the above gene PnDCD were submitted to NCBI for analysis. The Open Reading Frame (ORF) sequence of PnDCD has 966bp and codes for 321 amino acids. The molecular weight was predicted to be 36583.21Daltons and the isoelectric point (pI) to be 8.95 using ExpASY on-line software (https:// web. ExPASy. Org/computer_pi /), indicating that the protein was an alkaline protein. Of these, 44 strongly basic amino acids (K, R), 38 strongly acidic amino acids (D, E), 94 hydrophobic amino acids (Hydrophobic Amino Acids) (A, I, L, F, W, V), and 99 polar amino acids (Polar Amino Acids) (N, C, Q, S, T, Y). The instability index (instability index, II) of the PnDCD protein was 37.79, the total average hydrophilic value (Grand average of hydropathicity, GRAVY) was-0.737, and the protein was stable hydrophilic. SMART on-line software (http:// SMART. Embl-heidelberg. De /) predictions show that the protein has no transmembrane structure (transmembrane domains), but has two low-copy regions (low complexness) located at 106-115 aa and 122-143 aa, respectively, of the predicted amino acid sequence.
The secondary structure of the PnDCD protein was predicted using on-line software SOPMA (https:// npsa-prabi.ibcp. Fr/cgi-bin/npsa_Automat. Pl. The three-dimensional structure of the PnDCD protein is predicted by utilizing on-line software SWISS-MODEL (http:// swissmodel. Expasy. Org /), and the using method is X-ray, and the respective rate is thatThe results are shown in FIG. 2. The template used was 2p5d.1.A, the Identity of the sequence (Seq Identity) was 15.53%, the status of the oligonucleotide (Oligo-state) was Monomer, the similarity of the sequence to the template sequence (Seq similarity) was 0.31, and the Coverage (Coverage) was 0.32.
The gene PnDCD was cloned and analyzed in many species. The amino acid sequence of PnDCD was aligned multiple times with the amino acid sequence of the gene in other plants in the NCBI database by software Clustal X and MEGA6.0 and a evolutionary tree was constructed, and specific species and protein sequence numbers are shown in Table 1. From the results of the evolutionary tree (FIG. 3), pnDCD is evolutionarily similar to the Rosaceae PdDCD and Salicaceae PdDCD.
TABLE 1 nucleotide sequences for constructing evolutionary trees
Example 4 Yeast in vivo validation
Based on the results of the yeast single hybridization, the gene sequence that interacts with the PnSE1 promoter was selected as the in vivo verification target for yeast. Specific primers (upstream primer: 5' -CG) were designed based on Notoginseng radix transcriptome dataGAATTCATGGAGAACATGAATAGCTTTTGG-3', downstream primer: 5' -CGGGATCCTCAACTTCCAAGCTTGCGCT-3'), ecoRI and BamHI cleavage sites were introduced into the sequence, and the gene was constructed into pGADT7 vector by PCR amplification using pseudo-ginseng cDNA as a template. The constructed recombinant vectors were transferred into yeast strain YE697, respectively, and interaction between PnDCD and PnSE1 promoter was verified by X-alpha-gal chromogenic reaction (FIG. 4).
Example 5 in vitro validation
Specific primers (upstream primer: 5' -CG) were designed based on Notoginseng radix transcriptome dataGGATCCATGGAGAACATGAATAGCTTTTGG-3', downstream primer: 5' -CGGAATTCTCAACTTCCAAGCTTGCGCT-3'), bamHI and EcoRI cleavage sites were introduced into the sequence, and PnDCD was constructed into the prokaryotic expression vector pET-32a by PCR amplification using Notoginseng radix cDNA as a template. And (3) extracting positive recombinant plasmids, and transforming competent cells of escherichia coli BL21 (DE 3). Positive clones were selected with LB medium (50 mg/L Amp) containing antibiotics. 200 mu L of positive clone bacterial liquid is inoculated into 5mL of LB liquid culture medium for expansion culture, when the bacterial liquid reaches a logarithmic growth phase (OD 600 = 0.5), IPTG is added to induce and express recombinant protein, the concentration of the IPTG is 1mmol/L, the proper induction time is 6h, and the induction temperature is 25 ℃. Finally, the obtained protein is subjected to gel electrophoresis by adopting SDS-PAGE gel electrophoresis technology, and the result is shown in figure 5. The size of the PnDCD fusion protein is 50KD, and after His tag proteins are removed, the band size is matched with the predicted molecular weight size of the PnDCD protein, namely about 36KD.
EXAMPLE 6 subcellular localization
According to the data of the pseudo-ginseng transcriptomeMeter-specific primer (upstream primer: 5' -TCC)CCCGGGATGGAGAACATGAATAGCTTTTGG-3', downstream primer: 5' -CGGGATCCACTTCCAAGCTTGCGCTTTAC-3'), smaI and BamHI cleavage sites were introduced into the sequence, and the gene was constructed into a 35S-sGFP vector by PCR amplification using Notoginseng cDNA as a template. And (3) transforming the constructed recombinant vector into DH5 alpha competence of the escherichia coli. Picking up monoclone to LB liquid culture medium (50 mg/L Kan), enlarging culture, then sucking 1 mu L bacterial liquid as template for bacterial liquid PCR verification. The positive clone obtained was verified to be plasmid-extracted with TIANGEN TIANpure Mini Plasmid Kit II (Code No. DP107) and then transformed with Agrobacterium tumefaciens strain GV3101. Agrobacterium H 2 B-RFP Strain and GV3101 Strain containing the target Gene according to 1:1 are injected into the same tobacco leaf after being evenly mixed according to the proportion, H 2 B-RFP strains can cause nuclei to appear red. The tobacco after injection was cultured for 3 days, and then sampled and placed under a laser confocal microscope for observation. Subcellular localization results showed that the green fluorescence expressed by the PnDCD protein was weak, which was distributed in both the nucleus and cytoplasm, indicating its localization in the cytoplasm and cell membrane (fig. 6).
Sequence listing
<110> university of Zhejiang
Zhejiang university of technology limited company Zhejiang of Hemsleya shapewear
<120> Gene PnDCD and its use in regulating saponin Synthesis
<160> 10
<170> SIPOSequenceListing 1.0
<210> 1
<211> 966
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 1
atggagaaca tgaatagctt ttggcaattg ggtgacgaga tccgaggatt gaaagtttct 60
gaggagcaca agtggttaat ggctgcttct aggttggctg agcagactag gtccaagggc 120
gagcggagga acaatcttga tctttcgaaa ggatctactg aaacaaagcc aagggataat 180
attgggttcc aggaagataa caaatttgaa agcctcaact tcaacatgtt aaatttggat 240
acaaaaatga atgaaaccat tgccaaaagt tctctcagga atagtgtgta caacatgaac 300
acggtgtctc agaaaaacaa tatcaacaac actgttaata tgaatggtac caagtataat 360
ggtaacaacc acaaaaagga ggccaccacc aacaacaatc acaacaacaa caactatgag 420
aatgccaatt cgatcagtgc tgttgacaaa aggtttaaga ccttacctgc agcagagaca 480
cttccgagaa atgaagttct tggtggatat atctttgttt gcaataatga tacaatgcag 540
gaggatctga agcgtcaact atttggttta ccaccaagat atagagattc tgttcgagct 600
ataacgccag gcttacctct gtttctatac aactacacca ctcaccagtt gcatggtatt 660
tttgaggctt cgggttttgg aggttccaac atcgatgcta ctgcttggga agataaaaaa 720
tgcaaagggg agtctaggtt tcctgctcag gttagggtcc gtgttagaaa aatctgcaag 780
gccttggagg aagatgcttt taggcctgta ttgcatcatt atgatggccc gaaatttcgt 840
cttgagcttt cagtacctga gacattagac ttgcttgatc tctgtgaaca agctggtgta 900
aaagtttcat gggtgccgtg tgtttttcaa caaatcgttt ccgtaaagcg caagcttgga 960
agttga 966
<210> 2
<211> 321
<212> PRT
<213> Artificial sequence (Artificial Sequence)
<400> 2
Met Glu Asn Met Asn Ser Phe Trp Gln Leu Gly Asp Glu Ile Arg Gly
1 5 10 15
Leu Lys Val Ser Glu Glu His Lys Trp Leu Met Ala Ala Ser Arg Leu
20 25 30
Ala Glu Gln Thr Arg Ser Lys Gly Glu Arg Arg Asn Asn Leu Asp Leu
35 40 45
Ser Lys Gly Ser Thr Glu Thr Lys Pro Arg Asp Asn Ile Gly Phe Gln
50 55 60
Glu Asp Asn Lys Phe Glu Ser Leu Asn Phe Asn Met Leu Asn Leu Asp
65 70 75 80
Thr Lys Met Asn Glu Thr Ile Ala Lys Ser Ser Leu Arg Asn Ser Val
85 90 95
Tyr Asn Met Asn Thr Val Ser Gln Lys Asn Asn Ile Asn Asn Thr Val
100 105 110
Asn Met Asn Gly Thr Lys Tyr Asn Gly Asn Asn His Lys Lys Glu Ala
115 120 125
Thr Thr Asn Asn Asn His Asn Asn Asn Asn Tyr Glu Asn Ala Asn Ser
130 135 140
Ile Ser Ala Val Asp Lys Arg Phe Lys Thr Leu Pro Ala Ala Glu Thr
145 150 155 160
Leu Pro Arg Asn Glu Val Leu Gly Gly Tyr Ile Phe Val Cys Asn Asn
165 170 175
Asp Thr Met Gln Glu Asp Leu Lys Arg Gln Leu Phe Gly Leu Pro Pro
180 185 190
Arg Tyr Arg Asp Ser Val Arg Ala Ile Thr Pro Gly Leu Pro Leu Phe
195 200 205
Leu Tyr Asn Tyr Thr Thr His Gln Leu His Gly Ile Phe Glu Ala Ser
210 215 220
Gly Phe Gly Gly Ser Asn Ile Asp Ala Thr Ala Trp Glu Asp Lys Lys
225 230 235 240
Cys Lys Gly Glu Ser Arg Phe Pro Ala Gln Val Arg Val Arg Val Arg
245 250 255
Lys Ile Cys Lys Ala Leu Glu Glu Asp Ala Phe Arg Pro Val Leu His
260 265 270
His Tyr Asp Gly Pro Lys Phe Arg Leu Glu Leu Ser Val Pro Glu Thr
275 280 285
Leu Asp Leu Leu Asp Leu Cys Glu Gln Ala Gly Val Lys Val Ser Trp
290 295 300
Val Pro Cys Val Phe Gln Gln Ile Val Ser Val Lys Arg Lys Leu Gly
305 310 315 320
Ser
<210> 3
<211> 21
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 3
aatacgactc actatagggc g 21
<210> 4
<211> 20
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 4
agatggtgca cgatgcacag 20
<210> 5
<211> 32
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 5
cggaattcat ggagaacatg aatagctttt gg 32
<210> 6
<211> 28
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 6
cgggatcctc aacttccaag cttgcgct 28
<210> 7
<211> 32
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 7
cgggatccat ggagaacatg aatagctttt gg 32
<210> 8
<211> 28
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 8
cggaattctc aacttccaag cttgcgct 28
<210> 9
<211> 33
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 9
tcccccggga tggagaacat gaatagcttt tgg 33
<210> 10
<211> 29
<212> DNA
<213> Artificial sequence (Artificial Sequence)
<400> 10
cgggatccac ttccaagctt gcgctttac 29