WO2023077482A1 - Combination of mnp markers of mycobacterium tuberculosis, primer pair combination, kit, and uses of combination, primer pair combination and kit - Google Patents
Combination of mnp markers of mycobacterium tuberculosis, primer pair combination, kit, and uses of combination, primer pair combination and kit Download PDFInfo
- Publication number
- WO2023077482A1 WO2023077482A1 PCT/CN2021/129160 CN2021129160W WO2023077482A1 WO 2023077482 A1 WO2023077482 A1 WO 2023077482A1 CN 2021129160 W CN2021129160 W CN 2021129160W WO 2023077482 A1 WO2023077482 A1 WO 2023077482A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- mnp
- mycobacterium tuberculosis
- combination
- markers
- marker
- Prior art date
Links
- 241000187479 Mycobacterium tuberculosis Species 0.000 title claims abstract description 123
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract 3
- 238000001514 detection method Methods 0.000 claims description 68
- 239000003550 marker Substances 0.000 claims description 63
- 230000007614 genetic variation Effects 0.000 claims description 16
- 238000007403 mPCR Methods 0.000 claims description 12
- 239000002773 nucleotide Substances 0.000 claims description 10
- 125000003729 nucleotide group Chemical group 0.000 claims description 10
- 238000010276 construction Methods 0.000 claims description 7
- 230000001580 bacterial effect Effects 0.000 claims description 3
- 101100184487 Caenorhabditis elegans mnp-1 gene Proteins 0.000 abstract description 2
- 238000012163 sequencing technique Methods 0.000 description 22
- 108700028369 Alleles Proteins 0.000 description 17
- 238000000034 method Methods 0.000 description 17
- 238000006243 chemical reaction Methods 0.000 description 13
- 241000894007 species Species 0.000 description 10
- 201000008827 tuberculosis Diseases 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 9
- 238000012360 testing method Methods 0.000 description 9
- 230000003321 amplification Effects 0.000 description 8
- 238000003199 nucleic acid amplification method Methods 0.000 description 8
- 238000012216 screening Methods 0.000 description 8
- 230000035945 sensitivity Effects 0.000 description 8
- 238000012165 high-throughput sequencing Methods 0.000 description 7
- 238000003908 quality control method Methods 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 6
- 238000011156 evaluation Methods 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 244000005700 microbiome Species 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 5
- 244000000010 microbial pathogen Species 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 4
- 238000011109 contamination Methods 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 241000186359 Mycobacterium Species 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 108091036078 conserved sequence Proteins 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 238000007481 next generation sequencing Methods 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 150000007523 nucleic acids Chemical class 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000008223 sterile water Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 241000589291 Acinetobacter Species 0.000 description 1
- 241000193738 Bacillus anthracis Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000588807 Bordetella Species 0.000 description 1
- 241000588832 Bordetella pertussis Species 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- 241001647372 Chlamydia pneumoniae Species 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241000701085 Human alphaherpesvirus 3 Species 0.000 description 1
- 241000046923 Human bocavirus Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- 241000588655 Moraxella catarrhalis Species 0.000 description 1
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 229940065181 bacillus anthracis Drugs 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 229940047650 haemophilus influenzae Drugs 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003147 molecular marker Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108090000623 proteins and genes Proteins 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 230000037380 skin damage Effects 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/02—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving viable microorganisms
- C12Q1/04—Determining presence or kind of microorganism; Use of selective media for testing antibiotics or bacteriocides; Compositions containing a chemical indicator therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
Definitions
- the embodiment of the present invention relates to the field of biotechnology, in particular to a combination of MNP markers of Mycobacterium tuberculosis, a combination of primer pairs, a kit and applications thereof.
- Mycobacterium tuberculosis commonly known as Mycobacterium tuberculosis.
- Mycobacterium tuberculosis can invade susceptible organisms through respiratory tract, digestive tract or skin damage, causing tuberculosis in various tissues and organs.
- Mycobacterium tuberculosis can be inhaled through droplets or dust containing bacteria, so tuberculosis is more common.
- Tuberculosis is a worldwide disease and one of the most concerned public health problems. Although in recent years, with the production of vaccines and early vaccination of children, the incidence of tuberculosis has shown a downward trend, but its harm to human health is still very serious. Rapid and accurate detection and reporting of Mycobacterium tuberculosis is one of the important links in the prevention and control of tuberculosis, and is the basis for controlling the incidence and prevalence of tuberculosis.
- Mycobacterium tuberculosis is a typical pathogenic microorganism, and it is also a pathogenic microorganism that is often studied in the laboratory.
- the genetic variation of experimental microorganisms will lead to genetically different strains with the same name in different laboratories or in the same laboratory in different periods, resulting in non-reproducible and non-comparable experimental results.
- a consensus has been reached on the validation of human Hella cells prior to publication. Therefore, the detection of pathogenic microorganisms must not only be able to detect Mycobacterium tuberculosis sensitively and accurately, but also be able to detect low-frequency mutations in the population, which is a challenge to the existing detection technology.
- the classic detection methods of Mycobacterium tuberculosis including isolation and culture, PCR technology, whole genome and metagenomic sequencing, etc., have one or more problems in terms of time length, operation complexity, detection throughput, accuracy and sensitivity of detection of mutations, cost, etc. limited.
- the targeted molecular marker detection technology that combines ultra-multiplex PCR amplification and high-throughput sequencing can enrich target microorganisms in samples with low microbial content, avoiding the isolation and culture steps of whole-genome pathogenic bacteria and the disadvantages caused by metagenomic sequencing.
- a large amount of data waste and background noise have the advantages of less sample requirements, accurate diagnostic results, saving data volume, and detecting low-frequency variations.
- the molecular markers detected by existing targeted detection technologies mainly include SNP and SSR markers.
- SSR markers are recognized as the most polymorphic markers, but their number is small in microorganisms; SNP markers are huge in number, densely distributed, and are dimorphic markers, and the polymorphism of a single SNP marker is not enough to capture potential alleles in microbial populations genetic diversity. Therefore, the development of new molecular markers with high polymorphism and their detection technology has become an urgent technical problem to be solved.
- MNP markers refer to polymorphic markers caused by multiple nucleotides in an upper region of the genome. Compared with SSR markers and SNP markers, MNP markers have the following advantages: (1) rich allelic types, with 2 n allele types on a single MNP marker, higher than SSR and SNP markers; (2) strong species discrimination ability , only a small amount of MNP markers can be used to identify species and their taxonomic levels, and the identification of species is fine.
- the MNP labeling method based on super multiplex PCR combined with next-generation high-throughput sequencing technology to detect MNP markers has the following advantages: (1) The output is base sequence, without parallel experiments, and a standardized database can be built for sharing; (2) High Efficiency, using sample DNA barcodes, breaking through the limitation of the number of sequencing samples, and can type tens of thousands of MNP markers in hundreds of samples at one time; (3) High sensitivity, using multiplex PCR to detect multiple targets at a time, avoiding single Target amplification failures lead to high false negatives and low sensitivity; (4) high accuracy, using a second-generation high-throughput sequencer to sequence the amplified product hundreds of times.
- MNP marker and its detection technology can realize the classification and traceability of multi-allelic genotypes in populations, and has application potential in the identification of pathogenic microorganisms, fingerprint database construction, and genetic variation detection.
- MNP marker method can realize the classification and traceability of multi-allelic genotypes in populations, and has application potential in the identification of pathogenic microorganisms, fingerprint database construction, and genetic variation detection.
- MNP labeling there is no report on MNP labeling, and there is a lack of corresponding technology.
- the development, screening and application of MNP markers have a good application basis in plants.
- MNP markers and detection primers for identifying Mycobacterium tuberculosis in pathogenic microorganisms.
- the combination of markers and primers developed in the present invention will be used to formulate national standards for pathogen detection (plan number 20201830-T-469), which will be released by the end of 2021.
- the purpose of the present invention is to provide a MNP marker combination of Mycobacterium tuberculosis, a primer pair combination, a kit and its application, which can identify and detect mutations of Mycobacterium tuberculosis, and have multi-target, high-throughput, high-sensitivity and precision The typing effect.
- a MNP marker combination of Mycobacterium tuberculosis refers to the MNP marker combination that is screened on the Mycobacterium tuberculosis genome to distinguish it from other species and have multiple nucleosides within the species.
- nucleotide sequence marked by MNP-1-MNP-17 is specifically shown in NO.1-SEQ ID NO.17.
- the multiple PCR primer pair combination includes 17 pairs of primers, and the specific primer pair combination nucleotide sequence is as SEQ As shown in ID NO.18-SEQ ID NO.51, wherein ID NO.18-SEQ ID NO.34 is the upper primer, and ID NO.35-SEQ ID NO.51 is the lower primer.
- each MNP-labeled primer includes an upper primer and a lower primer, as shown in Table 1 of the specification.
- a detection kit for detecting the MNP marker combination of Mycobacterium tuberculosis includes the primer pair combination; further, the kit also includes Multiplex PCR Master Mix.
- the kit includes the primer pair combination; further, the kit also includes Multiplex PCR Master Mix.
- the MNP marker combination of Mycobacterium tuberculosis or the combination of multiple PCR primers or the detection kit in the identification of Mycobacterium tuberculosis and the construction of DNA fingerprint database Application in genetic variation detection.
- the kit of the present invention to carry out the first round of multiplex PCR amplification to the total DNA and the blank control, and the number of cycles is not higher than 25;
- the amplified product is purified, add sample tags and next-generation sequencing adapters based on the second round of PCR amplification; quantify the second-round amplified product after purification; detect multiple strains by combining the second-round amplified product, etc.
- High-throughput sequencing is carried out after the amounts are mixed; the sequencing results are compared to the reference sequence of Mycobacterium tuberculosis, and the number of detected sequences and genotype data in the total DNA are obtained.
- the quality control scheme and determination method is to use the DNA of Mycobacterium tuberculosis with known copy number as the detection sample, evaluate the sensitivity, accuracy and specificity of the kit to detect Mycobacterium tuberculosis, formulate the The quality control scheme and determination method when the kit detects Mycobacterium tuberculosis.
- the detection of genetic variation between strains includes using the above kit and method to obtain the genotype data of each of the 17 MNP markers of the strains to be compared. Through genotype comparison, analyze whether there are differences in the main genotypes of the strains to be compared on the 17 MNP markers. If there is a variation in at least one main genotype of the MNP marker in the strain to be compared, it is determined that there is a genetic variation between the two.
- single-plex PCR can also be used to amplify the 17 markers of the strains to be compared, and then perform Sanger sequencing on the amplified products.
- the sequence After obtaining the sequence, compare the genotypes of each MNP marker of the strains to be compared. right. If there are MNP markers with inconsistent main genotypes, there is variation among the strains to be compared. When detecting the genetic variation within the strain, it is judged by statistical model whether the MNP markers in the strain to be tested detect sub-genotypes other than the main genotype. If the test strain has a subgenotype in at least one MNP marker, it is determined that there is genetic variation within the test strain.
- the genotype data of the MNP marker of the Mycobacterium tuberculosis identified from the sample is entered into the database file to constitute the DNA fingerprint database of Mycobacterium tuberculosis;
- identify whether the Mycobacterium tuberculosis in the sample and the bacterial strain in the database have the main genotype in the MNP marker (with more than one MNP marker in one MNP marker).
- the genotype supported by 50% of the sequenced fragments) difference, Mycobacterium tuberculosis with main genotype difference in at least one MNP marker is a new variant type, which is included in the DNA fingerprint database.
- Mycobacterium tuberculosis typing it is to identify the Mycobacterium tuberculosis in the sample to be tested, and obtain the genotype of each MNP site; collect the genome sequence of Mycobacterium tuberculosis published on the Internet and the constructed
- the Mycobacterium tuberculosis DNA fingerprint database constitutes the Mycobacterium tuberculosis reference sequence library; compare the genotype of Mycobacterium tuberculosis in the sample to be tested with the reference sequence library of Mycobacterium tuberculosis, and screen for genetically consistent or most For close strains, the type of Mycobacterium tuberculosis in the sample to be tested is obtained. According to the comparison result with the reference sequence library, it is identified whether the Mycobacterium tuberculosis in the sample is an existing type or a new variant, so as to realize fine typing of Mycobacterium tuberculosis.
- the present invention has the following advantages:
- the invention provides an MNP marker combination of Mycobacterium tuberculosis, a primer pair combination, a kit and applications thereof.
- the provided 17 MNP markers of Mycobacterium tuberculosis and their primer combinations can be used for multiplex PCR amplification, combined with the next-generation sequencing platform to sequence the amplified products, meeting the high-throughput, high-efficiency,
- the requirements for high accuracy, high sensitivity and culture-free detection meet the requirements for the construction of a standard and shareable fingerprint database for Mycobacterium tuberculosis; meet the requirements for accurate detection of genetic variation between strains of Mycobacterium tuberculosis and the identification of pure Mycobacterium tuberculosis Synthetic and heterozygous needs.
- the present invention is the first in the field of Mycobacterium tuberculosis, and there are no relevant literature reports; MNP markers are mainly developed based on reference sequences, and large-scale identification of Mycobacterium tuberculosis from other species can be mined based on the resequencing data of the reported representative races of Mycobacterium tuberculosis , MNP markers with polymorphic and conserved sequences on both sides of Mycobacterium tuberculosis species; MNP marker detection primers suitable for multiplex PCR amplification can be designed through the conserved sequences on both sides of the MNP marker; then according to the test results of the standard, A set of primer combinations and detection methods with the largest polymorphism and high specificity and the best MNP marker compatibility can be screened out, and used for the detection of Mycobacterium tuberculosis, the construction of DNA fingerprints, and the genetic variation within and between strains In detection and other related applications, it provides technical support for scientific research, scientific monitoring and prevention of Myco
- Figure 1 is a schematic diagram of MNP marker polymorphism
- Fig. 2 is the flow chart of screening and primer design of Mycobacterium tuberculosis MNP marker
- Fig. 3 is the detection flowchart of MNP mark
- Example 1 The screening of Mycobacterium tuberculosis MNP marker combination and the design of multiple PCR amplification primers
- the step S1 specifically includes:
- the window uses 100-300bp as the window, and perform window translation with 1bp as the step size, and screen to obtain multiple candidate MNP marker regions, wherein the candidate MNP marker regions contain ⁇ 2 of the single nucleotides Variation markers, and the single nucleotide polymorphism markers do not exist on the 30bp sequences at both ends.
- step lengths can also be used when screening on the reference genome with a window of 100-300 bp.
- the step size is 1 bp, which is conducive to comprehensive screening.
- the multiple PCR amplification primers labeled with MNP are designed by primer design software.
- the primer design follows that the primers do not interfere with each other. All primers can be combined into a primer pool for multiple PCR amplification, that is, all designed primers can be used in one amplification reaction. normal expansion.
- the primers used to identify the MNP marker are shown in Table 1 and SEQ ID NO.18-SEQ ID NO.51.
- the method for detecting the MNP markers is to use a PCR master mix suitable for multiplex PCR, amplify all MNP markers at one time through multiplex PCR, sequence the amplified products through second-generation high-throughput sequencing, and analyze the sequencing data , evaluating the compatibility of the primer combination based on the detected markers.
- Mycobacterium tuberculosis DNA standard with known copy number (article number: BDS-BW-047, Guangzhou Bangdesheng Biotechnology Co., Ltd.) to prepare a mock sample of Mycobacterium tuberculosis, that is, tuberculosis with known copy number
- the mycobacterium standard was added to human genomic DNA (2ng/reaction) to prepare a template with 1000 copies/reaction, and the primer combination was used to perform repeated detection by the MNP marker detection method. Based on the principles of high species discrimination and species-specific MNP marker screening and high-efficiency amplification and stable amplification primer screening principles, 17 MNP markers and their detection primer pair combinations provided by the present invention were finally screened out.
- the DNA of Mycobacterium tuberculosis was added into Into human genomic DNA, prepare 1 copy/reaction, 10 copies/reaction, and 100 copies/reaction of M. tuberculosis mock samples. At the same time, an equal volume of sterile water was set as a blank control. In this embodiment, there are 4 samples in total, and 3 repeated libraries are constructed for each sample every day, and the detection is performed continuously for 4 days, that is, 12 sets of sequencing data are obtained for each sample, as shown in Table 2.
- the kit can accurately and sensitively detect Mycobacterium tuberculosis as low as 10 copies/reaction.
- the sequence aligned to Mycobacterium tuberculosis can be detected in 1 copy/reaction sample, covering at least 1 MNP marker. In some blank controls, the sequence of Mycobacterium tuberculosis was also detected. Due to the extreme sensitivity of the MNP marker detection method, data contamination during the detection process can easily lead to false positives. Therefore, the following quality control plan was formulated in this example.
- the quality control plan is as follows:
- the amount of sequencing data is greater than 5 million bases. The calculation is based on the fact that the number of MNP markers detected in each sample is 17, and the length of a sequencing fragment is 300 bases. Therefore, when the data volume is greater than 5 million bases, most samples can be guaranteed to cover each marker in one experiment. The number of sequencing fragments reaches 1000 times, ensuring accurate analysis of the base sequence of each MNP marker.
- Blank control noise index P nc/Nc, wherein nc and Nc respectively represent the number of sequencing fragments of Mycobacterium tuberculosis in the blank control.
- the signal index of the test sample S nt/Nt, wherein nt and Nt respectively represent the number of sequencing fragments of Mycobacterium tuberculosis in the test sample.
- the mean value of the noise index of Mycobacterium tuberculosis in the blank control is 0.1%
- the mean value of the signal index in the sample of 1 copy is 0.3%
- the signal of the sample of 1 copy and the blank control The average value of the noise ratio is 5.8. Therefore, the present invention stipulates that when the signal-to-noise ratio is greater than 10 times, it can be judged that the contamination in the detection system is acceptable.
- the average value of the signal-to-noise ratio of 10 copies of the sample and the blank control is 52.2, and in the 12 sets of data of 10 copies/reaction, at least 7 MNP markers can be stably detected, accounting for the total markers 41.2%. Therefore, in the case of ensuring accuracy, this standard stipulates that the signal-to-noise ratio threshold of Mycobacterium tuberculosis is 30, that is, when the signal-to-noise ratio of Mycobacterium tuberculosis in the sample is greater than 30, and the marker detection rate is greater than or equal to 40%. When , it is determined that the nucleotide of Mycobacterium tuberculosis has been detected in the sample.
- the kit provided by the present invention can sensitively detect 10 copies/reaction Mycobacterium tuberculosis.
- Artificial Mycobacterium tuberculosis and Acinetobacter spp. Adenovirus, Bacillus anthracis, Bordetella holbachii, Bordetella pertussis, Chlamydia pneumoniae, Mycoplasma pneumoniae, Epstein-Barr virus, Haemophilus influenzae, varicella zoster Viruses, cytomegalovirus, herpes simplex virus, human bocavirus, Klebsiella pneumoniae, Legionella, Moraxella catarrhalis, Pseudomonas aeruginosa, Rickettsia, Staphylococcus aureus, pneumonia Streptococcus and Streptococcus pyogenes DNA are mixed together in equimolar amounts to prepare a mixed template, and sterile water is used as a blank control, and the method provided by the invention is used to detect the mixed template.
- the MNP fingerprints of the 3 copies of BCG stored in laboratories A and C are consistent, and among the 4 copies of laboratory B, 1 copy (S-5) and the 9 copies of BCG tested together with the same batch are consistent.
- MNP markers The technical error rate of MNP markers is significantly lower than that of SNP markers, based on the fact that multiple errors are less likely to occur simultaneously than one error.
- the essence of the detection of genetic variation within Mycobacterium tuberculosis strains is to detect whether there is a minor allele other than the main genotype at the MNP site of the population, and to determine the authenticity of the detected minor allele.
- the authenticity evaluation of the secondary allelic type in this embodiment is carried out as follows: first, the allelic type with strand preference (the ratio of the number of sequencing sequences covered on the DNA double strand) is excluded according to the following rules: the strand preference is greater than 10 times, or more than 5 times different from the strand preference of the main allele type.
- the nucleotides of S-5 are mixed into S according to the following 8 ratios: In -4 nucleotides, artificial heterozygous samples were prepared, and each sample was detected 3 times, and a total of 24 sequencing data were obtained. Through accurate comparison with the genotypes of MNP markers of S-4 and S-5, the genotype of S-5 was detected in 24 artificial heterozygous samples, and the concentration of S-5 in the artificial heterozygous samples accounted for The ratio was as low as 1/1000, illustrating the applicability of the developed MNP marker assay for the identification of M. tuberculosis to detect individuals with a low proportion of genetic variation within the strain population.
- the DNA of all strains or samples used to construct the Mycobacterium tuberculosis DNA fingerprint database was extracted by conventional CTAB method and commercial kits, and the quality of DNA was detected by agarose gel and ultraviolet spectrophotometer. If the ratio of the absorbance value of the extracted DNA at 260nm to 230nm is greater than 2.0, the ratio of the absorbance value at 260nm to 280nm is between 1.6 and 1.8, the main band of DNA electrophoresis is obvious, and there is no obvious degradation and RNA residue, it means that the genomic DNA has reached Relevant quality requirements, follow-up experiments can be carried out.
- the MNP fingerprints of the S1-S10BCG strains were obtained. Compare the fingerprints of 10 strains pairwise, and enter the fingerprints of strains with at least one MNP fingerprint difference into the database file to form the MNP fingerprint database of Mycobacterium tuberculosis; The MNP fingerprints are compared with the established MNP fingerprint database, and the MNP fingerprints of the strains with different main genotypes are entered into the constructed MNP fingerprint database, and the constructed fingerprint database is continuously updated and enriched.
- the constructed MNP fingerprint database is based on the gene sequences of the detected strains, so it is compatible with all high-throughput sequencing data and has the characteristics of being completely co-constructed and shared.
- Embodiment 6 application in fine typing of Mycobacterium tuberculosis
- the reference sequence library of Mycobacterium tuberculosis which consists of the published genome sequence of Mycobacterium tuberculosis and the constructed DNA fingerprint database of Mycobacterium tuberculosis; use the primer combination and MNP marker site described in Example 2
- the detection method is to obtain the MNP fingerprint of Mycobacterium tuberculosis in each sample to be tested; compare the DNA fingerprint of each strain with the constructed reference sequence library, and screen to obtain the strain with the closest genetic distance to the sequence library; If the genotype is 100% identical to the existing strain, it is an existing variant, and if there is a main genotype difference in at least one MNP site, it is a new variant, and the fine typing of Mycobacterium tuberculosis can be achieved.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- Plant Pathology (AREA)
- Toxicology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Disclosed in the present invention are a combination of MNP markers of mycobacterium tuberculosis, a primer pair combination and kit for detecting the combination of the MNP markers, and the uses of the combination, the primer pair combination and the kit. The combination of the MNP markers comprises markers MNP-1 to MNP-17 on an AL123456 genome, and the specific nucleotide sequences are as shown in SEQ ID NO. 1-SEQ ID NO. 17. The primer pair combination has nucleotide sequences as shown in SEQ ID NO. 18-SEQ ID NO. 51. The combination of the MNP markers can specifically identify mycobacterium tuberculosis and finely distinguish different varieties; and the primers do not interfere with each other.
Description
本发明实施例涉及生物技术领域,特别涉及一种结核分枝杆菌的MNP标记组合、引物对组合、试剂盒及其应用。The embodiment of the present invention relates to the field of biotechnology, in particular to a combination of MNP markers of Mycobacterium tuberculosis, a combination of primer pairs, a kit and applications thereof.
结核分枝杆菌(Mycobacterium tuberculosis),俗称结核杆菌。结核分枝杆菌可通过呼吸道、消化道或皮肤损伤侵入易感机体,引起多种组织器官的结核病。结核分枝杆菌可通过飞沫微滴或含菌尘埃的吸入,故以肺结核较为多见。结核病是一种世界性疾病,为最受关注的公共卫生问题之一。虽然近年来随着疫苗的产生和对儿童的早期接种,结核病的发病率呈下降趋势,但其对人类健康造成的危害仍十分严峻。快速、准确的结核分枝杆菌的检测、上报是结核病防控的重要环节之一,是控制结核病发病和流行的基础。Mycobacterium tuberculosis (Mycobacterium tuberculosis), commonly known as Mycobacterium tuberculosis. Mycobacterium tuberculosis can invade susceptible organisms through respiratory tract, digestive tract or skin damage, causing tuberculosis in various tissues and organs. Mycobacterium tuberculosis can be inhaled through droplets or dust containing bacteria, so tuberculosis is more common. Tuberculosis is a worldwide disease and one of the most concerned public health problems. Although in recent years, with the production of vaccines and early vaccination of children, the incidence of tuberculosis has shown a downward trend, but its harm to human health is still very serious. Rapid and accurate detection and reporting of Mycobacterium tuberculosis is one of the important links in the prevention and control of tuberculosis, and is the basis for controlling the incidence and prevalence of tuberculosis.
结核分枝杆菌是典型的病原微生物,也是实验室经常研究的病原微生物。实验微生物的遗传变异会导致不同实验室或同一实验室不同时期相同命名的菌株遗传上实际并不相同,从而导致实验结果的不可重现和不可比较。对人hella细胞在成果发表前进行认证已达成共识。因此对病原微生物的检测不仅要能灵敏准确的检出结核分枝杆菌,还要能够检测群体中的低频率的变异,这对现有的检测技术是个挑战。Mycobacterium tuberculosis is a typical pathogenic microorganism, and it is also a pathogenic microorganism that is often studied in the laboratory. The genetic variation of experimental microorganisms will lead to genetically different strains with the same name in different laboratories or in the same laboratory in different periods, resulting in non-reproducible and non-comparable experimental results. A consensus has been reached on the validation of human Hella cells prior to publication. Therefore, the detection of pathogenic microorganisms must not only be able to detect Mycobacterium tuberculosis sensitively and accurately, but also be able to detect low-frequency mutations in the population, which is a challenge to the existing detection technology.
经典的结核分枝杆菌检测方法,包括分离培养、PCR技术、全基因组和宏基因组测序等,在时长、操作复杂度、检测通量、检测变异的准确性和灵敏度、成本等方面存在一个或多个局限。融合超多重PCR扩增和高通量测序的靶向分子标记检测技术,可以在低微生物含量的样本中靶向的富集目标微生物,避免了全基因组的病原菌分离培养步骤和宏基因组测序带来的大量数据浪费和背景噪音,具有样本需要量少、诊断结果精确,节约数据量、检测低频变异的优势。The classic detection methods of Mycobacterium tuberculosis, including isolation and culture, PCR technology, whole genome and metagenomic sequencing, etc., have one or more problems in terms of time length, operation complexity, detection throughput, accuracy and sensitivity of detection of mutations, cost, etc. limited. The targeted molecular marker detection technology that combines ultra-multiplex PCR amplification and high-throughput sequencing can enrich target microorganisms in samples with low microbial content, avoiding the isolation and culture steps of whole-genome pathogenic bacteria and the disadvantages caused by metagenomic sequencing. A large amount of data waste and background noise have the advantages of less sample requirements, accurate diagnostic results, saving data volume, and detecting low-frequency variations.
现有的靶向检测技术检测的分子标记主要包括SNP和SSR标记。SSR标记是公认的多态性最高的标记,但在微生物中数量少;SNP标记数量巨大,分布密集,是二态性标记,单个SNP标记的多态性不足以捕获微生物种群中潜在的等位基因多样性。因此,开发高多态性的新型分子标记及其检测技术,成为亟待解决的技术问题。The molecular markers detected by existing targeted detection technologies mainly include SNP and SSR markers. SSR markers are recognized as the most polymorphic markers, but their number is small in microorganisms; SNP markers are huge in number, densely distributed, and are dimorphic markers, and the polymorphism of a single SNP marker is not enough to capture potential alleles in microbial populations genetic diversity. Therefore, the development of new molecular markers with high polymorphism and their detection technology has become an urgent technical problem to be solved.
本发明开发物种特异的新型分子标记-MNP标记。MNP标记是指在基因组上一段区域内由多个核苷酸引起的多态性标记。与SSR标记和SNP标记相比,MNP标记具有以下优势:(1)等位基因型丰富,单个MNP标记上有2
n种等位基因型,高于SSR和SNP;(2)物种区分能力强,只需要少量的MNP标记就能实现物种及其以下分类水平的鉴定,物种鉴定 精细。基于超多重PCR结合二代高通量测序技术检测MNP标记的MNP标记法具有以下优势:(1)输出的是碱基序列,无需平行实验,可构建标准化的数据库进行共享共用;(2)高效率,利用样品DNA条形码,突破测序样品数量的局限,可一次性对成百上千份样本的数万个MNP标记分型;(3)高灵敏度,利用多重PCR一次检测多个靶标,避免单个靶标扩增失败导致高的假阴性和低的灵敏度;(4)高准确性,利用二代高通量测序仪对扩增产物测序数百次。
The present invention develops a species-specific novel molecular marker-MNP marker. MNP markers refer to polymorphic markers caused by multiple nucleotides in an upper region of the genome. Compared with SSR markers and SNP markers, MNP markers have the following advantages: (1) rich allelic types, with 2 n allele types on a single MNP marker, higher than SSR and SNP markers; (2) strong species discrimination ability , only a small amount of MNP markers can be used to identify species and their taxonomic levels, and the identification of species is fine. The MNP labeling method based on super multiplex PCR combined with next-generation high-throughput sequencing technology to detect MNP markers has the following advantages: (1) The output is base sequence, without parallel experiments, and a standardized database can be built for sharing; (2) High Efficiency, using sample DNA barcodes, breaking through the limitation of the number of sequencing samples, and can type tens of thousands of MNP markers in hundreds of samples at one time; (3) High sensitivity, using multiplex PCR to detect multiple targets at a time, avoiding single Target amplification failures lead to high false negatives and low sensitivity; (4) high accuracy, using a second-generation high-throughput sequencer to sequence the amplified product hundreds of times.
鉴于以上优点和特性,MNP标记及其检测技术MNP标记法可实现群体生物多等位基因型的分类与溯源,在病原微生物的鉴定、指纹数据库构建、遗传变异检测等方面都具有应用潜力。目前在微生物中,尚未有关于MNP标记的报道,也缺乏相应的技术。MNP标记法的开发、筛选和应用在植物中具有较好的应用基础。In view of the above advantages and characteristics, MNP marker and its detection technology MNP marker method can realize the classification and traceability of multi-allelic genotypes in populations, and has application potential in the identification of pathogenic microorganisms, fingerprint database construction, and genetic variation detection. At present, in microorganisms, there is no report on MNP labeling, and there is a lack of corresponding technology. The development, screening and application of MNP markers have a good application basis in plants.
因此,亟需开发病原微生物用于鉴定结核分枝杆菌的MNP标记和检测引物。本发明所开发的标记和引物组合将用于制定病原体检测的国家标准(计划编号20201830-T-469),该国家标准将于2021年底发布。Therefore, there is an urgent need to develop MNP markers and detection primers for identifying Mycobacterium tuberculosis in pathogenic microorganisms. The combination of markers and primers developed in the present invention will be used to formulate national standards for pathogen detection (plan number 20201830-T-469), which will be released by the end of 2021.
发明内容Contents of the invention
本发明目的是提供一种结核分枝杆菌的MNP标记组合、引物对组合、试剂盒及其应用,可以对结核分枝杆菌进行鉴定和变异检测,具有多靶标、高通量、高灵敏和精细分型的效果。The purpose of the present invention is to provide a MNP marker combination of Mycobacterium tuberculosis, a primer pair combination, a kit and its application, which can identify and detect mutations of Mycobacterium tuberculosis, and have multi-target, high-throughput, high-sensitivity and precision The typing effect.
在本发明的第一方面,提供了一种结核分枝杆菌的MNP标记组合,所述MNP标记组合是指在结核分枝杆菌基因组上筛选的区分于其他物种且在物种内部具有多个核苷酸多态性的基因组区域,包括结核分枝杆菌参考序列Genebank编号为AL123456序列上MNP-1~MNP-17的17个标记。In the first aspect of the present invention, a MNP marker combination of Mycobacterium tuberculosis is provided, which refers to the MNP marker combination that is screened on the Mycobacterium tuberculosis genome to distinguish it from other species and have multiple nucleosides within the species. Genomic regions with acid polymorphisms, including 17 markers from MNP-1 to MNP-17 on the Mycobacterium tuberculosis reference sequence Genebank number AL123456.
上述技术方案中,MNP-1~MNP-17标记的核苷酸序列具体如NO.1-SEQ ID NO.17所示。In the above technical scheme, the nucleotide sequence marked by MNP-1-MNP-17 is specifically shown in NO.1-SEQ ID NO.17.
说明书表1对其进一步地进行了解释,其中标注的所述MNP标记的起始和终止位置是基于AL123456序列确定的。It is further explained in Table 1 of the specification, wherein the starting and ending positions of the marked MNP markers are determined based on the sequence of AL123456.
在本发明的第二方面,提供了一种用于检测所述MNP标记组合的多重PCR引物对组合,所述多重PCR引物对组合包括17对引物,具体的引物对组合核苷酸序列如SEQ ID NO.18-SEQ ID NO.51所示,其中ID NO.18-SEQ ID NO.34为上引物,ID NO.35-SEQ ID NO.51为下引物。In a second aspect of the present invention, there is provided a combination of multiple PCR primer pairs for detecting the MNP marker combination, the multiple PCR primer pair combination includes 17 pairs of primers, and the specific primer pair combination nucleotide sequence is as SEQ As shown in ID NO.18-SEQ ID NO.51, wherein ID NO.18-SEQ ID NO.34 is the upper primer, and ID NO.35-SEQ ID NO.51 is the lower primer.
上述技术方案中,每个MNP标记的引物包括上引物和下引物,具体如说明书表1所示。In the above technical scheme, each MNP-labeled primer includes an upper primer and a lower primer, as shown in Table 1 of the specification.
在本发明的第三方面,提供了一种用于检测所述结核分枝杆菌MNP标记组合的检测试 剂盒,所述试剂盒包括所述的引物对组合;进一步地,所述试剂盒还包括多重PCR预混液。以及所述的标记组合、引物对组合和试剂盒在非疹断目的的结核分枝杆菌检测中的应用,在制备用于结核分枝杆菌的检测产品中的应用。In a third aspect of the present invention, a detection kit for detecting the MNP marker combination of Mycobacterium tuberculosis is provided, the kit includes the primer pair combination; further, the kit also includes Multiplex PCR Master Mix. As well as the application of the marker combination, primer pair combination and kit in the detection of Mycobacterium tuberculosis for non-diagnostic purposes, and in the preparation of detection products for Mycobacterium tuberculosis.
在本发明的第四方面,提供了所述的结核分枝杆菌的MNP标记组合或者所述的多重PCR引物对组合或者所述的检测试剂盒在结核分枝杆菌的鉴定、DNA指纹数据库的构建、遗传变异检测中的应用。In the fourth aspect of the present invention, there is provided the MNP marker combination of Mycobacterium tuberculosis or the combination of multiple PCR primers or the detection kit in the identification of Mycobacterium tuberculosis and the construction of DNA fingerprint database , Application in genetic variation detection.
以上所述的应用中,首先是获取待测样本的细菌总DNA;利用本发明的试剂盒对所述总DNA和空白对照进行第一轮多重PCR扩增,循环数不高于25个;对扩增产物进行纯化后,进行基于第二轮PCR扩增的样本标签和二代测序接头添加;对第二轮扩增产物纯化后定量;检测多个菌株时通过将第二轮扩增产物等量混合后进行高通量测序;测序结果比对到所述的结核分枝杆菌的参考序列上,获取在所述总DNA的检测序列数目和基因型数据。根据在所述总DNA和所述空白对照获得的结核分枝杆菌测序序列数量和检出MNP标记的数目,对所述总DNA的测序数据进行数据质量控制和数据分析,获得检出MNP标记数目、覆盖每个所述MNP标记的测序序列数目和所述MNP标记基因型数据。In the above-mentioned application, at first be to obtain the bacterial total DNA of the sample to be tested; Utilize the kit of the present invention to carry out the first round of multiplex PCR amplification to the total DNA and the blank control, and the number of cycles is not higher than 25; After the amplified product is purified, add sample tags and next-generation sequencing adapters based on the second round of PCR amplification; quantify the second-round amplified product after purification; detect multiple strains by combining the second-round amplified product, etc. High-throughput sequencing is carried out after the amounts are mixed; the sequencing results are compared to the reference sequence of Mycobacterium tuberculosis, and the number of detected sequences and genotype data in the total DNA are obtained. According to the number of Mycobacterium tuberculosis sequencing sequences and the number of detected MNP markers obtained in the total DNA and the blank control, data quality control and data analysis are performed on the sequencing data of the total DNA to obtain the number of detected MNP markers , the number of sequencing sequences covering each of the MNP markers and the genotype data of the MNP markers.
当用于结核分枝杆菌鉴定时,根据在待测样品和空白对照中检出的结核分枝杆菌的测序序列数量和检出MNP位点的数目,进行质控后判定待测样品中是否含有结核分枝杆菌的核酸。其中,所述的质控方案和判定方法是以拷贝数已知的结核分枝杆菌的DNA为检测样本,评估所述试剂盒检测结核分枝杆菌的灵敏度、准确性和特异性,制定所述试剂盒检测结核分枝杆菌时的质控方案和判定方法。When used for the identification of Mycobacterium tuberculosis, according to the number of sequencing sequences of Mycobacterium tuberculosis detected in the sample to be tested and the blank control and the number of detected MNP sites, after quality control, it is determined whether the sample to be tested contains Nucleic acid of Mycobacterium tuberculosis. Wherein, the quality control scheme and determination method is to use the DNA of Mycobacterium tuberculosis with known copy number as the detection sample, evaluate the sensitivity, accuracy and specificity of the kit to detect Mycobacterium tuberculosis, formulate the The quality control scheme and determination method when the kit detects Mycobacterium tuberculosis.
当用于结核分枝杆菌遗传变异检测时,包括菌株间和菌株内部的遗传变异检测。菌株间的遗传变异检测包括利用所述的试剂盒和方法,获得待比较菌株各自在17个MNP标记的基因型数据。通过基因型比对,分析待比较菌株在所述17个MNP标记上的主基因型是否存在差异。若待比较菌株在至少一个MNP标记的主基因型存在变异,则判定两者存在遗传变异。作为一种备选方案,也可以通过单重PCR对待比较菌株的17个标记分别进行扩增,然后对扩增产物进行Sanger测序,获得序列后,对待比较菌株每个MNP标记的基因型进行比对。如果存在主基因型不一致的MNP标记,则待比较菌株之间存在变异。当检测菌株内部的遗传变异时,则通过统计模型判定在待测菌株所述的MNP标记是否检出主基因型以外的次基因型。若待测菌株在至少一个MNP标记存在次基因型,则判定待测菌株内部存在遗传变异。When used for the detection of genetic variation in Mycobacterium tuberculosis, it includes the detection of genetic variation between strains and within strains. The detection of genetic variation between strains includes using the above kit and method to obtain the genotype data of each of the 17 MNP markers of the strains to be compared. Through genotype comparison, analyze whether there are differences in the main genotypes of the strains to be compared on the 17 MNP markers. If there is a variation in at least one main genotype of the MNP marker in the strain to be compared, it is determined that there is a genetic variation between the two. As an alternative, single-plex PCR can also be used to amplify the 17 markers of the strains to be compared, and then perform Sanger sequencing on the amplified products. After obtaining the sequence, compare the genotypes of each MNP marker of the strains to be compared. right. If there are MNP markers with inconsistent main genotypes, there is variation among the strains to be compared. When detecting the genetic variation within the strain, it is judged by statistical model whether the MNP markers in the strain to be tested detect sub-genotypes other than the main genotype. If the test strain has a subgenotype in at least one MNP marker, it is determined that there is genetic variation within the test strain.
当用于构建结核分枝杆菌DNA指纹数据库时,将从样本中鉴定的结核分枝杆菌的所述 MNP标记的基因型数据,录入数据库文件,构成结核分枝杆菌的DNA指纹数据库;每次鉴定不同的样本时,通过和所述结核分枝杆菌的DNA指纹数据库比对,鉴定样本中的结核分枝杆菌是否和数据库中的菌株在所述MNP标记存在主基因型(在一个MNP标记具有超过50%测序片段支持的基因型)的差异,在至少1个MNP标记存在主基因型差异的结核分枝杆菌即为新的变异类型,收录进DNA指纹数据库。When used to construct the DNA fingerprint database of Mycobacterium tuberculosis, the genotype data of the MNP marker of the Mycobacterium tuberculosis identified from the sample is entered into the database file to constitute the DNA fingerprint database of Mycobacterium tuberculosis; When different samples, by comparing with the DNA fingerprint database of Mycobacterium tuberculosis, identify whether the Mycobacterium tuberculosis in the sample and the bacterial strain in the database have the main genotype in the MNP marker (with more than one MNP marker in one MNP marker). The genotype supported by 50% of the sequenced fragments) difference, Mycobacterium tuberculosis with main genotype difference in at least one MNP marker is a new variant type, which is included in the DNA fingerprint database.
当用于结核分枝杆菌分型时,是对待测样本中的结核分枝杆菌进行鉴定,获得每个所述MNP位点的基因型;收集网上公开的结核分枝杆菌的基因组序列和已构建的结核分枝杆菌DNA指纹数据库组成结核分枝杆菌参考序列库;将待测样本中结核分枝杆菌的基因型和所述结核分枝杆菌的参考序列库进行比对,筛选遗传上一致或最接近的菌株,获得待测样本中结核分枝杆菌的分型。根据同所述参考序列库的比对结果,鉴定样品中的结核分枝杆菌是已有的型还是新的变型,实现对结核分枝杆菌的精细分型。When used for Mycobacterium tuberculosis typing, it is to identify the Mycobacterium tuberculosis in the sample to be tested, and obtain the genotype of each MNP site; collect the genome sequence of Mycobacterium tuberculosis published on the Internet and the constructed The Mycobacterium tuberculosis DNA fingerprint database constitutes the Mycobacterium tuberculosis reference sequence library; compare the genotype of Mycobacterium tuberculosis in the sample to be tested with the reference sequence library of Mycobacterium tuberculosis, and screen for genetically consistent or most For close strains, the type of Mycobacterium tuberculosis in the sample to be tested is obtained. According to the comparison result with the reference sequence library, it is identified whether the Mycobacterium tuberculosis in the sample is an existing type or a new variant, so as to realize fine typing of Mycobacterium tuberculosis.
与现有技术相比,本发明具有以下优点:Compared with the prior art, the present invention has the following advantages:
本发明提供了一种结核分枝杆菌的MNP标记组合、引物对组合、试剂盒及其应用。所提供的结核分枝杆菌的17个MNP标记和其引物组合,可进行多重PCR扩增,融合二代测序平台进行扩增产物的测序,满足对结合分枝杆菌进行高通量、高效率、高准确性、高灵敏度和免培养检测的需求,满足结核分枝杆菌标准的、可共享的指纹数据库构建的需求;满足准确检测结核分枝杆菌菌株间遗传变异的需求和鉴定结核分枝杆菌纯合和杂合的需求。The invention provides an MNP marker combination of Mycobacterium tuberculosis, a primer pair combination, a kit and applications thereof. The provided 17 MNP markers of Mycobacterium tuberculosis and their primer combinations can be used for multiplex PCR amplification, combined with the next-generation sequencing platform to sequence the amplified products, meeting the high-throughput, high-efficiency, The requirements for high accuracy, high sensitivity and culture-free detection meet the requirements for the construction of a standard and shareable fingerprint database for Mycobacterium tuberculosis; meet the requirements for accurate detection of genetic variation between strains of Mycobacterium tuberculosis and the identification of pure Mycobacterium tuberculosis Synthetic and heterozygous needs.
本发明在结核分枝杆菌领域属于首创,并未见相关文献报道;MNP标记主要基于参考序列开发,根据已报道的结核分枝杆菌代表小种的重测序数据可以挖掘大规模的区分于其他物种、在结核分枝杆菌物种内部多态、两侧序列保守的MNP标记;通过MNP标记两侧的保守序列可以设计适用于于多重PCR扩增的MNP标记检测引物;再根据标准品的测试结果,可筛选出一套多态性最大、特异性高的一套MNP标记兼容性最好的引物组合以及检测方法,并用于结核分枝杆菌的检测、DNA指纹图谱构建,菌株内和菌株间遗传变异检测以及其他相关应用中,为结核分枝杆菌的科学研究、科学监测和防治提供技术支撑。The present invention is the first in the field of Mycobacterium tuberculosis, and there are no relevant literature reports; MNP markers are mainly developed based on reference sequences, and large-scale identification of Mycobacterium tuberculosis from other species can be mined based on the resequencing data of the reported representative races of Mycobacterium tuberculosis , MNP markers with polymorphic and conserved sequences on both sides of Mycobacterium tuberculosis species; MNP marker detection primers suitable for multiplex PCR amplification can be designed through the conserved sequences on both sides of the MNP marker; then according to the test results of the standard, A set of primer combinations and detection methods with the largest polymorphism and high specificity and the best MNP marker compatibility can be screened out, and used for the detection of Mycobacterium tuberculosis, the construction of DNA fingerprints, and the genetic variation within and between strains In detection and other related applications, it provides technical support for scientific research, scientific monitoring and prevention of Mycobacterium tuberculosis.
图1为MNP标记多态性原理图;Figure 1 is a schematic diagram of MNP marker polymorphism;
图2为结核分枝杆菌MNP标记的筛选和引物设计流程图;Fig. 2 is the flow chart of screening and primer design of Mycobacterium tuberculosis MNP marker;
图3为MNP标记的检测流程图;Fig. 3 is the detection flowchart of MNP mark;
为了便于理解本发明,下面结合附图和具体实施例,对本发明进行更详细的说明。附 图中给出了本发明的较佳的实施例。但是,本发明可以以许多不同的形式来实现,并不限于本说明书所描述的实施例。相反地,提供这些实施例的目的是使对本发明的公开内容的理解更加透彻全面。In order to facilitate the understanding of the present invention, the present invention will be described in more detail below in conjunction with the accompanying drawings and specific embodiments. Preferred embodiments of the invention are shown in the accompanying drawings. However, the present invention can be implemented in many different forms and is not limited to the embodiments described in this specification. On the contrary, these embodiments are provided to make the understanding of the disclosure of the present invention more thorough and comprehensive.
需要说明的是,除非另有定义,本说明书所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是用于限制本发明。It should be noted that, unless otherwise defined, all technical and scientific terms used in this specification have the same meaning as commonly understood by those skilled in the technical field of the present invention. Terms used in the description of the present invention are only for the purpose of describing specific embodiments, and are not used to limit the present invention.
除非另有特别说明,本发明实施例中用到的各种原材料、试剂、仪器和设备等,均可通过市场购买得到或者可通过现有方法制备得到。Unless otherwise specified, various raw materials, reagents, instruments and equipment used in the examples of the present invention can be purchased from the market or prepared by existing methods.
实施例1结核分枝杆菌MNP标记组合的筛选和多重PCR扩增引物的设计Example 1 The screening of Mycobacterium tuberculosis MNP marker combination and the design of multiple PCR amplification primers
S1、结核分枝杆菌MNP标记组合的筛选S1, Screening of Mycobacterium tuberculosis MNP marker combinations
基于网上公开的501个结核分枝杆菌不同分离株的基因组序列,通过序列比对,获得17个MNP标记。对于网上不存在基因组数据的物种,也可以通过高通量测序获得待检测微生物物种代表小种的基因组序列信息,其中高通量测序可以是全基因组或简化基因组测序。为了保证所筛选标记的多态性,一般使用至少10个分离株的基因组序列作为参考。Based on the genome sequences of 501 different isolates of Mycobacterium tuberculosis published online, 17 MNP markers were obtained through sequence alignment. For species that do not have genomic data on the Internet, the genome sequence information of representative races of the microbial species to be detected can also be obtained through high-throughput sequencing, where high-throughput sequencing can be whole genome or simplified genome sequencing. In order to ensure the polymorphism of the selected markers, the genome sequences of at least 10 isolates are generally used as references.
筛选的17个MNP标记如表1所示:The 17 MNP markers screened are shown in Table 1:
表1所述MNP标记以及检测引物在参考基因组上的起始位置The MNP markers described in Table 1 and the starting positions of the detection primers on the reference genome
所述步骤S1具体包括:The step S1 specifically includes:
选择所述结核分枝杆菌的一个所述代表小种作为参考基因组,将所述基因组序列和所述参考基因组进行序列比对,获得所述结核分枝杆菌的单核苷酸多态性标记;selecting one of the representative races of the Mycobacterium tuberculosis as a reference genome, and comparing the genome sequence with the reference genome to obtain the single nucleotide polymorphism marker of the Mycobacterium tuberculosis;
在所述参考基因组上,以100-300bp为窗口,以1bp为步长进行窗口平移,筛选获得多个候选MNP标记区域,其中,所述候选MNP标记区域含有≥2个所述单核苷酸变异标记,且两端各30bp的序列上均不存在所述单核苷酸多态性标记在所述候选多核苷酸多态性标记区域中筛选区分度DP≥0.2的区域作为MNP标记;其中,DP=d/t,t是在所述候选多核苷酸多态性标记区域中所有小种两两比较时的比较对数,d是在所述候选多核苷酸多态性标记区域中至少两个单核苷酸多态性差异的样品对数。On the reference genome, use 100-300bp as the window, and perform window translation with 1bp as the step size, and screen to obtain multiple candidate MNP marker regions, wherein the candidate MNP marker regions contain ≥ 2 of the single nucleotides Variation markers, and the single nucleotide polymorphism markers do not exist on the 30bp sequences at both ends. In the region of the candidate polynucleotide polymorphism markers, the regions with a discrimination degree DP≥0.2 are selected as MNP markers; wherein , DP=d/t, t is the comparison logarithm of all races in the region of the candidate polynucleotide polymorphism marker, and d is at least Sample logarithm of the difference between two SNPs.
作为一种可选的实施方式,在所述参考基因组上,以100-300bp为窗口进行筛选时,也可选用其他步长,本实施方式采用步长为1bp,有利于全面的筛选。As an optional implementation, other step lengths can also be used when screening on the reference genome with a window of 100-300 bp. In this embodiment, the step size is 1 bp, which is conducive to comprehensive screening.
S2、多重PCR扩增引物的设计S2, the design of multiple PCR amplification primers
通过引物设计软件设计所述MNP标记的多重PCR扩增引物,引物设计遵循引物间互不干扰,所有引物可以组合成引物池进行多重PCR扩增,即所有设计的引物可以在一个扩增反应中均正常扩增。The multiple PCR amplification primers labeled with MNP are designed by primer design software. The primer design follows that the primers do not interfere with each other. All primers can be combined into a primer pool for multiple PCR amplification, that is, all designed primers can be used in one amplification reaction. normal expansion.
该实施方式中,用于鉴定所述MNP标记的引物,如表1及SEQ ID NO.18-SEQ ID NO.51所示。In this embodiment, the primers used to identify the MNP marker are shown in Table 1 and SEQ ID NO.18-SEQ ID NO.51.
S3、引物组合的检测效率评估S3. Evaluation of detection efficiency of primer combinations
所述MNP标记的检测方法是采用适用于多重PCR的PCR预混液,通过多重PCR对所有MNP标记一次性进行扩增,通过二代高通量测序对扩增产物进行测序,对测序数据进行分析,根据检出的标记评价所述引物组合的兼容性。The method for detecting the MNP markers is to use a PCR master mix suitable for multiplex PCR, amplify all MNP markers at one time through multiplex PCR, sequence the amplified products through second-generation high-throughput sequencing, and analyze the sequencing data , evaluating the compatibility of the primer combination based on the detected markers.
使用购买的拷贝数已知的结核分枝杆菌DNA标准品(货物编号:BDS-BW-047,广州邦德盛生物科技有限公司),制备结核分枝杆菌模拟样本,即将拷贝数已知的结核分枝杆菌标准品加入到人基因组DNA(2ng/反应)中,制备成1000拷贝/反应的模板,使用所述的引物组合,通过所述的MNP标记检测方法进行重复检测。依据物种区分度高、物种特异的MNP标记筛选原则和高效扩增、扩增稳定的引物筛选等原则,最终筛选出本发明提供的17个MNP标记及其检测引物对组合。Use the purchased Mycobacterium tuberculosis DNA standard with known copy number (article number: BDS-BW-047, Guangzhou Bangdesheng Biotechnology Co., Ltd.) to prepare a mock sample of Mycobacterium tuberculosis, that is, tuberculosis with known copy number The mycobacterium standard was added to human genomic DNA (2ng/reaction) to prepare a template with 1000 copies/reaction, and the primer combination was used to perform repeated detection by the MNP marker detection method. Based on the principles of high species discrimination and species-specific MNP marker screening and high-efficiency amplification and stable amplification primer screening principles, 17 MNP markers and their detection primer pair combinations provided by the present invention were finally screened out.
实施例2所述MNP标记组合和试剂盒鉴定结核分枝杆菌的性能评估和阈值制定Performance evaluation and threshold formulation of MNP marker combination and kit for identifying Mycobacterium tuberculosis described in Example 2
本实施例中,使用购买的拷贝数已知的结核分枝杆菌DNA标准品(货物编号:BDS-BW-047,广州邦德盛生物科技有限公司),将所述结核分枝杆菌的DNA加入到人基因组DNA中,制备1拷贝/反应、10拷贝/反应和100拷贝/反应的结核分枝杆菌模拟样本。同时设置等体积的无菌水作为空白对照。本实施例共计有4个样本,每个样本每天构建3个重复文库,连续检测4天,即每个样本获得12组测序数据,具体如表2所示。根据12次重复实验中,在空白对照和结核分枝杆菌模拟样本中检出的结核分枝杆菌MNP标记的测序片段数目和标记数目,评估检测方法的重现性、准确性、灵敏度,制定质控体系污染和目标病原体检出的阈值。In this example, the DNA of Mycobacterium tuberculosis was added into Into human genomic DNA, prepare 1 copy/reaction, 10 copies/reaction, and 100 copies/reaction of M. tuberculosis mock samples. At the same time, an equal volume of sterile water was set as a blank control. In this embodiment, there are 4 samples in total, and 3 repeated libraries are constructed for each sample every day, and the detection is performed continuously for 4 days, that is, 12 sets of sequencing data are obtained for each sample, as shown in Table 2. According to the number of sequencing fragments and the number of markers of Mycobacterium tuberculosis MNP markers detected in the blank control and Mycobacterium tuberculosis mock samples in 12 repeated experiments, the reproducibility, accuracy and sensitivity of the detection method were evaluated, and the quality Thresholds for controlling system contamination and detection of target pathogens.
MNP标记的检测流程如图3所示。The detection process of MNP markers is shown in Figure 3.
表2所述MNP标记组合和试剂盒鉴定结核分枝杆菌的灵敏度、稳定性分析Sensitivity and stability analysis of MNP marker combinations and kits for identifying Mycobacterium tuberculosis described in Table 2
如表2所示,在1拷贝/反应的12组数据中,能检出1-2个MNP位点,而在空白对照 中部分也能检出1个位点;在10拷贝/反应的12组数据中,能稳定的检出至少7个MNP标记,远远的高于空白对照中检出的MNP位点数目,表明所述试剂盒能够稳定、灵敏地检测低至10拷贝/反应的结核分枝杆菌。As shown in Table 2, in the 12 sets of data of 1 copy/reaction, 1-2 MNP sites can be detected, and in the blank control part can also detect 1 site; in the 12 sets of 10 copies/reaction In the group data, at least 7 MNP markers can be stably detected, which is much higher than the number of MNP sites detected in the blank control, indicating that the kit can stably and sensitively detect tuberculosis as low as 10 copies/reaction mycobacteria.
2、MNP标记和试剂盒检测结核分枝杆菌的重现性和准确性评估2. Evaluation of the reproducibility and accuracy of MNP markers and kits for detecting Mycobacterium tuberculosis
基于两次重复中,共同检出标记的基因型是否可重现,评估MNP标记检测方法检测结核分枝杆菌的重现性和准确性。具体地,对100拷贝样品的12组数据分别进行两两比较,结果如表3所示,主基因型存在差异的MNP标记数目都为0;依据2次重复实验间可重现的基因型认为是准确的原则,准确率a=1-(1-r)/2=0.5+0.5r,r代表重现率,即主基因型可重现的标记数目占共有标记数目的比率。本项目重现性试验中每个样品不同文库间、不同建库批次间MNP标记主基因型的差异对数为0,重现率r=100%,准确率a=100%。Based on whether the genotypes of the co-detected markers were reproducible in two replicates, the reproducibility and accuracy of the MNP marker detection method for detecting Mycobacterium tuberculosis were evaluated. Specifically, 12 sets of data of 100 copies of samples were compared in pairs. The results are shown in Table 3. The number of MNP markers with differences in main genotypes is 0; It is the principle of accuracy, the accuracy rate a=1-(1-r)/2=0.5+0.5r, r represents the recurrence rate, that is, the ratio of the number of reproducible markers of the main genotype to the number of common markers. In the reproducibility test of this project, the logarithm of the difference of the main genotype of the MNP marker between different libraries and different database construction batches of each sample is 0, the reproducibility rate r=100%, and the accuracy rate a=100%.
表3结核分枝杆菌MNP标记检出方法的重现性和准确率评估Table 3 Evaluation of reproducibility and accuracy of MNP marker detection methods for Mycobacterium tuberculosis
基于此,所述试剂盒能够准确、灵敏地检测低至10拷贝/反应的结核分枝杆菌。Based on this, the kit can accurately and sensitively detect Mycobacterium tuberculosis as low as 10 copies/reaction.
3、MNP标记检测试剂盒检出结核分枝杆菌的阈值判定3. Threshold determination of Mycobacterium tuberculosis detected by MNP marker detection kit
如表3所示,在1个拷贝/反应的样本中能检出比对到结核分枝杆菌的序列,至少覆盖1个MNP标记。而在部分空白对照中也检出了结核分枝杆菌的序列。由于MNP标记检测方法的极度灵敏,因此检测过中的数据污染容易导致假阳性的产生。因此本实例中制定如下质控方案。As shown in Table 3, the sequence aligned to Mycobacterium tuberculosis can be detected in 1 copy/reaction sample, covering at least 1 MNP marker. In some blank controls, the sequence of Mycobacterium tuberculosis was also detected. Due to the extreme sensitivity of the MNP marker detection method, data contamination during the detection process can easily lead to false positives. Therefore, the following quality control plan was formulated in this example.
质控方案具体如下:The quality control plan is as follows:
1)测序数据量大于5百万碱基。测算依据是每个样品检测MNP标记的数目是17个, 一条测序片段的长度是300个碱基,所以当数据量大于5百万碱基时,大部分样品一次实验可以保证覆盖每个标记的测序片段数量达到1000倍,保证对每个MNP标记碱基序列的精准分析。1) The amount of sequencing data is greater than 5 million bases. The calculation is based on the fact that the number of MNP markers detected in each sample is 17, and the length of a sequencing fragment is 300 bases. Therefore, when the data volume is greater than 5 million bases, most samples can be guaranteed to cover each marker in one experiment. The number of sequencing fragments reaches 1000 times, ensuring accurate analysis of the base sequence of each MNP marker.
2)根据测试样品中的结核分枝杆菌的信号指数S和空白对照中结核分枝杆菌的噪音指数P判定污染是否可接受,其中:2) Determine whether the pollution is acceptable according to the signal index S of Mycobacterium tuberculosis in the test sample and the noise index P of Mycobacterium tuberculosis in the blank control, wherein:
空白对照噪音指数P=nc/Nc,其中nc和Nc分别代表空白对照中,结核分枝杆菌的测序片段的数量。Blank control noise index P=nc/Nc, wherein nc and Nc respectively represent the number of sequencing fragments of Mycobacterium tuberculosis in the blank control.
测试样品的信号指数S=nt/Nt,其中nt和Nt分别代表测试样品中,结核分枝杆菌的测序片段的数量。The signal index of the test sample S=nt/Nt, wherein nt and Nt respectively represent the number of sequencing fragments of Mycobacterium tuberculosis in the test sample.
3)计算测试样品中MNP标记的检出率,指的是检出标记数和总设计标记数的比值。3) Calculate the detection rate of MNP markers in the test sample, which refers to the ratio of the number of detected markers to the total number of designed markers.
如表4所示,结核分枝杆菌在空白对照中的噪音指数平均值是0.1%,而在1个拷贝的样品中的信号指数平均值是0.3%,1个拷贝的样品和空白对照的信噪比的平均值是5.8,因此,本发明规定当信噪比大于10倍时,可判定检测体系中的污染是可接受的。As shown in Table 4, the mean value of the noise index of Mycobacterium tuberculosis in the blank control is 0.1%, while the mean value of the signal index in the sample of 1 copy is 0.3%, the signal of the sample of 1 copy and the blank control The average value of the noise ratio is 5.8. Therefore, the present invention stipulates that when the signal-to-noise ratio is greater than 10 times, it can be judged that the contamination in the detection system is acceptable.
如表4所示,在10个拷贝的样品和空白对照的信噪比的平均值是52.2,在10拷贝/反应的12组数据中,能稳定的检出至少7个MNP标记,占总标记的41.2%。因此,在保证准确性的情况下,本标准规定结核分枝杆菌的信噪比判定阈值是30,即当样品中结核分枝杆菌的信噪比大于30,且标记检出率大于等于40%时,判定样本中检出了结核分枝杆菌的核苷酸。As shown in Table 4, the average value of the signal-to-noise ratio of 10 copies of the sample and the blank control is 52.2, and in the 12 sets of data of 10 copies/reaction, at least 7 MNP markers can be stably detected, accounting for the total markers 41.2%. Therefore, in the case of ensuring accuracy, this standard stipulates that the signal-to-noise ratio threshold of Mycobacterium tuberculosis is 30, that is, when the signal-to-noise ratio of Mycobacterium tuberculosis in the sample is greater than 30, and the marker detection rate is greater than or equal to 40%. When , it is determined that the nucleotide of Mycobacterium tuberculosis has been detected in the sample.
表4 4个样本各12次检测中结核分枝杆菌的信噪比Table 4 The signal-to-noise ratio of Mycobacterium tuberculosis in each of the 12 detections of 4 samples
因此本发明所提供的试剂盒能灵敏的检测到10copy/反应的结核分枝杆菌。Therefore, the kit provided by the present invention can sensitively detect 10 copies/reaction Mycobacterium tuberculosis.
表33、MNP标记检测试剂盒检测结核分枝杆菌的特异性评估Table 33. Specificity evaluation of MNP marker detection kit for detecting Mycobacterium tuberculosis
人为的将结核分枝杆菌和不动杆菌属、腺病毒、炭疽杆菌、霍氏鲍特菌、百日咳博德特氏菌、肺炎衣原体、肺炎支原体、EB病毒、流感嗜血杆菌、水痘带状疱疹病毒、巨细胞病毒、单纯疱疹病毒、人博卡病毒、肺炎克雷伯杆菌、军团菌属、卡他莫拉菌、铜绿假单胞菌、立克次氏体属、金黄色葡萄球菌、肺炎链球菌、酿脓链球菌的DNA等摩尔量的混在一起,制备混合模板,以无菌水作为空白对照,采用本发明所提供的方法对混合模板进行 检测。进行3个重复实验,在3个重复实验中都能特异地检出结核分枝杆菌的17个MNP位点,信噪比分别为743.5,812.4和752.6。按照所述的质控方案和判定阈值进行分析后,判定检出结核分支杆菌的核酸,表明所述MNP标记和所述试剂盒在复杂模板中检测目标微生物的高特异性。Artificial Mycobacterium tuberculosis and Acinetobacter spp., Adenovirus, Bacillus anthracis, Bordetella holbachii, Bordetella pertussis, Chlamydia pneumoniae, Mycoplasma pneumoniae, Epstein-Barr virus, Haemophilus influenzae, varicella zoster Viruses, cytomegalovirus, herpes simplex virus, human bocavirus, Klebsiella pneumoniae, Legionella, Moraxella catarrhalis, Pseudomonas aeruginosa, Rickettsia, Staphylococcus aureus, pneumonia Streptococcus and Streptococcus pyogenes DNA are mixed together in equimolar amounts to prepare a mixed template, and sterile water is used as a blank control, and the method provided by the invention is used to detect the mixed template. Three repeated experiments were carried out, and 17 MNP sites of Mycobacterium tuberculosis could be specifically detected in the three repeated experiments, and the signal-to-noise ratios were 743.5, 812.4 and 752.6, respectively. After analysis according to the quality control scheme and the determination threshold, it was determined that the nucleic acid of Mycobacterium tuberculosis was detected, indicating that the MNP marker and the kit have high specificity in detecting target microorganisms in complex templates.
实施例3、相同命名结核分枝杆菌菌株间的遗传变异检测Example 3, Genetic variation detection between the same named Mycobacterium tuberculosis strains
在实验研究中,相同命名的菌株的变异将导致实验结果的不可比较和不可重现。利用所述的试剂盒和MNP标记组合检测方法对华中农业大学、武汉病毒所和石河子大学提供的共计10份结核分枝杆菌的减毒菌株BCG进行检测,样本依次命名为S1-S10,每个样品的测序平均覆盖倍数达2103倍,每个菌株平均可以检出全部17个MNP标记(表5)。将10个菌株的指纹图谱进行两两比对,结果如错误!未找到引用源。所示,实验室A和C各自保存的3份BCG拷贝的MNP指纹图谱保持一致,实验室B的4份拷贝中,有1份(S-5)和同批次一起检测的9份BCG均存在部分标记的主基因型差异(表6)。这样的基因型差异有可能导致使用此样本所获得的实验结果的不可重现,影响影响实验结果的交流和共享性。使用S-5菌株的产生的实验结果建议注明它的指纹图谱。In experimental research, the variation of strains with the same name will lead to incomparable and irreproducible experimental results. A total of 10 attenuated strains of Mycobacterium tuberculosis BCG provided by Huazhong Agricultural University, Wuhan Institute of Virology and Shihezi University were detected using the kit and MNP marker combination detection method, and the samples were named S1-S10 in turn, each The average sequencing coverage of the samples reached 2103 times, and all 17 MNP markers could be detected on average for each strain (Table 5). The fingerprints of 10 strains were compared in pairs, and the result was wrong! Reference source not found. As shown, the MNP fingerprints of the 3 copies of BCG stored in laboratories A and C are consistent, and among the 4 copies of laboratory B, 1 copy (S-5) and the 9 copies of BCG tested together with the same batch are consistent. There were major genotype differences for some markers (Table 6). Such genotype differences may lead to irreproducible experimental results obtained using this sample, affecting the communication and sharing of experimental results. It is recommended to note its fingerprint for the experimental results generated using the S-5 strain.
表5不同实验室相同命名菌株的检测分析Table 5 Detection and analysis of the same named strains in different laboratories
表6不同实验室相同命名菌株的检测分析Table 6 Detection and analysis of the same named strains in different laboratories
实施例4、结核分枝杆菌菌株内部的遗传变异检测Example 4, Detection of genetic variation within Mycobacterium tuberculosis strains
作为群体生物,结核分枝杆菌群体内部部分个体发生变异,使群体不再纯合,形成异质的杂合群体,影响尤其是试验用微生物表型的稳定性和一致性。这种变异体在对群体进行分子标记检测时,表现为标记的主基因型外的等位基因型。当变异个体还未累积时,只占群体的极少部分,表现为低频率的等位基因型。低频率的等位基因型往往和技术错误混在一起,导致现有技术难以区分。本发明检测的是高多态性的MNP标记。基于多个错误同时发生的几率低于一个错误发生的几率,MNP标记的技术错误率显著低于SNP标记。对结核分枝杆菌菌株内部遗传变异的检测,其实质是检测群体的MNP位点是否存在主基因型以外的次等位基因型,且判定所检出的次等位基因型的真实性。As a group organism, some individuals within the Mycobacterium tuberculosis group mutate, making the group no longer homozygous and forming a heterogeneous heterozygous group, which affects especially the stability and consistency of the microbial phenotype used in the test. Such variants appear as alleles outside of the marker's major genotype when the population is tested for molecular markers. When the variant individual has not yet accumulated, it only accounts for a very small part of the population, showing a low frequency allele type. Low-frequency allele types are often mixed with technical errors, making them difficult to distinguish with existing techniques. The present invention detects highly polymorphic MNP markers. The technical error rate of MNP markers is significantly lower than that of SNP markers, based on the fact that multiple errors are less likely to occur simultaneously than one error. The essence of the detection of genetic variation within Mycobacterium tuberculosis strains is to detect whether there is a minor allele other than the main genotype at the MNP site of the population, and to determine the authenticity of the detected minor allele.
本实施例次等位基因型的真实性评估按如下进行:首先按照以下规则排除具有链偏好性(在DNA双链上覆盖的测序序列数的比值)的等位基因型:链偏好性大于10倍,或者与主等位基因型的链偏好性之差大于5倍。The authenticity evaluation of the secondary allelic type in this embodiment is carried out as follows: first, the allelic type with strand preference (the ratio of the number of sequencing sequences covered on the DNA double strand) is excluded according to the following rules: the strand preference is greater than 10 times, or more than 5 times different from the strand preference of the main allele type.
不存在链偏好性的基因型基于表7测序序列数目和比例判定其真实性。表7列出了基于BINOM.INV函数计算在α=99.9999%的概率保障下,e
max(n=1)和e
max(n≥2)分别为1.03%和0.0994%时,在各个标记中次等位基因型测序序列数目的临界值,只有次等位基因型的测序序列数目超过临界值时判定为真实的次等位基因型。当存在多个候选次等位基因时,对各候选等位基因型的P值进行多重校正,FDR<0.5%的候选等位基因判定是真实的次等位基因型。
The authenticity of genotypes without strand preference was determined based on the number and ratio of sequenced sequences in Table 7. Table 7 lists the calculations based on the BINOM.INV function under the probability guarantee of α=99.9999%, when e max (n=1) and e max (n≥2) are 1.03% and 0.0994%, respectively, in each marker The critical value of the sequence number of the allele type, only when the sequence number of the secondary allele type exceeds the critical value, it is determined as the real secondary allele type. When there are multiple candidate minor alleles, the P value of each candidate allele type is multiple-corrected, and the candidate alleles with FDR<0.5% are judged to be true minor allele types.
表7涉及到的参数e
max(n=1)和e
max(n≥2)指的是携带n个SNP的错误等位基因的测序序列数占该标记总测序序列数的最高比例。本实施例汇总在10个结核分枝杆菌纯合菌株中检测到的MNP标记的所有次等位基因型的错误率,e
max(n=1)和e
max(n≥2)分别为1.03%和0.0994%。
The parameters e max (n=1) and e max (n≥2) involved in Table 7 refer to the highest ratio of the number of sequencing sequences carrying the wrong alleles of n SNPs to the total number of sequencing sequences of the marker. This embodiment summarizes the error rates of all minor allele types of MNP markers detected in 10 homozygous strains of Mycobacterium tuberculosis, and emax (n=1) and emax (n≥2) are 1.03% respectively and 0.0994%.
表7部分测序深度下进行判定次等位基因型的临界值Table 7 Partial sequencing depth to determine the critical value of the suballelic genotype
按照上述参数,将S-5的核苷酸按照以下8个比例1/1000,3/1000,5/1000,7/1000,1/100,3/100,5/100,7/100混入S-4的核苷酸中,制备人工杂合样本,每个样本检测3次重复,获得共计24个测序数据。通过和S-4,S-5的MNP标记的基因型进行精准比对,在24个人工杂合样本中均检测到了S-5的基因型,人工杂合样本物中S-5的浓度占比低至1/1000,说明了所开发的用于鉴定结核分枝杆菌的MNP标记检测方法在检测菌株群体内部低比例遗传变异个体的适用性。According to the above parameters, the nucleotides of S-5 are mixed into S according to the following 8 ratios: In -4 nucleotides, artificial heterozygous samples were prepared, and each sample was detected 3 times, and a total of 24 sequencing data were obtained. Through accurate comparison with the genotypes of MNP markers of S-4 and S-5, the genotype of S-5 was detected in 24 artificial heterozygous samples, and the concentration of S-5 in the artificial heterozygous samples accounted for The ratio was as low as 1/1000, illustrating the applicability of the developed MNP marker assay for the identification of M. tuberculosis to detect individuals with a low proportion of genetic variation within the strain population.
实施例5结核分枝杆菌DNA指纹数据库的构建The construction of embodiment 5 Mycobacterium tuberculosis DNA fingerprint database
利用常规CTAB法、商业化试剂盒等方法提取用于构建结核分枝杆菌DNA指纹数据库的所有菌株或是样本的DNA,采用琼脂糖凝胶和紫外分光光度计检测DNA的质量。若所提取的DNA在260nm与230nm处的吸光度值的比值大于2.0,260nm与280nm吸光度 值比值介于1.6与1.8之间,DNA电泳主带明显,无明显降解和RNA残留,则说明基因组DNA达到相关的质量要求,可进行后续实验。The DNA of all strains or samples used to construct the Mycobacterium tuberculosis DNA fingerprint database was extracted by conventional CTAB method and commercial kits, and the quality of DNA was detected by agarose gel and ultraviolet spectrophotometer. If the ratio of the absorbance value of the extracted DNA at 260nm to 230nm is greater than 2.0, the ratio of the absorbance value at 260nm to 280nm is between 1.6 and 1.8, the main band of DNA electrophoresis is obvious, and there is no obvious degradation and RNA residue, it means that the genomic DNA has reached Relevant quality requirements, follow-up experiments can be carried out.
利用所述的试剂盒,获得了S1-S10BCG菌株的MNP指纹图谱。将10个菌株的指纹图谱进行两两比对,将存在至少1个MNP指纹差异的菌株的指纹图谱录入数据库文件,构成结核分支杆菌的MNP指纹数据库;在每次新的检测中获得的菌株的MNP指纹图谱,均执行同已构建的MNP指纹数据库进行比对,将主基因型存在差异的菌株的MNP指纹图谱录入构建的MNP指纹数据库,不断更新和充实所构建的指纹数据库。另外,所构建的MNP指纹数据库基于检测的菌株的基因序列,因此和所有的高通量测序数据兼容,具有完全可共建共享的特征。Using the kit, the MNP fingerprints of the S1-S10BCG strains were obtained. Compare the fingerprints of 10 strains pairwise, and enter the fingerprints of strains with at least one MNP fingerprint difference into the database file to form the MNP fingerprint database of Mycobacterium tuberculosis; The MNP fingerprints are compared with the established MNP fingerprint database, and the MNP fingerprints of the strains with different main genotypes are entered into the constructed MNP fingerprint database, and the constructed fingerprint database is continuously updated and enriched. In addition, the constructed MNP fingerprint database is based on the gene sequences of the detected strains, so it is compatible with all high-throughput sequencing data and has the characteristics of being completely co-constructed and shared.
实施例6、在结核分枝杆菌精细分型中的应用Embodiment 6, application in fine typing of Mycobacterium tuberculosis
首先构建结核分枝杆菌的参考序列库,由已经公开的结核分枝杆菌的基因组序列和已构建的结核分枝杆菌的DNA指纹数据库组成;利用实施例2所述的引物组合和MNP标记位点检测方法,获得每个待测样品中结核分枝杆菌的MNP指纹图谱;将每个菌株的DNA指纹图谱同构建的参考序列库进行比对,筛选获得和序列库中遗传距离最接近的菌株;和已有菌株的基因型100%相同的,为已有的变型,在至少一个MNP位点存在主基因型差异的,为新的变型,实现对结核分枝杆菌的精细分型。First construct the reference sequence library of Mycobacterium tuberculosis, which consists of the published genome sequence of Mycobacterium tuberculosis and the constructed DNA fingerprint database of Mycobacterium tuberculosis; use the primer combination and MNP marker site described in Example 2 The detection method is to obtain the MNP fingerprint of Mycobacterium tuberculosis in each sample to be tested; compare the DNA fingerprint of each strain with the constructed reference sequence library, and screen to obtain the strain with the closest genetic distance to the sequence library; If the genotype is 100% identical to the existing strain, it is an existing variant, and if there is a main genotype difference in at least one MNP site, it is a new variant, and the fine typing of Mycobacterium tuberculosis can be achieved.
如表4错误!未找到引用源。所示的10个菌株的基因型分析结果,S-5和其他9份菌株在5个MNP标记存在主基因型不同,将10个菌株分为2个型。因此,所述的方法对结合分枝杆菌基因型的分辨率达到了单碱基的水平,可以实现对样本中结核分枝杆菌进行精细分型。Such as Table 4 error! Reference source not found. In the genotype analysis results of the 10 strains shown, S-5 and the other 9 strains were different in the main genotypes of 5 MNP markers, and the 10 strains were divided into 2 types. Therefore, the method can achieve the single-base level of resolution of combined mycobacterium genotypes, and can realize fine typing of Mycobacterium tuberculosis in samples.
最后,还需要说明的是,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。Finally, it should also be noted that the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, but also Other elements not expressly listed, or inherent to the process, method, article, or apparatus are also included.
尽管已描述了本发明实施例的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明实施例范围的所有变更和修改。Having described preferred embodiments of embodiments of the present invention, additional changes and modifications to these embodiments can be made by those skilled in the art once the basic inventive concept is appreciated. Therefore, the appended claims are intended to be construed to cover the preferred embodiment and all changes and modifications which fall within the scope of the embodiments of the present invention.
显然,本领域的技术人员可以对本发明实施例进行各种改动和变型而不脱离本发明实施例的精神和范围。这样,倘若本发明实施例的这些修改和变型属于本发明实施例权利要求及其等同技术的范围之内,则本发明实施例也意图包含这些改动和变型在内。Apparently, those skilled in the art can make various changes and modifications to the embodiments of the present invention without departing from the spirit and scope of the embodiments of the present invention. In this way, if the modifications and variations of the embodiments of the present invention fall within the scope of the claims of the embodiments of the present invention and equivalent technologies thereof, the embodiments of the present invention are also intended to include these modifications and variations.
Claims (9)
- 一种结核分枝杆菌的MNP标记组合,其特征在于,所述MNP标记组合包括17个标记,具体的核苷酸序列如SEQ ID NO.1-SEQ ID NO.17所示。A MNP marker combination of Mycobacterium tuberculosis, characterized in that the MNP marker combination includes 17 markers, and the specific nucleotide sequences are shown in SEQ ID NO.1-SEQ ID NO.17.
- 一种用于检测权利要求1所述结核分枝杆菌MNP标记组合的多重PCR引物对组合,其特征在于,所述多重PCR引物对组合包括17对引物,具体的引物对组合核苷酸序列如SEQ ID NO.18-SEQ ID NO.51所示。A multiplex PCR primer pair combination for detecting the Mycobacterium tuberculosis MNP marker combination described in claim 1, characterized in that the multiplex PCR primer pair combination includes 17 pairs of primers, and the specific primer pair combination nucleotide sequence is as follows: Shown in SEQ ID NO.18-SEQ ID NO.51.
- 一种用于检测权利要求1所述结核分枝杆菌MNP标记组合的检测试剂盒,其特征在于,所述检测试剂盒包括权利要求2所述的引物对组合。A detection kit for detecting the MNP marker combination of Mycobacterium tuberculosis according to claim 1, characterized in that the detection kit comprises the primer pair combination according to claim 2.
- 根据权利要求3所述的检测试剂盒,其特征在于,所述检测试剂盒还包括多重PCR预混液。The detection kit according to claim 3, characterized in that, the detection kit also includes a multiplex PCR master mix.
- 权利要求1所述的结核分枝杆菌MNP标记组合或权利要求2所述的引物对组合或权利要求3-4任一所述的检测试剂盒在非诊断目的的结核分枝杆菌检测中的应用。Application of the Mycobacterium tuberculosis MNP marker combination described in claim 1 or the primer pair combination described in claim 2 or the detection kit described in any one of claims 3-4 in the detection of Mycobacterium tuberculosis for non-diagnostic purposes .
- 权利要求1所述的结核分枝杆菌的MNP标记组合或权利要求2所述的引物对组合或权利要求3-4任一所述的检测试剂盒在制备用于结核分枝杆菌的检测产品中的应用。The MNP marker combination of mycobacterium tuberculosis described in claim 1 or the primer pair combination described in claim 2 or the detection kit described in any one of claims 3-4 is used in the detection product of preparing mycobacterium tuberculosis Applications.
- 权利要求1所述的结核分枝杆菌的MNP标记组合或权利要求2所述的引物对组合或权利要求3-4任一所述的检测试剂盒在在检测结核分枝杆菌菌株内部和菌株间遗传变异中的应用。The MNP marker combination of Mycobacterium tuberculosis described in claim 1 or the primer pair combination described in Claim 2 or the detection kit described in any one of Claims 3-4 are used in the detection of Mycobacterium tuberculosis strains inside and between bacterial strains Applications in Genetic Variation.
- 权利要求1所述的结核分枝杆菌的MNP标记组合或权利要求2所述的引物对组合或权利要求3-4任一所述的检测试剂盒在构建结核分枝杆菌数据库中的应用。Application of the MNP marker combination of Mycobacterium tuberculosis described in claim 1 or the primer pair combination described in Claim 2 or the detection kit described in any one of claims 3-4 in the construction of Mycobacterium tuberculosis database.
- 权利要求1所述的结核分枝杆菌的MNP标记组合或权利要求2所述的引物对组合或权利要求3-4任一所述的检测试剂盒在结核分枝杆菌精细分型检测中的应用。The application of the MNP marker combination of Mycobacterium tuberculosis described in claim 1 or the combination of primers described in Claim 2 or the detection kit described in any one of claims 3-4 in the detection of Mycobacterium tuberculosis fine typing .
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2021/129160 WO2023077482A1 (en) | 2021-11-06 | 2021-11-06 | Combination of mnp markers of mycobacterium tuberculosis, primer pair combination, kit, and uses of combination, primer pair combination and kit |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2021/129160 WO2023077482A1 (en) | 2021-11-06 | 2021-11-06 | Combination of mnp markers of mycobacterium tuberculosis, primer pair combination, kit, and uses of combination, primer pair combination and kit |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023077482A1 true WO2023077482A1 (en) | 2023-05-11 |
Family
ID=86240560
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/129160 WO2023077482A1 (en) | 2021-11-06 | 2021-11-06 | Combination of mnp markers of mycobacterium tuberculosis, primer pair combination, kit, and uses of combination, primer pair combination and kit |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2023077482A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5851763A (en) * | 1992-09-17 | 1998-12-22 | Institut Pasteur | Rapid detection of antibiotic resistance in mycobacterium tuberculosis |
US20040121366A1 (en) * | 2002-07-19 | 2004-06-24 | Keim Paul S. | High resolution typing system for pathogenic Mycobacterium tuberculosum |
CN101363799A (en) * | 2007-08-09 | 2009-02-11 | 上海主健生物工程有限公司 | Kit for evaluating medicine for tuberculosis |
CN102533959A (en) * | 2010-12-30 | 2012-07-04 | 华中农业大学 | Multiplex polymerase chain reaction (PCR) kit for identifying mycobacterium tuberculosis |
-
2021
- 2021-11-06 WO PCT/CN2021/129160 patent/WO2023077482A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5851763A (en) * | 1992-09-17 | 1998-12-22 | Institut Pasteur | Rapid detection of antibiotic resistance in mycobacterium tuberculosis |
US20040121366A1 (en) * | 2002-07-19 | 2004-06-24 | Keim Paul S. | High resolution typing system for pathogenic Mycobacterium tuberculosum |
CN101363799A (en) * | 2007-08-09 | 2009-02-11 | 上海主健生物工程有限公司 | Kit for evaluating medicine for tuberculosis |
CN102533959A (en) * | 2010-12-30 | 2012-07-04 | 华中农业大学 | Multiplex polymerase chain reaction (PCR) kit for identifying mycobacterium tuberculosis |
Non-Patent Citations (1)
Title |
---|
LYU, XIAODONG ET AL.: "Application of Multiple Nucleotide Polymorphism Analysis in the Chimerism Detection after Allogeneic Hematopoietic Stem Cell Transplantation", CHINESE JOURNAL OF HEMATOLOGY, vol. 40, no. 8, 31 August 2019 (2019-08-31), XP009545385 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113718057B (en) | MNP (MNP) marking site of EB (Epstein-Barr) virus, primer composition, kit and application | |
WO2023077482A1 (en) | Combination of mnp markers of mycobacterium tuberculosis, primer pair combination, kit, and uses of combination, primer pair combination and kit | |
WO2023077485A1 (en) | Mnp marker combination of chlamydia pneumoniae, primer pair combination, kit, and application thereof | |
WO2023077486A1 (en) | Mnp marker combination for mycoplasma pneumoniae, primer pair combination, kit, and use | |
WO2023077489A1 (en) | Mnp marker combination of yersinia pestis, primer pair combination, kit, and application thereof | |
WO2023077488A1 (en) | Mnp marker combination of streptococcus pneumonia, primer pair combination, kit and uses thereof | |
WO2023077487A1 (en) | Adenovirus mnp marker combination, primer pair combination, kit and use thereof | |
CN114277163B (en) | MNP (MNP) labeling combination of chlamydia pneumoniae, primer pair combination, kit and application of kit | |
CN115029477B (en) | MNP (MNP-associated protein) marker locus of human rhinovirus, primer composition, kit and application of MNP marker locus | |
CN115029479B (en) | MNP (MNP) marking site of Zika virus, primer composition, kit and application of MNP marking site | |
CN114790488B (en) | MNP (MNP) marking site of staphylococcus aureus, primer composition, kit and application of MNP marking site | |
CN114277162B (en) | MNP (MNP) labeling combination of mycobacterium tuberculosis, primer pair combination, kit and application of kit | |
CN114790486B (en) | MNP (MNP) marking site of bacillus anthracis, primer composition, kit and application of MNP marking site | |
CN114277165B (en) | MNP (MNP) marking combination of plague bacillus, primer pair combination, kit and application of MNP marking combination | |
CN114107563B (en) | MNP (human metapneumovirus) marker locus, primer composition, kit and application of MNP marker locus | |
CN114836550B (en) | MNP (MNP) marking site of klebsiella pneumoniae, primer composition, kit and application of MNP marking site | |
CN114214435B (en) | MNP (MNP) labeling combination of mycoplasma pneumoniae, primer pair combination, kit and application of MNP labeling combination | |
CN115029454B (en) | MNP (MNP) marking site of Moraxella catarrhalis, primer composition, kit and application of MNP marking site | |
CN113862383B (en) | MNP (MNP) marking site of bacillus subtilis, primer composition and application of MNP marking site | |
CN114836572B (en) | MNP (MNP) marker locus of paraenterovirus, primer composition, kit and application of MNP marker locus | |
CN114836573B (en) | MNP (MNP) marking site of measles virus, primer composition, kit and application of MNP marking site | |
CN114107525B (en) | MNP (MNP) marking site of pseudomonas aeruginosa, primer composition, kit and application of MNP marking site | |
CN114107562B (en) | MNP (human parainfluenza virus) marker locus, primer composition, kit and application of MNP marker locus | |
CN114703319B (en) | Primer group and kit for identifying human bocavirus and application | |
CN114790489B (en) | MNP (MNP) marking site of haemophilus influenzae, primer composition, kit and application of MNP marking site |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21962995 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21962995 Country of ref document: EP Kind code of ref document: A1 |