CN118671353A - Biomarker for AIS diagnosis and application thereof - Google Patents
Biomarker for AIS diagnosis and application thereof Download PDFInfo
- Publication number
- CN118671353A CN118671353A CN202410685032.3A CN202410685032A CN118671353A CN 118671353 A CN118671353 A CN 118671353A CN 202410685032 A CN202410685032 A CN 202410685032A CN 118671353 A CN118671353 A CN 118671353A
- Authority
- CN
- China
- Prior art keywords
- proteins
- ais
- protein
- plasma
- exosomes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000000090 biomarker Substances 0.000 title claims abstract description 32
- 238000003745 diagnosis Methods 0.000 title claims abstract description 25
- 206010056292 Androgen-Insensitivity Syndrome Diseases 0.000 title claims abstract 20
- 208000032382 Ischaemic stroke Diseases 0.000 title claims abstract 20
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 190
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 190
- 210000001808 exosome Anatomy 0.000 claims abstract description 84
- 238000000034 method Methods 0.000 claims abstract description 42
- 239000002245 particle Substances 0.000 claims abstract description 26
- 210000003205 muscle Anatomy 0.000 claims abstract description 25
- 238000012216 screening Methods 0.000 claims abstract description 24
- 238000001514 detection method Methods 0.000 claims abstract description 21
- 101710181683 Cartilage intermediate layer protein 1 Proteins 0.000 claims abstract description 20
- 102100026622 Cartilage intermediate layer protein 1 Human genes 0.000 claims abstract description 20
- 102100039819 Actin, alpha cardiac muscle 1 Human genes 0.000 claims abstract description 19
- 101000959247 Homo sapiens Actin, alpha cardiac muscle 1 Proteins 0.000 claims abstract description 19
- 239000003550 marker Substances 0.000 claims abstract description 10
- 102000007079 Peptide Fragments Human genes 0.000 claims abstract description 7
- 108010033276 Peptide Fragments Proteins 0.000 claims abstract description 7
- 102000004506 Blood Proteins Human genes 0.000 claims abstract description 6
- 108010017384 Blood Proteins Proteins 0.000 claims abstract description 6
- 238000004949 mass spectrometry Methods 0.000 claims description 28
- 238000004458 analytical method Methods 0.000 claims description 26
- 230000002159 abnormal effect Effects 0.000 claims description 22
- 239000012528 membrane Substances 0.000 claims description 19
- 230000008569 process Effects 0.000 claims description 18
- 238000001228 spectrum Methods 0.000 claims description 15
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 14
- GWEVSGVZZGPLCZ-UHFFFAOYSA-N Titan oxide Chemical compound O=[Ti]=O GWEVSGVZZGPLCZ-UHFFFAOYSA-N 0.000 claims description 12
- 238000005516 engineering process Methods 0.000 claims description 12
- 238000002347 injection Methods 0.000 claims description 12
- 239000007924 injection Substances 0.000 claims description 12
- 150000002500 ions Chemical class 0.000 claims description 12
- 239000007791 liquid phase Substances 0.000 claims description 12
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 claims description 12
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 claims description 9
- 239000007788 liquid Substances 0.000 claims description 9
- 238000010801 machine learning Methods 0.000 claims description 9
- 239000002244 precipitate Substances 0.000 claims description 9
- 239000000243 solution Substances 0.000 claims description 8
- 206010039722 scoliosis Diseases 0.000 claims description 7
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 claims description 6
- 108091005804 Peptidases Proteins 0.000 claims description 6
- 239000004365 Protease Substances 0.000 claims description 6
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 6
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 claims description 6
- 239000006180 TBST buffer Substances 0.000 claims description 6
- 102000004142 Trypsin Human genes 0.000 claims description 6
- 108090000631 Trypsin Proteins 0.000 claims description 6
- 235000012538 ammonium bicarbonate Nutrition 0.000 claims description 6
- 235000019253 formic acid Nutrition 0.000 claims description 6
- 238000002955 isolation Methods 0.000 claims description 6
- 239000004005 microsphere Substances 0.000 claims description 6
- 238000012986 modification Methods 0.000 claims description 6
- 230000004048 modification Effects 0.000 claims description 6
- 238000012856 packing Methods 0.000 claims description 6
- 239000012071 phase Substances 0.000 claims description 6
- 238000005191 phase separation Methods 0.000 claims description 6
- 238000000108 ultra-filtration Methods 0.000 claims description 6
- 238000005406 washing Methods 0.000 claims description 6
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 5
- 210000004027 cell Anatomy 0.000 claims description 4
- 238000003776 cleavage reaction Methods 0.000 claims description 4
- 230000001419 dependent effect Effects 0.000 claims description 4
- 238000004451 qualitative analysis Methods 0.000 claims description 4
- 238000004445 quantitative analysis Methods 0.000 claims description 4
- 230000007017 scission Effects 0.000 claims description 4
- 102100025222 CD63 antigen Human genes 0.000 claims description 3
- 102100037904 CD9 antigen Human genes 0.000 claims description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 3
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 claims description 3
- 102000004190 Enzymes Human genes 0.000 claims description 3
- 108090000790 Enzymes Proteins 0.000 claims description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N EtOH Substances CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 3
- 108010033040 Histones Proteins 0.000 claims description 3
- 101000934368 Homo sapiens CD63 antigen Proteins 0.000 claims description 3
- 101000738354 Homo sapiens CD9 antigen Proteins 0.000 claims description 3
- 101000729289 Homo sapiens Ribose-5-phosphate isomerase Proteins 0.000 claims description 3
- 101000613251 Homo sapiens Tumor susceptibility gene 101 protein Proteins 0.000 claims description 3
- 108090000144 Human Proteins Proteins 0.000 claims description 3
- 102000003839 Human Proteins Human genes 0.000 claims description 3
- 239000002033 PVDF binder Substances 0.000 claims description 3
- 229930040373 Paraformaldehyde Natural products 0.000 claims description 3
- 239000004793 Polystyrene Substances 0.000 claims description 3
- 102100031139 Ribose-5-phosphate isomerase Human genes 0.000 claims description 3
- 102100040879 Tumor susceptibility gene 101 protein Human genes 0.000 claims description 3
- COQLPRJCUIATTQ-UHFFFAOYSA-N Uranyl acetate Chemical class O.O.O=[U]=O.CC(O)=O.CC(O)=O COQLPRJCUIATTQ-UHFFFAOYSA-N 0.000 claims description 3
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 claims description 3
- 230000001640 apoptogenic effect Effects 0.000 claims description 3
- 230000005540 biological transmission Effects 0.000 claims description 3
- 239000000872 buffer Substances 0.000 claims description 3
- 238000004422 calculation algorithm Methods 0.000 claims description 3
- 229910052799 carbon Inorganic materials 0.000 claims description 3
- 238000005119 centrifugation Methods 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims description 3
- 229910052802 copper Inorganic materials 0.000 claims description 3
- 239000010949 copper Substances 0.000 claims description 3
- 238000004925 denaturation Methods 0.000 claims description 3
- 230000036425 denaturation Effects 0.000 claims description 3
- 238000001962 electrophoresis Methods 0.000 claims description 3
- 238000010828 elution Methods 0.000 claims description 3
- 230000007717 exclusion Effects 0.000 claims description 3
- 230000007062 hydrolysis Effects 0.000 claims description 3
- 238000006460 hydrolysis reaction Methods 0.000 claims description 3
- 239000012160 loading buffer Substances 0.000 claims description 3
- 239000012139 lysis buffer Substances 0.000 claims description 3
- 238000001819 mass spectrum Methods 0.000 claims description 3
- 239000011159 matrix material Substances 0.000 claims description 3
- 239000000203 mixture Substances 0.000 claims description 3
- 239000002105 nanoparticle Substances 0.000 claims description 3
- 230000003647 oxidation Effects 0.000 claims description 3
- 238000007254 oxidation reaction Methods 0.000 claims description 3
- 239000012188 paraffin wax Substances 0.000 claims description 3
- 229920002866 paraformaldehyde Polymers 0.000 claims description 3
- 238000000918 plasma mass spectrometry Methods 0.000 claims description 3
- 229920002223 polystyrene Polymers 0.000 claims description 3
- 229920002981 polyvinylidene fluoride Polymers 0.000 claims description 3
- 239000002243 precursor Substances 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- 238000000751 protein extraction Methods 0.000 claims description 3
- 230000000717 retained effect Effects 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 210000002966 serum Anatomy 0.000 claims description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 claims description 3
- 230000003595 spectral effect Effects 0.000 claims description 3
- 239000006228 supernatant Substances 0.000 claims description 3
- 238000012546 transfer Methods 0.000 claims description 3
- 238000004627 transmission electron microscopy Methods 0.000 claims description 3
- 239000012588 trypsin Substances 0.000 claims description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 3
- 238000001262 western blot Methods 0.000 claims description 3
- 238000013399 early diagnosis Methods 0.000 abstract description 4
- 208000022567 adolescent idiopathic scoliosis Diseases 0.000 description 104
- 239000000523 sample Substances 0.000 description 25
- 230000014509 gene expression Effects 0.000 description 18
- 201000010099 disease Diseases 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 238000000513 principal component analysis Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 7
- 230000027455 binding Effects 0.000 description 6
- 230000031018 biological processes and functions Effects 0.000 description 6
- 230000004879 molecular function Effects 0.000 description 6
- 238000011160 research Methods 0.000 description 5
- 238000001356 surgical procedure Methods 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 4
- 230000008506 pathogenesis Effects 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 238000007621 cluster analysis Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 241000533867 Fordia Species 0.000 description 2
- 108010026552 Proteome Proteins 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 238000003766 bioinformatics method Methods 0.000 description 2
- 239000013060 biological fluid Substances 0.000 description 2
- 238000013211 curve analysis Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- 238000002705 metabolomic analysis Methods 0.000 description 2
- 230000001431 metabolomic effect Effects 0.000 description 2
- 238000001964 muscle biopsy Methods 0.000 description 2
- 238000003068 pathway analysis Methods 0.000 description 2
- 239000000092 prognostic biomarker Substances 0.000 description 2
- 238000007637 random forest analysis Methods 0.000 description 2
- 210000003296 saliva Anatomy 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- CCEKAJIANROZEO-UHFFFAOYSA-N sulfluramid Chemical group CCNS(=O)(=O)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)F CCEKAJIANROZEO-UHFFFAOYSA-N 0.000 description 2
- 210000002700 urine Anatomy 0.000 description 2
- 206010003445 Ascites Diseases 0.000 description 1
- 102100040904 Beta-parvin Human genes 0.000 description 1
- 102000000905 Cadherin Human genes 0.000 description 1
- 108050007957 Cadherin Proteins 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 102100037290 Coatomer subunit gamma-1 Human genes 0.000 description 1
- 206010050694 Congenital scoliosis Diseases 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 101710089384 Extracellular protease Proteins 0.000 description 1
- 101000613557 Homo sapiens Beta-parvin Proteins 0.000 description 1
- 101000952964 Homo sapiens Coatomer subunit gamma-1 Proteins 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 102000002151 Microfilament Proteins Human genes 0.000 description 1
- 108010040897 Microfilament Proteins Proteins 0.000 description 1
- 206010033799 Paralysis Diseases 0.000 description 1
- 230000010799 Receptor Interactions Effects 0.000 description 1
- 101100233916 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR5 gene Proteins 0.000 description 1
- 206010058907 Spinal deformity Diseases 0.000 description 1
- 108010083111 Ubiquitin-Protein Ligases Proteins 0.000 description 1
- 102100020696 Ubiquitin-conjugating enzyme E2 K Human genes 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 230000017531 blood circulation Effects 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000004087 circulation Effects 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 238000005345 coagulation Methods 0.000 description 1
- 230000015271 coagulation Effects 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 230000024203 complement activation Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000002567 electromyography Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 230000014818 extracellular matrix organization Effects 0.000 description 1
- 108010036236 extracellular matrix receptor Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000001087 myotubule Anatomy 0.000 description 1
- 230000002232 neuromuscular Effects 0.000 description 1
- 238000010238 partial least squares regression Methods 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 238000010827 pathological analysis Methods 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000035778 pathophysiological process Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 230000009325 pulmonary function Effects 0.000 description 1
- 230000010837 receptor-mediated endocytosis Effects 0.000 description 1
- 210000003019 respiratory muscle Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 238000012070 whole genome sequencing analysis Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
- G01N33/6848—Methods of protein analysis involving mass spectrometry
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N15/00—Investigating characteristics of particles; Investigating permeability, pore-volume or surface-area of porous materials
- G01N15/02—Investigating particle size or size distribution
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N15/00—Investigating characteristics of particles; Investigating permeability, pore-volume or surface-area of porous materials
- G01N15/06—Investigating concentration of particle suspensions
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N23/00—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00
- G01N23/02—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by transmitting the radiation through the material
- G01N23/04—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by transmitting the radiation through the material and forming images of the material
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/04—Preparation or injection of sample to be analysed
- G01N30/06—Preparation
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/62—Detectors specially adapted therefor
- G01N30/72—Mass spectrometers
- G01N30/7233—Mass spectrometers interfaced to liquid or supercritical fluid chromatograph
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6887—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids from muscle, cartilage or connective tissue
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Hematology (AREA)
- Urology & Nephrology (AREA)
- Dispersion Chemistry (AREA)
- Cell Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Biophysics (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及生物标志技术领域,尤其涉及用于AIS诊断的生物标志物及其应用。The present invention relates to the technical field of biomarkers, and in particular to biomarkers for AIS diagnosis and applications thereof.
背景技术Background Art
青少年特发性脊柱侧弯(adolescent idiopathic scoliosis,AIS)是是青春发育期最常见的三维结构畸形,全世界发病率为全球发病率为1%-4%。女孩的发病率高于男孩。AIS是脊柱多个椎体结构三维空间的异常生长发育,包括矢状面的平背畸形,冠状面的侧弯和轴位的旋转畸形。不同于病因明确的神经肌肉型脊柱侧弯、先天性脊柱侧弯等,AIS患者发病机制仍有待进一步明确,目前研究倾向于AIS是多种内在因素及外部环境改变的共同作用导致。为了制定更有效的预后和治疗方法,目前非常需要对尚未探索的领域进行研究,进一步明确AIS的病因学将为临床的诊断和治疗提供更为可靠的依据。Adolescent idiopathic scoliosis (AIS) is the most common three-dimensional structural deformity during adolescence, with a global incidence of 1%-4%. The incidence rate in girls is higher than that in boys. AIS is the abnormal growth and development of multiple vertebral structures in the three-dimensional space of the spine, including flat back deformity in the sagittal plane, scoliosis in the coronal plane, and rotational deformity in the axial position. Unlike neuromuscular scoliosis and congenital scoliosis with clear etiologies, the pathogenesis of AIS patients still needs to be further clarified. Current research tends to show that AIS is caused by the combined effects of multiple internal factors and external environmental changes. In order to develop more effective prognostic and treatment methods, it is currently very necessary to study unexplored areas. Further clarifying the etiology of AIS will provide a more reliable basis for clinical diagnosis and treatment.
为阐明AIS的发病机制,研究人员采取了多种的研究方法和技术来发掘AIS患者存在的异常,包括全基因组测序及家系基因遗传,激素水平测定,蛋白质组学,代谢组学,生物力学,有限元分析及动物模型等。而肌肉的纤维类型,细胞外基质以及肌肉结构的异常在AIS椎旁肌肉中已经得到证实。在近期的一项研究中,通过暂时性麻痹腰大肌治疗后,AIS患者的侧弯度数、椎体旋转度出现了不同程度的缓解。据报道,AIS患者的肌力弱于年龄和性别相似的健康人,肺功能研究显示,AIS患者的呼吸肌力量减弱,为脊柱提供动态稳定性的后脊柱旁肌肉(包括多肌和竖脊肌),其失衡被推测为导致AIS脊柱畸形的起因和/或进展。这提示在椎体附着的肌肉可能是参与AIS病因及侧弯进展严重程度的重要因素之一。目前关于AIS椎旁肌肉的研究主要包括肌肉基因表达水平异常,组织病理学改变,影像学改变和肌电图的改变。To elucidate the pathogenesis of AIS, researchers have adopted a variety of research methods and techniques to explore the abnormalities in AIS patients, including whole genome sequencing and family genetic inheritance, hormone level measurement, proteomics, metabolomics, biomechanics, finite element analysis and animal models. Abnormalities in muscle fiber type, extracellular matrix and muscle structure have been confirmed in the paraspinal muscles of AIS. In a recent study, after temporary paralysis of the psoas major muscle, the degree of scoliosis and vertebral rotation in AIS patients showed varying degrees of relief. It is reported that the muscle strength of AIS patients is weaker than that of healthy people of similar age and gender. Pulmonary function studies have shown that the respiratory muscle strength of AIS patients is weakened. The imbalance of the posterior paraspinal muscles (including multiple muscles and erector spinae) that provide dynamic stability to the spine is speculated to be the cause and/or progression of AIS spinal deformity. This suggests that muscles attached to the vertebral body may be one of the important factors involved in the etiology of AIS and the severity of scoliosis progression. At present, the research on the paraspinal muscles of AIS mainly includes abnormal muscle gene expression levels, histopathological changes, imaging changes and changes in electromyography.
目前对AIS脊柱旁肌肉的研究主要依赖于肌肉活检,而由于存在伦理方面的顾虑,健康对照组或轻中度AIS患者的肌肉活检样本非常难以获取,因此采用这些样本和重度AIS患者术中的肌肉病理样本进行比较,是开展相关机制和诊断方面工作的障碍。因此,最好能够采用侵入性较低的测试用于在较早的时间点揭示肌肉相关的变化,并用于纵向研究。而血液样本是系统表型研究的样本,同时可以帮助医生可以发现早期诊断的生物标志物。At present, the study of paraspinal muscles in AIS mainly relies on muscle biopsy. However, due to ethical concerns, muscle biopsy samples from healthy controls or patients with mild to moderate AIS are very difficult to obtain. Therefore, the use of these samples and intraoperative muscle pathology samples from patients with severe AIS for comparison is an obstacle to the development of related mechanisms and diagnostic work. Therefore, it is best to use less invasive tests to reveal muscle-related changes at an earlier time point and for longitudinal studies. Blood samples are samples for systematic phenotypic studies and can help doctors discover biomarkers for early diagnosis.
先进的代谢组学和蛋白质组学平台使研究人员能够量化循环中的大量小分子、肽和蛋白质、肽和蛋白质进行量化,从而发现大量与疾病相关的预测性和预后性指标及代谢通路。目前,"组学"研究的进步使研究人员能够以非靶向和无偏差的方式量化血液循环中的大量蛋白质/肽和代谢物,从而确定AIS相关的预测和预后生物标志物。因此,采用血浆蛋白质组学进行疾病早期生物标记物的筛选和疾病发病机制的研究非常重要。Advanced metabolomics and proteomics platforms enable researchers to quantify a large number of small molecules, peptides and proteins in the circulation, thereby discovering a large number of predictive and prognostic indicators and metabolic pathways related to the disease. At present, the progress of "omics" research has enabled researchers to quantify a large number of proteins/peptides and metabolites in the blood circulation in a non-targeted and unbiased manner, thereby identifying AIS-related predictive and prognostic biomarkers. Therefore, it is very important to use plasma proteomics to screen early biomarkers of the disease and study the pathogenesis of the disease.
近年来,细胞外囊泡(extracellular vesicles,Evs)已成为寻找不同疾病的生物标志物以及探索参与疾病发展相关致病因子的重要工具[17-20]。EVs存在于不同的生物液体中,如血液、尿液、唾液、胆汁和腹水等。EVs是由不同类型的细胞产生的直径为40至大于1000nm的脂质双分子层球体,包含特定的蛋白质、脂质、不同种类RNA、DNA和代谢产物。目前,最常见的关于EVs研究是外泌体,其粒径大小在40至300nm[21]。这些外泌体包含的组成部分中,可能携带着疾病的生物标记物,在人类疾病的病理生理过程中发挥着重要作用。外泌体具有膜状结构,因此其腔内的内容物不受胞外蛋白酶的降解,在储存条件下具有高度稳定[16]。In recent years, extracellular vesicles (EVs) have become an important tool for finding biomarkers for different diseases and exploring pathogenic factors involved in the development of diseases [17-20]. EVs exist in different biological fluids, such as blood, urine, saliva, bile and ascites. EVs are lipid bilayer spheres with a diameter of 40 to more than 1000 nm produced by different types of cells, containing specific proteins, lipids, different types of RNA, DNA and metabolites. At present, the most common EVs studied are exosomes, whose particle size ranges from 40 to 300 nm [21]. The components contained in these exosomes may carry biomarkers of diseases and play an important role in the pathophysiological process of human diseases. Exosomes have a membrane structure, so the contents in their cavity are not degraded by extracellular proteases and are highly stable under storage conditions [16].
基于外泌体的这种特性,基于质谱检测的外泌体蛋白受到血浆高丰度蛋白的影响会更少,更容易检测相对较多的低丰度蛋白种类。因此将外泌体用于AIS的生物标志物检测可能具有较大的潜力。生物体液(血液、尿液、唾液等)由于样本易于获取、收集简单,被广泛应用于临床病理学诊断和疾病的监测过程中。但目前尚无关于AIS患者血浆外泌体蛋白的相关研究,评估AIS患者最常用的临床血浆外泌体内的生物化学因子尚未被发现。Based on this characteristic of exosomes, the exosome proteins detected by mass spectrometry will be less affected by high-abundance plasma proteins and will be easier to detect relatively more low-abundance protein species. Therefore, the use of exosomes for biomarker detection of AIS may have great potential. Biological fluids (blood, urine, saliva, etc.) are widely used in clinical pathological diagnosis and disease monitoring because of the ease of sample acquisition and simple collection. However, there is currently no relevant research on plasma exosome proteins in AIS patients, and the most commonly used biochemical factors in clinical plasma exosomes for evaluating AIS patients have not yet been discovered.
发明内容Summary of the invention
本发明的目的是为了解决现有技术中存在的问题,而提出的用于AIS诊断的生物标志物及其应用。The purpose of the present invention is to solve the problems existing in the prior art and to propose biomarkers for AIS diagnosis and their applications.
为了实现上述目的,本发明采用了如下技术方案:In order to achieve the above object, the present invention adopts the following technical solutions:
用于AIS诊断的生物标志物,包括CILP-1和ACTC中的一种或多种。Biomarkers for the diagnosis of AIS include one or more of CILP-1 and ACTC.
蛋白质用于AIS诊断的生物标志物的用途,所述蛋白质包括CILP-1和ACTC中的一种或多种。Use of proteins as biomarkers for AIS diagnosis, wherein the proteins include one or more of CILP-1 and ACTC.
用于AIS诊断的生物标志物的筛选方法,所述筛选方法具体包括以下步骤:A method for screening biomarkers for AIS diagnosis, the screening method specifically comprising the following steps:
获取AIS患者血浆样本以及健康人群的血浆样本作为对比;Obtain plasma samples from AIS patients and healthy people for comparison;
获取AIS患者肌肉组织;Obtain muscle tissue from AIS patients;
对血浆外泌体进行富集;Enrichment of plasma exosomes;
对血浆外泌体形态、粒径大小进行鉴定;Identify the morphology and particle size of plasma exosomes;
对外泌体标志性蛋白进行鉴定;Identify exosome marker proteins;
提取外泌体蛋白;Extracting exosomal proteins;
将外泌体蛋白切割成肽段;Cleave exosomal proteins into peptide fragments;
液相分离,以及质谱检测技术定性及定量分析样品中的蛋白质;Liquid phase separation and mass spectrometry detection technology to qualitatively and quantitatively analyze proteins in samples;
基于质谱数据进行蛋白质数据库的搜库鉴定;Search and identify protein databases based on mass spectrometry data;
差异蛋白鉴定及诊断标记物的筛选。Identification of differentially expressed proteins and screening of diagnostic markers.
优选地,获取AIS患者肌肉组织的具体过程为:取术中AIS患者(n=10)侧弯顶椎双侧(凸侧和凹侧)深部多裂肌肌肉组织各一块,大小约1cm×1cm×1cm,将肌肉组织放入4%多聚甲醛中固定24h用做石蜡切片。Preferably, the specific process of obtaining muscle tissue of AIS patients is as follows: a piece of deep multifidus muscle tissue of each side (convex side and concave side) of the apical vertebra of scoliosis of intraoperative AIS patients (n=10) is obtained, the size of which is about 1 cm×1 cm×1 cm, and the muscle tissue is placed in 4% paraformaldehyde and fixed for 24 hours for paraffin sectioning.
优选地,所述对血浆外泌体进行富集的具体过程为:通过0.2μm滤膜去除700μL血浆中的细胞碎片、微囊泡和凋亡小体等;加TiO2微球10mg,4℃,振荡孵育5min;500μL PBS洗涤3次后,离心,去除上清并收集TiO2微球沉淀,其表面富集有血浆外泌体。Preferably, the specific process of enriching plasma exosomes is: removing cell debris, microvesicles and apoptotic bodies in 700 μL plasma through a 0.2 μm filter membrane; adding 10 mg of TiO2 microspheres, incubating at 4°C with shaking for 5 minutes; washing 3 times with 500 μL PBS, centrifuging, removing the supernatant and collecting the TiO2 microsphere precipitate, the surface of which is enriched with plasma exosomes.
优选地,所述对血浆外泌体形态、粒径大小进行鉴定的具体过程为:Preferably, the specific process of identifying the morphology and particle size of plasma exosomes is:
透射电镜鉴定外泌体的形态:将含有外泌体的PBS溶液(20μL)滴在铜网formvar碳支持膜上(200目),静置10min充分沉降外泌体。滴加饱和乙酸双氧铀乙醇溶液复染1min,室温干燥并用透射电镜观察外泌体形态;Transmission electron microscopy was used to identify the morphology of exosomes: PBS solution containing exosomes (20 μL) was dropped onto a copper mesh formvar carbon support film (200 mesh) and allowed to stand for 10 min to fully precipitate the exosomes. Saturated uranyl acetate ethanol solution was added for counterstaining for 1 min, dried at room temperature, and the morphology of exosomes was observed using a transmission electron microscope;
外泌体粒径和浓度检测:采用ZetaView纳米颗粒跟踪仪检测外泌体的颗粒大小和浓度。将富集得到的外泌体用PBS缓冲液进行5000倍比稀释,用粒径约为100nm的聚苯乙烯颗粒对仪器进行校准后清洗样本池,将样本进行上机检测(温度为23℃);Exosome particle size and concentration detection: ZetaView nanoparticle tracking instrument was used to detect the particle size and concentration of exosomes. The enriched exosomes were diluted 5000 times with PBS buffer, the instrument was calibrated with polystyrene particles with a particle size of about 100 nm, the sample pool was cleaned, and the samples were tested on the machine (temperature was 23°C);
外泌体标志性蛋白的鉴定(Western-blot):富集的外泌体中加入RPIA裂解液进行蛋白提取并进行Nanodrop2000定量总蛋白浓度,将10μg蛋白与loading buffer配制后煮沸变性,用于SDS-PAGE电泳,再转膜至PVDF膜,在室温摇床上封闭1h后分别与三种外泌体标志蛋白(CD9、CD63、TSG101)的一抗进行孵育,于4℃孵育过夜。TBST洗去一抗后在室温摇床上与二抗进行孵育1h,再次用TBST洗膜,使用显影液显色。Identification of exosome marker proteins (Western-blot): RPIA lysis buffer was added to the enriched exosomes for protein extraction and the total protein concentration was quantified by Nanodrop2000. 10 μg of protein was prepared with loading buffer and then boiled for denaturation for SDS-PAGE electrophoresis. The membrane was then transferred to a PVDF membrane and blocked on a shaker at room temperature for 1 hour. The membrane was incubated with primary antibodies of three exosome marker proteins (CD9, CD63, TSG101) and incubated overnight at 4°C. After washing the primary antibody with TBST, the membrane was incubated with the secondary antibody on a shaker at room temperature for 1 hour, and the membrane was washed again with TBST and developed with a developer.
优选地,所述将外泌体蛋白切割成肽段的具体过程为:Preferably, the specific process of cutting the exosome protein into peptide segments is:
取上述得到的沉淀,加入2%十二烷基磺酸钠(SDS)50ul裂解,涡旋后冰上超声30min;The precipitate obtained above was taken and lysed by adding 50ul of 2% sodium dodecyl sulfate (SDS), vortexed and ultrasonicated on ice for 30min;
采用滤膜辅助样本准备法(Filter-Assisted Sample Preparation,FASP)将外泌体蛋白切割成肽段;Filter-Assisted Sample Preparation (FASP) was used to cut exosome proteins into peptide fragments;
用400μL UA(8M)清洗新的30KD超滤管(14000g,常温离心10min);A new 30KD ultrafiltration tube was washed with 400 μL UA (8 M) (14000 g, centrifugation at room temperature for 10 min);
将裂解后的液体转入洗好的30KD超滤管中,14000g,常温离心15min;加200μL UA(8M),14000g,常温离心15min;The lysed liquid was transferred to a washed 30KD ultrafiltration tube and centrifuged at 14000g for 15 min at room temperature; 200 μL UA (8M) was added and centrifuged at 14000g for 15 min at room temperature;
加200μL DTT(10mM),置于37℃恒温箱中4h;14000g,常温离心15min;加200μL UA(8M),14000g,常温离心15min;Add 200 μL DTT (10 mM), place in a 37°C incubator for 4 h; centrifuge at 14,000 g for 15 min at room temperature; add 200 μL UA (8 M), centrifuge at 14,000 g for 15 min at room temperature;
加200μL IAA(50mM),室温避光40min;Add 200 μL IAA (50 mM) and incubate at room temperature in the dark for 40 min;
加200μL NH4HCO3(50mM),14000x g,常温离心10min,重复2次;Add 200 μL NH4HCO3 (50 mM), centrifuge at 14000 x g for 10 min at room temperature, repeat twice;
加200μL NH4HCO3(50mM),吹打混匀,加1μg蛋白酶,放入37℃恒温箱中;12h后再加1μg trypsin酶,放入37℃恒温箱中;4h后取出;Add 200 μL NH4HCO3 (50 mM), mix well by pipetting, add 1 μg protease, and place in a 37°C incubator; after 12 hours, add 1 μg trypsin enzyme and place in a 37°C incubator; take out after 4 hours;
蛋白酶解成肽段后14000g,常温离心10min(自此步开始需要留离心下来的液体);加200μL水,14000g,常温离心10min,重复1次;将离心后的液体转移至1.5mL离心管中,热干。用0.1% FA复溶,用于外泌体鉴定、肽段浓度检测及质谱分析。After protease hydrolysis into peptides, centrifuge at 14000g for 10 minutes at room temperature (from this step, the centrifuged liquid needs to be retained); add 200μL of water, centrifuge at 14000g for 10 minutes at room temperature, repeat once; transfer the centrifuged liquid to a 1.5mL centrifuge tube and heat dry. Redissolve with 0.1% FA for exosome identification, peptide concentration detection and mass spectrometry analysis.
优选地,所述液相分离,以及质谱检测技术定性及定量分析样品中的蛋白质的具体过程为:样品先经纳升级液相在线分离后再检测,数据的采集方式采用数据非依赖型扫描模式(DIA,Data independent acquisition),DIA技术是先利用常规Data dependentAcquisition(DDA)质谱检测技术分析并建立出来图谱库,再采用DIA的方法采集待测样品的质谱数据,在对比DIA数据与DDA的谱图库信息后进行定性及定量分析样品中的蛋白质;Preferably, the specific process of the liquid phase separation and the mass spectrometry detection technology for qualitative and quantitative analysis of proteins in the sample is: the sample is first separated online by nanoliter liquid phase and then detected, and the data is collected in a data independent acquisition (DIA) scanning mode. The DIA technology first uses conventional Data dependent Acquisition (DDA) mass spectrometry detection technology to analyze and establish a spectrum library, and then uses the DIA method to collect mass spectrum data of the sample to be tested, and after comparing the DIA data with the DDA spectrum library information, qualitative and quantitative analysis of the protein in the sample is performed;
EASY-nLC 1200液相系统预柱及分析柱规格如下:The specifications of the pre-column and analytical column of the EASY-nLC 1200 liquid phase system are as follows:
预柱:3μm粒径C18填料,2cm×100μm内径;Pre-column: 3 μm particle size C18 packing, 2 cm × 100 μm inner diameter;
分析柱:1.9μm粒径C18填料,30cm×150μm内径;Analytical column: 1.9 μm particle size C18 packing, 30 cm × 150 μm inner diameter;
流动相A:0.1%甲酸,流速为600nl/min;Mobile phase A: 0.1% formic acid, flow rate 600 nl/min;
流动相B:乙腈及0.1%甲酸(体积比为1000:1);Mobile phase B: acetonitrile and 0.1% formic acid (volume ratio 1000:1);
肽分离洗脱梯度如下:0-6min,7-12% B;6-72min,12-30% B;72-94min,30-42%B;94-95min,42-95% B and 95-100min,95% B;The elution gradient for peptide separation was as follows: 0-6 min, 7-12% B; 6-72 min, 12-30% B; 72-94 min, 30-42% B; 94-95 min, 42-95% B and 95-100 min, 95% B;
质谱采用Q-Exactive HFXTM系统(Thermo Fisher Scientific),谱图库建立采用Data dependent Acquisition(DDA)模式,条件如下:扫描方式为正离子扫描模式,一级质谱采用Orbitrap检测,分辨率m/z 120,000@200,最大注入时间为80ms,扫描范围400-1400;二级扫描分辨率设为15,000@m/z200,起始扫描质量(Fixed first mass)120m/z,最大离子注入时间为45ms,HCD相对碰撞能量为27%,数据采集时采用18s动态排除;The mass spectrometer was performed using the Q-Exactive HFX TM system (Thermo Fisher Scientific). The spectral library was established using the Data dependent Acquisition (DDA) mode with the following conditions: the scanning mode was positive ion scanning mode, the primary mass spectrometer was detected by Orbitrap, the resolution was m/z 120,000@200, the maximum injection time was 80 ms, and the scanning range was 400-1400; the secondary scanning resolution was set to 15,000@m/z200, the starting scanning mass (fixed first mass) was 120 m/z, the maximum ion injection time was 45 ms, the HCD relative collision energy was 27%, and 18 s dynamic exclusion was used during data acquisition;
样品质谱采集:血清/血浆质谱数据采用Data Independent Acquisition(DIA)模式,扫描方式为正离子扫描模式,一级质谱采用Orbitrap检测,分辨率m/z 60,000@200,最大注入时间为80ms,扫描范围400-1200;DIA扫描分辨率设为30,000@m/z 200,起始扫描质量(Fixed first mass)200m/z,最大离子注入时间为45ms;每次采集的DIA隔离窗口为25m/z,隔离窗口数目为32,HCD相对碰撞能量为27%。Sample mass spectrometry acquisition: Serum/plasma mass spectrometry data were acquired in Data Independent Acquisition (DIA) mode, with positive ion scanning mode, and Orbitrap detection for primary mass spectrometry, with a resolution of m/z 60,000@200, a maximum injection time of 80 ms, and a scanning range of 400-1200; the DIA scanning resolution was set to 30,000@m/z 200, the starting scanning mass (Fixed first mass) was 200 m/z, and the maximum ion injection time was 45 ms; the DIA isolation window for each acquisition was 25 m/z, the number of isolation windows was 32, and the HCD relative collision energy was 27%.
优选地,所述基于质谱数据进行蛋白质数据库的搜库鉴定的具体过程为:所得质谱数据利用Spectronaut 14.5 200813.47784搜索引擎进行谱图库建立及DIA数据搜索,在Spectronaut软件模板中对数据库搜索的各项参数进行设定:library谱图库建立,“Protein Database”选用Uniprot数据库的人蛋白质序列数据库;在“Enzymes/cleavagerule”中选取Trypsin/p;在“Digest Type”中选取Specific;在“Missed Cleavages”中选取2;在“Precursor Mass Tolerance”中填20ppm;在“Variable Modifications”中选取Acetyl(Protein N-term)、Oxidation(M);在“Fixed Modifications”中选取Carbamidomethyl(C),DIA数据搜索选取上述建立的library谱图库,参数设置选取“BGSFactory Settings(default)”。Preferably, the specific process of searching the protein database based on mass spectrometry data is as follows: the mass spectrometry data is obtained by using the Spectronaut 14.5 200813.47784 search engine to establish a spectrum library and perform DIA data search, and various parameters for database search are set in the Spectronaut software template: library spectrum library is established, the human protein sequence database of the Uniprot database is selected in "Protein Database"; Trypsin/p is selected in "Enzymes/cleavagerule"; Specific is selected in "Digest Type"; 2 is selected in "Missed Cleavages"; 20ppm is filled in "Precursor Mass Tolerance"; Acetyl (Protein N-term), Oxidation (M) are selected in "Variable Modifications"; Carbamidomethyl (C) is selected in "Fixed Modifications", the library spectrum library established above is selected for DIA data search, and "BGSFactory Settings (default)" is selected for parameter setting.
优选地,所述差异蛋白鉴定及诊断标记物的筛选包括:采用R语言进行机器学习分析,具体描述如下:Preferably, the differential protein identification and screening of diagnostic markers include: using R language to perform machine learning analysis, which is specifically described as follows:
蛋白生理范围的构建:由于不同蛋白的丰度、丰度跨度及其离散程度存在明显的差异,因此根据不同蛋白丰度的数据特征使用机器学习算法分别确定正常组蛋白特异性生理范围的上限和下限,建立正常血清蛋白的生理波动范围;Construction of protein physiological range: Since the abundance, abundance span and discreteness of different proteins are significantly different, the upper and lower limits of the normal histone-specific physiological range are determined using machine learning algorithms based on the data characteristics of different protein abundances, and the physiological fluctuation range of normal serum proteins is established;
个体及群体差异蛋白的鉴定:使用某一样本鉴定到的所有蛋白逐一与该蛋白的生理范围进行比较,若某一蛋白在样本中的丰度超过该蛋白生理范围的上限或低于生理范围的下限,则定义该蛋白在样本中为异常,以此方法检出样本中所有的异常蛋白。与蛋白生理范围进行比较,整合样本的异常蛋白信息构建群体的异常蛋白信息关联矩阵,定义在90%以上样本均异常为检出群体水平异常蛋白的标准。Identification of individual and group differential proteins: All proteins identified in a sample are compared with the physiological range of the protein one by one. If the abundance of a protein in a sample exceeds the upper limit of the physiological range of the protein or is lower than the lower limit of the physiological range, the protein is defined as abnormal in the sample. This method is used to detect all abnormal proteins in the sample. By comparing with the physiological range of the protein, the abnormal protein information of the sample is integrated to construct the abnormal protein information association matrix of the group, and more than 90% of the samples are defined as abnormal as the standard for detecting abnormal proteins at the group level.
与现有技术相比,本发明的有益效果是:Compared with the prior art, the present invention has the following beneficial effects:
本发明采用CILP-1和ACTC作为AIS患者血浆细胞外囊泡早期诊断的生物标记物,受到血浆高峰度蛋白的影响较小,解决了现有技术采用血浆蛋白质组学进行差异蛋白和生物标记物的筛选,而血浆蛋白来源复杂,且血浆中存在大量的高峰度蛋白,可能影响低丰度蛋白的检测质量的问题。The present invention uses CILP-1 and ACTC as biomarkers for early diagnosis of plasma extracellular vesicles in AIS patients, which are less affected by plasma high-abundance proteins. This solves the problem that the prior art uses plasma proteomics to screen differential proteins and biomarkers, but the source of plasma proteins is complex and there are a large number of high-abundance proteins in plasma, which may affect the detection quality of low-abundance proteins.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为AIS组和对照组血浆外泌体蛋白质组鉴定的差异蛋白示意图;Figure 1 is a schematic diagram of differentially expressed proteins identified in the plasma exosome proteome between the AIS group and the control group;
图1中:A.差异蛋白的火山图,红色代表与对照组相比,AIS组显著升高1.5倍以上的蛋白,蓝色代表降低1.5倍以下的蛋白(p<0.05),横坐标为log2(AIS随访组/对照组),纵坐标为-log10(pvalue);Figure 1: A. Volcano plot of differentially expressed proteins, red represents proteins significantly increased by more than 1.5 times in the AIS group compared with the control group, blue represents proteins decreased by less than 1.5 times (p<0.05), the horizontal axis is log2 (AIS follow-up group/control group), and the vertical axis is -log10 (pvalue);
B.AIS组和对照组的二维主成分分析图,PC1=25.8%,PC2=15.9%;B. Two-dimensional principal component analysis of AIS group and control group, PC1 = 25.8%, PC2 = 15.9%;
C.AIS患者和对照组血浆外泌体的差异蛋白丰度的热图分析,红色程度越深代表蛋白丰度越高,蓝色程度越深代表蛋白丰度越低;C. Heat map analysis of differential protein abundance in plasma exosomes of AIS patients and control groups. Darker red represents higher protein abundance, and darker blue represents lower protein abundance.
NC:对照组,S:AIS组;NC: control group, S: AIS group;
图2为AIS组和对照组血浆外泌体差异表达蛋白的GO分析和KEGG分析示意图;Figure 2 is a schematic diagram of GO analysis and KEGG analysis of differentially expressed proteins in plasma exosomes between the AIS group and the control group;
图2中:In Figure 2:
A.AIS组血浆外泌体所有差异蛋白的GO分析;A. GO analysis of all differentially expressed proteins in plasma exosomes of the AIS group;
B.AIS组的血浆外泌体所所有差异蛋白的KEGG分析;B. KEGG analysis of all differentially expressed proteins in plasma exosomes of AIS group;
图3为多个模型分析AIS组和对照组之间的差异蛋白及其ROC诊断分析示意图;FIG3 is a schematic diagram of multiple models analyzing differential proteins between the AIS group and the control group and their ROC diagnostic analysis;
图3中:A.采用多个模型分析获得的25个差异蛋白,其中selectedfrequency代表被模型选择的频率(阈值),频率越高表示蛋白对于两组间的鉴别效果越好;In Figure 3: A. 25 differentially expressed proteins obtained by using multiple models, where selectedfrequency represents the frequency (threshold) selected by the model. The higher the frequency, the better the protein's ability to discriminate between the two groups.
B.ROC曲线对25个差异蛋白诊断AIS效率的分析;B. Analysis of ROC curve on the efficiency of 25 differentially expressed proteins in diagnosing AIS;
图4为AIS随访组和对照组血浆外泌体蛋白质组鉴定的差异蛋白示意图;Figure 4 is a schematic diagram of differentially expressed proteins identified in plasma exosome proteomes between the AIS follow-up group and the control group;
图4中:A.差异蛋白的火山图,红色代表与对照组相比,AIS随访组显著升高1.5倍以上的蛋白,蓝色代表降低1.5倍以下的蛋白(p<0.05),横坐标为log2(AIS随访组/对照组),纵坐标为-log10(pvalue);Figure 4: A. Volcano plot of differentially expressed proteins, red represents proteins significantly increased by more than 1.5 times in the AIS follow-up group compared with the control group, blue represents proteins decreased by less than 1.5 times (p<0.05), the horizontal axis is log2 (AIS follow-up group/control group), and the vertical axis is -log10 (pvalue);
B.AIS随访组和对照组的二维主成分分析图,NC:对照组,S1:AIS随访组,PC1=24.6%,PC2=16.3%;B. Two-dimensional principal component analysis of the AIS follow-up group and the control group, NC: control group, S1: AIS follow-up group, PC1 = 24.6%, PC2 = 16.3%;
C.AIS患者和对照组血浆外泌体的差异蛋白丰度的热图分析,红色程度越深代表蛋白丰度越高,蓝色程度越深代表蛋白丰度越低;NC:对照组,S1:AIS随访组;C. Heat map analysis of differential protein abundance in plasma exosomes of AIS patients and control group. Darker red represents higher protein abundance, and darker blue represents lower protein abundance; NC: control group, S1: AIS follow-up group;
图5为AIS随访组和对照组血浆外泌体差异表达蛋白的GO分析和KEGG分析示意图;Figure 5 is a schematic diagram of GO analysis and KEGG analysis of differentially expressed proteins in plasma exosomes between the AIS follow-up group and the control group;
图5中:A.AIS随访组血浆外泌体所有差异蛋白的GO分析;Figure 5: A. GO analysis of all differential proteins in plasma exosomes of the AIS follow-up group;
B.AIS随访组血浆外泌体所有差异蛋白的KEGG分析;B. KEGG analysis of all differential proteins in plasma exosomes of the AIS follow-up group;
图6为通过机器学习筛选的25个差异蛋白在对照组和AIS组的表达热图;Figure 6 is a heat map of the expression of 25 differentially expressed proteins in the control group and AIS group screened by machine learning;
图6中:A.25个差异蛋白在AIS随访组的表达情况,红色代表蛋白表达异常上调,蓝色代表蛋白表达异常下调;In Figure 6: A. Expression of 25 differentially expressed proteins in the AIS follow-up group, red represents abnormal up-regulation of protein expression, and blue represents abnormal down-regulation of protein expression;
B.25个差异蛋白在对照组、AIS随访组和AIS手术组表达的热图,红色代表表达上调,蓝色代表表达下调,右侧图注代表表达差异的倍数;B. Heat map of the expression of 25 differentially expressed proteins in the control group, AIS follow-up group, and AIS surgery group. Red represents up-regulated expression, blue represents down-regulated expression, and the legend on the right represents the multiple of expression difference;
图7为ACTC和CILP-1在AIS随访组和对照组的ROC曲线和表达水平示意图;FIG7 is a schematic diagram of the ROC curves and expression levels of ACTC and CILP-1 in the AIS follow-up group and the control group;
图7中:A.ACTC用于AIS随访组诊断的ROC曲线,曲线下面积为1;In Figure 7: A. ROC curve of ACTC for diagnosis of AIS follow-up group, the area under the curve is 1;
B.ACTC在对照组和AIS随访组的表达水平;B. Expression levels of ACTC in the control group and AIS follow-up group;
C.CILP-1用于AIS随访组诊断的ROC曲线,曲线下面积为0.85;C. ROC curve of CILP-1 for diagnosis of AIS follow-up group, with an area under the curve of 0.85;
D.CILP-1在对照组和AIS随访组的表达水平;D. Expression levels of CILP-1 in the control group and AIS follow-up group;
图8为ACTC和CILP-1在AIS患者(随访组和手术组)和对照组的ROC曲线和表达水平示意图;FIG8 is a schematic diagram of the ROC curves and expression levels of ACTC and CILP-1 in AIS patients (follow-up group and surgery group) and the control group;
图8中:A.ACTC用于AIS患者诊断的ROC曲线,曲线下面积为1;In Figure 8: A. ROC curve of ACTC for diagnosis of AIS patients, with an area under the curve of 1;
B.ACTC在对照组和AIS组的表达水平;B. Expression levels of ACTC in the control group and AIS group;
C.CILP-1用于AIS患者诊断的ROC曲线,曲线下面积为0.817;C. ROC curve of CILP-1 for the diagnosis of AIS patients, with an area under the curve of 0.817;
D.CILP-1在对照组和AIS组的表达水平。D. The expression level of CILP-1 in the control group and AIS group.
具体实施方式DETAILED DESCRIPTION
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。The technical solutions in the embodiments of the present invention will be described clearly and completely below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, rather than all the embodiments.
用于AIS诊断的生物标志物,包括CILP-1和ACTC中的一种或多种。Biomarkers for the diagnosis of AIS include one or more of CILP-1 and ACTC.
蛋白质用于AIS诊断的生物标志物的用途,所述蛋白质包括CILP-1和ACTC中的一种或多种。Use of proteins as biomarkers for AIS diagnosis, wherein the proteins include one or more of CILP-1 and ACTC.
用于AIS诊断的生物标志物的筛选方法,所述筛选方法具体包括以下步骤:A method for screening biomarkers for AIS diagnosis, the screening method specifically comprising the following steps:
获取AIS患者血浆样本以及健康人群的血浆样本作为对比;Obtain plasma samples from AIS patients and healthy people for comparison;
获取AIS患者肌肉组织;Obtain muscle tissue from AIS patients;
对血浆外泌体进行富集;Enrichment of plasma exosomes;
对血浆外泌体形态、粒径大小进行鉴定;Identify the morphology and particle size of plasma exosomes;
对外泌体标志性蛋白进行鉴定;Identify exosome marker proteins;
提取外泌体蛋白;Extracting exosomal proteins;
将外泌体蛋白切割成肽段;Cleave exosomal proteins into peptide fragments;
液相分离,以及质谱检测技术定性及定量分析样品中的蛋白质;Liquid phase separation and mass spectrometry detection technology to qualitatively and quantitatively analyze proteins in samples;
基于质谱数据进行蛋白质数据库的搜库鉴定;Search and identify protein databases based on mass spectrometry data;
差异蛋白鉴定及诊断标记物的筛选。Identification of differentially expressed proteins and screening of diagnostic markers.
本实施例中,获取AIS患者肌肉组织的具体过程为:取术中AIS患者(n=10)侧弯顶椎双侧(凸侧和凹侧)深部多裂肌肌肉组织各一块,大小约1cm×1cm×1cm,将肌肉组织放入4%多聚甲醛中固定24h用做石蜡切片。In this embodiment, the specific process of obtaining muscle tissue of AIS patients is as follows: a piece of deep multifidus muscle tissue of each side (convex side and concave side) of the apical vertebra of scoliosis of intraoperative AIS patients (n=10) is obtained, the size of which is about 1 cm×1 cm×1 cm, and the muscle tissue is placed in 4% paraformaldehyde and fixed for 24 hours for paraffin sectioning.
本实施例中,所述对血浆外泌体进行富集的具体过程为:通过0.2μm滤膜去除700μL血浆中的细胞碎片、微囊泡和凋亡小体等;加TiO2微球10mg,4℃,振荡孵育5min;500μLPBS洗涤3次后,离心,去除上清并收集TiO2微球沉淀,其表面富集有血浆外泌体。In this embodiment, the specific process of enriching plasma exosomes is as follows: removing cell debris, microvesicles, apoptotic bodies, etc. in 700 μL of plasma through a 0.2 μm filter membrane; adding 10 mg of TiO2 microspheres, incubating at 4°C with shaking for 5 minutes; washing 3 times with 500 μL PBS, centrifuging, removing the supernatant and collecting the TiO2 microsphere precipitate, the surface of which is enriched with plasma exosomes.
本实施例中,所述对血浆外泌体形态、粒径大小进行鉴定的具体过程为:In this embodiment, the specific process of identifying the morphology and particle size of plasma exosomes is as follows:
透射电镜鉴定外泌体的形态:将含有外泌体的PBS溶液(20μL)滴在铜网formvar碳支持膜上(200目),静置10min充分沉降外泌体。滴加饱和乙酸双氧铀乙醇溶液复染1min,室温干燥并用透射电镜观察外泌体形态;Transmission electron microscopy was used to identify the morphology of exosomes: PBS solution containing exosomes (20 μL) was dropped onto a copper mesh formvar carbon support film (200 mesh) and allowed to stand for 10 min to fully precipitate the exosomes. Saturated uranyl acetate ethanol solution was added for counterstaining for 1 min, dried at room temperature, and the morphology of exosomes was observed using a transmission electron microscope;
外泌体粒径和浓度检测:采用ZetaView纳米颗粒跟踪仪检测外泌体的颗粒大小和浓度。将富集得到的外泌体用PBS缓冲液进行5000倍比稀释,用粒径约为100nm的聚苯乙烯颗粒对仪器进行校准后清洗样本池,将样本进行上机检测(温度为23℃);Exosome particle size and concentration detection: ZetaView nanoparticle tracking instrument was used to detect the particle size and concentration of exosomes. The enriched exosomes were diluted 5000 times with PBS buffer, the instrument was calibrated with polystyrene particles with a particle size of about 100 nm, the sample pool was cleaned, and the samples were tested on the machine (temperature was 23°C);
外泌体标志性蛋白的鉴定(Western-blot):富集的外泌体中加入RPIA裂解液进行蛋白提取并进行Nanodrop2000定量总蛋白浓度,将10μg蛋白与loading buffer配制后煮沸变性,用于SDS-PAGE电泳,再转膜至PVDF膜,在室温摇床上封闭1h后分别与三种外泌体标志蛋白(CD9、CD63、TSG101)的一抗进行孵育,于4℃孵育过夜。TBST洗去一抗后在室温摇床上与二抗进行孵育1h,再次用TBST洗膜,使用显影液显色。Identification of exosome marker proteins (Western-blot): RPIA lysis buffer was added to the enriched exosomes for protein extraction and the total protein concentration was quantified by Nanodrop2000. 10 μg of protein was prepared with loading buffer and then boiled for denaturation for SDS-PAGE electrophoresis. The membrane was then transferred to a PVDF membrane and blocked on a shaker at room temperature for 1 hour. The membrane was incubated with primary antibodies of three exosome marker proteins (CD9, CD63, TSG101) and incubated overnight at 4°C. After washing the primary antibody with TBST, the membrane was incubated with the secondary antibody on a shaker at room temperature for 1 hour, and the membrane was washed again with TBST and developed with a developer.
本实施例中,所述将外泌体蛋白切割成肽段的具体过程为:In this embodiment, the specific process of cutting the exosome protein into peptide segments is:
取上述得到的沉淀,加入2%十二烷基磺酸钠(SDS)50ul裂解,涡旋后冰上超声30min;The precipitate obtained above was taken and lysed by adding 50ul of 2% sodium dodecyl sulfate (SDS), vortexed and ultrasonicated on ice for 30min;
采用滤膜辅助样本准备法(Filter-Assisted Sample Preparation,FASP)将外泌体蛋白切割成肽段;Filter-Assisted Sample Preparation (FASP) was used to cut exosome proteins into peptide fragments;
用400μL UA(8M)清洗新的30KD超滤管(14000g,常温离心10min);A new 30KD ultrafiltration tube was washed with 400 μL UA (8 M) (14000 g, centrifugation at room temperature for 10 min);
将裂解后的液体转入洗好的30KD超滤管中,14000g,常温离心15min;加200μL UA(8M),14000g,常温离心15min;The lysed liquid was transferred to a washed 30KD ultrafiltration tube and centrifuged at 14000g for 15 min at room temperature; 200 μL UA (8M) was added and centrifuged at 14000g for 15 min at room temperature;
加200μL DTT(10mM),置于37℃恒温箱中4h;14000g,常温离心15min;加200μL UA(8M),14000g,常温离心15min;Add 200 μL DTT (10 mM), place in a 37°C incubator for 4 h; centrifuge at 14,000 g for 15 min at room temperature; add 200 μL UA (8 M), centrifuge at 14,000 g for 15 min at room temperature;
加200μL IAA(50mM),室温避光40min;Add 200 μL IAA (50 mM) and incubate at room temperature in the dark for 40 min;
加200μL NH4HCO3(50mM),14000x g,常温离心10min,重复2次;Add 200 μL NH4HCO3 (50 mM), centrifuge at 14000 x g for 10 min at room temperature, repeat twice;
加200μL NH4HCO3(50mM),吹打混匀,加1μg蛋白酶,放入37℃恒温箱中;12h后再加1μg trypsin酶,放入37℃恒温箱中;4h后取出;Add 200 μL NH4HCO3 (50 mM), mix well by pipetting, add 1 μg protease, and place in a 37°C incubator; after 12 hours, add 1 μg trypsin enzyme and place in a 37°C incubator; take out after 4 hours;
蛋白酶解成肽段后14000g,常温离心10min(自此步开始需要留离心下来的液体);加200μL水,14000g,常温离心10min,重复1次;将离心后的液体转移至1.5mL离心管中,热干。用0.1% FA复溶,用于外泌体鉴定、肽段浓度检测及质谱分析。After protease hydrolysis into peptides, centrifuge at 14000g for 10 minutes at room temperature (from this step, the centrifuged liquid needs to be retained); add 200μL of water, centrifuge at 14000g for 10 minutes at room temperature, repeat once; transfer the centrifuged liquid to a 1.5mL centrifuge tube and heat dry. Redissolve with 0.1% FA for exosome identification, peptide concentration detection and mass spectrometry analysis.
本实施例中,所述液相分离,以及质谱检测技术定性及定量分析样品中的蛋白质的具体过程为:样品先经纳升级液相在线分离后再检测,数据的采集方式采用数据非依赖型扫描模式(DIA,Data independent acquisition),DIA技术是先利用常规Datadependent Acquisition(DDA)质谱检测技术分析并建立出来图谱库,再采用DIA的方法采集待测样品的质谱数据,在对比DIA数据与DDA的谱图库信息后进行定性及定量分析样品中的蛋白质;In this embodiment, the specific process of the liquid phase separation and mass spectrometry detection technology for qualitative and quantitative analysis of proteins in the sample is as follows: the sample is first separated online by nanoliter liquid phase and then detected, and the data is collected in a data independent acquisition (DIA) scanning mode. The DIA technology first uses conventional Datadependent Acquisition (DDA) mass spectrometry detection technology to analyze and establish a spectrum library, and then uses the DIA method to collect mass spectrum data of the sample to be tested, and qualitatively and quantitatively analyzes the protein in the sample after comparing the DIA data with the spectrum library information of DDA;
EASY-nLC 1200液相系统预柱及分析柱规格如下:The specifications of the pre-column and analytical column of the EASY-nLC 1200 liquid phase system are as follows:
预柱:3μm粒径C18填料,2cm×100μm内径;Pre-column: 3 μm particle size C18 packing, 2 cm × 100 μm inner diameter;
分析柱:1.9μm粒径C18填料,30cm×150μm内径;Analytical column: 1.9 μm particle size C18 packing, 30 cm × 150 μm inner diameter;
流动相A:0.1%甲酸,流速为600nl/min;Mobile phase A: 0.1% formic acid, flow rate 600 nl/min;
流动相B:乙腈及0.1%甲酸(体积比为1000:1);Mobile phase B: acetonitrile and 0.1% formic acid (volume ratio 1000:1);
肽分离洗脱梯度如下:0-6min,7-12% B;6-72min,12-30% B;72-94min,30-42%B;94-95min,42-95% B and 95-100min,95% B;The elution gradient for peptide separation was as follows: 0-6 min, 7-12% B; 6-72 min, 12-30% B; 72-94 min, 30-42% B; 94-95 min, 42-95% B and 95-100 min, 95% B;
质谱采用Q-Exactive HFXTM系统(Thermo Fisher Scientific),谱图库建立采用Data dependent Acquisition(DDA)模式,条件如下:扫描方式为正离子扫描模式,一级质谱采用Orbitrap检测,分辨率m/z 120,000@200,最大注入时间为80ms,扫描范围400-1400;二级扫描分辨率设为15,000@m/z200,起始扫描质量(Fixed first mass)120m/z,最大离子注入时间为45ms,HCD相对碰撞能量为27%,数据采集时采用18s动态排除;The mass spectrometer was performed using the Q-Exactive HFXTM system (Thermo Fisher Scientific). The spectral library was established using the Data dependent Acquisition (DDA) mode with the following conditions: the scanning mode was positive ion scanning mode, the primary mass spectrometer was detected by Orbitrap, the resolution was m/z 120,000@200, the maximum injection time was 80 ms, and the scanning range was 400-1400; the secondary scanning resolution was set to 15,000@m/z200, the starting scanning mass (fixed first mass) was 120 m/z, the maximum ion injection time was 45 ms, the HCD relative collision energy was 27%, and 18 s dynamic exclusion was used during data acquisition;
样品质谱采集:血清/血浆质谱数据采用Data Independent Acquisition(DIA)模式,扫描方式为正离子扫描模式,一级质谱采用Orbitrap检测,分辨率m/z 60,000@200,最大注入时间为80ms,扫描范围400-1200;DIA扫描分辨率设为30,000@m/z 200,起始扫描质量(Fixed first mass)200m/z,最大离子注入时间为45ms;每次采集的DIA隔离窗口为25m/z,隔离窗口数目为32,HCD相对碰撞能量为27%。Sample mass spectrometry acquisition: Serum/plasma mass spectrometry data were acquired in Data Independent Acquisition (DIA) mode, with positive ion scanning mode, and Orbitrap detection for primary mass spectrometry, with a resolution of m/z 60,000@200, a maximum injection time of 80 ms, and a scanning range of 400-1200; the DIA scanning resolution was set to 30,000@m/z 200, the starting scanning mass (Fixed first mass) was 200 m/z, and the maximum ion injection time was 45 ms; the DIA isolation window for each acquisition was 25 m/z, the number of isolation windows was 32, and the HCD relative collision energy was 27%.
本实施例中,所述基于质谱数据进行蛋白质数据库的搜库鉴定的具体过程为:所得质谱数据利用Spectronaut 14.5 200813.47784搜索引擎进行谱图库建立及DIA数据搜索,在Spectronaut软件模板中对数据库搜索的各项参数进行设定:library谱图库建立,“Protein Database”选用Uniprot数据库的人蛋白质序列数据库;在“Enzymes/cleavagerule”中选取Trypsin/p;在“Digest Type”中选取Specific;在“Missed Cleavages”中选取2;在“Precursor Mass Tolerance”中填20ppm;在“Variable Modifications”中选取Acetyl(Protein N-term)、Oxidation(M);在“Fixed Modifications”中选取Carbamidomethyl(C),DIA数据搜索选取上述建立的library谱图库,参数设置选取“BGSFactory Settings(default)”。In this embodiment, the specific process of searching the protein database based on mass spectrometry data is as follows: the obtained mass spectrometry data is used to establish a spectrum library and perform DIA data search using the Spectronaut 14.5 200813.47784 search engine, and various parameters for database search are set in the Spectronaut software template: library spectrum library is established, the human protein sequence database of the Uniprot database is selected in "Protein Database"; Trypsin/p is selected in "Enzymes/cleavagerule"; Specific is selected in "Digest Type"; 2 is selected in "Missed Cleavages"; 20ppm is filled in "Precursor Mass Tolerance"; Acetyl (Protein N-term), Oxidation (M) are selected in "Variable Modifications"; Carbamidomethyl (C) is selected in "Fixed Modifications", the library spectrum library established above is selected for DIA data search, and "BGSFactory Settings (default)" is selected for parameter setting.
本实施例中,所述差异蛋白鉴定及诊断标记物的筛选包括:采用R语言进行机器学习分析,具体描述如下:In this embodiment, the differential protein identification and screening of diagnostic markers include: using R language to perform machine learning analysis, which is specifically described as follows:
蛋白生理范围的构建:由于不同蛋白的丰度、丰度跨度及其离散程度存在明显的差异,因此根据不同蛋白丰度的数据特征使用机器学习算法分别确定正常组蛋白特异性生理范围的上限和下限,建立正常血清蛋白的生理波动范围;Construction of protein physiological range: Since the abundance, abundance span and discreteness of different proteins are significantly different, the upper and lower limits of the normal histone-specific physiological range are determined using machine learning algorithms based on the data characteristics of different protein abundances, and the physiological fluctuation range of normal serum proteins is established;
个体及群体差异蛋白的鉴定:使用某一样本鉴定到的所有蛋白逐一与该蛋白的生理范围进行比较,若某一蛋白在样本中的丰度超过该蛋白生理范围的上限或低于生理范围的下限,则定义该蛋白在样本中为异常,以此方法检出样本中所有的异常蛋白。与蛋白生理范围进行比较,整合样本的异常蛋白信息构建群体的异常蛋白信息关联矩阵,定义在90%以上样本均异常为检出群体水平异常蛋白的标准。Identification of individual and group differential proteins: All proteins identified in a sample are compared with the physiological range of the protein one by one. If the abundance of a protein in a sample exceeds the upper limit of the physiological range of the protein or is lower than the lower limit of the physiological range, the protein is defined as abnormal in the sample. This method is used to detect all abnormal proteins in the sample. By comparing with the physiological range of the protein, the abnormal protein information of the sample is integrated to construct the abnormal protein information association matrix of the group, and more than 90% of the samples are defined as abnormal as the standard for detecting abnormal proteins at the group level.
各组样本外泌体蛋白质鉴定数量:Number of exosome proteins identified in each group of samples:
应用液相色谱-串联质谱(liquid chromatography-tandem mass spectrometry,LC-MS/MS)技术鉴定AIS患者和对照组个体血浆样本中外泌体蛋白,通过搜库后鉴定到各个组的血浆样本富集的外泌体的蛋白数量:对照组(n=20)、AIS随访组(n=11)和手术组(n=10)中鉴定到的蛋白数量平均值分别为1398.6±262.89、1112.09±313.85和1288.9±269.24,所有样本共鉴定到蛋白数量2437个;Liquid chromatography-tandem mass spectrometry (LC-MS/MS) was used to identify exosome proteins in plasma samples of AIS patients and the control group. After searching the database, the number of proteins enriched in the plasma samples of each group was identified: the average number of proteins identified in the control group (n=20), AIS follow-up group (n=11) and surgery group (n=10) was 1398.6±262.89, 1112.09±313.85 and 1288.9±269.24, respectively. A total of 2437 proteins were identified in all samples.
AIS组和对照组之间差异蛋白的生物信息学分析(采用PCA分析、聚类分析、GO分析和KEGG分析方法):Bioinformatics analysis of differentially expressed proteins between the AIS group and the control group (using PCA analysis, cluster analysis, GO analysis, and KEGG analysis methods):
对于鉴定到的2437种蛋白首先进行AIS组(n=21)和对照组(n=20)之间比较,以两组间蛋白表达量比值>1.5或<0.67作为筛选差异蛋白的标准。与对照组相比,AIS组共鉴定319个差异蛋白,其中254个蛋白表达下调,65个蛋白表达上调(:1A),通过这些差异蛋白进行PCA分析结果如图1B所示,PCA分析未能较好的区分两组。对AIS组和对照组的差异蛋白丰度的热图分析如图1C所示。For the 2437 proteins identified, the AIS group (n=21) and the control group (n=20) were first compared, and the protein expression ratio between the two groups was >1.5 or <0.67 as the standard for screening differential proteins. Compared with the control group, a total of 319 differential proteins were identified in the AIS group, of which 254 proteins were downregulated and 65 proteins were upregulated (:1A). The results of PCA analysis of these differential proteins are shown in Figure 1B. PCA analysis failed to distinguish the two groups well. The heat map analysis of the differential protein abundance between the AIS group and the control group is shown in Figure 1C.
对两组间差异表达的319个蛋白进行GO分析,共获得2349条GO功能注释,分别参与了1537种生物过程(biological process,BP)、381种细胞组成(cellular component,CC)和429种分子功能(molecular function,MF)。如图2A所示,这些差异蛋白参与的生物过程包括初始免疫反应、补体激活、血小板降解、受体介导内吞、细胞外基质的组织和免疫反应等;细胞组成主要为细胞外泌体、细胞膜、胞浆和细胞外区域等;分子功能主要包括ATP结合、金属离子结合、DNA结合、丝氨酸样内切酶活性、包括钙黏蛋白结合在内的细胞间黏附、抗原结合、钙离子结合、肌动蛋白结合和泛素蛋白连接酶结合等。KEGG通路分析表明,差异蛋白参与了16条信号通路,包括细胞外基质受体相互作用、补体和凝血通路等(图2B)。GO analysis was performed on 319 differentially expressed proteins between the two groups, and a total of 2349 GO functional annotations were obtained, which were involved in 1537 biological processes (BP), 381 cellular components (CC) and 429 molecular functions (MF). As shown in Figure 2A, the biological processes involved in these differentially expressed proteins include initial immune response, complement activation, platelet degradation, receptor-mediated endocytosis, extracellular matrix organization and immune response, etc.; the cellular components are mainly cell exosomes, cell membrane, cytoplasm and extracellular regions, etc.; the molecular functions mainly include ATP binding, metal ion binding, DNA binding, serine-like endonuclease activity, cell-to-cell adhesion including cadherin binding, antigen binding, calcium ion binding, actin binding and ubiquitin protein ligase binding, etc. KEGG pathway analysis showed that the differentially expressed proteins were involved in 16 signaling pathways, including extracellular matrix receptor interaction, complement and coagulation pathways, etc. (Figure 2B).
通过多模型分析AIS组和对照组的差异蛋白得到25个代表性蛋白:The differential proteins between the AIS group and the control group were analyzed by multi-model analysis to obtain 25 representative proteins:
对于鉴定到的2437个蛋白首先经过单维统计和多维统计分析,再采用偏最小二乘回归分析法(partial least-squares regression analysis,PLS-DA)、支持向量机模型(Support Vector Machine,SVM)和随机森林(Random forest,RF)模型进行分析,获得25个AIS组和对照组之间的差异蛋白,综合包括CILP-1等这25种差异蛋白进行ROC分析得到曲线下面积为0.969(图3);The 2437 proteins identified were first analyzed by unidimensional and multidimensional statistics, and then analyzed by partial least-squares regression analysis (PLS-DA), support vector machine model (SVM) and random forest (RF) model to obtain 25 differential proteins between the AIS group and the control group. The ROC analysis of these 25 differential proteins including CILP-1 showed an area under the curve of 0.969 (Figure 3).
AIS随访组和对照组之间差异蛋白的生物信息学分析(采用PCA分析、聚类分析、GO分析和KEGG分析方法):Bioinformatics analysis of differentially expressed proteins between the AIS follow-up group and the control group (using PCA analysis, cluster analysis, GO analysis, and KEGG analysis methods):
比较AIS随访组和对照组血浆外泌体鉴定到的蛋白,选择比值>1.5或<0.67作为标准。与对照组相比,AIS随访组共有398个差异蛋白,其中356个蛋白表达下调,42个蛋白表达上调(图4A),通过这些差异蛋白进行主成分分析(principal component analysis,PCA),结果如图4B所示,对AIS随访组和对照组的差异蛋白丰度进行聚类分析,热图结果如图4C所示;The proteins identified in plasma exosomes of the AIS follow-up group and the control group were compared, and the ratio >1.5 or <0.67 was selected as the standard. Compared with the control group, there were 398 differentially expressed proteins in the AIS follow-up group, of which 356 proteins were down-regulated and 42 proteins were up-regulated (Figure 4A). Principal component analysis (PCA) was performed on these differentially expressed proteins, and the results are shown in Figure 4B. Cluster analysis was performed on the differentially expressed protein abundance between the AIS follow-up group and the control group, and the heat map results are shown in Figure 4C;
对两组间差异表达的398个蛋白进行GO分析,共获得2662条GO功能注释(图5A),分别参与了1741种生物过程(biological process,BP)、430种细胞组成(cellularcomponent,CC)和491种分子功能(molecular function,MF)。这些差异蛋白参与的生物过程、细胞组成和分子功能与AIS组整体和对照组的差异蛋白结果存在较多相似。KEGG通路分析表明,差异蛋白参与了15条信号通路,与AIS组整体和对照组的差异也存在较多相似(图5B)。GO analysis was performed on 398 differentially expressed proteins between the two groups, and a total of 2662 GO functional annotations were obtained (Figure 5A), which were involved in 1741 biological processes (BP), 430 cellular components (CC), and 491 molecular functions (MF). The biological processes, cellular components, and molecular functions involved in these differentially expressed proteins were similar to the differentially expressed proteins in the AIS group as a whole and the control group. KEGG pathway analysis showed that the differentially expressed proteins were involved in 15 signaling pathways, which were also similar to the differences between the AIS group as a whole and the control group (Figure 5B).
机器学习筛选AIS随访组和对照组之间的差异蛋白:Machine learning screening of differentially expressed proteins between the AIS follow-up group and the control group:
为进一步探索AIS的发病机制以及AIS早期诊断的疾病标记物,我们采用机器学习的方法进行差异蛋白的筛选,从2437个蛋白筛选得到在AIS随访组和对照组在72%以上的样本中表达差异的25个蛋白。To further explore the pathogenesis of AIS and disease markers for early diagnosis of AIS, we used machine learning methods to screen differentially expressed proteins. From 2437 proteins, we obtained 25 proteins that were differentially expressed in more than 72% of the samples between the AIS follow-up group and the control group.
我们进一步对这25个蛋白在AIS随访组中的表达情况进行分析,发现这25个蛋白中有6个蛋白(VDCA1、PARVB、KV122、CO4A、GDIB和COPG1)在AIS随访组异常下调,其余19个蛋白在AIS随访组异常上调。值得注意的是,ACTC在11个AIS随访组的样本中均为高表达,而CILP-1在8个AIS随访组样本中高表达(图6A)。25个差异蛋白在对照组、AIS随访组和AIS手术组的热图如图6B所示,同样可见ACTC在21个AIS患者样本中表达水平均高于对照组,CILP-1在大部分AIS患者也均相对高表达。ACTC蛋白在AIS随访组进行ROC曲线下面积分析结果为1,CILP-1蛋白在AIS随访组ROC曲线下面积为0.85(图7)。ACTC蛋白作为所有AIS患者筛查蛋白的(随访组和手术组)ROC曲线下面积分析,结果仍为1,而CILP-1蛋白的ROC曲线下面积为0.817(图8)。We further analyzed the expression of these 25 proteins in the AIS follow-up group and found that 6 of the 25 proteins (VDCA1, PARVB, KV122, CO4A, GDIB and COPG1) were abnormally downregulated in the AIS follow-up group, and the remaining 19 proteins were abnormally upregulated in the AIS follow-up group. It is worth noting that ACTC was highly expressed in all 11 samples of the AIS follow-up group, while CILP-1 was highly expressed in 8 samples of the AIS follow-up group (Figure 6A). The heat map of the 25 differentially expressed proteins in the control group, AIS follow-up group and AIS surgery group is shown in Figure 6B. It can also be seen that the expression level of ACTC in 21 AIS patient samples was higher than that in the control group, and CILP-1 was also relatively highly expressed in most AIS patients. The area under the ROC curve analysis of ACTC protein in the AIS follow-up group was 1, and the area under the ROC curve of CILP-1 protein in the AIS follow-up group was 0.85 (Figure 7). The area under the ROC curve analysis of ACTC protein as a screening protein for all AIS patients (follow-up group and surgery group) was still 1, while the area under the ROC curve of CILP-1 protein was 0.817 (Figure 8).
以上所述,仅为本发明较佳的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,根据本发明的技术方案及其发明构思加以等同替换或改变,都应涵盖在本发明的保护范围之内。The above description is only a preferred specific implementation manner of the present invention, but the protection scope of the present invention is not limited thereto. Any technician familiar with the technical field can make equivalent replacements or changes according to the technical scheme and inventive concept of the present invention within the technical scope disclosed by the present invention, which should be covered by the protection scope of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410685032.3A CN118671353A (en) | 2024-05-30 | 2024-05-30 | Biomarker for AIS diagnosis and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410685032.3A CN118671353A (en) | 2024-05-30 | 2024-05-30 | Biomarker for AIS diagnosis and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN118671353A true CN118671353A (en) | 2024-09-20 |
Family
ID=92717220
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202410685032.3A Pending CN118671353A (en) | 2024-05-30 | 2024-05-30 | Biomarker for AIS diagnosis and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN118671353A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090263800A1 (en) * | 2001-06-08 | 2009-10-22 | Xdx, Inc. | Methods and compositions for diagnosing and monitoring transplant rejection |
US20210403557A1 (en) * | 2018-11-05 | 2021-12-30 | Merck Sharp & Dohme Corp. | Dosing regimen of anti-tigit antibody for treatment of cancer |
CN116113440A (en) * | 2020-06-17 | 2023-05-12 | 阿塞勒隆制药公司 | ACTRII-ALK4 antagonists and methods of treating heart failure |
-
2024
- 2024-05-30 CN CN202410685032.3A patent/CN118671353A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090263800A1 (en) * | 2001-06-08 | 2009-10-22 | Xdx, Inc. | Methods and compositions for diagnosing and monitoring transplant rejection |
US20210403557A1 (en) * | 2018-11-05 | 2021-12-30 | Merck Sharp & Dohme Corp. | Dosing regimen of anti-tigit antibody for treatment of cancer |
CN116113440A (en) * | 2020-06-17 | 2023-05-12 | 阿塞勒隆制药公司 | ACTRII-ALK4 antagonists and methods of treating heart failure |
Non-Patent Citations (1)
Title |
---|
CHI WANG 等: "Plasma exosome proteomics and CILP-1/TGF-β1 pathway in paraspinal muscle of adolescent idiopathic scoliosis", SSRN, 10 October 2022 (2022-10-10), pages 1 - 54 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111289736A (en) | Metabolomics-based markers for early diagnosis of chronic obstructive pulmonary disease and their applications | |
WO2011157655A1 (en) | Use of bile acids for prediction of an onset of sepsis | |
CN113156018B (en) | Method for establishing liver and gall disease diagnosis model and diagnosis system | |
CN108334752B (en) | Method for establishing quantitative reference range of healthy human urine proteome and healthy human urine proteome database | |
CN113008972B (en) | Serum metabolic marker for gestational diabetes diagnosis and application thereof | |
CN106645757B (en) | A kind of serum protein markers group and its application diagnosing early onset diabetes MODY | |
CN113009162A (en) | Serum metabolic marker for diagnosing gestational diabetes and application thereof | |
CN110057954A (en) | Blood plasma metabolic markers are in the application for diagnosing or monitoring HBV | |
CN115267214A (en) | Lipid metabolism marker, screening method thereof and application of lipid metabolism marker in intracranial aneurysm | |
JP2024529555A (en) | Biomarkers for predicting or monitoring recurrence of NMOSD and their uses | |
CN105308455B (en) | Methods and compositions for diagnosing preeclampsia | |
US20160018413A1 (en) | Methods of Prognosing Preeclampsia | |
CN102481319A (en) | Apolipoprotein ciii in pre- and type 2 diabetes | |
CN109946411A (en) | Biomarkers for the diagnosis of ossification of the ligamentum flavum of the thoracic spine and their screening methods | |
CN118671353A (en) | Biomarker for AIS diagnosis and application thereof | |
US20070184511A1 (en) | Method for Diagnosing a Person Having Sjogren's Syndrome | |
CN116754772B (en) | Peripheral blood protein markers for early diagnosis of Alzheimer's disease, their application and auxiliary diagnosis system | |
CN109975553B (en) | Urine protein marker of chronic obstructive pulmonary disease and application thereof in early diagnosis | |
CN108334747B (en) | Method for obtaining tumor urine protein marker and obtained tumor-related outlier urine protein library | |
JP2014032046A (en) | Marker for identification of diabetic renal diseases, and use thereof | |
JP2023159312A (en) | Method for preparing protein markers for detecting atopic dermatitis | |
US20070249000A1 (en) | Method for diagnosing a person having b-cell pathologies | |
CN109946467A (en) | A biomarker for the diagnosis of ossification of the ligamentum flavum of the thoracic spine | |
CN115575553A (en) | Application and detection system of serum metabolic markers as detection of EGFR mutation | |
CN114047263A (en) | Application of metabolic marker in preparation of detection reagent or detection object for diagnosing AIS (automatic identification system) and kit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |