WO2022133281A1 - Compositions and methods for the treatment of human immunodeficiency virus - Google Patents
Compositions and methods for the treatment of human immunodeficiency virus Download PDFInfo
- Publication number
- WO2022133281A1 WO2022133281A1 PCT/US2021/064151 US2021064151W WO2022133281A1 WO 2022133281 A1 WO2022133281 A1 WO 2022133281A1 US 2021064151 W US2021064151 W US 2021064151W WO 2022133281 A1 WO2022133281 A1 WO 2022133281A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- optionally substituted
- conjugate
- amino acid
- acid sequence
- formula
- Prior art date
Links
- 239000000203 mixture Substances 0.000 title claims abstract description 119
- 238000000034 method Methods 0.000 title claims abstract description 110
- 238000011282 treatment Methods 0.000 title claims abstract description 21
- 241000725303 Human immunodeficiency virus Species 0.000 title claims description 42
- 108010088751 Albumins Proteins 0.000 claims abstract description 288
- 239000000178 monomer Substances 0.000 claims abstract description 279
- 230000027455 binding Effects 0.000 claims abstract description 211
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 195
- 208000036142 Viral infection Diseases 0.000 claims abstract description 38
- 230000009385 viral infection Effects 0.000 claims abstract description 38
- 208000031886 HIV Infections Diseases 0.000 claims abstract description 13
- 239000003112 inhibitor Substances 0.000 claims abstract description 10
- 125000005647 linker group Chemical group 0.000 claims description 227
- 150000003839 salts Chemical class 0.000 claims description 162
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 122
- 150000001875 compounds Chemical class 0.000 claims description 89
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 67
- 125000003118 aryl group Chemical group 0.000 claims description 66
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 57
- 125000004429 atom Chemical group 0.000 claims description 55
- 235000018417 cysteine Nutrition 0.000 claims description 54
- 125000001072 heteroaryl group Chemical group 0.000 claims description 52
- 239000000872 buffer Substances 0.000 claims description 50
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 46
- 229910052717 sulfur Inorganic materials 0.000 claims description 43
- 125000005549 heteroarylene group Chemical group 0.000 claims description 42
- 125000004450 alkenylene group Chemical group 0.000 claims description 41
- 239000004472 Lysine Substances 0.000 claims description 37
- 125000000732 arylene group Chemical group 0.000 claims description 37
- 150000001412 amines Chemical class 0.000 claims description 35
- 125000004419 alkynylene group Chemical group 0.000 claims description 33
- 229910052760 oxygen Inorganic materials 0.000 claims description 32
- 125000003837 (C1-C20) alkyl group Chemical group 0.000 claims description 31
- 125000000524 functional group Chemical group 0.000 claims description 31
- 229910052757 nitrogen Inorganic materials 0.000 claims description 26
- 125000004433 nitrogen atom Chemical group N* 0.000 claims description 26
- 150000001732 carboxylic acid derivatives Chemical class 0.000 claims description 24
- 125000004434 sulfur atom Chemical group 0.000 claims description 24
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 claims description 23
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 claims description 22
- 125000003358 C2-C20 alkenyl group Chemical group 0.000 claims description 20
- 125000001841 imino group Chemical group [H]N=* 0.000 claims description 20
- 239000002904 solvent Substances 0.000 claims description 19
- 230000002194 synthesizing effect Effects 0.000 claims description 19
- 229910019142 PO4 Inorganic materials 0.000 claims description 18
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 claims description 18
- 239000010452 phosphate Substances 0.000 claims description 18
- LFGREXWGYUGZLY-UHFFFAOYSA-N phosphoryl Chemical group [P]=O LFGREXWGYUGZLY-UHFFFAOYSA-N 0.000 claims description 18
- 125000002813 thiocarbonyl group Chemical group *C(*)=S 0.000 claims description 18
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 claims description 17
- 125000002993 cycloalkylene group Chemical group 0.000 claims description 17
- 125000006735 (C1-C20) heteroalkyl group Chemical group 0.000 claims description 16
- 150000001345 alkine derivatives Chemical class 0.000 claims description 15
- 229910052799 carbon Inorganic materials 0.000 claims description 15
- 239000003443 antiviral agent Substances 0.000 claims description 14
- 125000004474 heteroalkylene group Chemical group 0.000 claims description 14
- 150000003573 thiols Chemical class 0.000 claims description 14
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 claims description 13
- 150000001540 azides Chemical class 0.000 claims description 12
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 claims description 11
- 239000008194 pharmaceutical composition Substances 0.000 claims description 11
- 229940122313 Nucleoside reverse transcriptase inhibitor Drugs 0.000 claims description 10
- 125000004093 cyano group Chemical group *C#N 0.000 claims description 10
- 125000006588 heterocycloalkylene group Chemical group 0.000 claims description 10
- 239000012948 isocyanate Substances 0.000 claims description 9
- 150000002513 isocyanates Chemical class 0.000 claims description 9
- 239000003419 rna directed dna polymerase inhibitor Substances 0.000 claims description 9
- 125000006716 (C1-C6) heteroalkyl group Chemical group 0.000 claims description 8
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 claims description 8
- 150000001336 alkenes Chemical class 0.000 claims description 8
- 150000001408 amides Chemical class 0.000 claims description 8
- 239000003814 drug Substances 0.000 claims description 8
- 150000003460 sulfonic acids Chemical class 0.000 claims description 8
- 241000713772 Human immunodeficiency virus 1 Species 0.000 claims description 7
- 241000713340 Human immunodeficiency virus 2 Species 0.000 claims description 7
- DPOPAJRDYZGTIR-UHFFFAOYSA-N Tetrazine Chemical compound C1=CN=NN=N1 DPOPAJRDYZGTIR-UHFFFAOYSA-N 0.000 claims description 7
- 125000003161 (C1-C6) alkylene group Chemical group 0.000 claims description 6
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 claims description 6
- 229940124597 therapeutic agent Drugs 0.000 claims description 6
- QXYLYYZZWZQACI-UHFFFAOYSA-N 2,3,4,5-tetrafluorophenol Chemical compound OC1=CC(F)=C(F)C(F)=C1F QXYLYYZZWZQACI-UHFFFAOYSA-N 0.000 claims description 5
- IJGSULQFKYOYEU-UHFFFAOYSA-N 2,3,4-trifluorophenol Chemical compound OC1=CC=C(F)C(F)=C1F IJGSULQFKYOYEU-UHFFFAOYSA-N 0.000 claims description 5
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 claims description 5
- 206010028980 Neoplasm Diseases 0.000 claims description 5
- ZCIGNRJZKPOIKD-CQXVEOKZSA-N cobicistat Chemical compound S1C(C(C)C)=NC(CN(C)C(=O)N[C@@H](CCN2CCOCC2)C(=O)N[C@H](CC[C@H](CC=2C=CC=CC=2)NC(=O)OCC=2SC=NC=2)CC=2C=CC=CC=2)=C1 ZCIGNRJZKPOIKD-CQXVEOKZSA-N 0.000 claims description 5
- 229960002402 cobicistat Drugs 0.000 claims description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 5
- 229910052736 halogen Inorganic materials 0.000 claims description 5
- 150000002367 halogens Chemical class 0.000 claims description 5
- IDCLTMRSSAXUNY-UHFFFAOYSA-N 5-hydroxylansoprazole Chemical compound CC1=C(OCC(F)(F)F)C=CN=C1CS(=O)C1=NC2=CC(O)=CC=C2N1 IDCLTMRSSAXUNY-UHFFFAOYSA-N 0.000 claims description 4
- XQSPYNMVSIKCOC-NTSWFWBYSA-N Emtricitabine Chemical compound C1=C(F)C(N)=NC(=O)N1[C@H]1O[C@@H](CO)SC1 XQSPYNMVSIKCOC-NTSWFWBYSA-N 0.000 claims description 4
- 201000011510 cancer Diseases 0.000 claims description 4
- 239000004202 carbamide Substances 0.000 claims description 4
- WHBIGIKBNXZKFE-UHFFFAOYSA-N delavirdine Chemical compound CC(C)NC1=CC=CN=C1N1CCN(C(=O)C=2NC3=CC=C(NS(C)(=O)=O)C=C3C=2)CC1 WHBIGIKBNXZKFE-UHFFFAOYSA-N 0.000 claims description 4
- 201000010099 disease Diseases 0.000 claims description 4
- 229960000366 emtricitabine Drugs 0.000 claims description 4
- 229940124524 integrase inhibitor Drugs 0.000 claims description 4
- 239000002850 integrase inhibitor Substances 0.000 claims description 4
- NQDJXKOVJZTUJA-UHFFFAOYSA-N nevirapine Chemical compound C12=NC=CC=C2C(=O)NC=2C(C)=CC=NC=2N1C1CC1 NQDJXKOVJZTUJA-UHFFFAOYSA-N 0.000 claims description 4
- 239000000137 peptide hydrolase inhibitor Substances 0.000 claims description 4
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 4
- 125000003396 thiol group Chemical group [H]S* 0.000 claims description 4
- 125000003860 C1-C20 alkoxy group Chemical group 0.000 claims description 3
- 206010061598 Immunodeficiency Diseases 0.000 claims description 3
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 claims description 3
- 238000002347 injection Methods 0.000 claims description 3
- 239000007924 injection Substances 0.000 claims description 3
- 150000002825 nitriles Chemical class 0.000 claims description 3
- 229940042402 non-nucleoside reverse transcriptase inhibitor Drugs 0.000 claims description 3
- 239000002726 nonnucleoside reverse transcriptase inhibitor Substances 0.000 claims description 3
- 210000000056 organ Anatomy 0.000 claims description 3
- 230000007502 viral entry Effects 0.000 claims description 3
- 229960002555 zidovudine Drugs 0.000 claims description 3
- HBOMLICNUCNMMY-XLPZGREQSA-N zidovudine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](N=[N+]=[N-])C1 HBOMLICNUCNMMY-XLPZGREQSA-N 0.000 claims description 3
- 206010053622 Asplenia Diseases 0.000 claims description 2
- AXRYRYVKAWYZBR-UHFFFAOYSA-N Atazanavir Natural products C=1C=C(C=2N=CC=CC=2)C=CC=1CN(NC(=O)C(NC(=O)OC)C(C)(C)C)CC(O)C(NC(=O)C(NC(=O)OC)C(C)(C)C)CC1=CC=CC=C1 AXRYRYVKAWYZBR-UHFFFAOYSA-N 0.000 claims description 2
- 108010019625 Atazanavir Sulfate Proteins 0.000 claims description 2
- BXZVVICBKDXVGW-NKWVEPMBSA-N Didanosine Chemical compound O1[C@H](CO)CC[C@@H]1N1C(NC=NC2=O)=C2N=C1 BXZVVICBKDXVGW-NKWVEPMBSA-N 0.000 claims description 2
- XPOQHMRABVBWPR-UHFFFAOYSA-N Efavirenz Natural products O1C(=O)NC2=CC=C(Cl)C=C2C1(C(F)(F)F)C#CC1CC1 XPOQHMRABVBWPR-UHFFFAOYSA-N 0.000 claims description 2
- 208000028523 Hereditary Complement Deficiency disease Diseases 0.000 claims description 2
- 206010062016 Immunosuppression Diseases 0.000 claims description 2
- 206010025323 Lymphomas Diseases 0.000 claims description 2
- 208000034578 Multiple myelomas Diseases 0.000 claims description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 claims description 2
- XNKLLVCARDGLGL-JGVFFNPUSA-N Stavudine Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1C=C[C@@H](CO)O1 XNKLLVCARDGLGL-JGVFFNPUSA-N 0.000 claims description 2
- 201000001322 T cell deficiency Diseases 0.000 claims description 2
- MCGSCOLBFJQGHM-SCZZXKLOSA-N abacavir Chemical compound C=12N=CN([C@H]3C=C[C@@H](CO)C3)C2=NC(N)=NC=1NC1CC1 MCGSCOLBFJQGHM-SCZZXKLOSA-N 0.000 claims description 2
- 229960004748 abacavir Drugs 0.000 claims description 2
- 229960003277 atazanavir Drugs 0.000 claims description 2
- AXRYRYVKAWYZBR-GASGPIRDSA-N atazanavir Chemical group C([C@H](NC(=O)[C@@H](NC(=O)OC)C(C)(C)C)[C@@H](O)CN(CC=1C=CC(=CC=1)C=1N=CC=CC=1)NC(=O)[C@@H](NC(=O)OC)C(C)(C)C)C1=CC=CC=C1 AXRYRYVKAWYZBR-GASGPIRDSA-N 0.000 claims description 2
- 201000002388 complement deficiency Diseases 0.000 claims description 2
- 229960005107 darunavir Drugs 0.000 claims description 2
- CJBJHOAVZSMMDJ-HEXNFIEUSA-N darunavir Chemical compound C([C@@H]([C@H](O)CN(CC(C)C)S(=O)(=O)C=1C=CC(N)=CC=1)NC(=O)O[C@@H]1[C@@H]2CCO[C@@H]2OC1)C1=CC=CC=C1 CJBJHOAVZSMMDJ-HEXNFIEUSA-N 0.000 claims description 2
- 230000007812 deficiency Effects 0.000 claims description 2
- 229960005319 delavirdine Drugs 0.000 claims description 2
- 229960002656 didanosine Drugs 0.000 claims description 2
- 229960002542 dolutegravir Drugs 0.000 claims description 2
- RHWKPHLQXYSBKR-BMIGLBTASA-N dolutegravir Chemical group C([C@@H]1OCC[C@H](N1C(=O)C1=C(O)C2=O)C)N1C=C2C(=O)NCC1=CC=C(F)C=C1F RHWKPHLQXYSBKR-BMIGLBTASA-N 0.000 claims description 2
- 229960003804 efavirenz Drugs 0.000 claims description 2
- XPOQHMRABVBWPR-ZDUSSCGKSA-N efavirenz Chemical compound C([C@]1(C2=CC(Cl)=CC=C2NC(=O)O1)C(F)(F)F)#CC1CC1 XPOQHMRABVBWPR-ZDUSSCGKSA-N 0.000 claims description 2
- 229960003586 elvitegravir Drugs 0.000 claims description 2
- JUZYLCPPVHEVSV-LJQANCHMSA-N elvitegravir Chemical compound COC1=CC=2N([C@H](CO)C(C)C)C=C(C(O)=O)C(=O)C=2C=C1CC1=CC=CC(Cl)=C1F JUZYLCPPVHEVSV-LJQANCHMSA-N 0.000 claims description 2
- 229960002049 etravirine Drugs 0.000 claims description 2
- PYGWGZALEOIKDF-UHFFFAOYSA-N etravirine Chemical compound CC1=CC(C#N)=CC(C)=C1OC1=NC(NC=2C=CC(=CC=2)C#N)=NC(N)=C1Br PYGWGZALEOIKDF-UHFFFAOYSA-N 0.000 claims description 2
- 238000011134 hematopoietic stem cell transplantation Methods 0.000 claims description 2
- 230000001506 immunosuppresive effect Effects 0.000 claims description 2
- 238000002650 immunosuppressive therapy Methods 0.000 claims description 2
- 238000001802 infusion Methods 0.000 claims description 2
- 229960001627 lamivudine Drugs 0.000 claims description 2
- JTEGQNOMFQHVDC-NKWVEPMBSA-N lamivudine Chemical compound O=C1N=C(N)C=CN1[C@H]1O[C@@H](CO)SC1 JTEGQNOMFQHVDC-NKWVEPMBSA-N 0.000 claims description 2
- 208000032839 leukemia Diseases 0.000 claims description 2
- 208000004235 neutropenia Diseases 0.000 claims description 2
- 229960000689 nevirapine Drugs 0.000 claims description 2
- 238000011321 prophylaxis Methods 0.000 claims description 2
- 229960004742 raltegravir Drugs 0.000 claims description 2
- CZFFBEXEKNGXKS-UHFFFAOYSA-N raltegravir Chemical compound O1C(C)=NN=C1C(=O)NC(C)(C)C1=NC(C(=O)NCC=2C=CC(F)=CC=2)=C(O)C(=O)N1C CZFFBEXEKNGXKS-UHFFFAOYSA-N 0.000 claims description 2
- 229960002814 rilpivirine Drugs 0.000 claims description 2
- YIBOMRUWOWDFLG-ONEGZZNKSA-N rilpivirine Chemical compound CC1=CC(\C=C\C#N)=CC(C)=C1NC1=CC=NC(NC=2C=CC(=CC=2)C#N)=N1 YIBOMRUWOWDFLG-ONEGZZNKSA-N 0.000 claims description 2
- 229960001203 stavudine Drugs 0.000 claims description 2
- 229960004556 tenofovir Drugs 0.000 claims description 2
- VCMJCVGFSROFHV-WZGZYPNHSA-N tenofovir disoproxil fumarate Chemical compound OC(=O)\C=C\C(O)=O.N1=CN=C2N(C[C@@H](C)OCP(=O)(OCOC(=O)OC(C)C)OCOC(=O)OC(C)C)C=NC2=C1N VCMJCVGFSROFHV-WZGZYPNHSA-N 0.000 claims description 2
- 102100027211 Albumin Human genes 0.000 claims 2
- 208000009011 Cytochrome P-450 CYP3A Inhibitors Diseases 0.000 claims 2
- NCDNCNXCDXHOMX-UHFFFAOYSA-N Ritonavir Natural products C=1C=CC=CC=1CC(NC(=O)OCC=1SC=NC=1)C(O)CC(CC=1C=CC=CC=1)NC(=O)C(C(C)C)NC(=O)N(C)CC1=CSC(C(C)C)=N1 NCDNCNXCDXHOMX-UHFFFAOYSA-N 0.000 claims 2
- 239000003067 chemokine receptor CCR5 antagonist Substances 0.000 claims 2
- 125000001475 halogen functional group Chemical group 0.000 claims 2
- 229960000311 ritonavir Drugs 0.000 claims 2
- NCDNCNXCDXHOMX-XGKFQTDJSA-N ritonavir Chemical compound N([C@@H](C(C)C)C(=O)N[C@H](C[C@H](O)[C@H](CC=1C=CC=CC=1)NC(=O)OCC=1SC=NC=1)CC=1C=CC=CC=1)C(=O)N(C)CC1=CSC(C(C)C)=N1 NCDNCNXCDXHOMX-XGKFQTDJSA-N 0.000 claims 2
- QAGYKUNXZHXKMR-UHFFFAOYSA-N CPD000469186 Natural products CC1=C(O)C=CC=C1C(=O)NC(C(O)CN1C(CC2CCCCC2C1)C(=O)NC(C)(C)C)CSC1=CC=CC=C1 QAGYKUNXZHXKMR-UHFFFAOYSA-N 0.000 claims 1
- 108010032976 Enfuvirtide Proteins 0.000 claims 1
- KJHKTHWMRKYKJE-SUGCFTRWSA-N Kaletra Chemical compound N1([C@@H](C(C)C)C(=O)N[C@H](C[C@H](O)[C@H](CC=2C=CC=CC=2)NC(=O)COC=2C(=CC=CC=2C)C)CC=2C=CC=CC=2)CCCNC1=O KJHKTHWMRKYKJE-SUGCFTRWSA-N 0.000 claims 1
- SUJUHGSWHZTSEU-UHFFFAOYSA-N Tipranavir Natural products C1C(O)=C(C(CC)C=2C=C(NS(=O)(=O)C=3N=CC(=CC=3)C(F)(F)F)C=CC=2)C(=O)OC1(CCC)CCC1=CC=CC=C1 SUJUHGSWHZTSEU-UHFFFAOYSA-N 0.000 claims 1
- PEASPLKKXBYDKL-FXEVSJAOSA-N enfuvirtide Chemical group C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(C)=O)[C@@H](C)O)[C@@H](C)CC)C1=CN=CN1 PEASPLKKXBYDKL-FXEVSJAOSA-N 0.000 claims 1
- 229960002062 enfuvirtide Drugs 0.000 claims 1
- 229960003142 fosamprenavir Drugs 0.000 claims 1
- MLBVMOWEQCZNCC-OEMFJLHTSA-N fosamprenavir Chemical compound C([C@@H]([C@H](OP(O)(O)=O)CN(CC(C)C)S(=O)(=O)C=1C=CC(N)=CC=1)NC(=O)O[C@@H]1COCC1)C1=CC=CC=C1 MLBVMOWEQCZNCC-OEMFJLHTSA-N 0.000 claims 1
- 229960001936 indinavir Drugs 0.000 claims 1
- CBVCZFGXHXORBI-PXQQMZJSSA-N indinavir Chemical compound C([C@H](N(CC1)C[C@@H](O)C[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H]2C3=CC=CC=C3C[C@H]2O)C(=O)NC(C)(C)C)N1CC1=CC=CN=C1 CBVCZFGXHXORBI-PXQQMZJSSA-N 0.000 claims 1
- 229960004525 lopinavir Drugs 0.000 claims 1
- 229960004710 maraviroc Drugs 0.000 claims 1
- GSNHKUDZZFZSJB-QYOOZWMWSA-N maraviroc Chemical group CC(C)C1=NN=C(C)N1[C@@H]1C[C@H](N2CC[C@H](NC(=O)C3CCC(F)(F)CC3)C=3C=CC=CC=3)CC[C@H]2C1 GSNHKUDZZFZSJB-QYOOZWMWSA-N 0.000 claims 1
- 229960000884 nelfinavir Drugs 0.000 claims 1
- QAGYKUNXZHXKMR-HKWSIXNMSA-N nelfinavir Chemical compound CC1=C(O)C=CC=C1C(=O)N[C@H]([C@H](O)CN1[C@@H](C[C@@H]2CCCC[C@@H]2C1)C(=O)NC(C)(C)C)CSC1=CC=CC=C1 QAGYKUNXZHXKMR-HKWSIXNMSA-N 0.000 claims 1
- 229960001852 saquinavir Drugs 0.000 claims 1
- QWAXKHKRTORLEM-UGJKXSETSA-N saquinavir Chemical compound C([C@@H]([C@H](O)CN1C[C@H]2CCCC[C@H]2C[C@H]1C(=O)NC(C)(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)C=1N=C2C=CC=CC2=CC=1)C1=CC=CC=C1 QWAXKHKRTORLEM-UGJKXSETSA-N 0.000 claims 1
- 229960000838 tipranavir Drugs 0.000 claims 1
- SUJUHGSWHZTSEU-FYBSXPHGSA-N tipranavir Chemical compound C([C@@]1(CCC)OC(=O)C([C@H](CC)C=2C=C(NS(=O)(=O)C=3N=CC(=CC=3)C(F)(F)F)C=CC=2)=C(O)C1)CC1=CC=CC=C1 SUJUHGSWHZTSEU-FYBSXPHGSA-N 0.000 claims 1
- 102000009027 Albumins Human genes 0.000 abstract description 286
- 230000003612 virological effect Effects 0.000 abstract description 44
- QRPZBKAMSFHVRW-UHFFFAOYSA-N 1-(4-benzoylpiperazin-1-yl)-2-[4-methoxy-7-(3-methyl-1,2,4-triazol-1-yl)-1h-pyrrolo[2,3-c]pyridin-3-yl]ethane-1,2-dione Chemical compound C1=2NC=C(C(=O)C(=O)N3CCN(CC3)C(=O)C=3C=CC=CC=3)C=2C(OC)=CN=C1N1C=NC(C)=N1 QRPZBKAMSFHVRW-UHFFFAOYSA-N 0.000 abstract description 13
- 229950008708 temsavir Drugs 0.000 abstract description 12
- 239000000562 conjugate Substances 0.000 description 605
- 125000003275 alpha amino acid group Chemical group 0.000 description 412
- 241000282414 Homo sapiens Species 0.000 description 154
- 239000011230 binding agent Substances 0.000 description 145
- 238000006467 substitution reaction Methods 0.000 description 121
- 125000000539 amino acid group Chemical group 0.000 description 114
- 230000035772 mutation Effects 0.000 description 89
- 235000001014 amino acid Nutrition 0.000 description 79
- 229940024606 amino acid Drugs 0.000 description 74
- 150000001413 amino acids Chemical class 0.000 description 72
- -1 trifluorophenyl ester Chemical class 0.000 description 69
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 65
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 65
- 230000021615 conjugation Effects 0.000 description 61
- 239000002202 Polyethylene glycol Substances 0.000 description 57
- 239000000539 dimer Substances 0.000 description 57
- 229920001223 polyethylene glycol Polymers 0.000 description 57
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 48
- 102000004196 processed proteins & peptides Human genes 0.000 description 43
- 210000004899 c-terminal region Anatomy 0.000 description 42
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 40
- 125000001424 substituent group Chemical group 0.000 description 37
- 108091006905 Human Serum Albumin Proteins 0.000 description 36
- 102000008100 Human Serum Albumin Human genes 0.000 description 36
- 108010076504 Protein Sorting Signals Proteins 0.000 description 36
- 235000018977 lysine Nutrition 0.000 description 35
- 125000002947 alkylene group Chemical group 0.000 description 30
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 28
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 25
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 23
- 102100026120 IgG receptor FcRn large subunit p51 Human genes 0.000 description 23
- 241000699666 Mus <mouse, genus> Species 0.000 description 23
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 21
- 125000002877 alkyl aryl group Chemical group 0.000 description 20
- 125000005842 heteroatom Chemical group 0.000 description 20
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 19
- 125000005843 halogen group Chemical group 0.000 description 19
- 101000961145 Homo sapiens Immunoglobulin heavy constant gamma 3 Proteins 0.000 description 18
- 102100039348 Immunoglobulin heavy constant gamma 3 Human genes 0.000 description 18
- 125000000217 alkyl group Chemical group 0.000 description 18
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 17
- 125000003342 alkenyl group Chemical group 0.000 description 17
- 125000000304 alkynyl group Chemical group 0.000 description 17
- 125000003277 amino group Chemical group 0.000 description 17
- 239000012634 fragment Substances 0.000 description 17
- 101000767160 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Intracellular protein transport protein USO1 Proteins 0.000 description 16
- 125000000753 cycloalkyl group Chemical group 0.000 description 15
- 238000000746 purification Methods 0.000 description 15
- 125000000998 L-alanino group Chemical group [H]N([*])[C@](C([H])([H])[H])([H])C(=O)O[H] 0.000 description 14
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical group OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 14
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 14
- 150000002148 esters Chemical class 0.000 description 14
- 229940027941 immunoglobulin g Drugs 0.000 description 14
- 125000000773 L-serino group Chemical group [H]OC(=O)[C@@]([H])(N([H])*)C([H])([H])O[H] 0.000 description 13
- 238000006243 chemical reaction Methods 0.000 description 13
- 210000002865 immune cell Anatomy 0.000 description 13
- 229920001184 polypeptide Polymers 0.000 description 13
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 13
- 210000001519 tissue Anatomy 0.000 description 13
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 12
- 108090000623 proteins and genes Proteins 0.000 description 12
- 102000005962 receptors Human genes 0.000 description 12
- 108020003175 receptors Proteins 0.000 description 12
- 125000004122 cyclic group Chemical group 0.000 description 11
- 125000004185 ester group Chemical group 0.000 description 11
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 11
- 102000004169 proteins and genes Human genes 0.000 description 11
- 230000009467 reduction Effects 0.000 description 11
- 238000006722 reduction reaction Methods 0.000 description 11
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 10
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 10
- 125000002355 alkine group Chemical group 0.000 description 10
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 description 10
- 230000008901 benefit Effects 0.000 description 10
- 230000000120 cytopathologic effect Effects 0.000 description 10
- 230000007062 hydrolysis Effects 0.000 description 10
- 238000006460 hydrolysis reaction Methods 0.000 description 10
- 239000002245 particle Substances 0.000 description 10
- 235000018102 proteins Nutrition 0.000 description 10
- 208000030507 AIDS Diseases 0.000 description 9
- 230000000840 anti-viral effect Effects 0.000 description 9
- 230000003833 cell viability Effects 0.000 description 9
- 238000004108 freeze drying Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 125000001188 haloalkyl group Chemical group 0.000 description 9
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 9
- 239000001990 protein-drug conjugate Substances 0.000 description 9
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 108010087819 Fc receptors Proteins 0.000 description 8
- 102000009109 Fc receptors Human genes 0.000 description 8
- 108010002350 Interleukin-2 Proteins 0.000 description 8
- 102000000588 Interleukin-2 Human genes 0.000 description 8
- 102100029204 Low affinity immunoglobulin gamma Fc region receptor II-a Human genes 0.000 description 8
- 241001529936 Murinae Species 0.000 description 8
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 8
- 238000007792 addition Methods 0.000 description 8
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 8
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 8
- 210000004027 cell Anatomy 0.000 description 8
- 150000001945 cysteines Chemical class 0.000 description 8
- 238000012217 deletion Methods 0.000 description 8
- 230000037430 deletion Effects 0.000 description 8
- 125000000592 heterocycloalkyl group Chemical group 0.000 description 8
- 239000000543 intermediate Substances 0.000 description 8
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 8
- KXSKAZFMTGADIV-UHFFFAOYSA-N 2-[3-(2-hydroxyethoxy)propoxy]ethanol Chemical compound OCCOCCCOCCO KXSKAZFMTGADIV-UHFFFAOYSA-N 0.000 description 7
- 101000917839 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor III-B Proteins 0.000 description 7
- 108060003951 Immunoglobulin Proteins 0.000 description 7
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 7
- 102100029193 Low affinity immunoglobulin gamma Fc region receptor III-A Human genes 0.000 description 7
- 102100029185 Low affinity immunoglobulin gamma Fc region receptor III-B Human genes 0.000 description 7
- 210000001744 T-lymphocyte Anatomy 0.000 description 7
- 241000700605 Viruses Species 0.000 description 7
- 239000003795 chemical substances by application Substances 0.000 description 7
- 125000000392 cycloalkenyl group Chemical group 0.000 description 7
- 125000004404 heteroalkyl group Chemical group 0.000 description 7
- 102000018358 immunoglobulin Human genes 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 7
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 6
- 108010021468 Fc gamma receptor IIA Proteins 0.000 description 6
- 101710099301 Low affinity immunoglobulin gamma Fc region receptor III-A Proteins 0.000 description 6
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 6
- 102000007562 Serum Albumin Human genes 0.000 description 6
- 108010071390 Serum Albumin Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 239000011203 carbon fibre reinforced carbon Substances 0.000 description 6
- 125000002843 carboxylic acid group Chemical group 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 238000009826 distribution Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 125000005439 maleimidyl group Chemical group C1(C=CC(N1*)=O)=O 0.000 description 6
- 108010068617 neonatal Fc receptor Proteins 0.000 description 6
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 6
- 229960001153 serine Drugs 0.000 description 6
- 230000001225 therapeutic effect Effects 0.000 description 6
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 5
- 108010021470 Fc gamma receptor IIC Proteins 0.000 description 5
- 108090000288 Glycoproteins Proteins 0.000 description 5
- 102000003886 Glycoproteins Human genes 0.000 description 5
- 101000961156 Homo sapiens Immunoglobulin heavy constant gamma 1 Proteins 0.000 description 5
- 102100039345 Immunoglobulin heavy constant gamma 1 Human genes 0.000 description 5
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 5
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 5
- 102100029206 Low affinity immunoglobulin gamma Fc region receptor II-c Human genes 0.000 description 5
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 5
- 101800001690 Transmembrane protein gp41 Proteins 0.000 description 5
- 239000000460 chlorine Substances 0.000 description 5
- 229910052801 chlorine Inorganic materials 0.000 description 5
- 230000001268 conjugating effect Effects 0.000 description 5
- 125000005724 cycloalkenylene group Chemical group 0.000 description 5
- 150000001991 dicarboxylic acids Chemical class 0.000 description 5
- 239000012636 effector Substances 0.000 description 5
- 230000012010 growth Effects 0.000 description 5
- 125000004366 heterocycloalkenyl group Chemical group 0.000 description 5
- 230000005764 inhibitory process Effects 0.000 description 5
- 125000000542 sulfonic acid group Chemical group 0.000 description 5
- QWENRTYMTSOGBR-UHFFFAOYSA-N 1H-1,2,3-Triazole Chemical compound C=1C=NNN=1 QWENRTYMTSOGBR-UHFFFAOYSA-N 0.000 description 4
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- 241000283707 Capra Species 0.000 description 4
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 4
- VMQMZMRVKUZKQL-UHFFFAOYSA-N Cu+ Chemical compound [Cu+] VMQMZMRVKUZKQL-UHFFFAOYSA-N 0.000 description 4
- 239000004471 Glycine Substances 0.000 description 4
- 208000037357 HIV infectious disease Diseases 0.000 description 4
- 101000961146 Homo sapiens Immunoglobulin heavy constant gamma 2 Proteins 0.000 description 4
- 108010073807 IgG Receptors Proteins 0.000 description 4
- 102000009490 IgG Receptors Human genes 0.000 description 4
- 102100039346 Immunoglobulin heavy constant gamma 2 Human genes 0.000 description 4
- 206010057249 Phagocytosis Diseases 0.000 description 4
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 4
- 230000003213 activating effect Effects 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- IVRMZWNICZWHMI-UHFFFAOYSA-N azide group Chemical group [N-]=[N+]=[N-] IVRMZWNICZWHMI-UHFFFAOYSA-N 0.000 description 4
- 125000004432 carbon atom Chemical group C* 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 230000006378 damage Effects 0.000 description 4
- 239000003937 drug carrier Substances 0.000 description 4
- 230000002708 enhancing effect Effects 0.000 description 4
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 4
- 229910052739 hydrogen Inorganic materials 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 210000004072 lung Anatomy 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000008782 phagocytosis Effects 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 4
- 229920006395 saturated elastomer Polymers 0.000 description 4
- 150000003384 small molecules Chemical class 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 230000008685 targeting Effects 0.000 description 4
- 229960004441 tyrosine Drugs 0.000 description 4
- MRTPISKDZDHEQI-YFKPBYRVSA-N (2s)-2-(tert-butylamino)propanoic acid Chemical compound OC(=O)[C@H](C)NC(C)(C)C MRTPISKDZDHEQI-YFKPBYRVSA-N 0.000 description 3
- NPDBDJFLKKQMCM-SCSAIBSYSA-N (2s)-2-amino-3,3-dimethylbutanoic acid Chemical compound CC(C)(C)[C@H](N)C(O)=O NPDBDJFLKKQMCM-SCSAIBSYSA-N 0.000 description 3
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 241000283073 Equus caballus Species 0.000 description 3
- 108010021472 Fc gamma receptor IIB Proteins 0.000 description 3
- 241000287828 Gallus gallus Species 0.000 description 3
- 101000961149 Homo sapiens Immunoglobulin heavy constant gamma 4 Proteins 0.000 description 3
- 102100039347 Immunoglobulin heavy constant gamma 4 Human genes 0.000 description 3
- 102100029205 Low affinity immunoglobulin gamma Fc region receptor II-b Human genes 0.000 description 3
- OFOBLEOULBTSOW-UHFFFAOYSA-N Malonic acid Chemical compound OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 241001494479 Pecora Species 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 125000002252 acyl group Chemical group 0.000 description 3
- 230000010933 acylation Effects 0.000 description 3
- 238000005917 acylation reaction Methods 0.000 description 3
- 229940121357 antivirals Drugs 0.000 description 3
- XSCHRSMBECNVNS-UHFFFAOYSA-N benzopyrazine Natural products N1=CC=NC2=CC=CC=C21 XSCHRSMBECNVNS-UHFFFAOYSA-N 0.000 description 3
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 229910052794 bromium Inorganic materials 0.000 description 3
- 125000004976 cyclobutylene group Chemical group 0.000 description 3
- 125000004980 cyclopropylene group Chemical group 0.000 description 3
- 230000000447 dimerizing effect Effects 0.000 description 3
- 229910052731 fluorine Inorganic materials 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 210000002443 helper t lymphocyte Anatomy 0.000 description 3
- 229960002885 histidine Drugs 0.000 description 3
- 239000001257 hydrogen Substances 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 125000002950 monocyclic group Chemical group 0.000 description 3
- 210000000440 neutrophil Anatomy 0.000 description 3
- 150000002923 oximes Chemical class 0.000 description 3
- 238000001273 protein sequence alignment Methods 0.000 description 3
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 3
- 238000004064 recycling Methods 0.000 description 3
- 238000006268 reductive amination reaction Methods 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 125000000547 substituted alkyl group Chemical group 0.000 description 3
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- VWTFNYVAFGYEKI-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(3,4-dimethoxyphenyl)propanoate Chemical compound COC1=CC=C(C[C@H](N)C(O)=O)C=C1OC VWTFNYVAFGYEKI-QMMMGPOBSA-N 0.000 description 2
- AXDLCFOOGCNDST-VIFPVBQESA-N (2s)-3-(4-hydroxyphenyl)-2-(methylamino)propanoic acid Chemical compound CN[C@H](C(O)=O)CC1=CC=C(O)C=C1 AXDLCFOOGCNDST-VIFPVBQESA-N 0.000 description 2
- SMWADGDVGCZIGK-ZJUUUORDSA-N (2s,5r)-5-phenylpyrrolidin-1-ium-2-carboxylate Chemical compound N1[C@H](C(=O)O)CC[C@@H]1C1=CC=CC=C1 SMWADGDVGCZIGK-ZJUUUORDSA-N 0.000 description 2
- 125000000008 (C1-C10) alkyl group Chemical group 0.000 description 2
- 125000006833 (C1-C5) alkylene group Chemical group 0.000 description 2
- WOXWUZCRWJWTRT-UHFFFAOYSA-N 1-amino-1-cyclohexanecarboxylic acid Chemical compound OC(=O)C1(N)CCCCC1 WOXWUZCRWJWTRT-UHFFFAOYSA-N 0.000 description 2
- FCEHBMOGCRZNNI-UHFFFAOYSA-N 1-benzothiophene Chemical compound C1=CC=C2SC=CC2=C1 FCEHBMOGCRZNNI-UHFFFAOYSA-N 0.000 description 2
- YBYIRNPNPLQARY-UHFFFAOYSA-N 1H-indene Chemical compound C1=CC=C2CC=CC2=C1 YBYIRNPNPLQARY-UHFFFAOYSA-N 0.000 description 2
- OGNSCSPNOLGXSM-UHFFFAOYSA-N 2,4-diaminobutyric acid Chemical compound NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 2
- DICMQVOBSKLBBN-UHFFFAOYSA-N 2-(cyclodecylamino)acetic acid Chemical compound OC(=O)CNC1CCCCCCCCC1 DICMQVOBSKLBBN-UHFFFAOYSA-N 0.000 description 2
- NPLBBQAAYSJEMO-UHFFFAOYSA-N 2-(cycloheptylazaniumyl)acetate Chemical compound OC(=O)CNC1CCCCCC1 NPLBBQAAYSJEMO-UHFFFAOYSA-N 0.000 description 2
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 2
- YDBPFLZECVWPSH-UHFFFAOYSA-N 2-[3-(diaminomethylideneamino)propylamino]acetic acid Chemical compound NC(=N)NCCCNCC(O)=O YDBPFLZECVWPSH-UHFFFAOYSA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N 2-aminopentanoic acid Chemical compound CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- JWYOAMOZLZXDER-UHFFFAOYSA-N 2-azaniumylcyclopentane-1-carboxylate Chemical compound NC1CCCC1C(O)=O JWYOAMOZLZXDER-UHFFFAOYSA-N 0.000 description 2
- GAUBNQMYYJLWNF-UHFFFAOYSA-N 3-(Carboxymethylamino)propanoic acid Chemical compound OC(=O)CCNCC(O)=O GAUBNQMYYJLWNF-UHFFFAOYSA-N 0.000 description 2
- YYROPELSRYBVMQ-UHFFFAOYSA-N 4-toluenesulfonyl chloride Chemical compound CC1=CC=C(S(Cl)(=O)=O)C=C1 YYROPELSRYBVMQ-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- UJOBWOGCFQCDNV-UHFFFAOYSA-N 9H-carbazole Chemical compound C1=CC=C2C3=CC=CC=C3NC2=C1 UJOBWOGCFQCDNV-UHFFFAOYSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- NOWKCMXCCJGMRR-UHFFFAOYSA-N Aziridine Chemical compound C1CN1 NOWKCMXCCJGMRR-UHFFFAOYSA-N 0.000 description 2
- BPYKTIZUTYGOLE-IFADSCNNSA-N Bilirubin Chemical compound N1C(=O)C(C)=C(C=C)\C1=C\C1=C(C)C(CCC(O)=O)=C(CC2=C(C(C)=C(\C=C/3C(=C(C=C)C(=O)N\3)C)N2)CCC(O)=O)N1 BPYKTIZUTYGOLE-IFADSCNNSA-N 0.000 description 2
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 2
- 108010041397 CD4 Antigens Proteins 0.000 description 2
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 2
- 241000700199 Cavia porcellus Species 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- 108010069514 Cyclic Peptides Proteins 0.000 description 2
- 102000001189 Cyclic Peptides Human genes 0.000 description 2
- 101710121417 Envelope glycoprotein Proteins 0.000 description 2
- 101150050927 Fcgrt gene Proteins 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 239000004277 Ferrous carbonate Substances 0.000 description 2
- UGJMXCAKCUNAIE-UHFFFAOYSA-N Gabapentin Chemical compound OC(=O)CC1(CN)CCCCC1 UGJMXCAKCUNAIE-UHFFFAOYSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000878602 Homo sapiens Immunoglobulin alpha Fc receptor Proteins 0.000 description 2
- 101000917826 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor II-a Proteins 0.000 description 2
- 101000917824 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor II-b Proteins 0.000 description 2
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 2
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 2
- 108010073816 IgE Receptors Proteins 0.000 description 2
- 102000009438 IgE Receptors Human genes 0.000 description 2
- 102100038005 Immunoglobulin alpha Fc receptor Human genes 0.000 description 2
- SIKJAQJRHWYJAI-UHFFFAOYSA-N Indole Chemical compound C1=CC=C2NC=CC2=C1 SIKJAQJRHWYJAI-UHFFFAOYSA-N 0.000 description 2
- 102100034349 Integrase Human genes 0.000 description 2
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- QWCKQJZIFLGMSD-VKHMYHEASA-N L-alpha-aminobutyric acid Chemical compound CC[C@H](N)C(O)=O QWCKQJZIFLGMSD-VKHMYHEASA-N 0.000 description 2
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical compound OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 2
- 150000008575 L-amino acids Chemical group 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 101100288142 Mus musculus Klkb1 gene Proteins 0.000 description 2
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 2
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 2
- KSPIYJQBLVDRRI-UHFFFAOYSA-N N-methylisoleucine Chemical compound CCC(C)C(NC)C(O)=O KSPIYJQBLVDRRI-UHFFFAOYSA-N 0.000 description 2
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 2
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 2
- 208000001388 Opportunistic Infections Diseases 0.000 description 2
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 2
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 108010058846 Ovalbumin Proteins 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Chemical compound OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- SMWDFEZZVXVKRB-UHFFFAOYSA-N Quinoline Chemical compound N1=CC=CC2=CC=CC=C21 SMWDFEZZVXVKRB-UHFFFAOYSA-N 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108010077895 Sarcosine Proteins 0.000 description 2
- YTPLMLYBLZKORZ-UHFFFAOYSA-N Thiophene Chemical compound C=1C=CSC=1 YTPLMLYBLZKORZ-UHFFFAOYSA-N 0.000 description 2
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 2
- 229940118555 Viral entry inhibitor Drugs 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- ORILYTVJVMAKLC-UHFFFAOYSA-N adamantane Chemical compound C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 2
- 150000001299 aldehydes Chemical class 0.000 description 2
- DLAMVQGYEVKIRE-UHFFFAOYSA-N alpha-(methylamino)isobutyric acid Chemical compound CNC(C)(C)C(O)=O DLAMVQGYEVKIRE-UHFFFAOYSA-N 0.000 description 2
- 125000003368 amide group Chemical group 0.000 description 2
- RWZYAGGXGHYGMB-UHFFFAOYSA-N anthranilic acid Chemical compound NC1=CC=CC=C1C(O)=O RWZYAGGXGHYGMB-UHFFFAOYSA-N 0.000 description 2
- 238000011225 antiretroviral therapy Methods 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- IADUEWIQBXOCDZ-UHFFFAOYSA-N azetidine-2-carboxylic acid Chemical compound OC(=O)C1CCN1 IADUEWIQBXOCDZ-UHFFFAOYSA-N 0.000 description 2
- 239000002585 base Substances 0.000 description 2
- 125000003785 benzimidazolyl group Chemical group N1=C(NC2=C1C=CC=C2)* 0.000 description 2
- IOJUPLGTWVMSFF-UHFFFAOYSA-N benzothiazole Chemical compound C1=CC=C2SC=NC2=C1 IOJUPLGTWVMSFF-UHFFFAOYSA-N 0.000 description 2
- 125000001164 benzothiazolyl group Chemical group S1C(=NC2=C1C=CC=C2)* 0.000 description 2
- 125000002619 bicyclic group Chemical group 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 125000003917 carbamoyl group Chemical group [H]N([H])C(*)=O 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 229960002173 citrulline Drugs 0.000 description 2
- 235000013477 citrulline Nutrition 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000006352 cycloaddition reaction Methods 0.000 description 2
- XVOYSCVBGLVSOL-UHFFFAOYSA-N cysteic acid Chemical compound OC(=O)C(N)CS(O)(=O)=O XVOYSCVBGLVSOL-UHFFFAOYSA-N 0.000 description 2
- NNBZCPXTIHJBJL-UHFFFAOYSA-N decalin Chemical compound C1CCCC2CCCCC21 NNBZCPXTIHJBJL-UHFFFAOYSA-N 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- TXCDCPKCNAJMEE-UHFFFAOYSA-N dibenzofuran Chemical compound C1=CC=C2C3=CC=CC=C3OC2=C1 TXCDCPKCNAJMEE-UHFFFAOYSA-N 0.000 description 2
- 238000009792 diffusion process Methods 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- ZUOUZKKEUPVFJK-UHFFFAOYSA-N diphenyl Chemical compound C1=CC=CC=C1C1=CC=CC=C1 ZUOUZKKEUPVFJK-UHFFFAOYSA-N 0.000 description 2
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- 125000003147 glycosyl group Chemical group 0.000 description 2
- 229960002591 hydroxyproline Drugs 0.000 description 2
- 125000002883 imidazolyl group Chemical group 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 125000001041 indolyl group Chemical group 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 108091008042 inhibitory receptors Proteins 0.000 description 2
- IQPQWNKOIGAROB-UHFFFAOYSA-N isocyanate group Chemical group [N-]=C=O IQPQWNKOIGAROB-UHFFFAOYSA-N 0.000 description 2
- AWJUIBRHMBBTKR-UHFFFAOYSA-N isoquinoline Chemical compound C1=NC=CC2=CC=CC=C21 AWJUIBRHMBBTKR-UHFFFAOYSA-N 0.000 description 2
- 125000000842 isoxazolyl group Chemical group 0.000 description 2
- 150000002576 ketones Chemical class 0.000 description 2
- 230000002147 killing effect Effects 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 239000012038 nucleophile Substances 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 229960003104 ornithine Drugs 0.000 description 2
- 229940092253 ovalbumin Drugs 0.000 description 2
- 125000004430 oxygen atom Chemical group O* 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- YNPNZTXNASCQKK-UHFFFAOYSA-N phenanthrene Chemical compound C1=CC=C2C3=CC=CC=C3C=CC2=C1 YNPNZTXNASCQKK-UHFFFAOYSA-N 0.000 description 2
- HXEACLLIILLPRG-UHFFFAOYSA-N pipecolic acid Chemical compound OC(=O)C1CCCCN1 HXEACLLIILLPRG-UHFFFAOYSA-N 0.000 description 2
- JSSXHAMIXJGYCS-UHFFFAOYSA-N piperazin-4-ium-2-carboxylate Chemical compound OC(=O)C1CNCCN1 JSSXHAMIXJGYCS-UHFFFAOYSA-N 0.000 description 2
- 102000054765 polymorphisms of proteins Human genes 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 125000004076 pyridyl group Chemical group 0.000 description 2
- 125000000714 pyrimidinyl group Chemical group 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 108020001568 subdomains Proteins 0.000 description 2
- 229920001059 synthetic polymer Polymers 0.000 description 2
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 2
- RWRDLPDLKQPQOW-UHFFFAOYSA-N tetrahydropyrrole Natural products C1CCNC1 RWRDLPDLKQPQOW-UHFFFAOYSA-N 0.000 description 2
- 125000000335 thiazolyl group Chemical group 0.000 description 2
- 150000003568 thioethers Chemical class 0.000 description 2
- UMGDCJDMYOKAJW-UHFFFAOYSA-N thiourea Chemical compound NC(N)=S UMGDCJDMYOKAJW-UHFFFAOYSA-N 0.000 description 2
- 230000001988 toxicity Effects 0.000 description 2
- 231100000419 toxicity Toxicity 0.000 description 2
- 125000005559 triazolylene group Chemical group 0.000 description 2
- 125000004360 trifluorophenyl group Chemical group 0.000 description 2
- 230000007501 viral attachment Effects 0.000 description 2
- 230000029812 viral genome replication Effects 0.000 description 2
- 229960004854 viral vaccine Drugs 0.000 description 2
- JPZXHKDZASGCLU-LBPRGKRZSA-N β-(2-naphthyl)-alanine Chemical compound C1=CC=CC2=CC(C[C@H](N)C(O)=O)=CC=C21 JPZXHKDZASGCLU-LBPRGKRZSA-N 0.000 description 2
- RSPOGBIHKNKRFJ-MSZQBOFLSA-N (2S)-2-amino-2,3-dimethylpentanoic acid Chemical compound C[C@@](C(=O)O)(C(CC)C)N RSPOGBIHKNKRFJ-MSZQBOFLSA-N 0.000 description 1
- CWLQUGTUXBXTLF-RXMQYKEDSA-N (2r)-1-methylpyrrolidine-2-carboxylic acid Chemical compound CN1CCC[C@@H]1C(O)=O CWLQUGTUXBXTLF-RXMQYKEDSA-N 0.000 description 1
- YAXAFCHJCYILRU-RXMQYKEDSA-N (2r)-2-(methylamino)-4-methylsulfanylbutanoic acid Chemical compound CN[C@@H](C(O)=O)CCSC YAXAFCHJCYILRU-RXMQYKEDSA-N 0.000 description 1
- XLBVNMSMFQMKEY-SCSAIBSYSA-N (2r)-2-(methylamino)pentanedioic acid Chemical compound CN[C@@H](C(O)=O)CCC(O)=O XLBVNMSMFQMKEY-SCSAIBSYSA-N 0.000 description 1
- GDFAOVXKHJXLEI-GSVOUGTGSA-N (2r)-2-(methylamino)propanoic acid Chemical compound CN[C@H](C)C(O)=O GDFAOVXKHJXLEI-GSVOUGTGSA-N 0.000 description 1
- SCIFESDRCALIIM-SECBINFHSA-N (2r)-2-(methylazaniumyl)-3-phenylpropanoate Chemical compound CN[C@@H](C(O)=O)CC1=CC=CC=C1 SCIFESDRCALIIM-SECBINFHSA-N 0.000 description 1
- NHTGHBARYWONDQ-SNVBAGLBSA-N (2r)-2-amino-3-(4-hydroxyphenyl)-2-methylpropanoic acid Chemical compound OC(=O)[C@@](N)(C)CC1=CC=C(O)C=C1 NHTGHBARYWONDQ-SNVBAGLBSA-N 0.000 description 1
- HYOWVAAEQCNGLE-SNVBAGLBSA-N (2r)-2-azaniumyl-2-methyl-3-phenylpropanoate Chemical compound [O-]C(=O)[C@@]([NH3+])(C)CC1=CC=CC=C1 HYOWVAAEQCNGLE-SNVBAGLBSA-N 0.000 description 1
- ZYVMPHJZWXIFDQ-ZCFIWIBFSA-N (2r)-2-azaniumyl-2-methyl-4-methylsulfanylbutanoate Chemical compound CSCC[C@@](C)(N)C(O)=O ZYVMPHJZWXIFDQ-ZCFIWIBFSA-N 0.000 description 1
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 1
- WNNNWFKQCKFSDK-SCSAIBSYSA-N (2r)-2-azaniumylpent-4-enoate Chemical compound [O-]C(=O)[C@H]([NH3+])CC=C WNNNWFKQCKFSDK-SCSAIBSYSA-N 0.000 description 1
- LWHHAVWYGIBIEU-ZCFIWIBFSA-N (2r)-2-methylpyrrolidin-1-ium-2-carboxylate Chemical compound OC(=O)[C@@]1(C)CCCN1 LWHHAVWYGIBIEU-ZCFIWIBFSA-N 0.000 description 1
- CYZKJBZEIFWZSR-ZCFIWIBFSA-N (2r)-3-(1h-imidazol-5-yl)-2-(methylamino)propanoic acid Chemical compound CN[C@@H](C(O)=O)CC1=CN=CN1 CYZKJBZEIFWZSR-ZCFIWIBFSA-N 0.000 description 1
- CZCIKBSVHDNIDH-LLVKDONJSA-N (2r)-3-(1h-indol-3-yl)-2-(methylamino)propanoic acid Chemical compound C1=CC=C2C(C[C@@H](NC)C(O)=O)=CNC2=C1 CZCIKBSVHDNIDH-LLVKDONJSA-N 0.000 description 1
- BJBUEDPLEOHJGE-SRBOSORUSA-N (2r)-3-hydroxypyrrolidine-2-carboxylic acid Chemical compound OC1CCN[C@H]1C(O)=O BJBUEDPLEOHJGE-SRBOSORUSA-N 0.000 description 1
- AKCRVYNORCOYQT-RXMQYKEDSA-N (2r)-3-methyl-2-(methylazaniumyl)butanoate Chemical compound C[NH2+][C@H](C(C)C)C([O-])=O AKCRVYNORCOYQT-RXMQYKEDSA-N 0.000 description 1
- LNSMPSPTFDIWRQ-GSVOUGTGSA-N (2r)-4-amino-2-(methylamino)-4-oxobutanoic acid Chemical compound CN[C@@H](C(O)=O)CC(N)=O LNSMPSPTFDIWRQ-GSVOUGTGSA-N 0.000 description 1
- NTWVQPHTOUKMDI-RXMQYKEDSA-N (2r)-5-(diaminomethylideneamino)-2-(methylamino)pentanoic acid Chemical compound CN[C@@H](C(O)=O)CCCNC(N)=N NTWVQPHTOUKMDI-RXMQYKEDSA-N 0.000 description 1
- KSZFSNZOGAXEGH-SCSAIBSYSA-N (2r)-5-amino-2-(methylamino)-5-oxopentanoic acid Chemical compound CN[C@@H](C(O)=O)CCC(N)=O KSZFSNZOGAXEGH-SCSAIBSYSA-N 0.000 description 1
- OZRWQPFBXDVLAH-RXMQYKEDSA-N (2r)-5-amino-2-(methylamino)pentanoic acid Chemical compound CN[C@@H](C(O)=O)CCCN OZRWQPFBXDVLAH-RXMQYKEDSA-N 0.000 description 1
- KSPIYJQBLVDRRI-NTSWFWBYSA-N (2r,3s)-3-methyl-2-(methylazaniumyl)pentanoate Chemical compound CC[C@H](C)[C@@H](NC)C(O)=O KSPIYJQBLVDRRI-NTSWFWBYSA-N 0.000 description 1
- KRHNXNZBLHHEIU-CRCLSJGQSA-N (2r,4s)-4-hydroxypiperidin-1-ium-2-carboxylate Chemical compound O[C@H]1CCN[C@@H](C(O)=O)C1 KRHNXNZBLHHEIU-CRCLSJGQSA-N 0.000 description 1
- FQRURPFZTFUXEZ-MRVPVSSYSA-N (2s)-2,3,3,3-tetrafluoro-2-(n-fluoroanilino)propanoic acid Chemical compound OC(=O)[C@](F)(C(F)(F)F)N(F)C1=CC=CC=C1 FQRURPFZTFUXEZ-MRVPVSSYSA-N 0.000 description 1
- NMDDZEVVQDPECF-LURJTMIESA-N (2s)-2,7-diaminoheptanoic acid Chemical compound NCCCCC[C@H](N)C(O)=O NMDDZEVVQDPECF-LURJTMIESA-N 0.000 description 1
- YPJJGMCMOHDOFZ-ZETCQYMHSA-N (2s)-2-(1-benzothiophen-3-ylamino)propanoic acid Chemical compound C1=CC=C2C(N[C@@H](C)C(O)=O)=CSC2=C1 YPJJGMCMOHDOFZ-ZETCQYMHSA-N 0.000 description 1
- LDUWTIUXPVCEQF-LURJTMIESA-N (2s)-2-(cyclopentylamino)propanoic acid Chemical compound OC(=O)[C@H](C)NC1CCCC1 LDUWTIUXPVCEQF-LURJTMIESA-N 0.000 description 1
- NVXKJPGRZSDYPK-JTQLQIEISA-N (2s)-2-(methylamino)-4-phenylbutanoic acid Chemical compound CN[C@H](C(O)=O)CCC1=CC=CC=C1 NVXKJPGRZSDYPK-JTQLQIEISA-N 0.000 description 1
- HOKKHZGPKSLGJE-VKHMYHEASA-N (2s)-2-(methylamino)butanedioic acid Chemical compound CN[C@H](C(O)=O)CC(O)=O HOKKHZGPKSLGJE-VKHMYHEASA-N 0.000 description 1
- FPDYKABXINADKS-LURJTMIESA-N (2s)-2-(methylazaniumyl)hexanoate Chemical compound CCCC[C@H](NC)C(O)=O FPDYKABXINADKS-LURJTMIESA-N 0.000 description 1
- HCPKYUNZBPVCHC-YFKPBYRVSA-N (2s)-2-(methylazaniumyl)pentanoate Chemical compound CCC[C@H](NC)C(O)=O HCPKYUNZBPVCHC-YFKPBYRVSA-N 0.000 description 1
- WTDHSXGBDZBWAW-QMMMGPOBSA-N (2s)-2-[cyclohexyl(methyl)azaniumyl]propanoate Chemical compound OC(=O)[C@H](C)N(C)C1CCCCC1 WTDHSXGBDZBWAW-QMMMGPOBSA-N 0.000 description 1
- IUYZJPXOXGRNNE-ZETCQYMHSA-N (2s)-2-[cyclopentyl(methyl)amino]propanoic acid Chemical compound OC(=O)[C@H](C)N(C)C1CCCC1 IUYZJPXOXGRNNE-ZETCQYMHSA-N 0.000 description 1
- PECGVEGMRUZOML-AWEZNQCLSA-N (2s)-2-amino-3,3-diphenylpropanoic acid Chemical compound C=1C=CC=CC=1C([C@H](N)C(O)=O)C1=CC=CC=C1 PECGVEGMRUZOML-AWEZNQCLSA-N 0.000 description 1
- ZTTWHZHBPDYSQB-LBPRGKRZSA-N (2s)-2-amino-3-(1h-indol-3-yl)-2-methylpropanoic acid Chemical compound C1=CC=C2C(C[C@@](N)(C)C(O)=O)=CNC2=C1 ZTTWHZHBPDYSQB-LBPRGKRZSA-N 0.000 description 1
- DQLHSFUMICQIMB-VIFPVBQESA-N (2s)-2-amino-3-(4-methylphenyl)propanoic acid Chemical compound CC1=CC=C(C[C@H](N)C(O)=O)C=C1 DQLHSFUMICQIMB-VIFPVBQESA-N 0.000 description 1
- RXZQHZDTHUUJQJ-LURJTMIESA-N (2s)-2-amino-3-(furan-2-yl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CO1 RXZQHZDTHUUJQJ-LURJTMIESA-N 0.000 description 1
- DFZVZEMNPGABKO-ZETCQYMHSA-N (2s)-2-amino-3-pyridin-3-ylpropanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CN=C1 DFZVZEMNPGABKO-ZETCQYMHSA-N 0.000 description 1
- JADFNIWVWZWOFA-JTQLQIEISA-N (2s)-2-amino-4-(4-methoxyphenyl)butanoic acid Chemical compound COC1=CC=C(CC[C@H](N)C(O)=O)C=C1 JADFNIWVWZWOFA-JTQLQIEISA-N 0.000 description 1
- WNNNWFKQCKFSDK-BYPYZUCNSA-N (2s)-2-aminopent-4-enoic acid Chemical compound OC(=O)[C@@H](N)CC=C WNNNWFKQCKFSDK-BYPYZUCNSA-N 0.000 description 1
- GPYTYOMSQHBYTK-LURJTMIESA-N (2s)-2-azaniumyl-2,3-dimethylbutanoate Chemical compound CC(C)[C@](C)([NH3+])C([O-])=O GPYTYOMSQHBYTK-LURJTMIESA-N 0.000 description 1
- KFHRMMHGGBCRIV-BYPYZUCNSA-N (2s)-2-azaniumyl-4-methoxybutanoate Chemical compound COCC[C@H](N)C(O)=O KFHRMMHGGBCRIV-BYPYZUCNSA-N 0.000 description 1
- FMUMEWVNYMUECA-LURJTMIESA-N (2s)-2-azaniumyl-5-methylhexanoate Chemical compound CC(C)CC[C@H](N)C(O)=O FMUMEWVNYMUECA-LURJTMIESA-N 0.000 description 1
- LWHHAVWYGIBIEU-LURJTMIESA-N (2s)-2-methylpyrrolidin-1-ium-2-carboxylate Chemical compound [O-]C(=O)[C@]1(C)CCC[NH2+]1 LWHHAVWYGIBIEU-LURJTMIESA-N 0.000 description 1
- KWWFNGCKGYUCLC-RXMQYKEDSA-N (2s)-3,3-dimethyl-2-(methylamino)butanoic acid Chemical compound CN[C@H](C(O)=O)C(C)(C)C KWWFNGCKGYUCLC-RXMQYKEDSA-N 0.000 description 1
- LNSMPSPTFDIWRQ-VKHMYHEASA-N (2s)-4-amino-2-(methylamino)-4-oxobutanoic acid Chemical compound CN[C@H](C(O)=O)CC(N)=O LNSMPSPTFDIWRQ-VKHMYHEASA-N 0.000 description 1
- XJODGRWDFZVTKW-LURJTMIESA-N (2s)-4-methyl-2-(methylamino)pentanoic acid Chemical compound CN[C@H](C(O)=O)CC(C)C XJODGRWDFZVTKW-LURJTMIESA-N 0.000 description 1
- NWGZOALPWZDXNG-LURJTMIESA-N (2s)-5-(diaminomethylideneamino)-2-(dimethylamino)pentanoic acid Chemical compound CN(C)[C@H](C(O)=O)CCCNC(N)=N NWGZOALPWZDXNG-LURJTMIESA-N 0.000 description 1
- KSZFSNZOGAXEGH-BYPYZUCNSA-N (2s)-5-amino-2-(methylamino)-5-oxopentanoic acid Chemical compound CN[C@H](C(O)=O)CCC(N)=O KSZFSNZOGAXEGH-BYPYZUCNSA-N 0.000 description 1
- OZRWQPFBXDVLAH-YFKPBYRVSA-N (2s)-5-amino-2-(methylamino)pentanoic acid Chemical compound CN[C@H](C(O)=O)CCCN OZRWQPFBXDVLAH-YFKPBYRVSA-N 0.000 description 1
- LJRDOKAZOAKLDU-UDXJMMFXSA-N (2s,3s,4r,5r,6r)-5-amino-2-(aminomethyl)-6-[(2r,3s,4r,5s)-5-[(1r,2r,3s,5r,6s)-3,5-diamino-2-[(2s,3r,4r,5s,6r)-3-amino-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-6-hydroxycyclohexyl]oxy-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl]oxyoxane-3,4-diol;sulfuric ac Chemical compound OS(O)(=O)=O.N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO LJRDOKAZOAKLDU-UDXJMMFXSA-N 0.000 description 1
- KRHNXNZBLHHEIU-UHNVWZDZSA-N (2s,4r)-4-hydroxypiperidin-1-ium-2-carboxylate Chemical compound O[C@@H]1CCN[C@H](C(O)=O)C1 KRHNXNZBLHHEIU-UHNVWZDZSA-N 0.000 description 1
- RWSHAZYNUOFZMI-ZJUUUORDSA-N (2s,4r)-4-phenoxypyrrolidine-2-carboxylic acid Chemical compound C1N[C@H](C(=O)O)C[C@H]1OC1=CC=CC=C1 RWSHAZYNUOFZMI-ZJUUUORDSA-N 0.000 description 1
- GTODXOQKGULQFP-UWVGGRQHSA-N (2s,4s)-4-azaniumyl-1-benzoylpyrrolidine-2-carboxylate Chemical compound C1[C@@H](N)C[C@@H](C(O)=O)N1C(=O)C1=CC=CC=C1 GTODXOQKGULQFP-UWVGGRQHSA-N 0.000 description 1
- OYNANFOWNSGDJL-IMJSIDKUSA-N (2s,4s)-4-sulfanylpyrrolidin-1-ium-2-carboxylate Chemical compound OC(=O)[C@@H]1C[C@H](S)CN1 OYNANFOWNSGDJL-IMJSIDKUSA-N 0.000 description 1
- PJDINCOFOROBQW-LURJTMIESA-N (3S)-3,7-diaminoheptanoic acid Chemical compound NCCCC[C@H](N)CC(O)=O PJDINCOFOROBQW-LURJTMIESA-N 0.000 description 1
- FSNCEEGOMTYXKY-SNVBAGLBSA-N (3r)-2,3,4,9-tetrahydro-1h-pyrido[3,4-b]indole-3-carboxylic acid Chemical compound N1C2=CC=CC=C2C2=C1CN[C@@H](C(=O)O)C2 FSNCEEGOMTYXKY-SNVBAGLBSA-N 0.000 description 1
- FSNCEEGOMTYXKY-JTQLQIEISA-N (3s)-2,3,4,9-tetrahydro-1h-pyrido[3,4-b]indole-3-carboxylic acid Chemical compound N1C2=CC=CC=C2C2=C1CN[C@H](C(=O)O)C2 FSNCEEGOMTYXKY-JTQLQIEISA-N 0.000 description 1
- 125000006527 (C1-C5) alkyl group Chemical group 0.000 description 1
- XJLSEXAGTJCILF-RXMQYKEDSA-N (R)-nipecotic acid zwitterion Chemical compound OC(=O)[C@@H]1CCCNC1 XJLSEXAGTJCILF-RXMQYKEDSA-N 0.000 description 1
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 1
- 125000001376 1,2,4-triazolyl group Chemical group N1N=C(N=C1)* 0.000 description 1
- CSNIZNHTOVFARY-UHFFFAOYSA-N 1,2-benzothiazole Chemical compound C1=CC=C2C=NSC2=C1 CSNIZNHTOVFARY-UHFFFAOYSA-N 0.000 description 1
- KTZQTRPPVKQPFO-UHFFFAOYSA-N 1,2-benzoxazole Chemical compound C1=CC=C2C=NOC2=C1 KTZQTRPPVKQPFO-UHFFFAOYSA-N 0.000 description 1
- JIHQDMXYYFUGFV-UHFFFAOYSA-N 1,3,5-triazine Chemical compound C1=NC=NC=N1 JIHQDMXYYFUGFV-UHFFFAOYSA-N 0.000 description 1
- BCMCBBGGLRIHSE-UHFFFAOYSA-N 1,3-benzoxazole Chemical compound C1=CC=C2OC=NC2=C1 BCMCBBGGLRIHSE-UHFFFAOYSA-N 0.000 description 1
- FUVZDXDCPRQZSQ-UHFFFAOYSA-N 1,5,6,7-tetrahydroindazol-4-one Chemical compound O=C1CCCC2=C1C=NN2 FUVZDXDCPRQZSQ-UHFFFAOYSA-N 0.000 description 1
- QITDFMFPFGJWJY-UHFFFAOYSA-N 1-(2,2-diphenylethylamino)cyclopropane-1-carboxylic acid Chemical compound C=1C=CC=CC=1C(C=1C=CC=CC=1)CNC1(C(=O)O)CC1 QITDFMFPFGJWJY-UHFFFAOYSA-N 0.000 description 1
- MEZJQXVOMGUAMP-UHFFFAOYSA-N 1-(2-methylnaphthalen-1-yl)pyrrole-2,5-dione Chemical group CC1=CC=C2C=CC=CC2=C1N1C(=O)C=CC1=O MEZJQXVOMGUAMP-UHFFFAOYSA-N 0.000 description 1
- NDMFETHQFUOIQX-UHFFFAOYSA-N 1-(3-chloropropyl)imidazolidin-2-one Chemical compound ClCCCN1CCNC1=O NDMFETHQFUOIQX-UHFFFAOYSA-N 0.000 description 1
- FVTVMQPGKVHSEY-UHFFFAOYSA-N 1-AMINOCYCLOBUTANE CARBOXYLIC ACID Chemical compound OC(=O)C1(N)CCC1 FVTVMQPGKVHSEY-UHFFFAOYSA-N 0.000 description 1
- NILQLFBWTXNUOE-UHFFFAOYSA-N 1-aminocyclopentanecarboxylic acid Chemical compound OC(=O)C1(N)CCCC1 NILQLFBWTXNUOE-UHFFFAOYSA-N 0.000 description 1
- PAJPWUMXBYXFCZ-UHFFFAOYSA-N 1-aminocyclopropanecarboxylic acid Chemical compound OC(=O)C1(N)CC1 PAJPWUMXBYXFCZ-UHFFFAOYSA-N 0.000 description 1
- HOURAEXAZWBEAW-UHFFFAOYSA-N 1-aminopropane-1,1,3-tricarboxylic acid Chemical compound OC(=O)C(C(O)=O)(N)CCC(O)=O HOURAEXAZWBEAW-UHFFFAOYSA-N 0.000 description 1
- HTTPGMNPPMMMOP-UHFFFAOYSA-N 1-azaniumyl-2,3-dihydroindene-1-carboxylate Chemical compound C1=CC=C2C(N)(C(O)=O)CCC2=C1 HTTPGMNPPMMMOP-UHFFFAOYSA-N 0.000 description 1
- WLKSPGHQGFFKGE-UHFFFAOYSA-N 1-chloropropan-2-yl n-(3-chlorophenyl)carbamate Chemical group ClCC(C)OC(=O)NC1=CC=CC(Cl)=C1 WLKSPGHQGFFKGE-UHFFFAOYSA-N 0.000 description 1
- KXZQYLBVMZGIKC-UHFFFAOYSA-N 1-pyridin-2-yl-n-(pyridin-2-ylmethyl)methanamine Chemical compound C=1C=CC=NC=1CNCC1=CC=CC=N1 KXZQYLBVMZGIKC-UHFFFAOYSA-N 0.000 description 1
- TZMSYXZUNZXBOL-UHFFFAOYSA-N 10H-phenoxazine Chemical compound C1=CC=C2NC3=CC=CC=C3OC2=C1 TZMSYXZUNZXBOL-UHFFFAOYSA-N 0.000 description 1
- HYZJCKYKOHLVJF-UHFFFAOYSA-N 1H-benzimidazole Chemical compound C1=CC=C2NC=NC2=C1 HYZJCKYKOHLVJF-UHFFFAOYSA-N 0.000 description 1
- BAXOFTOLAUCFNW-UHFFFAOYSA-N 1H-indazole Chemical compound C1=CC=C2C=NNC2=C1 BAXOFTOLAUCFNW-UHFFFAOYSA-N 0.000 description 1
- JGQPSDIWMGNAPE-UHFFFAOYSA-N 2,1-benzothiazole Chemical compound C1=CC=CC2=CSN=C21 JGQPSDIWMGNAPE-UHFFFAOYSA-N 0.000 description 1
- FZKCAHQKNJXICB-UHFFFAOYSA-N 2,1-benzoxazole Chemical compound C1=CC=CC2=CON=C21 FZKCAHQKNJXICB-UHFFFAOYSA-N 0.000 description 1
- UTQNKKSJPHTPBS-UHFFFAOYSA-N 2,2,2-trichloroethanone Chemical group ClC(Cl)(Cl)[C]=O UTQNKKSJPHTPBS-UHFFFAOYSA-N 0.000 description 1
- BLCJBICVQSYOIF-UHFFFAOYSA-N 2,2-diaminobutanoic acid Chemical compound CCC(N)(N)C(O)=O BLCJBICVQSYOIF-UHFFFAOYSA-N 0.000 description 1
- YQTCQNIPQMJNTI-UHFFFAOYSA-N 2,2-dimethylpropan-1-one Chemical group CC(C)(C)[C]=O YQTCQNIPQMJNTI-UHFFFAOYSA-N 0.000 description 1
- VEPOHXYIFQMVHW-XOZOLZJESA-N 2,3-dihydroxybutanedioic acid (2S,3S)-3,4-dimethyl-2-phenylmorpholine Chemical compound OC(C(O)C(O)=O)C(O)=O.C[C@H]1[C@@H](OCCN1C)c1ccccc1 VEPOHXYIFQMVHW-XOZOLZJESA-N 0.000 description 1
- JTBWDIQPPZCDHV-UHFFFAOYSA-N 2-(1-azaniumylcyclohexyl)acetate Chemical compound [O-]C(=O)CC1([NH3+])CCCCC1 JTBWDIQPPZCDHV-UHFFFAOYSA-N 0.000 description 1
- WAAJQPAIOASFSC-UHFFFAOYSA-N 2-(1-hydroxyethylamino)acetic acid Chemical compound CC(O)NCC(O)=O WAAJQPAIOASFSC-UHFFFAOYSA-N 0.000 description 1
- ITYQPPZZOYSACT-UHFFFAOYSA-N 2-(2,2-dimethylpropylamino)acetic acid Chemical compound CC(C)(C)CNCC(O)=O ITYQPPZZOYSACT-UHFFFAOYSA-N 0.000 description 1
- UEQSFWNXRZJTKB-UHFFFAOYSA-N 2-(2,2-diphenylethylamino)acetic acid Chemical compound C=1C=CC=CC=1C(CNCC(=O)O)C1=CC=CC=C1 UEQSFWNXRZJTKB-UHFFFAOYSA-N 0.000 description 1
- PIINGYXNCHTJTF-UHFFFAOYSA-N 2-(2-azaniumylethylamino)acetate Chemical compound NCCNCC(O)=O PIINGYXNCHTJTF-UHFFFAOYSA-N 0.000 description 1
- XCDGCRLSSSSBIA-UHFFFAOYSA-N 2-(2-methylsulfanylethylamino)acetic acid Chemical compound CSCCNCC(O)=O XCDGCRLSSSSBIA-UHFFFAOYSA-N 0.000 description 1
- IAICFWDJMWEXAO-UHFFFAOYSA-N 2-(2-sulfanylethylamino)acetic acid Chemical compound OC(=O)CNCCS IAICFWDJMWEXAO-UHFFFAOYSA-N 0.000 description 1
- STMXJQHRRCPJCJ-UHFFFAOYSA-N 2-(3,3-diphenylpropylamino)acetic acid Chemical compound C=1C=CC=CC=1C(CCNCC(=O)O)C1=CC=CC=C1 STMXJQHRRCPJCJ-UHFFFAOYSA-N 0.000 description 1
- DZNWPRKEUQEDSZ-UHFFFAOYSA-N 2-(3-amino-2-oxopyridin-1-yl)acetic acid Chemical compound NC1=CC=CN(CC(O)=O)C1=O DZNWPRKEUQEDSZ-UHFFFAOYSA-N 0.000 description 1
- DHGYLUFLENKZHH-UHFFFAOYSA-N 2-(3-aminopropylamino)acetic acid Chemical compound NCCCNCC(O)=O DHGYLUFLENKZHH-UHFFFAOYSA-N 0.000 description 1
- OGAULEBSQQMUKP-UHFFFAOYSA-N 2-(4-aminobutylamino)acetic acid Chemical compound NCCCCNCC(O)=O OGAULEBSQQMUKP-UHFFFAOYSA-N 0.000 description 1
- KGSVNOLLROCJQM-UHFFFAOYSA-N 2-(benzylamino)acetic acid Chemical compound OC(=O)CNCC1=CC=CC=C1 KGSVNOLLROCJQM-UHFFFAOYSA-N 0.000 description 1
- IVCQRTJVLJXKKJ-UHFFFAOYSA-N 2-(butan-2-ylazaniumyl)acetate Chemical compound CCC(C)NCC(O)=O IVCQRTJVLJXKKJ-UHFFFAOYSA-N 0.000 description 1
- KQLGGQARRCMYGD-UHFFFAOYSA-N 2-(cyclobutylamino)acetic acid Chemical compound OC(=O)CNC1CCC1 KQLGGQARRCMYGD-UHFFFAOYSA-N 0.000 description 1
- CTVIWLLGUFGSLY-UHFFFAOYSA-N 2-(cyclohexylazaniumyl)-2-methylpropanoate Chemical compound OC(=O)C(C)(C)NC1CCCCC1 CTVIWLLGUFGSLY-UHFFFAOYSA-N 0.000 description 1
- OQMYZVWIXPPDDE-UHFFFAOYSA-N 2-(cyclohexylazaniumyl)acetate Chemical compound OC(=O)CNC1CCCCC1 OQMYZVWIXPPDDE-UHFFFAOYSA-N 0.000 description 1
- PNKNDNFLQNMQJL-UHFFFAOYSA-N 2-(cyclooctylazaniumyl)acetate Chemical compound OC(=O)CNC1CCCCCCC1 PNKNDNFLQNMQJL-UHFFFAOYSA-N 0.000 description 1
- DXQCCQKRNWMECV-UHFFFAOYSA-N 2-(cyclopropylazaniumyl)acetate Chemical compound OC(=O)CNC1CC1 DXQCCQKRNWMECV-UHFFFAOYSA-N 0.000 description 1
- PRVOMNLNSHAUEI-UHFFFAOYSA-N 2-(cycloundecylamino)acetic acid Chemical compound OC(=O)CNC1CCCCCCCCCC1 PRVOMNLNSHAUEI-UHFFFAOYSA-N 0.000 description 1
- HEPOIJKOXBKKNJ-UHFFFAOYSA-N 2-(propan-2-ylazaniumyl)acetate Chemical compound CC(C)NCC(O)=O HEPOIJKOXBKKNJ-UHFFFAOYSA-N 0.000 description 1
- TXHAHOVNFDVCCC-UHFFFAOYSA-N 2-(tert-butylazaniumyl)acetate Chemical compound CC(C)(C)NCC(O)=O TXHAHOVNFDVCCC-UHFFFAOYSA-N 0.000 description 1
- QWCKQJZIFLGMSD-UHFFFAOYSA-N 2-Aminobutanoic acid Natural products CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 1
- AWEZYTUWDZADKR-UHFFFAOYSA-N 2-[(2-amino-2-oxoethyl)azaniumyl]acetate Chemical compound NC(=O)CNCC(O)=O AWEZYTUWDZADKR-UHFFFAOYSA-N 0.000 description 1
- MNDBDVPDSHGIHR-UHFFFAOYSA-N 2-[(3-amino-3-oxopropyl)amino]acetic acid Chemical compound NC(=O)CCNCC(O)=O MNDBDVPDSHGIHR-UHFFFAOYSA-N 0.000 description 1
- IGVHXMVKIMYIRY-UHFFFAOYSA-N 2-[bis(3-aminopropyl)amino]acetic acid Chemical compound NCCCN(CC(O)=O)CCCN IGVHXMVKIMYIRY-UHFFFAOYSA-N 0.000 description 1
- UHQFXIWMAQOCAN-UHFFFAOYSA-N 2-amino-1,3-dihydroindene-2-carboxylic acid Chemical compound C1=CC=C2CC(N)(C(O)=O)CC2=C1 UHQFXIWMAQOCAN-UHFFFAOYSA-N 0.000 description 1
- AKVBCGQVQXPRLD-UHFFFAOYSA-N 2-aminooctanoic acid Chemical compound CCCCCCC(N)C(O)=O AKVBCGQVQXPRLD-UHFFFAOYSA-N 0.000 description 1
- WAMWSIDTKSNDCU-UHFFFAOYSA-N 2-azaniumyl-2-cyclohexylacetate Chemical compound OC(=O)C(N)C1CCCCC1 WAMWSIDTKSNDCU-UHFFFAOYSA-N 0.000 description 1
- CDULPPOISZOUTK-UHFFFAOYSA-N 2-azaniumyl-3,4-dihydro-1h-naphthalene-2-carboxylate Chemical compound C1=CC=C2CC(N)(C(O)=O)CCC2=C1 CDULPPOISZOUTK-UHFFFAOYSA-N 0.000 description 1
- KFHRMMHGGBCRIV-UHFFFAOYSA-N 2-azaniumyl-4-methoxybutanoate Chemical compound COCCC(N)C(O)=O KFHRMMHGGBCRIV-UHFFFAOYSA-N 0.000 description 1
- JTTHKOPSMAVJFE-UHFFFAOYSA-N 2-azaniumyl-4-phenylbutanoate Chemical compound OC(=O)C(N)CCC1=CC=CC=C1 JTTHKOPSMAVJFE-UHFFFAOYSA-N 0.000 description 1
- UXGVMFHEKMGWMA-UHFFFAOYSA-N 2-benzofuran Chemical compound C1=CC=CC2=COC=C21 UXGVMFHEKMGWMA-UHFFFAOYSA-N 0.000 description 1
- LYTMVABTDYMBQK-UHFFFAOYSA-N 2-benzothiophene Chemical compound C1=CC=CC2=CSC=C21 LYTMVABTDYMBQK-UHFFFAOYSA-N 0.000 description 1
- IZQAUUVBKYXMET-UHFFFAOYSA-N 2-bromoethanamine Chemical compound NCCBr IZQAUUVBKYXMET-UHFFFAOYSA-N 0.000 description 1
- WRFPVMFCRNYQNR-UHFFFAOYSA-N 2-hydroxyphenylalanine Chemical compound OC(=O)C(N)CC1=CC=CC=C1O WRFPVMFCRNYQNR-UHFFFAOYSA-N 0.000 description 1
- 125000000094 2-phenylethyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])([H])* 0.000 description 1
- PRNLNZMJMCUWNV-UHFFFAOYSA-N 2-piperidin-1-ium-2-ylacetate Chemical compound OC(=O)CC1CCCCN1 PRNLNZMJMCUWNV-UHFFFAOYSA-N 0.000 description 1
- 125000003903 2-propenyl group Chemical group [H]C([*])([H])C([H])=C([H])[H] 0.000 description 1
- VHMICKWLTGFITH-UHFFFAOYSA-N 2H-isoindole Chemical compound C1=CC=CC2=CNC=C21 VHMICKWLTGFITH-UHFFFAOYSA-N 0.000 description 1
- DFSFLZCLKYZYRD-UHFFFAOYSA-N 3,4-diethoxycyclobut-3-ene-1,2-dione Chemical compound CCOC1=C(OCC)C(=O)C1=O DFSFLZCLKYZYRD-UHFFFAOYSA-N 0.000 description 1
- UENRXLSRMCSUSN-UHFFFAOYSA-N 3,5-diaminobenzoic acid Chemical compound NC1=CC(N)=CC(C(O)=O)=C1 UENRXLSRMCSUSN-UHFFFAOYSA-N 0.000 description 1
- ZAQXSMCYFQJRCQ-UHFFFAOYSA-N 3-(1-adamantyl)-2-azaniumylpropanoate Chemical compound C1C(C2)CC3CC2CC1(CC(N)C(O)=O)C3 ZAQXSMCYFQJRCQ-UHFFFAOYSA-N 0.000 description 1
- AJHPGXZOIAYYDW-UHFFFAOYSA-N 3-(2-cyanophenyl)-2-[(2-methylpropan-2-yl)oxycarbonylamino]propanoic acid Chemical compound CC(C)(C)OC(=O)NC(C(O)=O)CC1=CC=CC=C1C#N AJHPGXZOIAYYDW-UHFFFAOYSA-N 0.000 description 1
- XFXOLBNQYFRSLQ-UHFFFAOYSA-N 3-amino-2-naphthoic acid Chemical compound C1=CC=C2C=C(C(O)=O)C(N)=CC2=C1 XFXOLBNQYFRSLQ-UHFFFAOYSA-N 0.000 description 1
- WBXOONOXOHMGQW-UHFFFAOYSA-N 3-aminobicyclo[2.2.1]heptane-4-carboxylic acid Chemical compound C1CC2(C(O)=O)C(N)CC1C2 WBXOONOXOHMGQW-UHFFFAOYSA-N 0.000 description 1
- CKTUXQBZPWBFDX-UHFFFAOYSA-N 3-azaniumylcyclohexane-1-carboxylate Chemical compound NC1CCCC(C(O)=O)C1 CKTUXQBZPWBFDX-UHFFFAOYSA-N 0.000 description 1
- 125000000474 3-butynyl group Chemical group [H]C#CC([H])([H])C([H])([H])* 0.000 description 1
- AOKCDAVWJLOAHG-UHFFFAOYSA-N 4-(methylamino)butyric acid Chemical compound C[NH2+]CCCC([O-])=O AOKCDAVWJLOAHG-UHFFFAOYSA-N 0.000 description 1
- ALYNCZNDIQEVRV-PZFLKRBQSA-N 4-amino-3,5-ditritiobenzoic acid Chemical compound [3H]c1cc(cc([3H])c1N)C(O)=O ALYNCZNDIQEVRV-PZFLKRBQSA-N 0.000 description 1
- SHINASQYHDCLEU-UHFFFAOYSA-N 4-aminopyrrolidine-2-carboxylic acid Chemical compound NC1CNC(C(O)=O)C1 SHINASQYHDCLEU-UHFFFAOYSA-N 0.000 description 1
- WIPRZHGCPZSPLJ-UHFFFAOYSA-N 4-aminotetrahydro-2H-thiopyran-4-carboxylic acid Chemical compound OC(=O)C1(N)CCSCC1 WIPRZHGCPZSPLJ-UHFFFAOYSA-N 0.000 description 1
- AEBRINKRALSWNY-UHFFFAOYSA-N 4-azaniumyl-2-methylbutanoate Chemical compound OC(=O)C(C)CCN AEBRINKRALSWNY-UHFFFAOYSA-N 0.000 description 1
- DRNGLYHKYPNTEA-UHFFFAOYSA-N 4-azaniumylcyclohexane-1-carboxylate Chemical compound NC1CCC(C(O)=O)CC1 DRNGLYHKYPNTEA-UHFFFAOYSA-N 0.000 description 1
- 125000002672 4-bromobenzoyl group Chemical group BrC1=CC=C(C(=O)*)C=C1 0.000 description 1
- 125000000242 4-chlorobenzoyl group Chemical group ClC1=CC=C(C(=O)*)C=C1 0.000 description 1
- PMQQFSDIECYOQV-UHFFFAOYSA-N 5,5-dimethyl-1,3-thiazolidin-3-ium-4-carboxylate Chemical compound CC1(C)SCNC1C(O)=O PMQQFSDIECYOQV-UHFFFAOYSA-N 0.000 description 1
- 229940117976 5-hydroxylysine Drugs 0.000 description 1
- ODHCTXKNWHHXJC-GSVOUGTGSA-N 5-oxo-D-proline Chemical compound OC(=O)[C@H]1CCC(=O)N1 ODHCTXKNWHHXJC-GSVOUGTGSA-N 0.000 description 1
- ODHCTXKNWHHXJC-VKHMYHEASA-N 5-oxo-L-proline Chemical compound OC(=O)[C@@H]1CCC(=O)N1 ODHCTXKNWHHXJC-VKHMYHEASA-N 0.000 description 1
- RYFOQDQDVYIEHN-UHFFFAOYSA-N 6-amino-2-(dimethylamino)hexanoic acid Chemical compound CN(C)C(C(O)=O)CCCCN RYFOQDQDVYIEHN-UHFFFAOYSA-N 0.000 description 1
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 206010067484 Adverse reaction Diseases 0.000 description 1
- 108010077480 Albumin Receptors Proteins 0.000 description 1
- 244000303258 Annona diversifolia Species 0.000 description 1
- 235000002198 Annona diversifolia Nutrition 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- IADUEWIQBXOCDZ-VKHMYHEASA-N Azetidine-2-carboxylic acid Natural products OC(=O)[C@@H]1CCN1 IADUEWIQBXOCDZ-VKHMYHEASA-N 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 241000282836 Camelus dromedarius Species 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-N Carbamic acid Chemical group NC(O)=O KXDHJXZQYSOELW-UHFFFAOYSA-N 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- ZAMOUSCENKQFHK-UHFFFAOYSA-N Chlorine atom Chemical compound [Cl] ZAMOUSCENKQFHK-UHFFFAOYSA-N 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 241000272201 Columbiformes Species 0.000 description 1
- AHLPHDHHMVZTML-SCSAIBSYSA-N D-Ornithine Chemical compound NCCC[C@@H](N)C(O)=O AHLPHDHHMVZTML-SCSAIBSYSA-N 0.000 description 1
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- DZLNHFMRPBPULJ-GSVOUGTGSA-N D-thioproline Chemical compound OC(=O)[C@H]1CSCN1 DZLNHFMRPBPULJ-GSVOUGTGSA-N 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- BRLQWZUYTZBJKN-UHFFFAOYSA-N Epichlorohydrin Chemical compound ClCC1CO1 BRLQWZUYTZBJKN-UHFFFAOYSA-N 0.000 description 1
- 241000283074 Equus asinus Species 0.000 description 1
- NIGWMJHCCYYCSF-UHFFFAOYSA-N Fenclonine Chemical compound OC(=O)C(N)CC1=CC=C(Cl)C=C1 NIGWMJHCCYYCSF-UHFFFAOYSA-N 0.000 description 1
- 101000609762 Gallus gallus Ovalbumin Proteins 0.000 description 1
- 108010078851 HIV Reverse Transcriptase Proteins 0.000 description 1
- 102100026122 High affinity immunoglobulin gamma Fc receptor I Human genes 0.000 description 1
- 101000913074 Homo sapiens High affinity immunoglobulin gamma Fc receptor I Proteins 0.000 description 1
- 101000917858 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor III-A Proteins 0.000 description 1
- 208000029462 Immunodeficiency disease Diseases 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-UHFFFAOYSA-N L-Homoarginine Natural products OC(=O)C(N)CCCCNC(N)=N QUOGESRFPZDMMT-UHFFFAOYSA-N 0.000 description 1
- LOOZZTFGSTZNRX-VIFPVBQESA-N L-Homotyrosine Chemical compound OC(=O)[C@@H](N)CCC1=CC=C(O)C=C1 LOOZZTFGSTZNRX-VIFPVBQESA-N 0.000 description 1
- 235000019766 L-Lysine Nutrition 0.000 description 1
- GDFAOVXKHJXLEI-UHFFFAOYSA-N L-N-Boc-N-methylalanine Natural products CNC(C)C(O)=O GDFAOVXKHJXLEI-UHFFFAOYSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-YFKPBYRVSA-N L-homoarginine Chemical compound OC(=O)[C@@H](N)CCCCNC(N)=N QUOGESRFPZDMMT-YFKPBYRVSA-N 0.000 description 1
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical compound OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 description 1
- FMUMEWVNYMUECA-UHFFFAOYSA-N L-homoleucine Natural products CC(C)CCC(N)C(O)=O FMUMEWVNYMUECA-UHFFFAOYSA-N 0.000 description 1
- JTTHKOPSMAVJFE-VIFPVBQESA-N L-homophenylalanine Chemical compound OC(=O)[C@@H](N)CCC1=CC=CC=C1 JTTHKOPSMAVJFE-VIFPVBQESA-N 0.000 description 1
- JZKXXXDKRQWDET-QMMMGPOBSA-N L-m-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(O)=C1 JZKXXXDKRQWDET-QMMMGPOBSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-ZXPFJRLXSA-N L-methionine (R)-S-oxide Chemical compound C[S@@](=O)CC[C@H]([NH3+])C([O-])=O QEFRNWWLZKMPFJ-ZXPFJRLXSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-UHFFFAOYSA-N L-methionine sulphoxide Natural products CS(=O)CCC(N)C(O)=O QEFRNWWLZKMPFJ-UHFFFAOYSA-N 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- NHTGHBARYWONDQ-JTQLQIEISA-N L-α-methyl-Tyrosine Chemical compound OC(=O)[C@](N)(C)CC1=CC=C(O)C=C1 NHTGHBARYWONDQ-JTQLQIEISA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241000282560 Macaca mulatta Species 0.000 description 1
- CYZKJBZEIFWZSR-LURJTMIESA-N N(alpha)-methyl-L-histidine Chemical compound CN[C@H](C(O)=O)CC1=CNC=N1 CYZKJBZEIFWZSR-LURJTMIESA-N 0.000 description 1
- CZCIKBSVHDNIDH-NSHDSACASA-N N(alpha)-methyl-L-tryptophan Chemical compound C1=CC=C2C(C[C@H]([NH2+]C)C([O-])=O)=CNC2=C1 CZCIKBSVHDNIDH-NSHDSACASA-N 0.000 description 1
- YDGMGEXADBMOMJ-LURJTMIESA-N N(g)-dimethylarginine Chemical compound CN(C)C(\N)=N\CCC[C@H](N)C(O)=O YDGMGEXADBMOMJ-LURJTMIESA-N 0.000 description 1
- NRFJZTXWLKPZAV-UHFFFAOYSA-N N-(2-oxo-3-thiolanyl)acetamide Chemical compound CC(=O)NC1CCSC1=O NRFJZTXWLKPZAV-UHFFFAOYSA-N 0.000 description 1
- WRUZLCLJULHLEY-UHFFFAOYSA-N N-(p-hydroxyphenyl)glycine Chemical compound OC(=O)CNC1=CC=C(O)C=C1 WRUZLCLJULHLEY-UHFFFAOYSA-N 0.000 description 1
- VKZGJEWGVNFKPE-UHFFFAOYSA-N N-Isobutylglycine Chemical compound CC(C)CNCC(O)=O VKZGJEWGVNFKPE-UHFFFAOYSA-N 0.000 description 1
- SCIFESDRCALIIM-UHFFFAOYSA-N N-Me-Phenylalanine Natural products CNC(C(O)=O)CC1=CC=CC=C1 SCIFESDRCALIIM-UHFFFAOYSA-N 0.000 description 1
- HOKKHZGPKSLGJE-GSVOUGTGSA-N N-Methyl-D-aspartic acid Chemical compound CN[C@@H](C(O)=O)CC(O)=O HOKKHZGPKSLGJE-GSVOUGTGSA-N 0.000 description 1
- NTWVQPHTOUKMDI-YFKPBYRVSA-N N-Methyl-arginine Chemical compound CN[C@H](C(O)=O)CCCN=C(N)N NTWVQPHTOUKMDI-YFKPBYRVSA-N 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- GDFAOVXKHJXLEI-VKHMYHEASA-N N-methyl-L-alanine Chemical compound C[NH2+][C@@H](C)C([O-])=O GDFAOVXKHJXLEI-VKHMYHEASA-N 0.000 description 1
- XLBVNMSMFQMKEY-BYPYZUCNSA-N N-methyl-L-glutamic acid Chemical compound CN[C@H](C(O)=O)CCC(O)=O XLBVNMSMFQMKEY-BYPYZUCNSA-N 0.000 description 1
- KSPIYJQBLVDRRI-WDSKDSINSA-N N-methyl-L-isoleucine Chemical compound CC[C@H](C)[C@H](NC)C(O)=O KSPIYJQBLVDRRI-WDSKDSINSA-N 0.000 description 1
- YAXAFCHJCYILRU-YFKPBYRVSA-N N-methyl-L-methionine Chemical compound C[NH2+][C@H](C([O-])=O)CCSC YAXAFCHJCYILRU-YFKPBYRVSA-N 0.000 description 1
- SCIFESDRCALIIM-VIFPVBQESA-N N-methyl-L-phenylalanine Chemical compound C[NH2+][C@H](C([O-])=O)CC1=CC=CC=C1 SCIFESDRCALIIM-VIFPVBQESA-N 0.000 description 1
- AKCRVYNORCOYQT-YFKPBYRVSA-N N-methyl-L-valine Chemical compound CN[C@@H](C(C)C)C(O)=O AKCRVYNORCOYQT-YFKPBYRVSA-N 0.000 description 1
- CWLQUGTUXBXTLF-YFKPBYRVSA-N N-methylproline Chemical compound CN1CCC[C@H]1C(O)=O CWLQUGTUXBXTLF-YFKPBYRVSA-N 0.000 description 1
- 125000000520 N-substituted aminocarbonyl group Chemical group [*]NC(=O)* 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 229940123424 Neuraminidase inhibitor Drugs 0.000 description 1
- 101100426732 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-9 gene Proteins 0.000 description 1
- JCXJVPUVTGWSNB-UHFFFAOYSA-N Nitrogen dioxide Chemical compound O=[N]=O JCXJVPUVTGWSNB-UHFFFAOYSA-N 0.000 description 1
- GEYBMYRBIABFTA-VIFPVBQESA-N O-methyl-L-tyrosine Chemical compound COC1=CC=C(C[C@H](N)C(O)=O)C=C1 GEYBMYRBIABFTA-VIFPVBQESA-N 0.000 description 1
- KNTFCRCCPLEUQZ-VKHMYHEASA-N O-methylserine Chemical compound COC[C@H](N)C(O)=O KNTFCRCCPLEUQZ-VKHMYHEASA-N 0.000 description 1
- UUKJGZSUMOVVOS-VXKWHMMOSA-N OC(=O)[C@@H]1C[C@@H](CN1)SC(c1ccccc1)(c1ccccc1)c1ccccc1 Chemical compound OC(=O)[C@@H]1C[C@@H](CN1)SC(c1ccccc1)(c1ccccc1)c1ccccc1 UUKJGZSUMOVVOS-VXKWHMMOSA-N 0.000 description 1
- MMZUHXUJQSMMAV-YOEHRIQHSA-N OC(=O)[C@@H]1C[C@H](Cc2ccc(cc2)-c2ccccc2)CN1 Chemical compound OC(=O)[C@@H]1C[C@H](Cc2ccc(cc2)-c2ccccc2)CN1 MMZUHXUJQSMMAV-YOEHRIQHSA-N 0.000 description 1
- 102000038030 PI3Ks Human genes 0.000 description 1
- 108091007960 PI3Ks Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Natural products C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 229940123066 Polymerase inhibitor Drugs 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- DHXVGJBLRPWPCS-UHFFFAOYSA-N Tetrahydropyran Chemical compound C1CCOCC1 DHXVGJBLRPWPCS-UHFFFAOYSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241001416177 Vicugna pacos Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 150000008065 acid anhydrides Chemical class 0.000 description 1
- ODHCTXKNWHHXJC-UHFFFAOYSA-N acide pyroglutamique Natural products OC(=O)C1CCC(=O)N1 ODHCTXKNWHHXJC-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 125000005076 adamantyloxycarbonyl group Chemical group C12(CC3CC(CC(C1)C3)C2)OC(=O)* 0.000 description 1
- 230000006838 adverse reaction Effects 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 125000003545 alkoxy group Chemical group 0.000 description 1
- 150000001350 alkyl halides Chemical class 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 229940100198 alkylating agent Drugs 0.000 description 1
- 230000002152 alkylating effect Effects 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- WNNNWFKQCKFSDK-UHFFFAOYSA-N allylglycine Chemical compound OC(=O)C(N)CC=C WNNNWFKQCKFSDK-UHFFFAOYSA-N 0.000 description 1
- 150000001370 alpha-amino acid derivatives Chemical class 0.000 description 1
- 235000008206 alpha-amino acids Nutrition 0.000 description 1
- HYOWVAAEQCNGLE-JTQLQIEISA-N alpha-methyl-L-phenylalanine Chemical compound OC(=O)[C@](N)(C)CC1=CC=CC=C1 HYOWVAAEQCNGLE-JTQLQIEISA-N 0.000 description 1
- ZYVMPHJZWXIFDQ-LURJTMIESA-N alpha-methylmethionine Chemical compound CSCC[C@](C)(N)C(O)=O ZYVMPHJZWXIFDQ-LURJTMIESA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- UGJQDKYTAYNNBH-UHFFFAOYSA-N amino cyclopropanecarboxylate Chemical compound NOC(=O)C1CC1 UGJQDKYTAYNNBH-UHFFFAOYSA-N 0.000 description 1
- 238000013103 analytical ultracentrifugation Methods 0.000 description 1
- 150000008064 anhydrides Chemical class 0.000 description 1
- 239000002259 anti human immunodeficiency virus agent Substances 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 150000001502 aryl halides Chemical class 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- YDGMGEXADBMOMJ-UHFFFAOYSA-N asymmetrical dimethylarginine Natural products CN(C)C(N)=NCCCC(N)C(O)=O YDGMGEXADBMOMJ-UHFFFAOYSA-N 0.000 description 1
- GFZWHAAOIVMHOI-UHFFFAOYSA-N azetidine-3-carboxylic acid Chemical compound OC(=O)C1CNC1 GFZWHAAOIVMHOI-UHFFFAOYSA-N 0.000 description 1
- 210000003651 basophil Anatomy 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- RFRXIWQYSOIBDI-UHFFFAOYSA-N benzarone Chemical compound CCC=1OC2=CC=CC=C2C=1C(=O)C1=CC=C(O)C=C1 RFRXIWQYSOIBDI-UHFFFAOYSA-N 0.000 description 1
- 125000000499 benzofuranyl group Chemical group O1C(=CC2=C1C=CC=C2)* 0.000 description 1
- 125000003354 benzotriazolyl group Chemical group N1N=NC2=C1C=CC=C2* 0.000 description 1
- 125000004541 benzoxazolyl group Chemical group O1C(=NC2=C1C=CC=C2)* 0.000 description 1
- 125000003236 benzoyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C(*)=O 0.000 description 1
- 125000001584 benzyloxycarbonyl group Chemical group C(=O)(OCC1=CC=CC=C1)* 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000008275 binding mechanism Effects 0.000 description 1
- 235000010290 biphenyl Nutrition 0.000 description 1
- 239000004305 biphenyl Substances 0.000 description 1
- JCZLABDVDPYLRZ-AWEZNQCLSA-N biphenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C1=CC=CC=C1 JCZLABDVDPYLRZ-AWEZNQCLSA-N 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 230000017531 blood circulation Effects 0.000 description 1
- 230000008081 blood perfusion Effects 0.000 description 1
- 239000008366 buffered solution Substances 0.000 description 1
- 230000000981 bystander Effects 0.000 description 1
- 125000001951 carbamoylamino group Chemical group C(N)(=O)N* 0.000 description 1
- 125000001589 carboacyl group Chemical group 0.000 description 1
- 150000001718 carbodiimides Chemical class 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 125000004965 chloroalkyl group Chemical group 0.000 description 1
- WCZVZNOTHYJIEI-UHFFFAOYSA-N cinnoline Chemical compound N1=NC=CC2=CC=CC=C21 WCZVZNOTHYJIEI-UHFFFAOYSA-N 0.000 description 1
- 229960004753 citiolone Drugs 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000001010 compromised effect Effects 0.000 description 1
- 125000006165 cyclic alkyl group Chemical group 0.000 description 1
- 150000001923 cyclic compounds Chemical class 0.000 description 1
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000001162 cycloheptenyl group Chemical group C1(=CCCCCC1)* 0.000 description 1
- 125000000582 cycloheptyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000000596 cyclohexenyl group Chemical group C1(=CCCCC1)* 0.000 description 1
- ORQXBVXKBGUSBA-UHFFFAOYSA-N cyclohexyl D-alanine Natural products OC(=O)C(N)CC1CCCCC1 ORQXBVXKBGUSBA-UHFFFAOYSA-N 0.000 description 1
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000002433 cyclopentenyl group Chemical group C1(=CCCC1)* 0.000 description 1
- 125000001511 cyclopentyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000001559 cyclopropyl group Chemical group [H]C1([H])C([H])([H])C1([H])* 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 210000004443 dendritic cell Anatomy 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 150000004985 diamines Chemical class 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 125000000118 dimethyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 125000002147 dimethylamino group Chemical group [H]C([H])([H])N(*)C([H])([H])[H] 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 230000007783 downstream signaling Effects 0.000 description 1
- 230000008406 drug-drug interaction Effects 0.000 description 1
- 108700004025 env Genes Proteins 0.000 description 1
- 101150030339 env gene Proteins 0.000 description 1
- 210000003979 eosinophil Anatomy 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- RTZKZFJDLAIYFH-UHFFFAOYSA-N ether Substances CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 1
- 125000003754 ethoxycarbonyl group Chemical group C(=O)(OCC)* 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- RMBPEFMHABBEKP-UHFFFAOYSA-N fluorene Chemical compound C1=CC=C2C3=C[CH]C=CC3=CC2=C1 RMBPEFMHABBEKP-UHFFFAOYSA-N 0.000 description 1
- 125000003709 fluoroalkyl group Chemical group 0.000 description 1
- 210000000285 follicular dendritic cell Anatomy 0.000 description 1
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 1
- 239000012458 free base Substances 0.000 description 1
- 125000002541 furyl group Chemical group 0.000 description 1
- 229940125874 fusion protein inhibitor Drugs 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229960002743 glutamine Drugs 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 150000004820 halides Chemical class 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 125000003187 heptyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 210000003630 histaminocyte Anatomy 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 239000000710 homodimer Substances 0.000 description 1
- 150000002431 hydrogen Chemical class 0.000 description 1
- 150000002463 imidates Chemical class 0.000 description 1
- 125000004857 imidazopyridinyl group Chemical group N1C(=NC2=C1C=CC=N2)* 0.000 description 1
- NBZBKCUXIYYUSX-UHFFFAOYSA-N iminodiacetic acid Chemical compound OC(=O)CNCC(O)=O NBZBKCUXIYYUSX-UHFFFAOYSA-N 0.000 description 1
- 230000005934 immune activation Effects 0.000 description 1
- 230000005931 immune cell recruitment Effects 0.000 description 1
- 230000003832 immune regulation Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000007813 immunodeficiency Effects 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- PZOUSPYUWWUPPK-UHFFFAOYSA-N indole Natural products CC1=CC=CC2=C1C=CN2 PZOUSPYUWWUPPK-UHFFFAOYSA-N 0.000 description 1
- RKJUIXBNRJVNHR-UHFFFAOYSA-N indolenine Natural products C1=CC=C2CC=NC2=C1 RKJUIXBNRJVNHR-UHFFFAOYSA-N 0.000 description 1
- QNRXNRGSOJZINA-UHFFFAOYSA-N indoline-2-carboxylic acid Chemical compound C1=CC=C2NC(C(=O)O)CC2=C1 QNRXNRGSOJZINA-UHFFFAOYSA-N 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 230000007794 irritation Effects 0.000 description 1
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000005928 isopropyloxycarbonyl group Chemical group [H]C([H])([H])C([H])(OC(*)=O)C([H])([H])[H] 0.000 description 1
- 125000005956 isoquinolyl group Chemical group 0.000 description 1
- 150000002540 isothiocyanates Chemical class 0.000 description 1
- GCHPUFAZSONQIV-UHFFFAOYSA-N isovaline Chemical compound CCC(C)(N)C(O)=O GCHPUFAZSONQIV-UHFFFAOYSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 208000017169 kidney disease Diseases 0.000 description 1
- 201000006370 kidney failure Diseases 0.000 description 1
- 230000003907 kidney function Effects 0.000 description 1
- HXEACLLIILLPRG-RXMQYKEDSA-N l-pipecolic acid Natural products OC(=O)[C@H]1CCCCN1 HXEACLLIILLPRG-RXMQYKEDSA-N 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- JZKXXXDKRQWDET-UHFFFAOYSA-N meta-tyrosine Natural products OC(=O)C(N)CC1=CC=CC(O)=C1 JZKXXXDKRQWDET-UHFFFAOYSA-N 0.000 description 1
- YQCIWBXEVYWRCW-UHFFFAOYSA-N methane;sulfane Chemical compound C.S YQCIWBXEVYWRCW-UHFFFAOYSA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 229960004452 methionine Drugs 0.000 description 1
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 1
- 125000001160 methoxycarbonyl group Chemical group [H]C([H])([H])OC(*)=O 0.000 description 1
- 229920012128 methyl methacrylate acrylonitrile butadiene styrene Polymers 0.000 description 1
- XJODGRWDFZVTKW-ZCFIWIBFSA-N n-methylleucine Chemical compound CN[C@@H](C(O)=O)CC(C)C XJODGRWDFZVTKW-ZCFIWIBFSA-N 0.000 description 1
- 125000001624 naphthyl group Chemical group 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 230000003589 nefrotoxic effect Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 231100000381 nephrotoxic Toxicity 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229940042404 nucleoside and nucleotide reverse transcriptase inhibitor Drugs 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- NIHNNTQXNPWCJQ-UHFFFAOYSA-N o-biphenylenemethane Natural products C1=CC=C2CC3=CC=CC=C3C2=C1 NIHNNTQXNPWCJQ-UHFFFAOYSA-N 0.000 description 1
- CQYBNXGHMBNGCG-RNJXMRFFSA-N octahydroindole-2-carboxylic acid Chemical compound C1CCC[C@H]2N[C@H](C(=O)O)C[C@@H]21 CQYBNXGHMBNGCG-RNJXMRFFSA-N 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 125000001181 organosilyl group Chemical group [SiH3]* 0.000 description 1
- 125000002971 oxazolyl group Chemical group 0.000 description 1
- 150000002924 oxiranes Chemical class 0.000 description 1
- 125000004043 oxo group Chemical group O=* 0.000 description 1
- 125000003232 p-nitrobenzoyl group Chemical group [N+](=O)([O-])C1=CC=C(C(=O)*)C=C1 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 229960001639 penicillamine Drugs 0.000 description 1
- 125000006340 pentafluoro ethyl group Chemical group FC(F)(F)C(F)(F)* 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N phenol group Chemical group C1(=CC=CC=C1)O ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 125000006678 phenoxycarbonyl group Chemical group 0.000 description 1
- UYWQUFXKFGHYNT-UHFFFAOYSA-N phenylmethyl ester of formic acid Natural products O=COCC1=CC=CC=C1 UYWQUFXKFGHYNT-UHFFFAOYSA-N 0.000 description 1
- 125000003170 phenylsulfonyl group Chemical group C1(=CC=CC=C1)S(=O)(=O)* 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- LFSXCDWNBUNEEM-UHFFFAOYSA-N phthalazine Chemical compound C1=NN=CC2=CC=CC=C21 LFSXCDWNBUNEEM-UHFFFAOYSA-N 0.000 description 1
- 125000005544 phthalimido group Chemical group 0.000 description 1
- 125000000612 phthaloyl group Chemical group C(C=1C(C(=O)*)=CC=CC1)(=O)* 0.000 description 1
- 238000002962 plaque-reduction assay Methods 0.000 description 1
- 230000036470 plasma concentration Effects 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 125000001501 propionyl group Chemical group O=C([*])C([H])([H])C([H])([H])[H] 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 125000003226 pyrazolyl group Chemical group 0.000 description 1
- 125000002098 pyridazinyl group Chemical group 0.000 description 1
- 230000006010 pyroptosis Effects 0.000 description 1
- 125000000719 pyrrolidinyl group Chemical group 0.000 description 1
- ADRDEXBBJTUCND-UHFFFAOYSA-N pyrrolizidine Chemical compound C1CCN2CCCC21 ADRDEXBBJTUCND-UHFFFAOYSA-N 0.000 description 1
- 125000000168 pyrrolyl group Chemical group 0.000 description 1
- JWVCLYRUEFBMGU-UHFFFAOYSA-N quinazoline Chemical compound N1=CN=CC2=CC=CC=C21 JWVCLYRUEFBMGU-UHFFFAOYSA-N 0.000 description 1
- 125000005493 quinolyl group Chemical group 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 108091006082 receptor inhibitors Proteins 0.000 description 1
- GHMLBKRAJCXXBS-UHFFFAOYSA-N resorcinol Chemical compound OC1=CC=CC(O)=C1 GHMLBKRAJCXXBS-UHFFFAOYSA-N 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000007142 ring opening reaction Methods 0.000 description 1
- AHTFMWCHTGEJHA-UHFFFAOYSA-N s-(2,5-dioxooxolan-3-yl) ethanethioate Chemical compound CC(=O)SC1CC(=O)OC1=O AHTFMWCHTGEJHA-UHFFFAOYSA-N 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 229940055619 selenocysteine Drugs 0.000 description 1
- 235000016491 selenocysteine Nutrition 0.000 description 1
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000002911 sialidase inhibitor Substances 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 125000003808 silyl group Chemical group [H][Si]([H])([H])[*] 0.000 description 1
- 230000007958 sleep Effects 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 102000009076 src-Family Kinases Human genes 0.000 description 1
- 108010087686 src-Family Kinases Proteins 0.000 description 1
- 125000005017 substituted alkenyl group Chemical group 0.000 description 1
- 125000004426 substituted alkynyl group Chemical group 0.000 description 1
- 125000003107 substituted aryl group Chemical group 0.000 description 1
- 125000005346 substituted cycloalkyl group Chemical group 0.000 description 1
- 125000005717 substituted cycloalkylene group Chemical group 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- YBBRCQOCSYXUOC-UHFFFAOYSA-N sulfuryl dichloride Chemical compound ClS(Cl)(=O)=O YBBRCQOCSYXUOC-UHFFFAOYSA-N 0.000 description 1
- 238000002198 surface plasmon resonance spectroscopy Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 230000001839 systemic circulation Effects 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- RAOIDOHSFRTOEL-UHFFFAOYSA-N tetrahydrothiophene Chemical compound C1CCSC1 RAOIDOHSFRTOEL-UHFFFAOYSA-N 0.000 description 1
- 150000004655 tetrazenes Chemical class 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 125000001544 thienyl group Chemical group 0.000 description 1
- DUYAAUVXQSMXQP-UHFFFAOYSA-M thioacetate Chemical compound CC([S-])=O DUYAAUVXQSMXQP-UHFFFAOYSA-M 0.000 description 1
- CNHYKKNIIGEXAY-UHFFFAOYSA-N thiolan-2-imine Chemical compound N=C1CCCS1 CNHYKKNIIGEXAY-UHFFFAOYSA-N 0.000 description 1
- JOKIQGQOKXGHDV-UHFFFAOYSA-N thiomorpholine-3-carboxylic acid Chemical compound [O-]C(=O)C1CSCC[NH2+]1 JOKIQGQOKXGHDV-UHFFFAOYSA-N 0.000 description 1
- 229930192474 thiophene Natural products 0.000 description 1
- 229960002898 threonine Drugs 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 238000005809 transesterification reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- 125000001425 triazolyl group Chemical group 0.000 description 1
- 125000004044 trifluoroacetyl group Chemical group FC(C(=O)*)(F)F 0.000 description 1
- 125000000026 trimethylsilyl group Chemical group [H]C([H])([H])[Si]([*])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 125000002221 trityl group Chemical group [H]C1=C([H])C([H])=C([H])C([H])=C1C([*])(C1=C(C(=C(C(=C1[H])[H])[H])[H])[H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- PXXNTAGJWPJAGM-UHFFFAOYSA-N vertaline Natural products C1C2C=3C=C(OC)C(OC)=CC=3OC(C=C3)=CC=C3CCC(=O)OC1CC1N2CCCC1 PXXNTAGJWPJAGM-UHFFFAOYSA-N 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
- ORQXBVXKBGUSBA-QMMMGPOBSA-N β-cyclohexyl-alanine Chemical compound OC(=O)[C@@H](N)CC1CCCCC1 ORQXBVXKBGUSBA-QMMMGPOBSA-N 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
- A61K47/6835—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment the modifying agent being an antibody or an immunoglobulin bearing at least one antigen-binding site
- A61K47/6849—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment the modifying agent being an antibody or an immunoglobulin bearing at least one antigen-binding site the antibody targeting a receptor, a cell surface antigen or a cell surface determinant
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/68—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an antibody, an immunoglobulin or a fragment thereof, e.g. an Fc-fragment
- A61K47/6801—Drug-antibody or immunoglobulin conjugates defined by the pharmacologically or therapeutically active agent
- A61K47/6803—Drugs conjugated to an antibody or immunoglobulin, e.g. cisplatin-antibody conjugates
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
- A61P31/18—Antivirals for RNA viruses for HIV
Definitions
- HIV antiviral inhibitors come in many several classes targeting distinct steps of the HIV cycle.
- One class of antivirals nucleoside reverse transcriptase inhibitors (NRTIs) inhibit viral replication by chain termination after being incorporated into growing DNA strands by HIV reverse transcriptase.
- NRTIs nucleoside reverse transcriptase inhibitors
- NRTIs non-nucleoside reverse transcription inhibitors
- a different class of antivirals, integrase inhibitors inhibit viral DNA insertion into the host cellular genome.
- Protease inhibitors are agents that inhibit the protease enzyme, a key enzyme in the assembly of new virus particles.
- One class of antivirals, known as viral entry inhibitors contains agents that interfere in viral entry into the cell by binding to HIV envelope (Env) glycoprotein. In particular, viral entry inhibitors target the surface subunit gp120 receptor of the HIV virus.
- the disclosure relates to conjugates, compositions, and methods for inhibiting viral growth, methods for the treatment of viral infections, and methods of synthesizing conjugates.
- such conjugates contain monomers or dimers of a moiety that inhibits human immunodeficiency virus, for example by binding to the gp120 glycoprotein (e.g., a gp120 binder such as temsavir, BMS-818251, DMJ- II-121, BNM-IV-147, or analogs thereof), conjugated to Fc monomers, Fc domains, Fc-binding peptides, albumin proteins, or albumin protein-binding peptides.
- gp120 glycoprotein e.g., a gp120 binder such as temsavir, BMS-818251, DMJ- II-121, BNM-IV-147, or analogs thereof
- the HIV targeting moiety e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof
- the HIV targeting moiety targets a protein encoded by the HIV Env gene, in particular gp120 glycoprotein on the surface of the viral particle, thereby preventing viral attachment to the host CD4+ T cell and entry into the host immune cell.
- the Fc monomers or Fc domains in the conjugates bind to Fc ⁇ Rs (e.g., FcRn, Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb) on immune cells, e.g., neutrophils, to activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates.
- Fc ⁇ Rs e.g., FcRn, Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb
- immune cells e.g., neutrophils
- ADCC antibody-dependent cell-mediated cytotoxicity
- the albumin or albumin-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neon
- compositions are useful in methods for the inhibition of viral growth and in methods for the treatment of viral infections, such as those caused by an HIV-1 and HIV-2.
- the disclosure features a conjugate described by any one of formulas (D-I), (M-I), (1), or (2): wherein each A 1 and each A 2 is independently described by formula (A-I) or (A-II): wherein Q is selected from the group consisting of: S is selected from the group consisting of: ; R 1 , R 2 , R 3 , are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C 1 -C 20 alkyl, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 alkenyl, optionally substituted C 3 -C 20 cycloalkenyl, optionally substituted C 2 -C
- each R 8 is independently selected from H, optionally substituted C 1 -C 20 alkyl, optionally substituted C 1 -C 20 alkylene, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C 15 aryl, and optionally substituted C 2 -C 15 heteroaryl; each R 9 is independently selected from optionally substituted C 1 -C 20 alkylene, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C 15 aryl, and optionally substituted C 2 -C 15 heteroaryl; x is 1 or 2; k is 0, 1, 2, 3, 4, or 5; Ar is selected from the group consisting of optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C
- each A 1 -L or each A 1 -L-A 2 may be independently selected (e.g., independently selected from any of the A 1 -L or A 1 -L-A 2 structures described herein).
- n is 1 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide.
- n is 2 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), wherein the Fc domain monomers dimerize to form and Fc domain.
- Fc domain monomer e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153
- the Fc domain monomers dimerize to form and Fc domain.
- x is 2.
- x is 2.
- each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153).
- each A 1 and each A 2 is independently described by any one of formulas (A-Ia)-(A-Ih): wherein each X is independently C or N; or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Ia-i)-(A-Ih-i): or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Ia-ii)-(A-Ih-ii):
- each A 1 and each A 2 is independently described by any one of formulas (A-Ic-i) and (A-Ic-ii): or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Ii)-(A-Ip): wherein each X is independently C or N; or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Iq)-(A-Ix): or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Iq-i)-(A-Ix-i): or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Iaa)-(A-Ihh): or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Iii)-(A-Ipp): wherein each X is independently C or N; or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Iii-i)-(A-Ipp-i): or a pharmaceutically acceptable salt thereof.
- R 1 is H.
- R 2 is H.
- R 2 is -OCH 3 .
- R 3 is H.
- R 4 is H.
- R 5 is H.
- R 7 is a carbonyl.
- X is N.
- X is C.
- each A 1 and each A 2 is independently described by any one of formulas (A-IIa)-(A-IId):
- each A 1 and each A 2 is independently described by any one of formulas (A-IIa-i)-(A-IId-i): wherein U5 is C 1 -C 10 alkyl; or a pharmaceutically acceptable salt thereof.
- each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153);
- L in each A 1 -L-A 2 is a linker covalently attached to a sulfur atom of a cysteine or a nitrogen atom of a lysine in E and to each of A 1 and A 2 ;
- n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain);
- T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and the squiggly line connected to the E indicates that each A 1 -L-A 2 is covalently attached (e.g., by way of a covalent bond or linker) to a sulfur atom of a hinge cysteine in E, or
- each A 1 -L-A 2 may be independently selected (e.g., independently selected from any of the A 1 -L-A 2 structures described herein).
- each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153);
- L in each L-A 1 is a linker covalently attached to a sulfur atom of a cysteine or a nitrogen atom of a lysine in E and to A 1 ;
- n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain);
- T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20); and the squiggly line connected to E indicates that each L-A 1 is covalently attached (e.g., by way of a covalent bond or linker) to the sulfur atom of the hinge cysteine in E, or a pharmaceutically acceptable salt thereof.
- each A 1 may be independently selected from any structure described by formula (A-I).
- each E includes an Fc domain monomer.
- each E includes an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153.
- the conjugate forms a homodimer including an Fc domain.
- E homodimerizes with another E to form an Fc domain.
- each E includes an albumin protein having the sequence of any one of SEQ ID NOs: 96-98.
- T is 1 and L-A 1 is covalently attached to the sulfur atom corresponding to Cys34 of SEQ ID NO: 96.
- the disclosure features an intermediate (Int) of Table 1. These intermediates comprise one or more gp120 binders and a linker (e.g., a PEG 2 -PEG 20 linker) and may be used in the synthesis of a conjugate described herein.
- Intermediates of Table 1 may be conjugated to, for example, an Fc domain or Fc domain monomer, albumin protein, albumin protein-binding peptide, or Fc-binding peptide (e.g., by way of a linker) by any suitable methods known to those of skill in the art, including any of the methods described or exemplified herein.
- the conjugate (e.g., a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) includes E, wherein E is an Fc domain monomer or an Fc domain (e.g., an Fc domain monomer or an Fc domain, each Fc domain monomer having, independently, the sequence of any one of SEQ ID NOs: 1-95 and 125-153).
- E is an Fc domain monomer or an Fc domain (e.g., an Fc domain monomer or an Fc domain, each Fc domain monomer having, independently, the sequence of any one of SEQ ID NOs: 1-95 and 125-153).
- one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E is covalently conjugated to a linker (e.g., a PEG 2 -PEG 20 linker).
- the linker conjugated to E may be functionalized such that it may react to form a covalent bond with any of the Ints described herein (e.g., an Int of Table 1).
- E is conjugated to a linker functionalized with an azido group and the Int (e.g., an Int of Table 1) is functionalized with an alkyne group.
- Conjugation (e.g., by click chemistry) of the linker-azido of E and linker-alkyne of the Int forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII).
- E is conjugated to a linker functionalized with an alkyne group and the Int (e.g., an Int of Table 1) is functionalized with an azido group.
- Conjugation e.g., by click chemistry
- the linker-alkyne of E and the linker-azido of the Int forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII).
- the Int e.g., an Int of Table 1
- a phenyl ester group e.g., a trifluorophenyl ester group or a tetrafluorophenyl ester group.
- Conjugation e.g., by acylation
- linker-phenyl ester e.g., trifluorophenyl ester or tetrafluorophenyl ester
- the linker-phenyl ester e.g., trifluorophenyl ester or tetrafluorophenyl ester
- M-I M-XVII
- the disclosure further features a composition (e.g., a pre-conjugation intermediate) having the structure of any one of Int-1 to Int-190 (e.g., Int-1 to Int-93, Int-1 to Int-27, Int-28 to Int-93, Int-94 to Int- 141, Int-142 to Int-190 or Int-28 to Int-190).
- a composition e.g., a pre-conjugation intermediate
- the disclosure features a composition (e.g., a pre-conjugation intermediate) having the structure of any one of Int-28 to Int-190 (e.g., Int-28 to Int-93, Int-94 to Int-141, Int-142 to Int-190 or Int-28 to Int-190).
- the disclosure provides a method of making an antiviral-Fc conjugate by conjugating (e.g., via a linker) any one of Int-1 to Int-190 (e.g., Int-1 to Int-93, Int-1 to Int-27, Int-28 to Int-93, Int-94 to Int-141, Int- 142 to Int-190, or Int-28 to Int-190) to an Fc domain monomer or an Fc domain.
- Int-1 to Int-190 e.g., Int-1 to Int-93, Int-1 to Int-27, Int-28 to Int-93, Int-94 to Int-141, Int- 142 to Int-190, or Int-28 to Int-190
- the disclosure features a conjugate, wherein the conjugate comprises a small molecule targeting agent, wherein the targeting agent is described by any one of Int-1 to Int-190 (e.g., Int-1 to Int-93, Int-1 to Int-27, Int-28 to Int-93, Int-94 to Int-141, Int-142 to Int-190, or Int-28 to Int-190), which is conjugated to an Fc (e.g., via a linker).
- Table 1 Intermediates
- the disclosure features a conjugate of Table 2.
- Each conjugate of Table 2 corresponds to a conjugate of either formula (M-I) or formula (D-I), as indicated.
- Conjugates of Table 2 include conjugates formed by the covalent reaction of an Int of Table 1 with a linker which is in turn conjugated to E (e.g., an Fc domain monomer, an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide).
- E e.g., an Fc domain monomer, an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide.
- the reactive moiety of the Int (e.g., the alkyne or azido group) reacts with a corresponding reactive group (e.g., an alkyne or azido) of a linker (represented by L’) covalently attached to E, such that an Int of Table 1 is covalently attached to E.
- a corresponding reactive group e.g., an alkyne or azido
- L linker
- the reactive moiety of the Int (e.g., the phenyl ester group, e.g., tetrafluorophenyl ester or trifluorophenyl ester group) reacts with a corresponding reactive group (e.g., a nitrogen or sulfur atom) of an amino acid side chain of E, such that an Int of Table 1 is covalently attached to E.
- a corresponding reactive group e.g., a nitrogen or sulfur atom
- L’ corresponds to the remainder of L as defined in (M-I) or (D-I) (e.g., L’ is a linker that covalently joins the Int and E).
- L’ may include a triazole (formed by the click chemistry reaction between the Int and a linker conjugated to E) and a linker (e.g., a PEG 2 -PEG 20 linker) which in turn is conjugated to an amino acid side chain of E.
- n is 1 or 2.
- each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide.
- each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), and the Fc domain monomers dimerize to form and Fc domain.
- T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20).
- the disclosure also provides a population of any of the conjugates of Table 2 wherein the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20).
- the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments, the average T is 1 to 10 (e.g., 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, or 10).
- the average T is 1 to 5 (e.g., 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, or 5).
- the average T is 5 to 10 (e.g., 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10).
- the average T is 2.5 to 7.5 (e.g., 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, or 7.5).
- the squiggly line in the conjugates of Table 2 indicates that each L’-Int is covalently attached to an amino acid side chain in E (e.g., the nitrogen atom of a surface exposed lysine or the sulfur atom of a surface exposed cysteine in E), or a pharmaceutically acceptable salt thereof.
- E amino acid side chain
- the disclosure further features a conjugate of Table 2, wherein the conjugate is produced by conjugation (e.g., via a linker) of any one of Int-1 to Int-190 to an Fc domain or an Fc domain monomer.
- Table 2 Conjugates Corresponding to Intermediates of Table 1
- each E includes an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153.
- the disclosure features a conjugate including (i) a first moiety, A 1 ; (ii) a second moiety, A 2 ; (iii) an Fc domain monomer or an Fc domain; and (iv) a linker covalently attached to A 1 and A 2 , and to the Fc domain monomer or the Fc domain; wherein each A 1 and each A 2 is independently selected from any structure described by formula (A-I) or (A-II).
- x is 2.
- the disclosure features a conjugate including (i) a first moiety, Int; (ii) an Fc domain monomer or an Fc domain; and (iv) a linker covalently attached to Int, and to the Fc domain monomer or the Fc domain; wherein each Int is independently selected from any one of the intermediates of Table 1.
- the disclosure features a conjugate including (i) a first moiety, A 1 ; (ii) a second moiety, A 2 ; (iii) an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; and (iv) a linker covalently attached to A 1 and A 2 , and to the Fc domain monomer or the Fc domain; wherein each A 1 and each A 2 is independently selected from any structure described by formula (A-I) or (A-II). In a preferred embodiment of the above, x is 2.
- each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide;
- n is 1 or 2;
- T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) (e.g., T is the total number of A 1 -L-A 2 moieties conjugated to (E)n); and L is a linker covalently attached to each of E, A 1 , and A 2 , or a pharmaceutically acceptable salt thereof.
- each A 1 -L-A 2 may be independently selected (e.g., independently selected from any of the A 1 -L-A 2 structures described herein).
- x is 2.
- the conjugate is described by formulas (e.g., (D-I)-(D-XVII)) found in WO 2020/252393.
- the conjugate is described by formula (D-IV-7): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (D-IV-8): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (D-IV-9): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (D-V-7): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (D-V-8): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (D-V-9): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (D-VI-8): wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof.
- L or L’ includes one or more optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, optionally substituted C 2 -C 20 heteroalkynylene, optionally substituted C 3 -C 20 cycloalkylene, optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heterocycloalkynylene, optionally substituted C 5 -C 15 arylene, optionally substituted C 3 -C 15 heteroarylene, O, S,
- the backbone of L or L’ consists of one or more optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 - C 20 alkynylene, optionally substituted C 2 -C 20 heteroalkynylene, optionally substituted C 3 -C 20 cycloalkylene, optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heterocycloalkynylene, optionally substituted C 5 -C 15 arylene, optionally substituted C 3 -C 15 heteroarylene
- L or L’ is oxo substituted.
- the backbone of L or L’ includes no more than 250 atoms.
- the backbone of L or L’ includes between 1 and 250 atoms (e.g., between 5 and 250 atoms, between 10 and 250 atoms, between 50 and 250 atoms, between 10 and 200 atoms, between 10 and 100 atoms, between 10 and 50 atoms, between 5 and 100 atoms, between 5 and 50 atoms, between 5 and 30 atoms).
- L or L’ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage.
- L or L’ is a bond. In some embodiments, L or L’ is an atom. In some embodiments of any of the aspects described herein, each L is described by formula (D- L-I): wherein L A is described by formula G A1 -(Z A1 )g1-(Y A1 )h1-(Z A2 )i1-(Y A2 )j1-(Z A3 )k1-(Y A3 )l1-(Z A4 )m1-(Y A4 )n1-(Z A5 )o1- G A2 ; L B is described by formula G B1 -(Z B1 )g2-(Y B1 )h2-(Z B2 )i2-(Y B2 )j2-(Z B3 )k2-(Y B3 )l2-(Z B4 )m2-(Y B4 )n2-(Z B5 )o2-G B2 ; L C is described by formula G C1 -(Z C
- optionally substituted includes substitution with a polyethylene glycol (PEG).
- PEG polyethylene glycol
- a PEG has a repeating unit structure (-CH 2 CH 2 O-)n, wherein n is an integer from 2 to 100.
- a polyethylene glycol may be selected any one of PEG 2 to PEG 100 (e.g., PEG 2 , PEG 3 , PEG 4 , PEG 5 , PEG 5 -PEG 10 , PEG 10 -PEG 20 , PEG 20 - PEG 30 , PEG 30 -PEG 40 , PEG 50 -PEG 60 , PEG 60 -PEG 70 , PEG 70 -PEG 80 , PEG 80 - PEG 90 , PEG 90 -PEG 100 ).
- L C may have two points of attachment to the Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide (e.g., two G C2 ).
- L includes a polyethylene glycol (PEG) linker.
- a PEG linker includes a linker having the repeating unit structure (-CH 2 CH 2 O-)n, wherein n is an integer from 2 to 100.
- a polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (M-I)-(M-XVII)).
- a polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (D-I)-(D-XVII)).
- a polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas (D-I)-(D-XVII)).
- a polyethylene glycol linker may be selected from any one of PEG 2 to PEG 100 (e.g., PEG 2 , PEG 3 , PEG 4 , PEG 5 , PEG 5 -PEG 10 , PEG 10 -PEG 20 , PEG 20 -PEG 30 , PEG 30 -PEG 40 , PEG 50 -PEG 60 , PEG 60 -PEG 70 , PEG 70 -PEG 80 , PEG 80 -PEG 90 , PEG 90 -PEG 100 ).
- L c includes a PEG linker, where L C is covalently attached to each of Q i and E.
- each A 1 is independently described by formula (A-I); each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; n is 1 or 2; T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) (e.g., T is the total number of A 1 -L moieties conjugated to (E)n); and L is a linker covalently attached to each of E and A 1 , or a pharmaceutically acceptable salt thereof.
- Fc domain monomer e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153
- each A 1 may be independently selected from any structure described by formula (A-I).
- x is 2.
- the conjugate is described by formula (M-II): wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-III): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-III-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-III-2): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-III-3): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-III-4): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-III-5): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-III-6): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-2): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-3): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-4): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-5): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-6): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-7): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-8): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IV-9): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V-2): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V-3): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V-4): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V-5): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V-6): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-V-7): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-V-8): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-V-9): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VI): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VI-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VI-2): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VI-3): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VI-4): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VI-5): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VI-6): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VI-7): wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VI-8): wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-VII): wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VIII): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VIII-1): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-IX): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-IX-1): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-X): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-X-1): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XI): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XI-1): wherein L’ is the remainder of L, and y 1 is an integer from 1-20 (e.g., y 1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XII): wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XII-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XII-2): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIII): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIII-1): or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIII-2): or a pharmaceutically acceptable salt thereof.
- the disclosure features a conjugate described by formula (M-I): wherein each A 1 is independently described by formula (A-II); each E comprises an Fc domain monomer; the squiggly line connected to the E indicates that each A 1 -L-A 2 is covalently attached to E, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XIV): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XIV-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XIV-2): wherein L’ is the remainder of L and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XIV-3): wherein L’ is the remainder of L; e 1 is an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XIV-4): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XIV-5): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XV): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XV-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XV-2): wherein L’ is the remainder of L and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XV-3): wherein L’ is the remainder of L; e 1 is an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XV-4): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XV-5): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVI): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVI-1): wherein U5 is C 1 -C 10 alkyl, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVI-2): wherein L’ is the remainder of L and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVI-3): wherein L’ is the remainder of L; e 1 is an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVI-4): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVI-5): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVII): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVII-1): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVII-2): wherein L’ is the remainder of L and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVII-3): wherein L’ is the remainder of L; e 1 is an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVII-4): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-XVII-5): wherein L’ is the remainder of L; e 1 and e 3 are each independently an integer from 1-10; and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- L or L’ includes one or more optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, optionally substituted C 2 -C 20 heteroalkynylene, optionally substituted C 3 -C 20 cycloalkylene, optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heterocycloalkynylene, optionally substituted C 5 -C 15 arylene, optionally substituted C 3 -C 15 heteroarylene, O, S,
- the backbone of L or L’ consists of one or more optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 - C 20 alkynylene, optionally substituted C 2 -C 20 heteroalkynylene, optionally substituted C 3 -C 20 cycloalkylene, optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heterocycloalkynylene, optionally substituted C 5 -C 15 arylene, optionally substituted C 3 -C 15 heteroarylene
- L or L’ is oxo substituted.
- the backbone of L or L’ includes no more than 250 atoms.
- the backbone of L or L’ includes between 1 and 250 atoms (e.g., between 5 and 250 atoms, between 10 and 250 atoms, between 50 and 250 atoms, between 10 and 200 atoms, between 10 and 100 atoms, between 10 and 50 atoms, between 5 and 100 atoms, between 5 and 50 atoms, between 5 and 30 atoms).
- L or L’ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage.
- L or L’ is a bond. In some embodiments, L or L’ is an atom. In some embodiments, L’ is a nitrogen atom. In some embodiments, each L is described by formula (M-L): J 1 -(Q 1 )g-(T 1 )h-(Q 2 )i-(T 2 )j-(Q 3 )k-(T 3 )l-(Q 4 )m-(T 4 )n-(Q 5 )o-J 2 wherein J 1 is a bond attached to A 1 ; J 2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyn
- each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1.
- each L is described by formula (M-L-I): J 1 -(Q 1 )g-(T 1 )h-(Q 2 )i-(T 2 )j-(Q 3 )k-(T 3 )l-(Q 4 )m-(T 4 )n-(Q 5 )o-J 2 wherein: J 1 is a bond attached to A 1 ; J 2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and
- optionally substituted includes substitution with a polyethylene glycol (PEG).
- PEG polyethylene glycol
- a PEG has a repeating unit structure (-CH 2 CH 2 O-)n, wherein n is an integer from 2 to 100.
- a polyethylene glycol may be selected from any one of PEG 2 to PEG 100 (e.g., PEG 2 , PEG 3 , PEG 4 , PEG 5 , PEG 5 -PEG 10 , PEG 10 -PEG 20 , PEG 20 -PEG 30 , PEG 30 -PEG 40 , PEG 50 -PEG 60 , PEG 60 -PEG 70 , PEG 70 -PEG 80 , PEG 80 -PEG 90 , PEG 90 -PEG 100 ).
- J 2 may have two points of attachment to the Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide (e.g., two J 2 ).
- L is , wherein d is an integer from 1 to 20 (e.g., d is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20). In some embodiments, L is , , , ,
- g is 0 or 1; h is 0; i is 0 or 1; j is 0 or 1; k is 0 or 1; l is 0; m is 0 or 1; n is 0; and o is 0 or 1.
- g is 1; h is 0; i is 1; j is 0; k is 1; l is 0; m is 1; n is 0; and o is 1.
- g is 0; h is 0; i is 1; j is 0; k is 1; l is 0; m is 1; n is 0; and o is 1.
- g is 1; h is 0; i is 0; j is 0; k is 1; l is 0; m is 1; n is 0; and o is 1.
- g is 0; h is 0; i is 1; j is 1; k is 1; l is 0; m is 1; n is 0; and o is 1.
- L is a linker described by formula (M-L-Ia): J 1 -Q 1 -Q 2 -Q 3 -Q 4 -Q 5 -J 2 .
- L is a linker described by formula (M-L-Ib): J 1 -Q 2 -Q 3 -Q 4 -Q 5 -J 2 .
- L is a linker described by formula (M-L-Ic): J 1 -Q 1 -Q 3 -Q 4 -Q 5 -J 2 .
- L is a linker described by formula (M-L-Id): J 1 -Q 2 -T 2 -Q 3 -Q 4 -Q 5 -J 2 .
- Q 1 is optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, or optionally substituted C 2 -C 20 heteroalkynylene.
- Q 1 is optionally substituted C 1 -C 20 alkylene or optionally substituted C 1 -C 20 heteroalkylene.
- Q 1 is optionally substituted C 1 -C 20 alkylene.
- Q 1 is optionally substituted C 1 -C 20 heteroalkylene.
- Q 2 is optionally substituted C 3 -C 20 cycloalkylene, optionally substituted C 2 - C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heterocycloalkynylene, optionally substituted C 5 -C 15 arylene, or optionally substituted C 2 -C 15 heteroarylene.
- Q 2 is optionally substituted C 3 -C 20 cycloalkylene, optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, or optionally substituted C 8 -C 20 heterocycloalkynylene.
- Q 2 is optionally substituted C 3 -C 20 cycloalkylene or optionally substituted C 2 -C 20 heterocycloalkylene.
- Q 2 is optionally substituted C 3 -C 20 cycloalkylene.
- Q 2 is optionally substituted C 2 -C 20 heterocycloalkylene.
- Q 2 is a five-membered C 2 -C 4 heterocycloalkylene or six- membered C 2 -C 5 heterocycloalkylene.
- Q 3 is optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, or optionally substituted C 2 -C 20 heteroalkynylene.
- Q 3 is optionally substituted C 1 -C 20 alkylene or optionally substituted C 1 -C 20 heteroalkylene. In some embodiments, Q 3 is optionally substituted C 1 -C 20 alkylene. In some embodiments, Q 3 is optionally substituted C 1 -C 20 heteroalkylene. In some embodiments, Q 3 is , where e4 is an integer from 1 to 10. In some embodiments, Q 4 is optionally substituted C 5 -C 15 arylene or optionally substituted C 2 -C 15 heteroarylene. In some embodiments, Q 4 is optionally substituted C 5 -C 15 arylene. In some embodiments, Q 4 is optionally substituted C 2 -C 15 heteroarylene.
- Q 4 is a five-membered optionally substituted C 2 -C 4 heteroarylene. In some embodiments, Q 4 is a five-membered C 2 -C 4 heteroarylene. In some embodiments, Q 4 is a five- membered C 2 heteroarylene. In some embodiments, Q 5 is optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, or optionally substituted C 2 -C 20 heteroalkynylene.
- Q 5 is optionally substituted C 1 -C 20 alkylene or optionally substituted C 1 -C 20 heteroalkylene. In some embodiments, Q 5 is optionally substituted C 1 -C 20 alkylene. In some embodiments, Q 5 is optionally substituted C 1 -C 20 heteroalkylene. In some embodiments, Q 5 is , where e5 is an integer from 1 to 8; and e6 is an integer from 1 to 16. In some embodiments of any of the aspects described herein, L includes (e.g., a portion of L that joins A 1 and E includes): , , ,
- L includes (e.g., a portion of L that joins A 1 and E includes):
- L is: , ,
- L includes a polyethylene glycol (PEG) linker.
- PEG linker includes a linker having the repeating unit structure (-CH 2 CH 2 O-) n , wherein n is an integer from 2 to 100.
- a polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- a polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- a polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas).
- a polyethylene glycol linker may be selected from any one of PEG 2 to PEG 100 (e.g., PEG 2 , PEG 3 , PEG 4 , PEG 5 , PEG 5 -PEG 10 , PEG 10 - PEG 20 , PEG 20 -PEG 30 , PEG 30 -PEG 40 , PEG 50 -PEG 60 , PEG 60 -PEG 70 , PEG 70 -PEG 80 , PEG 80 -PEG 90 , PEG 90 - PEG 100 ).
- L c includes a PEG linker, where L C is covalently attached to each of Q i and E.
- L is covalently attached to the nitrogen atom of a surface exposed lysine of E or L is covalently attached to the sulfur atom of a surface exposed cysteine of E.
- E is an Fc domain monomer.
- n is 2 and each E dimerizes to form an Fc domain.
- n 2
- each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (D-I-1): wherein J is an Fc domain; and T is an integer from 1 to 20 (e.g., T is 1 ,2 ,3 ,4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (M-I-1): wherein J is an Fc domain; and T is an integer from 1 to 20 (e.g., T is 1 ,2 ,3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
- each E independently includes an amino acid sequence having at least 90% (e.g., at least 95%, 96%, 97%, 98%, or 99%) sequence identity with the amino acid sequence of any one of SEQ ID Nos: 1-95 and 125-153.
- each E includes the sequence of any one of SEQ ID NOs: 1-95 and 125-153.
- E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide.
- n is 1.
- n is 1, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide and the conjugate is described by formula (D-I-2): wherein E is an albumin protein, an albumin protein-binding peptide, or Fc-binding peptide; and T is an integer from 1 to 20, or a pharmaceutically acceptable salt thereof.
- n 1, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide, and the conjugate is described by formula (M-I-2): wherein E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; and T is an integer from 1 to 20, or a pharmaceutically acceptable salt thereof.
- E is an albumin protein having the sequence of any one of SEQ ID NOs: 96-98.
- T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.
- the disclosure provides a population of conjugates having the structure of any of the conjugates described herein (e.g., a population of conjugates having the formula of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)), wherein the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.
- the disclosure provides a pharmaceutical composition
- a pharmaceutical composition comprising any of the conjugates described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M- I)-(M-XVII)), or a pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable excipient.
- the disclosure provides a method for the treatment of a subject having a viral infection or presumed to have a viral infection, the method comprising administering to the subject an effective amount of any of the conjugates or compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- an effective amount of any of the conjugates or compositions described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- the disclosure provides a method for the prophylactic treatment of a viral infection in a subject in need thereof, the method comprising administering to the subject an effective amount of any of the conjugates or compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- the viral infection is caused by HIV.
- the viral infection is HIV-1 or HIV-2.
- the subject is immunocompromised.
- the subject has been diagnosed with humoral immune deficiency, T cell deficiency, neutropenia, asplenia, or complement deficiency.
- the subject is being treated or is about to be treated with an immunosuppressive therapy.
- the subject has been diagnosed with a disease which causes immunosuppression.
- the disease is cancer or acquired immunodeficiency syndrome.
- the cancer is leukemia, lymphoma, or multiple myeloma.
- the subject has undergone or is about to undergo hematopoietic stem cell transplantation. In some embodiments, wherein the subject has undergone or is about to undergo an organ transplant.
- the conjugate of composition is administered intramuscularly, intravenously, intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally, locally, by inhalation, by injection, or by infusion.
- the subject is treated with a second therapeutic agent.
- the second therapeutic agent is an antiviral agent.
- the second therapeutic agent is a viral vaccine.
- the viral vaccine elicits an immune response in the subject against HIV (e.g., HIV-1 or HIV-2).
- HIV e.g., HIV-1 or HIV-2.
- an Fc-domain-containing composition may be substituted for an Fc domain and an Fc-domain-monomer-containing composition may be substituted for an Fc domain monomer in any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII), when n is 1, E is an Fc- domain-monomer-containing composition.
- E is an Fc-domain-containing composition.
- the Fc-domain-containing composition is an antibody or an antibody fragment.
- An antibody may include any form of immunoglobulin, heavy chain antibody, light chain antibody, LRR-based antibody, or other protein scaffold with antibody-like properties, as well as any other immunological binding moiety known in the art, including antibody fragments (e.g., a Fab, Fab', Fab’2, F(ab')2, Fd, Fv, Feb, scFv, or SMIP).
- An antibody fragment may include a binding moiety that includes a portion derived from or having significant homology to an antibody, such as the antigen- determining region of an antibody.
- Exemplary antibody fragments include Fab, Fab', Fab’2, F(ab')2, Fd, Fv, Feb, scFv, and SMIP.
- the antibody or antibody fragment is a human, mouse, camelid (e.g., llama, alpaca, or camel), goat, sheep, rabbit, chicken, guinea pig, hamster, horse, or rat antibody or antibody fragment.
- the antibody is an IgG, IgA, IgD, IgE, IgM, or intrabody.
- the antibody fragment includes an scFv, sdAb, dAb, Fab, Fab', Fab'2, F(ab')2, Fd, Fv, Feb, or SMIP.
- the Fc-domain-containing composition e.g., an antibody or antibody fragment
- confers binding specificity to a one or more targets e.g., an antigen, such as an antigen associated with HIV.
- HIV-targeting antibodies are known in the art, for example, as described in Wibmer et al. Curr. Opin.
- the one or more targets (e.g., an antigen) bound by the Fc-domain- containing composition is a viral (e.g., HIV) protein such as gp41 or gp120 receptor.
- the antibody or antibody fragment recognizes a viral surface antigen.
- E e.g., each E includes the amino acid sequence of SEQ ID NO: 1.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 1.
- E includes the amino acid sequence of SEQ ID NO: 2.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 2.
- E includes the amino acid sequence of SEQ ID NO: 3.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 3.
- E includes the amino acid sequence of SEQ ID NO: 4.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 4.
- E includes the amino acid sequence of SEQ ID NO: 5.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 5.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 6.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 6.
- E includes the amino acid sequence of SEQ ID NO: 7.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 7.
- E includes the amino acid sequence of SEQ ID NO: 8.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 8.
- E includes the amino acid sequence of SEQ ID NO: 9.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 9.
- E includes the amino acid sequence of SEQ ID NO: 10.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 10.
- E includes the amino acid sequence of SEQ ID NO: 11.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 11.
- E includes the amino acid sequence of SEQ ID NO: 12.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 12.
- E includes the amino acid sequence of SEQ ID NO: 13.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 13.
- E includes the amino acid sequence of SEQ ID NO: 14.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 14.
- E includes the amino acid sequence of SEQ ID NO: 15.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 15.
- E includes the amino acid sequence of SEQ ID NO: 16.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 16.
- E includes the amino acid sequence of SEQ ID NO: 17.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 17.
- E includes the amino acid sequence of SEQ ID NO: 18.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 18.
- E includes the amino acid sequence of SEQ ID NO: 19.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 19.
- E includes the amino acid sequence of SEQ ID NO: 20.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 20.
- E includes the amino acid sequence of SEQ ID NO: 21.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 21.
- E includes the amino acid sequence of SEQ ID NO: 22.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 22.
- E includes the amino acid sequence of SEQ ID NO: 23.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 23.
- E includes the amino acid sequence of SEQ ID NO: 24.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 24.
- E includes the amino acid sequence of SEQ ID NO: 25.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 25.
- E includes the amino acid sequence of SEQ ID NO: 26.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 26.
- E includes the amino acid sequence of SEQ ID NO: 27.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 27.
- E includes the amino acid sequence of SEQ ID NO: 28.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 28.
- E includes the amino acid sequence of SEQ ID NO: 29.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 29.
- E includes the amino acid sequence of SEQ ID NO: 30.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 30.
- E includes the amino acid sequence of SEQ ID NO: 31.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 31.
- E includes the amino acid sequence of SEQ ID NO: 32.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 32.
- E includes the amino acid sequence of SEQ ID NO: 33.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 33.
- E includes the amino acid sequence of SEQ ID NO: 34.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 34.
- E includes the amino acid sequence of SEQ ID NO: 35.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 35.
- E includes the amino acid sequence of SEQ ID NO: 36.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 36.
- E includes the amino acid sequence of SEQ ID NO: 37.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 37.
- E includes the amino acid sequence of SEQ ID NO: 38.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 38.
- E includes the amino acid sequence of SEQ ID NO: 39.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 39.
- E includes the amino acid sequence of SEQ ID NO: 40.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 40.
- E includes the amino acid sequence of SEQ ID NO: 41.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 41.
- E includes the amino acid sequence of SEQ ID NO: 42.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 42.
- E includes the amino acid sequence of SEQ ID NO: 43.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 43.
- E includes the amino acid sequence of SEQ ID NO: 44.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 45.
- E includes the amino acid sequence of SEQ ID NO: 46.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 46.
- E includes the amino acid sequence of SEQ ID NO: 47.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 47.
- E includes the amino acid sequence of SEQ ID NO: 48.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 48.
- E includes the amino acid sequence of SEQ ID NO: 49.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 49.
- E includes the amino acid sequence of SEQ ID NO: 50.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 50.
- E includes the amino acid sequence of SEQ ID NO: 51.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 51.
- E includes the amino acid sequence of SEQ ID NO: 52.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 52.
- E includes the amino acid sequence of SEQ ID NO: 53.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 53.
- E includes the amino acid sequence of SEQ ID NO: 54.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 54.
- E includes the amino acid sequence of SEQ ID NO: 55.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 55.
- E includes the amino acid sequence of SEQ ID NO: 56.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 56.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 57.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 57.
- E includes the amino acid sequence of SEQ ID NO: 58.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 58.
- E includes the amino acid sequence of SEQ ID NO: 59.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 59.
- E includes the amino acid sequence of SEQ ID NO: 60.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 60.
- E includes the amino acid sequence of SEQ ID NO: 61.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 61.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 62.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 62.
- E includes the amino acid sequence of SEQ ID NO: 63. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 63. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 64. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 64.
- E includes the amino acid sequence of SEQ ID NO: 65. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 65. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 66. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 66.
- E includes the amino acid sequence of SEQ ID NO: 67.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 67.
- E includes the amino acid sequence of SEQ ID NO: 68.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 68.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 69.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 69.
- E includes the amino acid sequence of SEQ ID NO: 70. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 70. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 71. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 71.
- E includes the amino acid sequence of SEQ ID NO: 72. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 72. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 73. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 73.
- E includes the amino acid sequence of SEQ ID NO: 74. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 74. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 75. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 75.
- E includes the amino acid sequence of SEQ ID NO: 76.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 76.
- E includes the amino acid sequence of SEQ ID NO: 77.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 77.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 78.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 78.
- E includes the amino acid sequence of SEQ ID NO: 79. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 79. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 80. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 80.
- E includes the amino acid sequence of SEQ ID NO: 81.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 81.
- E includes the amino acid sequence of SEQ ID NO: 82.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 82.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 83.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 83.
- E includes the amino acid sequence of SEQ ID NO: 84. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 84. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 85. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 85.
- E includes the amino acid sequence of SEQ ID NO: 86.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 86.
- E includes the amino acid sequence of SEQ ID NO: 87.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 87.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 88.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 88.
- E includes the amino acid sequence of SEQ ID NO: 89. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 89. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 90. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 90.
- E includes the amino acid sequence of SEQ ID NO: 91.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 91.
- E includes the amino acid sequence of SEQ ID NO: 92.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 92.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 93.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 93.
- E includes the amino acid sequence of SEQ ID NO: 94. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 94. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 95. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 95.
- E includes the amino acid sequence of SEQ ID NO: 96.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 96.
- E includes the amino acid sequence of SEQ ID NO: 97.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 97.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 98.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 98.
- E includes the amino acid sequence of SEQ ID NO: 125.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 125.
- E includes the amino acid sequence of SEQ ID NO: 126.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 126.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 127.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 127.
- E includes the amino acid sequence of SEQ ID NO: 128.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 128.
- E includes the amino acid sequence of SEQ ID NO: 129.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 129.
- E includes the amino acid sequence of SEQ ID NO: 130.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 130.
- E includes the amino acid sequence of SEQ ID NO: 131.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 131.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 132.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 132.
- E includes the amino acid sequence of SEQ ID NO: 133.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 133.
- E includes the amino acid sequence of SEQ ID NO: 134.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 134.
- E includes the amino acid sequence of SEQ ID NO: 135.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 135.
- E includes the amino acid sequence of SEQ ID NO: 136.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 136.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 137.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 137.
- E includes the amino acid sequence of SEQ ID NO: 138.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 138.
- E includes the amino acid sequence of SEQ ID NO: 139.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 139.
- E includes the amino acid sequence of SEQ ID NO: 140.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 140.
- E includes the amino acid sequence of SEQ ID NO: 141.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 141.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 142.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 142.
- E includes the amino acid sequence of SEQ ID NO: 143.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 143.
- E includes the amino acid sequence of SEQ ID NO: 144.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 144.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 145.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 145.
- E includes the amino acid sequence of SEQ ID NO: 146.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 146.
- E includes the amino acid sequence of SEQ ID NO: 147.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 147.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 148.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 148.
- E includes the amino acid sequence of SEQ ID NO: 149. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 149. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 150. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 150.
- E includes the amino acid sequence of SEQ ID NO: 151.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 151.
- E includes the amino acid sequence of SEQ ID NO: 152.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 152.
- E e.g., each E
- E includes the amino acid sequence of SEQ ID NO: 153.
- E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 153.
- the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a triple mutation corresponding to M252Y/S254T/T256E (YTE).
- YTE M252Y/S254T/T256E
- an amino acid “corresponding to” a particular amino acid residue should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence).
- any one of SEQ ID NOs: 1-95 and 125-153 may be mutated to include a YTE mutation.
- the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a double mutant corresponding to M428L/N434S (LS).
- an amino acid “corresponding to” a particular amino acid residue e.g., or a particular SEQ ID NO.
- any one of SEQ ID NOs: 1-95 and 125-153 may be mutated to include a LS mutation.
- the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a mutant corresponding to N434H.
- an amino acid “corresponding to” a particular amino acid residue e.g., of a particular SEQ ID NO.
- any one of SEQ ID NOs: 1-95 and 125- 153 may be mutated to include an N434H mutation.
- the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a mutant corresponding to C220S.
- an amino acid “corresponding to” a particular amino acid residue e.g., or a particular SEQ ID NO.
- any one of SEQ ID NOs: 1-95 and 125- 153 may be mutated to include a C220S mutation.
- the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-138 or 168-197) includes a mutant corresponding to K246X, where X is not Lys (e.g., K246X is K246S, K246G, K246A, K246T, K246N, K246Q, K246R, K246H, K246E, or K246D).
- X is not Lys
- an amino acid “corresponding to” a particular amino acid residue should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence).
- any one of SEQ ID NOs: 1-95 or 125-153 may be mutated to at position 246 such that position 246 is not a Lys.
- any one of SEQ ID NOs: 1-95 or 125-153 may be mutated to include a K246S mutation.
- the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) is a fragment of the Fc domain monomer (e.g., a fragment of at least 25 (e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more), at least 50 (e.g., 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75 or more), at least 75 (e.g., 75, 76, 77, 78, 79, 80, 81, 82, 83, 84,
- the Fc domain monomer is a fragment of the Fc domain monomer (e.g
- one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E is covalently conjugated to a linker (e.g., a PEG 2 -PEG 20 linker).
- the linker conjugated to E may be functionalized such that it may react to form a covalent bond with the L of any A 1 -L or any A 2 -L-A 1 described herein.
- E is conjugated to a linker functionalized with an azido group and the L of A 1 -L or any A 2 -L-A 1 is functionalized with an alkyne group.
- Conjugation e.g., by click chemistry
- the linker-azido of E and the linker-alkyne of A 1 -L or A 2 -L-A 1 forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)- (M-XVII).
- E is conjugated to a linker functionalized with an alkyne group and L of an A 1 -L or of any A 2 -L-A 1 is functionalized with an azido group.
- Conjugation e.g., by click chemistry
- linker-alkyne of E and linker-azido of A 1 -L or of any A 2 -L-A 1 forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII).
- the squiggly line of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII) represents a covalent bond between the L of A 1 -L or A 2 -L-A 1 or L’ of A 1 -L’ or A 1 -L’-A 2 .
- the squiggly line of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII) represents that one or more amino acid side chains of E (e.g., one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E) have been conjugated to a linker (e.g., a PEG 2 -PEG 20 linker) wherein the linker has been functionalized with a reactive moiety, such that the reactive moiety forms a covalent bond with the L of any A 1 -L or any A 2 -L-A 1 described herein (e.g., by click chemistry between an azido functionalized linker and an alkyne functionalized linker, as described above).
- a linker e.g., a PEG 2 -PEG 20 linker
- a 1 and/or A 2 have the structure described by (A-I): In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: .
- a 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: .
- a 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: .
- a 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: . In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by: .
- a 1 and/or A 2 have the structure described by (A-II): In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by (A-IIaa): In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by (A-IIbb): In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by (A-IIcc): In some embodiments of any of the aspects described herein, A 1 and/or A 2 have the structure described by (A-IIdd): In another aspect, the disclosure features a conjugate selected from any one of conjugates 1-76 (e.g., Conjugates 1-4, 5a, 5b, 6-8, 9a, 9b, 10, 11, 12a, 12b, 13a, 13b, 14a, 14b, 15-28, 29a, 29b, 30a, 30b, 31-36
- the disclosure features a conjugate selected from any one of conjugates 1-48 (e.g., Conjugates 1-4, 5a, 5b, 6-8, 9a, 9b, 10, 11, 12a, 12b, 13a, 13b, 14a, 14b, 15-28, 29a, 29b, 30a, 30b, 31-36, 37a, 37b, and 38-48).
- the conjugate is selected from any one of conjugates 5b, 9b, 12b, 13b, 14b, and 15-48 (e.g., Conjugates 15-28, 29a, 29b, and 30-48).
- the conjugate is selected from any one of conjugates 5b, 29a, 29b, 30a, and 37a.
- the conjugate is conjugate 1, or any regioisomer thereof, and the drug-to- antibody ratio (DAR) (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 2, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 3, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 4, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 5a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 6, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 7, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 8, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 9a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 10, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 11, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 12a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 13a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 14a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 5b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 9b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 12b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 13b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 14b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 29b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 15, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 16, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 17, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 18, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 19, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 20, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 21, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 22, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 23, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 24, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 25, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 26, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 27, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 28, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 29a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 29b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 30a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 31, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 32, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 33, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 34, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 35, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 36, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 37a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 38, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 39, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 40, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 41, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 42, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 43, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 44, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 45, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 46, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 47, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 48, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 30b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 37b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 49, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 50, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 51, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 52, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 53, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 54, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 55, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 56, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 57, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 58, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 59, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 60, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 61, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 62, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 63, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 64, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 65, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 66, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 67, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 68, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 69, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 70, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 71, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 72, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 73, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 74, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 75, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 76, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 77, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 78, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 79, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 80, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 81, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 82, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 83, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 84, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 85, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 86, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 87, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 88, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 89, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 90, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 91, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 92, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 93, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 94, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 95, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 96, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 97, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 98, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 99, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 100, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 101, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 102, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 103, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 104, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 105, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 106, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 107, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 108, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 109, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 110, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 111, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 112, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 113, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 114, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 115, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 116, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 117, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 118, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 119, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 120, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- the conjugate is conjugate 121, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
- a population of conjugates described herein has a DAR (e.g., T) of between 1 and 2, 2 and 4, 4 and 6, 6 and 8, 8 and 10, 1 and 10, 1 and 20, 1 and 5, 3 and 7, 5 and 10, or 10 and 20.
- DAR e.g., T
- the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.
- the Fc domain monomer includes less than about 300 amino acid residues (e.g., less than about 300, less than about 295, less than about 290, less than about 285, less than about 280, less than about 275, less than about 270, less than about 265, less than about 260, less than about 255, less than about 250, less than about 245, less than about 240, less than about 235, less than about 230, less than about 225, or less than about 220 amino acid residues).
- the Fc domain monomer is less than about 40 kDa (e.g., less than about 35kDa, less than about 30kDa, less than about 25kDa).
- the Fc domain monomer includes at least 200 amino acid residues (e.g., at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, or at least 300 amino residues). In some embodiments, the Fc domain monomer is at least 20 kDa (e.g., at least 25 kDa, at least 30 kDa, or at least 35 kDa). In some embodiments, the Fc domain monomer includes 200 to 400 amino acid residues (e.g., 200 to 250, 250 to 300, 300 to 350, 350 to 400, 200 to 300, 250 to 350, or 300 to 400 amino acid residues).
- the Fc domain monomer is 20 to 40 kDa (e.g., 20 to 25 kDa, 25 to 30 kDa, 35 to 40 kDa, 20 to 30 kDa, 25 to 35 kDa, or 30 to 40 KDa).
- the Fc domain monomer includes an amino acid sequence at least 90% identical (e.g., at least 95%, at least 98%) to the sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof.
- the Fc domain monomer includes the amino acid sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof.
- the Fc domain monomer includes a region of any one of SEQ ID NOs: 1- 95 and 125-153, wherein the region includes positions 220, 252, 254, and 256.
- the region includes at least 40 amino acid residues, at least 50 amino acid residues, at least 60 amino acid residues, at least 70 amino acids residues, at least 80 amino acids residues, at least 90 amino acid residues, at least 100 amino acid residues, at least 110 amino acid residues, at least 120 amino residues, at least 130 amino acid residues, at least 140 amino acid residues, at least 150 amino acid residues, at least 160 amino acid residues, at least 170 amino acid residues, at least 180 amino acid residues, at least 190 amino acid residues, or at least 200 amino acid residues.
- the disclosure features a method of synthesizing a conjugate of formula (D-I): wherein each A 1 and each A 2 is independently selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of each of A 1 and A 2 ; T is an integer from 1 to 20; and each squiggly line in formula (D-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing an Fc
- the disclosure features a method of synthesizing a conjugate of formula (M-I): wherein each A 1 is selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of A 1 ; T is an integer from 1 to 20; and each squiggly line in formula (M-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing a first composition including E; (b)
- L’ includes G, wherein G is optionally substituted C 1 -C 6 alkylene, optionally substituted C 1 -C 6 heteroalkylene, optionally substituted C 2 -C 6 alkenylene, optionally substituted C 2 -C 6 heteroalkenylene, optionally substituted C 2 -C 6 alkynylene, optionally substituted C 2 -C 6 heteroalkynylene, optionally substituted C 3 -C 10 cycloalkylene, optionally substituted C 2 -C 10 heterocycloalkylene, optionally substituted C 6 -C 10 arylene, or optionally substituted C 2 -C 10 heteroarylene.
- a compound of formula (MF-I) or salt thereof has the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74). In some embodiments, a compound of formula (MF-I) or salt thereof includes the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74).
- a compound of formula (MF-I) or salt thereof is synthesized from the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74).
- a compound of formula (DF-I) or (MF-I), where each R is halo (e.g., F) provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein).
- the increased stability allows for purification by reverse phase chromatography.
- the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- a compound of formula (DF-I) or (MF-I), where m is 3, provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein).
- the increased stability allows for purification by reverse phase chromatography.
- the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- a compound of formula (DF-I) or (MF-I), where m is 3 and each R is halo (e.g., F) provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein).
- the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- the disclosure features a method of synthesizing a conjugate of formula (D-I): wherein each A 1 and each A 2 is independently selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of each of A 1 and A 2 ; T is an integer from 1 to 20; and each squiggly line in formula (D-I)
- G is optionally substituted C 1 -C 6 heteroalkylene or optionally substituted C 2 -C 10 heteroarylene. In some embodiments, G is optionally substituted C 1 -C 6 heteroalkylene. In some embodiments, G is , where R a is H, optionally substituted C 1 - C 20 alkylene (e.g., optionally substituted C 1 -C 6 alkylene), or optionally substituted C 1 -C 20 heteroalkylene (e.g., optionally substituted C 1 -C 6 heteroalkylene). In some embodiments, G is optionally substituted C 2 -C 10 heteroarylene. In some embodiments, G is optionally substituted C 2 -C 5 heteroarylene.
- G is a 5-membered or 6- membered optionally substituted C 2 -C 5 heteroarylene. In some embodiments, G is a triazolylene.
- the conjugate of formula (D-I) has the structure of: , and the method includes the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (DF-II-A) or salt thereof: and (c) combining the first composition, the second composition, and a buffer to form a mixture.
- the synthesis of compound of formula (DF-II-A) includes: (d) providing a third composition including formula (D-G1-A) or salt thereof: (e) providing a fourth composition including formula (D-G1-B) or salt thereof: and (f) combining the third composition and the fourth composition to form a mixture.
- the conjugate of formula (D-I) has the structure of: , and the method includes the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (DF-II-B) or salt thereof: and (c) combining the first composition, the second composition, and a buffer to form a mixture.
- the synthesis of compound of formula (DF-II-B) includes: (d) providing a third composition including formula (D-G2-A) or salt thereof: (e) providing a fourth composition including formula (D-G2-B) or salt thereof: and (f) combining the third composition and the fourth composition to form a mixture.
- step (f) includes the use of a Cu(I) source.
- the disclosure features a method of synthesizing a conjugate of formula (M-I): wherein each A 1 is selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of A 1 ; T is an integer from 1 to 20; and each squiggly line in formula (M-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing a first composition including E; (b)
- G is optionally substituted C 1 -C 6 heteroalkylene or optionally substituted C 2 -C 10 heteroarylene. In some embodiments, G is optionally substituted C 1 -C 6 heteroalkylene. In some embodiments, G is , where R a is H, optionally substituted C 1 - C 20 alkylene (e.g., optionally substituted C 1 -C 6 alkylene), or optionally substituted C 1 -C 20 heteroalkylene (e.g., optionally substituted C 1 -C 6 heteroalkylene). In some embodiments, G is optionally substituted C 2 -C 10 heteroarylene. In some embodiments, G is optionally substituted C 2 -C 5 heteroarylene.
- G is a 5-membered or 6- membered optionally substituted C 2 -C 5 heteroarylene.
- G is a triazolylene.
- the conjugate of formula (M-I) has the structure of: , and the method including the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (MF-II-A) or salt thereof: and (c) combining the first composition, the second composition, and a buffer to form a mixture.
- the synthesis of compound of formula (MF-II-A) includes: (d) providing a third composition including formula (M-G1-A) or salt thereof: ( ) (e) providing a fourth composition including formula (M-G1-B) or salt thereof: and (f) combining the third composition and the fourth composition to form a mixture.
- the conjugate of formula (M-I) has the structure of: , and the method includes the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (MF-II-B) or salt thereof: and (c) combining the first composition, the second composition, and a buffer to form a mixture.
- the synthesis of compound of formula (MF-II-B) includes: (d) providing a third composition including formula (M-G2-A) or salt thereof: (e) providing a fourth composition including formula (M-G2-B) or salt thereof: and (f) combining the third composition and the fourth composition to form a mixture.
- step (f) includes the use of a Cu(I) source.
- the disclosure features a method of synthesizing a conjugate of formula (D-I): , wherein each A 1 and each A 2 is independently selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptidee; L is a linker covalently attached to E and to Y of each of A 1 and A 2 ; T is an integer from 1 to 20; and each squiggly line in formula (D-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (
- step (c) includes the use of a Cu(I) source.
- the method further includes: (d) providing a third composition including E; and (e) combining the third composition, the first mixture, and a buffer to form a second mixture.
- the disclosure features a method of synthesizing a conjugate of formula (M-I): wherein each A 1 is selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of A 1 ; T is an integer from 1 to 20; T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), where when T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16,
- step (c) includes the use of a Cu(I) source.
- the method further includes: (d) providing a third composition including E; and (e) combining the third composition, the first mixture, and a buffer to form a second mixture.
- G a includes optionally substituted amino.
- G b includes a carbonyl.
- G a includes a carbonyl.
- G b includes optionally substituted amino.
- G a includes an azido group.
- G b includes an alknyl group.
- G a includes an alkynyl group.
- G b includes an azido group.
- a compound of formula (MF-II) or salt thereof has the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74)
- a compound of formula (MF-II) or salt thereof includes the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74).
- a compound of formula (MF-II) or salt thereof is synthesized from the structure of any one of Int-1 to Int-140 (e.g., Int 110, Int-133, or Int- 148).
- a compound of formula (DF-II) e.g., a compound of formula (DF-II-A) or (DF-II-B) and/or a compound of formula (D-G1-A) or (D-G2-A), where each R is halo (e.g., F)
- a compound of formula (DF-II) e.g., a compound of formula (DF-II-A) or (DF-II-B) and/or a compound of formula (D-G1-A) or (D-G2-A), where each R is halo (e.g., F)
- the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- a compound of formula (DF-II) e.g., a compound of formula (DF-II-A) or (DF-II-B) and/or a compound of formula (D-G1-A) or (D-G2-A), where m is 3, provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein).
- the increased stability allows for purification by reverse phase chromatography.
- the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- a compound of formula (DF-II) e.g., a compound of formula (DF-II-A) or (DF-II-B) and/or a compound of formula (D-G1-A) or (D-G2-A), where m is 3 and each R is halo (e.g., F)
- DF-II a compound of formula (DF-II)
- D-G1-A) or (DF-G2-A) where m is 3 and each R is halo (e.g., F)
- the increased stability allows for purification by reverse phase chromatography.
- the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- a compound of formula (MF-II) e.g., a compound of formula (MF-II-A) or (MF-II-B) and/or a compound of formula (M-G1-A) or (M-G2-A), where each R is halo (e.g., F)
- MF-II a compound of formula
- M-G1-A e.g., F
- the increased stability allows for purification by reverse phase chromatography.
- the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- a compound of formula (MF-II) (e.g., a compound of formula (MF-II-A) or (MF-II-B) and/or a compound of formula (M-G1-A) or (M-G2-A), where m is 3, provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein).
- the increased stability allows for purification by reverse phase chromatography.
- the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- a compound of formula (MF-II) (e.g., a compound of formula (MF-II-A) or (MF-II-B) and/or a compound of formula (M-G1-A) or (M-G2-A), where m is 3 and each R is halo (e.g., F), provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein).
- the increased stability allows for purification by reverse phase chromatography.
- the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
- the first composition including E is an Fc domain (e.g., n is 2, each E is an Fc domain monomer, and the Fc domain monomers dimerize to form an Fc domain).
- E includes at least one lysine residue.
- the squiggly line in formula (D-I) or (M-I) is covalently bound to a lysine residue of each E.
- E includes at least one cysteine residue.
- the squiggly line in formula (D-I) or (M-I) is covalently bound to a cysteine residue of each E.
- each R is, independently, halo, cyano, nitro, haloalkyl, or , where R z is optionally substituted C 1 -C 5 alkyl group or optionally substituted C 1 -C 5 heteroalkyl group.
- each R is, independently, halo, cyano, nitro, or haloalkyl.
- each R is, independently, F, Cl, Br, or I.
- each R is F.
- m is 3 or 4.
- m is 3. In some embodiments, m is 4.
- the buffer includes borate or carbonate. In some embodiments, the buffer includes borate. In some embodiments, the buffer includes carbonate. In some embodiments, the buffer has a pH of about 7.0 to 10.0 (e.g., about 7.0 to 7.5, 7.5 to 8.0, 8.0 to 8.5, 8.5 to 9.0, 9.0 to 9.5, 9.5 to 10.0, 7.0 to 8.0, 7.5 to 8.5, 8.0 to 9.0, 8.5 to 9.5, 9.0 to 10.0, 7.0 to 9.0, 7.5 to 9.5, or 8.0 to 10.0).
- 7.0 to 10.0 e.g., about 7.0 to 7.5, 7.5 to 8.0, 8.0 to 8.5, 8.5 to 9.0, 9.0 to 9.5, 9.5 to 10.0, 7.0 to 8.0, 7.5 to 8.5, 9.5, 9.0 to 10.0, 7.0 to 9.0, 7.5 to 9.5, or 8.0 to 10.0.
- the buffer has a pH of about 7.0. In some embodiments, the buffer has a pH of about 7.1. In some embodiments, the buffer has a pH of about 7.2. In some embodiments, the buffer has a pH of about 7.3. In some embodiments, the buffer has a pH of about 7.4. In some embodiments, the buffer has a pH of about 7.5. In some embodiments, the buffer has a pH of about 7.6. In some embodiments, the buffer has a pH of about 7.7. In some embodiments, the buffer has a pH of about 7.8. In some embodiments, the buffer has a pH of about 7.9. In some embodiments, the buffer has a pH of about 8.0. In some embodiments, the buffer has a pH of about 8.1.
- the buffer has a pH of about 8.2. In some embodiments, the buffer has a pH of about 8.3. In some embodiments, the buffer has a pH of about 8.4. In some embodiments, the buffer has a pH of about 8.5. In some embodiments, the buffer has a pH of about 8.6. In some embodiments, the buffer has a pH of about 8.7. In some embodiments, the buffer has a pH of about 8.8. In some embodiments, the buffer has a pH of about 8.9. In some embodiments, the buffer has a pH of about 9.0. In some embodiments, the buffer has a pH of about 9.5. In some embodiments, the buffer has a pH of about 9.6. In some embodiments, the buffer has a pH of about 9.7.
- the buffer has a pH of about 9.8. In some embodiments, the buffer has a pH of about 9.9. In some embodiments, the buffer has a pH of about 10.0. In some embodiments of any of the aspects described herein, step (c) or step (e) is conducted at a temperature of 5 to 50 °C, such as 20 to 30 °C (e.g., 20 to 25, 21 to 26, 22 to 27, 23 to 28, 24 to 29, or 25 to 30 °C). In some embodiments, step (c) or step (e) is conducted at a temperature of about 25 °C.
- step (c) or step (e) is conducted for about 1 to 24 hours, such as 1 to 12 hours (e.g., 1 to 2, 1 to 5, 2 to 3, 2 to 5, 2 to 10, 2 to 12, 3 to 4, 4 to 5, 1 to 3, 2 to 4, or 3 to 5 hours).
- step (c) or step (e) is conducted for about 2 hours.
- step (c) or step (e) is conducted for about 3 hours.
- step (c) or step (e) is conducted for about 4 hours.
- step (c) or step (e) is conducted for about 5 hours.
- step (c) or step (e) is conducted for about 6 hours.
- step (c) or step (e) is conducted for about 7 hours. In some embodiments, step (c) or step (e) is conducted for about 8 hours. In some embodiments, step (c) or step (e) is conducted for about 9 hours. In some embodiments, step (c) or step (e) is conducted for about 10 hours. In some embodiments, step (c) or step (e) is conducted for about 11 hours. In some embodiments, step (c) or step (e) is conducted for about 12 hours.
- the first composition or third composition includes phosphate-buffered saline buffer.
- the buffer has a pH of about 7.0 to 8.0 (e.g., about 7.0 to 7.5, 7.5 to 8.0, 7.0 to 7.2, 7.2 to 7.4, 7.4 to 7.6, 7.6 to 7.8, or 7.8 to 8.0). In some embodiments, the buffer has a pH of about 7.5. In some embodiments, the second composition or the first mixture includes DMF. In some embodiments, the method further includes a purification step. In some embodiments, the purification step includes dialysis in arginine buffer. In some embodiments, the purification step includes a buffer exchange. In some embodiments, T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20).
- the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments, the average T is 1 to 10 (e.g., 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, or 10).
- the average T is 1 to 5 (e.g., 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, or 5).
- the average T is 5 to 10 (e.g., 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10).
- the average T is 2.5 to 7.5 (e.g., 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, or 7.5). Definitions To facilitate the understanding of this invention, a number of terms are defined below.
- viral infection is meant the pathogenic growth of a virus (e.g., the human immunodeficiency virus) in a host organism (e.g., a human subject).
- a viral infection can be any situation in which the presence of a viral population(s) is damaging to a host body.
- Fc domain monomer refers to a polypeptide chain that includes at least a hinge domain and second and third antibody constant domains (CH 2 and CH3) or functional fragments thereof (e.g., fragments that that capable of (i) dimerizing with another Fc domain monomer to form an Fc domain, and (ii) binding to an Fc receptor.
- the Fc domain monomer can be any immunoglobulin antibody isotype, including IgG, IgE, IgM, IgA, or IgD (e.g., IgG). Additionally, the Fc domain monomer can be an IgG subtype (e.g., IgG1, IgG2a, IgG2b, IgG3, or IgG4) (e.g., IgG1). An Fc domain monomer does not include any portion of an immunoglobulin that is capable of acting as an antigen-recognition region, e.g., a variable domain or a complementarity determining region (CDR).
- CDR complementarity determining region
- Fc domain monomers in the conjugates as described herein can contain one or more changes from a wild- type Fc domain monomer sequence (e.g., 1-10, 1-8, 1-6, 1-4 amino acid substitutions, additions, or deletions) that alter the interaction between an Fc domain and an Fc receptor. Examples of suitable changes are known in the art.
- a human Fc domain monomer e.g., an IgG heavy chain, such as IgG1
- IgG heavy chain such as IgG1
- a human Fc domain monomer includes a region that extends from any of Asn208, Glu216, Asp221, Lys222, or Cys226 to the carboxyl-terminus of the heavy chain at Lys447.
- C-terminal Lys447 of the Fc region may or may not be present, without affecting the structure or stability of the Fc region.
- numbering of amino acid residues in the IgG or Fc domain monomer is according to the EU numbering system for antibodies, also called the Kabat EU index, as described, for example, in Kabat et al., Sequences of Proteins of Immunological Interest, 5th Ed. Public Health Service, National Institutes of Health, Bethesda, MD, 1991.
- the term “Fc domain” refers to a dimer of two Fc domain monomers that is capable of binding an Fc receptor.
- the two Fc domain monomers dimerize by the interaction between the two CH3 antibody constant domains, in some embodiments, one or more disulfide bonds form between the hinge domains of the two dimerizing Fc domain monomers.
- covalently attached refers to two parts of a conjugate that are linked to each other by a covalent bond formed between two atoms in the two parts of the conjugate.
- Fc-binding peptide refers to a polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an Fc domain, such as any of the Fc domain described herein.
- An Fc-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat.
- Fc-binding peptides of the invention include Fc-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the Fc-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Fc-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues.
- a non-naturally occurring amino acid residue e.g., the side chain of a non-naturally occurring amino acid residue
- a compound of the invention e.g., a gp120 binder monomer or dimer, including by way of a linker
- Fc-binding peptides of the invention may be linear or cyclic.
- Fc-binding peptides of the invention include any Fc-binding peptides known to one of skill in the art.
- albumin protein refers to a polypeptide comprising an amino acid sequence corresponding to a naturally-occurring albumin protein (e.g., human serum albumin) or a variant thereof, such as an engineered variant of a naturally-occurring albumin protein.
- Variants of albumin proteins include polymorphisms, fragments such as domains and sub-domains, and fusion proteins (e.g., an albumin protein having a C-terminal or N-terminal fusion, such as a polypeptide linker).
- the albumin protein has the amino acid sequence of human serum albumin (HSA) or a variant or fragment thereof, most preferably a functional variant or fragment thereof.
- Albumin proteins of the invention include proteins having at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to any one of SEQ ID NOs: 96-98.
- Albumin proteins of the invention include albumin proteins which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker).
- albumin protein will contain a single solvent-exposed cysteine or lysine, thus enabling site- specific conjugation of a compound of the invention.
- Albumin proteins may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker).
- albumin protein-binding peptide refers to a polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an albumin protein, such as any of the albumin proteins described herein.
- the albumin protein-binding peptide binds to a naturally- occurring serum albumin, most preferably human serum albumin.
- An albumin protein-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat.
- Albumin protein-binding peptides of the invention include albumin protein-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention.
- Albumin protein-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Albumin protein-binding peptides of the invention may be linear or cyclic. Albumin protein-binding peptide of the invention include any albumin protein-binding peptides known to one of skill in the art, examples of which, are provided herein.
- albumin protein-binding peptides are provided in U.S. Patent Application No.2005/0287153, which is incorporated herein by reference in its entirety.
- a “surface exposed amino acid” or “solvent-exposed amino acid,” such as a surface exposed cysteine or a surface exposed lysine refers to an amino acid that is accessible to the solvent surrounding the protein.
- a surface exposed amino acid may be a naturally-occurring or an engineered variant (e.g., a substitution or insertion) of the protein.
- a surface exposed amino acid is an amino acid that when substituted does not substantially change the three- dimensional structure of the protein.
- linker refers to a covalent linkage or connection between two or more components in a conjugate (e.g., between two gp120 binders in a conjugate described herein, between a gp120 binder and an Fc domain or albumin protein in a conjugate described herein, and between a dimer of two gp120 binders and an Fc domain or an albumin protein in a conjugate described herein).
- a conjugate described herein may contain a linker that has a trivalent structure (e.g., a trivalent linker).
- a trivalent linker has three arms, in which each arm is covalently linked to a component of the conjugate (e.g., a first arm conjugated to a first gp120 binder, a second arm conjugated to a second gp120 binder, and a third arm conjugated to an Fc domain or an albumin protein).
- a component of the conjugate e.g., a first arm conjugated to a first gp120 binder, a second arm conjugated to a second gp120 binder, and a third arm conjugated to an Fc domain or an albumin protein.
- Molecules that may be used as linkers include at least two functional groups, which may be the same or different, e.g., two carboxylic acid groups, two amine groups, two sulfonic acid groups, a carboxylic acid group and a maleimide group, a carboxylic acid group and an alkyne group, a carboxylic acid group and an amine group, a carboxylic acid group and a sulfonic acid group, an amine group and a maleimide group, an amine group and an alkyne group, or an amine group and a sulfonic acid group.
- two functional groups which may be the same or different, e.g., two carboxylic acid groups, two amine groups, two sulfonic acid groups, a carboxylic acid group and a maleimide group, a carboxylic acid group and an alkyne group, a carboxylic acid group and an amine group, a carboxylic acid group and a sulfonic acid
- the first functional group may form a covalent linkage with a first component in the conjugate and the second functional group may form a covalent linkage with the second component in the conjugate.
- two arms of a linker may contain two dicarboxylic acids, in which the first carboxylic acid may form a covalent linkage with the first gp120 binder in the conjugate and the second carboxylic acid may form a covalent linkage with the second gp120 binder in the conjugate, and the third arm of the linker may for a covalent linkage with an Fc domain or albumin protein in the conjugate. Examples of dicarboxylic acids are described further herein.
- a molecule containing one or more maleimide groups may be used as a linker, in which the maleimide group may form a carbon-sulfur linkage with a cysteine in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate.
- a molecule containing one or more alkyne groups may be used as a linker, in which the alkyne group may form a 1,2,3-triazole linkage with an azide in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate.
- a molecule containing one or more azide groups may be used as a linker, in which the azide group may form a 1,2,3-triazole linkage with an alkyne in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate.
- a molecule containing one or more bis-sulfone groups may be used as a linker, in which the bis-sulfone group may form a linkage with an amine group a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate.
- a molecule containing one or more sulfonic acid groups may be used as a linker, in which the sulfonic acid group may form a sulfonamide linkage with a component in the conjugate.
- a molecule containing one or more isocyanate groups may be used as a linker, in which the isocyanate group may form a urea linkage with a component in the conjugate.
- a molecule containing one or more haloalkyl groups may be used as a linker, in which the haloalkyl group may form a covalent linkage, e.g., C-N and C-O linkages, with a component in the conjugate.
- a molecule containing one or more phenyl ester groups may be used as a linker, in which the phenyl ester group (e.g., triflurophenyl ester group or tetrafluorophenyl ester group) may form an amide with an amine in a component (e.g., a fusion protein) in the conjugate.
- a linker provides space, rigidity, and/or flexibility between the two or more components.
- a linker may be a bond, e.g., a covalent bond.
- a linker refers to a chemical bond, e.g., an amide bond, a disulfide bond, a C-O bond, a C-N bond, a N-N bond, a C-S bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation.
- a linker includes no more than 250 atoms. In some embodiments, a linker includes no more than 250 non-hydrogen atoms. In some embodiments, the backbone of a linker includes no more than 250 atoms.
- the “backbone” of a linker refers to the atoms in the linker that together form the shortest path from one part of a conjugate to another part of the conjugate (e.g., the shortest path linking a first gp120 binder and a second gp120 binder).
- the atoms in the backbone of the linker are directly involved in linking one part of a conjugate to another part of the conjugate (e.g., linking a first gp120 binder and a second gp120 binder).
- hydrogen atoms attached to carbons in the backbone of the linker are not considered as directly involved in linking one part of the conjugate to another part of the conjugate.
- a linker may comprise a synthetic group derived from, e.g., a synthetic polymer (e.g., a polyethylene glycol (PEG) polymer).
- a linker may comprise one or more amino acid residues, such as D- or L-amino acid residues.
- a linker may be a residue of an amino acid sequence (e.g., a 1-25 amino acid, 1-10 amino acid, 1-9 amino acid, 1-8 amino acid, 1-7 amino acid, 1-6 amino acid, 1-5 amino acid, 1-4 amino acid, 1-3 amino acid, 1-2 amino acid, or 1 amino acid sequence).
- a linker may comprise one or more, e.g., 1-100, 1-50, 1-25, 1-10, 1-5, or 1-3, optionally substituted alkylene, optionally substituted heteroalkylene (e.g., a PEG unit), optionally substituted alkenylene, optionally substituted heteroalkenylene, optionally substituted alkynylene, optionally substituted heteroalkynylene, optionally substituted cycloalkylene, optionally substituted heterocycloalkylene, optionally substituted cycloalkenylene, optionally substituted heterocycloalkenylene, optionally substituted cycloalkynylene, optionally substituted heterocycloalkynylene, optionally substituted arylene, optionally substituted heteroarylene (e.g., pyridine), O, S, NR i , , (each R i is, independently, H, optionally substituted alkyl, optionally substituted heteroalkyl, optionally substituted alkenyl,
- a linker may comprise one or more optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene (e.g., a PEG unit), optionally substituted C 2 -C 20 alkenylene (e.g., C 2 alkenylene), optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, optionally substituted C 2 -C 20 heteroalkynylene, optionally substituted C 3 -C 20 cycloalkylene (e.g., cyclopropylene, cyclobutylene), optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heterocycloalkynylene,
- alkyl straight-chain and branched- chain monovalent substituents, as well as combinations of these, containing only C and H when unsubstituted.
- alkyl group includes at least one carbon-carbon double bond or carbon-carbon triple bond, the alkyl group can be referred to as an “alkenyl” or “alkynyl” group respectively.
- alkenyl or alkynyl group respectively.
- the monovalency of an alkyl, alkenyl, or alkynyl group does not include the optional substituents on the alkyl, alkenyl, or alkynyl group.
- alkyl, alkenyl, or alkynyl group is attached to a compound
- monovalency of the alkyl, alkenyl, or alkynyl group refers to its attachment to the compound and does not include any additional substituents that may be present on the alkyl, alkenyl, or alkynyl group.
- the alkyl or heteroalkyl group may contain, e.g., 1-20.1-18, 1-16, 1-14, 1-12, 1-10, 1-8, 1- 6, 1-4, or 1-2 carbon atoms (e.g., C 1 -C 20 , C 1 -C 18 , C 1 -C 16 , C 1 -C 14 , C 1 -C 12 , C 1 -C 10 , C 1 -C 8 , C 1 -C 6 , C 1 -C 4 , or C 1 -C 2 ).
- 1-20.1-18, 1-16, 1-14, 1-12, 1-10, 1-8, 1- 6, 1-4, or 1-2 carbon atoms e.g., C 1 -C 20 , C 1 -C 18 , C 1 -C 16 , C 1 -C 14 , C 1 -C 12 , C 1 -C 10 , C 1 -C 8 , C 1 -C 6 , C 1 -C 4 , or C 1
- the alkenyl, heteroalkenyl, alkynyl, or heteroalkynyl group may contain, e.g., 2-20, 2-18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, or 2-4 carbon atoms (e.g., C 2 -C 20 , C 2 -C 18 , C 2 -C 16 , C 2 -C 14 , C 2 -C 12 , C 2 -C 10 , C 2 -C 8 , C 2 -C 6 , or C 2 -C 4 ).
- carbon atoms e.g., C 2 -C 20 , C 2 -C 18 , C 2 -C 16 , C 2 -C 14 , C 2 -C 12 , C 2 -C 10 , C 2 -C 8 , C 2 -C 6 , or C 2 -C 4 ).
- cycloalkyl represents a monovalent saturated or unsaturated non- aromatic cyclic alkyl group.
- a cycloalkyl may have, e.g., three to twenty carbons (e.g., a C 3 -C 7 , C 3 -C 8 , C 3 - C 9 , C 3 -C 10 , C 3 -C 11 , C 3 -C 12 , C 3 -C 14 , C 3 -C 16 , C 3 -C 18 , or C 3 -C 20 cycloalkyl).
- Examples of cycloalkyls include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, and cycloheptyl.
- the cycloalkyl group When the cycloalkyl group includes at least one carbon-carbon double bond, the cycloalkyl group can be referred to as a “cycloalkenyl” group.
- a cycloalkenyl may have, e.g., four to twenty carbons (e.g., a C 4 -C 7 , C 4 -C 8 , C 4 - C 9 , C 4 -C 10 , C 4 -C 11 , C 4 -C 12 , C 4 -C 14 , C 4 -C 16 , C 4 -C 18 , or C 4 -C 20 cycloalkenyl).
- Exemplary cycloalkenyl groups include, but are not limited to, cyclopentenyl, cyclohexenyl, and cycloheptenyl.
- the cycloalkyl group includes at least one carbon-carbon triple bond, the cycloalkyl group can be referred to as a “cycloalkynyl” group.
- a cycloalkynyl may have, e.g., eight to twenty carbons (e.g., a C 8 -C 9 , C 8 -C 10 , C 8 -C 11 , C 8 -C 12 , C 8 - C 14 , C 8 -C 16 , C 8 -C 18 , or C 8 -C 20 cycloalkynyl).
- the term “cycloalkyl” also includes a cyclic compound having a bridged multicyclic structure in which one or more carbons bridges two non-adjacent members of a monocyclic ring, e.g., bicyclo[2.2.1.]heptyl and adamantane.
- cycloalkyl also includes bicyclic, tricyclic, and tetracyclic fused ring structures, e.g., decalin and spiro cyclic compounds.
- a “heterocycloalkyl,” “heterocycloalkenyl,” or “heterocycloalkynyl” group refers to a cycloalkyl, cycloalkenyl, or cycloalkynyl group having one or more rings (e.g., 1, 2, 3, 4 or more rings) that has one or more heteroatoms independently selected from, e.g., N, O, and S.
- heterocycloalkyl groups include pyrrolidine, thiophene, thiolane, tetrahydrofuran, piperidine, tetrahydropyran, quinoline, isoquinoline, cinnoline, phthalazine, quinazoline, quinoxaline, indole, benzothiophene, benzofuran, isoindole, benzo[c]thiophene, isobenzofuran, benzimidazole, benzoxazole, benzothiazole, 1H-indazole, 1,2,benzisoxazole, 1,2-benzisothiazole, 2,1-benzisothiazole, 2,1-benzisoxazole, purine, pyrrolizidine, indene, fluorene, carbazole, dibenzofuran, acridine, phenazine, and phenoxazine.
- aryl refers to any monocyclic or fused ring bicyclic or tricyclic system which has the characteristics of aromaticity in terms of electron distribution throughout the ring system, e.g., phenyl, naphthyl, or phenanthrene.
- a ring system contains 5-15 ring member atoms or 5-10 ring member atoms.
- An aryl group may have, e.g., five to fifteen carbons (e.g., a C 5 -C 6 , C 5 -C 7 , C 5 -C 8 , C 5 -C 9 , C 5 -C 10 , C 5 -C 11 , C 5 -C 12 , C 5 -C 13 , C 5 - C 14 , or C 5 -C 15 aryl).
- the term “heteroaryl” also refers to such monocyclic or fused bicyclic ring systems containing one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from O, S and N.
- a heteroaryl group may have, e.g., two to fifteen ring member atoms (e.g., a C 2 -C 3 , C 2 -C 4 , C 2 -C 5 , C 2 -C 6 , C 2 -C 7 , C 2 -C 8 , C 2 -C 9 , C 2 -C 10 , C 2 -C 11 , C 2 -C 12 , C 2 -C 13 , C 2 -C 14 , or C 3 -C 15 heteroaryl).
- the inclusion of a heteroatom permits inclusion of 5-membered rings to be considered aromatic as well as 6-membered rings.
- heteroaryl systems include, e.g., pyridyl, pyrimidyl, indolyl, benzimidazolyl, benzotriazolyl, isoquinolyl, quinolyl, benzothiazolyl, benzofuranyl, thienyl, furyl, pyrrolyl, thiazolyl, triazolyl (e.g., 1,2,3- or 1,2,4-triazolyl) oxazolyl, isoxazolyl, benzoxazolyl, benzoisoxazolyl, and imidazolyl. Because tautomers are possible, a group such as phthalimido is also considered heteroaryl.
- the aryl or heteroaryl group is a 5- or 6-membered aromatic rings system optionally containing 1-2 nitrogen atoms.
- the aryl or heteroaryl group is an optionally substituted phenyl, pyridyl, indolyl, pyrimidyl, pyridazinyl, benzothiazolyl, benzimidazolyl, pyrazolyl, imidazolyl, isoxazolyl, thiazolyl, or imidazopyridinyl.
- the aryl group is phenyl.
- an aryl group may be optionally substituted with a substituent such an aryl substituent, e.g., biphenyl.
- aryl substituent e.g., biphenyl.
- alkaryl refers to an aryl group that is connected to an alkylene, alkenylene, or alkynylene group. In general, if a compound is attached to an alkaryl group, the alkylene, alkenylene, or alkynylene portion of the alkaryl is attached to the compound.
- an alkaryl is C 6 -C 3 5 alkaryl (e.g., C 6 -C 16 , C 6 -C 14 , C 6 -C 12 , C 6 -C 10 , C 6 -C 9 , C 6 -C 8 , C 7 , or C 6 alkaryl), in which the number of carbons indicates the total number of carbons in both the aryl portion and the alkylene, alkenylene, or alkynylene portion of the alkaryl.
- C 6 -C 3 5 alkaryl e.g., C 6 -C 16 , C 6 -C 14 , C 6 -C 12 , C 6 -C 10 , C 6 -C 9 , C 6 -C 8 , C 7 , or C 6 alkaryl
- alkaryls include, but are not limited to, (C 1 -C 8 )alkylene(C 6 - C 12 )aryl, (C 2 -C 8 )alkenylene(C 6 -C 12 )aryl, or (C 2 -C 8 )alkynylene(C 6 -C 12 )aryl.
- an alkaryl is benzyl or phenethyl.
- one or more heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the alkaryl group and/or may be present in the aryl portion of the alkaryl group.
- the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkaryl group and/or may be present on the aryl portion of the alkaryl group.
- the term “amino,” as used herein, represents –N(R x ) 2 or –N + (R x ) 3 , where each R x is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two R x combine to form a heterocycloalkyl.
- the amino group is -NH 2 .
- alkamino refers to an amino group, described herein, that is attached to an alkylene (e.g., C 1 -C 5 alkylene), alkenylene (e.g., C 2 -C 5 alkenylene), or alkynylene group (e.g., C 2 -C 5 alkenylene).
- alkylene e.g., C 1 -C 5 alkylene
- alkenylene e.g., C 2 -C 5 alkenylene
- alkynylene group e.g., C 2 -C 5 alkenylene
- the amino portion of an alkamino refers to –N(R x ) 2 or –N + (R x ) 3 , where each R x is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two R x combine to form a heterocycloalkyl.
- the amino portion of an alkamino is -NH 2 .
- An example of an alkamino group is C 1 -C 5 alkamino, e.g., C 2 alkamino (e.g., CH 2 CH 2 NH 2 or CH 2 CH 2 N(CH 3 ) 2 ).
- heteroalkamino group one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the heteroalkamino group.
- an alkamino group may be optionally substituted.
- the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkamino group and/or may be present on the amino portion of the alkamino group.
- alkamide refers to an amide group that is attached to an alkylene (e.g., C 1 -C 5 alkylene), alkenylene (e.g., C 2 -C 5 alkenylene), or alkynylene (e.g., C 2 -C 5 alkenylene) group.
- alkylene e.g., C 1 -C 5 alkylene
- alkenylene e.g., C 2 -C 5 alkenylene
- alkynylene e.g., C 2 -C 5 alkenylene
- the amide portion of an alkamide refers to – C(O)-N(R x ) 2 , where each R x is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two R x combine to form a heterocycloalkyl.
- the amide portion of an alkamide is -C(O)NH 2 .
- An alkamide group may be -(CH 2 ) 2 -C(O)NH 2 or -CH 2 -C(O)NH 2 .
- heteroalkamide group one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the heteroalkamide group.
- an alkamide group may be optionally substituted.
- the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkamide group and/or may be present on the amide portion of the alkamide group.
- alkylene alkenylene
- alkynylene refer to divalent groups having a specified size.
- an alkylene may contain, e.g., 1-20, 1-18, 1-16, 1-14, 1- 12, 1-10, 1-8, 1-6, 1-4, or 1-2 carbon atoms (e.g., C 1 -C 20 , C 1 -C 18 , C 1 -C 16 , C 1 -C 14 , C 1 -C 12 , C 1 -C 10 , C 1 -C 8 , C 1 -C 6 , C 1 -C 4 , or C 1 -C 2 ).
- carbon atoms e.g., C 1 -C 20 , C 1 -C 18 , C 1 -C 16 , C 1 -C 14 , C 1 -C 12 , C 1 -C 10 , C 1 -C 8 , C 1 -C 6 , C 1 -C 4 , or C 1 -C 2 ).
- an alkenylene or alkynylene may contain, e.g., 2-20, 2- 18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, or 2-4 carbon atoms (e.g., C 2 -C 20 , C 2 -C 18 , C 2 -C 16 , C 2 -C 14 , C 2 -C 12 , C 2 - C 10 , C 2 -C 8 , C 2 -C 6 , or C 2 -C 4 ).
- Alkylene, alkenylene, and/or alkynylene includes straight-chain and branched-chain forms, as well as combinations of these.
- the divalency of an alkylene, alkenylene, or alkynylene group does not include the optional substituents on the alkylene, alkenylene, or alkynylene group.
- two gp120 binders may be attached to each other by way of a linker that includes alkylene, alkenylene, and/or alkynylene, or combinations thereof.
- Each of the alkylene, alkenylene, and/or alkynylene groups in the linker is considered divalent with respect to the two attachments on either end of alkylene, alkenylene, and/or alkynylene group.
- a linker includes -(optionally substituted alkylene)-(optionally substituted alkenylene)-(optionally substituted alkylene)-
- the alkenylene is considered divalent with respect to its attachments to the two alkylenes at the ends of the linker.
- the optional substituents on the alkenylene are not included in the divalency of the alkenylene.
- the divalent nature of an alkylene, alkenylene, or alkynylene group refers to both of the ends of the group and does not include optional substituents that may be present in an alkylene, alkenylene, or alkynylene group.
- Alkylene, alkenylene, and/or alkynylene groups can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein.
- -HCR-C ⁇ C- may be considered as an optionally substituted alkynylene and is considered a divalent group even though it has an optional substituent, R.
- Heteroalkylene, heteroalkenylene, and/or heteroalkynylene groups refer to alkylene, alkenylene, and/or alkynylene groups including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S.
- a polyethylene glycol (PEG) polymer or a PEG unit -(CH 2 ) 2 -O- in a PEG polymer is considered a heteroalkylene containing one or more oxygen atoms.
- PEG polyethylene glycol
- cycloalkylene refers to a divalent cyclic group linking together two parts of a compound.
- one carbon within the cycloalkylene group may be linked to one part of the compound, while another carbon within the cycloalkylene group may be linked to another part of the compound.
- a cycloalkylene group may include saturated or unsaturated non-aromatic cyclic groups.
- a cycloalkylene may have, e.g., three to twenty carbons in the cyclic portion of the cycloalkylene (e.g., a C 3 -C 7 , C 3 -C 8 , C 3 -C 9 , C 3 -C 10 , C 3 -C 11 , C 3 -C 12 , C 3 -C 14 , C 3 -C 16 , C 3 -C 18 , or C 3 -C 20 cycloalkylene).
- the cycloalkylene group includes at least one carbon-carbon double bond
- the cycloalkylene group can be referred to as a “cycloalkenylene” group.
- a cycloalkenylene may have, e.g., four to twenty carbons in the cyclic portion of the cycloalkenylene (e.g., a C 4 -C 7 , C 4 -C 8 , C 4 -C 9 . C 4 -C 10 , C 4 -C 11 , C 4 -C 12 , C 4 -C 14 , C 4 -C 16 , C 4 -C 18 , or C 4 -C 20 cycloalkenylene).
- the cycloalkylene group includes at least one carbon-carbon triple bond
- the cycloalkylene group can be referred to as a “cycloalkynylene” group.
- a cycloalkynylene may have, e.g., four to twenty carbons in the cyclic portion of the cycloalkynylene (e.g., a C 4 -C 7 , C 4 -C 8 , C 4 -C 9 , C 4 -C 10 , C 4 -C 11 , C 4 -C 12 , C 4 -C 14 , C 4 -C 16 , C 4 -C 18 , or C 8 -C 20 cycloalkynylene).
- a cycloalkylene group can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein.
- Heterocycloalkylene refers to a cycloalkylene group including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S.
- Examples of cycloalkylenes include, but are not limited to, cyclopropylene and cyclobutylene.
- a tetrahydrofuran may be considered as a heterocycloalkylene.
- arylene refers to a multivalent (e.g., divalent or trivalent) aryl group linking together multiple (e.g., two or three) parts of a compound.
- one carbon within the arylene group may be linked to one part of the compound, while another carbon within the arylene group may be linked to another part of the compound.
- An arylene may have, e.g., five to fifteen carbons in the aryl portion of the arylene (e.g., a C 5 -C 6 , C 5 -C 7 , C 5 -C 8 , C 5 -C 9 , C 5 -C 10 , C 5 -C 11 , C 5 -C 12 , C 5 -C 13 , C 5 -C 14 , or C 5 - C 15 arylene).
- An arylene group can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein.
- Heteroarylene refers to an aromatic group including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S.
- a heteroarylene group may have, e.g., two to fifteen carbons (e.g., a C 2 -C 3 , C 2 -C 4 , C 2 -C 5 , C 2 -C 6 , C 2 -C 7 , C 2 -C 8 , C 2 -C 9 .
- Substituents include, but are not limited to, alkyl, alkenyl, alkynyl, aryl, alkaryl, acyl, heteroaryl, heteroalkyl, heteroalkenyl, heteroalkynyl, heteroalkaryl, halogen, oxo, cyano, nitro, amino, alkamino, hydroxy, alkoxy, alkanoyl, carbonyl, carbamoyl, guanidinyl, ureido, amidinyl, any of the groups or moieties described above, and hetero versions of any of the groups or moieties described above.
- Substituents include, but are not limited to, F, Cl, methyl, phenyl, benzyl, OR, NR 2 , SR, SOR, SO 2 R, OCOR, NRCOR, NRCONR 2 , NRCOOR, OCONR 2 , RCO, COOR, alkyl-OOCR, SO 3 R, CONR 2 , SO 2 NR 2 , NRSO 2 NR 2 , CN, CF 3 , O CF 3 , SiR 3 , and NO 2 , wherein each R is, independently, H, alkyl, alkenyl, aryl, heteroalkyl, heteroalkenyl, or heteroaryl, and wherein two of the optional substituents on the same or adjacent atoms can be joined to form a fused, optionally substituted aromatic or nonaromatic, saturated or unsaturated ring which contains 3–8 members, or two of the optional substituents on the same atom can be joined to form an optionally substituted aromatic or non
- an optionally substituted group or moiety refers to a group or moiety (e.g., any one of the groups or moieties described above) in which one of the atoms (e.g., a hydrogen atom) is optionally replaced with another substituent.
- an optionally substituted alkyl may be an optionally substituted methyl, in which a hydrogen atom of the methyl group is replaced by, e.g., OH.
- a substituent on a heteroalkyl or its divalent counterpart, heteroalkylene may replace a hydrogen on a carbon or a hydrogen on a heteroatom such as N.
- the hydrogen atom in the group -R-NH-R- may be substituted with an alkamide substituent, e.g., -R-N[(CH 2 C(O)N(CH 3 ) 2 ]-R.
- an optional substituent is a noninterfering substituent.
- a “noninterfering substituent” refers to a substituent that leaves the ability of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) to either bind to viral gp41 or gp120 receptor or to inhibit the proliferation of HIV.
- the substituent may alter the degree of such activity.
- the conjugate retains the ability to bind to viral gp41 or gp120 receptor or to inhibit HIV proliferation, the substituent will be classified as “noninterfering.”
- the noninterfering substituent would leave the ability of the compound to provide antiviral efficacy based on an IC 5 0 value of 10 ⁇ M or less in a viral plaque reduction assay.
- the substituent may alter the degree of inhibition based on plaque reduction or gp120 receptor inhibition.
- hetero when used to describe a chemical group or moiety, refers to having at least one heteroatom that is not a carbon or a hydrogen, e.g., N, O, and S. Any one of the groups or moieties described above may be referred to as hetero if it contains at least one heteroatom.
- a heterocycloalkyl, heterocycloalkenyl, or heterocycloalkynyl group refers to a cycloalkyl, cycloalkenyl, or cycloalkynyl group that has one or more heteroatoms independently selected from, e.g., N, O, and S.
- An example of a heterocycloalkenyl group is a maleimido.
- a heteroaryl group refers to an aromatic group that has one or more heteroatoms independently selected from, e.g., N, O, and S.
- One or more heteroatoms may also be included in a substituent that replaced a hydrogen atom in a group or moiety as described herein.
- the substituent may also contain one or more heteroatoms (e.g., methanol).
- acyl refers to a group having the structure: , wherein R z is an optionally substituted alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, alkaryl, alkamino, heteroalkyl, heteroalkenyl, heteroalkynyl, heterocycloalkyl, heterocycloalkenyl, heterocycloalkynyl, heteroaryl, heteroalkaryl, or heteroalkamino.
- halo or “halogen,” as used herein, refers to any halogen atom, e.g., F, Cl, Br, or I.
- halo moiety if it contains at least one halogen atom, such as haloalkyl.
- haloalkyl refers to an alkyl group substituted with one or more (e.g., one, two, three, four, five, six, or more) halo groups.
- Haloalkyl groups include, but are not limited to, fluoroalkyl (e.g., trifluoromethyl and pentafluoroethyl) and chloroalkyl.
- hydroxyl represents an -OH group.
- carbonyl refers to a group having the structure:
- thiocarbonyl refers to a group having the structure: .
- phosphate represents the group having the structure:
- phosphoryl represents the group having the structure:
- sulfonyl represents the group having the structure:
- amino represents the group having the structure: , wherein R is an optional substituent.
- N-protecting group represents those groups intended to protect an amino group against undesirable reactions during synthetic procedures. Commonly used N-protecting groups are disclosed in Greene, “Protective Groups in Organic Synthesis,” 5th Edition (John Wiley & Sons, New York, 2014), which is incorporated herein by reference.
- N-protecting groups include, e.g., acyl, aryloyl, and carbamyl groups such as formyl, acetyl, propionyl, pivaloyl, t-butylacetyl, 2-chloroacetyl, 2-bromoacetyl, trifluoroacetyl, trichloroacetyl, phthaloyl, o-nitrophenoxyacetyl, ⁇ -chlorobutyryl, benzoyl, carboxybenzyl (CBz), 4-chlorobenzoyl, 4-bromobenzoyl, 4-nitrobenzoyl, and chiral auxiliaries such as protected or unprotected D, L or D, L-amino acid residues such as alanine, leucine, phenylalanine; sulfonyl-containing groups such as benzenesulfonyl and p-toluenesulfonyl; carba
- amino acid means naturally occurring amino acids and non-naturally occurring amino acids.
- naturally occurring amino acids means amino acids including Ala, Arg, Asn, Asp, Cys, Gln, Glu, Gly, His, Ile, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, and Val.
- non-naturally occurring amino acid means an alpha amino acid that is not naturally produced or found in a mammal.
- non-naturally occurring amino acids include D-amino acids; an amino acid having an acetylaminomethyl group attached to a sulfur atom of a cysteine; a pegylated amino acid; the omega amino acids of the formula NH 2 (CH 2 )nCOOH where n is 2-6, neutral nonpolar amino acids, such as sarcosine, t-butyl alanine, t-butyl glycine, N-methyl isoleucine, and norleucine; oxymethionine; phenylglycine; citrulline; methionine sulfoxide; cysteic acid; ornithine; diaminobutyric acid; 3-aminoalanine; 3-hydroxy-D-proline; 2,4-diaminobutyric acid; 2-aminopentanoic acid; 2-aminooctanoic acid, 2-carboxy piperazine; piperazine-2-carboxylic acid, 2-amino
- amino acids are ⁇ -aminobutyric acid, ⁇ -amino- ⁇ - methylbutyrate, aminocyclopropane-carboxylate, aminoisobutyric acid, aminonorbornyl-carboxylate, L- cyclohexylalanine, cyclopentylalanine, L-N-methylleucine, L-N-methylmethionine, L-N-methylnorvaline, L- N-methylphenylalanine, L-N-methylproline, L-N-methylserine, L-N-methyltryptophan, D-ornithine, L-N- methylethylglycine, L-norleucine, ⁇ -methyl-aminoisobutyrate, ⁇ -methylcyclohexylalanine, D- ⁇ - methylalanine, D- ⁇ -methylarginine, D- ⁇ -methylasparagine, D- ⁇ -methylaspartate, D- ⁇ -methylcysteine
- amino acid residues may be charged or polar.
- Charged amino acids include alanine, lysine, aspartic acid, or glutamic acid, or non-naturally occurring analogs thereof.
- Polar amino acids include glutamine, asparagine, histidine, serine, threonine, tyrosine, methionine, or tryptophan, or non-naturally occurring analogs thereof. It is specifically contemplated that in some embodiments, a terminal amino group in the amino acid may be an amido group or a carbamate group.
- percent (%) identity refers to the percentage of amino acid residues of a candidate sequence, e.g., an Fc-IgG, or fragment thereof, that are identical to the amino acid residues of a reference sequence after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent identity (i.e., gaps can be introduced in one or both of the candidate and reference sequences for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). Alignment for purposes of determining percent identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, ALIGN, or Megalign (DNASTAR) software.
- the percent amino acid sequence identity of a given candidate sequence to, with, or against a given reference sequence is calculated as follows: 100 x (fraction of A/B) where A is the number of amino acid residues scored as identical in the alignment of the candidate sequence and the reference sequence, and where B is the total number of amino acid residues in the reference sequence.
- the percent amino acid sequence identity of the candidate sequence to the reference sequence would not equal to the percent amino acid sequence identity of the reference sequence to the candidate sequence.
- Two polynucleotide or polypeptide sequences are said to be “identical” if the sequence of nucleotides or amino acids in the two sequences is the same when aligned for maximum correspondence as described above. Comparisons between two sequences are typically performed by comparing the sequences over a comparison window to identify and compare local regions of sequence similarity.
- a “comparison window” as used herein refers to a segment of at least about 15 contiguous positions, about 20 contiguous positions, about 25 contiguous positions, or more (e.g., about 30 to about 75 contiguous positions, or about 40 to about 50 contiguous positions), in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.
- the term “treating” or “to treat,” as used herein, refers to a therapeutic treatment of a viral infection (e.g., a viral infection such as an HIV infection) in a subject. In some embodiments, a therapeutic treatment may slow the progression of the viral infection, improve the subject’s outcome, and/or eliminate the infection.
- a therapeutic treatment of a viral infection in a subject may alleviate or ameliorate of one or more symptoms or conditions associated with the viral infection, diminish the extent of the viral, stabilize (i.e., not worsening) the state of the viral infection, prevent the spread of the viral infection, and/or delay or slow the progress of the viral infection, as compare the state and/or the condition of the viral infection in the absence of the therapeutic treatment.
- the average number of monomers of gp120 binder or dimers of gp120 binders conjugated to an Fc domain monomer may be from 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20.
- the term “subject,” as used herein, can be a human or non-human primate, or other mammal, such as but not limited to dog, cat, horse, cow, pig, turkey, goat, fish, monkey, chicken, rat, mouse, or sheep.
- terapéuticaally effective amount refers to an amount, e.g., pharmaceutical dose, effective in inducing a desired effect in a subject or in treating a subject having a condition or disorder described herein (e.g., a viral infection, such as an HIV infection). It is also to be understood herein that a “therapeutically effective amount” may be interpreted as an amount giving a desired therapeutic and/or preventative effect, taken in one or more doses or in any dosage or route, and/or taken alone or in combination with other therapeutic agents (e.g., an antiviral agent described herein).
- an effective amount of a conjugate is, for example, an amount sufficient to prevent, slow down, or reverse the progression of the viral infection as compared to the response obtained without administration of the conjugate.
- the term “pharmaceutical composition” refers to a medicinal or pharmaceutical formulation that contains at least one active ingredient (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) as well as one or more excipients and diluents to enable the active ingredient suitable for the method of administration.
- the pharmaceutical composition of the present disclosure includes pharmaceutically acceptable components that are compatible with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- a pharmaceutically acceptable carrier refers to an excipient or diluent in a pharmaceutical composition.
- a pharmaceutically acceptable carrier may be a vehicle capable of suspending or dissolving the active conjugate (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- the pharmaceutically acceptable carrier must be compatible with the other ingredients of the formulation and not deleterious to the recipient.
- the pharmaceutically acceptable carrier must provide adequate pharmaceutical stability to a conjugate described herein. The nature of the carrier differs with the mode of administration.
- a solid carrier for oral administration, a solid carrier is preferred; for intravenous administration, an aqueous solution carrier (e.g., WFI, and/or a buffered solution) is generally used.
- a pharmaceutically acceptable salt represents salts of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) that are, within the scope of sound medical judgment, suitable for use in methods described herein without undue toxicity, irritation, and/or allergic response.
- Pharmaceutically acceptable salts are well known in the art.
- gp120 binder refers to a moiety, such as a small molecule (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147 or analogs thereof) that binds to the HIV gp120 glycoprotein.
- Gp120 binders of the invention include compounds described by formula (A-I), preferably temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or an analog thereof.
- the term “about,” as used herein, indicates a deviation of ⁇ 5%. For example, about 10% refers to from 9.5% to 10.5%. Any values provided in a range of values include both the upper and lower bounds, and any values contained within the upper and lower bounds.
- the term “(1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)”, as used herein, represents the formulas of any one of (D-IV-7), (D-IV-8), (D-V-6), (D-V-7), (D-V-8), (D-V-9), (D-VI-8), (M-I), (M-II), (M-III), (M-III-1), (M-III-2), (M-III-3), (M-III-4), (M-III-5), (M-III-6), (M-IV), (M-IV-1), (M-IV-2), (M-IV-3), (M-IV-4), (M-IV-5), (M- IV-6), (M-IV-7), (M-IV-8), (M-IV-9), (M-V), (M-V-1), (M-V-2), (M-V-3), (M-V-4), (M-V-5), (M-V-6), (M-V-7), (M-
- FIG.1 is an image depicting exemplary methods of conjugating a gp120 receptor inhibitor monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide.
- FIG.2 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by oxime conjugation to an amino acid residue, e.g., a nitrogen atom of a surface exposed lysine.
- an amino acid residue e.g., a nitrogen atom of a surface exposed lysine.
- FIG.3 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by thioether conjugation to an amino acid residue, e.g., a nitrogen atom of a surface exposed lysine.
- FIG.4 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by rebridged cysteine conjugation, e.g., rebridged cysteine conjugation to a pair of sulfur atoms of two hinge cysteines in an Fc domain monomer or Fc domain.
- FIG.5 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 1.
- FIG.6 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 3.
- FIG.7 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 5.
- FIG.8 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 7.
- FIG.9 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 9.
- FIG.10 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 12.
- FIG.11 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 14.
- FIG.12 is a graph showing the binding of conjugates containing gp120 binders to the gp120 protein compared to a polyclonal goat anti-gp120 HRP (PA 1 -73097, Invitrogen) positive control and an unconjugated Fc molecule negative control.
- PA 1 -73097 polyclonal goat anti-gp120 HRP
- FIG.13 is a graph showing plasma levels of a conjugate including an Fc domain having a C220S mutation (SEQ ID NO: 64) (2 mpk IV) compared to a conjugate including an Fc domain having a C220S mutation and a YTE triple mutation (SEQ ID NO: 67) (2 mpk IV) in non-human primate PK studies determined by Fc capture. This study was performed as described in Example 40.
- FIG.14 is an image depicting exemplary conjugates including a gp120 binder monomer or dimer and an Fc domain monomer or an Fc domain.
- T is representative of the drug-to-antibody ratio (DAR) and depicts that multiple monomers or dimers can be conjugated to an Fc domain monomer or an Fc domain.
- FIG.15 is a graph showing the 7-day mouse PK profiles of Conjugate 5b and Fc control (SEQ ID NO:73).
- FIG.16 is a graph showing the percent reduction in viral cytopathic effect (CPE) of HIV-1IIIB and cell viability of CEM-SS cells by AZT at a concentration of 2,000 pM, 5,000 pM, 20,000 pM, 50,000 pM, 160,000 pM, and 500,000 pM.
- CPE viral cytopathic effect
- FIG.17 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by the temsivir buffer at a dilution of 0, 00000002, and 0.000001.
- FIG.18 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by temsavir at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM.
- FIG.19 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by the conjugate buffer at a dilution of 0, 0.0000001, 0.0000004, 0.000002, 0.00001, and 0.00005.
- FIG.20 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 5b at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM.
- FIG.21 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 29a at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM.
- FIG.22 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 29b at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM.
- FIG.23 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 30a at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM.
- FIG.24 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 37a at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM.
- FIG.25 is a graph showing the 7-day mouse PK profiles of Conjugate 5b, Conjugate 29a, and Fc control (SEQ ID NO: 72).
- the disclosure features conjugates, compositions, methods for the treatment of viral infections (e.g., human immunodeficiency viral infections), and methods of synthesizing conjugates.
- the conjugates disclosed herein include monomers or dimers of viral gp120 binders (e.g., temsavir, BMS- 818251, DMJ-II-121, BNM-IV-147, or analogs thereof) conjugated to Fc monomers, Fc domains, Fc- binding peptides, albumin proteins, or albumin protein-binding peptides.
- the gp120 binder e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof
- the gp120 binder targets the gp120 receptor on the surface of the viral particle.
- the Fc monomers or Fc domains in the conjugates bind to Fc ⁇ Rs (e.g., FcRn, Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb) on immune cells, e.g., neutrophils, to activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates.
- Fc ⁇ Rs e.g., FcRn, Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb
- immune cells e.g., neutrophils
- ADCC antibody-dependent cell-mediated cytotoxicity
- the albumin or albumin-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neon
- compositions are useful in methods for the inhibition of viral growth and in methods for the treatment of viral infections, such as those caused by an HIV-1 or HIV-2.
- Viral Infections The compounds and pharmaceutical compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) can be used to treat a viral infection (e.g., an HIV-1 or HIV-2 viral infection).
- a viral infection e.g., an HIV-1 or HIV-2 viral infection.
- Viral infection refers to the pathogenic growth of a virus (e.g., the human immunodeficiency virus) in a host organism (e.g., a human subject).
- a viral infection can be any situation in which the presence of a viral population(s) is damaging to a host body.
- a subject is suffering from a viral infection when an excessive amount of a viral population is present in or on the subject’s body, or when the presence of a viral population(s) is damaging the cells or other tissue of the subject.
- the human immunodeficiency viruses are two species of Lentivirus (a subgroup of retrovirus) that causes HIV infection and over time acquired immunodeficiency syndrome (AIDS).
- AIDS is a condition in humans in which progressive failure of the immune system allows life-threatening opportunistic infections and cancers to thrive. Without treatment, average survival time after infection with HIV is estimated to be 9 to 11 years, depending on the HIV subtype.
- HIV is a sexually transmitted infection and occurs by contact with or transfer of blood, pre-ejaculate, semen, and vaginal fluids.
- Two types of HIV have been characterized: HIV-1 and HIV-2. HIV infects vital cells in the human immune system, such as helper T cells (specifically CD4+ T cells), macrophages, and dendritic cells. HIV infection leads to low levels of CD4+ T cells through a number of mechanisms, including pyroptosis of abortively infected T cells, apoptosis of uninfected bystander cells, direct viral killing of infected cells, and killing of infected CD4+ T cells by CD8+ cytotoxic lymphocytes that recognize infected cells.
- helper T cells specifically CD4+ T cells
- macrophages specifically dendritic cells
- dendritic cells dendritic cells. HIV infection leads to low levels of CD4+ T cells through a number of mechanisms, including pyroptosis of abortively infected T cells
- conjugates of the Disclosure Provided herein are synthetic conjugates useful in the treatment of viral infections (e.g., HIV infections).
- the conjugates disclosed herein include an Fc domain monomer, an Fc domain, or an albumin protein conjugated to one or more monomers gp120 binders or one or more dimers of two gp120 binders (e.g., gp120 binders selected from temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof).
- the dimers of two gp120 binders include a gp120 binder (e.g., a first gp120 binder of formula (A-I) or (A-II)) and a second gp120 binder (e.g., a second gp120 binder of formula(A-I) or (A-II)).
- the first and second gp120 binders are linked to each other by way of a linker.
- conjugates described herein bind to the surface of a viral particle (e.g., bind to viral gp120 receptor on the surface on an human immunodeficiency virus particle) through the interactions between the gp120 binder moieties in the conjugates and proteins on the surface of the viral particle.
- the gp120 binder disrupts gp120, an envelope glycoprotein that binds with the CD4 receptor, particularly on helper T cells. Binding to CD4 initiates a cascade of conformational changes in gp120 and gp41 that lead to the fusion of the viral membrane with the host cell membrane, allowing the spread of the virus.
- Conjugates of the invention include gp120 binder monomers and dimers conjugated to an Fc domain, Fc monomer, or Fc-binding peptide.
- the Fc domain in the conjugates described herein binds to the Fc ⁇ Rs (e.g., FcRn, Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb) on immune cells.
- Conjugates of the invention include gp120 binder monomers and dimers conjugated to an albumin protein or an albumin protein-binding peptide.
- the albumin protein or albumin protein-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neonatal Fc receptor.
- Conjugates provided herein are described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII).
- the conjugates described herein include one or more monomers of gp120 binders conjugated to an Fc domain or an albumin protein.
- the conjugates described herein include one or more dimers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein.
- n is 2
- E an Fc domain monomer
- Conjugates described herein may be synthesized using available chemical synthesis techniques in the art.
- a molecule may be derivatized using conventional chemical synthesis techniques that are well known in the art.
- the conjugates described herein contain one or more chiral centers.
- the conjugates include each of the isolated stereoisomeric forms as well as mixtures of stereoisomers in varying degrees of chiral purity, including racemic mixtures. It also encompasses the various diastereomers, enantiomers, and tautomers that can be formed.
- Gp120 binders A component of the conjugates described herein is an HIV gp120 binder moiety. The gp120 binder disrupts gp120, an envelope glycoprotein that binds with the CD4 receptor, particularly on helper T-Cells.
- Binding to CD4 initiates a cascade of conformational changes in gp120 and gp41 that lead to the fusion of the viral membrane with the host cell membrane, allowing the spread of the virus.
- gp120 binders include temsavir, BMS-818251, DMJ-II-121, and BNM-IV-147.
- derivatives of temsavir, BMS-818251, DMJ-II-121, and BNM-IV-147 such as those found in the literature, have gp120 binder activity and are useful as gp120 inhibitor moieties of the compounds herein (see, for example, Lu et al. Curr. Top. Med. Chem.16(10): 1074-1090).
- Conjugates described herein are separated into two types: (1) one or more dimers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein and (2) one or more monomers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein.
- the dimers of gp120 binders are linked to each other by way of a linker, such as the linkers described herein.
- Viral gp120 binders of the invention include temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, and analogs thereof, such as the viral gp120 binders of formula (A-I) and (A-II): wherein Q is selected from the group consisting of: , S is selected from the group consisting of: R 1 , R 2 , R 3 , are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C 1 -C 20 alkyl, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 alkenyl, optionally substituted C 3 -C 20 cycloalkenyl, optionally substituted C 2 -C 20 alkynyl, optionally substituted C 5 -C 20 aryl, optionally substitute
- each R 8 is independently selected from H, optionally substituted C 1 -C 20 alkyl, optionally substituted C 1 -C 20 alkylene, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C 15 aryl, and optionally substituted C 2 -C 15 heteroaryl; each R 9 is independently selected from optionally substituted C 1 -C 20 alkylene, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C 15 aryl, and optionally substituted C 2 -C 15 heteroaryl; x is 1 or 2; k is 0, 1, 2, 3, 4, or 5; Ar is selected from the group consisting of optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C
- x is 2.
- the gp120 inhibitor is selected from temsavir, BMS-818251, DMJ-II-121, or BNM-IV- 147: Conjugates of dimers of gp120 binders linked to an Fc domain or an albumin protein
- the conjugates described herein include an Fc domain monomer, an Fc domain, an Fc-binding peptide, and albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders.
- the dimers of two gp120 binders include a first gp120 binder (e.g., a first viral gp120 binder of formula (A-I) or (A-II)) and a second gp120 binder (e.g., a second viral gp120 binder of formula (A-I) or (A-II)).
- the first and second gp120 binders are linked to each other by way of a linker, such as a linker described herein.
- the first and second gp120 binders are the same.
- the first and second gp120 binders are different.
- each A 1 -L-A 2 may be independently selected (e.g., independently selected from any of the A 1 -L-A 2 structures described herein).
- E may be conjugated to 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different A 1 -L-A 2 moieties.
- E is conjugated to a first A 1 -L-A 2 moiety, and a second A 1 -L-A 2 , moiety.
- each of A 1 and A 2 of the first A 1 -L-A 2 moiety and of the second A 1 -L-A 2 moiety are independently selected from any structure described by formula (A-I) and (A-II):
- x is 2.
- the first A 1 -L-A 2 moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E), and the second A 1 -L-A 2 moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E).
- the first A 1 -L-A 2 moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E), and the second A 1 -L-A 2 moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E).
- the disclosure provides a conjugate, or a pharmaceutically acceptable salt thereof, described by any one any one of formulas (D-I), (D-II), (D-III), (D-III-1), (D-III-2), (D-III-3), (D-III- 4), (D-III-5), (D-III-6), (D-IV), (D-IV-1), (D-IV-2), (D-IV-3), (D-IV-4), (D-IV-5), (D-IV-6), (D-IV-7), (D-IV-8), (D-IV-9), (D-V), (D-V-1), (D-V-2), (D-V-3), (D-V-4), (D-V-5), (D-V-6), (D-V-7), (D-V-8), (D-V-9), (D-VI), (D- VI-1), (D-VI-2), (D-VI-3), (D-VI-4), (D-VI-5), (D-VI-6), (D-VI-7), (D-VI-8), (D-V
- the squiggly line connected to E indicates that one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) dimers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein- binding peptide.
- one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) dimers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- one or more dimers of gp120 binders may be attached to an Fc domain.
- the squiggly line in the conjugates described herein is not to be construed as a single bond between one or more dimers of gp120 binders and an atom in the Fc domain monomer, Fc domain, or albumin protein.
- T when T is 1, one dimer of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- two dimers of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- a linker in a conjugate described herein e.g., L or L’
- a linker in a conjugate described herein e.g., L or L’
- the linker when the linker has three arms, two of the arms may be attached to the first and second gp120 binders and the third arm may be attached to the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- conjugates having an Fc domain covalently linked to one or more dimers of gp120 binders as represented by the formulae above, when n is 2, two Fc domain monomers (each Fc domain monomer is represented by E) dimerize to form an Fc domain.
- Conjugates of monomers of gp120 binders linked to an Fc domain monomer, an Fc domain, or an albumin protein include an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders.
- Conjugates of an Fc domain monomer or albumin protein and one or more monomers of gp120 binders may be formed by linking the Fc domain monomer, Fc domain, or albumin protein to each of the monomers of gp120 binders through a linker, such as any of the linkers described herein.
- the squiggly line connected to E indicates that one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) monomers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- n is 1, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) monomers of gp120 binders may be attached to an Fc domain monomer, Fc domain, or an albumin protein.
- one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) monomers of gp120 binders may be attached to an Fc domain.
- the squiggly line in the conjugates described herein is not to be construed as a single bond between one or more monomers of gp120 binders and an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- T when T is 1, one monomer of gp120 binder may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is 2, two monomers of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- each A 1 -L may be independently selected (e.g., independently selected from any of the A 1 -L structures described herein).
- E may be conjugated to 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different A 1 -L moieties.
- E is conjugated to a first A 1 -L moiety, and a second A 1 -L, moiety.
- a 1 of each of the first A 1 -L moiety and of the second A 1 -L moiety is independently selected from any structure described by formula (A-I) or (A-II):
- x is 2.
- the first A 1 -L moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E), and the second A 1 -L moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E).
- the first A 1 -L moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E), and the second A 1 -L moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E).
- a linker in a conjugate having an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of the gp120 binders described herein (e.g., L or L’) may be a divalent structure having two arms.
- One arm in a divalent linker may be attached to the monomer of the gp120 binder and the other arm may be attached to the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide.
- a conjugate containing an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders provided herein is described by any one of formulas (M-I), (M-II), (M-III), (M-III-1), (M-III- 2), (M-III-3), (M-III-4), (M-III-5), (M-III-6), (M-IV), (M-IV-1), (M-IV-2), (M-IV-3), (M-IV-4), (M-IV-5), (M-IV-6), (M-IV-7), (M-IV-8), (M-IV-9), (M-V), (M-V-1), (M-V-2), (M-V-3), (M-V-4), (M-V-5), (M-V-6), (M-V-7), (M-V- 8), (M-V-9), (M-VI), (M-VI), (M-
- Fc domain monomers and Fc domains An Fc domain monomer includes a hinge domain, a C H 2 antibody constant domain, and a C H 3 antibody constant domain.
- the Fc domain monomer can be of immunoglobulin antibody isotype IgG, IgE, IgM, IgA, or IgD.
- the Fc domain monomer can also be of any immunoglobulin antibody isotype (e.g., IgG1, IgG2a, IgG2b, IgG3, or IgG4).
- the Fc domain monomer can be of any immunoglobulin antibody allotype (e.g., IGHG1*01 (i.e., G1m(za)), IGHG1*07 (i.e., G1m(zax)), IGHG1*04 (i.e., G1m(zav)), IGHG1*03 (G1m(f)), IGHG1*08 (i.e., G1m(fa)), IGHG2*01, IGHG2*06, IGHG2*02, IGHG3*01, IGHG3*05, IGHG3*10, IGHG3*04, IGHG3*09, IGHG3*11, IGHG3*12, IGHG3*06, IGHG3*07,
- the Fc domain monomer can also be of any species, e.g., human, murine, or mouse.
- a dimer of Fc domain monomers is an Fc domain that can bind to an Fc receptor, which is a receptor located on the surface of leukocytes.
- an Fc domain monomer in the conjugates described herein may contain one or more amino acid substitutions, additions, and/or deletion relative to an Fc domain monomer having a sequence of any one of SEQ ID NOs: 1-95 and 125-153.
- an Asn in an Fc domain monomer in the conjugates as described herein may be replaced by Ala in order to prevent N- linked glycosylation (see, e.g., SEQ ID NOs: 12-15, where Asn to Ala substitution is labeled with *).
- an Fc domain monomer in the conjugates described herein may also containing additional Cys additions (see, e.g., SEQ ID NOs: 9, 10, and 11, where Cys additions are labeled with *).
- an Fc domain monomer in the conjugates as described herein includes an additional moiety, e.g., an albumin-binding peptide, a purification peptide (e.g., a hexa-histidine peptide (HHHHHH (SEQ ID NO: 99)), or a signal sequence (e.g., IL2 signal sequence MYRMQLLSCIALSLALVTNS (SEQ ID NO: 100)) attached to the N- or C-terminus of the Fc domain monomer.
- an additional moiety e.g., an albumin-binding peptide, a purification peptide (e.g., a hexa-histidine peptide (HHHHHH (SEQ ID NO: 99)), or a signal sequence (e.g., IL2 signal sequence MYRMQLLSCIALSLALVTNS (SEQ ID NO: 100)) attached to the N- or C-terminus of the Fc domain monomer.
- an Fc domain monomer in the conjugate does not contain any type of antibody variable region, e.g., VH, VL, a complementarity determining region (CDR), or a hypervariable region (HVR).
- an Fc domain monomer in the conjugates as described herein may have a sequence that is at least 95% identical (e.g., 97%, 99%, or 99.5% identical) to the sequence of any one of SEQ ID NOs: 1-95 and 125-153 shown below.
- an Fc domain monomer in the conjugates as described herein may have a sequence of any one of SEQ ID NOs: 1-95 and 125-153 shown below.
- SEQ ID NO: 1 murine Fc-IgG2a with IL2 signal sequence at the N-terminus (bold) MYRMQLLSCIALSLALVTNSPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVS EDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTI SKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYF MYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK SEQ ID NO: 2: mature murine Fc-IgG2a PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTA QTQTHREDYNSTLRVVSA
- the variant Fc domain monomer includes at least the following mutations K246X, M252Y, S254T, and T256E, where X is not Lys. In some embodiments, the variant Fc domain monomer includes at least the following mutations K246X, V309D, Q311H, and N434S, where X is not Lys. In some embodiments, the variant Fc domain monomer includes at least the following mutations K246X, M428L, and N434S, where X is not Lys. In some embodiments, the variant Fc domain further includes a mutation of position 220, e.g., a C220S mutation.
- a variant Fc domain monomer includes a sequence that is at least 95% identical (e.g., 97%, 99%, or 99.5% identical) to the sequence of any one of SEQ ID NOs: 125-153 shown below.
- a variant Fc domain monomer includes the sequence of any one of SEQ ID NOs: 125-153 shown below.
- a variant Fc domain monomer includes at least the following mutations K246X, M252Y, S254T, and T256E, where X is not Lys.
- a variant Fc domain monomer includes at least the following mutations K246X, V309D, Q311H, and N434S, where X is not Lys. In some embodiments, a variant Fc domain monomer includes at least the following mutations K246X, M428L, and N434S, where X is not Lys. In some embodiments, the substitution at K246X is selected from Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp. In some embodiments, the substitution at K246X is Ser.
- SEQ ID NO: 125 mature human IgG1 Fc; X 1 (position 201) is Asn or absent; X 2 (position 220) is Cys or Ser; X 3 (position 246) is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X 4 (position 252) is Met or Tyr; X 5 (position 254) is Ser or Thr; X 6 (position 256) is Thr or Glu; X 7 (position 297) is Asn or Ala; X 8 (position 309) is Leu or Asp; X 9 (position 311) is Gln or His; X 10 (position 356) is Asp or Glu; and X 11 (position 358) is Leu or Met; X 12 (position 428) is Met or Leu; X 13 (position 434) is Asn or Ser; X 14 (position 447) is Lys or absent; N-terminal Fab residues are underlined; hinge residues are italicized X 1
- An Fc domain forms the minimum structure that binds to an Fc receptor, e.g., Fc-gamma receptors (i.e., Fc ⁇ receptors (Fc ⁇ R)), Fc-alpha receptors (i.e., Fc ⁇ receptors (Fc ⁇ R)), Fc-epsilon receptors (i.e., Fc ⁇ receptors (Fc ⁇ R)), and/or the neonatal Fc receptor (FcRn).
- Fc-gamma receptors i.e., Fc ⁇ receptors (Fc ⁇ R)
- Fc-alpha receptors i.e., Fc ⁇ receptors (Fc ⁇ R)
- Fc-epsilon receptors i.e., Fc ⁇ receptors (Fc ⁇ R)
- FcRn neonatal Fc receptor
- an Fc domain of the present invention binds to an Fc ⁇ receptor (e.g., FcRn, Fc ⁇ RI (CD64), Fc ⁇ RIIa (CD32), Fc ⁇ RIIb (CD32), Fc ⁇ RIIIa (CD16a), Fc ⁇ RIIIb (CD16b)), and/or Fc ⁇ RIV and/or the neonatal Fc receptor (FcRn).
- FcRn Fc ⁇ receptor
- FcRn Fc ⁇ RI
- Fc ⁇ RI CD64
- Fc ⁇ RIIa CD32
- Fc ⁇ RIIb CD32
- Fc ⁇ RIIIa CD16a
- Fc ⁇ RIIIb CD16b
- FcRn neonatal Fc receptor
- the Fc domain monomer or Fc domain of the invention is an aglycosylated Fc domain monomer or Fc domain (e.g., an Fc domain monomer or an Fc domain that maintains engagement to an Fc receptor (e.g., FcRn
- the Fc domain is an aglycosylated IgG1 variants that maintains engagement to an Fc receptor (e.g., an IgG1 having an amino acid substitution at N297 and/or T299 of the glycosylation motif).
- an Fc receptor e.g., an IgG1 having an amino acid substitution at N297 and/or T299 of the glycosylation motif.
- Exemplary aglycosylated Fc domains and methods for making aglycosylated Fc domains are known in the art, for example, as described in Sazinsky S.L. et al., Aglycosylated immunoglobulin G1 variants productively engage activating Fc receptors, PNAS, 2008, 105(51):20167-20172, which is incorporated herein in its entirety.
- the Fc domain or Fc domain monomer of the invention is engineered to enhance binding to the neonatal Fc receptor (FcRn).
- the Fc domain may include the triple mutation corresponding to M252Y/S254T/T256E (YTE) (e.g., an IgG1, such as a human or humanized IgG1 having a YTE mutation, for example SEQ ID NO: 33, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 56, or SEQ ID NO: 57).
- YTE M252Y/S254T/T256E
- the Fc domain may include the double mutant corresponding to M428L/N434S (LS) (e.g., an IgG1, such as a human or humanized IgG1 having an LS mutation, such as SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 54, SEQ ID NO: 55, or SEQ ID NO: 59).
- the Fc domain may include the single mutant corresponding to N434H (e.g., an IgG1, such as a human or humanized IgG1 having an N434H mutation).
- the Fc domain may include the single mutant corresponding to C220S (e.g., and IgG1, such as a human or humanized IgG1 having a C220S mutation, such as SEQ ID NO: 34, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, or SEQ ID NO: 68).
- C220S e.g., and IgG1, such as a human or humanized IgG1 having a C220S mutation, such as SEQ ID NO: 34, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, S
- the Fc domain may include a combination of one or more of the above-described mutations that enhance binding to the FcRn.
- Enhanced binding to the FcRn may increase the half-life Fc domain-containing conjugate.
- incorporation of one or more amino acid mutations that increase binding to the FcRn e.g., a YTE mutation, an LS mutation, or an N434H mutation
- Exemplary Fc domains with enhanced binding to the FcRN and methods for making Fc domains having enhanced binding to the FcRN are known in the art, for example, as described in Maeda, A. et al., Identification of human IgG1 variant with enhanced FcRn binding and without increased binding to rheumatoid factor autoantibody, MABS, 2017, 9(5):844-853, which is incorporated herein in its entirety.
- an amino acid “corresponding to” a particular amino acid residue e.g., of a particular SEQ ID NO.
- any one of SEQ ID NOs: 1-95 and 125- 153 may be mutated to include a YTE mutation, an LS mutation, and/or an N434H mutation by mutating the “corresponding residues” of the amino acid sequence.
- a sulfur atom “corresponding to” a particular cysteine residue of a particular SEQ ID NO. should be understood to include the sulfur atom of any cysteine residue that one of skill in the art would understand to align to the particular cysteine of the particular sequence.
- the protein sequence alignment of human IgG1 (UniProtKB: P01857; SEQ ID NO: 121), human IgG2 (UniProtKB: P01859; SEQ ID NO: 122), human IgG3 (UniProtKB: P01860; SEQ ID NO: 123), and human IgG4 (UniProtKB: P01861; SEQ ID NO: 124) is provided below (aligned with Clustal Omega Multiple Pairwise Alignment).
- the alignment indicates cysteine residues (e.g., sulfur atoms of cysteine residues) that “correspond to” one another (in boxes and indicated by the • symbol).
- Cys10 of SEQ ID NO: 10 corresponds to, for example, Cys109 of IgG1, Cys106 of IgG2, Cys156 of IgG3, Cys29 of SEQ ID NO: 1, Cys9 of SEQ ID NO: 2, Cys30 of SEQ ID NO: 3, or Cys10 of SEQ ID NO: 10.
- the Fc domain or Fc domain monomer of the invention has the sequence of any one of SEQ ID NOs: 39-95 may further include additional amino acids at the N-terminus (Xaa)x and/or additional amino acids at the C-terminus (Xaa)z, wherein Xaa is any amino acid and x and z are a whole number greater than or equal to zero, generally less than 100, preferably less than 10 and more preferably 0, 1, 2, 3, 4, or 5.
- the additional amino acids are least 70% (e.g., 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%) identical to one or more consecutive amino acids of SEQ ID NO: 103.
- the additional amino acids may be a single amino acid on the C- terminus corresponding to Lys330 of IgG1 (SEQ ID NO: 121).
- a nitrogen atom “corresponding to” a particular lysine residue of a particular SEQ ID NO. should be understood to include the nitrogen atom of any lysine residue that one of skill in the art would understand to align to the particular lysine of the particular sequence.
- human IgG1 UniProtKB: P01857; SEQ ID NO: 121
- human IgG2 UniProtKB: P01859; SEQ ID NO: 122
- human IgG3 UniProtKB: P01860; SEQ ID NO: 123
- human IgG4 UniProtKB: P01861; SEQ ID NO: 1244
- the alignment indicates lysine residues (e.g., nitrogen atoms of lysine residues) that “correspond to” one another (in boxes and indicated by the * symbol).
- Lys35 of SEQ ID NO: 10 corresponds to, for example, Lys129 of IgG1, Lys126 of IgG2, Lys176 of IgG3, Lys51 of SEQ ID NO: 1, Lys31 of SEQ ID NO: 2, Lys50 of SEQ ID NO: 3, or Lys30 of SEQ ID NO: 10.
- the Fc domain monomer includes less than about 300 amino acid residues (e.g., less than about 300, less than about 295, less than about 290, less than about 285, less than about 280, less than about 275, less than about 270, less than about 265, less than about 260, less than about 255, less than about 250, less than about 245, less than about 240, less than about 235, less than about 230, less than about 225, or less than about 220 amino acid residues).
- amino acid residues e.g., less than about 300, less than about 295, less than about 290, less than about 285, less than about 280, less than about 275, less than about 270, less than about 265, less than about 260, less than about 255, less than about 250, less than about 245, less than about 240, less than about 235, less than about 230, less than about 225, or less than about 220 amino acid residues.
- the Fc domain monomer is less than about 40 kDa (e.g., less than about 35kDa, less than about 30kDa, less than about 25kDa).
- the Fc domain monomer includes at least 200 amino acid residues (e.g., at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, or at least 300 amino residues).
- the Fc domain monomer is at least 20 kDa (e.g., at least 25 kDa, at least 30 kDa, or at least 35 kDa).
- the Fc domain monomer includes 200 to 400 amino acid residues (e.g., 200 to 250, 250 to 300, 300 to 350, 350 to 400, 200 to 300, 250 to 350, or 300 to 400 amino acid residues).
- the Fc domain monomer is 20 to 40 kDa (e.g., 20 to 25 kDa, 25 to 30 kDa, 35 to 40 kDa, 20 to 30 kDa, 25 to 35 kDa, or 30 to 40 KDa).
- the Fc domain monomer includes an amino acid sequence at least 90% identical (e.g., at least 95%, at least 98%) to the sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof. In some embodiments, the Fc domain monomer includes the amino acid sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof. In some embodiments, the Fc domain monomer includes a region of any one of SEQ ID NOs: 1- 95 and 125-153, wherein the region includes positions 220, 252, 254, and 256.
- the region includes at least 40 amino acid residues, at least 50 amino acid residues, at least 60 amino acid residues, at least 70 amino acids residues, at least 80 amino acids residues, at least 90 amino acid residues, at least 100 amino acid residues, at least 110 amino acid residues, at least 120 amino residues, at least 130 amino acid residues, at least 140 amino acid residues, at least 150 amino acid residues, at least 160 amino acid residues, at least 170 amino acid residues, at least 180 amino acid residues, at least 190 amino acid residues, or at least 200 amino acid residues.
- Fc-gamma receptors bind the Fc portion of immunoglobulin G (IgG) and play important roles in immune activation and regulation.
- IgG immunoglobulin G
- ICs immune complexes
- the human Fc ⁇ R family contains several activating receptors (Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb) and one inhibitory receptor (Fc ⁇ RIIb).
- Fc ⁇ R signaling is mediated by intracellular domains that contain immune tyrosine activating motifs (ITAMs) for activating Fc ⁇ Rs and immune tyrosine inhibitory motifs (ITIM) for inhibitory receptor Fc ⁇ RIIb.
- ITAMs immune tyrosine activating motifs
- ITIM immune tyrosine inhibitory motifs
- Fc ⁇ R binding by Fc domains results in ITAM phosphorylation by Src family kinases; this activates Syk family kinases and induces downstream signaling networks, which include PI3K and Ras pathways.
- the portion of the conjugates including monomers or dimers of gp120 binders bind to and inhibits viral gp120 receptor leading to inhibition of viral replication, while the Fc domain portion of the conjugates bind to Fc ⁇ Rs (e.g., FcRn, Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb) on immune cells and activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates.
- Fc ⁇ Rs e.g., FcRn, Fc ⁇ RI, Fc ⁇ RIIa, Fc ⁇ RIIc, Fc ⁇ RIIIa, and Fc ⁇ RIIIb
- ADCC antibody-dependent cell-mediated cytotoxicity
- immune cells that may be activated by the conjugates described herein include, but are not limited to, macrophages, neutrophils, eosinophils, basophils, lymphocytes, follicular dendritic cells, natural killer cells, and mast cells.
- Tissue distribution After a therapeutic enters the systemic circulation, it is distributed to the body’s tissues. Distribution is generally uneven because of different in blood perfusion, tissue binding, regional pH, and permeability of cell membranes. The entry rate of a drug into a tissue depends on the rate of blood flow to the tissue, tissue mass, and partition characteristics between blood and tissue.
- the conjugates described herein may be optimized to distribute to lung tissue.
- the conjugates have a concentration ratio of distribution in epithelial lining fluid of at least 30% the concentration of the conjugate in plasma within 2 hours after administration.
- ratio of the concentration is at least 45% within 2 hours after administration.
- the ratio of concentration is at least 55% within 2 hours after administration.
- the ratio of concentration is at least 60% within 2 hours after administration.
- a conjugate having an Fc domain (SEQ ID NO: 64) decorated with one or more small molecule antiviral inhibitors ELF levels are surprisingly ⁇ 60% of plasma exposure levels as measured by AUC across the rest of the time course indicating nearly immediate partitioning of the conjugate from plasma to the ELF in the lung.
- An albumin protein of the invention may be a naturally-occurring albumin or a variant thereof, such as an engineered variant of a naturally-occurring albumin protein.
- Variants include polymorphisms, fragments such as domains and sub-domains, and fusion proteins.
- An albumin protein may include the sequence of an albumin protein obtained from any source. Preferably the source is mammalian, such as human or bovine. Most preferably, the albumin protein is human serum albumin (HSA), or a variant thereof. Human serum albumins include any albumin protein having an amino acid sequence naturally occurring in humans, and variants thereof.
- An albumin protein coding sequence is obtainable by methods know to those of skill in the art for isolating and sequencing cDNA corresponding to human genes.
- An albumin protein of the invention may include the amino acid sequence of human serum albumin (HSA), provided in SEQ ID NO: 96 or SEQ ID NO: 97, or the amino acid sequence of mouse serum albumin (MSA), provided in SEQ ID NO: 98, or a variant or fragment thereof, preferably a functional variant or fragment thereof.
- a fragment or variant may or may not be functional, or may retain the function of albumin to some degree.
- a fragment or variant may retain the ability to bind to an albumin receptor, such as HSA or MSA, by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, or 105% of the ability of the parent albumin (e.g., the parent albumin from which the fragment or variant is derived).
- the albumin protein may be a naturally-occurring polymorphic variant of an albumin protein, such as human serum albumin. Generally, variants or fragments of human serum albumin will have at least 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, or 70%, and preferably 80%, 90%, 95%, 100%, or 105% or more of human serum albumin or mouse serum albumin’s ligand binding activity.
- the albumin protein may include the amino acid sequence of bovine serum albumin.
- Bovine serum albumin proteins include any albumin having an amino acid sequence naturally occurring in cows, for example, as described by Swissprot accession number P02769, and variants thereof as defined herein.
- Bovine serum albumin proteins also includes fragments of full-length bovine serum albumin or variants thereof, as defined herein.
- the albumin protein may comprise the sequence of an albumin derived from one of serum albumin from dog (e.g., Swissprot accession number P49822-1), pig (e.g., Swissprot accession number P08835-1), goat (e.g., Sigma product no.
- cat e.g., Swissprot accession number P49064-1
- chicken e.g., Swissprot accession number P19121-1
- ovalbumin e.g., chicken ovalbumin
- turkey ovalbumin e.g., Swissprot accession number O73860-1
- donkey e.g., Swissprot accession number Q5XLE4-1
- guinea pig e.g., Swissprot accession number Q6WDN9-1
- hamster e.g., as described in DeMarco et al.
- horse e.g., Swissprot accession number P35747-1
- rhesus monkey e.g., Swissprot accession number Q28522-1
- mouse e.g., Swissprot accession number P07724-1
- pigeon e.g., as defined by Khan et al. Int. J. Biol. Macromol.30(3-4),171-8 (2002)
- rabbit e.g., Swissprot accession number P49065-1
- rat e.g., Swissprot accession number P02770-1
- sheep e.g., Swissprot accession number P14639-1
- albumin proteins of the invention include variants of naturally-occurring albumin proteins.
- a variant albumin refers to an albumin protein having at least one amino acid mutation, such as an amino acid mutation generated by an insertion, deletion, or substitution, either conservative or non-conservative, provided that such changes result in an albumin protein for which at least one basic property has not been significantly altered (e.g., has not been altered by more than 5%, 10%, 15%, 20%, 25%, 30%, 35%, or 40%).
- Exemplary properties which may define the activity of an albumin protein include binding activity (e.g., including binding specificity or affinity to bilirubin, or a fatty acid such as a long-chain fatty acid), osmolarity, or behavior in a certain pH-range.
- an albumin protein variant will have at least 40%, at least 50%, at least 60%, and preferably at least 70%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% amino acid sequence identity with a naturally-occurring albumin protein, such as the albumin protein of any one of SEQ ID NOs: 96-98.
- Methods for the production and purification of recombinant human albumins are well-established (Sleep et al. Biotechnology, 8(1):42-6 (1990)), and include the production of recombinant human albumin for pharmaceutical applications (Bosse et al. J Clin Pharmacol 45(1):57-67 (2005)).
- HSA three-dimensional structure of HSA has been elucidated by X-ray crystallography (Carter et al. Science. 244(4909): 1195-8(1998)); Sugio et al. Protein Eng.12(6):439-46 (1999)).
- the HSA polypeptide chain has 35 cysteine residues, which form 17 disulfide bonds, and one unpaired (e.g., free) cysteine at position 34 of the mature protein. Cys-34 of HSA has been used for conjugation of molecules to albumin (Leger et al. Bioorg Med Chem Lett 14(17):4395-8 (2004); Thibaudeau et al. Bioconjug Chem 16(4):1000-8 (2005)), and provides a site for site-specific conjugation.
- SEQ ID NO: 96 Human serum albumin (HSA), variant 1) DAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAENCDKSLHTLF GDKLCTVATLRETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLKKYL YEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSAKQRLKCASLQKFGER AFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLECADDRADLAKYICENQDSISSKLKECC EKPLLEKSHCIAEVENDEMPADLPSLAADFVESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRL AKTYETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRY
- the albumin protein may be conjugated to any compound of the invention by any method well-known to those of skill in the art for producing small-molecule-protein conjugates. This may include covalent conjugation to a solvent-exposed amino acid, such as a solvent exposed cysteine or lysine.
- human serum albumin may be conjugated to a compound of the invention by covalent linkage to the sulfur atom corresponding to Cys34 of SEQ ID NO: 96 or Cys40 of SEQ ID NO: 97.
- An albumin protein of the invention may be conjugated to any compound of the invention by way of an amino acid located within 10 amino acid residues of the C-terminal or N-terminal end of the albumin protein.
- An albumin protein may include a C-terminal or N-terminal polypeptide fusion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 or more amino acid.
- the C-terminal or N-terminal polypeptide fusion may include one or more solvent-exposed cysteine or lysine residues, which may be used for covalent conjugation of a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker).
- Albumin proteins of the invention include any albumin protein which has been engineered to include one or more solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Exemplary methods for the production of engineered variants of albumin proteins that include one or more conjugation-competent cysteine residues are provided in U.S. Patent Application No. 2017/0081389, which is incorporated herein by reference in its entirety.
- albumin protein variants are those comprising a single, solvent-exposed, unpaired (e.g., free) cysteine residue, thus enabling site-specific conjugation of a linker to the cysteine residue.
- Albumin proteins which have been engineered to enable chemical conjugation to a solvent- exposed, unpaired cysteine residue include the following albumin protein variants: (a) an albumin protein having a substitution of a non-cysteine amino acid residue with a cysteine at an amino acid residue corresponding to any of L585, D1, A2, D562, A364, A504, E505, T79, E86, D129, D549, A581, D121, E82, S270, Q397, and A578 of SEQ ID NO: 96; (b) an albumin protein having an insertion of a cysteine at a position adjacent the N- or C-terminal side of an amino acid residue corresponding to any of L585, D1, A2, D562, A364, A504, E505, T79,
- the net result of the substitution, deletion, addition, or insertion events of (a), (b), (c) and/or (d) is that the number of conjugation competent cysteine residues of the polypeptide sequence is increased relative to the parent albumin sequence.
- the net result of the substitution, deletion, addition, or insertion events of (a), (b), (c) and/or (d) is that the number of conjugation competent-cysteine residues of the polypeptide sequence is one, thus enabling site-specific conjugation.
- Preferred albumin protein variants also include albumin proteins having a single solvent-exposed lysine residue, thus enabling site-specific conjugation of a linker to the lysine residue.
- Such variants may be generated by engineering an albumin protein, including any of the methods previously described (e.g., insertion, deletion, substitution, or C-terminal or N-terminal fusion).
- Albumin protein-binding peptides Conjugation of a biologically-active compound to an albumin protein-binding peptide can alter the pharmacodynamics of the biologically-active compound, including the alteration of tissue uptake, penetration, and diffusion.
- conjugation of an albumin protein-binding peptide to a compound of the invention e.g., a gp120 binder monomer or dimer, by way of a linker
- Albumin protein-binding peptides of the invention include any polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an albumin protein, such as any of the albumin proteins described herein.
- the albumin protein-binding peptide binds to a naturally occurring serum albumin, most preferably human serum albumin.
- An albumin protein-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat.
- Albumin protein-binding peptides of the invention include albumin protein-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention.
- Albumin protein-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Albumin protein-binding peptides of the invention may be linear or cyclic. Albumin protein- binding peptides of the invention include any albumin protein-binding peptides known to one of skill in the art, examples of which, are provided herein.
- Albumin protein-binding peptide and conjugates including an albumin protein-binding peptide, preferably bind an albumin protein (e.g., human serum albumin) with an affinity characterized by a dissociation constant, Kd, that is less than about 100 ⁇ M, preferably less than about 100 nM, and most preferably do not substantially bind other plasma proteins.
- an albumin protein e.g., human serum albumin
- Kd dissociation constant
- Specific examples of such compounds are linear or cyclic peptides, preferably between about 10 and 20 amino acid residues in length, optionally modified at the N-terminus or C-terminus or both.
- Albumin protein-binding peptides include linear and cyclic peptides comprising the following general formulae, wherein Xaa is any amino acid: SEQ ID NO: 101 Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Phe-Cys-Xaa-Asp-Trp-Pro-Xaa-Xaa-Xaa-Ser-Cys SEQ ID NO: 102 Val-Cys-Tyr-Xaa-Xaa-Xaa-Ile-Cys-Phe SEQ ID NO: 103 Cys-Tyr-Xaa-Pro-Gly-Xaa-Cys SEQ ID NO: 104 Asp-Xaa-Cys-Leu-Pro-Xaa-Trp-Gly-Cys-Leu-Trp SEQ ID NO: 105 Trp-Cys-
- albumin protein-binding peptides are provided in U.S. Patent Application No. 2005/0287153, which is incorporated herein by reference in its entirety.
- Conjugation of albumin protein-binding peptides An albumin protein-binding peptide of the invention may be conjugated to (e.g., by way of a covalent bond) to any compound of the invention (e.g., by way of the linker portion of a gp120 binder monomer or dimer).
- the albumin protein-binding peptide may be conjugated to any compound of the invention by any method known to those of skill in the art for producing peptide-small molecule conjugates.
- This may include covalent conjugation to the side chain group of an amino acid residue, such as a cysteine, a lysine, or a non-natural amino acid.
- covalent conjugation may occur at the C-terminus (e.g., to the C-terminal carboxylic acid, or to the side chain group of the C-terminal residue) or at the N-terminus (e.g., to the N-terminal amino group, or to the side chain group of the N- terminal amino acid).
- Linkers A linker refers to a linkage or connection between two or more components in a conjugate described herein (e.g., between two gp120 binders in a conjugate described herein, between a gp120 binder and an Fc domain monomer, an Fc domain, or an albumin protein in a conjugate described herein, and between a dimer of two gp120 binders and an Fc domain monomer, an Fc domain or an albumin protein in a conjugate described herein).
- Linkers in conjugates having an Fc domain monomer, an Fc domain, or an albumin protein covalently linked to dimers of gp120 binders In a conjugate containing an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders as described herein, a linker in the conjugate (e.g., L or L’) may be a branched structure.
- a linker in a conjugate described herein may be a multivalent structure, e.g., a divalent or trivalent structure having two or three arms, respectively.
- the linker when the linker has three arms, two of the arms may be attached to the first and second gp120 binders and the third arm may be attached to an Fc domain monomer, an Fc domain, an Fc- binding peptide, an albumin protein, or an albumin protein-binding peptide.
- one arm may be attached to an Fc domain monomer, an Fc domain, or an albumin protein and the other arm may be attached to one of the two gp120 binders.
- a linker with three arms may be used to attach the two gp120 binders on a conjugate containing an Fc domain monomer, an Fc domain, or albumin protein covalently linked to one or more dimers of gp120 binders.
- a linker in a conjugate having an Fc domain monomer, an Fc domain, or an albumin protein covalently linked to one or more dimers of gp120 binders is described by formula (D-L- I): wherein L A is described by formula G A1 -(Z A1 )g1-(Y A1 )h1-(Z A2 )i1-(Y A2 )j1-(Z A3 )k1-(Y A3 )l1-(Z A4 )m1-(Y A4 )n1-(Z A5 )o1- G A2 ; L B is described by formula G B1 -(Z B1 ) g2 -(Y B1 ) h2 -(Z B2 ) i2 -(Y B2 ) j2 -(Z B3 ) k2 -(Y B3 ) l2 -(Z B4 ) m2 -(Y B4 )
- optionally substituted includes substitution with a polyethylene glycol (PEG).
- PEG polyethylene glycol
- a PEG has a repeating unit structure (-CH 2 CH 2 O-)n, wherein n is an integer from 2 to 100.
- a polyethylene glycol may be selected from any one of PEG 2 to PEG 100 (e.g., PEG 2 , PEG 3 , PEG 4 , PEG 5 , PEG 5 -PEG 10 , PEG 10 -PEG 20 , PEG 20 -PEG 30 , PEG 30 -PEG 40 , PEG 50 -PEG 60 , PEG 60 -PEG 70 , PEG 70 -PEG 80 , PEG 80 -PEG 90 , PEG 90 -PEG 100 ).
- L C may have two points of attachment to the Fc domain (e.g., two G C2 ).
- L includes a polyethylene glycol (PEG) linker.
- a PEG linker includes a linker having the repeating unit structure (-CH 2 CH 2 O-)n, where n is an integer from 2 to 100.
- a polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (M-I)-(M-X)).
- a polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (D-I)-(D-X)).
- a polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas (D-I)-(D-X)).
- a polyethylene glycol linker may be selected from any one of PEG 2 to PEG 100 (e.g., PEG 2 , PEG 3 , PEG 4 , PEG 5 , PEG 5 -PEG 10 , PEG 10 -PEG 20 , PEG 20 -PEG 30 , PEG 30 -PEG 40 , PEG 50 -PEG 60 , PEG 60 -PEG 70 , PEG 70 - PEG 80 , PEG 80 -PEG 90 , PEG 90 -PEG 100 ).
- L c includes a PEG linker, where L C is covalently attached to each of Q i and E.
- Linkers in conjugates having an Fc domain monomer, an Fc domain, or an albumin protein covalently linked to monomers of gp120 binders In a conjugate containing an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders as described herein, a linker in the conjugate (e.g., L, or L’) may be a divalent structure having two arms.
- One arm in a divalent linker may be attached to the monomer of gp120 binder and the other arm may be attached to the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide.
- the one or more monomers of gp120 binders in the conjugates described herein may each be, independently, connected to an atom in the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein- binding peptide.
- each L is described by formula (M-L): J 1 -(Q 1 )g-(T 1 )h-(Q 2 )i-(T 2 )j-(Q 3 )k-(T 3 )l-(Q 4 )m-(T 4 )n-(Q 5 )o-J 2 wherein J 1 is a bond attached to A 1 ; J 2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q 1 , Q 2 , Q 3 , Q 4 , and Q 5 is, independently, optionally substitute
- each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1.
- a linker is described by formula (M-L-I): J 1 -(Q 1 )g-(T 1 )h-(Q 2 )i-(T 2 )j-(Q 3 )k-(T 3 )l-(Q 4 )m-(T 4 )n-(Q 5 )o-J 2 wherein J 1 is a bond attached to a gp120 binder; J 2 is a bond attached to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, or a functional group capable of reacting with a functional group conjugated to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e
- optionally substituted includes substitution with a polyethylene glycol (PEG).
- PEG polyethylene glycol
- a PEG has a repeating unit structure (-CH 2 CH 2 O-) n , wherein n is an integer from 2 to 100.
- a polyethylene glycol may be selected from any one of PEG 2 to PEG 100 (e.g., PEG 2 , PEG 3 , PEG 4 , PEG 5 , PEG 5 -PEG 10 , PEG 10 -PEG 20 , PEG 20 -PEG 30 , PEG 30 -PEG 40 , PEG 50 -PEG 60 , PEG 60 -PEG 70 , PEG 70 -PEG 80 , PEG 80 -PEG 90 , PEG 90 -PEG 100 ).
- J 2 may have two points of attachment to the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., two J 2 ).
- Linking groups In some embodiments, a linker provides space, rigidity, and/or flexibility between the gp120 binders and the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide in the conjugates described here or between two gp120 binders in the conjugates described herein.
- a linker may be a bond, e.g., a covalent bond, e.g., an amide bond, a disulfide bond, a C-O bond, a C-N bond, a N-N bond, a C-S bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation.
- a covalent bond e.g., an amide bond, a disulfide bond, a C-O bond, a C-N bond, a N-N bond, a C-S bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation.
- a linker (L or L’ as shown in any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII) includes no more than 250 atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1-20, 1-25, 1-30, 1-35, 1-40, 1-45, 1-50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1- 180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60
- a linker includes no more than 250 non-hydrogen atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1- 20, 1-25, 1-30, 1-35, 1-40, 1-45, 1-50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 non- hydrogen atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 non-hydrogen
- the backbone of a linker includes no more than 250 atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1-20, 1-25, 1-30, 1-35, 1-40, 1-45, 1- 50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 atom(s)).
- the “backbone” of a linker refers to the atoms in the linker that together form the shortest path from one part of the conjugate to another part of the conjugate.
- the atoms in the backbone of the linker are directly involved in linking one part of the conjugate to another part of the conjugate.
- hydrogen atoms attached to carbons in the backbone of the linker are not considered as directly involved in linking one part of the conjugate to another part of the conjugate.
- Molecules that may be used to make linkers (L or L’) include at least two functional groups, e.g., two carboxylic acid groups.
- two arms of a linker may contain two dicarboxylic acids, in which the first carboxylic acid may form a covalent linkage with the first gp120 binder in the conjugate and the second carboxylic acid may form a covalent linkage with the second gp120 binder in the conjugate, and the third arm of the linker may for a covalent linkage (e.g., a C-O bond) with an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide in the conjugate.
- a covalent linkage e.g., a C-O bond
- the divalent linker may contain two carboxylic acids, in which the first carboxylic acid may form a covalent linkage with one component (e.g., a gp120 binder) in the conjugate and the second carboxylic acid may form a covalent linkage (e.g., a C-S bond or a C-N bond) with another component (e.g., an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide) in the conjugate.
- dicarboxylic acid molecules may be used as linkers (e.g., a dicarboxylic acid linker).
- the first carboxylic acid in a dicarboxylic acid molecule may form a covalent linkage with a hydroxyl or amine group of the first gp120 binder and the second carboxylic acid may form a covalent linkage with a hydroxyl or amine group of the second gp120 binder.
- dicarboxylic acids molecules that may be used to linkers are found in WO 2020/252393, hereby incorporated by reference.
- dicarboxylic acid molecules such as the ones described herein, may be further functionalized to contain one or more additional functional groups.
- Dicarboxylic acids may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker).
- the linking group when the gp120 binder is attached to Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, the linking group may comprise a moiety comprising a carboxylic acid moiety and an amino moiety that are spaced by from 1 to 25 atoms. Examples of such linking groups re found in WO 2020/252393, hereby incorporated by reference.
- a linking group may include a moiety including a carboxylic acid moiety and an amino moiety, such as the ones described herein, may be further functionalized to contain one or more additional functional groups.
- linking groups may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker).
- a linker such as a PEG linker
- the linking group may comprise a moiety comprising two or amino moieties (e.g., a diamino moiety) that are spaced by from 1 to 25 atoms.
- a linking group may include a diamino moiety, such as the ones described herein, may be further functionalized to contain one or more additional functional groups.
- Such diamino linking groups may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker).
- a molecule containing an azide group may be used to form a linker, in which the azide group may undergo cycloaddition with an alkyne to form a 1,2,3-triazole linkage.
- a molecule containing an alkyne group may be used to form a linker, in which the alkyne group may undergo cycloaddition with an azide to form a 1,2,3-triazole linkage.
- a molecule containing a maleimide group may be used to form a linker, in which the maleimide group may react with a cysteine to form a C-S linkage.
- a molecule containing one or more haloalkyl groups may be used to form a linker, in which the haloalkyl group may form a covalent linkage, e.g., C-N and C-O linkages, with a gp120 binder.
- a linker (L or L’) may comprise a synthetic group derived from, e.g., a synthetic polymer (e.g., a polyethylene glycol (PEG) polymer).
- a linker may comprise one or more amino acid residues.
- a linker may be an amino acid sequence (e.g., a 1-25 amino acid, 1-10 amino acid, 1-9 amino acid, 1-8 amino acid, 1-7 amino acid, 1-6 amino acid, 1-5 amino acid, 1-4 amino acid, 1-3 amino acid, 1-2 amino acid, or 1 amino acid sequence).
- a linker may include one or more optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene (e.g., a PEG unit), optionally substituted C 2 -C 20 alkenylene (e.g., C 2 alkenylene), optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, optionally substituted C 2 -C 20 heteroalkynylene, optionally substituted C 3 -C 20 cycloalkylene (e.g., cyclopropylene, cyclobutylene), optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heteroalkynylene, optionally substituted
- Conjugation chemistries Gp120 binder monomers or dimers (e.g., in a conjugate of any one of formulas (1), (2), (D-I), (D- IV)-(D-VI), or (M-I)-(M-XVII)) may be conjugated to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, e.g., by way of a linker, by any standard conjugation chemistries known to those of skill in the art.
- conjugation chemistries are specifically contemplated, e.g., for conjugation of a PEG linker (e.g., a functionalized PEG linker) to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein- binding peptide.
- a PEG linker e.g., a functionalized PEG linker
- Covalent conjugation of two or more components in a conjugate using a linker may be accomplished using well-known organic chemical synthesis techniques and methods. Complementary functional groups on two components may react with each other to form a covalent bond.
- complementary reactive functional groups include, but are not limited to, e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine.
- Site-specific conjugation to a polypeptide e.g., an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein- binding peptide
- Exemplary techniques for site- specific conjugation of a small molecule to an Fc domain are provided in Agarwall. P., et al.
- amino-reactive acylating groups include, e.g., (i) an isocyanate and an isothiocyanate; (ii) a sulfonyl chloride; (iii) an acid halide; (iv) an active ester, e.g., a nitrophenylester or N- hydroxysuccinimidyl ester; (v) an acid anhydride, e.g., a mixed, symmetrical, or N-carboxyanhydride; (vi) an acylazide; and (vii) an imidoester. Aldehydes and ketones may be reacted with amines to form Schiff’s bases, which may be stabilized through reductive amination.
- a linker of the invention e.g., L or L’, such as L C of D-L-I
- E e.g., an Fc domain monomer, an Fc domain, or albumin protein
- a linker e.g., an active ester, e.g., a nitrophenylester or N- hydroxysuccinimidyl ester, or derivatives thereof (e.g., a functionalized PEG linker (e.g., azido-PEG 2 - PEG 40 -NHS ester)
- a T of e.g., DAR
- 0.5 and 10.0 e.g., about 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3,
- the E-(PEG 2 -PEG 40 )-azide can react with an Int having a terminal alkyne linker (e.g., L, or L’, such as L C of D-L-I) through click conjugation.
- an Int having a terminal alkyne linker e.g., L, or L’, such as L C of D-L-I
- the linker conjugated to E is a terminal alkyne and is conjugated to an Int having a terminal azide.
- Exemplary preparations of preparations of E-(PEG 2 -PEG 40 )-azide are described in Examples 2, 3, and 12.
- One of skill in the art would readily understand the final product from a click chemistry conjugation.
- Exemplary linking strategies e.g., methods for linking a monomer or a dimer of a neuraminidase inhibitor to E, such as, by way of a linker are further depicted in FIGS.1-4 and 14. VI.
- one or more antiviral agents may be administered in combination (e.g., administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions) or administered separately at different times) with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- a conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- the antiviral agent is an antiviral agent for the treatment of HIV.
- the antiviral agent may be a nucleoside/nucleotide reverse transcriptase inhibitor, a gp120 inhibitor, a polymerase inhibitor, or a fusion protein inhibitor.
- the antiviral agent may target either the virus or the host subject.
- the antiviral agent for the treatment of HIV used in combination with a conjugate described herein may be selected from an integrase inhibitor (e.g., dolutegravir, elvitegravir, or raltegravir), a nucleoside reverse transcriptase inhibitor (NRTI) (e.g., abacavir, lamivudine, zidovudine, emtricitabine, tenofovir, emtricitabine, didanosine, or stavudine), a non-nucleoside reverse transcriptase inhibitor (NNRTI) (e.g., efavirenz, etravirine, nevirapine, rilpivirine, or delavirdine), a protease inhibitor (e.g., a nucleoside reverse transcriptase inhibitor (NRTI) (e.g., abacavir, lamivudine, zidovudine, emtricita
- any one of conjugates described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- an antiviral vaccine e.g., a composition that elicits an immune response in a subject directed against a virus.
- the antiviral vaccine may be administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions) as the conjugates, or may be administered prior to or following the conjugates (e.g., within a period of 1 day, 2, days, 5, days, 1 week, 2 weeks, 3 weeks, 1 month, 2 months, 6 months, or 12 months, or more).
- the viral vaccine includes an immunogen that elicits an immune response in the subject against HIV-1 or HIV-2.
- the vaccine is administered as a nasal spray. VII. Methods Methods described herein include, e.g., methods of protecting against or treating a viral infection (e.g., an HIV infection) in a subject and methods of preventing, stabilizing, or inhibiting the growth of viral particles.
- a method of treating a viral infection (e.g., an HIV infection) in a subject includes administering to the subject a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)- (D-VI), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof.
- the viral infection is cause by the human immunodeficiency virus (e.g., HIV-1 or HIV-2).
- the viral infection is caused by a resistant strain of virus.
- a method of preventing, stabilizing, or inhibiting the growth of viral particles or preventing the replication and spread of the virus includes contacting the virus or a site susceptible to viral growth with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof.
- methods described herein also include methods of protecting against or treating viral infection in a subject by administering to the subject a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
- the method further includes administering to the subject an antiviral agent or an antiviral vaccine.
- Methods described herein also include methods of protecting against or treating a viral infection in a subject by administering to said subject (1) a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and (2) an antiviral agent or an antiviral vaccine.
- Methods described herein also include methods of preventing, stabilizing, or inhibiting the growth of viral particles or preventing the replication or spread of a virus, by contacting the virus or a site susceptible to viral growth with (1) a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and (2) an antiviral agent or an antiviral vaccine.
- a conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- the conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- the conjugate described herein is administered first, followed by administering of the antiviral agent or antiviral vaccine alone.
- the antiviral agent or antiviral vaccine is administered first, followed by administering of the conjugate described herein alone.
- the conjugate described herein and the antiviral agent or antiviral vaccine are administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions).
- the conjugate described herein or the antiviral agent or antiviral vaccine is administered first, followed by administering of the conjugate described herein and the antiviral agent or antiviral vaccine substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions). In some embodiments, the conjugate described herein and the antiviral agent or antiviral vaccine are administered first substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions), followed by administering of the conjugate described herein or the antiviral agent or antiviral vaccine alone.
- a conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- an antiviral agent or antiviral vaccine when administered together (e.g., substantially simultaneously in the same or separate pharmaceutical compositions, or separately in the same treatment regimen), inhibition of viral replication of each of the conjugate and the antiviral agent or antiviral vaccine may be greater (e.g., occur at a lower concentration) than inhibition of viral replication of each of the conjugate and the antiviral agent or antiviral vaccine when each is used alone in a treatment regimen.
- a conjugate described herein may be formulated in a pharmaceutical composition for use in the methods described herein.
- a conjugate described herein may be formulated in a pharmaceutical composition alone.
- a conjugate described herein may be formulated in combination with an antiviral agent or antiviral vaccine in a pharmaceutical composition.
- the pharmaceutical composition includes a conjugate described herein (e.g., a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and pharmaceutically acceptable carriers and excipients.
- Acceptable carriers and excipients in the pharmaceutical compositions are nontoxic to recipients at the dosages and concentrations employed.
- Acceptable carriers and excipients may include buffers such as phosphate, citrate, HEPES, and TAE, antioxidants such as ascorbic acid and methionine, preservatives such as hexamethonium chloride, octadecyldimethylbenzyl ammonium chloride, resorcinol, and benzalkonium chloride, proteins such as human serum albumin, gelatin, dextran, and immunoglobulins, hydrophilic polymers such as polyvinylpyrrolidone, amino acid residues such as glycine, glutamine, histidine, and lysine, and carbohydrates such as glucose, mannose, sucrose, and sorbitol.
- buffers such as phosphate, citrate, HEPES, and TAE
- antioxidants such as ascorbic acid and methionine
- preservatives such as hex
- excipients examples include, but are not limited to, antiadherents, binders, coatings, compression aids, disintegrants, dyes, emollients, emulsifiers, fillers (diluents), film formers or coatings, flavors, fragrances, glidants (flow enhancers), lubricants, sorbents, suspensing or dispersing agents, or sweeteners.
- excipients include, but are not limited to: butylated hydroxytoluene (BHT), calcium carbonate, calcium phosphate (dibasic), calcium stearate, croscarmellose, crosslinked polyvinyl pyrrolidone, citric acid, crospovidone, cysteine, ethylcellulose, gelatin, hydroxypropyl cellulose, hydroxypropyl methylcellulose, lactose, magnesium stearate, maltitol, mannitol, methionine, methylcellulose, methyl paraben, microcrystalline cellulose, polyethylene glycol, povidone, pregelatinized starch, propyl paraben, retinyl palmitate, shellac, silicon dioxide, sodium carboxymethyl cellulose, sodium citrate, sodium starch glycolate, sorbitol, starch (corn), stearic acid, stearic acid, sucrose, talc, titanium dioxide, vitamin A, vitamin E, vitamin C, and xylit
- the conjugates herein may have ionizable groups so as to be capable of preparation as pharmaceutically acceptable salts.
- These salts may be acid addition salts involving inorganic or organic acids or the salts may, in the case of acidic forms of the conjugates herein be prepared from inorganic or organic bases.
- the conjugates are prepared or used as pharmaceutically acceptable salts prepared as addition products of pharmaceutically acceptable acids or bases.
- Suitable pharmaceutically acceptable acids and bases are well-known in the art, such as hydrochloric, sulphuric, hydrobromic, acetic, lactic, citric, or tartaric acids for forming acid addition salts, and potassium hydroxide, sodium hydroxide, ammonium hydroxide, caffeine, various amines, and the like for forming basic salts.
- Representative acid addition salts include, but are not limited to, acetate, adipate, alginate, ascorbate, aspartate, benzenesulfonate, benzoate, bisulfate, borate, butyrate, camphorate, camphorsulfonate, citrate, cyclopentanepropionate, digluconate, dodecylsulfate, ethanesulfonate, fumarate, glucoheptonate, glycerophosphate, hemisulfate, heptonate, hexanoate, hydrobromide, hydrochloride, hydroiodide, 2-hydroxy-ethanesulfonate, lactobionate, lactate, laurate, lauryl sulfate, malate, maleate, malonate, methanesulfonate, 2-naphthalenesulfonate, nicotinate, n
- alkali or alkaline earth metal salts include, but are not limited to, sodium, lithium, potassium, calcium, and magnesium, as well as nontoxic ammonium, quaternary ammonium, and amine cations, including, but not limited to ammonium, tetramethylammonium, tetraethylammonium, methylamine, dimethylamine, trimethylamine, triethylamine, and ethylamine.
- a conjugate herein or a pharmaceutical composition thereof used in the methods described herein will be formulated into suitable pharmaceutical compositions to permit facile delivery.
- a conjugate e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- a pharmaceutical composition thereof may be formulated to be administered intramuscularly, intravenously (e.g., as a sterile solution and in a solvent system suitable for intravenous use), intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally (e.g., a tablet, capsule, caplet, gelcap, or syrup), topically (e.g., as a cream, gel,
- a conjugate herein or a pharmaceutical composition thereof may be in the form of, e.g., tablets, capsules, pills, powders, granulates, suspensions, emulsions, solutions, gels including hydrogels, pastes, ointments, creams, plasters, drenches, osmotic delivery devices, suppositories, enemas, injectables, implants, sprays, preparations suitable for iontophoretic delivery, or aerosols.
- the compositions may be formulated according to conventional pharmaceutical practice.
- a conjugate described herein may be formulated in a variety of ways that are known in the art.
- a conjugate described herein can be formulated as pharmaceutical or veterinary compositions.
- a conjugate described herein is formulated in ways consonant with these parameters.
- a summary of such techniques is found in Remington: The Science and Practice of Pharmacy, 22nd Edition, Lippincott Williams & Wilkins (2012); and Encyclopedia of Pharmaceutical Technology, 4th Edition, J. Swarbrick and J. C. Boylan, Marcel Dekker, New York (2013), each of which is incorporated herein by reference.
- Formulations may be prepared in a manner suitable for systemic administration or topical or local administration.
- Systemic formulations include those designed for injection (e.g., intramuscular, intravenous or subcutaneous injection) or may be prepared for transdermal, transmucosal, or oral administration.
- the formulation will generally include a diluent as well as, in some cases, adjuvants, buffers, and preservatives.
- the conjugates can be administered also in liposomal compositions or as microemulsions.
- Systemic administration may also include relatively noninvasive methods such as the use of suppositories, transdermal patches, transmucosal delivery and intranasal administration.
- Oral administration is also suitable for conjugates herein. Suitable forms include syrups, capsules, and tablets, as is understood in the art.
- compositions can be administered parenterally in the form of an injectable formulation.
- Pharmaceutical compositions for injection can be formulated using a sterile solution or any pharmaceutically acceptable liquid as a vehicle.
- Formulations may be prepared as solid forms suitable for solution or suspension in liquid prior to injection or as emulsions.
- Pharmaceutically acceptable vehicles include, but are not limited to, sterile water, physiological saline, and cell culture media (e.g., Dulbecco’s Modified Eagle Medium (DMEM), ⁇ -Modified Eagles Medium ( ⁇ -MEM), F-12 medium).
- Such injectable compositions may also contain amounts of nontoxic auxiliary substances such as wetting or emulsifying agents, pH buffering agents, such as sodium acetate and sorbitan monolaurate.
- Formulation methods are known in the art, see e.g., Pharmaceutical Preformulation and Formulation, 2nd Edition, M. Gibson, Taylor & Francis Group, CRC Press (2009).
- the pharmaceutical compositions can be prepared in the form of an oral formulation.
- Formulations for oral use include tablets containing the active ingredient(s) in a mixture with non-toxic pharmaceutically acceptable excipients.
- excipients may be, for example, inert diluents or fillers (e.g., sucrose, sorbitol, sugar, mannitol, microcrystalline cellulose, starches including potato starch, calcium carbonate, sodium chloride, lactose, calcium phosphate, calcium sulfate, or sodium phosphate); granulating and disintegrating agents (e.g., cellulose derivatives including microcrystalline cellulose, starches including potato starch, croscarmellose sodium, alginates, or alginic acid); binding agents (e.g., sucrose, glucose, sorbitol, acacia, alginic acid, sodium alginate, gelatin, starch, pregelatinized starch, microcrystalline cellulose, magnesium aluminum silicate, carboxymethylcellulose sodium, methylcellulose, hydroxypropyl methylcellulose, ethylcellulose, polyvinylpyrrolidone, or polyethylene glycol); and lubricating agents, glidants, and antiad
- Formulations for oral use may also be provided as chewable tablets, or as hard gelatin capsules wherein the active ingredient is mixed with an inert solid diluent (e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin), or as soft gelatin capsules wherein the active ingredient is mixed with water or an oil medium, for example, peanut oil, liquid paraffin, or olive oil.
- an inert solid diluent e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin
- water or an oil medium for example, peanut oil, liquid paraffin, or olive oil.
- Powders, granulates, and pellets may be prepared using the ingredients mentioned above under tablets and capsules in a conventional manner using, e.g., a mixer, a fluid bed apparatus, or a spray drying equipment.
- Formulations for oral use may also be provided as chewable tablets, or as hard gelatin capsules wherein the active ingredient is mixed with an inert solid diluent (e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin), or as soft gelatin capsules wherein the active ingredient is mixed with water or an oil medium, for example, peanut oil, liquid paraffin, or olive oil.
- an inert solid diluent e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin
- an oil medium for example, peanut oil, liquid paraffin, or olive oil.
- Powders, granulates, and pellets may be prepared using the ingredients mentioned above under tablets and capsules in a conventional manner using, e.g., a mixer, a fluid bed apparatus or a spray drying equipment.
- Dissolution or diffusion controlled release of a conjugate described herein e.g., a conjugate of any one of (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- a pharmaceutical composition thereof can be achieved by appropriate coating of a tablet, capsule, pellet, or granulate formulation of the conjugate, or by incorporating the conjugate into an appropriate matrix.
- a controlled release coating may include one or more of the coating substances mentioned above and/or, e.g., shellac, beeswax, glycowax, castor wax, carnauba wax, stearyl alcohol, glyceryl monostearate, glyceryl distearate, glycerol palmitostearate, ethylcellulose, acrylic resins, dl-polylactic acid, cellulose acetate butyrate, polyvinyl chloride, polyvinyl acetate, vinyl pyrrolidone, polyethylene, polymethacrylate, methylmethacrylate, 2-hydroxymethacrylate, methacrylate hydrogels, 1,3 butylene glycol, ethylene glycol methacrylate, and/or polyethylene glycols.
- shellac beeswax, glycowax, castor wax, carnauba wax, stearyl alcohol, glyceryl monostearate, glyceryl distearate, glyce
- the matrix material may also include, e.g., hydrated methylcellulose, carnauba wax and stearyl alcohol, carbopol 934, silicone, glyceryl tristearate, methyl acrylate-methyl methacrylate, polyvinyl chloride, polyethylene, and/or halogenated fluorocarbon.
- the pharmaceutical composition may be formed in a unit dose form as needed.
- the amount of active component, e.g., a conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- a suitable dose within the designated range is provided (e.g., a dose within the range of 0.01-100 mg/kg of body weight).
- conjugates herein may be administered by any appropriate route for treating or protecting against a viral infection (e.g., an HIV infection), or for preventing, stabilizing, or inhibiting the proliferation or spread of a virus (e.g., an HIV virus).
- Conjugates described herein may be administered to humans, domestic pets, livestock, or other animals with a pharmaceutically acceptable diluent, carrier, or excipient.
- administering includes administration of any of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) or compositions intramuscularly, intravenously (e.g., as a sterile solution and in a solvent system suitable for intravenous use), intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intra
- an antiviral agent if an antiviral agent is also administered in addition to a conjugate described herein, the antiviral agent or a pharmaceutical composition thereof may also be administered in any of the routes of administration described herein.
- the dosage of a conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D- I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- pharmaceutical compositions thereof depends on factors including the route of administration, the disease to be treated (e.g., the extent and/or condition of the viral infection), and physical characteristics, e.g., age, weight, general health, of the subject.
- the amount of the conjugate or the pharmaceutical composition thereof contained within a single dose may be an amount that effectively prevents, delays, or treats the viral infection without inducing significant toxicity.
- a pharmaceutical composition may include a dosage of a conjugate described herein ranging from 0.01 to 500 mg/kg (e.g., 0.01, 0.1, 0.2, 0.3, 0.4, 0.5, 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, 150, 200, 250, 300, 350, 400, 450, or 500 mg/kg) and, in a more specific embodiment, about 0.1 to about 30 mg/kg and, in a more specific embodiment, about 1 to about 30 mg/kg.
- a conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)
- an antiviral agent or antiviral vaccine are administered in combination (e.g., substantially simultaneously in the same or separate pharmaceutical compositions, or separately in the same treatment regimen)
- the dosage needed of the conjugate described herein may be lower than the dosage needed of the conjugate if the conjugate was used alone in a treatment regimen.
- a conjugate described herein e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D- VI), or (M-I)-(M-XVII)
- a pharmaceutical composition thereof may be administered to a subject in need thereof, for example, one or more times (e.g., 1-10 times or more; 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 times) daily, weekly, monthly, biannually, annually, or as medically necessary. Dosages may be provided in either a single or multiple dosage regimens. The timing between administrations may decrease as the medical condition improves or increase as the health of the patient declines.
- Example 1 Preparation of Fc constructs Reverse translations of the amino acids comprising the protein constructs (SEQ ID NOs: 1, 3, 5, 7, 9, 12, and 14) were synthesized by solid-phase synthesis.
- the oligonucleotide templates were cloned into pcDNA3.1 (Life Technologies, Carlsbad, CA, USA) at the cloning sites BamHI and XhoI (New England Biolabs, Ipswich, MA, USA) and included signal sequences derived from the human Interleukin-2 or human albumin.
- the pcDNA3.1 plasmids were transformed into Top10 E. coli cells (LifeTech). DNA was amplified, extracted, and purified using the PURELINK® HiPure Plasmid Filter Maxiprep Kit (LifeTech). The plasmid DNA is delivered, using the EXPIFECTAMINETM 293 Transfection Kit (LifeTech), into HEK-293 cells per the manufacturer’s protocol.
- FIGs.5-11 show non-reducing and reducing SDS-PAGE of an Fc domain formed from Fc domain monomers having the sequences of SEQ ID NOs: 1, 3, 5, 7, 9, 12, and 14, respectively.
- Example 2
- PEG4-azido Fc 0.05M PEG4-azidoNHS ester PBS buffer solution (9.88 mL, 494.0 ⁇ mol, 9.5 equivalents) was added to a solution of h-IgG1 Fc (SEQ ID NO: 4) (3027 mg in 213.0 mL of pH 7.4 PBS, MW ⁇ 58,200 Da, 16.5 ⁇ mol) and the mixture was shaken gently for 2 hours at ambient temperature. The solution was concentrated by using 10 centrifugal concentrators (30,000 MWCO, 15 mL) to a volume of ⁇ 1.5 mL. The crude mixture was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of three times.
- the small molecule reagent was removed with this wash procedure.
- the concentrated Fc-PEG4-azide (SEQ ID NO: 4) was diluted to 213.0 mL with pH 7.4 PBS 1x buffer and ready for Click conjugation.
- the purified material was quantified using a NANODROPTM UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield is quantitative after purification.
- the Fc-PEG4-azide (SEQ ID NO: 35) was prepared analogously.
- Conjugates and properties *The terminal Lys residue of the Fc domain may be cleaved upon expression and purification, e.g., SEQ ID NO: 64 coverts to SEQ ID NO: 73
- Example 9 General procedure for purification of conjugates. The crude mixture was diluted 1:10 in PBS pH 7.4, and purified using MabSelect Sure Resin (GE Healthcare, Chicago, IL, USA), followed by size exclusion chromatography. (HiLoad 26/600 Superdex200 pg, GE Healthcare, Chicago, IL, USA). Fractions containing purified conjugate were pooled and concentrated to approximately 20 mg/mL using a centrifugal concentrator (30,000 MWCO).
- Purified material was quantified using a NANODROPTM UV visible spectrophotometer using a calculated extinction coefficient based on the amino acid sequence of hIgG1 Fc(myc).
- Purified molecules were analyzed using 4-12% Bis Tris SDS PAGE gels by loading 1 ⁇ g of each molecule into the gel, and staining using Instant Blue (Expedeon, San Diego, CA, USA). Each gel included a molecular weight ladder with the indicated molecular weight standards. Yields were calculated and purity determined by Agilent Analytical HPLC. Product peak and MW were found by MALDI MS and a final DAR calculated.
- Example 10 Example 10
- gp120 glycoprotein binding assay Nunc MaxiSorp flat-bottom 96-well plates (12-565-136, Fisher Scientific) were coated with recombinant HIV-1 GP120 (SAE0071, Sigma) at 2 ⁇ g/mL in PBS (pH 7.4) (10-010-049, Fisher Scientific) overnight at 4°C (100 ⁇ L, 0.2 ⁇ g/well). Plates were washed (5 x 300 ⁇ L) with wash buffer (PBS 0.05% Tween 20) and blocked with 1% BSA (A5611-10G, Sigma; 200 ⁇ L/well) in wash buffer for 1 h at room temp on an orbital microplate shaker at 500 rpm (BT908, BT LabSystems).
- the blocking agent was removed and wells incubated with 3-fold serial dilutions of conjugate in sample diluent (0.5% BSA in PBS 0.025% Tween 20) starting at 1 ⁇ M for 1 h with shaking at room temp. After 5 x 300 ⁇ L washes, the plates were incubated with HRP conjugated donkey anti-human IgG Fc F(ab’)2 (709-036-098, Jackson ImmunoResearch) secondary antibody diluted 1:1,000 in sample diluent for 1 h with shaking at room temp. Plates were then washed (8 x 300 ⁇ L) and developed with TMB substrate (BD555214, Fisher Scientific) for 3-5 minutes at room temp.
- sample diluent (0.5% BSA in PBS 0.025% Tween 20
- HRP conjugated donkey anti-human IgG Fc F(ab’)2 709-036-098, Jackson ImmunoResearch
- Activity of pre-conjugation intermediate (Int) compounds in an in vitro cell fusion assay Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell- cell fusion which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR- ⁇ -Gal (catalog #1294) and HL2/3 cells (catalog #1294), obtained from the AIDS Research Reagent and Reference Program (Rockville, MD).
- HeLa-CD4-LTR- ⁇ -Gal cells were plated at a density of 5 x 10 3 cells per well in a volume of 50 ⁇ L with 50 ⁇ L of nine serial logarithmic dilutions of compound in triplicate for one hour at 37oC/5% CO 2 . Following the incubation, 100 ⁇ L of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37oC/5% CO 2 . Following the incubation, efficacy plates were evaluated for ⁇ -galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies, cytotoxicity was also evaluated (TC 50 ).
- Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide).
- XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product.
- XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives.
- Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C.
- XTT/PMS stock was prepared immediately before use by adding 40 ⁇ L of PMS per mL of XTT solution.50 ⁇ L of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37°C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay.
- the plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer. This assay identified several compounds with EC 50 values within 10-fold of the benchmark compound (Temsavir) (Table 5).
- This solution was used for preparing other PEG4-azido Fc with a variety of DAR values by adjusting the equivalents of this PEG4-azido NHS ester PBS solution.
- the concentrated Fc-PEG4-azide was diluted to 8.80 mL with pH 7.4 PBS buffer and ready for Click conjugation.
- the purified material was quantified using a NANODROPTM UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield was quantitative after purification.
- Example 13 Synthesis of Int-12 Int-12 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 14 Synthesis of Int-13 Int-13 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- HeLa-CD4-LTR- ⁇ -Gal cells were plated at a density of 5 x 10 3 cells per well in a volume of 50 ⁇ L, with 50 ⁇ L of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37oC/5% CO 2 . Following the incubation, 100 ⁇ L of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37oC/5% CO 2 . Following the incubation, efficacy plates were evaluated for ⁇ -galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies, cytotoxicity was also evaluated (TC 50 ).
- Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide).
- XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product.
- XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives.
- Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C.
- XTT/PMS stock was prepared immediately before use by adding 40 ⁇ L of PMS per mL of XTT solution. Fifty ⁇ L (50 ⁇ L) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37°C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer.
- This assay identified four compounds with EC 50 values approximately equal to the benchmark compound (Temsavir) (Table 6). These compounds were highly potent at inhibiting cell fusion with EC 50 values of less than 0.9 nM. One of these compounds, Int-17, also demonstrated no apparent loss of activity upon conjugation to an hIgG1 Fc (conjugate 5); this was an important finding. Lastly, no compounds showed cytotoxicity at the concentrations tested in this study. Therefore, for the most active compounds the difference between EC 50 and cytotoxicity is greater than 10,000-fold. A prior fusion inhibition study also identified several highly active compounds (Int-2 and Int-4). However, both compounds lost significant potency upon conjugation (conjugates 2 and 3, respectively), further emphasizing the significance of conjugate 4. Table 6.
- Example 20 Synthesis of DMJ-II-121 DMJ-II-12 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 21 Synthesis of Conjugate 8 A solution of azido functionalized Fc (50 mg, 28.43 mL, 0.862 ⁇ mol, 1.76 mg/mL; SEQ ID NO: 64, Example 2) was added to a 50 mL centrifuge tube following by addition of alkyne derivatized small molecule (15.83 mg, 0.012 mmol, Int-15, Example 16) in EPPES at pH 8.5, and a solution of copper (II) sulfate (1.1 mg, 0.0043 mmol) in water mixed with THTPA (0.43 mL, 0.0216 mmol, 50nM in water), aminoguanidine HCl (2.16 mL, 100 mM in water), and sodium ascorbate (2.16 mL, 100 mM in water).
- THTPA 0.
- Example 22 Synthesis of Int-18 Int-18 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 23 Synthesis of Int-19 Int-19 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 24 Synthesis of Int-20 Int-20 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 25 Synthesis of Int-20 Int-20 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 28 Synthesis of Int-23 Int-23 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 29 Synthesis of Int-24 Int-24 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 30 Synthesis of Int-25 Int-25 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 31 Synthesis of Int-25 Int-25 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
- Example 33 Synthesis of Int-53 To a -15 °C stirring solution of N ⁇ -Boc-N ⁇ -Cbz-L-ornithine (1.00 g, 2.729 mmol) and N- methylmorpholine (300 uL, 2.729 mmol) in THF (10.0 mL), it was added isobutylchlorofromate (355 uL, 2.729 mmol). After stirring for 5 minutes, a freshly prepared solution of sodium borohydride (310 mg, 8.188 mmol) in water (4.0 mL) was added. Upon reaction completion, water (10 mL) was added and the temperature raised to ambient, while stirring continued for 1 h.
- sodium borohydride 310 mg, 8.188 mmol
- Step b Under hydrogen atmosphere, a suspension of the product from step a (2.729 mmol, theoretical) and 20% palladium hydroxide on carbon (500 mg) in MeOH (20 mL), was stirred until full consumption of the starting material. The mixture was filtered and the filtrate concentrated per rotatory evaporation. Residual volatiles were evaporated under high vacuum. This material was used in the next step without further purification.
- Step c To a 0 °C stirring solution of step b product (150 mg, 0.687 mmol), propargyl-PEG4-acid (179 mg, 0.687 mmol), and DIPEA (0.359 mL, 2.061 mmol) in DMF (4.0 mL) and DCM (0.5 mL), was added HATU (266 mg, 0.701 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per rotatory evaporation.
- Step e To a 0 °C stirring solution of step d product (31 mg, 0.078 mmol), example XYZ step XYYY (40 mg, 0.078 mmol), HOBt hydrate (36 mg, 0.235 mmol, ⁇ 80%) and DIPEA (0.082 mL, 0.469 mmol) in DMF (3.0 mL) and DCM (0.5 mL), was added HATU (89 mg, 0.235 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per rotatory evaporation.
- step b To a 0 °C stirring solution of step b product (150 mg, 0.687 mmol), propargyl-PEG4-acid (179 mg, 0.687 mmol), and DIPEA (0.359 mL, 2.061 mmol) in DMF (4.0 mL) and DCM (0.5 mL), was added HATU (266 mg, 0.701 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per rotatory evaporation. The residue was purified by RP-C18 column using an Isco CombiFlash liquid chromatography eluted with 0% to 100% water and methanol, no modifier. Yield 0.186 g, 59%.
- step d product 31 mg, 0.078 mmol
- the triazole acid described in Example 5 40 mg, 0.078 mmol
- HOBt hydrate 36 mg, 0.235 mmol, ⁇ 80%
- DIPEA 0.082 mL, 0.469 mmol
- HATU 89 mg, 0.235 mmol
- Step b To a solution of the step-a product (1.56 g, 4.57 mmol) in anhydrous DMF (5 ml) was added K 2 CO 3 (691 mg, 5 mmol) and 2,2-dimethyl-1,3-dioxan-5-amine (779.5 mg, 5.94 mmol). The mixture was heated at 70°C overnight, then cooled to room temperature.
- Step c To a solution of the step-b product (760.9 mg, 1.94 mmol) in anhydrous DMF (3 ml) was added DIPEA (501.5 mg, 3.88 mmol), 3(-(Boc)-aminopropyl bromide (695.3 mg, 2.92 mmol) and NaI (75 mg, 0.5 mmol). The mixture was stirred at 50°C for 3 days.
- Step d The step-c product (679.6 mg, 1.24 mmol) was dissolved in MeOH (15 ml) and added with AcOH (74.3 mg, 1.24 mmol)) and Pd(OH)2 (20% on carbon, 350 mg). The resulting mixture was stirred under hydrogen atmosphere for 3 hours. Pd(OH) 2 /C was filtered off, and the filtrate was concentrated by rotary evaporation and further dried under high vacuum. Yield 588.5 mg, quantitative yield.
- Step e To a solution of the step-d product (588.5 mg, 1.24 mmol) in anhydrous DMF (2 ml) and THF (2 ml) was added K 2 CO 3 (514 mg 3.72 mmol) and propargyl-PEG4-mesyl ester (465.5 mg, 1.5 mmol). The resulting mixture was heated at 70°C for 1 day. The salt was filtered off, and the filtrate was concentrated and purified by HPLC (0% to 70% acetonitrile and water). Yield 253.2 mg, 32.5%.
- Step f The step-e product (253.2 mg, 0.403 mmol) was dissolved in acetonitrile/water (1:1, 4 ml) and added with 6N HCl aqueous solution (0.5 ml, 3 mmol). The reaction was stirred at 60°C for 5 hours, then cooled to room temperature. After the pH was adjusted to about 8 with 1M KOH (3.5 ml), the solution was lyophilized. The residue was then re-dissolved in MeOH and KCl was filtered off. The filtrate was concentrated by rotary evaporation and further dried under high vacuum.
- TFA 15 ⁇ l was added, followed by the addition of Azido-PEG4-trifluorophenyl ester ( 12.2 mg, 0.029 mmol), a pre-mixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (40 mg, 0.202 mmol) in water (0.5 ml) and Cu 2 SO 4 (1.6 mg, 0.01 mmol).
- the ice-water bath was removed, and the reaction mixture was stirred for 30 minutes. It was then directly purified by RPLC (50 g, 5% to 80% acetonitrile and water, using 0.1% TFA as modifier). Yield 30.7 mg, 68.8%.
- HIV lead compounds in an in vitro cell fusion assay
- Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell- cell fusion mediated by gp120 and CD4 interaction which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR- ⁇ -Gal (catalog #1294) and HL2/3 cells (catalog #1299), obtained from the AIDS Research Reagent and Reference Program (Rockville, MD).
- HeLa-CD4-LTR- ⁇ -Gal cells were plated at a density of 5 x 10 3 cells per well in a volume of 50 ⁇ L, with 50 ⁇ L of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37 oC/5% CO 2 . Following the incubation, 100 ⁇ L of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37 oC/5% CO 2 . Following the incubation, efficacy plates were evaluated for ⁇ -galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies cytotoxicity was also evaluated (TC 50 ).
- Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide).
- XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product.
- XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives.
- Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C.
- XTT/PMS stock was prepared immediately before use by adding 40 ⁇ L of PMS per mL of XTT solution. Fifty ⁇ L (50 ⁇ L) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37°C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer.
- Example 40 30-day comparative non-human primate PK study following IV administration of a conjugate including an Fc domain having a C220S/YTE quadruple mutant
- a conjugate including an Fc domain having a C220S mutation and a YTE mutation was synthesized as described in Example 39.
- a non-human primate PK study was performed to compare IV administration of the C220S/YTE Fc conjugate (SEQ ID NO: 67) to a conjugate including an Fc domain having a C220S mutation alone (SEQ ID NO: 64).
- Non-human primate (NHP) PK studies were performed by BTS Research (San Diego, CA) using male and female cynomolgus monkeys 5-9 years old with body weights ranging from 3.5-8.5 kg. NHPs were injected IV with 2 mg/kg of test article (0.4 mL/kg dose volume). Animals were housed under standard IACUC approved housing conditions. At appropriate times animals were non-terminally bled (via femoral or cephalic veins) with blood collected in K 2 EDTA tubes to prevent coagulation. Collected blood was centrifuged (2,000 x g, for 10 minutes) and plasma withdrawn for analysis of test article concentrations over time.
- the plasma concentrations for the C220S/YTE Fc conjugate and the C220S conjugate at each time point were measured by sandwich ELISA. Briefly, test articles were captured on Fc-coated plates and then detected using a HRP-conjugated anti-human IgG-Fc antibody. Protein concentrations were calculated in GraphPad Prism using 4PL non-linear regression of the C220S/YTE Fc conjugate or C220S conjugate standard curves. A more detailed method description is provided above. The corresponding curves are shown in FIG.13.
- the C220S/YTE Fc conjugate demonstrates a significantly improved terminal half-life of ⁇ 45 days compared with ⁇ 10 days for the C220S Fc conjugate. AUCs for the C220S/YTE Fc conjugate are 2X greater than the AUCs for The C220S conjugate (Table 8).
- HeLa-CD4-LTR- ⁇ -Gal cells were plated at a density of 5 x 10 3 cells per well in a volume of 50 ⁇ L, with 50 ⁇ L of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37 oC/5% CO 2 . Following the incubation, 100 ⁇ L of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37 °C/5% CO 2 . Following the incubation, efficacy plates were evaluated for ⁇ -galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies, cytotoxicity was also evaluated (TC50).
- Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide).
- XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product.
- XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives.
- Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C.
- XTT/PMS stock was prepared immediately before use by adding 40 ⁇ L of PMS per mL of XTT solution. Fifty ⁇ L (50 ⁇ L) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37 °C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer.
- Example 42 Activity of lead compounds in a cell fusion assay (EC50) and cytotoxicity (TC50)
- DAR is determined by Maldi TOF of the purified conjugates and is typically 2 to 5.
- the yields and properties for conjugates synthesized using this general procedure are listed in Table 12 below. Table 12.
- this assay measures the fusion of two cell lines, HeLa-CD4-LTR- ⁇ -Gal (catalog #1470) and HL2/3 cells (catalog #1294), obtained from the NIH AIDS Research Reagent and Reference Program (Rockville, MD).
- HeLa-CD4-LTR- ⁇ -Gal cells were plated at a density of 5 x 10 3 cells per well in a volume of 50 ⁇ L, with 50 ⁇ L of nine serial half-logarithmic dilutions of compound in triplicate for 1 h at 37oC/5% CO 2 . Following the incubation, 100 ⁇ L of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 h at 37oC/5% CO 2 .
- the concentration required to induce 50% toxicity was derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4- nitro-5-sulfophenyl)-5-[(phenylamino)carbonyl]-2H-tetrazolium hydroxide).
- XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product.
- XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives.
- Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C.
- XTT/PMS stock was prepared immediately before use by adding 40 ⁇ L of PMS per mL of XTT solution. Fifty ⁇ L (50 ⁇ L) of XTT/PMS was added to each well of the plate and the plate incubated for 4 h at 37°C. The 4 h incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer.
- Compounds Conjugate 29a, Conjugate 30a, and Conjugate 34 were the most active with EC50 values of 7.06 nM, 10.5 nM and 14.9 nM, respectively.
- Compounds Conjugate 5b, Conjugate 32, and Conjugate 36 were slightly less active than the most active compounds with EC50 values ranging from 22.8 nM to 43.6 nM. The remaining compounds had EC50 values ranging from 144 nM to 915 nM. All of the compounds were non-toxic up to the highest concentration evaluated (2000 nM with the exception of 1800 nM for compound Conjugate 15) (Table 13). Table 13. Evaluation of inhibition of cell to cell fusion in HeLa-CD4-LTR- ⁇ -Gal and HL2/3 cells.
- this assay measures the fusion of two cell lines, HeLa-CD4-LTR- ⁇ -Gal (catalog #1470) and HL2/3 cells (catalog #1294), obtained from the NIH AIDS Research Reagent and Reference Program (Rockville, MD).
- HL2/3 cells were plated at a density of 2 x 10 4 cells per well in a volume of 50 ⁇ L, with 50 ⁇ L of nine serial half-logarithmic dilutions of compound in triplicate for 1 h at 37oC/5% CO 2 . Following the incubation, 100 ⁇ L of HeLa-CD4-LTR- ⁇ -Gal cells were added to the plates. The cultures were incubated for an additional 48 h at 37oC/5% CO 2 .
- CSB Chicago Sky Blue
- Temsavir Temsavir
- conjugates were evaluated for the ability to inhibit HIV-1-induced cell to cell fusion.
- the average EC50 values defined in the assays ranged from 1.9E-01 to 6.0E+03 nM (Table 15).
- Conjugate 14b was the most potent conjugate with an EC 50 value of 1.99E-01 nM (Table 15).
- Several conjugates had single-digit nM potency, including Conjugate 27, Conjugate 29a, Conjugate 12b, Conjugate 37a, Conjugate 5b, and Conjugate 30a.
- the remaining compounds had EC 50 values ranging from 1.14E+01 nM to 6.02E+03 nM (Table 15). Table 15.
- mice were non-terminally bled (retro-orbital, cheek, or by tail vein) with blood collected in K2EDTA tubes to prevent coagulation. Collected blood was centrifuged (2,000 x g, for 10 min) and plasma withdrawn for analysis of test article concentrations over time. The Fc plasma concentrations at each time point were measured by Fc-capture sandwich ELISA as follows.
- Nunc Maxisorp 96-well plates (cat no.12-565-136, Fisher Scientific) were coated overnight at 4 ⁇ C with 0.1 ⁇ g/100 ⁇ L/well of goat anti-human IgG (Fc ⁇ fragment specific; cat no.109-005-098, Jackson Immunoresearch) in carbonate buffer (cat no. C3041, MilliporeSigma). Plates were washed 5x with 300 ⁇ L/well PBST and blocked with 200 ⁇ L/well 5% non-fat dry milk (cat no.9999S, Cell Signaling) in PBST for 1 h at room temperature with shaking.
- sample diluent 2.5% non-fat dry milk in PBS 0.025% Tween 20 + na ⁇ ve mouse plasma final concentration of 1:900.
- Compound standard curves ranging from 0.03 to 55 ng/mL in duplicate, were run on each plate. Following the 2 h incubation, plates were washed 5x with 300 ⁇ L/well PBST.
- Test article in plasma samples was interpolated using GraphPad Prism Version 8 following nonlinear regression analysis (Sigmoidal, 4PL analysis) of the standard curves. The resulting mean plasma concentrations were then used to calculate the total AUC for each plasma concentration-time profile.
- the 7-day mouse PK profiles of Conjugate 5b and Fc control (SEQ ID NO:73) are shown in FIG. 15. Following IV administration of compound at 5 mg/kg, the average plasma exposure levels of Conjugate 5b were noninferior to SEQ ID NO:73, with AUCs of 2922 and 1766, respectively (FIG.15).
- Step e A solution of the step-c product (35.8 mg, 0.0409 mmol) in DMF (1 ml) was mixed with TFA (20 ⁇ l) and cooled in an ice-water bath. It was then mixed with the step-d product (29.3 mg, 0.0491 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu 2 SO 4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the resulting mixture was stirred for 40 minutes.
- Example 52 Synthesis of Int-115 To a cold solution of dimethyl 2-(3-((tert-butoxycarbonyl)amino)propyl)malonate (500 mg, 1.72 mmol, prepared as described in Angewandte Chemie, International Edition (2016), 57(22), 6527-6531) in DMF (6 ml) was added sodium hydride (72.5 mg, 1.81 mmol), followed N-(bromomethyl)phthalimide (0.45 mg, 1.9 mmol). The resulting suspension was stirred for 2.5 h at 0°C, and partitioned between water (6 ml) and dichloromethane (40 ml). The organic layer was separated, washed with water and sat. aq.
- Step b To a solution of step-a product (300 mg, 0.66 mmol) in methanol (3 ml) was added hydrazine hydrate (0.18 ml, 3.34 mmol). The solution was stirred under reflux for 4 hrs. It was then cooled, filtered, concentrated and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier) to afford product. Yield 160 mg, 75 %.
- step-c product 142 mg, 0.25 mmol
- THF 2.5 ml
- LiBH4 23.2 mg, 1.0 mmol
- Step e The step-d product (91 mg, 0.17 mmol) wascooled and dissolved in HCl (2 ml, 4N in dioxane) and reaction was stirred for 2 hrs.
- Step f To a solution of triazole-carboxylic acid (77.4 mg, 0.151 mmol) in DMF (1 mL) was added HSPyU (124 mg, 0.30 mmol) and DIPEA (0.15 ml, 0.90 mmol) and the mixture was stirred for 30 min. Step-e product (72 mg, 0.181 mmol) was then added to above solution and LCMS after 30min shows complete consumption of the triazole-carboxylic acid.
- Step g A solution of step-f product (39 mg, 0.043 mmol) and azido-Peg4-trifluorophenol ester (23.8 mg, 0.056 mmol) in DMF (0.6 mL), was treated with a solution of copper(II) sulfate (1.73 mg, 0.010 mmol), sodium ascorbate (8.6 mg, 0.043 mmol), and THPTA (9.43 mg, 0.021 mmol) in water (0.3 mL).
- Step f A solution of product from the previous step (0.030 g, 0.033 mmol), and azido-PEG4- trifluorophenol ester (0.018 g, 0.043 mmol) dissolved in DMF (1.0 mL), was treated with a solution of copper(II) sulfate (0.001 g, 0.008 mmol), and sodium ascorbate (0.006 g, 0.033 mmol), dissolved in water (1.0 mL), and stirred at room temperature for 30 min.
- Example 54 Synthesis of Int-119 Step a. A mixture of t-butyl(5-formyl-2,2-dimethyl-1,3-dioxan-5-yl) carbamate (1.9 g, 7.3 mmol) and ammonium acetate (5.6 g, 73.2 mmoL)in methanol (30 mL) were stirred at ambient temperature for 1 hour. Sodium cyanoborohydride (1.2 g, 18.3 mmol) was added and the reaction was stirred for 12 hours then concentrated to a volume of approx.5 mL.
- the amine was taken up in acetonitrile (25 mL) and DIPEA (2.5 mL, 14.7 mmol) and Fmoc-OSu (2.7 g, 8.1 mmoL was added and the mixture was stirred for 1 hour, diluted with water (50 mL) and extracted into ethyl acetate (3x, 30 mL0. The combined organic extracts were dried over sodium sulfate and concentrated.
- the amine-HCl salt was dissolved in methanol (30 mL) and CBZ-amino- propanal (227 mg, 1.1 mmol) was added and the mixture was stirred for 30 minutes at which point sodium cyanoborohydride (140 mg, 2.2 mmol) was added and the mixture was stirred for 12 hours at ambient temperature.
- the intermediate was stirred in methanol in the presence of 5% Pd/C (75 mg) for 2 hours.
- Step c HATU (67 mg, 0.18 mmol) was added to a stirring mixture of the intermediate from step b. of this example (76 mg, 0.19 mmol), and the triazole carboxylic acid (75 mg, 0.15 mmol described in Example 5 of Int- 2), and DIPEA (0.10 uL, 0.58 mmol) in DMF (2 mL). The reaction was stirred for 45 minutes and then purified by reversed phase HPLC (0-95% ACN in DI water, 0.1% TFA modifier, 30 minute gradient).
- Step d DBU (60 uL, 0.41 mmol) was added to a stirring mixture of the intermediate described in step c. of this example (75 mg, 0.084 mmol) in DMF (2 mL) and the mixture was stirred for 20 minutes. Propargyl-PEG4-carboxylic acid (33 mg, 0.13 mmol) was added followed by HATU (42 mg 0.11 mmol) and the reaction was stirred for 45 minutes.
- Copper sulfate (0.5 mg, 0.003 mmol), sodium ascorbate (17 mg, 0.089 mmol), and BTTA (2.5 mg, 0.006 mmol), were pre-mixed in DI water (0.5 mL) and then added to a mixture of the alkyne intermediate (27 mg, 0.029mmol), and azido-peg4-trifluorophenol-ester (12 mg, 0.29 mmol), in DMF/H 2 O (3:1, 1 mL) , cooled to 0°C via and ice water bath.
- Step b To a solution of the step-a crude product in DCM (20 ml) was added 3- [benzyloxycarbonyl)amino]-1-propanal (2.49 g, 12 mmol) and acetic acid (900 mg, 15 mmol). The resulting mixture was stirred at room temperature for 30 minutes, then sodium triacetoxyborohydride (3.15 g, 15 mmol) was added in portions.
- Step c To a solution of the step-b product (1.05 g, 1.93 mmol) in MeOH (25 ml) was added Pd(OH) 2 on carbon (850 mg).
- Step d To a solution of the step-c product (1.25 g, 3.33 mmol) in anhydrous DMF (4 ml) was added benzyl bis(2-chloroethyl)carbamate (1.38 g, 5 mmol), triethylamine (12 ml) and NaI (150 mg, 1 mmol).
- Step f To a solution of the step-e product (245 mg, 0.55 mmol) in anhydrous DMF (1 ml) was added propargyl-PEG4-mesyl ester (223.5 mg, 0.72 mmol) and DIEPA (129.2 mg, 1 mmol). The resulting mixture was heated at 600C for 1 day, then directly purified by RPLC (50 g, 5 to 70% acetonitrile and water). Yield 129 mg, 35.6%.
- step-c product 101.4 mg, 0.28 mmol
- anhydrous DMF 0.5 ml
- propargyl-PEG4-NHS ester 121.3 mg, 0.34 mmol
- DIPEA 65 mg, 0.5 mmol
- Step e The step-d product (192 mg, 0.268 mmol) was dissolved in acetonitrile (1 ml) and water (0.5 ml).
- Step f A mixture of the triazole acid described herein (51.2 mg, 0.1 mmol) and the step-e product (53.8 mg, 0.1 mmol) was dissolved in anhydrous DMF (1 ml) and Et 3 N (606 mg, 6 mmol) by gently heated with a heat gun.
- Step b To a solution of the step-a product (2.55 g, 4.95 mmol) in acetonitrile (5 ml) was added water (5 ml) and 6N HCl aqueous solution (4 ml). The reaction mixture was heated at 600C for 2 hours, then concentrated by rotary evaporation. Excess HCl was further removed by azeotrope/evaporation with acetonitrile. The residue was further dried under high vacuum.
- step-c product 505 mg, 1.1 mmol
- water 2 ml
- 6N HCl aqueous solution 1 ml
- Step h A solution of the step-g product (35.4 mg, 0.025 mmol) in DMF (1 ml) was treated with 40 ⁇ l of TFA, and cooled in an ice-water bath. It was then mixed with azido-PEG4-trifluorophenyl ester (12.7 mg, 0.3 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu 2 SO 4 (1.6 mg, 0.01 mmol).
- Step c HATU (72 mg, 0.19 mmol) was added to a stirring mixture of the intermediate from step b. of this example (75 mg, 0.16 mmol), and the triazole carboxylic acid, (described in example 5 of Int- 2, 81 mg, 0.16 mmol), and DIPEA (0.11 uL, 0.64 mmol) in DMF (2 mL). The reaction was stirred for 45 minutes and purified directly by reversed phase HPLC (0-95% ACN in DI water, 0.1% TFA modifier, 30 minute gradient).
- Step d DBU (73 uL, 0.52 mmol) was added to a stirring mixture of the intermediate described in step c. of this example (100 mg, 0.10 mmol) in DMF (2 mL) and the mixture was stirred for 20 minutes. Propargyl-peg4-carboxylic acid (41 mg, 0.16 mmol) was added followed by HATU (47 mg 0.12 mmol) and the reaction was stirred for 45 minutes. The mixture was purified by reversed phase HPLC (0-95% ACN in DI water, 0.1% TFA modifier, 30 minute gradient).
- Step h A solution of product from the previous step (0.095g, 0.147 mmol), and triazole-carboxylic acid (0.050g, 0.098 mmol, described herein) dissolved in DMF (2 mL), was treated with DIEA (0.170 mL, 0.98 mmol), HOBT (0.026 g, 0.196), and HATU (0.074 g, 0.196 mmol).
- LCMS after 30min shows complete consumption of the triazole-carboxylic acid.
- the reaction was concentrated and purified by semi- preparative HPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.060g, 46%.
- Step b To a solution of the step-a product (944.7 mg, 2.77 mmol) in anhydrous DMF (3 ml) was added K 2 CO 3 (459 mg, 3.32 mmol) and ethanolamine (338.5 mg, 5.54 mmol). The mixture was heated at 70°C for 3 hours, then cooled to room temperature.
- Step c To a solution of the step-b product (708 mg, 2.19 mmol) in anhydrous DMF (2.5 ml) was added DIPEA (568.7 mg, 4.4 mmol), 3-(Boc)-aminopropyl bromide (785.7 mg, 3.3 mmol) and NaI (105 mg, 0.7 mmol).
- Step d To a solution of the step-c product (793 mg, 1.66 mmol) in MeOH (15 ml) was added acetic acid (99.4 mg, 1.66 mmol)) and Pd(OH) 2 (20% on carbon, 350 mg). The resulting mixture was stirred under hydrogen atmosphere for 3 hours. Pd(OH) 2 /C was filtered off, and the filtrate was concentrated by rotary evaporation.
- Step e To a solution of the step-d (618 mg, 1.53 mmol) in anhydrous THF (3 ml) and DMF (1 ml) was added K 2 CO 3 (624.3 mg 4.59 mmol), propargyl-PEG4-mesyl ester (552.5 mg, 1.78 mmol) and NaI (45 mg, 0.3 mmol). The resulting mixture was heated at 70°C overnight.
- Step f The step-e product (262.4 mg, 0.47 mmol) was dissolved in acetonitrile/water (1:1, 4 ml) and mixed with 6N HCl aqueous solution (0.4 ml, 2.4 mmol). The reaction was heated at 50°C for 6 hours, then cooled to room temperature. After the pH was adjusted to about 8 with 1M KOH (2.4 ml), the solution was concentrated by rotary evaporation.
- a solution of the step-g product (20 mg, 0.0142 mmol) in DMF:MeOH (1:1, 1 ml) was mixed with TFA (20 ⁇ l) and cooled in an ice-water bath. It was then mixed with azido-PEG4-trifluorophenyl ester (7.2 mg, 0.017 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu 2 SO 4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the resulting mixture was stirred for 40 minutes.
- step-c product 100 mg, 0.15 mmol
- THF 1.5 ml
- LiBH4 14.5 mg, 0.63 mmol
- Step e The step-d product (42 mg, 0.052 mmol) was dissolved in HCl (4N in dioxane).
- Step f To a solution of step-e product (35 mg, 0.052 mmol) in DMF (0.5 ml) was added triazole- carboxylic acid (32.1 mg, 0.062 mmol), DIPEA (0.089 mL, 0.51 mmol), HOBT (14.1 mg, 0.10 mmol), and HATU (39.8 mg, 0.10 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi-preparative HPLC (5% to 100% ACN/water).
- Step g A solution of step-f product (12 mg, 0.011 mmol), and azido-Peg4-trifluorophenol ester (6 mg, 0.014 mmol) in DMF (0.6 mL), was treated with a solution of copper (II) sulfate (0.4 mg, 0.00277 mmol), sodium ascorbate (2.19 mg, 0.011 mmol), and THPTA (2.41 mg, 0.0055 mmol) in water (0.3 mL), and stirred at room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by semi-preparative HPLC (5% to 100% ACN/water).
- step-c product 169 mg, 0.34 mmol
- DMF dimethyl methoxyethyl-N-(2-aminoethyl)
- step-c product 169 mg, 0.34 mmol
- Et3N 0.05 ml, 0.34 mmol
- Step f Step-e product (31 mg, 0.06 mmol) was dissolved in a cool solution of HCl (1.5 ml, 4N in dioxane) and reaction was stirred for 2 hr.
- Example 74 Activity against HIV-1 IIIB Strain (HIV-1IIIB) in a cytopathic effects assay using CEM-SS Cells Materials. This study determined the activity of conjugates, comparators, and buffers against HIV-1IIIB in a CPE assay with a T-cell lymphoblastic cell line obtained from the AIDS Research and Reference Program (Rockville, MD).
- Controls consisted of AZT (Sigma-Aldrich), Temsavir (Astatech, PA), conjugate buffer (200 mM Arginine, 120 mM NaCl, 1% Sucrose, pH 6.0), and Temsavir buffer (DMSO/PEG300/Tween-80/PBS).
- Methods – Evaluation of Inhibition of HIV-1 IIIB and Cellular Toxicity in CEM-SS Cells Fifty microliters (50 ⁇ L) of CEM-SS cells at a density of 2.5 x10 3 cells/well in 10% complete RPMI-1640 (10% FBS with 1% L-glutamine and 1% Penicillin/Streptomycin) media were plated in a 96-well round bottom plate.
- One-hundred microliters (100 ⁇ L) of each compound at 6 concentrations were added in triplicate followed by 50 ⁇ L of HIV-1IIIB at a pre-determined titer. The cultures were incubated for 6 days at 37°C/5% CO 2 . Following the incubation, the cells were stained with XTT for evaluation of compound efficacy and cellular toxicity, as described below. AZT was evaluated in parallel as an assay positive control compound. Data Analysis and Evaluation. Microsoft Excel 2010 in combination with XLfit4 was used to analyze and graph data.
- EC 25 , EC 50 and EC 95 (25%, 50% and 95% inhibition of virus replication), TC 25 , TC 50 and TC 95 (25%, 50%, and 95% reduction in cell viability) and a therapeutic index (TI, TC 25 /EC 25 , TC 50 /EC 50 , TC 95 / EC 95 ) were calculated.
- TI TC 25 /EC 25 , TC 50 /EC 50 , TC 95 / EC 95
- AZT was evaluated in parallel as a positive control compound and was active at the expected concentration.
- Example 75 Activity of Conjugate 29a and Conjugate 29b against 5 HIV-1 strains in a cell-based assay with human peripheral blood mononuclear cells (PMBCs) Materials. This study determined the activity of Conjugate 29a and Conjugate 29b against wild type HIV-1 isolates covering 3 different clades and co-receptor specificities obtained from the AIDS Research and Reference Reagent Program (Rockville, MD) (Table 18).
- Controls consisted of AZT (Sigma-Aldrich), Temsavir (Astatech, PA), and compound buffer (200 mM Arginine, 120 mM NaCl, 1% Sucrose, pH 6.0)
- Human PBMCs were isolated from whole blood from three donors. The leukophoresed blood was diluted 1:1 with DPBS and was layered over 15 mL of Ficoll-Hypaque density gradient. These tubes were centrifuged for 30 min at 1800 rpm. Banded PBMCs were gently aspirated from the resulting interface and subsequently washed three times with DPBS by low-speed centrifugation.
- tissue culture medium RPMI 1640 with 15% FBS, 2 mmol/L L-glutamine, 100 U/mL penicillin, 100 ⁇ g/mL streptomycin and 3.6 ng/mL recombinant human IL-2).
- the cultures were then maintained until use by half culture volume change with fresh IL-2 containing tissue culture medium every 3 to 4 days. Assay conditions. Assays were initiated with PBMCs that had been induced to proliferate with PHA-P for 72 hours.
- PHA-P stimulated PBMCs from three donors were pooled together to minimize the variability that occurs when cells from individual donors are used, resuspended in fresh tissue culture medium at 1 x 10 6 cells/mL and plated in the interior wells of a 96-well round bottom microtiter plate at 50 ⁇ L/well.
- One-hundred microliters (100 ⁇ L) of each two times the concentration of compound in assay media was added to designated wells of the round-bottom 96-well plate containing the cells, in triplicate.
- 50 ⁇ L of a pre- determined dilution of virus was added.
- HIV-1 replication was quantified by the measurement of cell-free HIV-1 RT activity in the tissue culture supernatant. Cytotoxicity was evaluated using the tetrazolium dye XTT following well known standard methods. Quantification of reverse transcriptase (RT) activity. Virus production by untreated and treated cells was quantified by measuring reverse transcriptase in cell-free supernatants using a standard radioactive incorporation polymerization assay. Tritiated thymidine triphosphate (TTP) was purchased at 1 Ci/mL and 1 ⁇ L was used per enzyme reaction.
- TTP Tritiated thymidine triphosphate
- Poly rA and oligo dT were prepared at concentrations of 0.5 mg/mL and 1.7 Units/mL, respectively, from a stock solution which was kept at -20°C.
- the RT reaction buffer was prepared fresh and consists of 125 ⁇ L of 1 M EGTA, 125 ⁇ L of dH 2 O, 125 ⁇ L of 20% Triton X-100, 50 ⁇ L of 1 M Tris (pH 7.4), 50 ⁇ L of 1 M DTT, and 40 ⁇ L of 1 M MgCl 2 .
- 1 ⁇ L of TTP, 4 ⁇ L of dH 2 O, 2.5 ⁇ L of rAdT and 2.5 ⁇ L of reaction buffer were mixed.
- Conjugate 29a and Conjugate 29b were not cytotoxicity with Conjugate 29a and Conjugate 29b against this, or any strain, in the panel (TC 50 > 250 nM).
- Conjugate demonstrated single digit potency. Potency against HIV-1HT/92/594 was 0.49 nM for Conjugate 1.
- Conjugate 1 also displayed potent activity against the Subtype C strain (HIV-1ZA/97/009). Likewise, Conjugate 1 demonstrated significant activity against the A Subtype (HIV-1UG/92/037). Significantly, Conjugate 1 was more potent than the Temsavir comparator against the A Subtype.
- Conjugate 29a and Conjugate 29b were highly active against 5 clinically relevant HIV-1 strains with EC 50 values in the single digit (or better) nM range.
- the HIV-1 panel tested in this study also included 3 different Subtypes with 3 different co-receptor tropisms indicating broad coverage by Conjugate 29a and Conjugate 29b.
- this assay was conducted with actual human PBMCs isolated from 3 different donors, making the assay, and results, all the more meaningful.
- Example 76 Screening of HIV antiviral small molecules and conjugates in an in vitro cell fusion inhibition assay Activity of compounds was determined by using an assay designed to measure the inhibition of cell-cell fusion mediated by gp120 and CD4, a key step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR- ⁇ -Gal (catalog #1470) and HL2/3 cells (catalog #1294), obtained from the NIH AIDS Research Reagent and Reference Program (Rockville, MD).
- HL2/3 cells were plated at a density of 2 x 10 4 cells per well in a volume of 50 ⁇ L, with 50 ⁇ L of nine serial half- logarithmic dilutions of compound in triplicate for 1 h at 37oC/5% CO 2 .
- 100 ⁇ L of HeLa-CD4-LTR- ⁇ -Gal cells were added to the plates.
- the cultures were incubated for an additional 48 h at 37oC/5% CO 2 .
- plates were evaluated for ⁇ -galactosidase production using a chemiluminescent substrate and buffer (ThermoFisher). Briefly, all media was removed from the efficacy plates and replaced with 100 ⁇ L of DPBS.
- Table 20 Evaluation of inhibition of cell to cell fusion in HeLa-CD4-LTR- ⁇ -Gal and HL2/3 cells. All compounds inhibited HIV-induced cell to cell fusion with EC50 values ranging from 1.1E-05 to 3.1E+03 nM.
- the conjugates were also evaluated for the ability to inhibit HIV-1-induced cell to cell fusion.
- the EC 50 values defined in the assays are shown below in Table 21.
- DAR drug-to-antibody ratio
- Example 77.7-day mouse PK study comparing IV administration of Conjugate 5b, Conjugate 29a and Fc control (SEQ ID NO: 72) at 5 mg/kg
- mice were injected intravenously (IV) via the tail vein with 5 mg/kg of test article (5 mL/kg dose volume). Animals were housed under standard IACUC approved housing conditions. At indicated times (0.25, 1, 2, 4, 24, 48, 120, 144, and 168 h), animals were non-terminally bled (retro-orbital, cheek, or by tail vein) with blood collected in K2EDTA tubes to prevent coagulation. Collected blood was centrifuged (2,000 x g, for 10 min) and plasma withdrawn for analysis of test article concentrations over time. The Fc plasma concentrations at each time point were measured by Fc-capture sandwich ELISA as described above.
- PBMCs Banded PBMCs were gently aspirated from the resulting interface and subsequently washed three times with DPBS by low-speed centrifugation. After the final wash, cells were enumerated by Trypan Blue dye exclusion and re-suspended at 1 x 10 6 cells/mL in RPMI 1640 with 15% Fetal Bovine Serum (FBS), 2 mmol/L L-glutamine, 2 ⁇ g/mL PHA-P, 100 Units/mL penicillin and 100 ⁇ g/mL streptomycin were incubated for 72 hours at 37 o C/5% CO 2 .
- FBS Fetal Bovine Serum
- tissue culture medium RPMI 1640 with 15% FBS, 2 mmol/L L-glutamine, 100 U/mL penicillin, 100 ⁇ g/mL streptomycin and 3.6 ng/mL recombinant human IL-2).
- the cultures were then maintained until use by half culture volume change with fresh IL-2 containing tissue culture medium every 3 to 4 days. Assay conditions. Assays were initiated with PBMCs that had been induced to proliferate with PHA-P for 72 hours.
- PHA-P stimulated PBMCs from three donors were pooled together to minimize the variability that occurs when cells from individual donors are used, resuspended in fresh tissue culture medium at 1 x 10 6 cells/mL and plated in the interior wells of a 96-well round bottom microtiter plate at 50 ⁇ L/well.
- One-hundred microliters (100 ⁇ L) of each two times the concentration of compound in assay media was added to designated wells of the round-bottom 96-well plate containing the cells, in triplicate.
- 50 ⁇ L of a pre- determined dilution of virus was added.
- HIV-1 replication was quantified by the measurement of cell-free HIV-1 RT activity in the tissue culture supernatant. Cytotoxicity was evaluated using the tetrazolium dye XTT following well known standard methods. Quantification of reverse transcriptase (RT) activity. Virus production by untreated and treated cells was quantified by measuring reverse transcriptase in cell-free supernatants using a standard radioactive incorporation polymerization assay. Tritiated thymidine triphosphate (TTP) was purchased at 1 Ci/mL and 1 ⁇ L was used per enzyme reaction.
- TTP Tritiated thymidine triphosphate
- Poly rA and oligo dT were prepared at concentrations of 0.5 mg/mL and 1.7 Units/mL, respectively, from a stock solution which was kept at -20°C.
- the RT reaction buffer was prepared fresh and consists of 125 ⁇ L of 1 M EGTA, 125 ⁇ L of dH 2 O, 125 ⁇ L of 20% Triton X-100, 50 ⁇ L of 1 M Tris (pH 7.4), 50 ⁇ L of 1 M DTT, and 40 ⁇ L of 1 M MgCl 2 .
- 1 ⁇ L of TTP, 4 ⁇ L of dH 2 O, 2.5 ⁇ L of rAdT and 2.5 ⁇ L of reaction buffer were mixed.
- Conjugate 71 is a conjugate with single small molecule (SM) targeting moieties (TM), which generated EC 50 values ranging from 0.00828 to >100 (Table 26 and Table 27).
- SM single small molecule
- step-a product 500 mg, 0.75 mmol
- DMF dimethyl methoxysulfoxide
- 4-imidazolecarboxylic acid 85mg, 0.75mmol
- HATU hydroxybenzyl
- step-b product 500 mg, 0.94 mmol
- 20% Pd(OH) 2 100 mg
- MeOH 10 ml
- step d A mixture of step-c product (298 mg, 0.75 mmol), propargyl-Peg4-mesylate (257 mg, 0.83 mmol), and potassium carbonate (310 mg, 2.26 mmol) in acetonitrile (2 mL) were heated at 65 o C for 12 h.
- Step A mixture of product-f (40 mg, 0.032 mmol), and azido-Peg4-trifluorophenol ester (18 mg, 0.042 mmol) in DMF (0.6 mL) was treated with a solution of copper(II) sulfate (1.2 mg, 0.008 mmol), sodium ascorbate (6.4 mg, 0.032 mmol), and THPTA (7 mg, 16.2 mmol) in water (0.6 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by semi- preparative HPLC (5% to 100% ACN/water). Yield mg, %. LCMS: [(M + 2H)/2] + 712.7. Example 81.
- step-a product 160 mg, 0.71 mmol
- 20% Pd(OH) 2 32 mg
- MeOH 5 ml
- step c A mixture of step-b product (115 mg, 0.25 mmol), propargyl-Peg4-mesylate (95 mg, 0.3 mmol), and potassium carbonate (213 mg, 1.5 mmol) were dissolved in acetonitrile (2 mL) and heated at 65 o C for 12h.
- NUMBERED EMBODIMENTS 1.
- R 8 are each independently selected from H, optionally substituted C 1 -C 20 alkyl, optionally substituted C 1 -C 20 alkylene, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C 15 aryl, and optionally substituted C 2 -C 15 heteroaryl;
- R 9 are each independently selected from optionally substituted C 1 -C 20 alkylene, optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optionally substituted C 5 -C 15 aryl, and optionally substituted C 2 -C 15 heteroaryl;
- x is 1 or 2;
- k is 0, 1, 2, 3, 4, or 5;
- Ar is selected from the group consisting of optionally substituted C 3 -C 20 cycloalkyl, optionally substituted C 2 -C 20 heterocycloalkyl, optional
- each X is independently C or N; or a pharmaceutically acceptable salt thereof.
- each A 1 and each A 2 is independently described by any one of formulas (A-Iq)-(A-Ix): or a pharmaceutically acceptable salt thereof.
- the conjugate is described by formula (M-I): wherein each A 1 is independently described by formula (A-I); each E comprises an Fc domain monomer, and the squiggly line connected to the E indicates that each A 1 -L is covalently attached to E; or a pharmaceutically acceptable salt thereof.
- the conjugate of embodiment 18, wherein L’ is a nitrogen atom.
- 20. The conjugate of embodiment 9, wherein the conjugate is described by formula (M-IV): or a pharmaceutically acceptable salt thereof.
- 21. The conjugate of embodiment 20, wherein the conjugate is described by formula (M-IV-1): or a pharmaceutically acceptable salt thereof.
- 22. The conjugate of embodiment 21, wherein the conjugate is described by formula (M-IV-2): wherein L’ is the remainder of L, and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- 23. The conjugate of embodiment 22, wherein L’ is a nitrogen atom.
- 24. The conjugate of embodiment 20, wherein the conjugate is described by formula (M-IV-3): or a pharmaceutically acceptable salt thereof. 25.
- the conjugate of embodiment 48, wherein L’ is a nitrogen atom.
- the conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-3): or a pharmaceutically acceptable salt thereof.
- the conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-4): wherein L’ is the remainder of L, and y 1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
- 52. The conjugate of embodiment 51, wherein L’ is a nitrogen atom.
- the conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-5): or a pharmaceutically acceptable salt thereof. 54.
- L or L’ comprises one or more optionally substituted C 1 -C 20 alkylene, optionally substituted C 1 -C 20 heteroalkylene, optionally substituted C 2 -C 20 alkenylene, optionally substituted C 2 -C 20 heteroalkenylene, optionally substituted C 2 -C 20 alkynylene, optionally substituted C 2 -C 20 heteroalkynylene, optionally substituted C 3 -C 20 cycloalkylene, optionally substituted C 2 -C 20 heterocycloalkylene, optionally substituted C 4 -C 20 cycloalkenylene, optionally substituted C 4 -C 20 heterocycloalkenylene, optionally substituted C 8 -C 20 cycloalkynylene, optionally substituted C 8 -C 20 heterocycloalkynylene, optionally substituted C 5 -C 15 arylene, optionally substituted C 3 -C 15 heteroarylene, O, S,
- each L is described by formula (M- L): J 1 -(Q 1 )g-(T 1 )h-(Q 2 )i-(T 2 )j-(Q 3 )k-(T 3 )l-(Q 4 )m-(T 4 )n-(Q 5 )o-J 2 wherein J 1 is a bond attached to A 1 ; J 2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q 1 , Q 2 , Q 3 , Q 4 , and
- each E dimerizes to form an Fc domain.
- 92 The conjugate of any one of embodiments 1-91, wherein each E has the sequence of any one of SEQ ID NOs: 1-95 and 125-153.
- 93 The conjugate of embodiment 92, wherein each E comprises the sequence of SEQ ID NO: 72 or SEQ ID NO: 73.
- 94 A conjugate selected from any one of conjugates 1-121.
- 95 The conjugate of embodiment 94, wherein the conjugate is selected from any one of conjugates 5b, 9b, 12b, 13b, 14b, 15-28, 29a, 29b, 30a, 31-36, 37a, and 38-48.
- 96 The conjugate of embodiment 90, wherein n is 2, and each E dimerizes to form an Fc domain.
- the conjugate of embodiment 94, wherein the conjugate is selected from any one of conjugates 30b, 37b, and 49-76. 97.
- the conjugate of embodiment 94, wherein the conjugate is selected from any one of conjugates 77-121. 98.
- the conjugate of any one of embodiments 1-98, wherein T is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.
- 100. A population of conjugates of any one of embodiments 1-98, wherein the average value of T is 1 to 10.
- 101. A population of conjugates of embodiment 100, wherein the average value of T is 1 to 5. 102.
- a pharmaceutical composition comprising a conjugate of any one of embodiments 1-99, or a population of conjugates of embodiments 100 or 101, or a pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable excipient.
- a method for the treatment of a subject having a viral infection or presumed to have a viral infection comprising administering to the subject an effective amount of a conjugate of any one of embodiments 1-99, a population of conjugates of embodiments 100 or 101, or a composition of embodiment 102. 104.
- a method for the prophylactic treatment of a viral infection in a subject in need thereof comprising administering to the subject an effective amount of a conjugate of any one of embodiments 1-99, a population of conjugates of embodiments 100 or 101, or a composition of embodiment 102.
- 105. The method of embodiment 103 or 104, wherein the viral infection is caused by human immunodeficiency virus (HIV).
- HIV human immunodeficiency virus
- 106 The method of embodiment 105, wherein the HIV is HIV-1 or HIV-2.
- 107. The method of any one of embodiments 103-106, wherein the subject is immunocompromised. 108.
- the method of embodiment 110, wherein the disease is cancer.
- the method of embodiment 111, wherein the cancer is leukemia, lymphoma, or multiple myeloma. 113.
- any one of embodiments 103-114 wherein the conjugate of composition is administered intramuscularly, intravenously, intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally, locally, by inhalation, by injection, or by infusion. 116.
- the antiviral agent is selected from an integrase inhibitor, a nucleoside reverse transcriptase inhibitor (NRTI), a non-nucleoside reverse transcriptase inhibitor (NNRTI), a protease inhibitor, an inhibitor of viral entry, a CCR5 antagonist, or a CYP3A inhibitor.
- NRTI nucleoside reverse transcriptase inhibitor
- NRTI non-nucleoside reverse transcriptase inhibitor
- protease inhibitor an inhibitor of viral entry, a CCR5 antagonist, or a CYP3A inhibitor.
- the integrase inhibitor is selected from dolutegravir, elvitegravir, or raltegravir. 120.
- nucleoside reverse transcriptase inhibitor is selected from abacavir, lamivudine, zidovudine, emtricitabine, tenofovir, emtricitabine, didanosine, or stavudine.
- NRTI non-nucleoside reverse transcriptase inhibitor
- the method of embodiment 118, wherein the protease inhibitor is selected from atazanavir, cobicistat, darunavir, cobicistat, lopinavir, ritonavir, fosamprenavir, tipranavir, nelfinavir, indinavir, or saquinavir. 123.
- the method of embodiment 118, wherein the inhibitor of viral entry is enfuvirtide.
- the CCR5 antagonist is maraviroc.
- the method of embodiment 118, wherein the CYP3A inhibitor is cobicistat or ritonavir. 126.
- a method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102 comprising: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (DF-I) or salt thereof: wherein L’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C 1 -C 6 alkyl group, or optionally substituted C 1 -C 6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture.
- DF-I formula
- a method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102 comprising: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (MF-I) or salt thereof: wherein L’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C 1 -C 6 alkyl group, or optionally substituted C 1 -C 6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. 128.
- a method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102 comprising: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (DF-II) or salt thereof: wherein G is optionally substituted C 1 -C 6 alkylene, optionally substituted C 1 -C 6 heteroalkylene, optionally substituted C 2 -C 6 alkenylene, optionally substituted C 2 -C 6 heteroalkenylene, optionally substituted C 2 -C 6 alkynylene, optionally substituted C 2 -C 6 heteroalkynylene, optionally substituted C 3 -C 10 cycloalkylene, optionally substituted C 2 -C 10 heterocycloalkylene, optionally substituted C 6 -C 10 arylene, or optionally substituted C 2 -C 10 heteroarylene; L’-G-L’’ is the remainder
- a method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102 comprising: (a) providing a first composition including formula (D-G3-A) or a salt thereof: where G a is a functional group that reacts with G b to form G; (b) providing a second composition including formula (D-G3-B) or a salt thereof: where G b is a functional group that reacts with G a to form G; and (c) combining the first composition and the second composition to form a first mixture, where m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C 1 -C 6 alkyl group, or optionally substituted C 1 -C 6 heteroalkyl group.
- a method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102 comprising: (a) providing a first composition including formula (M-G3-A) or a salt thereof: where G a is a functional group that reacts with G b to form G; (b) providing a second composition including formula (M-G3-B) or a salt thereof: where G b is a functional group that reacts with G a to form G; and (c) combining the first composition and the second composition to form a first mixture.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Engineering & Computer Science (AREA)
- Veterinary Medicine (AREA)
- Medicinal Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Chemical & Material Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- General Health & Medical Sciences (AREA)
- Public Health (AREA)
- Epidemiology (AREA)
- Immunology (AREA)
- Cell Biology (AREA)
- Virology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Molecular Biology (AREA)
- AIDS & HIV (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Organic Chemistry (AREA)
- Medicinal Preparation (AREA)
- Peptides Or Proteins (AREA)
Abstract
Compositions and methods for the treatment of viral infections include conjugates containing inhibitors of viral gp120 receptor (e.g., temsavir, BMS-818251, DMJ-ll-121, BNM-IV-147, or analogs thereof) linked to an Fc monomer, an Fc domain, and Fc-binding peptide, an albumin protein, or albumin- binding peptide. In particular, conjugates can be used in the treatment of viral infections (e.g., HIV infections).
Description
COMPOSITIONS AND METHODS FOR THE TREATMENT OF HUMAN IMMUNODEFICIENCY VIRUS Sequence Listing The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on December 16, 2021, is named 50945-069WO5_Sequence_Listing_12_16_21_ST25 and is 323,625 bytes in size. Background The need for novel antiviral treatments for human immunodeficiency virus (HIV) is significant and especially critical in the medical field. Since the first case of HIV was identified over 30 years ago, 78 million people have become infected with HIV and 35 million have died from acquired immune deficiency syndrome (AIDS)-related illnesses. Currently, 36.9 million people worldwide are living with HIV or AIDS, including 1.8 million children under the age of 15. An estimated 1.8 million individuals worldwide became newly infected with HIV in 2017. The development of antiviral treatments for HIV has been a continuing challenge. Since the first U.S. FDA approved anti-HIV drugs in 1987, a series of antiretroviral therapies have been developed. However, drug-resistant strains have emerged limiting the number of patients that can use these anti- retroviral therapies. HIV antiviral inhibitors come in many several classes targeting distinct steps of the HIV cycle. One class of antivirals, nucleoside reverse transcriptase inhibitors (NRTIs) inhibit viral replication by chain termination after being incorporated into growing DNA strands by HIV reverse transcriptase. Another class, non-nucleoside reverse transcription inhibitors (NNRTIs), similarly target reverse transcription, however at a different site than nucleoside reverse transcription inhibitors. A different class of antivirals, integrase inhibitors, inhibit viral DNA insertion into the host cellular genome. Protease inhibitors are agents that inhibit the protease enzyme, a key enzyme in the assembly of new virus particles. One class of antivirals, known as viral entry inhibitors, contains agents that interfere in viral entry into the cell by binding to HIV envelope (Env) glycoprotein. In particular, viral entry inhibitors target the surface subunit gp120 receptor of the HIV virus. However, many of these agents are secreted or cleared by the kidney, requiring dose adjustments in those with compromised kidney function, and they have drug-drug interactions that may increase the effect of adverse reactions, particularly in HIV positive individuals undergoing organ transplant. Furthermore, many of these agents have been shown to be directly nephrotoxic, inducing a variety of kidney disorders. New, more effective therapies for treating HIV are needed. Summary The disclosure relates to conjugates, compositions, and methods for inhibiting viral growth, methods for the treatment of viral infections, and methods of synthesizing conjugates. In particular, such conjugates contain monomers or dimers of a moiety that inhibits human immunodeficiency virus, for example by binding to the gp120 glycoprotein (e.g., a gp120 binder such as temsavir, BMS-818251, DMJ-
II-121, BNM-IV-147, or analogs thereof), conjugated to Fc monomers, Fc domains, Fc-binding peptides, albumin proteins, or albumin protein-binding peptides. In preferred embodiments, the HIV targeting moiety (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof) in the conjugate targets a protein encoded by the HIV Env gene, in particular gp120 glycoprotein on the surface of the viral particle, thereby preventing viral attachment to the host CD4+ T cell and entry into the host immune cell. The Fc monomers or Fc domains in the conjugates bind to FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells, e.g., neutrophils, to activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates. The albumin or albumin-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neonatal Fc receptor. Such compositions are useful in methods for the inhibition of viral growth and in methods for the treatment of viral infections, such as those caused by an HIV-1 and HIV-2. In a first aspect, the disclosure features a conjugate described by any one of formulas (D-I), (M-I), (1), or (2):
wherein each A1 and each A2 is independently described by formula (A-I) or (A-II):
wherein Q is selected from the group consisting of:
S is selected from the group consisting of:
; R1, R2, R3, are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C3-C20 cycloalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C5-C20 aryl, optionally substituted C3-C15 heteroaryl, and optionally substituted C1-C20 alkoxy; R4 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, optionally substituted C3- C15 heteroaryl, and a bond; R5 is selected from H or optionally substituted C1-C6 alkyl; R6 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl; R7 and Y are each independently selected from
each R8 is independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; each R9 is independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; x is 1 or 2; k is 0, 1, 2, 3, 4, or 5; Ar is selected from the group consisting of optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl; n is 1 or 2; each E comprises an Fc domain monomer, an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to each Y of each A1 and/or A2; T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) (e.g., T is the total number of A1-L-A2 or A1-L moieties conjugated to (E)n); and
each squiggly line in formulas (D-I), (M-I), (1), or (2) indicates that L is covalently attached to E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L or each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L or A1-L-A2 structures described herein). In some embodiments, n is 1 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide. In some embodiments, n is 2 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), wherein the Fc domain monomers dimerize to form and Fc domain. In preferred embodiments of any of the aspects described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)), when A1 and/or A2 are selected from a structure described by (A-I), x is 2. In preferred embodiments of any of the aspects described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)), when A1 and/or A2 are selected from a structure described by (A-II), x is 2. In preferred embodiments of any of the aspects described herein, n is 2 and each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153). In a conjugate having two Fc domain monomers (e.g., a conjugate of formula (1), formula (2), formula (D-I) where n equals 2, or (M-I) where n equals 2), the Fc domain monomers dimerize to form an Fc domain. In certain embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ia)-(A-Ih):
wherein each X is independently C or N; or a pharmaceutically acceptable salt thereof. In further embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ia-i)-(A-Ih-i):
or a pharmaceutically acceptable salt thereof. In further embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ia-ii)-(A-Ih-ii):
or a pharmaceutically acceptable salt thereof. In some embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ic-i) and (A-Ic-ii):
or a pharmaceutically acceptable salt thereof. In some embodiments, each A1 and each A2 is independently described by any one of formulas (A-Ii)-(A-Ip):
wherein each X is independently C or N;
or a pharmaceutically acceptable salt thereof. In another embodiment, each A1 and each A2 is independently described by any one of formulas (A-Iq)-(A-Ix):
or a pharmaceutically acceptable salt thereof. In some embodiments, each A1 and each A2 is independently described by any one of formulas (A-Iq-i)-(A-Ix-i):
or a pharmaceutically acceptable salt thereof. In yet another embodiment, each A1 and each A2 is independently described by any one of formulas (A-Iaa)-(A-Ihh):
or a pharmaceutically acceptable salt thereof. In some embodiments, each A1 and each A2 is independently described by any one of formulas (A-Iii)-(A-Ipp):
wherein each X is independently C or N; or a pharmaceutically acceptable salt thereof. In another embodiment, each A1 and each A2 is independently described by any one of formulas (A-Iii-i)-(A-Ipp-i):
or a pharmaceutically acceptable salt thereof. In some embodiments, R1 is H. In certain embodiments, R2 is H. In preferred embodiments, R2 is -OCH3. In particular embodiments, R3 is H. In another embodiment, R4 is H. In some embodiments, R5 is H. In particular embodiments, R7 is a carbonyl. In certain embodiments, X is N. In other embodiments, X is C. In certain embodiments, each A1 and each A2 is independently described by any one of formulas (A-IIa)-(A-IId):
In another embodiment, each A1 and each A2 is independently described by any one of formulas (A-IIa-i)-(A-IId-i):
wherein U5 is C1-C10 alkyl; or a pharmaceutically acceptable salt thereof. In another aspect, the disclosure features a conjugate described by formula (D-I):
wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153); L in each A1-L-A2 is a linker covalently attached to a sulfur atom of a cysteine or a nitrogen atom of a lysine in E and to each of A1 and A2; n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain); T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), and the squiggly line connected to the E indicates that each A1-L-A2 is covalently attached (e.g., by way of a covalent bond or linker) to a sulfur atom of a hinge cysteine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is
2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein). In another aspect, the disclosure features a conjugate described by formula (M-I):
wherein each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153); L in each L-A1 is a linker covalently attached to a sulfur atom of a cysteine or a nitrogen atom of a lysine in E and to A1; n is 1 or 2 (e.g., when n is 2, the two Fc domain monomers dimerize to form and Fc domain); T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20); and the squiggly line connected to E indicates that each L-A1 is covalently attached (e.g., by way of a covalent bond or linker) to the sulfur atom of the hinge cysteine in E, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1 may be independently selected from any structure described by formula (A-I). In some embodiments of any of the aspects described herein each E includes an Fc domain monomer. In some embodiments of any of the aspects described herein, each E includes an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153. In some embodiments of any of the conjugates described herein, the conjugate forms a homodimer including an Fc domain. In some embodiments of the conjugates described herein, E homodimerizes with another E to form an Fc domain. In some embodiments, each E includes an albumin protein having the sequence of any one of SEQ ID NOs: 96-98. In some embodiments, T is 1 and L-A1 is covalently attached to the sulfur atom corresponding to Cys34 of SEQ ID NO: 96. In another aspect, the disclosure features an intermediate (Int) of Table 1. These intermediates comprise one or more gp120 binders and a linker (e.g., a PEG2-PEG20 linker) and may be used in the synthesis of a conjugate described herein. Intermediates of Table 1 may be conjugated to, for example, an Fc domain or Fc domain monomer, albumin protein, albumin protein-binding peptide, or Fc-binding peptide (e.g., by way of a linker) by any suitable methods known to those of skill in the art, including any of the methods described or exemplified herein. In some embodiments, the conjugate (e.g., a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) includes E, wherein E is an Fc domain monomer or an Fc domain (e.g., an Fc domain monomer or an Fc domain, each Fc domain monomer having, independently, the sequence of any one of SEQ ID NOs: 1-95 and 125-153). In preferred embodiments, one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E is covalently conjugated to a linker (e.g., a PEG2-PEG20 linker). The linker conjugated to E may be functionalized such that it may react to form a covalent bond with any of the Ints described herein (e.g., an Int of Table 1). In preferred
embodiments, E is conjugated to a linker functionalized with an azido group and the Int (e.g., an Int of Table 1) is functionalized with an alkyne group. Conjugation (e.g., by click chemistry) of the linker-azido of E and linker-alkyne of the Int forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII). In yet other embodiments, E is conjugated to a linker functionalized with an alkyne group and the Int (e.g., an Int of Table 1) is functionalized with an azido group. Conjugation (e.g., by click chemistry) of the linker-alkyne of E and the linker-azido of the Int forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII). In yet other embodiments, the Int (e.g., an Int of Table 1) is functionalized with a phenyl ester group (e.g., a trifluorophenyl ester group or a tetrafluorophenyl ester group). Conjugation (e.g., by acylation) of E and the linker-phenyl ester (e.g., trifluorophenyl ester or tetrafluorophenyl ester) of the Int forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII). The disclosure further features a composition (e.g., a pre-conjugation intermediate) having the structure of any one of Int-1 to Int-190 (e.g., Int-1 to Int-93, Int-1 to Int-27, Int-28 to Int-93, Int-94 to Int- 141, Int-142 to Int-190 or Int-28 to Int-190). In some embodiments, the disclosure features a composition (e.g., a pre-conjugation intermediate) having the structure of any one of Int-28 to Int-190 (e.g., Int-28 to Int-93, Int-94 to Int-141, Int-142 to Int-190 or Int-28 to Int-190). The disclosure provides a method of making an antiviral-Fc conjugate by conjugating (e.g., via a linker) any one of Int-1 to Int-190 (e.g., Int-1 to Int-93, Int-1 to Int-27, Int-28 to Int-93, Int-94 to Int-141, Int- 142 to Int-190, or Int-28 to Int-190) to an Fc domain monomer or an Fc domain. In some embodiments, the disclosure features a conjugate, wherein the conjugate comprises a small molecule targeting agent, wherein the targeting agent is described by any one of Int-1 to Int-190 (e.g., Int-1 to Int-93, Int-1 to Int-27, Int-28 to Int-93, Int-94 to Int-141, Int-142 to Int-190, or Int-28 to Int-190), which is conjugated to an Fc (e.g., via a linker). Table 1: Intermediates
In another aspect, the disclosure features a conjugate of Table 2. Each conjugate of Table 2 corresponds to a conjugate of either formula (M-I) or formula (D-I), as indicated. Conjugates of Table 2 include conjugates formed by the covalent reaction of an Int of Table 1 with a linker which is in turn conjugated to E (e.g., an Fc domain monomer, an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide). In some embodiments, the reactive moiety of the Int (e.g., the alkyne or azido group) reacts with a corresponding reactive group (e.g., an alkyne or azido) of a linker (represented by L’) covalently attached to E, such that an Int of Table 1 is covalently attached to E. In some embodiments, the reactive moiety of the Int (e.g., the phenyl ester group, e.g., tetrafluorophenyl ester or trifluorophenyl ester group) reacts with a corresponding reactive group (e.g., a nitrogen or sulfur atom) of an amino acid side chain of E, such that an Int of Table 1 is covalently attached to E. As represented in Table 2, L’ corresponds to the remainder of L as defined in (M-I) or (D-I) (e.g., L’ is a linker that covalently joins the Int and E). For example, L’ may include a triazole (formed by the click chemistry reaction between the Int and a linker conjugated to E) and a linker (e.g., a PEG2-PEG20 linker) which in turn is conjugated to an amino acid side chain of E. In some embodiments in any conjugate of Table 2, n is 1 or 2. When n is 1, each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide. When n is 2, each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), and the Fc domain monomers dimerize to form and Fc domain. In some embodiments in any conjugate of Table 2, T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20). The disclosure also provides a population of any of the conjugates of Table 2 wherein the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments, the average T is 1 to 10 (e.g., 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, or 10). In certain embodiments, the average T is 1 to 5 (e.g., 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, or 5). In some embodiment, the average T is 5 to 10 (e.g., 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5,
8.6, 8.7, 8.8, 8.9, 9, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10). In some embodiments, the average T is 2.5 to 7.5 (e.g., 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, or 7.5). The squiggly line in the conjugates of Table 2 indicates that each L’-Int is covalently attached to an amino acid side chain in E (e.g., the nitrogen atom of a surface exposed lysine or the sulfur atom of a surface exposed cysteine in E), or a pharmaceutically acceptable salt thereof. The disclosure further features a conjugate of Table 2, wherein the conjugate is produced by conjugation (e.g., via a linker) of any one of Int-1 to Int-190 to an Fc domain or an Fc domain monomer. Table 2: Conjugates Corresponding to Intermediates of Table 1
In some embodiments, each E includes an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153. In another aspect, the disclosure features a conjugate including (i) a first moiety, A1; (ii) a second moiety, A2; (iii) an Fc domain monomer or an Fc domain; and (iv) a linker covalently attached to A1 and A2, and to the Fc domain monomer or the Fc domain; wherein each A1 and each A2 is independently selected from any structure described by formula (A-I) or (A-II). In a preferred embodiment of the above, x is 2.
In another aspect, the disclosure features a conjugate including (i) a first moiety, Int; (ii) an Fc domain monomer or an Fc domain; and (iv) a linker covalently attached to Int, and to the Fc domain monomer or the Fc domain; wherein each Int is independently selected from any one of the intermediates of Table 1. In another aspect, the disclosure features a conjugate including (i) a first moiety, A1; (ii) a second moiety, A2; (iii) an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; and (iv) a linker covalently attached to A1 and A2, and to the Fc domain monomer or the Fc domain; wherein each A1 and each A2 is independently selected from any structure described by formula (A-I) or (A-II). In a preferred embodiment of the above, x is 2. In another aspect, the disclosure features a conjugate described by formula (D-I):
( ) wherein each A1 and each A2 is independently described by formula (A-I) or (A-II); each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; n is 1 or 2; T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) (e.g., T is the total number of A1-L-A2 moieties conjugated to (E)n); and L is a linker covalently attached to each of E, A1, and A2, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein). In a preferred embodiment of the above, x is 2. In some embodiments, the conjugate is described by formulas (e.g., (D-I)-(D-XVII)) found in WO 2020/252393.
In some embodiments, the conjugate is described by formula (D-IV-7):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (D-IV-8):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (D-IV-9):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (D-V-7):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (D-V-8):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (D-V-9):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (D-VI-8):
wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof. In some embodiments of any of the aspects described herein, L or L’ includes one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl. In some embodiments of any of the aspects described herein, the backbone of L or L’ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2- C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted
C3-C15 heteroarylene, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl. In some embodiments of any of the aspects described herein, L or L’ is oxo substituted. In some embodiments, the backbone of L or L’ includes no more than 250 atoms. In some embodiments, the backbone of L or L’ includes between 1 and 250 atoms (e.g., between 5 and 250 atoms, between 10 and 250 atoms, between 50 and 250 atoms, between 10 and 200 atoms, between 10 and 100 atoms, between 10 and 50 atoms, between 5 and 100 atoms, between 5 and 50 atoms, between 5 and 30 atoms). In some embodiments, L or L’ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage. In some embodiments L or L’ is a bond. In some embodiments, L or L’ is an atom. In some embodiments of any of the aspects described herein, each L is described by formula (D- L-I):
wherein LA is described by formula GA1-(ZA1)g1-(YA1)h1-(ZA2)i1-(YA2)j1-(ZA3)k1-(YA3)l1-(ZA4)m1-(YA4)n1-(ZA5)o1- GA2; LB is described by formula GB1-(ZB1)g2-(YB1)h2-(ZB2)i2-(YB2)j2-(ZB3)k2-(YB3)l2-(ZB4)m2-(YB4)n2-(ZB5)o2-GB2; LC is described by formula GC1-(ZC1)g3-(YC1)h3-(ZC2)i3-(YC2)j3-(ZC3)k3-(YC3)l3-(ZC4)m3-(YC4)n3-(ZC5)o3-GC2; GA1 is a bond attached to Qi; GA2 is a bond attached to A1; GB1 is a bond attached to Qi); GB2 is a bond attached to A2; GC1 is a bond attached to Qi; GC2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of ZA1, ZA2, ZA3, ZA4, ZA5, ZB1, ZB2, ZB3, ZB4, ZB5, ZC1, ZC2, ZC3, ZC4, and ZC5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene; each of YA1, YA2, YA3, YA4, YB1, YB2, YB3, YB4, YC1, YC2, YC3, and YC4 is, independently, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate,
phosphoryl, or imino; each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; each of g1, h1, i1, j1, k1, l1, m1, n1, o1, g2, h2, i2, j2, k2, l2, m2, n2, o2, g3, h3, i3, j3, k3, l3, m3, n3, and o3 is, independently, 0 or 1; Qi is a nitrogen atom, optionally substituted C1- C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2- C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene. In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (-CH2CH2O-)n, wherein n is an integer from 2 to 100. A polyethylene glycol may be selected any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20- PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80- PEG90, PEG90-PEG100). In some embodiments, LC may have two points of attachment to the Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide (e.g., two GC2). In some embodiments of any of the aspects described herein, L includes a polyethylene glycol (PEG) linker. A PEG linker includes a linker having the repeating unit structure (-CH2CH2O-)n, wherein n is an integer from 2 to 100. A polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (M-I)-(M-XVII)). A polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (D-I)-(D-XVII)). A polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas (D-I)-(D-XVII)). A polyethylene glycol linker may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, Lc includes a PEG linker, where LC is covalently attached to each of Qi and E. In another aspect, the disclosure features a conjugate described by formula (M-I):
wherein each A1 is independently described by formula (A-I); each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153), an
albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; n is 1 or 2; T is an integer from 1 to 20 (e.g., T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) (e.g., T is the total number of A1-L moieties conjugated to (E)n); and L is a linker covalently attached to each of E and A1, or a pharmaceutically acceptable salt thereof. When T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1 may be independently selected from any structure described by formula (A-I). In a preferred embodiment of the above, x is 2. In some embodiments, the conjugate is described by formula (M-II):
wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-III):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-III-1):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-III-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-III-3):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-III-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-III-5):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-III-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-IV):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-IV-1):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-IV-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-IV-3):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-IV-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-IV-5):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-IV-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-IV-7):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-IV-8):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-IV-9):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-V):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-V-1):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-V-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-V-3):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-V-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-V-5):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-V-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-V-7):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-V-8):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-V-9):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-VI):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VI-1):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-VI-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VI-3):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-VI-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VI-5):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-VI-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VI-7):
wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-VI-8):
wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VII):
wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-VIII):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-VIII-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-IX):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-IX-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-X):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-X-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XI):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XI-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 (e.g., y1 is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XII):
wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XII-1):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XII-2):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIII):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIII-1):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XIII-2):
or a pharmaceutically acceptable salt thereof. In some embodiments, the disclosure features a conjugate described by formula (M-I):
wherein each A1 is independently described by formula (A-II); each E comprises an Fc domain monomer; the squiggly line connected to the E indicates that each A1-L-A2 is covalently attached to E, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIV):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XIV-1):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIV-2):
wherein L’ is the remainder of L and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XIV-3):
wherein L’ is the remainder of L; e1 is an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XIV-4):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XIV-5):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XV):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XV-1):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XV-2):
wherein L’ is the remainder of L and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XV-3):
wherein L’ is the remainder of L; e1 is an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XV-4):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XV-5):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XVI):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XVI-1):
wherein U5 is C1-C10 alkyl, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XVI-2):
wherein L’ is the remainder of L and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XVI-3):
wherein L’ is the remainder of L; e1 is an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XVI-4):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XVI-5):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XVII):
or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XVII-1):
or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XVII-2):
wherein L’ is the remainder of L and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XVII-3):
wherein L’ is the remainder of L; e1 is an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments, the conjugate is described by formula (M-XVII-4):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
In some embodiments, the conjugate is described by formula (M-XVII-5):
wherein L’ is the remainder of L; e1 and e3 are each independently an integer from 1-10; and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. In some embodiments of any of the aspects described herein, L or L’ includes one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl. In some embodiments of any of the aspects described herein, the backbone of L or L’ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally
substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2- C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl. In some embodiments of any of the aspects described herein, L or L’ is oxo substituted. In some embodiments, the backbone of L or L’ includes no more than 250 atoms. In some embodiments, the backbone of L or L’ includes between 1 and 250 atoms (e.g., between 5 and 250 atoms, between 10 and 250 atoms, between 50 and 250 atoms, between 10 and 200 atoms, between 10 and 100 atoms, between 10 and 50 atoms, between 5 and 100 atoms, between 5 and 50 atoms, between 5 and 30 atoms). In some embodiments, L or L’ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage. In some embodiments, L or L’ is a bond. In some embodiments, L or L’ is an atom. In some embodiments, L’ is a nitrogen atom. In some embodiments, each L is described by formula (M-L): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached to A1; J2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-
C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0, 1, or 2; or a pharmaceutically acceptable salt thereof. In some embodiments, each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1. In some embodiments, each L is described by formula (M-L-I): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein: J1 is a bond attached to A1; J2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2- C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1, or a pharmaceutically acceptable salt thereof. In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (-CH2CH2O-)n, wherein n is an integer from 2 to 100. A polyethylene glycol may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, J2 may have two points of attachment to the Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide (e.g., two J2). In some embodiments, L is
,
wherein d is an integer from 1 to 20 (e.g., d is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20). In some embodiments, L is
, , ,
, wherein each Y is independently selected from (-O-), (-S-), (-R8-), (-O(C=O)NR8-), (-O(C=S)NR8-), (-O(C=O)O-), (-O(C=O)-), (-NH(C=O)O-), (-NH(C=O)-), (-NH(C=NH)-), (-NH(C=O)NR8-), (-NH(C=NH)NR8-), (-NH(C=S)NR8-), (-NH(C=S)-), (-OCH2(C=O)NR8-), (-NH(SO2)-), (-NH(SO2)NR8-), (-OR9-), (-NR9-), (-SR9-), (-R9NH(C=O)-), (-R9OR9C(=O)NH-), (-CH2NH(C=O)-), (-CH2OCH2(C=O)NH-), (-(C=NR8)NH-), (-NH(SO2)-), (-(C=O)NH-), (-C(=O)-), (-C(NR8)-), or (-R9C(=O)-); each R8 is independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; each R9 is independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; and each of d, e, y1, and x1 is, independently, an integer from 1 to 26 (e.g., d is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26). In some embodiments, g is 0 or 1; h is 0; i is 0 or 1; j is 0 or 1; k is 0 or 1; l is 0; m is 0 or 1; n is 0; and o is 0 or 1. In some embodiments, g is 1; h is 0; i is 1; j is 0; k is 1; l is 0; m is 1; n is 0; and o is 1. In some embodiments, g is 0; h is 0; i is 1; j is 0; k is 1; l is 0; m is 1; n is 0; and o is 1. In some embodiments, g is 1; h is 0; i is 0; j is 0; k is 1; l is 0; m is 1; n is 0; and o is 1. In some embodiments, g is 0; h is 0; i is 1; j is 1; k is 1; l is 0; m is 1; n is 0; and o is 1. In some embodiments, L is a linker described by formula (M-L-Ia): J1-Q1-Q2-Q3-Q4-Q5-J2. In some embodiments, L is a linker described by formula (M-L-Ib): J1-Q2-Q3-Q4-Q5-J2. In some embodiments, L is a linker described by formula (M-L-Ic): J1-Q1-Q3-Q4-Q5-J2. In some embodiments, L is a linker described by formula (M-L-Id): J1-Q2-T2-Q3-Q4-Q5-J2.
In some embodiments, Q1 is optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, or optionally substituted C2-C20 heteroalkynylene. In some embodiments, Q1 is optionally substituted C1-C20 alkylene or optionally substituted C1-C20 heteroalkylene. In some embodiments, Q1 is optionally substituted C1-C20 alkylene. In some embodiments, Q1 is optionally substituted C1-C20 heteroalkylene. In some embodiments, Q2 is optionally substituted C3-C20 cycloalkylene, optionally substituted C2- C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene. In some embodiments, Q2 is optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, or optionally substituted C8-C20 heterocycloalkynylene. In some embodiments, Q2 is optionally substituted C3-C20 cycloalkylene or optionally substituted C2-C20 heterocycloalkylene. In some embodiments, Q2 is optionally substituted C3-C20 cycloalkylene. In some embodiments, Q2 is optionally substituted C2-C20 heterocycloalkylene. In some embodiments, Q2 is a five-membered C2-C4 heterocycloalkylene or six- membered C2-C5 heterocycloalkylene. In some embodiments, Q3 is optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, or optionally substituted C2-C20 heteroalkynylene. In some embodiments, Q3 is optionally substituted C1-C20 alkylene or optionally substituted C1-C20 heteroalkylene. In some embodiments, Q3 is optionally substituted C1-C20 alkylene. In some embodiments, Q3 is optionally substituted C1-C20 heteroalkylene. In some embodiments, Q3 is
, where e4 is an integer from 1 to 10. In some embodiments, Q4 is optionally substituted C5-C15 arylene or optionally substituted C2-C15 heteroarylene. In some embodiments, Q4 is optionally substituted C5-C15 arylene. In some embodiments, Q4 is optionally substituted C2-C15 heteroarylene. In some embodiments, Q4 is a five-membered optionally substituted C2-C4 heteroarylene. In some embodiments, Q4 is a five-membered C2-C4 heteroarylene. In some embodiments, Q4 is a five- membered C2 heteroarylene. In some embodiments, Q5 is optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, or optionally substituted C2-C20 heteroalkynylene. In some embodiments, Q5 is optionally substituted C1-C20 alkylene or optionally substituted C1-C20 heteroalkylene. In some embodiments, Q5 is optionally substituted C1-C20 alkylene. In some embodiments, Q5 is optionally substituted C1-C20 heteroalkylene.
In some embodiments, Q5 is , where e5 is an integer from 1 to 8;
and e6 is an integer from 1 to 16. In some embodiments of any of the aspects described herein, L includes (e.g., a portion of L that joins A1 and E includes):
, ,
, where each e is, independently, an integer from 1 to 20. In some embodiments of any of the aspects described herein, L includes (e.g., a portion of L that joins A1 and E includes):
, where each e is, independently, an integer from 1 to 20. In some embodiments of any of the aspects described herein, L is: ,
,
,
,
where each e is, independently, an integer from 1 to 20. In some embodiments of any of the aspects described herein, L includes a polyethylene glycol (PEG) linker. A PEG linker includes a linker having the repeating unit structure (-CH2CH2O-)n, wherein n is an integer from 2 to 100. A polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)). A polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one
of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)). A polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas). A polyethylene glycol linker may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10- PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90- PEG100). In some embodiments, Lc includes a PEG linker, where LC is covalently attached to each of Qi and E. In some embodiments of any of the aspects described herein, L is covalently attached to the nitrogen atom of a surface exposed lysine of E or L is covalently attached to the sulfur atom of a surface exposed cysteine of E. In some embodiments of any of the aspects described herein, E is an Fc domain monomer. In some embodiments, n is 2 and each E dimerizes to form an Fc domain. In some embodiments, n is 2, each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (D-I-1):
wherein J is an Fc domain; and T is an integer from 1 to 20 (e.g., T is 1 ,2 ,3 ,4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments, n is 2, each E is an Fc domain monomer, each E dimerizes to form an Fc domain, and the conjugate is described by formula (M-I-1):
wherein J is an Fc domain; and T is an integer from 1 to 20 (e.g., T is 1 ,2 ,3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), or a pharmaceutically acceptable salt thereof. In some embodiments of any of the aspects described herein, each E independently includes an amino acid sequence having at least 90% (e.g., at least 95%, 96%, 97%, 98%, or 99%) sequence identity with the amino acid sequence of any one of SEQ ID Nos: 1-95 and 125-153. In some embodiments, each E includes the sequence of any one of SEQ ID NOs: 1-95 and 125-153. In some embodiments of any of the aspects described herein, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide. In some embodiments, where E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide, n is 1. In some embodiments, n is 1, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide and the conjugate is described by formula (D-I-2):
wherein E is an albumin protein, an albumin protein-binding peptide, or Fc-binding peptide; and T is an integer from 1 to 20, or a pharmaceutically acceptable salt thereof. In some embodiments, n is 1, E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide, and the conjugate is described by formula (M-I-2):
wherein E is an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; and T is an integer from 1 to 20, or a pharmaceutically acceptable salt thereof. In some embodiments of any of the aspects described herein, E is an albumin protein having the sequence of any one of SEQ ID NOs: 96-98. In some embodiments of any of the aspects described herein, T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In another aspect, the disclosure provides a population of conjugates having the structure of any of the conjugates described herein (e.g., a population of conjugates having the formula of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)), wherein the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In another aspect, the disclosure provides a pharmaceutical composition comprising any of the conjugates described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M- I)-(M-XVII)), or a pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable excipient. In another aspect, the disclosure provides a method for the treatment of a subject having a viral infection or presumed to have a viral infection, the method comprising administering to the subject an effective amount of any of the conjugates or compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)). In another aspect, the disclosure provides a method for the prophylactic treatment of a viral infection in a subject in need thereof, the method comprising administering to the subject an effective amount of any of the conjugates or compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)). In some embodiments, the viral infection is caused by HIV. In some embodiments, the viral infection is HIV-1 or HIV-2. In some embodiments, the subject is immunocompromised. In some embodiments, the subject has been diagnosed with humoral immune deficiency, T cell deficiency, neutropenia, asplenia, or complement deficiency. In some embodiments, the subject is being treated or is about to be treated with an immunosuppressive therapy.
In some embodiments, the subject has been diagnosed with a disease which causes immunosuppression. In some embodiments, the disease is cancer or acquired immunodeficiency syndrome. In some embodiments, the cancer is leukemia, lymphoma, or multiple myeloma. In some embodiments, the subject has undergone or is about to undergo hematopoietic stem cell transplantation. In some embodiments, wherein the subject has undergone or is about to undergo an organ transplant. In some embodiments, the conjugate of composition is administered intramuscularly, intravenously, intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally, locally, by inhalation, by injection, or by infusion. In some embodiments, the subject is treated with a second therapeutic agent. In some embodiments, the second therapeutic agent is an antiviral agent. In some embodiments, the second therapeutic agent is a viral vaccine. In some embodiments, the viral vaccine elicits an immune response in the subject against HIV (e.g., HIV-1 or HIV-2). In some embodiments, an Fc-domain-containing composition may be substituted for an Fc domain and an Fc-domain-monomer-containing composition may be substituted for an Fc domain monomer in any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII), when n is 1, E is an Fc- domain-monomer-containing composition. In any of the formulas described herein (e.g., any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)), when n is 2, E is an Fc-domain-containing composition. In certain embodiments, the Fc-domain-containing composition is an antibody or an antibody fragment. An antibody may include any form of immunoglobulin, heavy chain antibody, light chain antibody, LRR-based antibody, or other protein scaffold with antibody-like properties, as well as any other immunological binding moiety known in the art, including antibody fragments (e.g., a Fab, Fab', Fab’2, F(ab')2, Fd, Fv, Feb, scFv, or SMIP). The subunit structures and three-dimensional configurations of different classes of antibodies are known in the art. An antibody fragment may include a binding moiety that includes a portion derived from or having significant homology to an antibody, such as the antigen- determining region of an antibody. Exemplary antibody fragments include Fab, Fab', Fab’2, F(ab')2, Fd, Fv, Feb, scFv, and SMIP. In particular embodiments, the antibody or antibody fragment is a human, mouse, camelid (e.g., llama, alpaca, or camel), goat, sheep, rabbit, chicken, guinea pig, hamster, horse, or rat antibody or antibody fragment. In specific embodiments, the antibody is an IgG, IgA, IgD, IgE, IgM, or intrabody. In certain embodiments, the antibody fragment includes an scFv, sdAb, dAb, Fab, Fab', Fab'2, F(ab')2, Fd, Fv, Feb, or SMIP. In some embodiments, the Fc-domain-containing composition (e.g., an antibody or antibody fragment) confers binding specificity to a one or more targets (e.g., an antigen, such as an antigen associated with HIV). HIV-targeting antibodies are known in the art, for example, as described in Wibmer
et al. Curr. Opin. HIV AIDS, 10(3): 135-143 (2015), which is incorporated herein by reference in its entirety. In some embodiments, the one or more targets (e.g., an antigen) bound by the Fc-domain- containing composition (e.g., an antibody or antibody fragment) is a viral (e.g., HIV) protein such as gp41 or gp120 receptor. In some embodiments, the antibody or antibody fragment recognizes a viral surface antigen. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 1. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 1. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 2. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 2. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 3. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 3. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 4. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 4. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 5. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 5. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 6. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 6. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 7. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 7. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 8. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 8. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 9. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 9. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 10. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 10. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 11. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 11. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 12. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 12. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 13. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 13. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 14. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 14. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 15. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 15. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 16. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 16. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 17. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 17. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 18. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 18. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 19. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 19. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 20. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 20. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 21. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 21. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 22. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 22. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 23. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 23. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 24. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 24. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 25. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 25. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 26. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 26. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 27. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 27. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 28. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 28. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 29. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 29. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 30. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 30. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 31. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 31. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 32. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 32. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 33. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 33. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 34. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 34. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 35. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 35. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 36. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 36. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 37. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 37. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 38. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 38. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 39. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 39. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 40. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 40. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 41. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 41. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 42. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 42. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 43. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 43. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 44. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 45. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 46. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 46. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 47. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 47. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 48. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 48. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 49. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 49. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 50. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 50. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 51. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 51. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 52. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 52. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 53. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 53. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 54. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 54. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 55. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 55. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 56. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 56. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 57. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 57. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 58. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 58. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 59. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 59. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 60. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 60. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 61. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 61. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 62. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 62. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 63. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 63. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 64. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 64. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 65. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 65. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 66. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 66. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 67. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 67. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 68. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 68. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 69. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 69. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 70. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 70. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 71. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 71. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 72. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 72. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 73. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 73. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 74. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 74. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 75. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 75. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 76. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 76. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 77. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 77. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 78. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 78. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 79. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 79. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 80. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 80. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 81. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 81. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 82. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 82. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 83. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 83. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 84. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 84. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 85. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 85. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 86. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 86. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 87. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 87. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 88. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 88. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 89. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 89. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 90. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 90. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 91. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 91. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 92. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 92. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 93. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 93. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 94. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 94. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 95. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 95. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 96. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 96. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 97. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 97. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 98. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 98. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 125. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 125. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 126. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 126. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 127. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 127. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 128. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 128. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 129. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 129. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 130. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 130. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 131. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 131. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 132. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 132. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 133. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 133. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 134. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 134. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 135. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 135. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 136. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 136. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 137. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 137. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 138. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 138. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 139. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 139. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 140. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 140. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 141. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 141. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 142. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 142. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 143. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 143. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 144. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 144. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 145. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 145. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 146. In some embodiments, E includes an amino acid sequence that is at
least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 146. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 147. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 147. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 148. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 148. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 149. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 149. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 150. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 150. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 151. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 151. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 152. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 152. In some embodiments of any of the aspects described herein, E (e.g., each E) includes the amino acid sequence of SEQ ID NO: 153. In some embodiments, E includes an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 153. In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a triple mutation corresponding to M252Y/S254T/T256E (YTE). As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., of a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 and 125-153 may be mutated to include a YTE mutation. In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a double mutant corresponding to M428L/N434S (LS). As used
herein, an amino acid “corresponding to” a particular amino acid residue (e.g., or a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 and 125-153 may be mutated to include a LS mutation. In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a mutant corresponding to N434H. As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., of a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 and 125- 153 may be mutated to include an N434H mutation. In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) includes a mutant corresponding to C220S. As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., or a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 and 125- 153 may be mutated to include a C220S mutation. In some embodiments of any of the aspects described herein, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-138 or 168-197) includes a mutant corresponding to K246X, where X is not Lys (e.g., K246X is K246S, K246G, K246A, K246T, K246N, K246Q, K246R, K246H, K246E, or K246D). As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., or a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 or 125-153 may be mutated to at position 246 such that position 246 is not a Lys. In some embodiments, any one of SEQ ID NOs: 1-95 or 125-153 may be mutated to include a K246S mutation. In some embodiments of any of the aspects described herein, wherein E includes an Fc domain monomer, the Fc domain monomer (e.g., the Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 and 125-153) is a fragment of the Fc domain monomer (e.g., a fragment of at least 25 (e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more), at least 50 (e.g., 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75 or more), at least 75 (e.g., 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 or more) consecutive amino acids in length from SEQ ID NOs: 1-95 and 125-153. In some embodiments of any of the aspects described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)), one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E is covalently conjugated to a linker (e.g., a PEG2-PEG20 linker). The linker conjugated to E may be functionalized such that it may react to form a covalent bond with the L of any A1-L or any A2-L-A1
described herein. In preferred embodiments, E is conjugated to a linker functionalized with an azido group and the L of A1-L or any A2-L-A1 is functionalized with an alkyne group. Conjugation (e.g., by click chemistry) of the linker-azido of E and the linker-alkyne of A1-L or A2-L-A1 forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)- (M-XVII). In yet other embodiments, E is conjugated to a linker functionalized with an alkyne group and L of an A1-L or of any A2-L-A1 is functionalized with an azido group. Conjugation (e.g., by click chemistry) of the linker-alkyne of E and linker-azido of A1-L or of any A2-L-A1 forms a conjugate of the invention, for example a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII). In some embodiments of any of the aspects described herein, the squiggly line of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII) represents a covalent bond between the L of A1-L or A2-L-A1 or L’ of A1-L’ or A1-L’-A2. In some embodiments of any of the aspects described herein, the squiggly line of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII) represents that one or more amino acid side chains of E (e.g., one or more nitrogen atoms of one or more surface exposed lysine residues of E or one or more sulfur atoms of one or more surface exposed cysteines in E) have been conjugated to a linker (e.g., a PEG2-PEG20 linker) wherein the linker has been functionalized with a reactive moiety, such that the reactive moiety forms a covalent bond with the L of any A1-L or any A2-L-A1 described herein (e.g., by click chemistry between an azido functionalized linker and an alkyne functionalized linker, as described above). In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-I):
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
.
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
.
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
.
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
.
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
.
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
. In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by:
.
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-II):
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIaa):
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIbb):
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIcc):
In some embodiments of any of the aspects described herein, A1 and/or A2 have the structure described by (A-IIdd):
In another aspect, the disclosure features a conjugate selected from any one of conjugates 1-76 (e.g., Conjugates 1-4, 5a, 5b, 6-8, 9a, 9b, 10, 11, 12a, 12b, 13a, 13b, 14a, 14b, 15-28, 29a, 29b, 30a, 30b, 31-36, 37a, 37b, and 38-76). In another aspect, the disclosure features a conjugate selected from any one of conjugates 1-48 (e.g., Conjugates 1-4, 5a, 5b, 6-8, 9a, 9b, 10, 11, 12a, 12b, 13a, 13b, 14a, 14b, 15-28, 29a, 29b, 30a, 30b, 31-36, 37a, 37b, and 38-48). In some embodiments, the conjugate is selected from any one of conjugates 5b, 9b, 12b, 13b, 14b, and 15-48 (e.g., Conjugates 15-28, 29a, 29b, and 30-48). In some embodiments, the conjugate is selected from any one of conjugates 5b, 29a, 29b, 30a, and 37a. In some embodiments, the conjugate is conjugate 1, or any regioisomer thereof, and the drug-to- antibody ratio (DAR) (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 2, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 3, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 4, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 5a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 6, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 7, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 8, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 9a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 10, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 11, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 12a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 13a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 14a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 5b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 9b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 12b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 13b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 14b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 29b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 15, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 16, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 17, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 18, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 19, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 20, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 21, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 22, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 23, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 24, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 25, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 26, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 27, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 28, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 29a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 29b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 30a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 31, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 32, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 33, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 34, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 35, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 36, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 37a, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 38, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 39, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 40, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 41, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 42, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 43, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 44, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 45, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 46, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 47, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 48, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 30b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 37b, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 49, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 50, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 51, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 52, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 53, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 54, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 55, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 56, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 57, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 58, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 59, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 60, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 61, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 62, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 63, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 64, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 65, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 66, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 67, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 68, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 69, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 70, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 71, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 72, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 73, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 74, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 75, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 76, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 77, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 78, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 79, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 80, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 81, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 82, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 83, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 84, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 85, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 86, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 87, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 88, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 89, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 90, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 91, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 92, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 93, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 94, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 95, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 96, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 97, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 98, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 99, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 100, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 101, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 102, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 103, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 104, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 105, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 106, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 107, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 108, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 109, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 110, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 111, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 112, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 113, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 114, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0.
In some embodiments, the conjugate is conjugate 115, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 116, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 117, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 118, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 119, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 120, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, the conjugate is conjugate 121, or any regioisomer thereof, and the DAR (e.g., T) is between 0.5 and 10.0, e.g., between 0.5 and 2.0, between 2.0 and 4.0, between 4.0 and 6.0 between 6.0 and 8.0, or between 8.0 and 10.0. In some embodiments, a population of conjugates described herein has a DAR (e.g., T) of between 1 and 2, 2 and 4, 4 and 6, 6 and 8, 8 and 10, 1 and 10, 1 and 20, 1 and 5, 3 and 7, 5 and 10, or 10 and 20. In some embodiments, the DAR (e.g., T) is between 0.5 and 10.0, e.g., 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In some embodiments, the Fc domain monomer includes less than about 300 amino acid residues (e.g., less than about 300, less than about 295, less than about 290, less than about 285, less than about 280, less than about 275, less than about 270, less than about 265, less than about 260, less than about 255, less than about 250, less than about 245, less than about 240, less than about 235, less than about 230, less than about 225, or less than about 220 amino acid residues). In some embodiments, the Fc domain monomer is less than about 40 kDa (e.g., less than about 35kDa, less than about 30kDa, less than about 25kDa). In some embodiments, the Fc domain monomer includes at least 200 amino acid residues (e.g., at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, or at least 300 amino residues). In some embodiments, the Fc domain monomer is at least 20 kDa (e.g., at least 25 kDa, at least 30 kDa, or at least 35 kDa).
In some embodiments, the Fc domain monomer includes 200 to 400 amino acid residues (e.g., 200 to 250, 250 to 300, 300 to 350, 350 to 400, 200 to 300, 250 to 350, or 300 to 400 amino acid residues). In some embodiments, the Fc domain monomer is 20 to 40 kDa (e.g., 20 to 25 kDa, 25 to 30 kDa, 35 to 40 kDa, 20 to 30 kDa, 25 to 35 kDa, or 30 to 40 KDa). In some embodiments, the Fc domain monomer includes an amino acid sequence at least 90% identical (e.g., at least 95%, at least 98%) to the sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof. In some embodiments, the Fc domain monomer includes the amino acid sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof. In some embodiments, the Fc domain monomer includes a region of any one of SEQ ID NOs: 1- 95 and 125-153, wherein the region includes positions 220, 252, 254, and 256. In some embodiments, the region includes at least 40 amino acid residues, at least 50 amino acid residues, at least 60 amino acid residues, at least 70 amino acids residues, at least 80 amino acids residues, at least 90 amino acid residues, at least 100 amino acid residues, at least 110 amino acid residues, at least 120 amino residues, at least 130 amino acid residues, at least 140 amino acid residues, at least 150 amino acid residues, at least 160 amino acid residues, at least 170 amino acid residues, at least 180 amino acid residues, at least 190 amino acid residues, or at least 200 amino acid residues. In an aspect, the disclosure features a method of synthesizing a conjugate of formula (D-I):
wherein each A1 and each A2 is independently selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of each of A1 and A2; T is an integer from 1 to 20; and each squiggly line in formula (D-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing a first composition including E;
(b) providing a second composition including a compound of formula (DF-I) or salt thereof:
wherein L’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. In another aspect, the disclosure features a method of synthesizing a conjugate of formula (M-I):
wherein each A1 is selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of A1; T is an integer from 1 to 20; and each squiggly line in formula (M-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing a first composition including E;
(b) providing a second composition including a compound of formula (MF-I) or salt thereof:
wherein L’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. In some embodiments, L’ includes G, wherein G is optionally substituted C1-C6 alkylene, optionally substituted C1-C6 heteroalkylene, optionally substituted C2-C6 alkenylene, optionally substituted C2-C6 heteroalkenylene, optionally substituted C2-C6 alkynylene, optionally substituted C2-C6 heteroalkynylene, optionally substituted C3-C10 cycloalkylene, optionally substituted C2-C10 heterocycloalkylene, optionally substituted C6-C10 arylene, or optionally substituted C2-C10 heteroarylene. In some embodiments, a compound of formula (MF-I) or salt thereof has the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74). In some embodiments, a compound of formula (MF-I) or salt thereof includes the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74). In some embodiments, a compound of formula (MF-I) or salt thereof is synthesized from the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74).In some embodiments, a compound of formula (DF-I) or (MF-I), where each R is halo (e.g., F), provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester. In some embodiments, a compound of formula (DF-I) or (MF-I), where m is 3, provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester. In some embodiments, a compound of formula (DF-I) or (MF-I), where m is 3 and each R is halo (e.g., F), provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
In an aspect, the disclosure features a method of synthesizing a conjugate of formula (D-I):
wherein each A1 and each A2 is independently selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of each of A1 and A2; T is an integer from 1 to 20; and each squiggly line in formula (D-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (DF-II) or salt thereof:
wherein G is optionally substituted C1-C6 alkylene, optionally substituted C1-C6 heteroalkylene, optionally substituted C2-C6 alkenylene, optionally substituted C2-C6 heteroalkenylene, optionally substituted C2-C6 alkynylene, optionally substituted C2-C6 heteroalkynylene, optionally substituted C3-C10 cycloalkylene, optionally substituted C2-C10 heterocycloalkylene, optionally substituted C6-C10 arylene, or optionally substituted C2-C10 heteroarylene; L’-G-L’’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. In some embodiments, G is optionally substituted C1-C6 heteroalkylene or optionally substituted C2-C10 heteroarylene. In some embodiments, G is optionally substituted C1-C6 heteroalkylene.
In some embodiments, G is
, where Ra is H, optionally substituted C1- C20 alkylene (e.g., optionally substituted C1-C6 alkylene), or optionally substituted C1-C20 heteroalkylene (e.g., optionally substituted C1-C6 heteroalkylene). In some embodiments, G is optionally substituted C2-C10 heteroarylene. In some embodiments, G is optionally substituted C2-C5 heteroarylene. In some embodiments, G is a 5-membered or 6- membered optionally substituted C2-C5 heteroarylene. In some embodiments, G is a triazolylene. In some embodiments, the conjugate of formula (D-I) has the structure of:
, and the method includes the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (DF-II-A) or salt thereof:
and (c) combining the first composition, the second composition, and a buffer to form a mixture. In some embodiments, the synthesis of compound of formula (DF-II-A) includes: (d) providing a third composition including formula (D-G1-A) or salt thereof:
(e) providing a fourth composition including formula (D-G1-B) or salt thereof:
and (f) combining the third composition and the fourth composition to form a mixture. In some embodiments, the conjugate of formula (D-I) has the structure of:
, and the method includes the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (DF-II-B) or salt thereof:
and (c) combining the first composition, the second composition, and a buffer to form a mixture. In some embodiments, the synthesis of compound of formula (DF-II-B) includes: (d) providing a third composition including formula (D-G2-A) or salt thereof:
(e) providing a fourth composition including formula (D-G2-B) or salt thereof:
and (f) combining the third composition and the fourth composition to form a mixture. In some embodiments, step (f) includes the use of a Cu(I) source. In another aspect, the disclosure features a method of synthesizing a conjugate of formula (M-I):
wherein each A1 is selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of A1; T is an integer from 1 to 20; and each squiggly line in formula (M-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (MF-II) or salt thereof:
wherein G is optionally substituted C1-C6 alkylene, optionally substituted C1-C6 heteroalkylene, optionally substituted C2-C6 alkenylene, optionally substituted C2-C6 heteroalkenylene, optionally substituted C2-C6 alkynylene, optionally substituted C2-C6 heteroalkynylene, optionally substituted C3-C10 cycloalkylene, optionally substituted C2-C10 heterocycloalkylene, optionally substituted C6-C10 arylene, or optionally substituted C2-C10 heteroarylene; L’-G-L’’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group;
and (c) combining the first composition, the second composition, and a buffer to form a mixture. In some embodiments, G is optionally substituted C1-C6 heteroalkylene or optionally substituted C2-C10 heteroarylene. In some embodiments, G is optionally substituted C1-C6 heteroalkylene. In some embodiments, G is
, where Ra is H, optionally substituted C1- C20 alkylene (e.g., optionally substituted C1-C6 alkylene), or optionally substituted C1-C20 heteroalkylene (e.g., optionally substituted C1-C6 heteroalkylene). In some embodiments, G is optionally substituted C2-C10 heteroarylene. In some embodiments, G is optionally substituted C2-C5 heteroarylene. In some embodiments, G is a 5-membered or 6- membered optionally substituted C2-C5 heteroarylene. In some embodiments, G is a triazolylene. In some embodiments, the conjugate of formula (M-I) has the structure of:
, and the method including the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (MF-II-A) or salt thereof:
and (c) combining the first composition, the second composition, and a buffer to form a mixture. In some embodiments, the synthesis of compound of formula (MF-II-A) includes:
(d) providing a third composition including formula (M-G1-A) or salt thereof:
( ) (e) providing a fourth composition including formula (M-G1-B) or salt thereof:
and (f) combining the third composition and the fourth composition to form a mixture. In some embodiments, the conjugate of formula (M-I) has the structure of:
, and the method includes the steps of: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (MF-II-B) or salt thereof:
and (c) combining the first composition, the second composition, and a buffer to form a mixture. In some embodiments, the synthesis of compound of formula (MF-II-B) includes:
(d) providing a third composition including formula (M-G2-A) or salt thereof:
(e) providing a fourth composition including formula (M-G2-B) or salt thereof:
and (f) combining the third composition and the fourth composition to form a mixture. In some embodiments, step (f) includes the use of a Cu(I) source. In another aspect, the disclosure features a method of synthesizing a conjugate of formula (D-I): ,
wherein each A1 and each A2 is independently selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptidee; L is a linker covalently attached to E and to Y of each of A1 and A2; T is an integer from 1 to 20; and each squiggly line in formula (D-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of: (a) providing a first composition including formula (D-G3-A) or a salt thereof:
where Ga is a functional group that reacts with Gb to form G;
(b) providing a second composition including formula (D-G3-B) or a salt thereof:
where Gb is a functional group that reacts with Ga to form G; and (c) combining the first composition and the second composition to form a first mixture, where m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group. In some embodiments, step (c) includes the use of a Cu(I) source. In some embodiments, the method further includes: (d) providing a third composition including E; and (e) combining the third composition, the first mixture, and a buffer to form a second mixture. In another aspect, the disclosure features a method of synthesizing a conjugate of formula (M-I):
wherein each A1 is selected from any one of formulas (A-I)-(A-VI) as described herein; n is 1 or 2; each E includes an Fc domain monomer (e.g., an Fc domain monomer having the sequence of any one of SEQ ID NOs: 1-95 or 125-153), an albumin protein (e.g., an albumin protein having the sequence of any one of SEQ ID NOs: 96-98), an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to Y of A1; T is an integer from 1 to 20; T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), where when T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L may be independently selected (e.g., independently selected from any of the A1-L structures described herein); and each squiggly line in formula (M-I) indicates that L is covalently attached (e.g., by way of a covalent bond or linker) to each E, the method including the steps of:
(a) providing a first composition including formula (M-G3-A) or a salt thereof:
where Ga is a functional group that reacts with Gb to form G; (b) providing a second composition including formula (M-G3-B) or a salt thereof:
where Gb is a functional group that reacts with Ga to form G; and (c) combining the first composition and the second composition to form a first mixture, where m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group. In some embodiments, step (c) includes the use of a Cu(I) source. In some embodiments, the method further includes: (d) providing a third composition including E; and (e) combining the third composition, the first mixture, and a buffer to form a second mixture. In some embodiments, Ga includes optionally substituted amino. In some embodiments, Gb includes a carbonyl. In some embodiments, Ga includes a carbonyl. In some embodiments, Gb includes optionally substituted amino. In some embodiments, Ga includes an azido group. In some embodiments, Gb includes an alknyl group. In some embodiments, Ga includes an alkynyl group. In some embodiments, Gb includes an azido group. In some embodiments of any of the aspects described herein, a compound of formula (MF-II) or salt thereof has the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74) In some embodiments of any of the aspects described herein, a compound of formula (MF-II) or salt thereof includes the structure of any one of Int-1 to Int-140 (e.g., Int-56, Int-57, and Int-74). In some embodiments of any of the aspects described herein, a compound of formula (MF-II) or salt thereof is synthesized from the structure of any one of Int-1 to Int-140 (e.g., Int 110, Int-133, or Int- 148). In some embodiments, a compound of formula (DF-II) (e.g., a compound of formula (DF-II-A) or (DF-II-B) and/or a compound of formula (D-G1-A) or (D-G2-A), where each R is halo (e.g., F), provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester.
In some embodiments, a compound of formula (DF-II) (e.g., a compound of formula (DF-II-A) or (DF-II-B) and/or a compound of formula (D-G1-A) or (D-G2-A), where m is 3, provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester. In some embodiments, a compound of formula (DF-II) (e.g., a compound of formula (DF-II-A) or (DF-II-B) and/or a compound of formula (D-G1-A) or (D-G2-A), where m is 3 and each R is halo (e.g., F), provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester. In some embodiments, a compound of formula (MF-II) (e.g., a compound of formula (MF-II-A) or (MF-II-B) and/or a compound of formula (M-G1-A) or (M-G2-A), where each R is halo (e.g., F), provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester. In some embodiments, a compound of formula (MF-II) (e.g., a compound of formula (MF-II-A) or (MF-II-B) and/or a compound of formula (M-G1-A) or (M-G2-A), where m is 3, provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester. In some embodiments, a compound of formula (MF-II) (e.g., a compound of formula (MF-II-A) or (MF-II-B) and/or a compound of formula (M-G1-A) or (M-G2-A), where m is 3 and each R is halo (e.g., F), provides technical advantages (e.g., increased stability) in methods of synthesizing protein-drug conjugates (e.g., the methods described herein). In some embodiments, the increased stability allows for purification by reverse phase chromatography. In some embodiments, the increased stability allows for lyophilization with minimal hydrolysis of the activated ester. In some embodiments, the first composition including E is an Fc domain (e.g., n is 2, each E is an Fc domain monomer, and the Fc domain monomers dimerize to form an Fc domain). In some embodiments of any of the aspects described herein, E includes at least one lysine residue. In some embodiments, the squiggly line in formula (D-I) or (M-I) is covalently bound to a lysine residue of each E. In some embodiments of any of the aspects described herein, E includes at least one cysteine residue. In some embodiments, the squiggly line in formula (D-I) or (M-I) is covalently bound to a cysteine residue of each E.
In some embodiments of any of the aspects described herein, each R is, independently, halo, cyano, nitro, haloalkyl, or
, where Rz is optionally substituted C1-C5 alkyl group or optionally substituted C1-C5 heteroalkyl group. In some embodiments, each R is, independently, halo, cyano, nitro, or haloalkyl. In some embodiments, each R is, independently, F, Cl, Br, or I. In some embodiments, each R is F. In some embodiments, m is 3 or 4. In some embodiments, m is 3. In some embodiments, m is 4. In some embodiments,
In some embodiments, In some embodiments,
In some embodiments,
In some embodiments,
In some embodiments,
In some embodiments of any of the aspects described herein, the buffer includes borate or carbonate. In some embodiments, the buffer includes borate. In some embodiments, the buffer includes carbonate. In some embodiments, the buffer has a pH of about 7.0 to 10.0 (e.g., about 7.0 to 7.5, 7.5 to 8.0, 8.0 to 8.5, 8.5 to 9.0, 9.0 to 9.5, 9.5 to 10.0, 7.0 to 8.0, 7.5 to 8.5, 8.0 to 9.0, 8.5 to 9.5, 9.0 to 10.0, 7.0 to 9.0, 7.5 to 9.5, or 8.0 to 10.0). In some embodiments, the buffer has a pH of about 7.0. In some embodiments, the buffer has a pH of about 7.1. In some embodiments, the buffer has a pH of about 7.2. In some embodiments, the buffer has a pH of about 7.3. In some embodiments, the buffer has a pH of about 7.4. In some embodiments, the buffer has a pH of about 7.5. In some embodiments, the buffer has a pH of about 7.6. In some embodiments, the buffer has a pH of about 7.7. In some embodiments, the buffer has a pH of about 7.8. In some embodiments, the buffer has a pH of about 7.9. In some embodiments, the buffer has a pH of about 8.0. In some embodiments, the buffer has a pH of about 8.1. In some embodiments, the buffer has a pH of about 8.2. In some embodiments, the buffer has a pH of about 8.3. In some embodiments, the buffer has a pH of about 8.4. In some embodiments, the buffer has a pH of about 8.5. In some embodiments, the buffer has a pH of about 8.6. In some embodiments, the buffer has a pH of about 8.7. In some embodiments, the buffer has a pH of about 8.8. In some embodiments, the buffer has a pH of about 8.9. In some embodiments, the buffer has a pH of about 9.0. In some embodiments, the buffer has a pH of about 9.5. In some embodiments, the buffer has a pH of about 9.6. In some embodiments, the buffer has a pH of about 9.7. In some embodiments, the buffer has a pH of about 9.8. In some embodiments, the buffer has a pH of about 9.9. In some embodiments, the buffer has a pH of about 10.0. In some embodiments of any of the aspects described herein, step (c) or step (e) is conducted at a temperature of 5 to 50 °C, such as 20 to 30 °C (e.g., 20 to 25, 21 to 26, 22 to 27, 23 to 28, 24 to 29, or 25 to 30 °C). In some embodiments, step (c) or step (e) is conducted at a temperature of about 25 °C. In some embodiments, step (c) or step (e) is conducted for about 1 to 24 hours, such as 1 to 12 hours (e.g., 1 to 2, 1 to 5, 2 to 3, 2 to 5, 2 to 10, 2 to 12, 3 to 4, 4 to 5, 1 to 3, 2 to 4, or 3 to 5 hours). In some embodiments, step (c) or step (e) is conducted for about 2 hours. In some embodiments, step (c) or step (e) is conducted for about 3 hours. In some embodiments, step (c) or step (e) is conducted for about 4 hours. In some embodiments, step (c) or step (e) is conducted for about 5 hours. In some embodiments, step (c) or step (e) is conducted for about 6 hours. In some embodiments, step (c) or step (e) is conducted for about 7 hours. In some embodiments, step (c) or step (e) is conducted for about 8 hours. In some embodiments, step (c) or step (e) is conducted for about 9 hours. In some embodiments, step (c) or step (e) is conducted for about 10 hours. In some embodiments, step
(c) or step (e) is conducted for about 11 hours. In some embodiments, step (c) or step (e) is conducted for about 12 hours. In some embodiments, the first composition or third composition includes phosphate-buffered saline buffer. In some embodiments, the buffer has a pH of about 7.0 to 8.0 (e.g., about 7.0 to 7.5, 7.5 to 8.0, 7.0 to 7.2, 7.2 to 7.4, 7.4 to 7.6, 7.6 to 7.8, or 7.8 to 8.0). In some embodiments, the buffer has a pH of about 7.5. In some embodiments, the second composition or the first mixture includes DMF. In some embodiments, the method further includes a purification step. In some embodiments, the purification step includes dialysis in arginine buffer. In some embodiments, the purification step includes a buffer exchange. In some embodiments, T is an integer from 1 to 20 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20). In some embodiments, the average value of T is 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In certain embodiments, the average T is 1 to 10 (e.g., 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, or 10). In certain embodiments, the average T is 1 to 5 (e.g., 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, or 5). In some embodiment, the average T is 5 to 10 (e.g., 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10). In some embodiments, the average T is 2.5 to 7.5 (e.g., 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, or 7.5). Definitions To facilitate the understanding of this invention, a number of terms are defined below. Terms defined herein have meanings as commonly understood by a person of ordinary skill in the areas relevant to the present invention. Terms such as "a", "an," and "the" are not intended to refer to only a singular entity, but include the general class of which a specific example may be used for illustration. The terminology herein is used to describe specific embodiments of the invention, but their usage does not delimit the invention, except as outlined in the claims. By “viral infection” is meant the pathogenic growth of a virus (e.g., the human immunodeficiency virus) in a host organism (e.g., a human subject). A viral infection can be any situation in which the presence of a viral population(s) is damaging to a host body. Thus, a subject is “suffering” from a viral infection when an excessive amount of a viral population is present in or on the subject’s body, or when the presence of a viral population(s) is damaging the cells or other tissue of the subject. As used herein, the term “Fc domain monomer” refers to a polypeptide chain that includes at least a hinge domain and second and third antibody constant domains (CH2 and CH3) or functional fragments thereof (e.g., fragments that that capable of (i) dimerizing with another Fc domain monomer to
form an Fc domain, and (ii) binding to an Fc receptor. The Fc domain monomer can be any immunoglobulin antibody isotype, including IgG, IgE, IgM, IgA, or IgD (e.g., IgG). Additionally, the Fc domain monomer can be an IgG subtype (e.g., IgG1, IgG2a, IgG2b, IgG3, or IgG4) (e.g., IgG1). An Fc domain monomer does not include any portion of an immunoglobulin that is capable of acting as an antigen-recognition region, e.g., a variable domain or a complementarity determining region (CDR). Fc domain monomers in the conjugates as described herein can contain one or more changes from a wild- type Fc domain monomer sequence (e.g., 1-10, 1-8, 1-6, 1-4 amino acid substitutions, additions, or deletions) that alter the interaction between an Fc domain and an Fc receptor. Examples of suitable changes are known in the art. In certain embodiments, a human Fc domain monomer (e.g., an IgG heavy chain, such as IgG1) includes a region that extends from any of Asn208, Glu216, Asp221, Lys222, or Cys226 to the carboxyl-terminus of the heavy chain at Lys447. C-terminal Lys447 of the Fc region may or may not be present, without affecting the structure or stability of the Fc region. Unless otherwise specified herein, numbering of amino acid residues in the IgG or Fc domain monomer is according to the EU numbering system for antibodies, also called the Kabat EU index, as described, for example, in Kabat et al., Sequences of Proteins of Immunological Interest, 5th Ed. Public Health Service, National Institutes of Health, Bethesda, MD, 1991. As used herein, the term “Fc domain” refers to a dimer of two Fc domain monomers that is capable of binding an Fc receptor. In the wild-type Fc domain, the two Fc domain monomers dimerize by the interaction between the two CH3 antibody constant domains, in some embodiments, one or more disulfide bonds form between the hinge domains of the two dimerizing Fc domain monomers. The term “covalently attached” refers to two parts of a conjugate that are linked to each other by a covalent bond formed between two atoms in the two parts of the conjugate. As used herein, the term “Fc-binding peptide” refers to refers to a polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an Fc domain, such as any of the Fc domain described herein. An Fc-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat. Fc-binding peptides of the invention include Fc-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the Fc-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Fc-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Fc-binding peptides of the invention may be linear or cyclic. Fc-binding peptides of the invention include any Fc-binding peptides known to one of skill in the art. As used here, the term “albumin protein” refers to a polypeptide comprising an amino acid sequence corresponding to a naturally-occurring albumin protein (e.g., human serum albumin) or a
variant thereof, such as an engineered variant of a naturally-occurring albumin protein. Variants of albumin proteins include polymorphisms, fragments such as domains and sub-domains, and fusion proteins (e.g., an albumin protein having a C-terminal or N-terminal fusion, such as a polypeptide linker). Preferably the albumin protein has the amino acid sequence of human serum albumin (HSA) or a variant or fragment thereof, most preferably a functional variant or fragment thereof. Albumin proteins of the invention include proteins having at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to any one of SEQ ID NOs: 96-98. Albumin proteins of the invention include albumin proteins which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein will contain a single solvent-exposed cysteine or lysine, thus enabling site- specific conjugation of a compound of the invention. Albumin proteins may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). As used herein, the term “albumin protein-binding peptide” refers to a polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an albumin protein, such as any of the albumin proteins described herein. Preferably, the albumin protein-binding peptide binds to a naturally- occurring serum albumin, most preferably human serum albumin. An albumin protein-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat. Albumin protein-binding peptides of the invention include albumin protein-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Albumin protein-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Albumin protein-binding peptides of the invention may be linear or cyclic. Albumin protein-binding peptide of the invention include any albumin protein-binding peptides known to one of skill in the art, examples of which, are provided herein. Further exemplary albumin protein-binding peptides are provided in U.S. Patent Application No.2005/0287153, which is incorporated herein by reference in its entirety. As used-herein, a “surface exposed amino acid” or “solvent-exposed amino acid,” such as a surface exposed cysteine or a surface exposed lysine refers to an amino acid that is accessible to the solvent surrounding the protein. A surface exposed amino acid may be a naturally-occurring or an engineered variant (e.g., a substitution or insertion) of the protein. In some embodiments, a surface
exposed amino acid is an amino acid that when substituted does not substantially change the three- dimensional structure of the protein. The terms “linker,” “L,” and “L’ ,” as used herein, refer to a covalent linkage or connection between two or more components in a conjugate (e.g., between two gp120 binders in a conjugate described herein, between a gp120 binder and an Fc domain or albumin protein in a conjugate described herein, and between a dimer of two gp120 binders and an Fc domain or an albumin protein in a conjugate described herein). In some embodiments, a conjugate described herein may contain a linker that has a trivalent structure (e.g., a trivalent linker). A trivalent linker has three arms, in which each arm is covalently linked to a component of the conjugate (e.g., a first arm conjugated to a first gp120 binder, a second arm conjugated to a second gp120 binder, and a third arm conjugated to an Fc domain or an albumin protein). Molecules that may be used as linkers include at least two functional groups, which may be the same or different, e.g., two carboxylic acid groups, two amine groups, two sulfonic acid groups, a carboxylic acid group and a maleimide group, a carboxylic acid group and an alkyne group, a carboxylic acid group and an amine group, a carboxylic acid group and a sulfonic acid group, an amine group and a maleimide group, an amine group and an alkyne group, or an amine group and a sulfonic acid group. The first functional group may form a covalent linkage with a first component in the conjugate and the second functional group may form a covalent linkage with the second component in the conjugate. In some embodiments of a trivalent linker, two arms of a linker may contain two dicarboxylic acids, in which the first carboxylic acid may form a covalent linkage with the first gp120 binder in the conjugate and the second carboxylic acid may form a covalent linkage with the second gp120 binder in the conjugate, and the third arm of the linker may for a covalent linkage with an Fc domain or albumin protein in the conjugate. Examples of dicarboxylic acids are described further herein. In some embodiments, a molecule containing one or more maleimide groups may be used as a linker, in which the maleimide group may form a carbon-sulfur linkage with a cysteine in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more alkyne groups may be used as a linker, in which the alkyne group may form a 1,2,3-triazole linkage with an azide in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more azide groups may be used as a linker, in which the azide group may form a 1,2,3-triazole linkage with an alkyne in a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more bis-sulfone groups may be used as a linker, in which the bis-sulfone group may form a linkage with an amine group a component (e.g., an Fc domain monomer, an Fc domain, or an albumin protein) in the conjugate. In some embodiments, a molecule containing one or more sulfonic acid groups may be used as a linker, in which the sulfonic acid group may form a sulfonamide linkage with a component in the conjugate. In some embodiments, a molecule containing one or more isocyanate groups may be used as a linker, in which the isocyanate group may form a urea linkage with a component in the conjugate. In some embodiments, a molecule containing one or more haloalkyl groups may be used as a linker, in which the haloalkyl group may form a covalent linkage, e.g., C-N and C-O linkages, with a component in the conjugate. In some embodiments, a molecule containing
one or more phenyl ester groups (e.g., triflurophenyl ester groups or tetrafluorophenyl ester groups) may be used as a linker, in which the phenyl ester group (e.g., triflurophenyl ester group or tetrafluorophenyl ester group) may form an amide with an amine in a component (e.g., a fusion protein) in the conjugate. In some embodiments, a linker provides space, rigidity, and/or flexibility between the two or more components. In some embodiments, a linker may be a bond, e.g., a covalent bond. The term “bond” refers to a chemical bond, e.g., an amide bond, a disulfide bond, a C-O bond, a C-N bond, a N-N bond, a C-S bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation. In some embodiments, a linker includes no more than 250 atoms. In some embodiments, a linker includes no more than 250 non-hydrogen atoms. In some embodiments, the backbone of a linker includes no more than 250 atoms. The “backbone” of a linker refers to the atoms in the linker that together form the shortest path from one part of a conjugate to another part of the conjugate (e.g., the shortest path linking a first gp120 binder and a second gp120 binder). The atoms in the backbone of the linker are directly involved in linking one part of a conjugate to another part of the conjugate (e.g., linking a first gp120 binder and a second gp120 binder). For examples, hydrogen atoms attached to carbons in the backbone of the linker are not considered as directly involved in linking one part of the conjugate to another part of the conjugate. In some embodiments, a linker may comprise a synthetic group derived from, e.g., a synthetic polymer (e.g., a polyethylene glycol (PEG) polymer). In some embodiments, a linker may comprise one or more amino acid residues, such as D- or L-amino acid residues. In some embodiments, a linker may be a residue of an amino acid sequence (e.g., a 1-25 amino acid, 1-10 amino acid, 1-9 amino acid, 1-8 amino acid, 1-7 amino acid, 1-6 amino acid, 1-5 amino acid, 1-4 amino acid, 1-3 amino acid, 1-2 amino acid, or 1 amino acid sequence). In some embodiments, a linker may comprise one or more, e.g., 1-100, 1-50, 1-25, 1-10, 1-5, or 1-3, optionally substituted alkylene, optionally substituted heteroalkylene (e.g., a PEG unit), optionally substituted alkenylene, optionally substituted heteroalkenylene, optionally substituted alkynylene, optionally substituted heteroalkynylene, optionally substituted cycloalkylene, optionally substituted heterocycloalkylene, optionally substituted cycloalkenylene, optionally substituted heterocycloalkenylene, optionally substituted cycloalkynylene, optionally substituted heterocycloalkynylene, optionally substituted arylene, optionally substituted heteroarylene (e.g., pyridine), O, S, NRi, , (each Ri is, independently, H, optionally substituted alkyl, optionally substituted heteroalkyl, optionally substituted alkenyl, optionally substituted heteroalkenyl, optionally substituted alkynyl, optionally substituted heteroalkynyl, optionally substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted cycloalkenyl, optionally substituted heterocycloalkenyl, optionally substituted cycloalkynyl, optionally substituted heterocycloalkynyl, optionally substituted aryl, or optionally substituted heteroaryl), P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino. For example, a linker may comprise one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene (e.g., a PEG unit), optionally substituted C2-C20 alkenylene (e.g., C2 alkenylene), optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene (e.g., cyclopropylene, cyclobutylene), optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20
cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene (e.g., C6 arylene), optionally substituted C3-C15 heteroarylene (e.g., imidazole, pyridine), O, S, NRi, , (each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl), P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino. The terms “alkyl,” “alkenyl,” and “alkynyl,” as used herein, include straight-chain and branched- chain monovalent substituents, as well as combinations of these, containing only C and H when unsubstituted. When the alkyl group includes at least one carbon-carbon double bond or carbon-carbon triple bond, the alkyl group can be referred to as an “alkenyl” or “alkynyl” group respectively. The monovalency of an alkyl, alkenyl, or alkynyl group does not include the optional substituents on the alkyl, alkenyl, or alkynyl group. For example, if an alkyl, alkenyl, or alkynyl group is attached to a compound, monovalency of the alkyl, alkenyl, or alkynyl group refers to its attachment to the compound and does not include any additional substituents that may be present on the alkyl, alkenyl, or alkynyl group. In some embodiments, the alkyl or heteroalkyl group may contain, e.g., 1-20.1-18, 1-16, 1-14, 1-12, 1-10, 1-8, 1- 6, 1-4, or 1-2 carbon atoms (e.g., C1-C20, C1-C18, C1-C16, C1-C14, C1-C12, C1-C10, C1-C8, C1-C6, C1-C4, or C1-C2). In some embodiments, the alkenyl, heteroalkenyl, alkynyl, or heteroalkynyl group may contain, e.g., 2-20, 2-18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, or 2-4 carbon atoms (e.g., C2-C20, C2-C18, C2-C16, C2-C14, C2-C12, C2-C10, C2-C8, C2-C6, or C2-C4). Examples include, but are not limited to, methyl, ethyl, isobutyl, sec-butyl, tert-butyl, 2-propenyl, and 3-butynyl. The term “cycloalkyl,” as used herein, represents a monovalent saturated or unsaturated non- aromatic cyclic alkyl group. A cycloalkyl may have, e.g., three to twenty carbons (e.g., a C3-C7, C3-C8, C3- C9, C3-C10, C3-C11, C3-C12, C3-C14, C3-C16, C3-C18, or C3-C20 cycloalkyl). Examples of cycloalkyls include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, and cycloheptyl. When the cycloalkyl group includes at least one carbon-carbon double bond, the cycloalkyl group can be referred to as a “cycloalkenyl” group. A cycloalkenyl may have, e.g., four to twenty carbons (e.g., a C4-C7, C4-C8, C4- C9, C4-C10, C4-C11, C4-C12, C4-C14, C4-C16, C4-C18, or C4-C20 cycloalkenyl). Exemplary cycloalkenyl groups include, but are not limited to, cyclopentenyl, cyclohexenyl, and cycloheptenyl. When the cycloalkyl group includes at least one carbon-carbon triple bond, the cycloalkyl group can be referred to as a “cycloalkynyl” group. A cycloalkynyl may have, e.g., eight to twenty carbons (e.g., a C8-C9, C8-C10, C8-C11, C8-C12, C8- C14, C8-C16, C8-C18, or C8-C20 cycloalkynyl). The term “cycloalkyl” also includes a cyclic compound having a bridged multicyclic structure in which one or more carbons bridges two non-adjacent members of a monocyclic ring, e.g., bicyclo[2.2.1.]heptyl and adamantane. The term “cycloalkyl” also includes bicyclic, tricyclic, and tetracyclic fused ring structures, e.g., decalin and spiro cyclic compounds. A “heterocycloalkyl,” “heterocycloalkenyl,” or “heterocycloalkynyl” group refers to a cycloalkyl, cycloalkenyl,
or cycloalkynyl group having one or more rings (e.g., 1, 2, 3, 4 or more rings) that has one or more heteroatoms independently selected from, e.g., N, O, and S. Exemplary heterocycloalkyl groups include pyrrolidine, thiophene, thiolane, tetrahydrofuran, piperidine, tetrahydropyran, quinoline, isoquinoline, cinnoline, phthalazine, quinazoline, quinoxaline, indole, benzothiophene, benzofuran, isoindole, benzo[c]thiophene, isobenzofuran, benzimidazole, benzoxazole, benzothiazole, 1H-indazole, 1,2,benzisoxazole, 1,2-benzisothiazole, 2,1-benzisothiazole, 2,1-benzisoxazole, purine, pyrrolizidine, indene, fluorene, carbazole, dibenzofuran, acridine, phenazine, and phenoxazine. The term “aryl,” as used herein, refers to any monocyclic or fused ring bicyclic or tricyclic system which has the characteristics of aromaticity in terms of electron distribution throughout the ring system, e.g., phenyl, naphthyl, or phenanthrene. In some embodiments, a ring system contains 5-15 ring member atoms or 5-10 ring member atoms. An aryl group may have, e.g., five to fifteen carbons (e.g., a C5-C6, C5-C7, C5-C8, C5-C9, C5-C10, C5-C11, C5-C12, C5-C13, C5- C14, or C5-C15 aryl). The term “heteroaryl” also refers to such monocyclic or fused bicyclic ring systems containing one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from O, S and N. A heteroaryl group may have, e.g., two to fifteen ring member atoms (e.g., a C2-C3, C2-C4, C2-C5, C2-C6, C2-C7, C2-C8, C2-C9, C2-C10, C2-C11, C2-C12, C2-C13, C2-C14, or C3-C15 heteroaryl). The inclusion of a heteroatom permits inclusion of 5-membered rings to be considered aromatic as well as 6-membered rings. Thus, typical heteroaryl systems include, e.g., pyridyl, pyrimidyl, indolyl, benzimidazolyl, benzotriazolyl, isoquinolyl, quinolyl, benzothiazolyl, benzofuranyl, thienyl, furyl, pyrrolyl, thiazolyl, triazolyl (e.g., 1,2,3- or 1,2,4-triazolyl) oxazolyl, isoxazolyl, benzoxazolyl, benzoisoxazolyl, and imidazolyl. Because tautomers are possible, a group such as phthalimido is also considered heteroaryl. In some embodiments, the aryl or heteroaryl group is a 5- or 6-membered aromatic rings system optionally containing 1-2 nitrogen atoms. In some embodiments, the aryl or heteroaryl group is an optionally substituted phenyl, pyridyl, indolyl, pyrimidyl, pyridazinyl, benzothiazolyl, benzimidazolyl, pyrazolyl, imidazolyl, isoxazolyl, thiazolyl, or imidazopyridinyl. In some embodiments, the aryl group is phenyl. In some embodiments, an aryl group may be optionally substituted with a substituent such an aryl substituent, e.g., biphenyl. The term “alkaryl,” refers to an aryl group that is connected to an alkylene, alkenylene, or alkynylene group. In general, if a compound is attached to an alkaryl group, the alkylene, alkenylene, or alkynylene portion of the alkaryl is attached to the compound. In some embodiments, an alkaryl is C6-C35 alkaryl (e.g., C6-C16, C6-C14, C6-C12, C6-C10, C6-C9, C6-C8, C7, or C6 alkaryl), in which the number of carbons indicates the total number of carbons in both the aryl portion and the alkylene, alkenylene, or alkynylene portion of the alkaryl. Examples of alkaryls include, but are not limited to, (C1-C8)alkylene(C6- C12)aryl, (C2-C8)alkenylene(C6-C12)aryl, or (C2-C8)alkynylene(C6-C12)aryl. In some embodiments, an alkaryl is benzyl or phenethyl. In a heteroalkaryl, one or more heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the alkaryl group and/or may be present in the aryl portion of the alkaryl group. In an optionally substituted alkaryl, the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkaryl group and/or may be present on the aryl portion of the alkaryl group.
The term “amino,” as used herein, represents –N(Rx)2 or –N+(Rx)3, where each Rx is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two Rx combine to form a heterocycloalkyl. In some embodiment, the amino group is -NH2. The term “alkamino,” as used herein, refers to an amino group, described herein, that is attached to an alkylene (e.g., C1-C5 alkylene), alkenylene (e.g., C2-C5 alkenylene), or alkynylene group (e.g., C2-C5 alkenylene). In general, if a compound is attached to an alkamino group, the alkylene, alkenylene, or alkynylene portion of the alkamino is attached to the compound. The amino portion of an alkamino refers to –N(Rx)2 or –N+(Rx)3, where each Rx is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two Rx combine to form a heterocycloalkyl. In some embodiment, the amino portion of an alkamino is -NH2. An example of an alkamino group is C1-C5 alkamino, e.g., C2 alkamino (e.g., CH2CH2NH2 or CH2CH2N(CH3)2). In a heteroalkamino group, one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the heteroalkamino group. In some embodiments, an alkamino group may be optionally substituted. In a substituted alkamino group, the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkamino group and/or may be present on the amino portion of the alkamino group. The term “alkamide,” as used herein, refers to an amide group that is attached to an alkylene (e.g., C1-C5 alkylene), alkenylene (e.g., C2-C5 alkenylene), or alkynylene (e.g., C2-C5 alkenylene) group. In general, if a compound is attached to an alkamide group, the alkylene, alkenylene, or alkynylene portion of the alkamide is attached to the compound. The amide portion of an alkamide refers to – C(O)-N(Rx)2, where each Rx is, independently, H, alkyl, alkenyl, alkynyl, aryl, alkaryl, cycloalkyl, or two Rx combine to form a heterocycloalkyl. In some embodiment, the amide portion of an alkamide is -C(O)NH2. An alkamide group may be -(CH2)2-C(O)NH2 or -CH2-C(O)NH2. In a heteroalkamide group, one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms selected from N, O, and S may be present in the alkylene, alkenylene, or alkynylene portion of the heteroalkamide group. In some embodiments, an alkamide group may be optionally substituted. In a substituted alkamide group, the substituent may be present on the alkylene, alkenylene, or alkynylene portion of the alkamide group and/or may be present on the amide portion of the alkamide group. The terms “alkylene,” “alkenylene,” and “alkynylene,” as used herein, refer to divalent groups having a specified size. In some embodiments, an alkylene may contain, e.g., 1-20, 1-18, 1-16, 1-14, 1- 12, 1-10, 1-8, 1-6, 1-4, or 1-2 carbon atoms (e.g., C1-C20, C1-C18, C1-C16, C1-C14, C1-C12, C1-C10, C1-C8, C1-C6, C1-C4, or C1-C2). In some embodiments, an alkenylene or alkynylene may contain, e.g., 2-20, 2- 18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, or 2-4 carbon atoms (e.g., C2-C20, C2-C18, C2-C16, C2-C14, C2-C12, C2- C10, C2-C8, C2-C6, or C2-C4). Alkylene, alkenylene, and/or alkynylene includes straight-chain and branched-chain forms, as well as combinations of these. The divalency of an alkylene, alkenylene, or alkynylene group does not include the optional substituents on the alkylene, alkenylene, or alkynylene group. For example, two gp120 binders may be attached to each other by way of a linker that includes alkylene, alkenylene, and/or alkynylene, or combinations thereof. Each of the alkylene, alkenylene, and/or alkynylene groups in the linker is considered divalent with respect to the two attachments on either end of alkylene, alkenylene, and/or alkynylene group. For example, if a linker includes -(optionally substituted alkylene)-(optionally substituted alkenylene)-(optionally substituted alkylene)-, the alkenylene
is considered divalent with respect to its attachments to the two alkylenes at the ends of the linker. The optional substituents on the alkenylene are not included in the divalency of the alkenylene. The divalent nature of an alkylene, alkenylene, or alkynylene group (e.g., an alkylene, alkenylene, or alkynylene group in a linker) refers to both of the ends of the group and does not include optional substituents that may be present in an alkylene, alkenylene, or alkynylene group. Because they are divalent, they can link together multiple (e.g., two) parts of a conjugate, e.g., a first gp120 binder and a second gp120 binder. Alkylene, alkenylene, and/or alkynylene groups can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein. For example, C=O is a C1 alkylene that is substituted by an oxo (=O). For example, -HCR-C≡C- may be considered as an optionally substituted alkynylene and is considered a divalent group even though it has an optional substituent, R. Heteroalkylene, heteroalkenylene, and/or heteroalkynylene groups refer to alkylene, alkenylene, and/or alkynylene groups including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S. For example, a polyethylene glycol (PEG) polymer or a PEG unit -(CH2)2-O- in a PEG polymer is considered a heteroalkylene containing one or more oxygen atoms. The term “cycloalkylene,” as used herein, refers to a divalent cyclic group linking together two parts of a compound. For example, one carbon within the cycloalkylene group may be linked to one part of the compound, while another carbon within the cycloalkylene group may be linked to another part of the compound. A cycloalkylene group may include saturated or unsaturated non-aromatic cyclic groups. A cycloalkylene may have, e.g., three to twenty carbons in the cyclic portion of the cycloalkylene (e.g., a C3-C7, C3-C8, C3-C9, C3-C10, C3-C11, C3-C12, C3-C14, C3-C16, C3-C18, or C3-C20 cycloalkylene). When the cycloalkylene group includes at least one carbon-carbon double bond, the cycloalkylene group can be referred to as a “cycloalkenylene” group. A cycloalkenylene may have, e.g., four to twenty carbons in the cyclic portion of the cycloalkenylene (e.g., a C4-C7, C4-C8, C4-C9. C4-C10, C4-C11, C4-C12, C4-C14, C4-C16, C4-C18, or C4-C20 cycloalkenylene). When the cycloalkylene group includes at least one carbon-carbon triple bond, the cycloalkylene group can be referred to as a “cycloalkynylene” group. A cycloalkynylene may have, e.g., four to twenty carbons in the cyclic portion of the cycloalkynylene (e.g., a C4-C7, C4-C8, C4-C9, C4-C10, C4-C11, C4-C12, C4-C14, C4-C16, C4-C18, or C8-C20 cycloalkynylene). A cycloalkylene group can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein. Heterocycloalkylene refers to a cycloalkylene group including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S. Examples of cycloalkylenes include, but are not limited to, cyclopropylene and cyclobutylene. A tetrahydrofuran may be considered as a heterocycloalkylene. The term “arylene,” as used herein, refers to a multivalent (e.g., divalent or trivalent) aryl group linking together multiple (e.g., two or three) parts of a compound. For example, one carbon within the arylene group may be linked to one part of the compound, while another carbon within the arylene group may be linked to another part of the compound. An arylene may have, e.g., five to fifteen carbons in the aryl portion of the arylene (e.g., a C5-C6, C5-C7, C5-C8, C5-C9, C5-C10, C5-C11, C5-C12, C5-C13, C5-C14, or C5- C15 arylene). An arylene group can be substituted by the groups typically suitable as substituents for alkyl, alkenyl and alkynyl groups as set forth herein. Heteroarylene refers to an aromatic group including one or more, e.g., 1-4, 1-3, 1, 2, 3, or 4, heteroatoms, e.g., N, O, and S. A heteroarylene group may
have, e.g., two to fifteen carbons (e.g., a C2-C3, C2-C4, C2-C5, C2-C6, C2-C7, C2-C8, C2-C9. C2-C10, C2-C11, C2-C12, C2-C13, C2-C14, or C3-C15 heteroarylene). The term “optionally substituted,” as used herein, refers to having 0, 1, or more substituents, such as 0-25, 0-20, 0-10 or 0-5 substituents. Substituents include, but are not limited to, alkyl, alkenyl, alkynyl, aryl, alkaryl, acyl, heteroaryl, heteroalkyl, heteroalkenyl, heteroalkynyl, heteroalkaryl, halogen, oxo, cyano, nitro, amino, alkamino, hydroxy, alkoxy, alkanoyl, carbonyl, carbamoyl, guanidinyl, ureido, amidinyl, any of the groups or moieties described above, and hetero versions of any of the groups or moieties described above. Substituents include, but are not limited to, F, Cl, methyl, phenyl, benzyl, OR, NR2, SR, SOR, SO2R, OCOR, NRCOR, NRCONR2, NRCOOR, OCONR2, RCO, COOR, alkyl-OOCR, SO3R, CONR2, SO2NR2, NRSO2NR2, CN, CF3, O CF3, SiR3, and NO2, wherein each R is, independently, H, alkyl, alkenyl, aryl, heteroalkyl, heteroalkenyl, or heteroaryl, and wherein two of the optional substituents on the same or adjacent atoms can be joined to form a fused, optionally substituted aromatic or nonaromatic, saturated or unsaturated ring which contains 3–8 members, or two of the optional substituents on the same atom can be joined to form an optionally substituted aromatic or nonaromatic, saturated or unsaturated ring which contains 3–8 members. An optionally substituted group or moiety refers to a group or moiety (e.g., any one of the groups or moieties described above) in which one of the atoms (e.g., a hydrogen atom) is optionally replaced with another substituent. For example, an optionally substituted alkyl may be an optionally substituted methyl, in which a hydrogen atom of the methyl group is replaced by, e.g., OH. As another example, a substituent on a heteroalkyl or its divalent counterpart, heteroalkylene, may replace a hydrogen on a carbon or a hydrogen on a heteroatom such as N. For example, the hydrogen atom in the group -R-NH-R- may be substituted with an alkamide substituent, e.g., -R-N[(CH2C(O)N(CH3)2]-R. Generally, an optional substituent is a noninterfering substituent. A “noninterfering substituent” refers to a substituent that leaves the ability of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) to either bind to viral gp41 or gp120 receptor or to inhibit the proliferation of HIV. Thus, in some embodiments, the substituent may alter the degree of such activity. However, as long as the conjugate retains the ability to bind to viral gp41 or gp120 receptor or to inhibit HIV proliferation, the substituent will be classified as “noninterfering.” For example, the noninterfering substituent would leave the ability of the compound to provide antiviral efficacy based on an IC50 value of 10 μM or less in a viral plaque reduction assay. Thus, the substituent may alter the degree of inhibition based on plaque reduction or gp120 receptor inhibition. However, as long as the compound described herein, such as any compound of formula (A-I), retains the ability to inhibit gp120 receptor, the substituent will be classified as "noninterfering." A number of assays for determining viral plaque reduction or the ability of any compound to inhibit gp120 receptor are available in the art, and some are exemplified in the Examples below. The term “hetero,” when used to describe a chemical group or moiety, refers to having at least one heteroatom that is not a carbon or a hydrogen, e.g., N, O, and S. Any one of the groups or moieties described above may be referred to as hetero if it contains at least one heteroatom. For example, a heterocycloalkyl, heterocycloalkenyl, or heterocycloalkynyl group refers to a cycloalkyl, cycloalkenyl, or cycloalkynyl group that has one or more heteroatoms independently selected from, e.g., N, O, and S. An
example of a heterocycloalkenyl group is a maleimido. For example, a heteroaryl group refers to an aromatic group that has one or more heteroatoms independently selected from, e.g., N, O, and S. One or more heteroatoms may also be included in a substituent that replaced a hydrogen atom in a group or moiety as described herein. For example, in an optionally substituted heteroaryl group, if one of the hydrogen atoms in the heteroaryl group is replaced with a substituent (e.g., methyl), the substituent may also contain one or more heteroatoms (e.g., methanol). The term “acyl,” as used herein, refers to a group having the structure:
, wherein Rz is an optionally substituted alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, alkaryl, alkamino, heteroalkyl, heteroalkenyl, heteroalkynyl, heterocycloalkyl, heterocycloalkenyl, heterocycloalkynyl, heteroaryl, heteroalkaryl, or heteroalkamino. The term “halo” or “halogen,” as used herein, refers to any halogen atom, e.g., F, Cl, Br, or I. Any one of the groups or moieties described herein may be referred to as a “halo moiety” if it contains at least one halogen atom, such as haloalkyl. The term “haloalkyl,” as used herein, refers to an alkyl group substituted with one or more (e.g., one, two, three, four, five, six, or more) halo groups. Haloalkyl groups include, but are not limited to, fluoroalkyl (e.g., trifluoromethyl and pentafluoroethyl) and chloroalkyl. The term “hydroxyl,” as used herein, represents an -OH group. The term “oxo,” as used herein, refers to a substituent having the structure =O, where there is a double bond between an atom and an oxygen atom. The term “carbonyl,” as used herein, refers to a group having the structure:
The term “thiocarbonyl,” as used herein, refers to a group having the structure:
. The term “phosphate,” as used herein, represents the group having the structure:
The term “phosphoryl,” as used herein, represents the group having the structure:
The term “sulfonyl,” as used herein, represents the group having the structure:
The term “imino,” as used herein, represents the group having the structure:
, wherein R is an optional substituent.
The term “N-protecting group,” as used herein, represents those groups intended to protect an amino group against undesirable reactions during synthetic procedures. Commonly used N-protecting groups are disclosed in Greene, “Protective Groups in Organic Synthesis,” 5th Edition (John Wiley & Sons, New York, 2014), which is incorporated herein by reference. N-protecting groups include, e.g., acyl, aryloyl, and carbamyl groups such as formyl, acetyl, propionyl, pivaloyl, t-butylacetyl, 2-chloroacetyl, 2-bromoacetyl, trifluoroacetyl, trichloroacetyl, phthaloyl, o-nitrophenoxyacetyl, α-chlorobutyryl, benzoyl, carboxybenzyl (CBz), 4-chlorobenzoyl, 4-bromobenzoyl, 4-nitrobenzoyl, and chiral auxiliaries such as protected or unprotected D, L or D, L-amino acid residues such as alanine, leucine, phenylalanine; sulfonyl-containing groups such as benzenesulfonyl and p-toluenesulfonyl; carbamate forming groups such as benzyloxycarbonyl, p-chlorobenzyloxycarbonyl, p-methoxybenzyloxycarbonyl, p- nitrobenzyloxycarbonyl, 2-nitrobenzyloxycarbonyl, p-bromobenzyloxycarbonyl, 3,4- dimethoxybenzyloxycarbonyl, 3,5-dimethoxybenzyl oxycarbonyl, 2,4-dimethoxybenzyloxycarbonyl, 4-methoxybenzyloxycarbonyl, 2-nitro-4,5-dimethoxybenzyloxycarbonyl, 3,4,5-trimethoxybenzyloxycarbonyl, 1-(p-biphenylyl)-1-methylethoxycarbonyl, α,α-dimethyl- 3,5-dimethoxybenzyloxycarbonyl, benzhydryloxy carbonyl, t-butyloxycarbonyl (BOC), diisopropylmethoxycarbonyl, isopropyloxycarbonyl, ethoxycarbonyl, methoxycarbonyl, allyloxycarbonyl, 2,2,2,-trichloroethoxycarbonyl, phenoxycarbonyl, 4-nitrophenoxy carbonyl, fluorenyl-9-methoxycarbonyl (Fmoc), cyclopentyloxycarbonyl, adamantyloxycarbonyl, cyclohexyloxycarbonyl, and phenylthiocarbonyl; alkaryl groups such as benzyl, triphenylmethyl, and benzyloxymethyl; and silyl groups such as trimethylsilyl. The term “amino acid,” as used herein, means naturally occurring amino acids and non-naturally occurring amino acids. The term “naturally occurring amino acids,” as used herein, means amino acids including Ala, Arg, Asn, Asp, Cys, Gln, Glu, Gly, His, Ile, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, and Val. The term “non-naturally occurring amino acid,” as used herein, means an alpha amino acid that is not naturally produced or found in a mammal. Examples of non-naturally occurring amino acids include D-amino acids; an amino acid having an acetylaminomethyl group attached to a sulfur atom of a cysteine; a pegylated amino acid; the omega amino acids of the formula NH2(CH2)nCOOH where n is 2-6, neutral nonpolar amino acids, such as sarcosine, t-butyl alanine, t-butyl glycine, N-methyl isoleucine, and norleucine; oxymethionine; phenylglycine; citrulline; methionine sulfoxide; cysteic acid; ornithine; diaminobutyric acid; 3-aminoalanine; 3-hydroxy-D-proline; 2,4-diaminobutyric acid; 2-aminopentanoic acid; 2-aminooctanoic acid, 2-carboxy piperazine; piperazine-2-carboxylic acid, 2-amino-4-phenylbutanoic acid; 3-(2-naphthyl)alanine, and hydroxyproline. Other amino acids are α-aminobutyric acid, α-amino-α- methylbutyrate, aminocyclopropane-carboxylate, aminoisobutyric acid, aminonorbornyl-carboxylate, L- cyclohexylalanine, cyclopentylalanine, L-N-methylleucine, L-N-methylmethionine, L-N-methylnorvaline, L- N-methylphenylalanine, L-N-methylproline, L-N-methylserine, L-N-methyltryptophan, D-ornithine, L-N- methylethylglycine, L-norleucine, α-methyl-aminoisobutyrate, α-methylcyclohexylalanine, D-α- methylalanine, D-α-methylarginine, D-α-methylasparagine, D-α-methylaspartate, D-α-methylcysteine, D- α-methylglutamine, D-α-methylhistidine, D-α-methylisoleucine, D-α-methylleucine, D-α-methyllysine, D-α- methylmethionine, D-α-methylornithine, D-α-methylphenylalanine, D-α-methylproline, D-α-methylserine,
D-N-methylserine, D-α-methylthreonine, D-α-methyltryptophan, D-α-methyltyrosine, D-α-methylvaline, D- N-methylalanine, D-N-methylarginine, D-N-methylasparagine, D-N-methylaspartate, D-N-methylcysteine, D-N-methylglutamine, D-N-methylglutamate, D-N-methylhistidine, D-N-methylisoleucine, D-N- methylleucine, D-N-methyllysine, N-methylcyclohexylalanine, D-N-methylornithine, N-methylglycine, N- methylaminoisobutyrate, N-(1-methylpropyl)glycine, N-(2-methylpropyl)glycine, D-N-methyltryptophan, D- N-methyltyrosine, D-N-methylvaline, γ-aminobutyric acid, L-t-butylglycine, L-ethylglycine, L- homophenylalanine, L-α-methylarginine, L-α-methylaspartate, L-α-methylcysteine, L-α-methylglutamine, L-α-methylhistidine, L-α-methylisoleucine, L-α-methylleucine, L-α-methylmethionine, L-α-methylnorvaline, L-α-methylphenylalanine, L-α-methylserine, L-α-methyltryptophan, L-α-methylvaline, N-(N-(2,2- diphenylethyl) carbamylmethylglycine, 1-carboxy-1-(2,2-diphenyl-ethylamino) cyclopropane, 4- hydroxyproline, ornithine, 2-aminobenzoyl (anthraniloyl), D-cyclohexylalanine, 4-phenyl-phenylalanine, L- citrulline, α-cyclohexylglycine, L-1,2,3,4-tetrahydroisoquinoline-3-carboxylic acid, L-thiazolidine-4- carboxylic acid, L-homotyrosine, L-2-furylalanine, L-histidine (3-methyl), N-(3-guanidinopropyl)glycine, O- methyl-L-tyrosine, O-glycan-serine, meta-tyrosine, nor-tyrosine, L-N,N′,N″-trimethyllysine, homolysine, norlysine, N-glycan asparagine, 7-hydroxy-1,2,3,4-tetrahydro-4-fluorophenylalanine, 4- methylphenylalanine, bis-(2-picolyl)amine, pentafluorophenylalanine, indoline-2-carboxylic acid, 2- aminobenzoic acid, 3-amino-2-naphthoic acid, asymmetric dimethylarginine, L-tetrahydroisoquinoline-1- carboxylic acid, D-tetrahydroisoquinoline-1-carboxylic acid, 1-amino-cyclohexane acetic acid, D/L- allylglycine, 4-aminobenzoic acid, 1-amino-cyclobutane carboxylic acid, 2 or 3 or 4-aminocyclohexane carboxylic acid, 1-amino-1-cyclopentane carboxylic acid, 1-aminoindane-1-carboxylic acid, 4-amino- pyrrolidine-2-carboxylic acid, 2-aminotetraline-2-carboxylic acid, azetidine-3-carboxylic acid, 4-benzyl- pyrolidine-2-carboxylic acid, tert-butylglycine, b-(benzothiazolyl-2-yl)-alanine, b-cyclopropyl alanine, 5,5- dimethyl-1,3-thiazolidine-4-carboxylic acid, (2R,4S)4-hydroxypiperidine-2-carboxylic acid, (2S,4S) and (2S,4R)-4-(2-naphthylmethoxy)-pyrolidine-2-carboxylic acid, (2S,4S) and (2S,4R)4-phenoxy-pyrrolidine-2- carboxylic acid, (2R,5S)and(2S,5R)-5-phenyl-pyrrolidine-2-carboxylic acid, (2S,4S)-4-amino-1-benzoyl- pyrrolidine-2-carboxylic acid, t-butylalanine, (2S,5R)-5-phenyl-pyrrolidine-2-carboxylic acid, 1- aminomethyl-cyclohexane-acetic acid, 3,5-bis-(2-amino)ethoxy-benzoic acid, 3,5-diamino-benzoic acid, 2- methylamino-benzoic acid, N-methylanthranylic acid, L-N-methylalanine, L-N-methylarginine, L-N- methylasparagine, L-N-methylaspartic acid, L-N-methylcysteine, L-N-methylglutamine, L-N- methylglutamic acid, L-N-methylhistidine, L-N-methylisoleucine, L-N-methyllysine, L-N-methylnorleucine, L-N-methylornithine, L-N-methylthreonine, L-N-methyltyrosine, L-N-methylvaline, L-N-methyl-t- butylglycine, L-norvaline, α-methyl-γ-aminobutyrate, 4,4′-biphenylalanine, α-methylcylcopentylalanine, α- methyl-α-napthylalanine, α-methylpenicillamine, N-(4-aminobutyl)glycine, N-(2-aminoethyl)glycine, N-(3- aminopropyl)glycine, N-amino-α-methylbutyrate, α-napthylalanine, N-benzylglycine, N-(2- carbamylethyl)glycine, N-(carbamylmethyl)glycine, N-(2-carboxyethyl)glycine, N-(carboxymethyl)glycine, N-cyclobutylglycine, N-cyclodecylglycine, N-cycloheptylglycine, N-cyclohexylglycine, N-cyclodecylglycine, N-cylcododecylglycine, N-cyclooctylglycine, N-cyclopropylglycine, N-cycloundecylglycine, N-(2,2- diphenylethyl)glycine, N-(3,3-diphenylpropyl)glycine, N-(3-guanidinopropyl)glycine, N-(1- hydroxyethyl)glycine, N-(hydroxyethyl))glycine, N-(imidazolylethyl))glycine, N-(3-indolylyethyl)glycine, N- methyl-γ-aminobutyrate, D-N-methylmethionine, N-methylcyclopentylalanine, D-N-methylphenylalanine,
D-N-methylproline, D-N-methylthreonine, N-(1-methylethyl)glycine, N-methyl-napthylalanine, N- methylpenicillamine, N-(p-hydroxyphenyl)glycine, N-(thiomethyl)glycine, penicillamine, L-α-methylalanine, L-α-methylasparagine, L-α-methyl-t-butylglycine, L-methylethylglycine, L-α-methylglutamate, L-α- methylhomophenylalanine, N-(2-methylthioethyl)glycine, L-α-methyllysine, L-α-methylnorleucine, L-α- methylornithine, L-α-methylproline, L-α-methylthreonine, L-α-methyltyrosine, L-N-methyl- homophenylalanine, N-(N-(3,3-diphenylpropyl) carbamylmethylglycine, L-pyroglutamic acid, D- pyroglutamic acid, O-methyl-L-serine, O-methyl-L-homoserine, 5-hydroxylysine, α-carboxyglutamate, phenylglycine, L-pipecolic acid (homoproline), L-homoleucine, L-lysine (dimethyl), L-2-naphthylalanine, L- dimethyldopa or L-dimethoxy-phenylalanine, L-3-pyridylalanine, L-histidine (benzoyloxymethyl), N- cycloheptylglycine, L-diphenylalanine, O-methyl-L-homotyrosine, L-β-homolysine, O-glycan-threoine, Ortho-tyrosine, L-N,N′-dimethyllysine, L-homoarginine, neotryptophan, 3-benzothienylalanine, isoquinoline-3-carboxylic acid, diaminopropionic acid, homocysteine, 3,4-dimethoxyphenylalanine, 4- chlorophenylalanine, L-1,2,3,4-tetrahydronorharman-3-carboxylic acid, adamantylalanine, symmetrical dimethylarginine, 3-carboxythiomorpholine, D-1,2,3,4-tetrahydronorharman-3-carboxylic acid, 3- aminobenzoic acid, 3-amino-1-carboxymethyl-pyridin-2-one, 1-amino-1-cyclohexane carboxylic acid, 2- aminocyclopentane carboxylic acid, 1-amino-1-cyclopropane carboxylic acid, 2-aminoindane-2-carboxylic acid, 4-amino-tetrahydrothiopyran-4-carboxylic acid, azetidine-2-carboxylic acid, b-(benzothiazol-2-yl)- alanine, neopentylglycine, 2-carboxymethyl piperidine, b-cyclobutyl alanine, allylglycine, diaminopropionic acid, homo-cyclohexyl alanine, (2S,4R)- 4-hydroxypiperidine-2-carboxylic acid, octahydroindole-2- carboxylic acid, (2S,4R) and (2S,4R)-4-(2-naphthyl), pyrrolidine-2-carboxylic acid, nipecotic acid, (2S,4R)and (2S,4S)-4-(4-phenylbenzyl) pyrrolidine-2-carboxylic acid, (3S)-1-pyrrolidine-3-carboxylic acid, (2S,4S)-4-tritylmercapto-pyrrolidine-2-carboxylic acid, (2S,4S)-4-mercaptoproline, t-butylglycine, N,N- bis(3-aminopropyl)glycine, 1-amino-cyclohexane-1-carboxylic acid, N-mercaptoethylglycine, and selenocysteine. In some embodiments, amino acid residues may be charged or polar. Charged amino acids include alanine, lysine, aspartic acid, or glutamic acid, or non-naturally occurring analogs thereof. Polar amino acids include glutamine, asparagine, histidine, serine, threonine, tyrosine, methionine, or tryptophan, or non-naturally occurring analogs thereof. It is specifically contemplated that in some embodiments, a terminal amino group in the amino acid may be an amido group or a carbamate group. As used herein, the term “percent (%) identity” refers to the percentage of amino acid residues of a candidate sequence, e.g., an Fc-IgG, or fragment thereof, that are identical to the amino acid residues of a reference sequence after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent identity (i.e., gaps can be introduced in one or both of the candidate and reference sequences for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). Alignment for purposes of determining percent identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, ALIGN, or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. In some embodiments, the percent amino acid sequence identity of a given candidate sequence to, with, or against a given reference sequence (which can
alternatively be phrased as a given candidate sequence that has or includes a certain percent amino acid sequence identity to, with, or against a given reference sequence) is calculated as follows: 100 x (fraction of A/B) where A is the number of amino acid residues scored as identical in the alignment of the candidate sequence and the reference sequence, and where B is the total number of amino acid residues in the reference sequence. In some embodiments where the length of the candidate sequence does not equal to the length of the reference sequence, the percent amino acid sequence identity of the candidate sequence to the reference sequence would not equal to the percent amino acid sequence identity of the reference sequence to the candidate sequence. Two polynucleotide or polypeptide sequences are said to be “identical” if the sequence of nucleotides or amino acids in the two sequences is the same when aligned for maximum correspondence as described above. Comparisons between two sequences are typically performed by comparing the sequences over a comparison window to identify and compare local regions of sequence similarity. A “comparison window” as used herein, refers to a segment of at least about 15 contiguous positions, about 20 contiguous positions, about 25 contiguous positions, or more (e.g., about 30 to about 75 contiguous positions, or about 40 to about 50 contiguous positions), in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. The term “treating” or “to treat,” as used herein, refers to a therapeutic treatment of a viral infection (e.g., a viral infection such as an HIV infection) in a subject. In some embodiments, a therapeutic treatment may slow the progression of the viral infection, improve the subject’s outcome, and/or eliminate the infection. In some embodiments, a therapeutic treatment of a viral infection in a subject may alleviate or ameliorate of one or more symptoms or conditions associated with the viral infection, diminish the extent of the viral, stabilize (i.e., not worsening) the state of the viral infection, prevent the spread of the viral infection, and/or delay or slow the progress of the viral infection, as compare the state and/or the condition of the viral infection in the absence of the therapeutic treatment. The term “average value of T,” as used herein, refers to the mean number of monomers of gp120 binder or dimers of gp120 binders conjugated to an Fc domain or an albumin protein within a population of conjugates. In some embodiments, within a population of conjugates, the average number of monomers of gp120 binder or dimers of gp120 binders conjugated to an Fc domain monomer may be from 1 to 20 (e.g., the average value of T is 1 to 2, 1 to 3, 1 to 4, 1 to 5, 5 to 10, 10 to 15, or 15 to 20). In some embodiments, the average value of T is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. The term “subject,” as used herein, can be a human or non-human primate, or other mammal, such as but not limited to dog, cat, horse, cow, pig, turkey, goat, fish, monkey, chicken, rat, mouse, or sheep. The term “therapeutically effective amount,” as used herein, refers to an amount, e.g., pharmaceutical dose, effective in inducing a desired effect in a subject or in treating a subject having a condition or disorder described herein (e.g., a viral infection, such as an HIV infection). It is also to be understood herein that a “therapeutically effective amount” may be interpreted as an amount giving a
desired therapeutic and/or preventative effect, taken in one or more doses or in any dosage or route, and/or taken alone or in combination with other therapeutic agents (e.g., an antiviral agent described herein). For example, in the context of administering a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) that is used for the treatment of a viral infection, an effective amount of a conjugate is, for example, an amount sufficient to prevent, slow down, or reverse the progression of the viral infection as compared to the response obtained without administration of the conjugate. As used herein, the term “pharmaceutical composition” refers to a medicinal or pharmaceutical formulation that contains at least one active ingredient (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) as well as one or more excipients and diluents to enable the active ingredient suitable for the method of administration. The pharmaceutical composition of the present disclosure includes pharmaceutically acceptable components that are compatible with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)). As used herein, the term “pharmaceutically acceptable carrier” refers to an excipient or diluent in a pharmaceutical composition. For example, a pharmaceutically acceptable carrier may be a vehicle capable of suspending or dissolving the active conjugate (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)). The pharmaceutically acceptable carrier must be compatible with the other ingredients of the formulation and not deleterious to the recipient. In the present disclosure, the pharmaceutically acceptable carrier must provide adequate pharmaceutical stability to a conjugate described herein. The nature of the carrier differs with the mode of administration. For example, for oral administration, a solid carrier is preferred; for intravenous administration, an aqueous solution carrier (e.g., WFI, and/or a buffered solution) is generally used. The term “pharmaceutically acceptable salt,” as used herein, represents salts of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) that are, within the scope of sound medical judgment, suitable for use in methods described herein without undue toxicity, irritation, and/or allergic response. Pharmaceutically acceptable salts are well known in the art. For example, pharmaceutically acceptable salts are described in: Pharmaceutical Salts: Properties, Selection, and Use (Eds. P.H. Stahl and C.G. Wermuth), Wiley-VCH, 2008. The salts can be prepared in situ during the final isolation and purification of the conjugates described herein or separately by reacting the free base group with a suitable organic acid. The term “gp120 binder,” as used herein, refers to a moiety, such as a small molecule (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147 or analogs thereof) that binds to the HIV gp120 glycoprotein. By blocking the gp120 glycoprotein of the virus, a gp120 binder prevents viral attachment to the host CD4+ T cell and entry into the host immune cell. Gp120 binders of the invention include compounds described by formula (A-I), preferably temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or an analog thereof. The term “about,” as used herein, indicates a deviation of ±5%. For example, about 10% refers to from 9.5% to 10.5%. Any values provided in a range of values include both the upper and lower bounds, and any values contained within the upper and lower bounds.
The term “(1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)”, as used herein, represents the formulas of any one of (D-IV-7), (D-IV-8), (D-V-6), (D-V-7), (D-V-8), (D-V-9), (D-VI-8), (M-I), (M-II), (M-III), (M-III-1), (M-III-2), (M-III-3), (M-III-4), (M-III-5), (M-III-6), (M-IV), (M-IV-1), (M-IV-2), (M-IV-3), (M-IV-4), (M-IV-5), (M- IV-6), (M-IV-7), (M-IV-8), (M-IV-9), (M-V), (M-V-1), (M-V-2), (M-V-3), (M-V-4), (M-V-5), (M-V-6), (M-V-7), (M-V-8), (M-V-9), (M-VI), (M-VI-1), (M-VI-2), (M-VI-3), (M-VI-4), (M-VI-5), (M-VI-6), (M-VI-7), (M-VI-8), (M- VII), (M-VIII), (M-VIII-1), (M-IX), (M-IX-1), (M-X), (M-X-1), (M-XI), or (M-XI-1), (M-XII), (M-XII-1), (M-XII-2), (M-XIII), (M-XIII-1), (M-XIII-2), (M-XIV), (M-XIV-1), (M-XIV-2), (M-XIV-3), (M-XIV-4), (M-XIV-5), (M-XV), (M-XV-1), (M-XV-2), (M-XV-3), (M-XV-4), (M-XV-5), (M-XVI), (M-XVI-1), (M-XVI-2), (M-XVI-3), (M-XVI-4), (M-XVI-5), (M-XV), (M-XVII-1), (M-XVII-2), (M-XVII-3), (M-XVII-4), (M-XVII-5). Other features and advantages of the conjugates described herein will be apparent from the following Detailed Description and the claims. Description of the Drawings FIG.1 is an image depicting exemplary methods of conjugating a gp120 receptor inhibitor monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide. FIG.2 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by oxime conjugation to an amino acid residue, e.g., a nitrogen atom of a surface exposed lysine. FIG.3 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by thioether conjugation to an amino acid residue, e.g., a nitrogen atom of a surface exposed lysine. FIG.4 is an image depicting a method of conjugating a gp120 binder monomer or dimer, e.g., by way of a linker, to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide by rebridged cysteine conjugation, e.g., rebridged cysteine conjugation to a pair of sulfur atoms of two hinge cysteines in an Fc domain monomer or Fc domain. FIG.5 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 1. FIG.6 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 3. FIG.7 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 5. FIG.8 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 7. FIG.9 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 9. FIG.10 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 12.
FIG.11 shows non-reducing and reducing SDS-PAGE and a schematic illustration of an Fc domain formed from Fc domain monomers having the sequence of SEQ ID NO: 14. FIG.12 is a graph showing the binding of conjugates containing gp120 binders to the gp120 protein compared to a polyclonal goat anti-gp120 HRP (PA1-73097, Invitrogen) positive control and an unconjugated Fc molecule negative control. FIG.13 is a graph showing plasma levels of a conjugate including an Fc domain having a C220S mutation (SEQ ID NO: 64) (2 mpk IV) compared to a conjugate including an Fc domain having a C220S mutation and a YTE triple mutation (SEQ ID NO: 67) (2 mpk IV) in non-human primate PK studies determined by Fc capture. This study was performed as described in Example 40. FIG.14 is an image depicting exemplary conjugates including a gp120 binder monomer or dimer and an Fc domain monomer or an Fc domain. “T” is representative of the drug-to-antibody ratio (DAR) and depicts that multiple monomers or dimers can be conjugated to an Fc domain monomer or an Fc domain. FIG.15 is a graph showing the 7-day mouse PK profiles of Conjugate 5b and Fc control (SEQ ID NO:73). FIG.16 is a graph showing the percent reduction in viral cytopathic effect (CPE) of HIV-1IIIB and cell viability of CEM-SS cells by AZT at a concentration of 2,000 pM, 5,000 pM, 20,000 pM, 50,000 pM, 160,000 pM, and 500,000 pM. FIG.17 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by the temsivir buffer at a dilution of 0, 00000002, and 0.000001. FIG.18 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by temsavir at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM. FIG.19 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by the conjugate buffer at a dilution of 0, 0.0000001, 0.0000004, 0.000002, 0.00001, and 0.00005. FIG.20 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 5b at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM. FIG.21 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 29a at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM. FIG.22 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 29b at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM. FIG.23 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 30a at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM. FIG.24 is a graph showing the percent reduction in viral CPE of HIV-1IIIB and cell viability of CEM- SS cells by Conjugate 37a at a concentration of 0.64 pM, 3.2 pM, 16 pM, 80 pM, 400 pM, and 2,000 pM. FIG.25 is a graph showing the 7-day mouse PK profiles of Conjugate 5b, Conjugate 29a, and Fc control (SEQ ID NO: 72).
Detailed Description The disclosure features conjugates, compositions, methods for the treatment of viral infections (e.g., human immunodeficiency viral infections), and methods of synthesizing conjugates. The conjugates disclosed herein include monomers or dimers of viral gp120 binders (e.g., temsavir, BMS- 818251, DMJ-II-121, BNM-IV-147, or analogs thereof) conjugated to Fc monomers, Fc domains, Fc- binding peptides, albumin proteins, or albumin protein-binding peptides. The gp120 binder (e.g., temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof) in the conjugates targets the gp120 receptor on the surface of the viral particle. The Fc monomers or Fc domains in the conjugates bind to FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells, e.g., neutrophils, to activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates. The albumin or albumin-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neonatal Fc receptor. Such compositions are useful in methods for the inhibition of viral growth and in methods for the treatment of viral infections, such as those caused by an HIV-1 or HIV-2. I. Viral Infections The compounds and pharmaceutical compositions described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) can be used to treat a viral infection (e.g., an HIV-1 or HIV-2 viral infection). Viral infection refers to the pathogenic growth of a virus (e.g., the human immunodeficiency virus) in a host organism (e.g., a human subject). A viral infection can be any situation in which the presence of a viral population(s) is damaging to a host body. Thus, a subject is suffering from a viral infection when an excessive amount of a viral population is present in or on the subject’s body, or when the presence of a viral population(s) is damaging the cells or other tissue of the subject. The human immunodeficiency viruses (HIV) are two species of Lentivirus (a subgroup of retrovirus) that causes HIV infection and over time acquired immunodeficiency syndrome (AIDS). AIDS is a condition in humans in which progressive failure of the immune system allows life-threatening opportunistic infections and cancers to thrive. Without treatment, average survival time after infection with HIV is estimated to be 9 to 11 years, depending on the HIV subtype. In most cases, HIV is a sexually transmitted infection and occurs by contact with or transfer of blood, pre-ejaculate, semen, and vaginal fluids. Two types of HIV have been characterized: HIV-1 and HIV-2. HIV infects vital cells in the human immune system, such as helper T cells (specifically CD4+ T cells), macrophages, and dendritic cells. HIV infection leads to low levels of CD4+ T cells through a number of mechanisms, including pyroptosis of abortively infected T cells, apoptosis of uninfected bystander cells, direct viral killing of infected cells, and killing of infected CD4+ T cells by CD8+ cytotoxic lymphocytes that recognize infected cells. When CD4+ T cell numbers decline below a critical level, cell-mediated immunity is lost, and the body becomes progressively more susceptible to opportunistic infections, leading to the development of AIDS.
II. Conjugates of the Disclosure Provided herein are synthetic conjugates useful in the treatment of viral infections (e.g., HIV infections). The conjugates disclosed herein include an Fc domain monomer, an Fc domain, or an albumin protein conjugated to one or more monomers gp120 binders or one or more dimers of two gp120 binders (e.g., gp120 binders selected from temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, or analogs thereof). The dimers of two gp120 binders include a gp120 binder (e.g., a first gp120 binder of formula (A-I) or (A-II)) and a second gp120 binder (e.g., a second gp120 binder of formula(A-I) or (A-II)). The first and second gp120 binders are linked to each other by way of a linker. Without being bound by theory, in some aspects, conjugates described herein bind to the surface of a viral particle (e.g., bind to viral gp120 receptor on the surface on an human immunodeficiency virus particle) through the interactions between the gp120 binder moieties in the conjugates and proteins on the surface of the viral particle. The gp120 binder disrupts gp120, an envelope glycoprotein that binds with the CD4 receptor, particularly on helper T cells. Binding to CD4 initiates a cascade of conformational changes in gp120 and gp41 that lead to the fusion of the viral membrane with the host cell membrane, allowing the spread of the virus. Conjugates of the invention include gp120 binder monomers and dimers conjugated to an Fc domain, Fc monomer, or Fc-binding peptide. The Fc domain in the conjugates described herein binds to the FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells. The binding of the Fc domain in the conjugates described herein to the FcγRs on immune cells activates phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates. Conjugates of the invention include gp120 binder monomers and dimers conjugated to an albumin protein or an albumin protein-binding peptide. The albumin protein or albumin protein-binding peptide may extend the half-life of the conjugate, for example, by binding of albumin to the recycling neonatal Fc receptor. Conjugates provided herein are described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII). In some embodiments, the conjugates described herein include one or more monomers of gp120 binders conjugated to an Fc domain or an albumin protein. In some embodiments, the conjugates described herein include one or more dimers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein. In some embodiments, when n is 2, E (an Fc domain monomer) dimerizes to form an Fc domain. Conjugates described herein may be synthesized using available chemical synthesis techniques in the art. In cases where a functional group is not available for conjugation, a molecule may be derivatized using conventional chemical synthesis techniques that are well known in the art. In some embodiments, the conjugates described herein contain one or more chiral centers. The conjugates include each of the isolated stereoisomeric forms as well as mixtures of stereoisomers in varying degrees of chiral purity, including racemic mixtures. It also encompasses the various diastereomers, enantiomers, and tautomers that can be formed.
Gp120 binders A component of the conjugates described herein is an HIV gp120 binder moiety. The gp120 binder disrupts gp120, an envelope glycoprotein that binds with the CD4 receptor, particularly on helper T-Cells. Binding to CD4 initiates a cascade of conformational changes in gp120 and gp41 that lead to the fusion of the viral membrane with the host cell membrane, allowing the spread of the virus. Examples of gp120 binders include temsavir, BMS-818251, DMJ-II-121, and BNM-IV-147. In addition, derivatives of temsavir, BMS-818251, DMJ-II-121, and BNM-IV-147, such as those found in the literature, have gp120 binder activity and are useful as gp120 inhibitor moieties of the compounds herein (see, for example, Lu et al. Curr. Top. Med. Chem.16(10): 1074-1090). Conjugates described herein are separated into two types: (1) one or more dimers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein and (2) one or more monomers of gp120 binders conjugated to an Fc domain monomer, an Fc domain, or an albumin protein. The dimers of gp120 binders are linked to each other by way of a linker, such as the linkers described herein. Viral gp120 binders of the invention include temsavir, BMS-818251, DMJ-II-121, BNM-IV-147, and analogs thereof, such as the viral gp120 binders of formula (A-I) and (A-II):
wherein Q is selected from the group consisting of:
,
S is selected from the group consisting of:
R1, R2, R3, are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C3-C20 cycloalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C5-C20 aryl, optionally substituted C3-C15 heteroaryl, and optionally substituted C1-C20 alkoxy; R4 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, optionally substituted C3- C15 heteroaryl, and a bond; R5 is selected from H or optionally substituted C1-C6 alkyl; R6 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl; R7 and Y are each independently selected from
each R8 is independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; each R9 is independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; x is 1 or 2; k is 0, 1, 2, 3, 4, or 5; Ar is selected from the group consisting of optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl. In a preferred embodiment of the above, x is 2. Preferably the gp120 inhibitor is selected from temsavir, BMS-818251, DMJ-II-121, or BNM-IV- 147:
Conjugates of dimers of gp120 binders linked to an Fc domain or an albumin protein The conjugates described herein include an Fc domain monomer, an Fc domain, an Fc-binding peptide, and albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders. The dimers of two gp120 binders include a first gp120 binder (e.g., a first viral gp120 binder of formula (A-I) or (A-II)) and a second gp120 binder (e.g., a second viral gp120 binder of formula (A-I) or (A-II)). The first and second gp120 binders are linked to each other by way of a linker, such as a linker described herein. In some embodiments of the dimers of gp120 binders, the first and second gp120 binders are the same. In some embodiments, the first and second gp120 binders are different. In some embodiments, when T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L-A2 may be independently selected (e.g., independently selected from any of the A1-L-A2 structures described herein). In some embodiments, E may be conjugated to 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different A1-L-A2 moieties. In some embodiments, E is conjugated to a first A1-L-A2 moiety, and a second A1-L-A2, moiety. In some embodiments, each of A1 and A2 of the first A1-L-A2 moiety and of the second A1-L-A2 moiety are independently selected from any structure described by formula (A-I) and (A-II):
In a preferred embodiment of the above, x is 2. In some embodiments, the first A1-L-A2 moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E), and the second A1-L-A2 moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E). In some embodiments, the first A1-L-A2 moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E), and the second A1-L-A2 moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E). In some embodiments, the disclosure provides a conjugate, or a pharmaceutically acceptable salt thereof, described by any one any one of formulas (D-I), (D-II), (D-III), (D-III-1), (D-III-2), (D-III-3), (D-III- 4), (D-III-5), (D-III-6), (D-IV), (D-IV-1), (D-IV-2), (D-IV-3), (D-IV-4), (D-IV-5), (D-IV-6), (D-IV-7), (D-IV-8), (D-IV-9), (D-V), (D-V-1), (D-V-2), (D-V-3), (D-V-4), (D-V-5), (D-V-6), (D-V-7), (D-V-8), (D-V-9), (D-VI), (D- VI-1), (D-VI-2), (D-VI-3), (D-VI-4), (D-VI-5), (D-VI-6), (D-VI-7), (D-VI-8), (D-VII), (D-VIII), (D-VIII-1), (D-IX), (D-IX-1), (D-X), (D-X-1), (D-XI), (D-XI-1), (D-XII), (D-XII-1), (D-XII-2), (D-XIII), (D-XIII-1), (D-XIII-2), (D- XIV), (D-XIV-1), (D-XIV-2), (D-XIV-3), (D-XIV-4), (D-XIV-5), (D-XV), (D-XV-1), (D-XV-2), (D-XV-3), (D-XV- 4), (D-XV-5), (D-XVI), (D-XVI-1), (D-XVI-2), (D-XVI-3), (D-XVI-4), (D-XVI-5), (D-XV), (D-XVII-1), (D-XVII- 2), (D-XVII-3), (D-XVII-4), (D-XVII-5), or a pharmaceutically acceptable salt thereof. In the conjugates described herein, the squiggly line connected to E indicates that one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) dimers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-
binding peptide. In some embodiments, when n is 1, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) dimers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when n is 2, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) dimers of gp120 binders may be attached to an Fc domain. The squiggly line in the conjugates described herein is not to be construed as a single bond between one or more dimers of gp120 binders and an atom in the Fc domain monomer, Fc domain, or albumin protein. In some embodiments, when T is 1, one dimer of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is 2, two dimers of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. As described further herein, a linker in a conjugate described herein (e.g., L or L’) may be a branched structure. As described further herein, a linker in a conjugate described herein (e.g., L or L’) may be a multivalent structure, e.g., a divalent or trivalent structure having two or three arms, respectively. In some embodiments when the linker has three arms, two of the arms may be attached to the first and second gp120 binders and the third arm may be attached to the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In conjugates having an Fc domain covalently linked to one or more dimers of gp120 binders, as represented by the formulae above, when n is 2, two Fc domain monomers (each Fc domain monomer is represented by E) dimerize to form an Fc domain. Conjugates of monomers of gp120 binders linked to an Fc domain monomer, an Fc domain, or an albumin protein In some embodiments, the conjugates described herein include an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders. Conjugates of an Fc domain monomer or albumin protein and one or more monomers of gp120 binders may be formed by linking the Fc domain monomer, Fc domain, or albumin protein to each of the monomers of gp120 binders through a linker, such as any of the linkers described herein. In the conjugates having an Fc domain or albumin protein covalently linked to one or more monomers of gp120 binders described herein, the squiggly line connected to E indicates that one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) monomers of gp120 binders may be attached to an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when n is 1, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) monomers of gp120 binders may be attached to an Fc domain monomer, Fc domain, or an albumin protein. In some embodiments, when n is 2, one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20) monomers of gp120 binders may be attached to an Fc domain. The squiggly line in the conjugates described herein is not to be construed as a single bond between one or more monomers of gp120 binders and an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is 1, one monomer of
gp120 binder may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is 2, two monomers of gp120 binders may be attached to an atom in the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, when T is greater than 1 (e.g., T is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20), each A1-L may be independently selected (e.g., independently selected from any of the A1-L structures described herein). In some embodiments, E may be conjugated to 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different A1-L moieties. In some embodiments, E is conjugated to a first A1-L moiety, and a second A1-L, moiety. In some embodiments, A1 of each of the first A1-L moiety and of the second A1-L moiety is independently selected from any structure described by formula (A-I) or (A-II):
In a preferred embodiment, x is 2. In some embodiments, the first A1-L moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E), and the second A1-L moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E). In some embodiments, the first A1-L moiety is conjugated specifically to cysteine residues of E (e.g., the sulfur atoms of surface exposed cysteine residues of E), and the second A1-L moiety is conjugated specifically to lysine residues of E (e.g., the nitrogen atoms of surface exposed lysine residues of E). As described further herein, a linker in a conjugate having an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of the gp120 binders described herein (e.g., L or L’) may be a divalent structure having two arms. One arm in a divalent linker may be attached to the monomer of the gp120 binder and the other arm may be attached to the Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide. In some embodiments, a conjugate containing an Fc domain monomer, Fc domain, Fc-binding peptide, albumin protein, or albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders provided herein is described by any one of formulas (M-I), (M-II), (M-III), (M-III-1), (M-III- 2), (M-III-3), (M-III-4), (M-III-5), (M-III-6), (M-IV), (M-IV-1), (M-IV-2), (M-IV-3), (M-IV-4), (M-IV-5), (M-IV-6), (M-IV-7), (M-IV-8), (M-IV-9), (M-V), (M-V-1), (M-V-2), (M-V-3), (M-V-4), (M-V-5), (M-V-6), (M-V-7), (M-V- 8), (M-V-9), (M-VI), (M-VI-1), (M-VI-2), (M-VI-3), (M-VI-4), (M-VI-5), (M-VI-6), (M-VI-7), (M-VI-8), (M-VII), (M-VIII), (M-VIII-1), (M-IX), (M-IX-1), (M-X), (M-X-1), (M-XI), or (M-XI-1), (M-XII), (M-XII-1), (M-XII-2), (M- XIII), (M-XIII-1), (M-XIII-2), (M-XIV), (M-XIV-1), (M-XIV-2), (M-XIV-3), (M-XIV-4), (M-XIV-5), (M-XV), (M- XV-1), (M-XV-2), (M-XV-3), (M-XV-4), (M-XV-5), (M-XVI), (M-XVI-1), (M-XVI-2), (M-XVI-3), (M-XVI-4), (M- XVI-5), (M-XV), (M-XVII-1), (M-XVII-2), (M-XVII-3), (M-XVII-4), (M-XVII-5), or a pharmaceutically acceptable salt thereof.
In conjugates having an Fc domain covalently linked to one or more monomers of gp120 binders, as represented by the formulae above, when n is 2, two Fc domain monomers (each Fc domain monomer is represented by E) dimerize to form an Fc domain. III. Fc domain monomers and Fc domains An Fc domain monomer includes a hinge domain, a CH2 antibody constant domain, and a CH3 antibody constant domain. The Fc domain monomer can be of immunoglobulin antibody isotype IgG, IgE, IgM, IgA, or IgD. The Fc domain monomer can also be of any immunoglobulin antibody isotype (e.g., IgG1, IgG2a, IgG2b, IgG3, or IgG4). The Fc domain monomer can be of any immunoglobulin antibody allotype (e.g., IGHG1*01 (i.e., G1m(za)), IGHG1*07 (i.e., G1m(zax)), IGHG1*04 (i.e., G1m(zav)), IGHG1*03 (G1m(f)), IGHG1*08 (i.e., G1m(fa)), IGHG2*01, IGHG2*06, IGHG2*02, IGHG3*01, IGHG3*05, IGHG3*10, IGHG3*04, IGHG3*09, IGHG3*11, IGHG3*12, IGHG3*06, IGHG3*07, IGHG3*08, IGHG3*13, IGHG3*03, IGHG3*14, IGHG3*15, IGHG3*16, IGHG3*17, IGHG3*18, IGHG3*19, IGHG2*04, IGHG4*01, IGHG4*03, or IGHG4*02) (as described in, for example, in Vidarsson et al. IgG subclasses and allotypes: from structure to effector function. Frontiers in Immunology.5(520):1-17 (2014)). The Fc domain monomer can also be of any species, e.g., human, murine, or mouse. A dimer of Fc domain monomers is an Fc domain that can bind to an Fc receptor, which is a receptor located on the surface of leukocytes. In some embodiments, an Fc domain monomer in the conjugates described herein may contain one or more amino acid substitutions, additions, and/or deletion relative to an Fc domain monomer having a sequence of any one of SEQ ID NOs: 1-95 and 125-153. In some embodiments, an Asn in an Fc domain monomer in the conjugates as described herein may be replaced by Ala in order to prevent N- linked glycosylation (see, e.g., SEQ ID NOs: 12-15, where Asn to Ala substitution is labeled with *). In some embodiments, an Fc domain monomer in the conjugates described herein may also containing additional Cys additions (see, e.g., SEQ ID NOs: 9, 10, and 11, where Cys additions are labeled with *). In some embodiments, an Fc domain monomer in the conjugates as described herein includes an additional moiety, e.g., an albumin-binding peptide, a purification peptide (e.g., a hexa-histidine peptide (HHHHHH (SEQ ID NO: 99)), or a signal sequence (e.g., IL2 signal sequence MYRMQLLSCIALSLALVTNS (SEQ ID NO: 100)) attached to the N- or C-terminus of the Fc domain monomer. In some embodiments, an Fc domain monomer in the conjugate does not contain any type of antibody variable region, e.g., VH, VL, a complementarity determining region (CDR), or a hypervariable region (HVR). In some embodiments, an Fc domain monomer in the conjugates as described herein may have a sequence that is at least 95% identical (e.g., 97%, 99%, or 99.5% identical) to the sequence of any one of SEQ ID NOs: 1-95 and 125-153 shown below. In some embodiments, an Fc domain monomer in the conjugates as described herein may have a sequence of any one of SEQ ID NOs: 1-95 and 125-153 shown below. SEQ ID NO: 1: murine Fc-IgG2a with IL2 signal sequence at the N-terminus (bold) MYRMQLLSCIALSLALVTNSPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVS EDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTI
SKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYF MYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGK SEQ ID NO: 2: mature murine Fc-IgG2a PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTA QTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEE MTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSC SVVHEGLHNHHTTKSFSRTPGK SEQ ID NO: 3: human Fc-IgG1 with IL2 signal sequence at the N-terminus (bold) and N-terminal MVRS amino acid residues added (underlined) MYRMQLLSCIALSLALVTNSMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 4: mature human Fc-IgG1 with N-terminal MVRS amino acid residues added (underlined) MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH NAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSR EEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVF SCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 5: murine Fc-IgG2a with IL2 signal sequence (bold) at the N-terminus and hexa-histidine peptide (italicized) at the C-terminus MYRMQLLSCIALSLALVTNSPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVS EDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTI SKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYF MYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGKHHHHHH SEQ ID NO: 6: mature murine Fc-IgG2a with hexa-histidine peptide (italicized) at the C-terminus PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTA QTQTHREDYNSTLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEEE MTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYSC SVVHEGLHNHHTTKSFSRTPGKHHHHHH SEQ ID NO: 7: human Fc-IgG1 with IL2 signal sequence (bold) at the N-terminus, N-terminal MVRS amino acid residues added (underlined), and hexa-histidine peptide (italicized) at the C-terminus MYRMQLLSCIALSLALVTNSMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT
ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH SEQ ID NO: 8: mature human Fc-IgG1 with hexa-histidine peptide (italicized) at the C-terminus and N- terminal MVRS amino acid residues added (underlined) MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH NAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSR EEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVF SCSVMHEALHNHYTQKSLSLSPGKHHHHHH SEQ ID NO: 9: human Fc-IgG1 with IL2 signal sequence (bold) at the N-terminus, N-terminal MVRS amino acid residues added (underlined), two additional cysteines in the hinge region (*), and hexa- histidine peptide (italicized) at the C-terminus MYRMQLLSCIALSLALVTNSMVRSDKTHTCPPCPPC*KC*PAPELLGGPSVFLFPPKPKDTLMISRTPEVT CVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKAL PAPIEKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVL DSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH SEQ ID NO: 10: mature human Fc-IgG1 with N-terminal MVRS amino acid residues added (underlined), two additional cysteines in the hinge region (*), and hexa-histidine peptide (italicized) at the C-terminus MVRSDKTHTCPPCPPC*KC*PAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQV YTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRW QQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH SEQ ID NO: 11: mature human Fc-IgG1 with N-terminal MVRS amino acid residues added (underlined) and two additional cysteines in the hinge region (*) MVRSDKTHTCPPCPPC*KC*PAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYV DGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQV YTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRW QQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 12: murine Fc-IgG2a with IL2 signal sequence (bold) at the N-terminus, Asn to Ala substitution (*), and hexa-histidine peptide (italicized) at the C-terminus MYRMQLLSCIALSLALVTNSPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVS EDDPDVQISWFVNNVEVHTAQTQTHREDYA*STLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTI SKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYF MYSKLRVEKKNWVERNSYSCSVVHEGLHNHHTTKSFSRTPGKHHHHHH
SEQ ID NO: 13: mature murine Fc-IgG2a with Asn to Ala substitution (*) and hexa-histidine peptide (italicized) at the C-terminus PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVVVDVSEDDPDVQISWFVNNVEVHTA QTQTHREDYA*STLRVVSALPIQHQDWMSGKEFKCKVNNKDLPAPIERTISKPKGSVRAPQVYVLPPPEE EMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKNTEPVLDSDGSYFMYSKLRVEKKNWVERNSYS CSVVHEGLHNHHTTKSFSRTPGKHHHHHH SEQ ID NO: 14: human Fc-IgG1 with IL2 signal sequence (bold) at the N-terminus, N-terminal MVRS amino acid residues added (underlined), Asn to Ala substitution (*), and hexa-histidine peptide (italicized) at the C-terminus MYRMQLLSCIALSLALVTNSMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA*STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGKHHHHHH SEQ ID NO: 15: mature human Fc-IgG1 with Asn to Ala substitution (*), N-terminal MVRS amino acid residues added (underlined), and hexa-histidine peptide (italicized) at the C-terminus MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVH NAKTKPREEQYA*STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPS REEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNV FSCSVMHEALHNHYTQKSLSLSPGKHHHHHH SEQ ID NO: 16: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus and N-terminal ISAMVRS amino acid residues added (underlined) MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 17: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added (underlined), C-terminal G4S linker (italicized), and C- terminal c-Myc tag (underlined, italicized) MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL
SEQ ID NO: 18: mature human IgG1 Fc with N-terminal ISAMVRS amino acid residues added (underlined), C-terminal G4S linker (italicized), and C-terminal c-Myc tag (underlined, italicized) ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVE VHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPP SREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGN VFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 19: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold), N-terminal ISAMVRS amino acid residues added (underlined), and lysine to serine modification (*) to prevent lysine conjugation at this site MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS*DTLMISRTPEVTCVVVD VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 20: mature human IgG1 Fc with N-terminal ISAMVRS amino acid residues added (underlined) and lysine to serine modification (*) to prevent lysine conjugation at this site ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS*DTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGV EVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLP PSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQG NVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 21: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added (underlined), lysine to serine modification (*) to prevent lysine conjugation at this site, C-terminal G4S linker (italicized), and C-terminal C-Myc tag (underlined, italicized) MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS(*)DTLMISRTPEVTCVVV DVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD GSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 22: mature human IgG1 Fc with N-terminal ISAMVRS amino acid residues added (underlined), lysine to serine modification (*) to prevent lysine conjugation at this site, C-terminal G4S linker (italicized), and C-terminal C-Myc tag (underlined, italicized) ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPS(*)DTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDG VEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL PPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQ GNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL
SEQ ID NO: 23: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added (underlined), Asn to Ala substitution (*), C-terminal G4S linker (italicized), and C-terminal C-myc tag (underlined, italicized) MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIE KTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 24: mature human IgG1 Fc with N-terminal ISAMVRS amino acid residues added (underlined), Asn to Ala substitution (*), C-terminal G4S linker (italicized), and C-terminal C-myc tag (underlined, italicized) ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVE VHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL PPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQ GNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 25: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added (underlined), H310A (*) and H435A (*) mutations to impede FcRn binding, C-terminal G4S (italicized), and C-terminal C-myc tag (underlined, italicized) MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLA(*)QDWLNGKEYKCKVSNKALPAPIE KTISKA(*)KGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD GSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNAYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 26: mature human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N- terminus, N-terminal ISAMVRS amino acid residues added (underlined), with H310A (*) and H435A (*) mutations to impede FcRn binding, C-terminal G4S (italicized), and C-terminal C-myc tag (underlined, italicized) ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVE VHNAKTKPREEQYNSTYRVVSVLTVLA(*)QDWLNGKEYKCKVSNKALPAPIEKTISKA(*)KGQPREPQVY TLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQ QGNVFSCSVMHEALHNAYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 27: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added (underlined), C-terminal G4S linker (italicized), and C- terminal mutated (lysine to phenylalanine, bold) C-myc tag (underlined, italicized) MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQFLISEEDL
SEQ ID NO: 28: mature human IgG1 Fc with N-terminal ISAMVRS amino acid residues added (underlined), C-terminal G4S linker (italicized), and C-terminal mutated (lysine to phenylalanine, bold) C- myc tag (underlined, italicized) ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVE VHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPP SREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGN VFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQFLISEEDL SEQ ID NO: 29: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, N-terminal ISAMVRS amino acid residues added (underlined), Asn to Ala substitution (*), C-terminal G4S linker (italicized), and C-terminal mutated (lysine to phenylalanine, bold) C-myc tag (underlined, italicized) MKWVTFISLLFLFSSAYSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIE KTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQFLISEEDL SEQ ID NO: 30: mature human IgG1 Fc with N-terminal MVRS amino acid residues added (underlined), Asn to Ala substitution (*), C-terminal G4S linker (italicized), and C-terminal mutated (lysine to phenylalanine, bold) C-myc tag (underlined, italicized) ISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVE VHNAKTKPREEQYA(*)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL PPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQ GNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQFLISEEDL SEQ ID NO: 31: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, allotype G1m(fa) (bold italics), C-terminal G4S linker (italicized), and C-terminal mutated (lysine to phenylalanine, bold) C-myc tag (underlined) MKWVTFISLLFLFSSAYSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPE VKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKG QPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQFLISEEDL SEQ ID NO: 32: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, allotype G1m(fa) (bold italics) MKWVTFISLLFLFSSAYSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPE VKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKG QPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK
SEQ ID NO: 33: mature human IgG1 Fc with a YTE triple mutation (bold and underlined) with N-terminal MVRS amino acid residues added (underlined) MVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVH NAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSR EEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVF SCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 34: human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N-terminus, contains residues EPKSS including the full hinge region on the N-terminus of mature human IgG1 Fc (underlined), Cys to Ser substitution (#), allotype G1m(fa) (bold italics) MKWVTFISLLFLFSSAYSEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 35: human IgG1 Fc with murine IgG signal sequence (bold) at the N-terminus, with removal of EPKSSD hinge residues from the N-terminus of the mature human IgG1 Fc, allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPE VKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKG QPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 36: mature human IgG1 Fc with a YTE triple mutation (bold and underlined), with removal of EPKSSD hinge residues from the N-terminus of the mature human IgG1 Fc, allotype G1m(fa) (bold italics) KTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKP REEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDE LTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS VMHEALHNHYTQKSLSLSPGK SEQ ID NO: 37: mature human IgG1 Fc with an LS double mutation (bold and underlined), with removal of EPKSSD hinge residues from the N-terminus of the mature human IgG1 Fc, allotype G1m(fa) (bold italics) KTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTK PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDE LTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS VLHEALHSHYTQKSLSLSPGK
SEQ ID NO: 38: mature human IgG1 Fc with Human Serum Albumin Signal Sequence (bold) at the N- terminus, a YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics), C-terminal G4S linker (italicized), and C-terminal C-myc tag (underlined) MKWVTFISLLFLFSSAYSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEV KFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ PREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 39: mature human Fc IgG1, wherein X1 is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAK TKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4E X5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSC SVX6HEALHX7HYTQKSLSLSPG SEQ ID NO: 40: mature human Fc IgG1 wherein X4 is Asp or Glu, and X5 is Leu or Met DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKT KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5 TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV MHEALHNHYTQKSLSLSPG SEQ ID NO: 41: mature human Fc IgG1 with a YTE triple mutation (bold and underlined), and wherein X4 is Asp or Glu, and X5 is Leu or Met DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTK PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5T KNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV MHEALHNHYTQKSLSLSPG SEQ ID NO: 42: mature human Fc IgG1 with a YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics) DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTK PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTK NQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM HEALHNHYTQKSLSLSPG SEQ ID NO: 43: mature human Fc IgG1 with a YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics) DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTK PREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSREEMTK
NQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM HEALHNHYTQKSLSLSPG SEQ ID NO: 44: mature human Fc IgG1 with a LS double mutation (bold and underlined), and wherein X4 is Asp or Glu, and X5 is Leu or Met DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKT KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5 TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV LHEALHSHYTQKSLSLSPG SEQ ID NO: 45: mature human Fc IgG1 with a LS double mutation (bold and underlined), allotype G1m(fa) (bold italics) DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKT KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELT KNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVL HEALHSHYTQKSLSLSPG SEQ ID NO: 46: mature human Fc IgG1 with a LS double mutation (bold and underlined), allotype G1m(f) (bold italics) DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKT KPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSREEMT KNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVL HEALHSHYTQKSLSLSPG SEQ ID NO: 47: mature human Fc IgG1 with mouse heavy chain MIgG Vh signal sequence (bold), deletion of Asp ([D]) Cys to Ser substitution (#), and wherein X1 is Met or Trp, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD TLX1IX2RX3PEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNG KEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNG QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG SEQ ID NO: 48: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), Cys to Ser substitution (#), allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD TLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK
SEQ ID NO: 49: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), Cys to Ser substitution (#), allotype G1m(f) (bold italics) MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD TLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQP ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 50: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD TLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK SEQ ID NO: 51: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics) MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD TLMISRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGK EYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQP ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK SEQ ID NO: 52: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD TLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKE YKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPE NNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 53: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics) MGWSCIILFLVATATGVHSNVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKD TLYITREPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKE YKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPE NNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 54: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), N- terminal ISAMVRS amino acid residues added (italicized), M428L, N434S mutations (bold/underlined), G4S linker (italicized), and C-terminal C-myc-tag (underlined), allotype G1m(f) (bold italics) MGWSCIILFLVATATGVHSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK
TISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS FFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 55: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), N- terminal ISAMVRS amino acid residues added (italicized), M428L, N434S mutations (bold/underlined), G4S linker (italicized), C-terminal C-myc-tag (underlined), allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 56: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), N- terminal ISAMVRS amino acid residues added (italicized), YTE triple mutant (bold/underlined), G4S linker (italicized), and C-terminal C-myc-tag (underlined), allotype G1m(f) (bold italics) MGWSCIILFLVATATGVHSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVD VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 57: mature human IgG1 Fc with mouse heavy chain MIgG Vh signal sequence (bold), N- terminal ISAMVRS amino acid residues added (italicized), YTE triple mutant (bold/underlined), G4S linker (italicized), C-terminal C-myc-tag (underlined), allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSISAMVRSDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVD VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSEQKLISEEDL SEQ ID NO: 58: mature human IgG1 with mouse heavy chain MIgG1 signal sequence (bold), Cys to Ser substitution (#), C-terminal G4S (italics), and C-terminal IgA peptide (underline), allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGGGGGSQRNPRLRLIRRHPTLRIPPI SEQ ID NO: 59: mature human IgG1 with mouse heavy chain MIgG1 signal sequence (bold), Cys to Ser substitution (#), M428L, N434S mutations (bold/underlined), C-terminal G4S (italics), and C-terminal IgA peptide (underline), allotype G1m(fa) (bold italics) MGWSCIILFLVATATGVHSEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVD VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK
TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGGGGGSQRNPRLRLIRRHPTLRIPPI SEQ ID NO: 60: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser NVNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK SEQ ID NO: 61: mature human Fc IgG1, Cys to Ser substitution (#), and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK SEQ ID NO: 62: mature human IgG1 Fc, Cys to Ser substitution (#), X4 is Asp or Glu, and X5 is Leu or Met NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 63: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(f) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 64: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(fa) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 65: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(fa) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS
KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK SEQ ID NO: 66: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK SEQ ID NO: 67: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 68: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 69: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser NVNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG SEQ ID NO: 70: mature human Fc IgG1, Cys to Ser substitution (#), and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG
SEQ ID NO: 71: mature human IgG1 Fc, Cys to Ser substitution (#), X4 is Asp or Glu, and X5 is Leu or Met NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 72: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(f) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 73: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(fa) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 74: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(fa) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 75: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 76: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG
SEQ ID NO: 77: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics) NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 78: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser VNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK SEQ ID NO: 79: mature human Fc IgG1, Cys to Ser substitution (#), and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPGK SEQ ID NO: 80: mature human IgG1 Fc, Cys to Ser substitution (#), X4 is Asp or Glu, and X5 is Leu or Met VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 81: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(f) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 82: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(fa) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK
SEQ ID NO: 83: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(fa) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK SEQ ID NO: 84: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGK SEQ ID NO: 85: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 86: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK SEQ ID NO: 87: mature human Fc IgG1, Z1 is Cys or Ser, and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser VNHKPSNTKVDKKVEPKSZ1DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG SEQ ID NO: 88: mature human Fc IgG1, Cys to Ser substitution (#), and wherein X1 is Met or Tyr, X2 is Ser or Thr, X3 is Thr or Glu, X4 is Asp or Glu, and X5 is Leu or Met, X6 is Met or Leu, and X7 is Asn or Ser VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLX1IX2RX3PEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS
KAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVX6HEALHX7HYTQKSLSLSPG SEQ ID NO: 89: mature human IgG1 Fc, Cys to Ser substitution (#), X4 is Asp or Glu, and X5 is Leu or Met VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFL YSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 90: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(f) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 91: mature human IgG1 Fc, Cys to Ser substitution (#), allotype G1m(fa) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 92: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(fa) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 93: mature human IgG1 Fc, Cys to Ser substitution (#), M428L, N434S mutations (Bold/Underlined), allotype G1m(f) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 94: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(fa) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK
AKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 95: mature human IgG1 Fc, Cys to Ser substitution (#), YTE triple mutation (bold and underlined), allotype G1m(f) (bold italics) VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHE DPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISK AKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG In some embodiments, the variant Fc domain includes an amino acid substitution at position 246 (e.g., K246X where X is any amino acid that is not Lys, such as K246S, K246G, K246A, K246T, K246N, K246Q, K246R, K246H, K246E, or K246DC220S). In some embodiments, the variant Fc domain monomer includes at least the following mutations K246X, M252Y, S254T, and T256E, where X is not Lys. In some embodiments, the variant Fc domain monomer includes at least the following mutations K246X, V309D, Q311H, and N434S, where X is not Lys. In some embodiments, the variant Fc domain monomer includes at least the following mutations K246X, M428L, and N434S, where X is not Lys. In some embodiments, the variant Fc domain further includes a mutation of position 220, e.g., a C220S mutation. Amino acid substitutions are relative to a wild-type Fc monomer amino acid sequence, e.g., wild-type human IgG1 or IgG2. In some embodiments, a variant Fc domain monomer includes a sequence that is at least 95% identical (e.g., 97%, 99%, or 99.5% identical) to the sequence of any one of SEQ ID NOs: 125-153 shown below. In some embodiments, a variant Fc domain monomer includes the sequence of any one of SEQ ID NOs: 125-153 shown below. In some embodiments, a variant Fc domain monomer includes at least the following mutations K246X, M252Y, S254T, and T256E, where X is not Lys. In some embodiments, a variant Fc domain monomer includes at least the following mutations K246X, V309D, Q311H, and N434S, where X is not Lys. In some embodiments, a variant Fc domain monomer includes at least the following mutations K246X, M428L, and N434S, where X is not Lys. In some embodiments, the substitution at K246X is selected from Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp. In some embodiments, the substitution at K246X is Ser. SEQ ID NO: 125: mature human IgG1 Fc; X1 (position 201) is Asn or absent; X2 (position 220) is Cys or Ser; X3 (position 246) is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X4 (position 252) is Met or Tyr; X5 (position 254) is Ser or Thr; X6 (position 256) is Thr or Glu; X7 (position 297) is Asn or Ala; X8 (position 309) is Leu or Asp; X9 (position 311) is Gln or His; X10 (position 356) is Asp or Glu; and X11 (position 358) is Leu or Met; X12 (position 428) is Met or Leu; X13 (position 434) is Asn or Ser; X14 (position 447) is Lys or absent; N-terminal Fab residues are underlined; hinge residues are italicized
X1VNHKPSNTKVDKKVEPKSX2DKTHTCPPCPAPELLGGPSVFLFPPX3PKDTLX4IX5RX6PEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX7STYRVVSVLTVX8HX9DWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRX10EX11TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD GSFFLYSKLTVDKSRWQQGNVFSCSVX12HEALH X13HYTQKSLSLSPGX14 SEQ ID NO: 126: mature human IgG1 Fc; Cys to Ser substitution (#); X1 is Asn or absent; X2 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X3 is Asn or Ala; X4 is Asp or Glu; and X5 is Leu or Met; X6 is Lys or absent; N-terminal Fab residues are underlined; hinge residues are italicized X1VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX2PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX3STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGX6 SEQ ID NO: 127: mature human IgG1 Fc; Cys to Ser substitution (#); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asn or Ala; X3 is Asp or Glu; and X4 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX2STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX3EX4TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 128: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); X1 is Asp or Glu; and X2 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS FFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 129: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); allotype G1m(fa) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 130: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); allotype G1m(f) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT
ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 131: mature human IgG1 Fc; Cys to Ser substitution (#); Asn to Ala substitution (^); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asp or Glu; and X3 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRX2EX3TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 132: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); Asn to Ala substitution (^); X1 is Asp or Glu; and X2 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIE KTISKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD GSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 133: mature human IgG1 Fc; Cys to Ser substitution (#); YTE triple mutation (bold and underlined); X1 is Asn or absent; X2 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X3 is Asn or Ala; X4 is Asp or Glu; and X5 is Leu or Met; X6 is Lys or absent; N-terminal Fab residues are underlined; hinge residues are italicized X1VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX2PKDTLYITREPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX3STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGX6 SEQ ID NO: 134: mature human IgG1 Fc; Cys to Ser substitution (#); YTE triple mutation (bold and underlined); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asn or Ala; X3 is Asp or Glu; and X4 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLYITREPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYX2STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS KAKGQPREPQVYTLPPSRX3EX4TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG
SEQ ID NO: 135: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); YTE triple mutation (bold and underlined); X1 is Asp or Glu; and X2 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLYITREPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 136: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); YTE triple mutation (bold and underlined); allotype G1m(fa) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLYITREPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 137: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); YTE triple mutation (bold and underlined); allotype G1m(f) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLYITREPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 138: mature human IgG1 Fc; Cys to Ser substitution (#); Asn to Ala substitution (^); YTE triple mutation (bold and underlined); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asp or Glu; and X3 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLYITREPEVTCVVVDVSH EDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX2EX3TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG SEQ ID NO: 139: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); Asn to Ala substitution (^); YTE triple mutation (bold and underlined); X1 is Asp or Glu; and X2 is Leu or Met; N- terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLYITREPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG
SEQ ID NO: 140: mature human IgG1 Fc; Cys to Ser substitution (#); DHS triple mutation (bold and underlined); X1 is Asn or absent; X2 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X3 is Asn or Ala; X4 is Asp or Glu; and X5 is Leu or Met; X6 is Lys or absent; N-terminal Fab residues are underlined; hinge residues are italicized X1VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX2PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX3STYRVVSVLTVDHHDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHSHYTQKSLSLSPGX6 SEQ ID NO: 141: mature human IgG1 Fc; Cys to Ser substitution (#); DHS triple mutation (bold and underlined); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asn or Ala; X3 is Asp or Glu; and X4 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX2STYRVVSVLTVDHHDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX3EX4TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHSHYTQKSLSLSPG SEQ ID NO: 142: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); DHS triple mutation (bold and underlined); X1 is Asp or Glu; and X2 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVDHHDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHSHYTQKSLSLSPG SEQ ID NO: 143: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); DHS triple mutation (bold and underlined); allotype G1m(fa) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVDHHDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVMHEALHSHYTQKSLSLSPG SEQ ID NO: 144: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); DHS triple mutation (bold and underlined); allotype G1m(f) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVDHHDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS FFLYSKLTVDKSRWQQGNVFSCSVMHEALHSHYTQKSLSLSPG
SEQ ID NO: 145: mature human IgG1 Fc; Cys to Ser substitution (#); Asn to Ala substitution (^); DHS triple mutation (bold and underlined); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asp or Glu; and X3 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVDHHDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRX2EX3TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHSHYTQKSLSLSPG SEQ ID NO: 146: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); Asn to Ala substitution (^); DHS triple mutation (bold and underlined); X1 is Asp or Glu; and X2 is Leu or Met; N- terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVDHHDWLNGKEYKCKVSNKALPAPIE KTISKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD GSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHSHYTQKSLSLSPG SEQ ID NO: 147: mature human IgG1 Fc; Cys to Ser substitution (#); LS double mutation (bold and underlined); X1 is Asn or absent; X2 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X3 is Asn or Ala; X4 is Asp or Glu; and X5 is Leu or Met; X6 is Lys or absent; N-terminal Fab residues are underlined; hinge residues are italicized X1VNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX2PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX3STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX4EX5TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPGX6 SEQ ID NO: 148: mature human IgG1 Fc; Cys to Ser substitution (#); LS double mutation (bold and underlined); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asn or Ala; X3 is Asp or Glu; and X4 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYX2STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI SKAKGQPREPQVYTLPPSRX3EX4TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 149: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); LS double mutation (bold and underlined); X1 is Asp or Glu; and X2 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT
ISKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGS FFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 150: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); LS double mutation (bold and underlined); allotype G1m(fa) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFF LYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 151: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); LS double mutation (bold and underlined); allotype G1m(f) (bold italics); N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKT ISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF FLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 152: mature human IgG1 Fc; Cys to Ser substitution (#); Asn to Ala substitution (^); LS double mutation (bold and underlined); X1 is Ser, Gly, Ala, Thr, Asn, Gln, Arg, His, Glu, or Asp; X2 is Asp or Glu; and X3 is Leu or Met; N-terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPX1PKDTLMISRTPEVTCVVVDVS HEDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEK TISKAKGQPREPQVYTLPPSRX2EX3TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG SFFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG SEQ ID NO: 153: mature human IgG1 Fc; Cys to Ser substitution (#); Lys to Ser substitution(*); Asn to Ala substitution (^); LS double mutation (bold and underlined); X1 is Asp or Glu; and X2 is Leu or Met; N- terminal Fab residues are underlined; hinge residues are italicized NVNHKPSNTKVDKKVEPKSS(#)DKTHTCPPCPAPELLGGPSVFLFPPS(*)PKDTLMISRTPEVTCVVVDV SHEDPEVKFNWYVDGVEVHNAKTKPREEQYA(^)STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIE KTISKAKGQPREPQVYTLPPSRX1EX2TKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSD GSFFLYSKLTVDKSRWQQGNVFSCSVLHEALHSHYTQKSLSLSPG As defined herein, an Fc domain includes two Fc domain monomers that are dimerized by the interaction between the CH3 antibody constant domains, as well as one or more disulfide bonds that form between the hinge domains of the two dimerizing Fc domain monomers. An Fc domain forms the minimum structure that binds to an Fc receptor, e.g., Fc-gamma receptors (i.e., Fcγ receptors (FcγR)), Fc-alpha receptors (i.e., Fcα receptors (FcαR)), Fc-epsilon receptors (i.e., Fcε receptors (FcεR)), and/or
the neonatal Fc receptor (FcRn). In some embodiments, an Fc domain of the present invention binds to an Fcγ receptor (e.g., FcRn, FcγRI (CD64), FcγRIIa (CD32), FcγRIIb (CD32), FcγRIIIa (CD16a), FcγRIIIb (CD16b)), and/or FcγRIV and/or the neonatal Fc receptor (FcRn). In some embodiments, the Fc domain monomer or Fc domain of the invention is an aglycosylated Fc domain monomer or Fc domain (e.g., an Fc domain monomer or an Fc domain that maintains engagement to an Fc receptor (e.g., FcRn). For example, the Fc domain is an aglycosylated IgG1 variants that maintains engagement to an Fc receptor (e.g., an IgG1 having an amino acid substitution at N297 and/or T299 of the glycosylation motif). Exemplary aglycosylated Fc domains and methods for making aglycosylated Fc domains are known in the art, for example, as described in Sazinsky S.L. et al., Aglycosylated immunoglobulin G1 variants productively engage activating Fc receptors, PNAS, 2008, 105(51):20167-20172, which is incorporated herein in its entirety. In some embodiments, the Fc domain or Fc domain monomer of the invention is engineered to enhance binding to the neonatal Fc receptor (FcRn). For example, the Fc domain may include the triple mutation corresponding to M252Y/S254T/T256E (YTE) (e.g., an IgG1, such as a human or humanized IgG1 having a YTE mutation, for example SEQ ID NO: 33, SEQ ID NO: 36, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 41, SEQ ID NO: 42, SEQ ID NO: 43, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 56, or SEQ ID NO: 57). The Fc domain may include the double mutant corresponding to M428L/N434S (LS) (e.g., an IgG1, such as a human or humanized IgG1 having an LS mutation, such as SEQ ID NO: 37, SEQ ID NO: 39, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 54, SEQ ID NO: 55, or SEQ ID NO: 59). The Fc domain may include the single mutant corresponding to N434H (e.g., an IgG1, such as a human or humanized IgG1 having an N434H mutation). The Fc domain may include the single mutant corresponding to C220S (e.g., and IgG1, such as a human or humanized IgG1 having a C220S mutation, such as SEQ ID NO: 34, SEQ ID NO: 47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, or SEQ ID NO: 68). The Fc domain may include a combination of one or more of the above-described mutations that enhance binding to the FcRn. Enhanced binding to the FcRn may increase the half-life Fc domain-containing conjugate. For example, incorporation of one or more amino acid mutations that increase binding to the FcRn (e.g., a YTE mutation, an LS mutation, or an N434H mutation) may increase the half-life of the conjugate by 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%.100%, 200%, 300%, 400%, 500% or more relative to a conjugate having the corresponding Fc domain without the mutation that enhances FcRn binding. Exemplary Fc domains with enhanced binding to the FcRN and methods for making Fc domains having enhanced binding to the FcRN are known in the art, for example, as described in Maeda, A. et al., Identification of human IgG1 variant with enhanced FcRn binding and without increased binding to rheumatoid factor autoantibody, MABS, 2017, 9(5):844-853, which is incorporated herein in its entirety. As used herein, an amino acid “corresponding to” a particular amino acid residue (e.g., of a particular SEQ ID NO.) should be understood to include any amino acid residue that one of skill in the art would understand to align to the particular residue (e.g., of the particular sequence). For example, any one of SEQ ID NOs: 1-95 and 125-
153 may be mutated to include a YTE mutation, an LS mutation, and/or an N434H mutation by mutating the “corresponding residues” of the amino acid sequence. As used herein, a sulfur atom “corresponding to” a particular cysteine residue of a particular SEQ ID NO. should be understood to include the sulfur atom of any cysteine residue that one of skill in the art would understand to align to the particular cysteine of the particular sequence. The protein sequence alignment of human IgG1 (UniProtKB: P01857; SEQ ID NO: 121), human IgG2 (UniProtKB: P01859; SEQ ID NO: 122), human IgG3 (UniProtKB: P01860; SEQ ID NO: 123), and human IgG4 (UniProtKB: P01861; SEQ ID NO: 124) is provided below (aligned with Clustal Omega Multiple Pairwise Alignment). The alignment indicates cysteine residues (e.g., sulfur atoms of cysteine residues) that “correspond to” one another (in boxes and indicated by the • symbol). One of skill in the art would readily be able to perform such an alignment with any IgG variant of the invention to determine the sulfur atom of a cysteine that corresponds to any sulfur atom of a particular cysteine of a particular SEQ ID NO. described herein (e.g., any one of SEQ ID NOs: 1-95 and 125-153). For example, one of skill in the art would readily be able to determine that Cys10 of SEQ ID NO: 10 (the first cysteine of the conserved CPPC motif of the hinge region of the Fc domain) corresponds to, for example, Cys109 of IgG1, Cys106 of IgG2, Cys156 of IgG3, Cys29 of SEQ ID NO: 1, Cys9 of SEQ ID NO: 2, Cys30 of SEQ ID NO: 3, or Cys10 of SEQ ID NO: 10. In some embodiments, the Fc domain or Fc domain monomer of the invention has the sequence of any one of SEQ ID NOs: 39-95 may further include additional amino acids at the N-terminus (Xaa)x and/or additional amino acids at the C-terminus (Xaa)z, wherein Xaa is any amino acid and x and z are a whole number greater than or equal to zero, generally less than 100, preferably less than 10 and more preferably 0, 1, 2, 3, 4, or 5. In some embodiments, the additional amino acids are least 70% (e.g., 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%) identical to one or more consecutive amino acids of SEQ ID NO: 103. For example, the additional amino acids may be a single amino acid on the C- terminus corresponding to Lys330 of IgG1 (SEQ ID NO: 121). As used herein, a nitrogen atom “corresponding to” a particular lysine residue of a particular SEQ ID NO. should be understood to include the nitrogen atom of any lysine residue that one of skill in the art would understand to align to the particular lysine of the particular sequence. The protein sequence alignment of human IgG1 (UniProtKB: P01857; SEQ ID NO: 121), human IgG2 (UniProtKB: P01859; SEQ ID NO: 122), human IgG3 (UniProtKB: P01860; SEQ ID NO: 123), and human IgG4 (UniProtKB: P01861; SEQ ID NO: 124) is provided below (aligned with Clustal Omega Multiple Pairwise Alignment). The alignment indicates lysine residues (e.g., nitrogen atoms of lysine residues) that “correspond to” one another (in boxes and indicated by the * symbol). One of skill in the art would readily be able to perform such an alignment with any IgG variant of the invention to determine the nitrogen atom of a lysine that corresponds to any nitrogen atom of a particular lysine of a particular SEQ ID NO. described herein (e.g., any one of SEQ ID NOs: 1-95 and 125-153). For example, one of skill in the art would readily be able to determine that Lys35 of SEQ ID NO: 10 corresponds to, for example, Lys129 of IgG1, Lys126 of IgG2, Lys176 of IgG3, Lys51 of SEQ ID NO: 1, Lys31 of SEQ ID NO: 2, Lys50 of SEQ ID NO: 3, or Lys30 of SEQ ID NO: 10.
Protein sequence alignment of IgG1 (SEQ ID NO: 121), IgG2 (SEQ ID NO: 122), IgG3 (SEQ ID NO: 123), and IgG4 (SEQ ID NO: 124)
In some embodiments, the Fc domain monomer includes less than about 300 amino acid residues (e.g., less than about 300, less than about 295, less than about 290, less than about 285, less than about 280, less than about 275, less than about 270, less than about 265, less than about 260, less than about 255, less than about 250, less than about 245, less than about 240, less than about 235, less than about 230, less than about 225, or less than about 220 amino acid residues). In some
embodiments, the Fc domain monomer is less than about 40 kDa (e.g., less than about 35kDa, less than about 30kDa, less than about 25kDa). In some embodiments, the Fc domain monomer includes at least 200 amino acid residues (e.g., at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, or at least 300 amino residues). In some embodiments, the Fc domain monomer is at least 20 kDa (e.g., at least 25 kDa, at least 30 kDa, or at least 35 kDa). In some embodiments, the Fc domain monomer includes 200 to 400 amino acid residues (e.g., 200 to 250, 250 to 300, 300 to 350, 350 to 400, 200 to 300, 250 to 350, or 300 to 400 amino acid residues). In some embodiments, the Fc domain monomer is 20 to 40 kDa (e.g., 20 to 25 kDa, 25 to 30 kDa, 35 to 40 kDa, 20 to 30 kDa, 25 to 35 kDa, or 30 to 40 KDa). In some embodiments, the Fc domain monomer includes an amino acid sequence at least 90% identical (e.g., at least 95%, at least 98%) to the sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof. In some embodiments, the Fc domain monomer includes the amino acid sequence of any one of SEQ ID NOs: 1-95 and 125-153, or a region thereof. In some embodiments, the Fc domain monomer includes a region of any one of SEQ ID NOs: 1- 95 and 125-153, wherein the region includes positions 220, 252, 254, and 256. In some embodiments, the region includes at least 40 amino acid residues, at least 50 amino acid residues, at least 60 amino acid residues, at least 70 amino acids residues, at least 80 amino acids residues, at least 90 amino acid residues, at least 100 amino acid residues, at least 110 amino acid residues, at least 120 amino residues, at least 130 amino acid residues, at least 140 amino acid residues, at least 150 amino acid residues, at least 160 amino acid residues, at least 170 amino acid residues, at least 180 amino acid residues, at least 190 amino acid residues, or at least 200 amino acid residues. Activation of Immune Cells Fc-gamma receptors (FcγRs) bind the Fc portion of immunoglobulin G (IgG) and play important roles in immune activation and regulation. For example, the IgG Fc domains in immune complexes (ICs) engage FcγRs with high avidity, thus triggering signaling cascades that regulate immune cell activation. The human FcγR family contains several activating receptors (FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) and one inhibitory receptor (FcγRIIb). FcγR signaling is mediated by intracellular domains that contain immune tyrosine activating motifs (ITAMs) for activating FcγRs and immune tyrosine inhibitory motifs (ITIM) for inhibitory receptor FcγRIIb. In some embodiments, FcγR binding by Fc domains results in ITAM phosphorylation by Src family kinases; this activates Syk family kinases and induces downstream signaling networks, which include PI3K and Ras pathways. In the conjugates described herein, the portion of the conjugates including monomers or dimers of gp120 binders bind to and inhibits viral gp120 receptor leading to inhibition of viral replication, while the Fc domain portion of the conjugates bind to FcγRs (e.g., FcRn, FcγRI, FcγRIIa, FcγRIIc, FcγRIIIa, and FcγRIIIb) on immune cells and activate phagocytosis and effector functions, such as antibody-dependent cell-mediated cytotoxicity (ADCC), thus leading to the engulfment and destruction of viral particles by immune cells and further enhancing the antiviral activity of the conjugates. Examples of immune cells that may be activated by the conjugates described herein include, but are not limited to, macrophages,
neutrophils, eosinophils, basophils, lymphocytes, follicular dendritic cells, natural killer cells, and mast cells. Tissue distribution After a therapeutic enters the systemic circulation, it is distributed to the body’s tissues. Distribution is generally uneven because of different in blood perfusion, tissue binding, regional pH, and permeability of cell membranes. The entry rate of a drug into a tissue depends on the rate of blood flow to the tissue, tissue mass, and partition characteristics between blood and tissue. Distribution equilibrium (when the entry and exit rates are the same) between blood and tissue is reached more rapidly in richly vascularized areas, unless diffusion across cell membranes is the rate-limiting step. The size, shape, charge, target binding, FcRn and target binding mechanisms, route of administration, and formulation affect tissue distribution. In some instances, the conjugates described herein may be optimized to distribute to lung tissue. In some instances, the conjugates have a concentration ratio of distribution in epithelial lining fluid of at least 30% the concentration of the conjugate in plasma within 2 hours after administration. In certain embodiments, ratio of the concentration is at least 45% within 2 hours after administration. In some embodiments, the ratio of concentration is at least 55% within 2 hours after administration. In particular, the ratio of concentration is at least 60% within 2 hours after administration. As shown in Example 35 and FIG.13, by 2 hours post injection, a conjugate having an Fc domain (SEQ ID NO: 64) decorated with one or more small molecule antiviral inhibitors ELF levels are surprisingly ~60% of plasma exposure levels as measured by AUC across the rest of the time course indicating nearly immediate partitioning of the conjugate from plasma to the ELF in the lung. This demonstrates that an Fc containing conjugate rapidly distributes to lung, and maintains high concentrations in lung relative to levels in plasma. IV. Albumin proteins and albumin protein-binding peptides Albumin proteins An albumin protein of the invention may be a naturally-occurring albumin or a variant thereof, such as an engineered variant of a naturally-occurring albumin protein. Variants include polymorphisms, fragments such as domains and sub-domains, and fusion proteins. An albumin protein may include the sequence of an albumin protein obtained from any source. Preferably the source is mammalian, such as human or bovine. Most preferably, the albumin protein is human serum albumin (HSA), or a variant thereof. Human serum albumins include any albumin protein having an amino acid sequence naturally occurring in humans, and variants thereof. An albumin protein coding sequence is obtainable by methods know to those of skill in the art for isolating and sequencing cDNA corresponding to human genes. An albumin protein of the invention may include the amino acid sequence of human serum albumin (HSA), provided in SEQ ID NO: 96 or SEQ ID NO: 97, or the amino acid sequence of mouse serum albumin (MSA), provided in SEQ ID NO: 98, or a variant or fragment thereof, preferably a functional variant or fragment thereof. A fragment or variant may or may not be functional, or may retain the function of albumin to some degree. For example, a fragment or variant may retain the ability to bind to an albumin receptor, such as HSA or MSA, by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, or
105% of the ability of the parent albumin (e.g., the parent albumin from which the fragment or variant is derived). Relative binding ability may be determined by methods known in the art, such as by surface plasmon resonance. The albumin protein may be a naturally-occurring polymorphic variant of an albumin protein, such as human serum albumin. Generally, variants or fragments of human serum albumin will have at least 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, or 70%, and preferably 80%, 90%, 95%, 100%, or 105% or more of human serum albumin or mouse serum albumin’s ligand binding activity. The albumin protein may include the amino acid sequence of bovine serum albumin. Bovine serum albumin proteins include any albumin having an amino acid sequence naturally occurring in cows, for example, as described by Swissprot accession number P02769, and variants thereof as defined herein. Bovine serum albumin proteins also includes fragments of full-length bovine serum albumin or variants thereof, as defined herein. The albumin protein may comprise the sequence of an albumin derived from one of serum albumin from dog (e.g., Swissprot accession number P49822-1), pig (e.g., Swissprot accession number P08835-1), goat (e.g., Sigma product no. A2514 or A4164), cat (e.g., Swissprot accession number P49064-1), chicken (e.g., Swissprot accession number P19121-1), ovalbumin (e.g., chicken ovalbumin) (e.g., Swissprot accession number P01012-1), turkey ovalbumin (e.g., Swissprot accession number O73860-1), donkey (e.g., Swissprot accession number Q5XLE4-1), guinea pig (e.g., Swissprot accession number Q6WDN9-1), hamster (e.g., as described in DeMarco et al. International Journal for Parasitology 37(11): 1201-1208 (2007)), horse (e.g., Swissprot accession number P35747-1), rhesus monkey (e.g., Swissprot accession number Q28522-1), mouse (e.g., Swissprot accession number P07724-1), pigeon (e.g., as defined by Khan et al. Int. J. Biol. Macromol.30(3-4),171-8 (2002)), rabbit (e.g., Swissprot accession number P49065-1), rat (e.g., Swissprot accession number P02770-1) or sheep (e.g., Swissprot accession number P14639-1), and includes variants and fragments thereof as defined herein. Many naturally-occurring mutant forms of albumin are known to those skilled in the art. Naturally- occurring mutant forms of albumin are described in, for example, Peters, et al. All About Albumin: Biochemistry, Genetics and Medical Applications, Academic Press, Inc., San Diego, Calif., p.170-181 (1996). Albumin proteins of the invention include variants of naturally-occurring albumin proteins. A variant albumin refers to an albumin protein having at least one amino acid mutation, such as an amino acid mutation generated by an insertion, deletion, or substitution, either conservative or non-conservative, provided that such changes result in an albumin protein for which at least one basic property has not been significantly altered (e.g., has not been altered by more than 5%, 10%, 15%, 20%, 25%, 30%, 35%, or 40%). Exemplary properties which may define the activity of an albumin protein include binding activity (e.g., including binding specificity or affinity to bilirubin, or a fatty acid such as a long-chain fatty acid), osmolarity, or behavior in a certain pH-range. Typically an albumin protein variant will have at least 40%, at least 50%, at least 60%, and preferably at least 70%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% amino acid sequence identity with a naturally-occurring albumin protein, such as the albumin protein of any one of SEQ ID NOs: 96-98.
Methods for the production and purification of recombinant human albumins are well-established (Sleep et al. Biotechnology, 8(1):42-6 (1990)), and include the production of recombinant human albumin for pharmaceutical applications (Bosse et al. J Clin Pharmacol 45(1):57-67 (2005)). The three- dimensional structure of HSA has been elucidated by X-ray crystallography (Carter et al. Science. 244(4909): 1195-8(1998)); Sugio et al. Protein Eng.12(6):439-46 (1999)). The HSA polypeptide chain has 35 cysteine residues, which form 17 disulfide bonds, and one unpaired (e.g., free) cysteine at position 34 of the mature protein. Cys-34 of HSA has been used for conjugation of molecules to albumin (Leger et al. Bioorg Med Chem Lett 14(17):4395-8 (2004); Thibaudeau et al. Bioconjug Chem 16(4):1000-8 (2005)), and provides a site for site-specific conjugation. SEQ ID NO: 96 (Human serum albumin (HSA), variant 1) DAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAENCDKSLHTLF GDKLCTVATLRETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLKKYL YEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSAKQRLKCASLQKFGER AFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLECADDRADLAKYICENQDSISSKLKECC EKPLLEKSHCIAEVENDEMPADLPSLAADFVESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRL AKTYETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRYTKKVPQVS TPTLVEVSRNLGKVGSKCCKHPEAKRMPCAEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESLVNRRPCF SALEVDETYVPKEFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATKEQLKAVMDDFAAFVEKCC KADDKETCFAEEGKKLVAASQAALGL SEQ ID NO: 97 (Human serum albumin (HSA), variant 2) RGVFRRDAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAENCD KSLHTLFGDKLCTVATLRETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVDVMCTAFHDNE ETFLKKYLYEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDEGKASSAKQRLKCAS LQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLECADDRADLAKYICENQDSIS SKLKECCEKPLLEKSHCIAEVENDEMPADLPSLAADFVESKDVCKNYAEAKDVFLGMFLYEYARRHPDYS VVLLLRLAKTYETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRYTK KVPQVSTPTLVEVSRNLGKVGSKCCKHPEAKRMPCAEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESLV NRRPCFSALEVDETYVPKEFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATKEQLKAVMDDFAA FVEKCCKADDKETCFAEEGKKLVAASQAALGL SEQ ID NO: 98 (Mouse serum albumin (MSA)) RGVFRREAHKSEIAHRYNDLGEQHFKGLVLIAFSQYLQKCSYDEHAKLVQEVTDFAKTCVADESAANCD KSLHTLFGDKLCAIPNLRENYGELADCCTKQEPERNECFLQHKDDNPSLPPFERPEAEAMCTSFKENPTT FMGHYLHEVARRHPYFYAPELLYYAEQYNEILTQCCAEADKESCLTPKLDGVKEKALVSSVRQRMKCSS MQKFGERAFKAWAVARLSQTFPNADFAEITKLATDLTKVNKECCHGDLLECADDRAELAKYMCENQATI SSKLQTCCDKPLLKKAHCLSEVEHDTMPADLPAIAADFVEDQEVCKNYAEAKDVFLGTFLYEYSRRHPDY SVSLLLRLAKKYEATLEKCCAEANPPACYGTVLAEFQPLVEEPKNLVKTNCDLYEKLGEYGFQNAILVRYT QKAPQVSTPTLVEAARNLGRVGTKCCTLPEDQRLPCVEDYLSAILNRVCLLHEKTPVSEHVTKCCSGSLV
ERRPCFSALTVDETYVPKEFKAETFTFHSDICTLPEKEKQIKKQTALAELVKHKPKATAEQLKTVMDDFAQ FLDTCCKAADKDTCFSTEGPNLVTRCKDALA Conjugation of albumin proteins An albumin protein of the invention may be conjugated to (e.g., by way of a covalent bond) to any compound of the invention (e.g., by way of the linker portion of a gp120 binder monomer or dimer). The albumin protein may be conjugated to any compound of the invention by any method well-known to those of skill in the art for producing small-molecule-protein conjugates. This may include covalent conjugation to a solvent-exposed amino acid, such as a solvent exposed cysteine or lysine. For example, human serum albumin may be conjugated to a compound of the invention by covalent linkage to the sulfur atom corresponding to Cys34 of SEQ ID NO: 96 or Cys40 of SEQ ID NO: 97. An albumin protein of the invention may be conjugated to any compound of the invention by way of an amino acid located within 10 amino acid residues of the C-terminal or N-terminal end of the albumin protein. An albumin protein may include a C-terminal or N-terminal polypeptide fusion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, or 20 or more amino acid. The C-terminal or N-terminal polypeptide fusion may include one or more solvent-exposed cysteine or lysine residues, which may be used for covalent conjugation of a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Albumin proteins of the invention include any albumin protein which has been engineered to include one or more solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Exemplary methods for the production of engineered variants of albumin proteins that include one or more conjugation-competent cysteine residues are provided in U.S. Patent Application No. 2017/0081389, which is incorporated herein by reference in its entirety. Briefly, preferred albumin protein variants are those comprising a single, solvent-exposed, unpaired (e.g., free) cysteine residue, thus enabling site-specific conjugation of a linker to the cysteine residue. Albumin proteins which have been engineered to enable chemical conjugation to a solvent- exposed, unpaired cysteine residue include the following albumin protein variants: (a) an albumin protein having a substitution of a non-cysteine amino acid residue with a cysteine at an amino acid residue corresponding to any of L585, D1, A2, D562, A364, A504, E505, T79, E86, D129, D549, A581, D121, E82, S270, Q397, and A578 of SEQ ID NO: 96; (b) an albumin protein having an insertion of a cysteine at a position adjacent the N- or C-terminal side of an amino acid residue corresponding to any of L585, D1, A2, D562, A364, A504, E505, T79, E86, D129, D549, A581, D121, E82, S270, Q397, and A578 of SEQ ID NO: 96; (c) an albumin protein engineered to have an unpaired cysteine having a free thiol group at a residue corresponding to any of C369, C361, C91, C177, C567, C316, C75, C169, C124, or C558 of SEQ ID NO: 96, and which may or may not be generated by deletion or substitution of a
residue corresponding to C360, C316, C75, C168, C558, C361, C91, C124, C169, or C567 of SEQ ID NO: 96; and/or (d) addition of a cysteine to the N- or C-terminus of an albumin protein. In some embodiments of the invention, the net result of the substitution, deletion, addition, or insertion events of (a), (b), (c) and/or (d) is that the number of conjugation competent cysteine residues of the polypeptide sequence is increased relative to the parent albumin sequence. In some embodiments of the invention, the net result of the substitution, deletion, addition, or insertion events of (a), (b), (c) and/or (d) is that the number of conjugation competent-cysteine residues of the polypeptide sequence is one, thus enabling site-specific conjugation. Preferred albumin protein variants also include albumin proteins having a single solvent-exposed lysine residue, thus enabling site-specific conjugation of a linker to the lysine residue. Such variants may be generated by engineering an albumin protein, including any of the methods previously described (e.g., insertion, deletion, substitution, or C-terminal or N-terminal fusion). Albumin protein-binding peptides Conjugation of a biologically-active compound to an albumin protein-binding peptide can alter the pharmacodynamics of the biologically-active compound, including the alteration of tissue uptake, penetration, and diffusion. In a preferred embodiment, conjugation of an albumin protein-binding peptide to a compound of the invention (e.g., a gp120 binder monomer or dimer, by way of a linker) increases the efficacy or decreases the toxicity of the compound, as compared to the compound alone. Albumin protein-binding peptides of the invention include any polypeptide having an amino acid sequence of 5 to 50 (e.g., 5 to 40, 5 to 30, 5 to 20, 5 to 15, 5 to 10, 10 to 50, 10 to 30, or 10 to 20) amino acid residues that has affinity for and functions to bind an albumin protein, such as any of the albumin proteins described herein. Preferably, the albumin protein-binding peptide binds to a naturally occurring serum albumin, most preferably human serum albumin. An albumin protein-binding peptide can be of different origins, e.g., synthetic, human, mouse, or rat. Albumin protein-binding peptides of the invention include albumin protein-binding peptides which have been engineered to include one or more (e.g., two, three, four, or five) solvent-exposed cysteine or lysine residues, which may provide a site for conjugation to a compound of the invention (e.g., conjugation to a gp120 binder monomer or dimer, including by way of a linker). Most preferably, the albumin protein-binding peptide will contain a single solvent-exposed cysteine or lysine, thus enabling site-specific conjugation of a compound of the invention. Albumin protein-binding peptides may include only naturally occurring amino acid residues, or may include one or more non-naturally occurring amino acid residues. Where included, a non-naturally occurring amino acid residue (e.g., the side chain of a non-naturally occurring amino acid residue) may be used as the point of attachment for a compound of the invention (e.g., a gp120 binder monomer or dimer, including by way of a linker). Albumin protein-binding peptides of the invention may be linear or cyclic. Albumin protein- binding peptides of the invention include any albumin protein-binding peptides known to one of skill in the art, examples of which, are provided herein. Albumin protein-binding peptide, and conjugates including an albumin protein-binding peptide, preferably bind an albumin protein (e.g., human serum albumin) with an affinity characterized by a
dissociation constant, Kd, that is less than about 100 μM, preferably less than about 100 nM, and most preferably do not substantially bind other plasma proteins. Specific examples of such compounds are linear or cyclic peptides, preferably between about 10 and 20 amino acid residues in length, optionally modified at the N-terminus or C-terminus or both. Albumin protein-binding peptides include linear and cyclic peptides comprising the following general formulae, wherein Xaa is any amino acid: SEQ ID NO: 101 Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Phe-Cys-Xaa-Asp-Trp-Pro-Xaa-Xaa-Xaa-Ser-Cys SEQ ID NO: 102 Val-Cys-Tyr-Xaa-Xaa-Xaa-Ile-Cys-Phe SEQ ID NO: 103 Cys-Tyr-Xaa-Pro-Gly-Xaa-Cys SEQ ID NO: 104 Asp-Xaa-Cys-Leu-Pro-Xaa-Trp-Gly-Cys-Leu-Trp SEQ ID NO: 105 Trp-Cys-Asp-Xaa-Xaa-Leu-Xaa-Ala-Xaa-Asp-Leu-Cys SEQ ID NO: 106 Asp-Leu-Val-Xaa-Leu-Gly-Leu-Glu-Cys-Trp Albumin protein-binding peptides of the invention further include any of the following peptide sequences, which may be linear or cyclic: SEQ ID NO: 107 DLCLRDWGCLW SEQ ID NO: 108 DICLPRWGCLW SEQ ID NO: 109 MEDICLPRWGCLWGD SEQ ID NO: 110 QRLMEDICLPRWGCLWEDDE SEQ ID NO: 111 QGLIGDICLPRWGCLWGRSV SEQ ID NO: 112 QGLIGDICLPRWGCLWGRSVK SEQ ID NO: 113 EDICLPRWGCLWEDD SEQ ID NO: 114 RLMEDICLPRWGCLWEDD SEQ ID NO: 115 MEDICLPRWGCLWEDD SEQ ID NO: 116 MEDICLPRWGCLWED SEQ ID NO: 117 RLMEDICLARWGCLWEDD SEQ ID NO: 118 EVRSFCTRWPAEKSCKPLRG SEQ ID NO: 119 RAPESFVCYWETICFERSEQ
SEQ ID NO: 120 EMCYFPGICWM Albumin protein-binding peptides of SEQ ID NOs: 101-120 may further include additional amino acids at the N-terminus (Xaa)x and/or additional amino acids at the C-terminus (Xaa)z, wherein Xaa is any amino acid and x and z are a whole number greater or equal to zero, generally less than 100, preferably less than 10, and more preferably 0, 1, 2, 3, 4 or 5. Further exemplary albumin protein-binding peptides are provided in U.S. Patent Application No. 2005/0287153, which is incorporated herein by reference in its entirety. Conjugation of albumin protein-binding peptides An albumin protein-binding peptide of the invention may be conjugated to (e.g., by way of a covalent bond) to any compound of the invention (e.g., by way of the linker portion of a gp120 binder monomer or dimer). The albumin protein-binding peptide may be conjugated to any compound of the invention by any method known to those of skill in the art for producing peptide-small molecule conjugates. This may include covalent conjugation to the side chain group of an amino acid residue, such as a cysteine, a lysine, or a non-natural amino acid. Alternately, covalent conjugation may occur at the C-terminus (e.g., to the C-terminal carboxylic acid, or to the side chain group of the C-terminal residue) or at the N-terminus (e.g., to the N-terminal amino group, or to the side chain group of the N- terminal amino acid). V. Linkers A linker refers to a linkage or connection between two or more components in a conjugate described herein (e.g., between two gp120 binders in a conjugate described herein, between a gp120 binder and an Fc domain monomer, an Fc domain, or an albumin protein in a conjugate described herein, and between a dimer of two gp120 binders and an Fc domain monomer, an Fc domain or an albumin protein in a conjugate described herein). Linkers in conjugates having an Fc domain monomer, an Fc domain, or an albumin protein covalently linked to dimers of gp120 binders In a conjugate containing an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders as described herein, a linker in the conjugate (e.g., L or L’) may be a branched structure. As described further herein, a linker in a conjugate described herein (e.g., L or L’) may be a multivalent structure, e.g., a divalent or trivalent structure having two or three arms, respectively. In some embodiments when the linker has three arms, two of the arms may be attached to the first and second gp120 binders and the third arm may be attached to an Fc domain monomer, an Fc domain, an Fc- binding peptide, an albumin protein, or an albumin protein-binding peptide. In some embodiments when the linker has two arms, one arm may be attached to an Fc domain monomer, an Fc domain, or an albumin protein and the other arm may be attached to one of the two gp120 binders. In other embodiments, a linker with three arms may be used to attach the two gp120 binders on a conjugate
containing an Fc domain monomer, an Fc domain, or albumin protein covalently linked to one or more dimers of gp120 binders. In some embodiments, a linker in a conjugate having an Fc domain monomer, an Fc domain, or an albumin protein covalently linked to one or more dimers of gp120 binders is described by formula (D-L- I):
wherein LA is described by formula GA1-(ZA1)g1-(YA1)h1-(ZA2)i1-(YA2)j1-(ZA3)k1-(YA3)l1-(ZA4)m1-(YA4)n1-(ZA5)o1- GA2; LB is described by formula GB1-(ZB1)g2-(YB1)h2-(ZB2)i2-(YB2)j2-(ZB3)k2-(YB3)l2-(ZB4)m2-(YB4)n2-(ZB5)o2-GB2; LC is described by formula GC1-(ZC1)g3-(YC1)h3-(ZC2)i3-(YC2)j3-(ZC3)k3-(YC3)l3-(ZC4)m3-(YC4)n3-(ZC5)o3-GC2; GA1 is a bond attached to Qi in formula (D-L-I); GA2 is a bond attached to the first gp120 binder (e.g., A1); GB1 is a bond attached to Qi in formula (D-L-I); GB2 is a bond attached to the second gp120 binder (e.g., A2); GC1 is a bond attached to Qi in formula (D-L-I); GC2 is a bond attached to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of ZA1, ZA2, ZA3, ZA4, ZA5, ZB1, ZB2, ZB3, ZB4, ZB5, ZC1, ZC2, ZC3, ZC4, and ZC5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene; each of YA1, YA2, YA3, YA4, YB1, YB2, YB3, YB4, YC1, YC2, YC3, and YC4 is, independently, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; each R i is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4- C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; each of g1, h1, i1, j1, k1, l1, m1, n1, o1, g2, h2, i2, j2, k2, l2, m2, n2, o2, g3, h3, i3, j3, k3, l3, m3, n3, and o3 is, independently, 0 or 1; Qi is a nitrogen atom, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20
heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene. In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (-CH2CH2O-)n, wherein n is an integer from 2 to 100. A polyethylene glycol may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, LC may have two points of attachment to the Fc domain (e.g., two GC2). In some embodiments, L includes a polyethylene glycol (PEG) linker. A PEG linker includes a linker having the repeating unit structure (-CH2CH2O-)n, where n is an integer from 2 to 100. A polyethylene glycol linker may covalently join a gp120 binder and E (e.g., in a conjugate of any one of formulas (M-I)-(M-X)). A polyethylene glycol linker may covalently join a first gp120 binder and a second gp120 binder (e.g., in a conjugate of any one of formulas (D-I)-(D-X)). A polyethylene glycol linker may covalently join a gp120 binder dimer and E (e.g., in a conjugate of any one of formulas (D-I)-(D-X)). A polyethylene glycol linker may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70- PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, Lc includes a PEG linker, where LC is covalently attached to each of Qi and E. Linkers in conjugates having an Fc domain monomer, an Fc domain, or an albumin protein covalently linked to monomers of gp120 binders In a conjugate containing an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more monomers of gp120 binders as described herein, a linker in the conjugate (e.g., L, or L’) may be a divalent structure having two arms. One arm in a divalent linker may be attached to the monomer of gp120 binder and the other arm may be attached to the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide. In some embodiments, the one or more monomers of gp120 binders in the conjugates described herein may each be, independently, connected to an atom in the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein- binding peptide. In some embodiments, each L is described by formula (M-L): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached to A1; J2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene,
optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8- C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0, 1, or 2; or a pharmaceutically acceptable salt thereof. In some embodiments, each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1. In some embodiments, a linker is described by formula (M-L-I): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached to a gp120 binder; J2 is a bond attached to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, or a functional group capable of reacting with a functional group conjugated to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazene); each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2- C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi, , P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2- C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8- C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1. In some embodiments, optionally substituted includes substitution with a polyethylene glycol (PEG). A PEG has a repeating unit structure (-CH2CH2O-)n, wherein n is an integer from 2 to 100. A
polyethylene glycol may be selected from any one of PEG2 to PEG100 (e.g., PEG2, PEG3, PEG4, PEG5, PEG5-PEG10, PEG10-PEG20, PEG20-PEG30, PEG30-PEG40, PEG50-PEG60, PEG60-PEG70, PEG70-PEG80, PEG80-PEG90, PEG90-PEG100). In some embodiments, J2 may have two points of attachment to the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., two J2). Linking groups In some embodiments, a linker provides space, rigidity, and/or flexibility between the gp120 binders and the Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide in the conjugates described here or between two gp120 binders in the conjugates described herein. In some embodiments, a linker may be a bond, e.g., a covalent bond, e.g., an amide bond, a disulfide bond, a C-O bond, a C-N bond, a N-N bond, a C-S bond, or any kind of bond created from a chemical reaction, e.g., chemical conjugation. In some embodiments, a linker (L or L’ as shown in any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII) includes no more than 250 atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1-20, 1-25, 1-30, 1-35, 1-40, 1-45, 1-50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1- 180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 atom(s)). In some embodiments, a linker (L or L) includes no more than 250 non-hydrogen atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1- 20, 1-25, 1-30, 1-35, 1-40, 1-45, 1-50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 non- hydrogen atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 non-hydrogen atom(s)). In some embodiments, the backbone of a linker (L or L) includes no more than 250 atoms (e.g., 1-2, 1-4, 1-6, 1-8, 1-10, 1-12, 1-14, 1-16, 1-18, 1-20, 1-25, 1-30, 1-35, 1-40, 1-45, 1- 50, 1-55, 1-60, 1-65, 1-70, 1-75, 1-80, 1-85, 1-90, 1-95, 1-100, 1-110, 1-120, 1-130, 1-140, 1-150, 1-160, 1-170, 1-180, 1-190, 1-200, 1-210, 1-220, 1-230, 1-240, or 1-250 atom(s); 250, 240, 230, 220, 210, 200, 190, 180, 170, 160, 150, 140, 130, 120, 110, 100, 95, 90, 85, 80, 75, 70, 65, 60, 55, 50, 45, 40, 35, 30, 28, 26, 24, 22, 20, 18, 16, 14, 12, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 atom(s)). The “backbone” of a linker refers to the atoms in the linker that together form the shortest path from one part of the conjugate to another part of the conjugate. The atoms in the backbone of the linker are directly involved in linking one part of the conjugate to another part of the conjugate. For examples, hydrogen atoms attached to carbons in the backbone of the linker are not considered as directly involved in linking one part of the conjugate to another part of the conjugate. Molecules that may be used to make linkers (L or L’) include at least two functional groups, e.g., two carboxylic acid groups. In some embodiments of a trivalent linker, two arms of a linker may contain two dicarboxylic acids, in which the first carboxylic acid may form a covalent linkage with the first gp120 binder in the conjugate and the second carboxylic acid may form a covalent linkage with the second gp120 binder in the conjugate, and the third arm of the linker may for a covalent linkage (e.g., a C-O
bond) with an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide in the conjugate. In some embodiments of a divalent linker, the divalent linker may contain two carboxylic acids, in which the first carboxylic acid may form a covalent linkage with one component (e.g., a gp120 binder) in the conjugate and the second carboxylic acid may form a covalent linkage (e.g., a C-S bond or a C-N bond) with another component (e.g., an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide) in the conjugate. In some embodiments, dicarboxylic acid molecules may be used as linkers (e.g., a dicarboxylic acid linker). For example, in a conjugate containing an Fc domain monomer, an Fc domain, an Fc- binding peptide, an albumin protein, or an albumin protein-binding peptide covalently linked to one or more dimers of gp120 binders, the first carboxylic acid in a dicarboxylic acid molecule may form a covalent linkage with a hydroxyl or amine group of the first gp120 binder and the second carboxylic acid may form a covalent linkage with a hydroxyl or amine group of the second gp120 binder. Examples of dicarboxylic acids molecules that may be used to linkers are found in WO 2020/252393, hereby incorporated by reference. In some embodiments, dicarboxylic acid molecules, such as the ones described herein, may be further functionalized to contain one or more additional functional groups. Dicarboxylic acids may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker). In some embodiments, when the gp120 binder is attached to Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, the linking group may comprise a moiety comprising a carboxylic acid moiety and an amino moiety that are spaced by from 1 to 25 atoms. Examples of such linking groups re found in WO 2020/252393, hereby incorporated by reference. In some embodiments, a linking group may include a moiety including a carboxylic acid moiety and an amino moiety, such as the ones described herein, may be further functionalized to contain one or more additional functional groups. Such linking groups may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker). In some embodiments, when the gp120 binder is attached to Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, the linking group may comprise a moiety comprising two or amino moieties (e.g., a diamino moiety) that are spaced by from 1 to 25 atoms. Examples of such linking groups are found in WO 2020/252393, hereby incorporated by reference. In some embodiments, a linking group may include a diamino moiety, such as the ones described herein, may be further functionalized to contain one or more additional functional groups. Such diamino linking groups may be further functionalized, for example, to provide an attachment point to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide (e.g., by way of a linker, such as a PEG linker).
In some embodiments, a molecule containing an azide group may be used to form a linker, in which the azide group may undergo cycloaddition with an alkyne to form a 1,2,3-triazole linkage. In some embodiments, a molecule containing an alkyne group may be used to form a linker, in which the alkyne group may undergo cycloaddition with an azide to form a 1,2,3-triazole linkage. In some embodiments, a molecule containing a maleimide group may be used to form a linker, in which the maleimide group may react with a cysteine to form a C-S linkage. In some embodiments, a molecule containing one or more haloalkyl groups may be used to form a linker, in which the haloalkyl group may form a covalent linkage, e.g., C-N and C-O linkages, with a gp120 binder. In some embodiments, a linker (L or L’) may comprise a synthetic group derived from, e.g., a synthetic polymer (e.g., a polyethylene glycol (PEG) polymer). In some embodiments, a linker may comprise one or more amino acid residues. In some embodiments, a linker may be an amino acid sequence (e.g., a 1-25 amino acid, 1-10 amino acid, 1-9 amino acid, 1-8 amino acid, 1-7 amino acid, 1-6 amino acid, 1-5 amino acid, 1-4 amino acid, 1-3 amino acid, 1-2 amino acid, or 1 amino acid sequence). In some embodiments, a linker (L or L’) may include one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene (e.g., a PEG unit), optionally substituted C2-C20 alkenylene (e.g., C2 alkenylene), optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene (e.g., cyclopropylene, cyclobutylene), optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene (e.g., C6 arylene), optionally substituted C3-C15 heteroarylene (e.g., imidazole, pyridine), O, S, NRi,
, (each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl), P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino. Conjugation chemistries Gp120 binder monomers or dimers (e.g., in a conjugate of any one of formulas (1), (2), (D-I), (D- IV)-(D-VI), or (M-I)-(M-XVII)) may be conjugated to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein-binding peptide, e.g., by way of a linker, by any standard conjugation chemistries known to those of skill in the art. The following conjugation chemistries are specifically contemplated, e.g., for conjugation of a PEG linker (e.g., a functionalized PEG linker) to an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein- binding peptide. Covalent conjugation of two or more components in a conjugate using a linker may be accomplished using well-known organic chemical synthesis techniques and methods. Complementary
functional groups on two components may react with each other to form a covalent bond. Examples of complementary reactive functional groups include, but are not limited to, e.g., maleimide and cysteine, amine and activated carboxylic acid, thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine. Site-specific conjugation to a polypeptide (e.g., an Fc domain monomer, an Fc domain, an Fc-binding peptide, an albumin protein, or an albumin protein- binding peptide) may accomplished using techniques known in the art. Exemplary techniques for site- specific conjugation of a small molecule to an Fc domain are provided in Agarwall. P., et al. Bioconjugate Chem.26:176-192 (2015). Other examples of functional groups capable of reacting with amino groups include, e.g., alkylating and acylating agents. Representative alkylating agents include: (i) an α-haloacetyl group, e.g., XCH2CO- (where X=Br, Cl, or I); (ii) a N-maleimide group, which may react with amino groups either through a Michael type reaction or through acylation by addition to the ring carbonyl group; (iii) an aryl halide, e.g., a nitrohaloaromatic group; (iv) an alkyl halide; (v) an aldehyde or ketone capable of Schiff’s base formation with amino groups; (vi) an epoxide, e.g., an epichlorohydrin and a bisoxirane, which may react with amino, sulfhydryl, or phenolic hydroxyl groups; (vii) a chlorine-containing of s-triazine, which is reactive towards nucleophiles such as amino, sulfhydryl, and hydroxyl groups; (viii) an aziridine, which is reactive towards nucleophiles such as amino groups by ring opening; (ix) a squaric acid diethyl ester; and (x) an α-haloalkyl ether. Examples of amino-reactive acylating groups include, e.g., (i) an isocyanate and an isothiocyanate; (ii) a sulfonyl chloride; (iii) an acid halide; (iv) an active ester, e.g., a nitrophenylester or N- hydroxysuccinimidyl ester; (v) an acid anhydride, e.g., a mixed, symmetrical, or N-carboxyanhydride; (vi) an acylazide; and (vii) an imidoester. Aldehydes and ketones may be reacted with amines to form Schiff’s bases, which may be stabilized through reductive amination. It will be appreciated that certain functional groups may be converted to other functional groups prior to reaction, for example, to confer additional reactivity or selectivity. Examples of methods useful for this purpose include conversion of amines to carboxyls using reagents such as dicarboxylic anhydrides; conversion of amines to thiols using reagents such as N-acetylhomocysteine thiolactone, S- acetylmercaptosuccinic anhydride, 2-iminothiolane, or thiol-containing succinimidyl derivatives; conversion of thiols to carboxyls using reagents such as α -haloacetates; conversion of thiols to amines using reagents such as ethylenimine or 2-bromoethylamine; conversion of carboxyls to amines using reagents such as carbodiimides followed by diamines; and conversion of alcohols to thiols using reagents such as tosyl chloride followed by transesterification with thioacetate and hydrolysis to the thiol with sodium acetate. In some embodiments, a linker of the invention (e.g., L or L’, such as LC of D-L-I), is conjugated (e.g., by any of the methods described herein) to E (e.g., an Fc domain monomer, an Fc domain, or albumin protein). In preferred embodiments of the invention, the linker is conjugated by way of: (a) a thiourea linkage (i.e., -NH(C=S)NH-) to a lysine of E; (b) a carbamate linkage (i.e., -NH(C=O)-O) to a lysine of E; (c) an amine linkage by reductive amination (i.e., -NHCH2) between a lysine and E; (d) an amide (i.e., -NH-(C=O)CH2) to a lysine of E; (e) a cysteine-maleimide conjugate between a maleimide of the linker to a cysteine of E; (f) an amine linkage by reductive amination (i.e., -NHCH2) between the linker
and a carbohydrate of E (e.g., a glycosyl group of an Fc domain monomer or an Fc domain); (g) a rebridged cysteine conjugate, wherein the linker is conjugated to two cysteines of E; (h) an oxime linkage between the linker and a carbohydrate of E (e.g., a glycosyl group of an Fc domain monomer or an Fc domain); (i) an oxime linkage between the linker and an amino acid residue of E; (j) an azido linkage between the linker and E; (k) direct acylation of a linker to E; or (l) a thioether linkage between the linker and E. In some embodiments, a linker is conjugated to E, wherein the linkage includes the structure -NH(C=NH)X-, wherein X is O, HN, or a bond. In some embodiments, a linker is conjugated to E, wherein the linkage between the remainder of the linker and E includes the structure -NH(C=O)NH-. In some embodiments, a linker is conjugated to E, wherein the linkage includes the structure -R9OR9C(=O)NH-, wherein R9 is H, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl. In some embodiments, the linker is conjugated to E, wherein the linkage between the remainder of the linker and E includes the structure –CH2OCH2C(=O)NH-. Exemplary linking strategies (e.g., methods for linking a monomer or a dimer of a gp120 binder to E, such as, by way of a linker) are further depicted in FIGS.1-4 and 14. In some embodiments, a linker (e.g., an active ester, e.g., a nitrophenylester or N- hydroxysuccinimidyl ester, or derivatives thereof (e.g., a functionalized PEG linker (e.g., azido-PEG2- PEG40-NHS ester), is conjugated to E, with a T of (e.g., DAR) of between 0.5 and 10.0, e.g., about 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3, 3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8.0, 7.9, 8, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.1, 9.2, 9.3, 9.4, 9.5, 9.6, 9.7, 9.8, 9.9, or 10.0. In these instances, the E-(PEG2-PEG40)-azide can react with an Int having a terminal alkyne linker (e.g., L, or L’, such as LC of D-L-I) through click conjugation. During click conjugation, the copper-catalyzed reaction of the an azide (e.g., the Fc-(PEG2-PEG40)-azide) with the alkyne (e.g., the Int having a terminal alkyne linker (e.g., L or L’, such as LC of D-L-I) forming a 5-membered heteroatom ring. In some embodiments, the linker conjugated to E is a terminal alkyne and is conjugated to an Int having a terminal azide. Exemplary preparations of preparations of E-(PEG2-PEG40)-azide are described in Examples 2, 3, and 12. One of skill in the art would readily understand the final product from a click chemistry conjugation. Exemplary linking strategies (e.g., methods for linking a monomer or a dimer of a neuraminidase inhibitor to E, such as, by way of a linker) are further depicted in FIGS.1-4 and 14. VI. Combination therapies Antiviral Agents In some embodiments, one or more antiviral agents may be administered in combination (e.g., administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions) or administered separately at different times) with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)).
In some embodiments, the antiviral agent is an antiviral agent for the treatment of HIV. For example, the antiviral agent may be a nucleoside/nucleotide reverse transcriptase inhibitor, a gp120 inhibitor, a polymerase inhibitor, or a fusion protein inhibitor. The antiviral agent may target either the virus or the host subject. The antiviral agent for the treatment of HIV used in combination with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)- (M-XVII)) may be selected from an integrase inhibitor (e.g., dolutegravir, elvitegravir, or raltegravir), a nucleoside reverse transcriptase inhibitor (NRTI) (e.g., abacavir, lamivudine, zidovudine, emtricitabine, tenofovir, emtricitabine, didanosine, or stavudine), a non-nucleoside reverse transcriptase inhibitor (NNRTI) (e.g., efavirenz, etravirine, nevirapine, rilpivirine, or delavirdine), a protease inhibitor (e.g., atazanavir, cobicistat, darunavir, cobicistat, lopinavir, ritonavir, fosamprenavir, tipranavir, nelfinavir, indinavir, or saquinavir), an inhibitor of viral entry (e.g., enfuviritide), a CCR5 antagonist (e.g., maraviroc), or a CYP3A inhibitor (e.g., cobicistat or ritonavir), or an siRNA targeting a host or viral gene, or prodrugs thereof, or pharmaceutically acceptable salts thereof. Antiviral vaccines In some embodiments, any one of conjugates described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) is administered in combination with an antiviral vaccine (e.g., a composition that elicits an immune response in a subject directed against a virus). The antiviral vaccine may be administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions) as the conjugates, or may be administered prior to or following the conjugates (e.g., within a period of 1 day, 2, days, 5, days, 1 week, 2 weeks, 3 weeks, 1 month, 2 months, 6 months, or 12 months, or more). In some embodiments the viral vaccine includes an immunogen that elicits an immune response in the subject against HIV-1 or HIV-2. In some embodiments the vaccine is administered as a nasal spray. VII. Methods Methods described herein include, e.g., methods of protecting against or treating a viral infection (e.g., an HIV infection) in a subject and methods of preventing, stabilizing, or inhibiting the growth of viral particles. A method of treating a viral infection (e.g., an HIV infection) in a subject includes administering to the subject a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)- (D-VI), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof. In some embodiments, the viral infection is cause by the human immunodeficiency virus (e.g., HIV-1 or HIV-2). In some embodiments, the viral infection is caused by a resistant strain of virus. A method of preventing, stabilizing, or inhibiting the growth of viral particles or preventing the replication and spread of the virus includes contacting the virus or a site susceptible to viral growth with a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof. Moreover, methods described herein also include methods of protecting against or treating viral infection in a subject by administering to the subject a conjugate described herein (e.g., a conjugate of
any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)). In some embodiments, the method further includes administering to the subject an antiviral agent or an antiviral vaccine. Methods described herein also include methods of protecting against or treating a viral infection in a subject by administering to said subject (1) a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and (2) an antiviral agent or an antiviral vaccine. Methods described herein also include methods of preventing, stabilizing, or inhibiting the growth of viral particles or preventing the replication or spread of a virus, by contacting the virus or a site susceptible to viral growth with (1) a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and (2) an antiviral agent or an antiviral vaccine. In some embodiments, the conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) is administered first, followed by administering of the antiviral agent or antiviral vaccine alone. In some embodiments, the antiviral agent or antiviral vaccine is administered first, followed by administering of the conjugate described herein alone. In some embodiments, the conjugate described herein and the antiviral agent or antiviral vaccine are administered substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions). In some embodiments, the conjugate described herein or the antiviral agent or antiviral vaccine is administered first, followed by administering of the conjugate described herein and the antiviral agent or antiviral vaccine substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions). In some embodiments, the conjugate described herein and the antiviral agent or antiviral vaccine are administered first substantially simultaneously (e.g., in the same pharmaceutical composition or in separate pharmaceutical compositions), followed by administering of the conjugate described herein or the antiviral agent or antiviral vaccine alone. In some embodiments, when a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and an antiviral agent or antiviral vaccine are administered together (e.g., substantially simultaneously in the same or separate pharmaceutical compositions, or separately in the same treatment regimen), inhibition of viral replication of each of the conjugate and the antiviral agent or antiviral vaccine may be greater (e.g., occur at a lower concentration) than inhibition of viral replication of each of the conjugate and the antiviral agent or antiviral vaccine when each is used alone in a treatment regimen. VIII. Pharmaceutical Compositions and Preparations A conjugate described herein may be formulated in a pharmaceutical composition for use in the methods described herein. In some embodiments, a conjugate described herein may be formulated in a pharmaceutical composition alone. In some embodiments, a conjugate described herein may be formulated in combination with an antiviral agent or antiviral vaccine in a pharmaceutical composition. In some embodiments, the pharmaceutical composition includes a conjugate described herein (e.g., a conjugate described by any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and pharmaceutically acceptable carriers and excipients. Acceptable carriers and excipients in the pharmaceutical compositions are nontoxic to recipients at the dosages and concentrations employed. Acceptable carriers and excipients may include buffers
such as phosphate, citrate, HEPES, and TAE, antioxidants such as ascorbic acid and methionine, preservatives such as hexamethonium chloride, octadecyldimethylbenzyl ammonium chloride, resorcinol, and benzalkonium chloride, proteins such as human serum albumin, gelatin, dextran, and immunoglobulins, hydrophilic polymers such as polyvinylpyrrolidone, amino acid residues such as glycine, glutamine, histidine, and lysine, and carbohydrates such as glucose, mannose, sucrose, and sorbitol. Examples of other excipients include, but are not limited to, antiadherents, binders, coatings, compression aids, disintegrants, dyes, emollients, emulsifiers, fillers (diluents), film formers or coatings, flavors, fragrances, glidants (flow enhancers), lubricants, sorbents, suspensing or dispersing agents, or sweeteners. Exemplary excipients include, but are not limited to: butylated hydroxytoluene (BHT), calcium carbonate, calcium phosphate (dibasic), calcium stearate, croscarmellose, crosslinked polyvinyl pyrrolidone, citric acid, crospovidone, cysteine, ethylcellulose, gelatin, hydroxypropyl cellulose, hydroxypropyl methylcellulose, lactose, magnesium stearate, maltitol, mannitol, methionine, methylcellulose, methyl paraben, microcrystalline cellulose, polyethylene glycol, povidone, pregelatinized starch, propyl paraben, retinyl palmitate, shellac, silicon dioxide, sodium carboxymethyl cellulose, sodium citrate, sodium starch glycolate, sorbitol, starch (corn), stearic acid, stearic acid, sucrose, talc, titanium dioxide, vitamin A, vitamin E, vitamin C, and xylitol. The conjugates herein may have ionizable groups so as to be capable of preparation as pharmaceutically acceptable salts. These salts may be acid addition salts involving inorganic or organic acids or the salts may, in the case of acidic forms of the conjugates herein be prepared from inorganic or organic bases. Frequently, the conjugates are prepared or used as pharmaceutically acceptable salts prepared as addition products of pharmaceutically acceptable acids or bases. Suitable pharmaceutically acceptable acids and bases are well-known in the art, such as hydrochloric, sulphuric, hydrobromic, acetic, lactic, citric, or tartaric acids for forming acid addition salts, and potassium hydroxide, sodium hydroxide, ammonium hydroxide, caffeine, various amines, and the like for forming basic salts. Methods for preparation of the appropriate salts are well-established in the art. Representative acid addition salts include, but are not limited to, acetate, adipate, alginate, ascorbate, aspartate, benzenesulfonate, benzoate, bisulfate, borate, butyrate, camphorate, camphorsulfonate, citrate, cyclopentanepropionate, digluconate, dodecylsulfate, ethanesulfonate, fumarate, glucoheptonate, glycerophosphate, hemisulfate, heptonate, hexanoate, hydrobromide, hydrochloride, hydroiodide, 2-hydroxy-ethanesulfonate, lactobionate, lactate, laurate, lauryl sulfate, malate, maleate, malonate, methanesulfonate, 2-naphthalenesulfonate, nicotinate, nitrate, oleate, oxalate, palmitate, pamoate, pectinate, persulfate, 3-phenylpropionate, phosphate, picrate, pivalate, propionate, stearate, succinate, sulfate, tartrate, thiocyanate, toluenesulfonate, undecanoate, and valerate salts. Representative alkali or alkaline earth metal salts include, but are not limited to, sodium, lithium, potassium, calcium, and magnesium, as well as nontoxic ammonium, quaternary ammonium, and amine cations, including, but not limited to ammonium, tetramethylammonium, tetraethylammonium, methylamine, dimethylamine, trimethylamine, triethylamine, and ethylamine. Depending on the route of administration and the dosage, a conjugate herein or a pharmaceutical composition thereof used in the methods described herein will be formulated into suitable pharmaceutical
compositions to permit facile delivery. A conjugate (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof may be formulated to be administered intramuscularly, intravenously (e.g., as a sterile solution and in a solvent system suitable for intravenous use), intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally (e.g., a tablet, capsule, caplet, gelcap, or syrup), topically (e.g., as a cream, gel, lotion, or ointment), locally, by inhalation, by injection, or by infusion (e.g., continuous infusion, localized perfusion bathing target cells directly, catheter, lavage, in cremes, or lipid compositions). Depending on the route of administration, a conjugate herein or a pharmaceutical composition thereof may be in the form of, e.g., tablets, capsules, pills, powders, granulates, suspensions, emulsions, solutions, gels including hydrogels, pastes, ointments, creams, plasters, drenches, osmotic delivery devices, suppositories, enemas, injectables, implants, sprays, preparations suitable for iontophoretic delivery, or aerosols. The compositions may be formulated according to conventional pharmaceutical practice. A conjugate described herein may be formulated in a variety of ways that are known in the art. For use as treatment of human and animal subjects, a conjugate described herein can be formulated as pharmaceutical or veterinary compositions. Depending on the subject (e.g., a human) to be treated, the mode of administration, and the type of treatment desired, e.g., prophylaxis or therapy, a conjugate described herein is formulated in ways consonant with these parameters. A summary of such techniques is found in Remington: The Science and Practice of Pharmacy, 22nd Edition, Lippincott Williams & Wilkins (2012); and Encyclopedia of Pharmaceutical Technology, 4th Edition, J. Swarbrick and J. C. Boylan, Marcel Dekker, New York (2013), each of which is incorporated herein by reference. Formulations may be prepared in a manner suitable for systemic administration or topical or local administration. Systemic formulations include those designed for injection (e.g., intramuscular, intravenous or subcutaneous injection) or may be prepared for transdermal, transmucosal, or oral administration. The formulation will generally include a diluent as well as, in some cases, adjuvants, buffers, and preservatives. The conjugates can be administered also in liposomal compositions or as microemulsions. Systemic administration may also include relatively noninvasive methods such as the use of suppositories, transdermal patches, transmucosal delivery and intranasal administration. Oral administration is also suitable for conjugates herein. Suitable forms include syrups, capsules, and tablets, as is understood in the art. The pharmaceutical compositions can be administered parenterally in the form of an injectable formulation. Pharmaceutical compositions for injection can be formulated using a sterile solution or any pharmaceutically acceptable liquid as a vehicle. Formulations may be prepared as solid forms suitable for solution or suspension in liquid prior to injection or as emulsions. Pharmaceutically acceptable vehicles include, but are not limited to, sterile water, physiological saline, and cell culture media (e.g., Dulbecco’s Modified Eagle Medium (DMEM), α-Modified Eagles Medium (α-MEM), F-12 medium). Such injectable compositions may also contain amounts of nontoxic auxiliary substances such as wetting or emulsifying agents, pH buffering agents, such as sodium acetate and sorbitan monolaurate. Formulation
methods are known in the art, see e.g., Pharmaceutical Preformulation and Formulation, 2nd Edition, M. Gibson, Taylor & Francis Group, CRC Press (2009). The pharmaceutical compositions can be prepared in the form of an oral formulation. Formulations for oral use include tablets containing the active ingredient(s) in a mixture with non-toxic pharmaceutically acceptable excipients. These excipients may be, for example, inert diluents or fillers (e.g., sucrose, sorbitol, sugar, mannitol, microcrystalline cellulose, starches including potato starch, calcium carbonate, sodium chloride, lactose, calcium phosphate, calcium sulfate, or sodium phosphate); granulating and disintegrating agents (e.g., cellulose derivatives including microcrystalline cellulose, starches including potato starch, croscarmellose sodium, alginates, or alginic acid); binding agents (e.g., sucrose, glucose, sorbitol, acacia, alginic acid, sodium alginate, gelatin, starch, pregelatinized starch, microcrystalline cellulose, magnesium aluminum silicate, carboxymethylcellulose sodium, methylcellulose, hydroxypropyl methylcellulose, ethylcellulose, polyvinylpyrrolidone, or polyethylene glycol); and lubricating agents, glidants, and antiadhesives (e.g., magnesium stearate, zinc stearate, stearic acid, silicas, hydrogenated vegetable oils, or talc). Formulations for oral use may also be provided as chewable tablets, or as hard gelatin capsules wherein the active ingredient is mixed with an inert solid diluent (e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin), or as soft gelatin capsules wherein the active ingredient is mixed with water or an oil medium, for example, peanut oil, liquid paraffin, or olive oil. Powders, granulates, and pellets may be prepared using the ingredients mentioned above under tablets and capsules in a conventional manner using, e.g., a mixer, a fluid bed apparatus, or a spray drying equipment. Other pharmaceutically acceptable excipients for oral formulations include, but are not limited to, colorants, flavoring agents, plasticizers, humectants, and buffering agents. Formulations for oral use may also be provided as chewable tablets, or as hard gelatin capsules wherein the active ingredient is mixed with an inert solid diluent (e.g., potato starch, lactose, microcrystalline cellulose, calcium carbonate, calcium phosphate or kaolin), or as soft gelatin capsules wherein the active ingredient is mixed with water or an oil medium, for example, peanut oil, liquid paraffin, or olive oil. Powders, granulates, and pellets may be prepared using the ingredients mentioned above under tablets and capsules in a conventional manner using, e.g., a mixer, a fluid bed apparatus or a spray drying equipment. Dissolution or diffusion controlled release of a conjugate described herein (e.g., a conjugate of any one of (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof can be achieved by appropriate coating of a tablet, capsule, pellet, or granulate formulation of the conjugate, or by incorporating the conjugate into an appropriate matrix. A controlled release coating may include one or more of the coating substances mentioned above and/or, e.g., shellac, beeswax, glycowax, castor wax, carnauba wax, stearyl alcohol, glyceryl monostearate, glyceryl distearate, glycerol palmitostearate, ethylcellulose, acrylic resins, dl-polylactic acid, cellulose acetate butyrate, polyvinyl chloride, polyvinyl acetate, vinyl pyrrolidone, polyethylene, polymethacrylate, methylmethacrylate, 2-hydroxymethacrylate, methacrylate hydrogels, 1,3 butylene glycol, ethylene glycol methacrylate, and/or polyethylene glycols. In a controlled release matrix formulation, the matrix material may also include, e.g., hydrated methylcellulose, carnauba wax and stearyl alcohol, carbopol 934, silicone, glyceryl tristearate, methyl acrylate-methyl methacrylate, polyvinyl chloride, polyethylene, and/or halogenated fluorocarbon.
The pharmaceutical composition may be formed in a unit dose form as needed. The amount of active component, e.g., a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)), included in the pharmaceutical compositions are such that a suitable dose within the designated range is provided (e.g., a dose within the range of 0.01-100 mg/kg of body weight). IX. Routes of Administration and Dosages In any of the methods described herein, conjugates herein may be administered by any appropriate route for treating or protecting against a viral infection (e.g., an HIV infection), or for preventing, stabilizing, or inhibiting the proliferation or spread of a virus (e.g., an HIV virus). Conjugates described herein may be administered to humans, domestic pets, livestock, or other animals with a pharmaceutically acceptable diluent, carrier, or excipient. In some embodiments, administering includes administration of any of the conjugates described herein (e.g., conjugates of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) or compositions intramuscularly, intravenously (e.g., as a sterile solution and in a solvent system suitable for intravenous use), intradermally, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally (e.g., a tablet, capsule, caplet, gelcap, or syrup), topically (e.g., as a cream, gel, lotion, or ointment), locally, by inhalation, by injection, or by infusion (e.g., continuous infusion, localized perfusion bathing target cells directly, catheter, lavage, in cremes, or lipid compositions). In some embodiments, if an antiviral agent is also administered in addition to a conjugate described herein, the antiviral agent or a pharmaceutical composition thereof may also be administered in any of the routes of administration described herein. The dosage of a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D- I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) or pharmaceutical compositions thereof depends on factors including the route of administration, the disease to be treated (e.g., the extent and/or condition of the viral infection), and physical characteristics, e.g., age, weight, general health, of the subject. Typically, the amount of the conjugate or the pharmaceutical composition thereof contained within a single dose may be an amount that effectively prevents, delays, or treats the viral infection without inducing significant toxicity. A pharmaceutical composition may include a dosage of a conjugate described herein ranging from 0.01 to 500 mg/kg (e.g., 0.01, 0.1, 0.2, 0.3, 0.4, 0.5, 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, 150, 200, 250, 300, 350, 400, 450, or 500 mg/kg) and, in a more specific embodiment, about 0.1 to about 30 mg/kg and, in a more specific embodiment, about 1 to about 30 mg/kg. In some embodiments, when a conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D-VI), or (M-I)-(M-XVII)) and an antiviral agent or antiviral vaccine are administered in combination (e.g., substantially simultaneously in the same or separate pharmaceutical compositions, or separately in the same treatment regimen), the dosage needed of the conjugate described herein may be lower than the dosage needed of the conjugate if the conjugate was used alone in a treatment regimen.
A conjugate described herein (e.g., a conjugate of any one of formulas (1), (2), (D-I), (D-IV)-(D- VI), or (M-I)-(M-XVII)) or a pharmaceutical composition thereof may be administered to a subject in need thereof, for example, one or more times (e.g., 1-10 times or more; 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 times) daily, weekly, monthly, biannually, annually, or as medically necessary. Dosages may be provided in either a single or multiple dosage regimens. The timing between administrations may decrease as the medical condition improves or increase as the health of the patient declines. The dosage and frequency of administration may be adapted by the physician in accordance with conventional factors such as the extent of the infection and different parameters of the subject. EXAMPLES The following examples are put forth so as to provide those of ordinary skill in the art with a description of how the compositions and methods described herein may be used, made, and evaluated, and are intended to be purely exemplary of the invention and are not intended to limit the scope of what the inventors regard as their invention. Example 1: Preparation of Fc constructs Reverse translations of the amino acids comprising the protein constructs (SEQ ID NOs: 1, 3, 5, 7, 9, 12, and 14) were synthesized by solid-phase synthesis. The oligonucleotide templates were cloned into pcDNA3.1 (Life Technologies, Carlsbad, CA, USA) at the cloning sites BamHI and XhoI (New England Biolabs, Ipswich, MA, USA) and included signal sequences derived from the human Interleukin-2 or human albumin. The pcDNA3.1 plasmids were transformed into Top10 E. coli cells (LifeTech). DNA was amplified, extracted, and purified using the PURELINK® HiPure Plasmid Filter Maxiprep Kit (LifeTech). The plasmid DNA is delivered, using the EXPIFECTAMINE™ 293 Transfection Kit (LifeTech), into HEK-293 cells per the manufacturer’s protocol. Cells were centrifuged, filtered, and the supernatants were purified using MabSelect Sure Resin (GE Healthcare, Chicago, IL, USA). Purified molecules were analyzed using 4-12% Bis Tris SDS PAGE gels by loading 1-2 µg of each molecule into the gel, and staining using instant Blue staining. Each gel included a molecular weight ladder with the indicated molecular weight standards. Reduced and non-reduced lanes are denoted by “R” and “NR”. FIGs.5-11 show non-reducing and reducing SDS-PAGE of an Fc domain formed from Fc domain monomers having the sequences of SEQ ID NOs: 1, 3, 5, 7, 9, 12, and 14, respectively. Example 2. Synthesis of h-IgG1 Fc-PEG4-azide Preparation of 0.05M PEG4-azido NHS ester solution in DMF/PBS: 195.8 mg of PEG4-azido NHS ester was dissolved in 0.500 mL of DMF at 0 °C and diluted to 9.88 mL by adding PBS buffer at 0 °C. This solution was used for preparing other PEG4-azido Fc with variety of DAR values by adjusting the equivalents of this PEG4-azido NHS ester solution. Preparation of PEG4-azido Fc: 0.05M PEG4-azidoNHS ester PBS buffer solution (9.88 mL, 494.0 μmol, 9.5 equivalents) was added to a solution of h-IgG1 Fc (SEQ ID NO: 4) (3027 mg in 213.0 mL of pH 7.4 PBS, MW~58,200 Da, 16.5 μmol) and the mixture was shaken gently for 2 hours at ambient temperature. The solution was concentrated by using 10 centrifugal concentrators (30,000 MWCO, 15
mL) to a volume of ~1.5 mL. The crude mixture was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of three times. The small molecule reagent was removed with this wash procedure. The concentrated Fc-PEG4-azide (SEQ ID NO: 4) was diluted to 213.0 mL with pH 7.4 PBS 1x buffer and ready for Click conjugation. The purified material was quantified using a NANODROP™ UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield is quantitative after purification. The Fc-PEG4-azide (SEQ ID NO: 35) was prepared analogously. Example 3. Synthesis of recombinant mouse serum albumin (MSA)-PEG4-azide PEG4-azidoNHS ester (98%, 81.7 μmol, 4.5 equivalents, 32.4 mg in 0.3 mL of DMF and diluted to 1.63 mL with pH 7.4 PBS 1x buffer solution) was added to a solution of recombinant mouse serum albumin (SEQ ID NO: 71) (1200 mg in 75.0 mL of pH 7.4 PBS, MW~66,000 Da, 18.2 μmol) and the mixture was shaken gently for 12 hours at ambient temperature. The solution was concentrated using a centrifugal concentrator (30,000 MWCO) to a volume of ~1.5 mL. The crude mixture was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of three times. The small molecule reagent was removed with this wash procedure. The concentrated MSA-PEG4-azide was diluted to 75.0 mL with pH 7.4 PBS 1x buffer and ready for Click conjugation. The purified material was quantified using a NANODROP™ UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield is quantitative after purification. DAR = 3.5 determined by MALDI. The DAR value can be adjusted by altering the equivalents of PEG4-azido NHS ester similar to h-IgG1 Fc (Example 2). Example 4. Synthesis of Int-1 Int-1 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 5. Synthesis of Int-2 Int-2 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 6. Synthesis of Int-3 Int-3 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 7. Synthesis of Int-4 Int-4 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
Example 8. Synthesis of conjugates – Click conjugation A preparation of 0.0050M CuSO4 in PBS buffer solution Click reagent was performed. Briefly, 10.0 mg CuSO4 was dissolved in 12.53 mL PBS, next 6.00 mL of the CuSO4 solution and added 51.7 mg BTTAA (CAS# 1334179-85-9) and 297.2 mg sodium ascorbate to give the Click reagent solution (0.0050M CuSO4, 0.020M BTTAA and 0.25M sodium ascorbate). This Click reagent solution was used for all subsequent conjugations. General procedure for Click conjugation of payload: a solution of azido functionalized Fc was added to a 15 mL centrifuge tube containing alkyne derivatized small molecule (2 equivalents for each DAR). After gently shaking to dissolve all solids, the mixture was treated with the Click reagent solution of (L-ascorbic acid sodium, 0.25 M, 400 equivalents, copper (II) sulfate 0.0050M, 8 equivalents, and BTTAA 0.020M, 32 equivalents). The resulting mixture was gently rotated for 6 hours at ambient temperature. It was purified by affinity chromatography over a protein A column, followed size exclusion chromatography (as described herein). Maldi TOF analysis of the purified final product gave an average mass, average DAR and Yield listed in Table 3 below. Table 3. Conjugates and properties
*The terminal Lys residue of the Fc domain may be cleaved upon expression and purification, e.g., SEQ ID NO: 64 coverts to SEQ ID NO: 73 Example 9. General procedure for purification of conjugates. The crude mixture was diluted 1:10 in PBS pH 7.4, and purified using MabSelect Sure Resin (GE Healthcare, Chicago, IL, USA), followed by size exclusion chromatography. (HiLoad 26/600 Superdex200 pg, GE Healthcare, Chicago, IL, USA). Fractions containing purified conjugate were pooled and concentrated to approximately 20 mg/mL using a centrifugal concentrator (30,000 MWCO). Purified
material was quantified using a NANODROP™ UV visible spectrophotometer using a calculated extinction coefficient based on the amino acid sequence of hIgG1 Fc(myc). Purified molecules were analyzed using 4-12% Bis Tris SDS PAGE gels by loading 1 μg of each molecule into the gel, and staining using Instant Blue (Expedeon, San Diego, CA, USA). Each gel included a molecular weight ladder with the indicated molecular weight standards. Yields were calculated and purity determined by Agilent Analytical HPLC. Product peak and MW were found by MALDI MS and a final DAR calculated. Example 10. gp120 glycoprotein binding assay Nunc MaxiSorp flat-bottom 96-well plates (12-565-136, Fisher Scientific) were coated with recombinant HIV-1 GP120 (SAE0071, Sigma) at 2 μg/mL in PBS (pH 7.4) (10-010-049, Fisher Scientific) overnight at 4°C (100 μL, 0.2 μg/well). Plates were washed (5 x 300 μL) with wash buffer (PBS 0.05% Tween 20) and blocked with 1% BSA (A5611-10G, Sigma; 200 μL/well) in wash buffer for 1 h at room temp on an orbital microplate shaker at 500 rpm (BT908, BT LabSystems). The blocking agent was removed and wells incubated with 3-fold serial dilutions of conjugate in sample diluent (0.5% BSA in PBS 0.025% Tween 20) starting at 1 μM for 1 h with shaking at room temp. After 5 x 300 μL washes, the plates were incubated with HRP conjugated donkey anti-human IgG Fc F(ab’)2 (709-036-098, Jackson ImmunoResearch) secondary antibody diluted 1:1,000 in sample diluent for 1 h with shaking at room temp. Plates were then washed (8 x 300 μL) and developed with TMB substrate (BD555214, Fisher Scientific) for 3-5 minutes at room temp. The reaction was stopped with 1N H2SO4 and the absorbance read at 450 nm using the EnSpire multimode plate reader (PerkinElmer). Half maximal effective concentration (EC50) was calculated with GraphPad Prism version 8 using nonlinear regression analysis (Sigmoidal, 4PL) of binding curves. Polyclonal goat anti-GP120 HRP (PA1-73097, Invitrogen) and unconjugated Fc molecule were run as the positive and negative binding controls, respectively. The results are provided in FIG.12 and in Table 4. Table 4. GP120 protein binding EC50 (nM)
Example 11. Activity of pre-conjugation intermediate (Int) compounds in an in vitro cell fusion assay Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell- cell fusion which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1294), obtained from the AIDS Research Reagent and Reference Program (Rockville, MD). HeLa-CD4-LTR-β-Gal cells were
plated at a density of 5 x 103 cells per well in a volume of 50 µL with 50 µL of nine serial logarithmic dilutions of compound in triplicate for one hour at 37ºC/5% CO2. Following the incubation, 100 µL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37ºC/5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies, cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C. XTT/PMS stock was prepared immediately before use by adding 40 μL of PMS per mL of XTT solution.50 μL of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37°C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer. This assay identified several compounds with EC50 values within 10-fold of the benchmark compound (Temsavir) (Table 5). Particularly active was Int-1 with an EC50 of less than 4 nM. Importantly, no cytotoxicity was evident for any compound at concentration tested. The combination of nM inhibition and no detectable cytotoxicity indicates this is a potent series with significant therapeutic potential. Table 5. Fusion inhibition activity of pre-conjugation intermediate (Int) compounds
Example 12. General procedure for Synthesis of azido Fc Preparation of PEG4-azido NHS ester solution (0.050 M) in DMF/PBS: 16.75 mg of PEG4-azido NHS ester was dissolved in 0.100 mL of DMF at 0 °C and diluted to 0.837 mL by adding PBS 1x buffer at 0 °C. This solution was used for preparing other PEG4-azido Fc with a variety of DAR values by adjusting the equivalents of this PEG4-azido NHS ester PBS solution. Pretreatment of h-lgG1 Fc, SEQ ID NO: 48 (107.2 mg in 8.800 mL of pH 7.4 PBS, MW~57891 Da, 1.852 μmol): The Fc solution was transferred into four centrifugal concentrators (30,000 MWCO, 15 mL) and diluted to 15 mL with PBS x1 buffer and concentrated to a volume of ~1.5 mL. The residue was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of four times followed by dilution to 8.80 mL. Preparation of PEG4-azido Fc: 0.050M PEG4-azidoNHS ester PBS buffer solution (0.593 mL, 29.6 μmol, 16 equivalents) was added to above solution of h-IgG1 Fc (SEQ ID NO: 48) and the mixture was shaken rotated for 2 hours at ambient temperature. The solution was concentrated by using four centrifugal concentrators (30,000 MWCO, 15 mL) to a volume of ~1.5 mL. The crude mixture was diluted 1:10 in PBS pH 7.4, and concentrated again. This wash procedure was repeated for total of three times. The concentrated Fc-PEG4-azide was diluted to 8.80 mL with pH 7.4 PBS buffer and ready for Click conjugation. The purified material was quantified using a NANODROP™ UV visible spectrophotometer (using a calculated extinction coefficient based on the amino acid sequence of h-IgG1). Yield was quantitative after purification. Example 13. Synthesis of Int-12 Int-12 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 14. Synthesis of Int-13 Int-13 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 15. Synthesis of Int-14 Int-14 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 16. Synthesis of Int-15 Int-15 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 17. Synthesis of Int-16 Int-16 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
Example 18. Synthesis of Int-17 Int-17 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 19. Screening of Ints and conjugates in an in vitro cell fusion assay Activity of Ints was determined in an assay designed to measure the inhibition of cell-cell fusion which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1294), obtained from the AIDS Research Reagent and Reference Program (Rockville, MD). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5 x 103 cells per well in a volume of 50 µL, with 50 µL of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37ºC/5% CO2. Following the incubation, 100 µL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37ºC/5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies, cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C. XTT/PMS stock was prepared immediately before use by adding 40 μL of PMS per mL of XTT solution. Fifty μL (50 μL) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37°C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer. This assay identified four compounds with EC50 values approximately equal to the benchmark compound (Temsavir) (Table 6). These compounds were highly potent at inhibiting cell fusion with EC50 values of less than 0.9 nM. One of these compounds, Int-17, also demonstrated no apparent loss of activity upon conjugation to an hIgG1 Fc (conjugate 5); this was an important finding. Lastly, no compounds showed cytotoxicity at the concentrations tested in this study. Therefore, for the most active compounds the difference between EC50 and cytotoxicity is greater than 10,000-fold. A prior fusion inhibition study also identified several highly active compounds (Int-2 and Int-4). However, both compounds lost significant potency upon conjugation (conjugates 2 and 3, respectively), further emphasizing the significance of conjugate 4.
Table 6. Fusion inhibition activity of HIV inhibitor compounds
Example 20. Synthesis of DMJ-II-121
DMJ-II-12 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
Example 21. Synthesis of Conjugate 8 A solution of azido functionalized Fc (50 mg, 28.43 mL, 0.862 µmol, 1.76 mg/mL; SEQ ID NO: 64, Example 2) was added to a 50 mL centrifuge tube following by addition of alkyne derivatized small molecule (15.83 mg, 0.012 mmol, Int-15, Example 16) in EPPES at pH 8.5, and a solution of copper (II) sulfate (1.1 mg, 0.0043 mmol) in water mixed with THTPA (0.43 mL, 0.0216 mmol, 50nM in water), aminoguanidine HCl (2.16 mL, 100 mM in water), and sodium ascorbate (2.16 mL, 100 mM in water). The resulting solution was gently shaken for 4 hours. It was purified by affinity chromatography over a protein A column, followed by size exclusion chromatography (as described in Example 8). Maldi TOF analysis of the purified final product gave an average mass of 60593 Da (DAR 2.5). Yield 12.71 mg, 25%. Example 22. Synthesis of Int-18 Int-18 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 23. Synthesis of Int-19 Int-19 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 24. Synthesis of Int-20 Int-20 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 25. Synthesis of Int-21 Int-21 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 26. Synthesis of Int-22 Int-22 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 27. Synthesis of Conjugate 9 Prepared the Click reagent solution: 0.0050M CuSO4 in PBS buffer solution: 10.0 mg CuSO4 was dissolved in 12.53 mL PBS, then took 6.00 mL this CuSO4 solution and added 64.8 mg BTTAA(CAS# 1334179-85-9) and 297 mg sodium ascorbate to give the Click reagent solution (0.0050M CuSO4, 0.025M BTTAA and 0.25M sodium ascorbate). To a solution of azido functionalized Fc (122.1 mg, 8.55 mL, 21.1 µmol, SEQ ID NO: 64, Example 2, DAR = 3.9, in 25mM MES, 150mM NaCl, pH6.0 buffer) in a 15 mL centrifuge tube was added an alkyne derivatized small molecule (25.0 mg, 19.0 mmol, 3.0 equivalents for each azido on the Fc, described in Example 26, Int-22) in 1.5 mL of MES buffer. After gently agitating, the mixture was treated
with the Click reagent solution (5.05 mL). The resulting mixture was gently rotated for 4 hours at ambient temperature. It was then purified by affinity chromatography over a protein A column, followed size exclusion chromatography (see general conjugate purification protocol). Maldi TOF analysis of the purified final product gave an average mass of 65,947 Da (DAR =3.4). Yield 50 mg/ 41%. Example 28. Synthesis of Int-23 Int-23 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 29. Synthesis of Int-24 Int-24 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 30. Synthesis of Int-25 Int-25 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 31. Synthesis of Int-26 Int-26 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference. Example 32. Synthesis of Int-27 Int-27 was synthesized according to the procedure found in WO 2020/252393, hereby incorporated by reference.
Example 33. Synthesis of Int-53
To a -15 °C stirring solution of Nα-Boc-Nδ-Cbz-L-ornithine (1.00 g, 2.729 mmol) and N- methylmorpholine (300 uL, 2.729 mmol) in THF (10.0 mL), it was added isobutylchlorofromate (355 uL, 2.729 mmol). After stirring for 5 minutes, a freshly prepared solution of sodium borohydride (310 mg, 8.188 mmol) in water (4.0 mL) was added. Upon reaction completion, water (10 mL) was added and the temperature raised to ambient, while stirring continued for 1 h. The resulting mixture was extracted with DCM (4 x 30 mL), and the combined organics were dried with magnesium sulfate, filtered and concentrated per rotatory evaporation. Residual volatiles were evaporated under high vacuum. This material was used in the next step without further purification. LCMS: [(M + H]]+ = 353.2.
Step b.
Under hydrogen atmosphere, a suspension of the product from step a (2.729 mmol, theoretical) and 20% palladium hydroxide on carbon (500 mg) in MeOH (20 mL), was stirred until full consumption of the starting material. The mixture was filtered and the filtrate concentrated per rotatory evaporation. Residual volatiles were evaporated under high vacuum. This material was used in the next step without further purification. Ions found by LCMS: [(M + H]]+ = 219.2. Step c.
To a 0 °C stirring solution of step b product (150 mg, 0.687 mmol), propargyl-PEG4-acid (179 mg, 0.687 mmol), and DIPEA (0.359 mL, 2.061 mmol) in DMF (4.0 mL) and DCM (0.5 mL), was added HATU (266 mg, 0.701 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per rotatory evaporation. The residue was purified by RP-C18 column using an Isco CombiFlash liquid chromatography eluted with 0% to 100% water and methanol, no modifier. Yield 0.186 g, 59%. Ions found by LCMS: [(M + H)]+ = 461.3. Step d.
The product from step c (186 mg, 0.404 mmol) was treated with 4.0 M solution of HCl in dioxane (3.0 mL) under stirring. Upon completion, all the volatiles were evaporated per rotatory evaporation and high vacuum. This material was used in the next step without further purification. Yield 0.161 g, quant. Ions found by LCMS: [(M + H]]+ = 361.2.
Step e.
To a 0 °C stirring solution of step d product (31 mg, 0.078 mmol), example XYZ step XYYY (40 mg, 0.078 mmol), HOBt hydrate (36 mg, 0.235 mmol, ~80%) and DIPEA (0.082 mL, 0.469 mmol) in DMF (3.0 mL) and DCM (0.5 mL), was added HATU (89 mg, 0.235 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per rotatory evaporation. The residue was purified by RP-C18 column using an Isco ACCQ liquid chromatography eluted with 0% to 100% water and acetonitrile, 0.1% TFA modifier. Yield 0.051 g, 76%. Ions found by LCMS: [(M + H)]+ = 854.2. Step f.
To a stirring solution of the product from step e (0.047 mg, 0.055 mmol), azido-PEG4- trifluorophenyl ester (24 mg, 0.058 mmol), BTTAA (1.2 mg, 0.0027 mmol), cupric sulfate (0.2 mg, 0.0014 mmol), in DMF (1.0 mL) and water (1.0 mL), it was added sodium ascorbate (5.4 mg, 0.028 mmol). Upon completion, acetic acid (0.099 mL, 1.725 mmol) was added, and the reaction was concentrate per rotatory evaporation. The residue was purified by RP-C18 column using an Isco ACCQ liquid chromatography eluted with 0% to 100% water and methanol, 0.1% TFA modifier. Yield 0.047 g, 60%. Ions found by LCMS: [(M + 2H)/2]+ = 638.3.
Example 34. Synthesis of Int-53
To a -15 °C stirring solution of Nα-Boc-Nδ-Cbz-L-ornithine (1.00 g, 2.729 mmol) and N- methylmorpholine (300 uL, 2.729 mmol) in THF (10.0 mL), it was added isobutylchlorofromate (355 uL, 2.729 mmol). After stirring for 5 minutes, a freshly prepared solution of sodium borohydride (310 mg, 8.188 mmol) in water (4.0 mL) was added. Upon reaction completion, water (10 mL) was added and the temperature raised to ambient, while stirring continued for 1 h. The resulting mixture was extracted with DCM (4 x 30 mL), and the combined organics were dried with magnesium sulfate, filtered and concentrated per rotatory evaporation. Residual volatiles were evaporated under high vacuum. This material was used in the next step without further purification. LCMS: [(M + H]]+ = 353.2.
Step b.
Under hydrogen atmosphere, a suspension of the product from step a (2.729 mmol, theoretical) and 20% palladium hydroxide on carbon (500 mg) in MeOH (20 mL), was stirred until full consumption of the starting material. The mixture was filtered and the filtrate concentrated per rotatory evaporation. Residual volatiles were evaporated under high vacuum. This material was used in the next step without further purification. Ions found by LCMS: [(M + H]]+ = 219.2. Step c.
To a 0 °C stirring solution of step b product (150 mg, 0.687 mmol), propargyl-PEG4-acid (179 mg, 0.687 mmol), and DIPEA (0.359 mL, 2.061 mmol) in DMF (4.0 mL) and DCM (0.5 mL), was added HATU (266 mg, 0.701 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per rotatory evaporation. The residue was purified by RP-C18 column using an Isco CombiFlash liquid chromatography eluted with 0% to 100% water and methanol, no modifier. Yield 0.186 g, 59%. Ions found by LCMS: [(M + H)]+ = 461.3. Step d.
The product from step c (186 mg, 0.404 mmol) was treated with 4.0 M solution of HCl in dioxane (3.0 mL) under stirring. Upon completion, all the volatiles were evaporated per rotatory evaporation and high vacuum. This material was used in the next step without further purification. Yield 0.161 g, quant. Ions found by LCMS: [(M + H]]+ = 361.2.
Step e.
To a 0 °C stirring solution of step d product (31 mg, 0.078 mmol), the triazole acid described in Example 5 (40 mg, 0.078 mmol), HOBt hydrate (36 mg, 0.235 mmol, ~80%), and DIPEA (0.082 mL, 0.469 mmol) in DMF (3.0 mL) and DCM (0.5 mL), was added HATU (89 mg, 0.235 mmol). The temperature was raised to ambient and stirring was continued until complete as determined by LCMS. All the volatiles were removed per rotatory evaporation. The residue was purified by RP-C18 column using an Isco ACCQ liquid chromatography eluted with 0% to 100% water and acetonitrile, 0.1% TFA modifier. Yield 0.051 g, 76%. Ions found by LCMS: [(M + H)]+ = 854.2. Step f.
To a stirring solution of the product from step e (0.047 mg, 0.055 mmol), azido-PEG4- trifluorophenyl ester (24 mg, 0.058 mmol), BTTAA (1.2 mg, 0.0027 mmol), cupric sulfate (0.2 mg, 0.0014 mmol), in DMF (1.0 mL) and water (1.0 mL), it was added sodium ascorbate (5.4 mg, 0.028 mmol). Upon completion, acetic acid (0.099 mL, 1.725 mmol) was added, and the reaction was concentrate per rotatory evaporation. The residue was purified by RP-C18 column using an Isco ACCQ liquid chromatography eluted with 0% to 100% water and methanol, 0.1% TFA modifier. Yield 0.047 g, 60%. Ions found by LCMS: [(M + 2H)/2]+ = 638.3.
Example 35. Synthesis of Int-56
To a solution of 1-Boc-2-hydroxymethyl-piperazine (1.01 g, 4.68 mmol) and 3- [(benzyloxycarbonyl)amino]propionaldehyde (0.97 g, 4.68 mmol) in DCE (25 mL) was added sodium triacetoxyborohydride (2.97 g, 14.04 mmol) followed by acetic acid (1.1 mL, 18.72 mmol). The solution was then stirred for 16h at which point LC-MS showed 90% consumption of starting material. The crude reaction was then washed with 1N NaOH (2 x 50 mL). The organic layer was then concentrated and the crude material was purified via flash chromatography (0-100% hexanes and EtOAc). LCMS: [M + H]+ = 408.3. Steb b.
To tert-butyl 2-(hydroxymethyl)-4-{3- [(phenylmethoxy)carbonylamino]propyl}piperazinecarboxylate) was added HCl in dioxane (12 mL, 4M). The solution was stirred for 1h at which point LC-MS showed complete consumption of starting material, and the solvent was removed. The crude material was used in the next step without further purification. LCMS: [M + H]+ = 308.2.
Step c.
To a solution of N-{3-[3-(hydroxymethyl)piperazinyl]propyl}(phenylmethoxy)carboxamide hydrochloride salt (1.5 g, 3.95 mmol) in Hunigs base (15 mL) and DMF (2 mL) was added propargyl- PEG4-mesylate (1.57 g, 5.07 mmol). The solution was then stirred for 48h at 60 °C at which point LC-MS showed 50% consumption of starting material. The crude mixture was concentrated and then purified via flash chromatography (0-70% EtOAc and MeOH:NH4OH). Yield 0.47 g, 19.2% (Yield for 3 steps). LCMS: [M + H]+ = 522.3. Step d.
To N-{3-[3-(hydroxymethyl)-4-(2-{2-[2-(2-prop-2- ynyloxyethoxy)ethoxy]ethoxy}ethyl)piperazinyl]propyl}(phenylmethoxy)carboxamide (0.47 g, 0.91 mmol) was added TFA (10 mL) followed by thioanisole (2.79 g, 22.53 mmol). The solution was cooled to 0 °C and then bromotrimethylsilane (0.59 mL, 4.50 mmol) was added dropwise. The solution was stirred for 1h at which point LC-MS showed complete consumption of starting material. The solvent was removed and the crude mixture was washed with hexane (2 x 20 mL) to remove unwanted thioanisole. The crude mixture was then purified via flash chromatography (0-70% EtOAc and MeOH:NH4OH). Yield 0.25 g, 71.6%. LCMS: [M + H]+ = 388.4. Step e.
To a solution of triazole acid core described in Example 5 in DMF (4 mL) was added Hunigs base (0.50 mL, 2.83 mmol) followed by HOBt (0.19 g, 1.42 mmol) and HATU (0.43 g, 1.13 mmol). The solution was stirred for 30 min at which point LC-MS showed complete consumption of starting material. The crude material was then directly purified by reversed phase HPLC (0-100% H2O and CH3CN using 0.1% TFA). The product containing fractions were combined frozen, and lyophilized. Yield 0.16 g, 64.0%. LCMS: [M + H]+ = 881.2.
Step f.
To a solution of the alkyne (110 mg, 0.12 mmol) and 2,4,6-trifluorophenyl 3-(2-{2-[2-(3-diazo-3- azaprop-3-enyloxy)ethoxy]ethoxy}ethoxy)propanoate (68 mg, 0.16 mmol) in DMF (4 mL) was added a solution of sodium ascorbate (25 mg, 0.12 mmol) and copper(II) sulfate (5 mg, 0.03 mmol) in H2O (2 mL). The solution was stirred for 30 min at which point LC-MS showed complete consumption of starting material. The reaction was acidified with acetic acid (2 drops) and the crude material was then directly purified by reversed phase HPLC (0-100% H2O and CH3CN using 0.1% TFA). The product containing fractions were combined frozen, and lyophilized. Yield 65 mg, 40.0%. LCMS: [M + 2H]+2 = 652.0. Example 36. Synthesis of Int-57
To a solution of benzyl 1-piperazinecarboxylate (4.4 g, 20 mmol) in anhydrous THF (20 ml) was added K2CO3 (4.15 g, 30 mmol) and 1,3-dibromopropane (20.2 g, 100 mmol). The mixture was heated at 70℃ overnight, then cooled to room temperature. The salt was filtered off, and the filtrate was
concentrated by rotary evaporation. The residue was purified through silica gel chromatography (80 g, 20% to 100% EtOAc and hexane). Yield 4.48 g, 65.5%. Ion found by LCMS: [M + H]+ = 341.0. Step b.
To a solution of the step-a product (1.56 g, 4.57 mmol) in anhydrous DMF (5 ml) was added K2CO3 (691 mg, 5 mmol) and 2,2-dimethyl-1,3-dioxan-5-amine (779.5 mg, 5.94 mmol). The mixture was heated at 70℃ overnight, then cooled to room temperature. The salt was filtered off, and the filtrate was concentrated by rotary evaporation. The residue was purified by RPLC (100 g, 0% to 70% acetonitrile and water). Yield 1.47 mg, 82.2%. Ion found by LCMS: [M + H]+ = 392.0. Step c.
To a solution of the step-b product (760.9 mg, 1.94 mmol) in anhydrous DMF (3 ml) was added DIPEA (501.5 mg, 3.88 mmol), 3(-(Boc)-aminopropyl bromide (695.3 mg, 2.92 mmol) and NaI (75 mg, 0.5 mmol). The mixture was stirred at 50℃ for 3 days. It was then was purified by RPLC (100 g, 5% to 90% acetonitrile and water). Yield 679.6 mg, 64%. Ion found by LCMS: [M + H]+ = 549.0. Step d.
The step-c product (679.6 mg, 1.24 mmol) was dissolved in MeOH (15 ml) and added with AcOH (74.3 mg, 1.24 mmol)) and Pd(OH)2 (20% on carbon, 350 mg). The resulting mixture was stirred under hydrogen atmosphere for 3 hours. Pd(OH)2/C was filtered off, and the filtrate was concentrated by rotary evaporation and further dried under high vacuum. Yield 588.5 mg, quantitative yield. Ion found by LCMS [M + H]+ = 415.0, [M + H - Boc]+ = 315.0. Step e.
To a solution of the step-d product (588.5 mg, 1.24 mmol) in anhydrous DMF (2 ml) and THF (2 ml) was added K2CO3 (514 mg 3.72 mmol) and propargyl-PEG4-mesyl ester (465.5 mg, 1.5 mmol). The resulting mixture was heated at 70℃ for 1 day. The salt was filtered off, and the filtrate was concentrated
and purified by HPLC (0% to 70% acetonitrile and water). Yield 253.2 mg, 32.5%. Ion found by LCMS: [M + H]+ = 629.4, [(M + 2H)/2]+ = 315.0. Step f.
The step-e product (253.2 mg, 0.403 mmol) was dissolved in acetonitrile/water (1:1, 4 ml) and added with 6N HCl aqueous solution (0.5 ml, 3 mmol). The reaction was stirred at 60℃ for 5 hours, then cooled to room temperature. After the pH was adjusted to about 8 with 1M KOH (3.5 ml), the solution was lyophilized. The residue was then re-dissolved in MeOH and KCl was filtered off. The filtrate was concentrated by rotary evaporation and further dried under high vacuum. Yield 197 mg, quantitative yield. Ion found by LCMS: [M + H]+ = 489.4, [(M + 2H)/2]+ = 245.2. Step g.
A mixture of triazole acid core described in Example 5 (50mg, 0.0976 mmol) and HOBT hydrate (76 mg, 0.5 mmol) was dissolved in anhydrous DMF (1 ml) and Et3N (606 mg, 6 mmol) by gently heated with a heat-gun. After cooled to room temperature, the mixture was added with HATU (114 mg, 0.3 mmol), followed by the addition of a solution of the step-f product (113.4 mg, 0.12 mmol) in anhydrous DMF (0.3 ml). Addition DMF (0.2 ml x 3) was used to rinse the flask and combined to the reaction flask. The reaction mixture was stirred for 1 hour and then directly purified by HPLC: 5% to 50% acetonitrile and water, using 0.1% TFA as modifier. Yield 42 mg, 30%. Ions found by LCMS: [M + H]+ = 981.8, [(M + 2H)/2]+ = 491.6, [(M + 3H)/3]+ = 328.2. Step h. Synthesis of Int-57
Inte-51 (34.6 mg, 0.024 mmol) was dissolved in DMF (1 ml), and the solution was cooled in an ice-water bath. TFA (15 µl) was added, followed by the addition of Azido-PEG4-trifluorophenyl ester ( 12.2 mg, 0.029 mmol), a pre-mixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (40 mg, 0.202 mmol) in water (0.5 ml) and Cu2SO4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the reaction mixture was stirred for 30 minutes. It was then directly purified by RPLC (50 g, 5% to 80% acetonitrile and water, using 0.1% TFA as modifier). Yield 30.7 mg, 68.8%. Ions found by LCMS: [(M + 2H)/2]+ = 702, [(M + 3H)/3]+ = 468.4. Example 37. Synthesis of Int-74
To a solution of [(3S,5S)-5-(hydroxymethyl)-3-pyrrolidinyl] carbamic acid tert-butyl ester (432.6 mg, 2 mmol) in anhydrous DMF (2 ml) was added DIPEA (517 mg, 4 mmol), propargyl-PEG4-mesyl ester (682.9 mg, 2.2 mmol) and NaI (60 mg, 0.4 mmol). The resulting mixture was heated at 70 ⁰C for 3 hours. It was then directly purified through RPLC (100 g, 5% to 100% acetonitrile and water). Yield 509.4 mg, 59.2%. Ion found by LCMS: [M + H]+ = 431.
Step b.
To a solution of the product from the previous step in acetonitrile (6 ml) was added 6N HCl aqueous solution (2 ml) and water (3 ml). The resulting mixture was heated at 50 ⁰C overnight, then cooled to room temperature and lyophilized. Yield 480 mg, quantitative yield. Ion found by LCMS: [M + H]+ = 331.0. Step c.
A mixture of previously described triazole acid (in Example 5) (25.6 mg, 0.05 mmol) and the product from the previous step (40.3 mg, 0.1 mmol) were dissolved in anhydrous DMF (0.5 ml) and Et3N (303 mg, 3 mmol) then gently heated with a heat-gun. After cooled to room temperature, the resulting mixture was treated with HATU (57 mg, 0.075 mmol) and stirred for 2 hour. It was then directly purified by HPLC (5 to 50% acetonitrile and water, using 0.1% TFA as modifier). Yield 37.5 mg, 71.3%. Ions found by LCMS: [M + Na]+ = 846.2, [M + H]+ = 824.4, [(M + 2H)/2]+ = 412.8. Step d.
The product from the previous step (35 mg, 0.0333 mmol) was dissolved in DMF (0.5 ml). After cooled in an ice-water bath, the solution was treated with TFA (15 µl) and Azido-PEG4-trifluorophenyl ester (21 mg, 0.05 mmol). To this solution was added a pre-mixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (20 mg, 0.101 mmol) in water (0.5 ml) was then added, followed by Cu2SO4 (1.6 mg, 0.01 mmol) and MeOH (0.5 ml). The ice-water bath was removed, and the resulting mixture was stirred for 30 minutes. It was then directly purified by RPLC (50 g, 5% to 80% acetonitrile and water, using
0.1% TFA as modifier). Yield 36 mg, 73.4%. Ion found by LCMS: [M + H]+ = 1244.4, [(M + 2H)/2]+ = 622.8. Example 38. Screening of HIV lead compounds in an in vitro cell fusion assay Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell- cell fusion mediated by gp120 and CD4 interaction which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1299), obtained from the AIDS Research Reagent and Reference Program (Rockville, MD). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5 x 103 cells per well in a volume of 50 µL, with 50 µL of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37 ºC/5% CO2. Following the incubation, 100 µL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37 ºC/5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C. XTT/PMS stock was prepared immediately before use by adding 40 μL of PMS per mL of XTT solution. Fifty μL (50 µL) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37°C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer. In this assay 7 Ints and 5 conjugates were run in with 2 control compounds (Chicago Sky Blue (CSB) and Enfuvirtide) know to prevent cell fusion mediated by binding of viral gp120 to the host cell receptor (CD4). CSB and Enfuvirtide had EC50 values of 201 and 409 nM, indicating moderate inhibition of cell fusion. In contrast, the 7 Ints ranged from 1.56 to 72.8 nM. Collectively, the Ints tested in this assay have significantly more activity than Enfuvirtide, an approved HIV therapeutic. Importantly, conjugate 5 was highly potent with an EC50 value of 3.62 nM. With the exception of conjugate 014 (EC50 value of 406 nM) the other AVCs were >500 nM. However, it is worth noting that although the EC50 for these compounds was greater than 500 nM a signal was detected at this concentration, suggesting the true value is not much higher than 500 nM. Most critically is the observation that at least one of these chemical series could be conjugated to an hIgG1 Fc and retain potent activity (Int-17 conjugate, conjugate 5). Lastly, no test articles showed cytotoxicity at the concentrations tested in this study.
Table 7. Activity of lead compounds in a cell fusion assay (EC50) and cytotoxicity (TC50).
Example 39. Synthesis of a conjugate including an Fc domain having a C220S/YTE quadruple mutation Preparation of the Click reagent solution: 0.0050M CuSO4 in PBS buffer solution: 10.0 mg CuSO4 was dissolved in 12.53 mL PBS, then took 5.00 mL this CuSO4 solution and added 43.1 mg BTTAA (CAS# 1334179-85-9) and 247.5 mg sodium ascorbate to give the Click reagent solution (0.0050M CuSO4, 0.020M BTTAA and 0.25M sodium ascorbate). To a solution of azido functionalized Fc having a C220S mutation and a YTE mutation (65.5 mg, 10.0 mL, 1.13 µmol, azido DAR~5.9, SEQ ID NO: 67) in a 15 mL centrifuge tube was added to an alkyne derivatized small molecule (3.0 equivalents per each azido of the Fc). After gently agitating to dissolve all solids, the mixture was treated with the Click reagent solution (1.80 mL). The resulting mixture was gently rotated for 12 hours at ambient temperature. It was purified by affinity chromatography over a protein A column, followed size exclusion chromatography (see general conjugate purification protocol). Maldi TOF analysis of the purified final product gave an average mass of 66,420 Da (DAR = 5.8). Yield 57 mg with 98% purity. Example 40. 30-day comparative non-human primate PK study following IV administration of a conjugate including an Fc domain having a C220S/YTE quadruple mutant A conjugate including an Fc domain having a C220S mutation and a YTE mutation (SEQ ID NO: 67) was synthesized as described in Example 39. A non-human primate PK study was performed to compare IV administration of the C220S/YTE Fc conjugate (SEQ ID NO: 67) to a conjugate including an Fc domain having a C220S mutation alone (SEQ ID NO: 64). Non-human primate (NHP) PK studies were performed by BTS Research (San Diego, CA) using male and female cynomolgus monkeys 5-9 years old with body weights ranging from 3.5-8.5 kg. NHPs were injected IV with 2 mg/kg of test article (0.4 mL/kg dose volume). Animals were housed under
standard IACUC approved housing conditions. At appropriate times animals were non-terminally bled (via femoral or cephalic veins) with blood collected in K2EDTA tubes to prevent coagulation. Collected blood was centrifuged (2,000 x g, for 10 minutes) and plasma withdrawn for analysis of test article concentrations over time. The plasma concentrations for the C220S/YTE Fc conjugate and the C220S conjugate at each time point were measured by sandwich ELISA. Briefly, test articles were captured on Fc-coated plates and then detected using a HRP-conjugated anti-human IgG-Fc antibody. Protein concentrations were calculated in GraphPad Prism using 4PL non-linear regression of the C220S/YTE Fc conjugate or C220S conjugate standard curves. A more detailed method description is provided above. The corresponding curves are shown in FIG.13. The C220S/YTE Fc conjugate demonstrates a significantly improved terminal half-life of ~45 days compared with ~10 days for the C220S Fc conjugate. AUCs for the C220S/YTE Fc conjugate are 2X greater than the AUCs for The C220S conjugate (Table 8).
Table 8. Monkey PK, C220S/YTE Fc conjugate vs. C220S Fc conjugate
Example 41. Screening of HIV compounds in an in vitro cell fusion assay Activity of HIV compounds was determined in an assay designed to measure the inhibition of cell- cell fusion mediated by gp120 and CD4 interaction which is an important step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1294) and HL2/3 cells (catalog #1299), obtained from the AIDS Research Reagent and Reference Program (Rockville, MD). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5 x 103 cells per well in a volume of 50 µL, with 50 µL of nine serial half-logarithmic dilutions of compound in triplicate for one hour at 37 ºC/5% CO2. Following the incubation, 100 µL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 hours at 37 °C/5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and toxicity plates were stained with XTT to evaluate cell viability. In these studies, cytotoxicity was also evaluated (TC50). Test materials were derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4-nitro-5-sulfophenyl)-5- [(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C. XTT/PMS stock was prepared immediately before use by adding 40 µL of PMS per mL of XTT solution. Fifty µL (50 µL) of XTT/PMS was added to each well of the plate and the plate incubated for 4 hours at 37 °C. The 4 hour incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer. In this assay 5 Ints and 4 conjugates were run with 2 control compounds (Chicago Sky Blue (CSB) and Enfuvirtide) know to prevent cell fusion mediated by binding of viral gp120 to the host cell receptor (CD4). CSB and Enfuvirtide had EC50 values of 781 and 358 nM, indicating moderate inhibition of cell fusion. In contrast, the 5 Ints tested ranged from <0.0051 to 70.1 nM. Collectively, the Ints tested in this assay show significantly more activity than Enfuvirtide, an approved HIV therapeutic. The four conjugates also demonstrated acceptable activity with the exception of Conjugate 14. Lastly, no test articles showed cytotoxicity at the concentrations tested in this study. Table 9. Activity of lead compounds in a cell fusion assay (EC50) and cytotoxicity (TC50)
Example 42. Synthesis of Conjugates via Click Conjugation Conjugates were prepared according to the general conjugation procedure described in Example 8, using the corresponding alkyne derivatized small molecule intermediate and Fc domain as shown in Table 10. Table 10. Conjugates and properties
Example 43. Synthesis of Conjugates – Tetrafluorophenyl ester General procedure for conjugation using tetrafluorophenyl ester: A solution of Fc in PBS buffer (pH = 7.4) and DMF was treated with a solution of tetrafluorophenyl ester dissolved in DMF. The pH was adjusted to ~7.5 to 8.0 with borate buffer (pH 8.5 ). The mixture was then gently rocked at room temperature. Maldi TOF after 3 hours shows an average DAR of 3 to 5. The crude conjugate was purified by dialysis in arginine buffer (200 mM Arginine, 120 mM NaCl , 1% Sucrose pH 6.0). Yields are typically 30% to 70%. DAR is determined by Maldi TOF of the purified conjugates and is typically 2 to 5. The yields and properties for conjugates synthesized using this general procedure are listed in Table 11 below. Table 11. Conjugates and properties
Example 44. Synthesis of Conjugates – Trifluorophenyl ester General procedure for conjugation using trifluorophenyl ester: A solution of Fc in PBS buffer (pH = 7.4) and DMF was treated with a solution of trifluorophenyl ester dissolved in DMF. The pH was adjusted to ~8.0 to 9.5 with borate buffer (pH 8.5-9.5). Then the mixture was gently rocked at room temperature. Maldi TOF after 3 hours shows an average DAR of 3 to 5. The crude conjugate was purified by dialysis in arginine buffer (200 mM Arginine, 120 mM NaCl , 1% Sucrose pH 6.0). Yields are typically 30% to 70%. DAR is determined by Maldi TOF of the purified conjugates and is typically 2 to 5. The yields and properties for conjugates synthesized using this general procedure are listed in Table 12 below. Table 12. Conjugates and properties
Example 45. Screening of HIV antiviral conjugates in an in vitro cell fusion inhibition assay Activity of HIV lead compounds was determined by ImQuest Biosciences (Frederick, MD) using an assay designed to measure the inhibition of cell-cell fusion mediated by gp120 and CD4, a key step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1470) and HL2/3 cells (catalog #1294), obtained from the NIH AIDS Research Reagent and Reference Program (Rockville, MD). HeLa-CD4-LTR-β-Gal cells were plated at a density of 5 x 103 cells per well in a volume of 50 µL, with 50 µL of nine serial half-logarithmic dilutions of compound in triplicate for 1 h at 37ºC/5% CO2. Following the incubation, 100 µL of HL2/3 cells were added to the plates. The cultures were incubated for an additional 48 h at 37ºC/5% CO2. Following the incubation, efficacy plates were evaluated for β-galactosidase production using a chemiluminescent substrate and buffer (ThermoFisher). Briefly, all media was removed from the efficacy plates and replaced with 50 µL of DPBS. Fifty microliters (50 µL) of Gal-Screen substrate diluted 1:25 in Gal Screen Buffer was added to all wells of the plate. The plate was incubated for 90 min at room temperature. Following the incubation, the contents of the wells were transferred to a clear bottom plate. The plate was covered and chemiluminescence was detected using a Microbeta Scintillation Counter. In these studies, cell viability was also evaluated. The concentration required to induce 50% toxicity (TC50) was derived by measuring the reduction of the tetrazolium dye XTT (2,3-bis(2-methoxy-4- nitro-5-sulfophenyl)-5-[(phenylamino)carbonyl]-2H-tetrazolium hydroxide). XTT in metabolically active cells is metabolized by the mitochondrial enzyme NADPH oxidase to a soluble formazan product. XTT solution was prepared daily as a stock of 1 mg/mL in RPMI-1640 without additives. Phenazine methosulfate (PMS) solution was prepared at 0.15 mg/mL in DPBS and stored in the dark at -20°C. XTT/PMS stock was prepared immediately before use by adding 40 µL of PMS per mL of XTT solution. Fifty µL (50 µL) of XTT/PMS was added to each well of the plate and the plate incubated for 4 h at 37°C. The 4 h incubation has been empirically determined to be within the linear response range for XTT dye reduction with the indicated numbers of cells for each assay. The plates were sealed and inverted several times to mix the soluble formazan product and the plate was read at 450 nm (650 nm reference wavelength) with a Molecular Devices SpectraMax Plus 38496 well plate format spectrophotometer. In this assay, 11 conjugates were evaluated in parallel with 2 control compounds, including Chicago Sky Blue (CSB; “Control 1”) and T-20 (“Control 2”), which inhibit cell fusion by binding HIV
glycoprotein gp120 and preventing interaction with the host cell receptor (CD4). CSB and T-20 were active at the expected concentrations with EC50 values of 727 ng/mL and 158 nM, respectively (Table 13). Each of the conjugates were evaluated for the ability to inhibit HIV-1-induced cell to cell fusion. The EC50 values defined in the assays ranged from 7.06 nM 915 nM (Table 13). Compounds Conjugate 29a, Conjugate 30a, and Conjugate 34 were the most active with EC50 values of 7.06 nM, 10.5 nM and 14.9 nM, respectively. Compounds Conjugate 5b, Conjugate 32, and Conjugate 36 were slightly less active than the most active compounds with EC50 values ranging from 22.8 nM to 43.6 nM. The remaining compounds had EC50 values ranging from 144 nM to 915 nM. All of the compounds were non-toxic up to the highest concentration evaluated (2000 nM with the exception of 1800 nM for compound Conjugate 15) (Table 13). Table 13. Evaluation of inhibition of cell to cell fusion in HeLa-CD4-LTR-β-Gal and HL2/3 cells. All compounds inhibited HIV-induced cell to cell fusion with EC50 values ranging from 7.06 nM to 915 nM.
“ND” – Therapeutic index (TI) could not be determined from EC50 and TC50 values Example 46. Screening of HIV antiviral small molecules and conjugates in an in vitro cell fusion inhibition assay Activity of HIV lead compounds was determined by using an assay designed to measure the inhibition of cell-cell fusion mediated by gp120 and CD4, a key step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1470) and HL2/3 cells (catalog #1294), obtained from the NIH AIDS Research Reagent and Reference Program (Rockville, MD). HL2/3 cells were plated at a density of 2 x 104 cells per well in a volume of 50 µL, with 50 µL of nine serial half-logarithmic dilutions of compound in triplicate for 1 h at 37ºC/5% CO2. Following the incubation,
100 µL of HeLa-CD4-LTR-β-Gal cells were added to the plates. The cultures were incubated for an additional 48 h at 37ºC/5% CO2. Following incubation, plates were evaluated for β-galactosidase production using a chemiluminescent substrate and buffer (ThermoFisher). Briefly, all media was removed from the efficacy plates and replaced with 100 µL of DPBS.100 microliters (100 µL) of Gal- Screen substrate diluted 1:25 in Gal Screen Buffer was added to all wells of the plate. The plate was incubated for 90 min at room temperature. Following the incubation, the contents of the wells were transferred to a clear bottom plate. The plate was covered and chemiluminescence was detected using an EnSpire multimode plate reader (PerkinElmer). In this assay, 58 compounds (26 small molecules and 32 conjugates) were evaluated in parallel with 2 control compounds, including Chicago Sky Blue (CSB) and Temsavir (“Control 3”), known to prevent cell fusion mediated by binding of viral gp120 to the host cell receptor (CD4). CSB and Temsavir were active at the expected concentrations with EC50 values of 510 and 1.05 nM, respectively (Table 14). Int-44 and Int-67 were the most potent compounds with sub-nM EC50s (Table 14). Several molecules had single-digit nM potency, including Temsavir, Int-39, Int-31, Int-55, Int-63, Int-59, Int-62, Int- 33, Int-51, Int-72, Int-50, Int-43, and Int-68. Table 14. Evaluation of inhibition of cell to cell fusion in HeLa-CD4-LTR-β-Gal and HL2/3 cells. All small molecule compounds inhibited HIV-induced cell to cell fusion with EC50 values ranging from 1.1E-05 to 5.1E+02 nM.
Each of the conjugates were evaluated for the ability to inhibit HIV-1-induced cell to cell fusion. The average EC50 values defined in the assays ranged from 1.9E-01 to 6.0E+03 nM (Table 15). Conjugate 14b was the most potent conjugate with an EC50 value of 1.99E-01 nM (Table 15). Several conjugates had single-digit nM potency, including Conjugate 27, Conjugate 29a, Conjugate 12b, Conjugate 37a, Conjugate 5b, and Conjugate 30a. The remaining compounds had EC50 values ranging from 1.14E+01 nM to 6.02E+03 nM (Table 15). Table 15. Evaluation of inhibition of cell to cell fusion in HeLa-CD4-LTR-β-Gal and HL2/3 cells. All conjugates inhibited HIV-induced cell to cell fusion with EC50 values ranging from 1.9E-01 to 6.0E+03 nM.
DAR, drug-to-antibody ratio Example 47.7-day mouse PK study comparing IV administration of Conjugate 5b and Fc control (SEQ ID NO: 73) at 5 mg/kg Mouse PK studies were performed using male BALB/c mice 6 weeks of age (n = 2 mice/group). Mice were injected intravenously (IV) via the tail vein with 5 mg/kg of test article (5 mL/kg dose volume). Animals were housed under standard IACUC approved housing conditions. At indicated times (0.25, 1, 2, 4, 24, 48, 120, 144, and 168 h), animals were non-terminally bled (retro-orbital, cheek, or by tail vein) with blood collected in K2EDTA tubes to prevent coagulation. Collected blood was centrifuged (2,000 x g, for 10 min) and plasma withdrawn for analysis of test article concentrations over time. The Fc plasma concentrations at each time point were measured by Fc-capture sandwich ELISA as follows. Nunc Maxisorp 96-well plates (cat no.12-565-136, Fisher Scientific) were coated overnight at 4 ^C with 0.1 µg/100 µL/well of goat anti-human IgG (Fcγ fragment specific; cat no.109-005-098, Jackson Immunoresearch) in carbonate buffer (cat no. C3041, MilliporeSigma). Plates were washed 5x with 300 µL/well PBST and blocked with 200 µL/well 5% non-fat dry milk (cat no.9999S, Cell Signaling) in PBST for 1 h at room temperature with shaking. Three-fold serial dilutions of the plasma samples were plated at 100 µL/well and incubated at room temperature for 2 h with shaking (sample diluent: 2.5% non-fat dry milk in PBS 0.025% Tween 20 + naïve mouse plasma final concentration of 1:900). Compound standard curves ranging from 0.03 to 55 ng/mL in duplicate, were run on each plate. Following the 2 h incubation, plates were washed 5x with 300 µL/well PBST. Conjugate bound to Fc on the plates was then probed with 100 µL/well of HRP conjugated anti-human IgG Fc F(ab’)2 (cat no.709-036-098, Jackson Immunoresearch) diluted 1:2,000 in sample diluent for 1 h at room temp with shaking. Plates were then washed 8x in 300 µL/well PBST and developed with 100 µL/well TMB substrate reagent (cat no.555214, BD) for 7-8 minutes. The reaction was stopped with 100 µL/well 1N H2SO4 and the absorbance read at 450 nm with an EnSpire multimode plate reader (PerkinElmer). Test article in plasma samples was interpolated using GraphPad Prism Version 8 following nonlinear regression analysis (Sigmoidal, 4PL analysis) of the standard curves. The resulting mean plasma concentrations were then used to calculate the total AUC for each plasma concentration-time profile.
The 7-day mouse PK profiles of Conjugate 5b and Fc control (SEQ ID NO:73) are shown in FIG. 15. Following IV administration of compound at 5 mg/kg, the average plasma exposure levels of Conjugate 5b were noninferior to SEQ ID NO:73, with AUCs of 2922 and 1766, respectively (FIG.15). Example 48. Synthesis of Int-47
Propargyl-PEG4-mesylate (1 g, 5.74 mmol), N-Boc-diamino propane (2.7 g, 8.60 mmol), and DIPEA (2 mL, 11.48 mmol) were stirred together in DMF (10 mL) at 70°C for 4 hours. The solvent was reduced by half on the rotary evaporator. The mixture was purified by reversed phase HPLC (0-80% ACN in DI water, 0.1% TFA modified, 30 minute gradient). The pure fractions were pooled and concentrated to afford the diamine as a light amber oil. Yield 1.9 g, 84%. Ion found by LC/MS [(M/2)+H]+ = 389.0.
Step b.
HATU (0.54 g, 1.41 mmol) was added to a stirring mixture of the intermediate described in the previous step (0.50, 1.29 mmol), 3-hydroxy-cyclobutanoic acid(cis/trans mixture) (0.16 g, 1.41 mmol) and DIPEA (0.9 mL, 5.1 mmol) in DMF (3 mL). The mixture was stirred at ambient temperature for 45 minutes, then purified directly by reversed phase HPLC (0-80% ACN in DI water, 0.1% TFA modified, 30 minute gradient). The pure fractions were pooled and concentrated to afford the diamine as a light amber oil. Ion found by LC/MS (M+H)+ = 487.0. The Boc-protected amino amide was stirred in a 5/1 mixture 4N HCl in dioxane and methanol (10 mL) for 45 minutes. The mixture was concentrated and the HCl salt was dried under high vacuum to afford the product as a clear oil. Yield 550 mg, 86%, 2 steps. Ion found by LC/MS [(M/2)+H]+ = 387.0. Step c.
HATU (52 mg, 0.12 mmol) was added to a stirring mixture of the amine HCL salt intermediate from step b. of this example, (45 mg, 0.11 mmol), and triazole carboxylic acid, (55 mg, 0.11 mmol, described in example 5 of Int- 2), and DIPEA (0.74 mL, 0.43 mmol) in DMF (2 mL). The mixture was stirred at ambient temperature for 45 minutes, then purified directly by reversed phase HPLC (0-80% ACN in DI water, 0.1% TFA modified, 30 minute gradient). The pure fractions were pooled and lyophilized to afford the product as a white solid. Yield 40 mg, 43%. Ion found by LC/MS [M+H]+ = 880.4. Step d.
Copper sulfate (0.4 mg, 0.0028 mmol), sodium ascorbate (17 mg, 0.085mmol), and BTTA (2 mg, 0.006 mmol), were pre-mixed in DI water (0.5 mL). This solution was added to a mixture of the alkyne intermediate, from step d of this example (25 mg, 0.28 mmol) and azido-PEG4-trifluorophenol-ester (13
mg, 0.31 mmol), in a mixture of DMF/H2O (3:1, 1 mL) , then cooled to 0°C via and ice-water bath. The ice bath was removed and the reaction was stirred for 25 minutes.1 drop of glacial acetic acid was added and the mixture was purified by reversed phase HPLC (0-80% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were pooled and lyophilized to afford the title compound as a white solid. Yield 20 mg, 54 %. Ion found by LC/MS [(M/2)+H]+ =651.2. Example 49. Synthesis of Int-98
A stirring solution of Cbz-piperazine (2.0g, 9.08 mmol), alkyl chloride (2.08 g, 13.62 mmol), potassium iodide (0.754g, 4.54 mmol), and potassium carbonate (3.76 g, 27.24 mmol), dissolved in acetonitrile (12 mL) was heated in a 65C oil bath for 24 h. Reaction progress was monitored by LCMS. The crude reaction mixture was filtered, concentrated and purified by RPLC (5% to 100% ACN/water, no TFA). Yield 2.56g, 63%. LCMS: [M + H]+ = 337.2
Step b.
Product from the previous step (2.56g, 5.68 mmol) was dissolved in acetonitrile (5mL) then treated with 1N HCl (20 mL). Reaction progress was monitored by LCMS. After 3h the reaction was concentrated and used in the next step without further purification. Yield 1.77g, 95%. LCMS: [M + H2O]+ = 309.2. Step c.
A solution of product from the previous step (0.350g, 1.119 mmol), disubstituted pyrolidine (0.242g, 1.119 mmol), and DIEA (0.097 mL, 0.559 mmol) dissolved in DCM (5 mL), were treated with sodium triacetoxyborohydride (0.356, 1.68 mmol) at room temperature. Reaction progress was monitored by LCMS. After 12h the reaction was concentrated and purified by RPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.804g, 50%. LCMS: [M + H]+ = 491.2 Step d.
A mixture of product from the previous step (0.404g, 0.668 mmol), and 5% Pd/C (0.5g) was dissolved in methanol (5mL), then vacuum flushed with hydrogen from a balloon and stirred fo 1h. The reaction was filtered through a pad of celite, concentrated, and used in the next step without further purification. Yield 0.342g, 88%. LCMS: [M + H]+ = 357.3 Step e.
Product from the previous step (0.342g, 0.585 mmol), propargyl-Peg4-mesylate (0.236g, 0.761 mmol), and potassium carbonate (0.404g, 2.92 mmol), dissolved in acetonitrile (5 mL) was stirred overnight in a 65C oil bath. Reaction progress was monitored by LCMS. The crude reaction was filtered, concentrated, and purified by RPLC (5% to 60% ACN/water). Yield 0.16 g, 34%. LCMS: [M + H]+ = 571.4
Step f.
A solution of product from the previous step (0.160g, 0.200 mmol) dissolved in acetonitrile (2mL) was treated with concentrated aq HCl (8 mL). LCMS after 5 min shows complete deprotection. The solution was concentrated, stored under high vacuum and used in the next step without further purification. LCMS: [M + H]+ = 471.4 Step g.
A solution of product from the previous step (0.179g, 0.137 mmol), and triazole-carboxylic acid (0.050g, 0.098 mmol, synthesis described in a previous example) dissolved in DMF (3 mL), was treated with DIEA (0.511 mL, 2.93 mmol), HOBT (0.026 g, 0.196), and HATU (0.074 g, 0.196 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by RPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.094g, 81%. LCMS: [M + H]+ = 964.4. Step h.
A solution of product from the previous step (0.094g, 0.079 mmol), and azido-Peg4- trifluorophenol ester (0.043g, 0.103 mmol) dissolved in DMF (1 mL), was treated with a solution of copper(II) sulfate (0.0031 g, 0.0197 mmol), sodium ascorbate (0.0156 g, 0.0788 mmol), and THPTA (0.0137g, 0.0315 mmol) dissolved in water (0.5 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by RPLC (10% to 100% ACN/water with 0.1% TFA). Yield 0.106g, 78%. LCMS: [(M + 2H)/2]+ = 693.4.
The title compound was prepared analogously, where 4-chloro-1,1-dimethoxy butane was replaced with 3-chloro-1,1-diethoxypropane in step a of that procedure. LCMS: [(M + 2H)/2]+ = 686.4 Example 50. Synthesis of Int-106
Step a.
To a mixture of [(3S, 5S)-5-(hydroxymethyl)-3-pyrrolidinyl] carbamic acid tert-butyl ester (432.6 mg, 2 mmol) and K2CO3 (414.6 mg, 3 mmol) was added anhydrous THF (2 ml), DMF (1 ml) and propargyl bromide (285.6 mg, 2.4 mmol). The resulting mixture was stirred at room temperature for 20 hours. The salt was filtered off and washed with acetonitrile. The filtrate was concentrated by rotary evaporation and purified by HPLC (2 to 15% acetonitrile and water, using 0.1% TFA as modifier). Yield 238.7 mg, 32.4%. Ion found by LCMS: [M + H]+ = 255. Step b.
To a solution of step-a product (250 mg, 0.652 mmol) in acetonitrile (3 ml), was added 6N HCl aqueous solution (0.5 ml) and water (3 ml). The resulting mixture was heated at 70⁰C for 1.5 hours. It was
then cooled to room temperature, frozen, and lyophilized. Yield 145.2 mg, quantitative yield. Ion found by LCMS: [M + H]+ = 155.0. Step c.
A mixture of triazole acid (51.2 mg, 0.1 mmol) and the step-b product (22.8 mg, 0.1 mmol) was dissolved in anhydrous DMF (1 ml) and Et3N (606 mg, 6 mmol) by gently heated with a heat gun. After cooled to room temperature, the resulting solution was mixed with HATU (57 mg, 0.15 mmol) and stirred for 1 hour. Addition HATU (19 mg, 0.05 mmol) was then added, followed by the step-b product ( 11.4 mg, 0.05 mmol) in DMF (0.5 ml). The reaction was continued for 30 minutes, then directly purified by HPLC: 5 to 50% acetonitrile and water, using 0.1% TFA as modifier. Yield 43 mg, 49.1%. Ions found by LCMS: [M + H]+ = 647.8, [(M + 2H)/2]+ = 324.6. Step d.
To a solution of azido-PEG8-acid (1 g, 2.14 mmol) in anhydrous DCM (4 ml) was added 2,4,6- trifluorophenol (633.9 mg, 4.28 mmol) and EDC HCl (631 mg, 3.21 mmol). The resulting mixture was stirred at room temperature overnight. It was then purified by RPLC (100 g, 5 to 100% acetonitrile and water, using 0.1% TFA as modifier). Yield 831 mg, 65%. Ion found by LCMS: [M- N2]+ = 570.2. Step e.
A solution of the step-c product (35.8 mg, 0.0409 mmol) in DMF (1 ml) was mixed with TFA (20 µl) and cooled in an ice-water bath. It was then mixed with the step-d product (29.3 mg, 0.0491 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu2SO4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the resulting mixture was stirred for 40 minutes. It was then directly purified HPLC (5 to 70% acetonitrile and water, using 0.1% TFA as modifier). Yield 46.8 mg, 77.8%. Ions found by LCMS: [M + H]+ = 1244.6, [(M + 2H)/2]+ = 622.8. 30
Example 51. Synthesis of Int-108
The title compound was prepared as shown above where propargyl bromide in step-a was replaced with bromo-1-butyne and the reaction was heated at 70C⁰ for 1 day (Yield 250 mg, 32.6%, Ion found by LCMS: [M + H]+ = 661.8). Yield 22.3 mg, 47.9%. Ions found by LCMS: [M + H]+ = 1260.4, [(M + 2H)/2]+ = 630.4.
The title compound was prepared analogously to the synthesis of Int-108, where propargyl bromide in step-a was replaced with 6-chloro-1-hexyne and the reaction was heated at 70C⁰ for 2 days (Yield 909.3 mg, 73.7%, ion found by LCMS: [M + H]+ = 689.8). Yield 31.3 mg, 73.9%. Ions found by LCMS: [M + H]+ = 1287.5, [(M + 2H)/2]+ = 644.4.
Example 52. Synthesis of Int-115
To a cold solution of dimethyl 2-(3-((tert-butoxycarbonyl)amino)propyl)malonate (500 mg, 1.72 mmol, prepared as described in Angewandte Chemie, International Edition (2018), 57(22), 6527-6531) in DMF (6 ml) was added sodium hydride (72.5 mg, 1.81 mmol), followed N-(bromomethyl)phthalimide (0.45 mg, 1.9 mmol). The resulting suspension was stirred for 2.5 h at 0°C, and partitioned between water (6 ml) and dichloromethane (40 ml). The organic layer was separated, washed with water and sat. aq. NaHCO3, dried over anhydrous NaSO4 and concentrated. The residue was purified by flash chromatography (hexane/EtOAc 40%) to afford product. Yield 652 mg, 84 %. Ion found by LCMS: [M + H]+ = 449.1. Step b.
To a solution of step-a product (300 mg, 0.66 mmol) in methanol (3 ml) was added hydrazine hydrate (0.18 ml, 3.34 mmol). The solution was stirred under reflux for 4 hrs. It was then cooled, filtered, concentrated and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier) to afford product. Yield 160 mg, 75 %. Ion found by LCMS: [M + H]+ = 319.1.
Step c.
To a solution of step-b product (160 mg, 0.37 mmol) in DMF (4 ml) was added propargyl-PEG4- NHS ester (158 mg, 0.44 mmol) and DIPEA (0.387 ml, 2.22 mmol). The reaction solution was stirred at r.t overnight, concentrated and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 142 mg, 68 %. Ion found by LCMS: [M + H]+ = 561.2. Step d.
To a cooled solution of step-c product (142 mg, 0.25 mmol) in THF (2.5 ml) was added LiBH4 (23.2 mg, 1.0 mmol). The reaction was stirred at 0oC to room temperature for 6 hrs, and then quenched with water, concentrated and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier) to give product. Yield 91 mg, 69 %. Ion found by LCMS: [M + H]+ = 505.3. Step e.
The step-d product (91 mg, 0.17 mmol) wascooled and dissolved in HCl (2 ml, 4N in dioxane) and reaction was stirred for 2 hrs. The solution was concentrated to an oil and used without purification. Yield of HCl salt 72 mg. Ion found by LCMS: [M + H]+ = 504.2. Step f.
To a solution of triazole-carboxylic acid (77.4 mg, 0.151 mmol) in DMF (1 mL) was added HSPyU (124 mg, 0.30 mmol) and DIPEA (0.15 ml, 0.90 mmol) and the mixture was stirred for 30 min. Step-e product (72 mg, 0.181 mmol) was then added to above solution and LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi- preparative HPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier) to give product. Yield 39 mg, 28%. LCMS: [M + H]+ = 898.4.
Step g.
A solution of step-f product (39 mg, 0.043 mmol) and azido-Peg4-trifluorophenol ester (23.8 mg, 0.056 mmol) in DMF (0.6 mL), was treated with a solution of copper(II) sulfate (1.73 mg, 0.010 mmol), sodium ascorbate (8.6 mg, 0.043 mmol), and THPTA (9.43 mg, 0.021 mmol) in water (0.3 mL). The reaction stirred at room temperature for 30 min. The product was purified by semi-preparative HPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 29 mg, 50%. LCMS: [(M + 2H)/2]+ = 660.1 Example 53. Synthesis of Int-116
To a mixture of aldehyde (0.175 g, 0.84 mmol) and piperazine (0.389 g, 1.69 mmol) in DCE (25 mL) was added sodium triacetoxyborohydride (1.07 g, 5.06 mmol) followed by acetic acid (0.405 g, 6.75 mmol). The solution was stirred for 1h then more aldehyde (0.175 g, 0.84 mmol) was added. The solution was then stirred for 15h. Reaction progress was monitored by LCMS. The crude reaction mixture was
partitioned between DCE and 0.5N NaOH (25 mL). The aqueous layer was discarded and the excess DCE was removed and the crude mixture was purified by flash chromatography (0% to 40% 1:1 MeOH:NH4OH/EtOAc). Yield: 0.534 g, 75%. LCMS: [M + H]+ = 422.2. Step b.
To the neat piperazine alcohol from the previous step (0.534 g, 1.26 mmol) was added HCl in dioxane (12 mL). The mixture was stirred for 1h. Reaction progress was monitored by LCMS. The excess dioxane was removed and the crude material was used in the next step without additional purification. Yield: 0.403 g, 99%. LCMS: [M + H]+ = 322.2. Step c.
To a solution of product from the previous step (0.403 g, 1.26 mmol) in DMF (1 ml) and Hunigs Base (20 mL) was added the propargyl-PEG4-mesylate (0.588 g, 1.89 mmol). The solution was stirred at 70 °C for 72h. Reaction progress was monitored by LCMS. The excess solvent was removed and the crude mixture was purified by flash chromatography (0% to 40% 1:1 MeOH:NH4OH/EtOAc). Yield: 400 mg, 59%. LCMS: [M + H]+ = 536.4. Step d.
To a solution of free amino piperazine alcohol from the previous step (0.400 g, 0.75 mmol) in TFA (3 mL) was added thioanisole (2.31 g, 18.67 mmol) followed by dropwise addition of TMSBr (0.571 g, 3.73 mmol). The solution was stirred for 30 min. Reaction progress was monitored by LCMS. The excess solvents were removed and the crude material was washed with hexanes (2x10 mL). The crude mixture was then purified by flash chromatography (0% to 60% 1:1 MeOH:NH4OH/EtOAc). Yield: 140 mg, 46%. LCMS: [M + H]+ = 402.3. Step e.
To a solution of the free amine from the previous step (0.040 g, 0.10 mmol) and triazole- carboxylic acid (0.050 g, 0.10 mmol) in DMF (2 mL) was added Hunigs base (0.126 g, 1.0 mmol),
followed by HATU (0.148 g, 0.39 mmol) and HOBt (0.066 g, 0.48 mmol). LCMS after 1 hr showed complete consumption of the triazole-carboxylic acid. The mixture was directly loaded on the column and purified by semi-preparative HPLC (5% to 100% ACN/water). Yield 0.030 g, 33%. LCMS: [M + H]+ = 895.4. Step f.
A solution of product from the previous step (0.030 g, 0.033 mmol), and azido-PEG4- trifluorophenol ester (0.018 g, 0.043 mmol) dissolved in DMF (1.0 mL), was treated with a solution of copper(II) sulfate (0.001 g, 0.008 mmol), and sodium ascorbate (0.006 g, 0.033 mmol), dissolved in water (1.0 mL), and stirred at room temperature for 30 min. Reaction progress was monitored by LCMS. The mixture was directly loaded on the column and purified by semi-preparative HPLC (5% to 100% ACN/water). Yield 0.024 g, 51%. LCMS: [(M + 2H)/2]+ = 658.9.
The title compound was prepared analogously to the synthesis of Int-116 where tert-butyl 2- (hydroxymethyl)piperazinecarboxylate was used instead of tert-butyl 3-(2- hydroxyethyl)piperazinecarboxylate in Step a of that procedure. Ion found by LCMS: [(M + 2H)/2]+ = 652.0
Example 54. Synthesis of Int-119
Step a. A mixture of t-butyl(5-formyl-2,2-dimethyl-1,3-dioxan-5-yl) carbamate (1.9 g, 7.3 mmol) and ammonium acetate (5.6 g, 73.2 mmoL)in methanol (30 mL) were stirred at ambient temperature for 1 hour. Sodium cyanoborohydride (1.2 g, 18.3 mmol) was added and the reaction was stirred for 12 hours then concentrated to a volume of approx.5 mL. The mixture was purified by reversed phase HPLC (0- 60% ACN in DI water, 0.1% TFA modifier, 30 minute gradient) and the pure fractions were concentrated to afford the amine-TFA salt as a clear oil. Ion found by LC/MS [M-boc+H]+ = 161.2. The amine was taken up in acetonitrile (25 mL) and DIPEA (2.5 mL, 14.7 mmol) and Fmoc-OSu (2.7 g, 8.1 mmoL was added and the mixture was stirred for 1 hour, diluted with water (50 mL) and extracted into ethyl acetate (3x, 30 mL0. The combined organic extracts were dried over sodium sulfate and concentrated. The crude material was purified by reversed phase HPLC (0-90% ACN in DI water, 0.1% TFA modifier, 30 minute gradient) and the pure fractions were concentrated to afford the title compound as a white solid. Yield 530 mg, 45%, 2 steps. Ion found by LC/MS [M+Na]+ = 505.2.
Step b.
The intermediate from step a. of this example (530m g, 1.1 mmol) was stirred in 4N HCl in dioxane (20 mL) for 30 minutes. The mixture was concentrated and dried under high vacuum. Ion found by LC/MS [M+H]+ = 343.2. The amine-HCl salt was dissolved in methanol (30 mL) and CBZ-amino- propanal (227 mg, 1.1 mmol) was added and the mixture was stirred for 30 minutes at which point sodium cyanoborohydride (140 mg, 2.2 mmol) was added and the mixture was stirred for 12 hours at ambient temperature. The mixture was concentrated and purified by reversed phase HPLC (0-60% ACN in DI water, 0.1% TFA modifier, 30 minute gradient) and the pure fractions were concentrated to afford the CBZ protected amine intermediate, ion found by LC/MS [M+H]+ = 534.2. The intermediate was stirred in methanol in the presence of 5% Pd/C (75 mg) for 2 hours. The mixture was filtered and concentrated to afford the free amine which was carried forward without further purification. Yield 438 mg, 61%, 3 steps. Ion found by LC/MS [M+H]+ = 400.2 Step c.
HATU (67 mg, 0.18 mmol) was added to a stirring mixture of the intermediate from step b. of this example (76 mg, 0.19 mmol), and the triazole carboxylic acid (75 mg, 0.15 mmol described in Example 5 of Int- 2), and DIPEA (0.10 uL, 0.58 mmol) in DMF (2 mL). The reaction was stirred for 45 minutes and then purified by reversed phase HPLC (0-95% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were pooled and concentrated to afford the intermediate as a clear oil. Yield 75 mg, 58%. Ion found by LC/MS [M+H]+ = 893.4. Step d.
DBU (60 uL, 0.41 mmol) was added to a stirring mixture of the intermediate described in step c. of this example (75 mg, 0.084 mmol) in DMF (2 mL) and the mixture was stirred for 20 minutes. Propargyl-PEG4-carboxylic acid (33 mg, 0.13 mmol) was added followed by HATU (42 mg 0.11 mmol)
and the reaction was stirred for 45 minutes. The mixture was purified by reversed phase HPLC (0-95% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were pooled and concentrated to afford the intermediate as a clear oil. Yield 27 mg, 35%. Ion found by LC/MS [M+H]+ = 913.4. Step e.
Copper sulfate (0.5 mg, 0.003 mmol), sodium ascorbate (17 mg, 0.089 mmol), and BTTA (2.5 mg, 0.006 mmol), were pre-mixed in DI water (0.5 mL) and then added to a mixture of the alkyne intermediate (27 mg, 0.029mmol), and azido-peg4-trifluorophenol-ester (12 mg, 0.29 mmol), in DMF/H2O (3:1, 1 mL) , cooled to 0°C via and ice water bath. The ice bath was removed and the reaction was stirred for 25 minutes.1 drop of glacial acetic acid was added and the mixture was purified by reversed phase HPLC (0-80% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were pooled and lyophilized to afford the title compound as a white solid. Yield 17 mg, 43%. Ion found by LC/MS [(M/2)+H]+ = 667.8. Example 55. Synthesis of Int-121
Step a.
A mixture of (5-amino-2,2-dimethyl-1,3-dioxan-5-yl)methanol (1.61 g, 10 mmol), K2CO3 (2.07 g, 15 mmol), and 3-(Boc-amino)propyl bromide (2.86 g, 12 mmol) in anhydrous dioxane (10 ml) was heated 70⁰C overnight. Additional amount of 3-(Boc-amino)propyl bromide (2.15 g, 9 mmol) was added, and the reaction was continued for 1 more day, then cooled to room temperature. The salt was filtered off and washed with acetonitrile. The filtrate was dried over Na2SO4, concentrated by rotary evaporation, and further dried under high vacuum. The crude product (4.74 g) was carried to the subsequent step without further purification. Ion found by LCMS: [M + H]+ = 319.0. Step b.
To a solution of the step-a crude product in DCM (20 ml) was added 3- [benzyloxycarbonyl)amino]-1-propanal (2.49 g, 12 mmol) and acetic acid (900 mg, 15 mmol). The resulting mixture was stirred at room temperature for 30 minutes, then sodium triacetoxyborohydride (3.15 g, 15 mmol) was added in portions. After 2 hours, additional amount of sodium triacetoxyborohydride (1.25 g, 6 mmol) was added, and the reaction was stirred overnight. MeOH (10 ml) was added, and the reaction was stirred for 20 minutes. The solid was filtered off through celite. The filtrate was concentrated by rotary evaporation and purified by RPLC (100 g, 5 to 80% acetonitrile and water). Yield 3.67 g, 72% over two steps. Ion found by LCMS: [M + H]+ = 510.0. Step c.
To a solution of the step-b product (1.05 g, 1.93 mmol) in MeOH (25 ml) was added Pd(OH)2 on carbon (850 mg). The mixture was stirred under hydrogen for 6 hours. Pd(OH)2/C was then filtered off, and the filtrate was concentrated by rotary evaporation. The product was further dried under high vacuum and carried to the subsequent step without purification. Ion found by LCMS: [M + H]+ = 376.0, [M - Boc + H]+ = 276.0.
Step d.
To a solution of the step-c product (1.25 g, 3.33 mmol) in anhydrous DMF (4 ml) was added benzyl bis(2-chloroethyl)carbamate (1.38 g, 5 mmol), triethylamine (12 ml) and NaI (150 mg, 1 mmol). The resulting mixture was heated at 75⁰C to 100⁰C overnight. After concentrating by rotary evaporation, the crude product was purified by RPLC (100 g, 5 to 100% acetonitrile and water). Yield 315 mg, 16.3%. Ions found by LCMS: [M + H]+ = 579.0, [M - Boc + H]+ = 479.0. Step e.
To a solution of the step-d product (315 mg, 0.544 mmol) in MeOH (10 ml) was added Pd(OH)2 on carbon (150 mg). The mixture was stirred under hydrogen for 2 hours. Pd(OH)2/C was then filtered off, and the filtrate was concentrated by rotary evaporation. The product was dried under high vacuum and carried to the subsequent step without purification. Ion found by LCMS: [M + H]+ = 445.0 Step f.
To a solution of the step-e product (245 mg, 0.55 mmol) in anhydrous DMF (1 ml) was added propargyl-PEG4-mesyl ester (223.5 mg, 0.72 mmol) and DIEPA (129.2 mg, 1 mmol). The resulting mixture was heated at 60⁰C for 1 day, then directly purified by RPLC (50 g, 5 to 70% acetonitrile and water). Yield 129 mg, 35.6%. Ions found by LCMS: [M + H]+ = 659.0, [(M + 2H)/2]+ = 330.2. Step g.
The step-f product (129 mg, 0.196 mmol) was dissolved in acetonitrile (2 ml) and water (0.5 ml). then 6N HCl aqueous solution (0.5 ml) was added, and the reaction mixture was heated at 70⁰C for 20 minutes. It was then concentrated by rotary evaporation, and excess HCl was further removed by azeotrope/evaporation with acetonitrile. The product was further dried under high vacuum. Yield 141.4 mg, quantitative yield. Ions found by LCMS: [M + H]+ = 519.0.
Step h.
A mixture of triazole acid (51.2 mg, 0.1 mmol) and the step-g product (61.4 mg, 0.1 mmol) were dissolved in anhydrous DMF (1 ml) and Et3N (606 mg, 6 mmol) by gently heating with a heat gun. After cooling to room temperature, the resulting solution was mized with HATU (45.6 mg, 0.12 mmol) and stirred for 1 hour. Addition HATU (19 mg, 0.05 mmol) was added, followed by the step-g product (20 mg, 0.03 mmol). The reaction was continued for 2 hours, then directly purified by HPLC: 5 to 50% acetonitrile and water, using 0.1% TFA as modifier. Yield 46.2 mg, 31.4%. Ions found by LCMS: [M + H]+ = 1011.6, [(M + 2H)/2]+ = 506.4.
A solution of product from step h (46.2 mg, 0.0314 mmol) in DMF (1 ml) was mixed with TFA (20 µl) and cooled in an ice-water bath. It was then mixed with Azido-PEG4-trifluorophenyl ester (116 mg, 0.38 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu2SO4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the resulting mixture was stirred for 40 minutes. It was then directly purified by HPLC (5 to 80% acetonitrile and water, using 0.1% TFA as modifier). Yield 53 mg, 89.3%. Ions found by LCMS: [M + H]+ = 1432.6, [(M + 2H)/2]+ = 716.8, [(M + 3H)/3]+ = 478.0.
To a solution of 2,2-dimethyl-1,3-dioxan-5-amine (524.8 mg, 4 mmol) in anhydrous THF (6 ml) was added tert-butyl (2S)-3-chloro-2-hydroxypropyl carbamate (1 g, 4.77 mmol) and K2CO3 (995 mg, 7.2 mmol). The resulting mixture was heated at 75⁰C for 3 days. The salt was filtered off, and the filtrate was dried over Na2SO4, concentrated by rotary evaporation, and further dried under high vacuum. The crude product (1.36 g) was carried to the subsequent step without further purification. Ion found by LCMS: [M + H]+ = 305.2. Step b.
To a solution of the step-a crude product in DCM (8 ml) was added 3- [(benzyloxycarbonyl)amino]-1-propanal (1.08 g, 5.2 mmol) and acetic acid (360 mg, 6 mmol). The resulting mixture was stirred at room temperature for 30 minutes, then mixed with sodium triacetoxyborohydride (1.49 g, 7 mmol). After 2 hours, MeOH (5 ml) was added, and the reaction was stirred for 20 minutes. The solid was filtered off through celite, and the filtrate was concentrated by rotary evaporation. The residue was purified by RPLC (100 g, 5 to 80% acetonitrile and water). Yield 1.51 g, 75.9% over two steps. Ion found by LCMS: [M + H]+ = 496.0.
Step c.
To a solution of the step-b product (1.51 mg, 3.03 mmol) in MeOH (20 ml) was added Pd(OH)2 on carbon (750 mg). The mixture was stirred under hydrogen for 4 hours. Pd(OH)2/C was then filtered off, and the filtrate was concentrated by rotary evaporation. The product was further dried under high vacuum and carried to the subsequent step without purification. Ion found by LCMS: [M + H]+ = 362.0. Step d.
To a solution of the step-c product (101.4 mg, 0.28 mmol) in anhydrous DMF (0.5 ml) was added propargyl-PEG4-NHS ester (121.3 mg, 0.34 mmol) and DIPEA (65 mg, 0.5 mmol). The reaction mixture was stirred at room temperature for 30 minutes. It was then directly purified by HPLC (0 to 70% acetonitrile and water, using 0.1% TFA as modifier). Yield 192 mg, 95.7%. Ion found by LCMS: [M + H]+ = 604.0. Step e.
The step-d product (192 mg, 0.268 mmol) was dissolved in acetonitrile (1 ml) and water (0.5 ml). 6N HCl aqueous solution (0.5 ml) was added, and the reaction mixture was heated at 60⁰C for 10 minutes. It was then concentrated by rotary evaporation, and excess HCl was further removed by azeotrope/evaporation with acetonitrile. The residue was further dried under high vacuum. Yield 140 mg, quantitative yield. Ion found by LCMS: [M + H]+ = 464.0. Step f.
A mixture of the triazole acid described herein (51.2 mg, 0.1 mmol) and the step-e product (53.8 mg, 0.1 mmol) was dissolved in anhydrous DMF (1 ml) and Et3N (606 mg, 6 mmol) by gently heated with a heat gun. After cooling to room temperature, the resulting solution was mixed with HATU (45.6 mg, 0.12 mmol) and stirred for 1 hour. Additional HATU (19 mg, 0.05 mmol) was added, followed by the step-e product (24 mg, 0.05 mmol) in DMF (0.5 ml). The reaction was continued for 1 more hour, then directly
purified by HPLC: 5 to 50% acetonitrile and water, using 0.1% TFA as modifier. Yield 36.3 mg, 30.6%. Ions found by LCMS: [(M + 2H)/2]+ = 956.6, [(M + 2H)/2]+ = 479.0. Step g.
A solution of the step-f product (36.3 mg, 0.0306 mmol) in DMF (1 ml) was treated with 20 µl of TFA and cooled in an ice-water bath. It was then mixed with azido-PEG4-trifluorophenyl ester (17.9 mg, 0.424 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu2SO4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the resulting mixture was stirred for 40 minutes. It was then directly purified HPLC (5 to 80% acetonitrile and water, using 0.1% TFA as modifier). Yield 42 mg, 85.4%. Ions found by LCMS: [M + H]+ = 1377.4, [(M + 2H)/2]+ = 689.4. Example 57. Synthesis of Int-126
To a solution of t-butyl piperazine-1-carboxylate (931.5 mg, 5 mmol) in anhydrous DMF (5 ml) was added DIPEA (1.29 g, 10 mmol) and propargyl-PEG4-mesyl ester (1.71 g, 5.5 mmol). The resulting mixture was heated at 70 ⁰C for 20 hours. It was then directly purified by RPLC (150 g, 0 to 40% acetonitrile and water, using 0.1% TFA as modifier). Yield 2.55 g, 98.9%. Ion found by LCMS: [M + H]+ = 401.4.
Step b.
To a solution of the step-a product (2.55 g, 4.95 mmol) in acetonitrile (5 ml) was added water (5 ml) and 6N HCl aqueous solution (4 ml). The reaction mixture was heated at 60⁰C for 2 hours, then concentrated by rotary evaporation. Excess HCl was further removed by azeotrope/evaporation with acetonitrile. The residue was further dried under high vacuum. Yield 1.88 g, quantitative yield. Ion found by LCMS: [M + H]+ = 301.0 Step c.
To a solution of the step-b product (790 mg, 2.11 mmol) in anhydrous dioxane (5 ml) was added K2CO3 (1.17 g, 8.5 mmol) and 3-(boc-amino)propyl bromide (754.8 mg, 3.17 mmol). The resulting mixture was heated at 70 ⁰C overnight. It was then purified by RPLC ( 100 g, 5 to 70% acetonitrile and water). Yield 505 mg, 52.1%. Ion found by LCMS: [M + H]+ = 458.0. Step d.
To a solution of the step-c product (505 mg, 1.1 mmol) in acetonitrile (2 ml) was added water (2 ml) and 6N HCl aqueous solution (1 ml). The reaction mixture was heated at 70⁰C for 3 hours, then concentrated by rotary evaporation. Excess HCl was further removed by azeotrope/evaporation with acetonitrile. The residue was further dried under high vacuum. Yield 516 mg, quantitative yield. Ion found by LCMS: [M + H]+ = 358.0. Step e.
To a solution of the step-d product (384.8 mg, 0.82 mmol) in anhydrous DMF (2 ml) was added K2CO3 (566.6 mg, 4.1 mmol) and 4-(boc-aminomethyl)-6-chloropyrimidine (220 mg, 0.9 mmol). The resulting mixture was heated at 70⁰C for 20 hours. It was then purified by HPLC (5 to 40% acetonitrile and water, using 0.1% TFA as modifier). Yield 390.6 mg, 52.4%. Ions found by LCMS: [M + H]+ = 565.0, [(M + 2H)/2]+ = 283.0. Step f.
To a solution of the step-e product (309.6 mg, 0.43 mmol) in acetonitrile (2 ml) was added water (2 ml) and 6N HCl aqueous solution (1 ml). The reaction mixture was heated at 70⁰C for 45 minutes, then concentrated by rotary evaporation. Excess HCl was further removed by azeotrope/evaporation with acetonitrile. The residue was further dried under high vacuum. Yield 247.6 mg, quantitative yield. Ions found by LCMS: [M + H]+ = 465.4, [(M + 2H)/2]+ = 233.0. Step g.
A mixture of triazole acid (51.2 mg, 0.1 mmol) and the step-f product (61.4 mg, 0.1 mmol) were dissolved in anhydrous DMF (1 ml) and Et3N (606 mg, 6 mmol) and gently heated with a heat gun. After cooling to room temperature, the resulting solution was treated with HATU (57 mg, 0.15 mmol), stirred for 1 hour, then treated with the step-f product (61.4 mg, 0.1 mmol). The reaction was continued for 2 more hours, then directly purified by semi-preparative HPLC: 5 to 50% acetonitrile and water, using 0.1% TFA as modifier. Yield 52.1 mg, 36.8%. Ions found by LCMS: [M + H]+ = 957.8, [(M + 2H)/2]+ = 497.4. Step h.
A solution of the step-g product (35.4 mg, 0.025 mmol) in DMF (1 ml) was treated with 40 µl of TFA, and cooled in an ice-water bath. It was then mixed with azido-PEG4-trifluorophenyl ester (12.7 mg, 0.3 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu2SO4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the resulting mixture was stirred for 40 minutes. It was then directly purified HPLC (5 to 70% acetonitrile and water, using 0.1% TFA as modifier). Yield 40.2 mg, 87.6%. Ions found by LCMS: [M + H]+ = 1378.6, [(M + 2H)/2]+ = 689.8, [(M + 3H)/3]+ = 460.4.
Example 58. Synthesis o Int-129
Step a.
A mixture of piperazine bromide (2.4g, 7.02 mmol, prepared as described in ACS Medicinal Chemistry Letters, 9(5), 446-451; 2018), amine-acetonide (1.02g, 7.74 mmol), and potassium carbonate (1.94g, 14.07 mmol) in dioxane (10 mL), were heated at 75C for 24 hrs. Reaction progress was monitored by LCMS. The crude mixture was filtered, and concentrated to an oil, which was purified by flash chromatography (0% to 10% MeOH/DCM). Yield: 1.59g, 58%. LCMS: [M + H]+ = 392.2 Step b.
A mixture of product from the previous step (1.59g, 2.57 mmol), alkyl bromide (1.62g, 6.42 mmol), potassium iodide (0.43 g, 2.57 mmol), and potassium carbonate (3.55g, 25.67 mmol) dissolved in dioxane (7.5 mL) and DMF (2.5 mL) were heated at 65C for 18h. Reaction progress was monitored by LCMS. The crude reaction was filtered, concentrated and purified by RPLC (5% ACN to 100% ACM/water). Yield 1.15g, 80%. LCMS: [M + H]+ = 563.4
Step c.
A mixture of product from the previous step (1.15g, 2.04 mmol), and 5% Pd/C (0.5g), were dissolved in MeOH (10mL), then vacuum flushed with hydrogen from a balloon. Reaction was complete after 2h as monitored by LCMS. The crude mixture was filtered through a pad of celite, concentrated, and used in the next step without addition purification. LCMS: [M + H]+ = 429.4 Step d.
A mixture of product from the previous step (0.88 g, 2.04 mmol), propargyl-Peg4-mesylate (0.76 g, 2.45 mmol), and potassium carbonate (0.846 g, 6.12 mmol), dissolved in acetonitrile (8 mL), were heated at 65C for 12h. The mixture was filtered, concentrated, and purified by RPLC (5% ACN to 100% ACM/water). Yield 0.65 g, 50%. LCMS: [M + H]+ = 643.4 Step e.
Product from the previous step (0.100g, 0.156 mmol) was stirred in acetonitrile (1.5 mL), water (0.75 mL), and concentrated HCL aq (0.75 mL) for 1h, then concentrated to an oil and used in the next step without further purification. Yield of HCL salt 0.112 g. LCMS: [M + H]+ = 503.4 Step f.
A solution of product from the previous step (0.078g, 0.156 mmol), and triazole-carboxylic acid (0.050g, 0.098 mmol, described herein) dissolved in DMF (2 mL), was treated with DIEA (0.341 mL, 1.96 mmol), HOBT (0.026 g, 0.196), and HATU (0.074 g, 0.196 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi- preparative HPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.055g, 42%. LCMS: [M + H]+ = 996.4
Step g.
A solution of product from the previous step (0.068g, 0.051 mmol), and azido-Peg4- trifluorophenol ester (0.028g, 0.066 mmol) dissolved in DMF (1 mL), was treated with a solution of copper(II) sulfate (0.0020 g, 0.0127 mmol), sodium ascorbate (0.0101 g, 0.0508 mmol), and THPTA (0.00883g, 0.0203 mmol) dissolved in water (0.5 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by RPLC (10% to 100% MeOH/water). Yield 0.027g, 30%. LCMS: [(M + 2H)/2]+ = 709.4. Example 59. Synthesis of Int-132
A mixture of t-butyl(5-formyl-2,2-dimethyl-1,3-dioxan-5-yl) carbamate (0.5 g, 1.9 mmol) and Cbz- piperazine (0.51 g, 2.3 mmoL) in methanol (20 mL) was stirred at ambient temperature for 1 hour. Sodium cyanoborohydride (0.31 g, 4.8 mmol) was added and the reaction was stirred for 12 hours then
concentrated to a volume of approx.5 mL. The mixture was purified by reversed phase HPLC (0-70% ACN in DI water, 0.1% TFA modifier, 30 minute gradient) The pure fractions were concentrated to afford the intermediate as a clear oil. Ion found by LCMS [M+H]+ = 464.2. The intermediate was stirred in methanol (15 mL) in the presence of 5% Pd/C (30 mg) under 1 atmosphere of hydrogen for 45 minutes. The mixture was filtered, concentrated and taken up in acetonitrile (15 mL). Fmoc-OSu (0.67 g, 2 mmol) was added and the mixture was stirred for 1 hour, diluted with water (50 mL) and extracted into ethyl acetate (3x, 30 mL. The combined organic extracts were dried over sodium sulfate and concentrated. The crude material was purified by reversed phase HPLC (0-90% ACN in DI water, 0.1% TFA modifier, 30 minute gradient) and the pure fractions were concentrated to afford the title compound as a white solid. Yield 250 mg, 23%, 3 steps. Ion found by LC/MS [M+Na]+ = 575.4. Step b.
The intermediate from step a. of this example (250 mg, 0.45 mmol) was stirred in 4N HCl in dioxane (5mL) for 40 minutes. The mixture was concentrated and dried under high vacuum. Ion found by LC/MS [M+H]+ = 412.2. The amine-HCl salt and CBZ-amino-propanal (112 mg, 0.54 mmol) were dissolved in methanol (10 mL), stirred for 30 minutes. Sodium cyanoborohydride (75mg, 1.1 mmol) was added and the mixture was stirred for 12 hours at ambient temperature. The mixture was concentrated and purified by reversed phase HPLC (0-80% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were concentrated to afford the CBZ protected amine intermediate. Ion found by LC/MS [M+H]+ = 603.2. The intermediate was stirred in methanol in the presence of 5% Pd/C (25 mg) for 2 hours. The mixture was filtered and concentrated to afford the free amine which was carried forward without further purification. Yield 111 mg, 57%, 3 steps. Ion found by LC/MS [M+H]+ = 469.2. Step c.
HATU (72 mg, 0.19 mmol) was added to a stirring mixture of the intermediate from step b. of this example (75 mg, 0.16 mmol), and the triazole carboxylic acid, (described in example 5 of Int- 2, 81 mg, 0.16 mmol), and DIPEA (0.11 uL, 0.64 mmol) in DMF (2 mL). The reaction was stirred for 45 minutes and purified directly by reversed phase HPLC (0-95% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were pooled and concentrated to afford the intermediate as a clear oil. Yield 53 mg, 35%. Ion found by LC/MS [M+H]+ = 962.4.
Step d.
DBU (73 uL, 0.52 mmol) was added to a stirring mixture of the intermediate described in step c. of this example (100 mg, 0.10 mmol) in DMF (2 mL) and the mixture was stirred for 20 minutes. Propargyl-peg4-carboxylic acid (41 mg, 0.16 mmol) was added followed by HATU (47 mg 0.12 mmol) and the reaction was stirred for 45 minutes. The mixture was purified by reversed phase HPLC (0-95% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were pooled and concentrated to afford the intermediate as a clear oil. Yield 40 mg, 39 %. Ion found by LC/MS [M+H]+ = 982.2.
Copper sulfate (0.6 mg, 0.004 mmol), sodium ascorbate (24 mg, 0.12mmol), and BTTA (3.5 mg, 0.008 mmol), pre-mixed in DI water (0.5 mL), were added to a mixture of the alkyne intermediate, (described in step d. of this example, 40 mg, 0.40 mmol) and azido-peg4-trifluorophenol-ester (17 mg, 0.40 mmol), in a mixture of DMF/H2O (3:1, 1 mL) , and cooled to 0°C with an ice-water bath. The ice bath was removed and the reaction was stirred for 25 minutes.1 drop of glacial acetic acid was added and the mixture was purified by reversed phase HPLC (0-80% ACN in DI water, 0.1% TFA modifier, 30 minute gradient). The pure fractions were pooled and lyophilized to afford the title compound as a white solid. Yield 29 mg, 50%. Ion found by LC/MS [(M/2)+H]+ =702.4.
A mixture of bromide (2.72g, 11.4 mmol), amine-acetonide (1.00g, 7.62 mmol), and potassium carbonate (3.16 g, 22.9 mmol) in dioxane (3 mL), were heated at 75C for 18 hrs. Reaction progress was monitored by LCMS. The crude mixture was filtered, and concentrated to an oil, which was purified by flash chromatography (0% to 10% MeOH/DCM). Yield: 0.823 g, 37.4%. LCMS: [M + H]+ = 289.2 Step b.
A mixture of product from the previous step (0.823 g, 2.85 mmol), alkyl chloride (0.871 g, 5.71 mmol), potassium iodide (0.474 g, 2.85 mmol), and potassium carbonate (1.18 g, 8.56 mmol) dissolved in DMF (10 mL) was stirred at 75C for 48h. Reaction progress was monitored by LCMS. The crude mixture was filtered, concentrated, and purified by RPLC (5% to 100% ACN/water). Yield 1.05g, 71%. LCMS: [M + H]+ = 405.4
Step c.
A solution of product from the previous step (1.05g 2.60 mmol), was dissolved in (1:1) MeOH/water (20 ml), then treated with concentrated HCL (0.5 mL) and stirred at RT for 2h. The crude product was concentrated, stored under high vacuum and used in the next step without purification. LCMS: [M + H]+ = 319.2 Step d.
A solution of product from the previous step (0.715g, 2.25 mmol), and Cbz-piperazine (0.495 g, 2.25 mmol), acetic acid (0.5 mL) dissolved in DCM (7 mL) was treated with sodium triacetoxy borohydride (0.714 g, 3.37 mmol) at RT for 12hr. The reaction was filtered, concentrated, and purified by RPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.116g, 10%. LCMS: [M + H]+ = 523.4 Step e.
Product from the previous step (0.116 g, 0.220 mmol), and 20% Pd(OH)2 (0.100 g) was dissolved in MeOH (2 mL), then vacuum flushed with hydrogen from a balloon. Reaction was complete by LCMS after 20 min. LCMS: [M + H]+ = 389.3 Step f.
Product from the previous step (0.085 g, 0.219 mmol), propargyl-Peg4-mesylate (0.081g, 0.263 mmol), and potassium carbonate (0.091g, 0.656 mmol), dissolved in acetonitrile (2mL) were heated at 65C for 12h. The reaction was filtered, concentrated, and purified by RPLC (5% to 100% ACN/water). Yield 0.106g, 51%. LCMS: [M + H]+ = 603.4
Step g.
Product from the previous step (0.106 g, 0.112 mmol), was dissolved in 4M HCl/dioxane (2ml) and stirred for 15 min. The crude product was concentrated and stored under high vacuum for 12h. The crude product was used in the next step without purification. Step h.
A solution of product from the previous step (0.095g, 0.147 mmol), and triazole-carboxylic acid (0.050g, 0.098 mmol, described herein) dissolved in DMF (2 mL), was treated with DIEA (0.170 mL, 0.98 mmol), HOBT (0.026 g, 0.196), and HATU (0.074 g, 0.196 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi- preparative HPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.060g, 46%. LCMS: [M + H]+ = 996.4 Step i.
A solution of product from the previous step (0.060g, 0.045 mmol), and azido-Peg4- trifluorophenol ester (0.025g, 0.058 mmol) dissolved in DMF (1 mL), was treated with a solution of copper(II) sulfate (0.0018 g, 0.0112 mmol), sodium ascorbate (0.0089 g, 0.0448 mmol), and THPTA (0.00779g, 0.0179 mmol) dissolved in water (0.5 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by RPLC (10% to 100% ACN/water with 0.1% TFA). Yield 0.054g, 68%. LCMS: [(M + 2H)/2]+ = 709.4
Example 61. Synthesis of Int-134
Step a.
To a solution of benzyl 1-piperazinecarboxylate (4.4 g, 20 mmol) in anhydrous THF (20 ml) was added K2CO3 (4.15 g, 30 mmol) and 1,3-dibromopropane (20.2 g, 100 mmol). The mixture was heated at 70℃ overnight, then cooled to room temperature. The salt was filtered off, and the filtrate was concentrated by rotary evaporation and purified by silica gel chromatography (80 g, 20 to 100% EtOAc and hexane). Yield 4.48 g, 65.5%. Ion found by LCMS: [M + H]+ = 341.0. Step b.
To a solution of the step-a product (944.7 mg, 2.77 mmol) in anhydrous DMF (3 ml) was added K2CO3 (459 mg, 3.32 mmol) and ethanolamine (338.5 mg, 5.54 mmol). The mixture was heated at 70℃ for 3 hours, then cooled to room temperature. The salt was filtered off, and the filtrate was concentrated by rotary evaporation and purified by RPLC (100 g, 0 to 70% acetonitrile and water). Yield 708 mg, 79.2%. Ion found by LCMS: [M + H]+ = 322.0. Step c.
To a solution of the step-b product (708 mg, 2.19 mmol) in anhydrous DMF (2.5 ml) was added DIPEA (568.7 mg, 4.4 mmol), 3-(Boc)-aminopropyl bromide (785.7 mg, 3.3 mmol) and NaI (105 mg, 0.7 mmol). The mixture was stirred at 50℃ for 1 day, then purified by RPLC (100 g, 5 to 80% acetonitrile and water). Yield 793 mg, 75.7%. Ion found by LCMS: [M + H]+ = 479.0.
Step d.
To a solution of the step-c product (793 mg, 1.66 mmol) in MeOH (15 ml) was added acetic acid (99.4 mg, 1.66 mmol)) and Pd(OH)2 (20% on carbon, 350 mg). The resulting mixture was stirred under hydrogen atmosphere for 3 hours. Pd(OH)2/C was filtered off, and the filtrate was concentrated by rotary evaporation. The crude product was further dried under high vacuum and carried to the subsequent without purification. Yield 618 mg, 92%. Ion found by LCMS [M + H]+ = 345.0, [M - Boc + H]+ = 245.0. Step e.
To a solution of the step-d (618 mg, 1.53 mmol) in anhydrous THF (3 ml) and DMF (1 ml) was added K2CO3 (624.3 mg 4.59 mmol), propargyl-PEG4-mesyl ester (552.5 mg, 1.78 mmol) and NaI (45 mg, 0.3 mmol). The resulting mixture was heated at 70℃ overnight. The salt was filtered off, and the filtrate was concentrated and purified by HPLC (0 to 70% acetonitrile and water). Yield 262.4 mg, 31.7%. Ion found by LCMS: [M + H]+ = 559.2. Step f.
The step-e product (262.4 mg, 0.47 mmol) was dissolved in acetonitrile/water (1:1, 4 ml) and mixed with 6N HCl aqueous solution (0.4 ml, 2.4 mmol). The reaction was heated at 50℃ for 6 hours, then cooled to room temperature. After the pH was adjusted to about 8 with 1M KOH (2.4 ml), the solution was concentrated by rotary evaporation. The residue was re-dissolved in MeOH/DCM (~1:1), and KCl was filtered off. The filtrate was concentrated by rotary evaporation and further dried under high vacuum. Yield 216 mg, quantitative yield. Ion found by LCMS: [M + H]+ = 459, [(M + 2H)/2]+ = 230.0. Step g.
A mixture of triazole acid (51.2 mg, 0.1 mmol, described herein) and the step-f product (86.1 mg, 0.189 mmol) was dissolved in anhydrous DMF (2 ml) and Et3N (606 mg, 6 mmol) by gently heated with a heat gun. After cooling to room temperature, the resulting solution was mixed with HATU (57 mg, 0.15 mmol) and stirred for 1 hour. Addition HATU (19 mg, 0.05 mmol) was added, followed by the step-g product (24 mg, 0.05 mmol) in DMF (0.5 ml). The reaction was continued for 1 more hour, then directly
purified by HPLC: 5 to 50% acetonitrile and water, using 0.1% TFA as modifier. Yield 40 mg, 28.4%. Ions found by LCMS: [M + H]+ = 951.8, [(M + 2H)/2]+ = 476.5, [(M + 3H)/3]+ = 318.2. Step h.
A solution of the step-g product (20 mg, 0.0142 mmol) in DMF:MeOH (1:1, 1 ml) was mixed with TFA (20 µl) and cooled in an ice-water bath. It was then mixed with azido-PEG4-trifluorophenyl ester (7.2 mg, 0.017 mmol) and a premixed solution of THPTA (3.5 mg, 0.008 mmol) and sodium ascorbate (29 mg, 0.146 mmol) in water (0.5 ml), followed by Cu2SO4 (1.6 mg, 0.01 mmol). The ice-water bath was removed, and the resulting mixture was stirred for 40 minutes. It was then directly purified HPLC (5 to 70% acetonitrile and water, using 0.1% TFA as modifier). Yield 16.3 mg, 62.7%. Ions found by LCMS: [M + H]+ = 1373.5, [(M + 2H)/2]+ = 687.4, [(M + 3H)/3]+ = 458.8. Example 62. Synthesis of Int-139
To a cooled solution of malonate derivative (700 mg, 2.4 mmol, prepared as described in Angewandte Chemie, International Edition (2018), 57(22), 6527-6531) in DMF (5 ml) was added sodium hydride (100 mg, 2.6 mmol), followed by benzyl 4-(3-bromopropyl)piperazine-1-carboxylate (1 g, 2.93 mmol, prepared as described in ACS Medicinal Chemistry Letters (2018), 9(5), 446-451). The resulting suspension was stirred for 2.5 h at 0°C and then partitioned between water and
dichloromethane. The organic phase was separated, washed with water and sat. aq. NaHCO3 and then dried over anhydrous Na2SO4 and concentrate. The residue was purified by flash chromatography (hexane/EtOAc 40%) to afford product. Yield 900 mg, 67 %. Ion found by LCMS: [M + H]+ = 550.3. Step b. A mixture of step-a product (500 mg, 0.71 mmol) and 20% Pd(OH)2 (100 mg, 0.143 mmol) were stirred in MeOH (10 ml), then vacuum flushed with hydrogen from a balloon. Reaction was complete after 2h as monitored by LCMS. The crude mixture was filtered through a pad of celite, concentrated, and used in the next step without addition purification. LCMS: [M + H]+ = 416.2. Step c.
A mixture of step-b product (410 mg, 0.61 mmol), propargyl-Peg4-mesylate (210 mg, 0.674 mmol), and potassium carbonate (254 mg, 1.839 mmol) in acetonitrile (2 mL) were heated at 65oC for 12h. The mixture was filtered, concentrated, and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 305 mg, 78%. LCMS: [M + H]+ = 630.3. Step d.
To a cool solution of step-c product (100 mg, 0.15 mmol) in THF (1.5 ml) was added LiBH4 (14.5 mg, 0.63 mmol), and the solution was stirred at 0oC -r.t for 6 hrs. The reaction solution was quenched with dis water, concentrated and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 42 mg, 46 %. Ion found by LCMS: [M + H]+ = 574.4. Step e.
The step-d product (42 mg, 0.052 mmol) was dissolved in HCl (4N in dioxane). The reaction was stirred for 1h, and then concentrated and used in the next step without further purification. Yield of HCL salt 35 mg. LCMS: [M + H]+ = 474.3.
Step f.
To a solution of step-e product (35 mg, 0.052 mmol) in DMF (0.5 ml) was added triazole- carboxylic acid (32.1 mg, 0.062 mmol), DIPEA (0.089 mL, 0.51 mmol), HOBT (14.1 mg, 0.10 mmol), and HATU (39.8 mg, 0.10 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi-preparative HPLC (5% to 100% ACN/water). Yield 12 mg, 23.6%. LCMS: [M + H]+ =967.5. Step g.
A solution of step-f product (12 mg, 0.011 mmol), and azido-Peg4-trifluorophenol ester (6 mg, 0.014 mmol) in DMF (0.6 mL), was treated with a solution of copper (II) sulfate (0.4 mg, 0.00277 mmol), sodium ascorbate (2.19 mg, 0.011 mmol), and THPTA (2.41 mg, 0.0055 mmol) in water (0.3 mL), and stirred at room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by semi-preparative HPLC (5% to 100% ACN/water). Yield 8 mg, 51.9%. LCMS: [(M + 2H)/2]+ = 695.6. Example 63. Synthesis of Int-140
Step a.
Propargyl bromide (0.675 g of 80% solution in toluene) was added dropwise over 10 minutes to a 0C solution of Cbz-piperazine (1.00 g, 4.54 mmol), and potassium carbonate (1.25 g, 9.08 mmol) stirring in acetonitrile (60 mL). The reaction was kept at 0C for 2hr, then filtered, concentrated, and used in the next step without further purification. LCMS: [M + H]+ = 259.2 Step b.
A solution of product from the previous step (0.200g, 0.774 mmol), and aryl bromide (0.389g, 0.774 mmol), dissolved in DMF (2 mL) and diisopropyl amine (3 mL), was vacuum flushed with nitrogen gas, then charged with Pd(PPh3)2Cl2 (0.0217g, 0.0310 mmol) and copper iodide (0.0118 g, 0.0620 mmol) and heated in a 100C oil bath for 10 minutes. . The reaction concentrated and purified by RPLC (10% to 100% ACN/water no TFA modifier). Yield 0.197g, 55%. LCMS: [M + H]+ = 466.2 Step c.
Product from the previous step (0.197 g, 0.430 mmol), 20% Pd(OH)2/C (0.200 g) and methanol (4mL) was vacuum flushed with hydrogen from a balloon and stirred for 1hr under a hydrogen atmosphere. Reaction progress was monitored by LCMS. The reaction was filtered through a pad of celite, concentrated, and used in the next step without further purification. LCMS: [M + H]+ = 336.2 Step d.
A mixture of product from the previous step (0.121g, 0.361 mmol), propargyl-Peg4-mesylate (0.134 g, 0.433 mmol), and potassium carbonate (0.150g, 1.08 mmol) dissolved in acetonitrile, was stirred for 12h in a 65C oil bath. Reaction progress was monitored by LCMS. The reaction was filtered, concentrated, and purified by RPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.121 g, 50.5 %. LCMS: [M + H]+ = 550.4
Step e.
Product from the previous step (0.121g, 0.182 mmol) was dissolved in 4M HCl in dioxane (3mL), stirred for 30 minutes, then concentrated and used in the next step without further purification. Crude yield 0.100 g, 92.1%. LCMS: [M + H]+ = 450.4 Step f.
A solution of product from the previous step (0.100g, 0.168 mmol), and triazole-carboxylic acid (0.050g, 0.098 mmol, described here) dissolved in DMF (2 mL), was treated with DIEA (0.206 mL, 1.19 mmol), HOBT (0.026 g, 0.196), and HATU (0.056 g, 0.148 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by RPLC (5% to 100% ACN/water with 0.1% TFA). Yield 0.043g, 37%. LCMS: [M + H]+ = 943.4 Step
A solution of product from the previous step (0.043g, 0.046 mmol), and azido-Peg4- trifluorophenol ester (0.025g, 0.058 mmol) dissolved in DMF (1 mL), was treated with a solution of copper(II) sulfate (0.0018 g, 0.0112 mmol), sodium ascorbate (0.0090 g, 0.0456 mmol), and THPTA (0.00792g, 0.0182 mmol) dissolved in water (0.5 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by RPLC (10% to 100% ACN/water with 0.1% TFA). Yield 0.044g, 61%. LCMS: [(M + 2H)/2]+ = 683.0
Example 64. Synthesis of Int-141
To a cool solution of dimethyl malonate (3.58 g, 27.1 mmol) in THF (60 ml) was added NaH (1.1 g, 60% NaH in mineral oil, 29.8 mmol) in portions. The resulting suspension was stirred for 30 mins, concentrated and dried in vacuum. The residue was re-dissolved in methanol and tert-butyl (4- bromopropyl) carbamate (3.42 g, 13.5 mmol) was added. The reaction was refluxed for 3 h. Upon completion (monitored by TLC), the solvent was concentrated and added 50 mL of saturated aqueous NH4Cl. The aqueous phase was then extracted with CH2Cl2 (3 X 50 mL). The combined organic extracts were dried over Na2SO4 anhydrous, filtered and concentrated and the residue was purified by flash chromatography (gradient, 0 → 40% EtOAc/hexanes) to afford product. Yield 1.2 g, 30 %. Ion found by LCMS: [M + H]+ = 304.1.
Step b.
To a cold solution of step-a product (422 mg, 1.39 mmol) in DMF (5 ml) was added sodium hydride (58 mg, 1.46 mmol), followed N-(Bromomethyl)phthalimide (382 mg, 1.53 mmol). The resulting suspension was stirred for 2.5 h at 0°C and then partitioned between water and dichloromethane. The organic phase was separated, washed with water and sat. aq. NaHCO3 and then dried over anhydrous NaSO4 and concentrate. The residue was purified by flash chromatography (hexane/EtOAc=40%) to afford product. Yield 491 mg, 76 %. Ion found by LCMS: [M + H]+ = 463.2. Step c.
To a solution of step-b product (491 mg, 1.06 mmol) in methanol (5 ml) was added hydrazine hydride (0.3 ml, 5.3 mmol). The solution was stirred under reflux for 4 hrs, cooled, filtered and concentrated and the residue was purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 352 mg, 95.7 %. Ion found by LCMS: [M + H]+ = 333.1. Step d.
To a solution of step-c product (169 mg, 0.34 mmol) in DMF (4 ml) was added propargyl-PEG4- NHS (121 mg, 0.34 mmol) and Et3N (0.14 ml, 1.02 mmol). The solution was stirred at r.t overnight, concentrated and the residue was purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 87 mg, 44 %. Ion found by LCMS: [M + H]+ = 575.3. Step e.
To a cool solution of step-d product (87 mg, 0.15 mmol) in THF (1.5 ml) was added LiBH4 (14 mg, 0.6 mmol), and the solution was stirred at 0oC -r.t for 6 hrs. The reaction solution was quenched with dis water, concentrated and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 31 mg, 39 %. Ion found by LCMS: [M + H]+ = 519.3.
Step f.
Step-e product (31 mg, 0.06 mmol) was dissolved in a cool solution of HCl (1.5 ml, 4N in dioxane) and reaction was stirred for 2 hr. The solution was concentrated to an oil and used as an intermediate without purification. Yield of HCl salt 25 mg. Ion found by LCMS: [M + H]+ = 419.2. Step g.
To a solution of product-f (25 mg, 0.06 mmol) in DMF (1 mL) was added triazole-carboxylic acid (36.8 mg, 0.072 mmol), DIPEA (0.062 mL, 0.36 mmol), HOBT (16.2 mg, 0.12 mmol), and HATU (45.6 mg, 0.36 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi-preparative HPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 37 mg, 67%. LCMS: [M + H]+ = 912.4. Step h.
A mixture of product-g (20 mg, 0.022 mmol), and azido-Peg4-trifluorophenol ester (12 mg, 0.028 mmol) in DMF (0.6 mL) was treated with a solution of copper(II) sulfate (0.87 mg, 0.0054 mmol), sodium ascorbate (4.34 mg, 0.021 mmol), and THPTA (4.76 mg, 10.9 mmol) in water (0.3 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by semi- preparative HPLC (5% to 100% ACN/water). Yield 18 mg, 62%. LCMS: [(M + 2H)/2]+ = 667.5. Example 65. Synthesis of Conjugate 54 To a solution of Fc carrier (7.5 ml, 150 mg, 0.002578 mmol, SEQ ID NO: 76) in sodium acetate buffer (pH 5.0, 20 mM) was added DMF (2 ml) and Int-106 (38 mg, 0.0258 mmol) in DMF (1 ml + 3 x 0.2 ml to wash the glassware). After the resulting mixture was gently mixed for 5 minutes, its pH was adjusted to about 8.5 with borate buffer (0.8 mL, pH 8.5, 1M). The reaction was rotated overnight and then purified by dialysis. Maldi TOF analysis of the purified final product gave an average mass of 63,678 Da (DAR = 4.9). Yield: 72 mg, 48%.
Example 66. Synthesis of Conjugate 55 To a solution of Fc carrier (5.45 ml, 109 mg, 0.001874 mmol, SEQ ID NO: 76) in sodium acetate buffer (pH5.0, 20 mM) was added DMF (1.4 ml) and Int-108 (22.3 mg, 0.015 mmol) in DMF (1 ml + 3 x 0.2 ml to wash the glassware). After the resulting mixture was gently mixed for 5 minutes, its pH was adjusted to about 8.5 with borate buffer solution (0.6 ml, 1M, pH 8.5). The reaction was rotated overnight, then purified by dialysis. Maldi TOF analysis of the purified final product gave an average mass of 63,048 Da (DAR = 4.3). Yield: 66 mg, 60%. Example 67. Synthesis of Conjugate 57 The title compound was prepared analogously to synthesis of Conjugate 55 where Int-108 was replaced with Int-112. Maldi TOF analysis of the purified final product gave an average mass of 63445 Da (DAR = 4.5). Yield: 88 mg, 58%. Example 68. Synthesis of Conjugate 60 To a solution of Fc carrier (6.0 ml, 120 mg, 0.00206 mmol, SEQ ID NO: 76) in sodium acetate buffer (20mM, pH 5.0) was added DMF (1 ml) and Int-121 (24.2 mg, 0.0128 mmol) in DMF (1 ml + 3 x 0.2 ml to wash the glassware). After the resulting mixture was gently mixed for 10 minutes, its pH was adjusted to ~ 8.5 with borate buffer (1M, pH 8.5) solution in three portions (0.25 ml x 2 + 0.1 ml). The reaction was rotated overnight, then purified by dialysis. Maldi TOF analysis of the purified final product gave an average mass of 61,328 Da (DAR = 2.4). Yield: 105 mg, 88%. Example 69. Synthesis of Conjugate 61 To a solution of Fc carrier (SEQ ID NO: 76)( 5.57 ml, 111 mg, 0.00153 mmol) in sodium acetate buffer (pH 5.0, 20 mM) was added DMF (1 ml) and Int-122 (24.2 mg, 0.0153 mmol) in DMF (1 ml + 3 x 0.2 ml to wash the glassware). After the resulting mixture was rocked for 10 minutes, its pH was adjusted to about 8.5 with borate buffer(pH 8.5, 1 M) solution in three portions (0.25 ml x 2 + 0.1 ml). The reaction was gently rotated overnight and then purified by dialysis. Maldi TOF analysis of the purified final product gave an average mass of 63,068 Da (DAR = 3.9). Yield: 89 mg, 81%. Example 70. Synthesis of Conjugate 65 To a solution of Fc carrier (5 ml, 100 mg, 0.001719 mmol, SEQ ID NO: 76) in sodium acetate buffer (20mM, pH 5.0), was added DMF (1 ml) and Int-126 (25.2 mg, 0.01373 mmol) in DMF (1 ml + 3 x 0.2 ml to wash the glassware). After the resulting mixture was gently mixed for 10 minutes, its pH was adjusted to about 8.5 with borate buffer solution (1M) in three portions (3 x 0.2 ml). The reaction was gently rotated overnight and then purified by dialysis. Maldi TOF analysis of the purified final product gave an average mass of 60,714 Da (DAR = 1.9). Yield: 66.2 mg, 66.2%. Example 71. Synthesis of Conjugate 71 To a solution of Fc carrier (3.24 ml, 64.7 mg, 0.0089 mmol, SEQ ID NO: 76) in sodium acetate buffer (pH 5.0, 20 mM) was added DMF (0.8 ml) and a solution of Int-134 (16.3 mg, 0.0089 mmol) in DMF
(0.6 ml). Addition DMF (0.2 ml x 3) was used to wash the glassware and combined with the reaction mixture. After gently mixing for 5 minutes, borate buffer (pH 8.5, 1 M) solution (0.4 ml) was added to adjust the pH to ~8.5. The reaction was gently rotated overnight and then purified by dialysis. Batch 1: Maldi-TOF analysis of the purified final product gave an average mass of 61,018 Da; DAR = 2.4; and yield = 34.5 mg, 53%. Batch 2: Average mass (Maldi-TOF) = 62,949 Da; DAR 3.9; and yield = 75.2 mg, 84%. Example 72. Synthesis of Conjugates – Trifluorophenyl ester General procedure for conjugation using trifluorophenyl ester: A solution of Fc in PBS buffer (pH = 7.4) and DMF was treated with a solution of trifluorophenyl ester dissolved in DMF. The pH was adjusted to ~8.0 to 9.5 with borate buffer (pH 8.5-9.5). Then the mixture was gently rocked at room temperature. Maldi TOF after 3 hours shows an average DAR of 3 to 5. The crude conjugate was purified by dialysis in arginine buffer (200 mM Arginine, 120 mM NaCl , 1% Sucrose pH 6.0). Yields are typically 30% to 70%. DAR is determined by Maldi TOF of the purified conjugates and is typically 2 to 5. The yields and properties for conjugates synthesized using this general procedure are listed in Table 16a below. Table 16a. Conjugates and properties
Example 73. Synthesis of Conjugates – Click conjugation A preparation of 0.0050M CuSO4 in PBS buffer solution Click reagent was performed. Briefly, 10.0 mg CuSO4 was dissolved in 12.53 mL PBS, next 6.00 mL of the CuSO4 solution and added 51.7 mg BTTAA (CAS# 1334179-85-9) and 297.2 mg sodium ascorbate to give the Click reagent solution (0.0050M CuSO4, 0.020M BTTAA and 0.25M sodium ascorbate). This Click reagent solution was used for all subsequent conjugations. General procedure for Click conjugation of payload: a solution of azido functionalized Fc was added to a 15 mL centrifuge tube containing alkyne derivatized small molecule (2 equivalents for each DAR). After gently shaking to dissolve all solids, the mixture was treated with the Click reagent solution of (L-ascorbic acid sodium, 0.25 M, 400 equivalents, copper (II) sulfate 0.0050M, 8 equivalents, and BTTAA 0.020M, 32 equivalents). The resulting mixture was gently rotated for 6 hours at ambient temperature. It was purified by affinity chromatography over a protein A column, followed size exclusion chromatography (as described herein). Maldi TOF analysis of the purified final product gave an average mass, average DAR and Yield listed in Table 16b below. Table 16b. Conjugates and properties
Example 74. Activity against HIV-1 IIIB Strain (HIV-1IIIB) in a cytopathic effects assay using CEM-SS Cells Materials. This study determined the activity of conjugates, comparators, and buffers against HIV-1IIIB in a CPE assay with a T-cell lymphoblastic cell line obtained from the AIDS Research and
Reference Program (Rockville, MD). Controls consisted of AZT (Sigma-Aldrich), Temsavir (Astatech, PA), conjugate buffer (200 mM Arginine, 120 mM NaCl, 1% Sucrose, pH 6.0), and Temsavir buffer (DMSO/PEG300/Tween-80/PBS). Methods – Evaluation of Inhibition of HIV-1IIIB and Cellular Toxicity in CEM-SS Cells. Fifty microliters (50 µL) of CEM-SS cells at a density of 2.5 x103 cells/well in 10% complete RPMI-1640 (10% FBS with 1% L-glutamine and 1% Penicillin/Streptomycin) media were plated in a 96-well round bottom plate. One-hundred microliters (100 µL) of each compound at 6 concentrations were added in triplicate followed by 50 µL of HIV-1IIIB at a pre-determined titer. The cultures were incubated for 6 days at 37°C/5% CO2. Following the incubation, the cells were stained with XTT for evaluation of compound efficacy and cellular toxicity, as described below. AZT was evaluated in parallel as an assay positive control compound. Data Analysis and Evaluation. Microsoft Excel 2010 in combination with XLfit4 was used to analyze and graph data. EC25, EC50 and EC95 (25%, 50% and 95% inhibition of virus replication), TC25, TC50 and TC95 (25%, 50%, and 95% reduction in cell viability) and a therapeutic index (TI, TC25/EC25, TC50/EC50, TC95/ EC95) were calculated. Results. Six (6) compounds and two buffers which were used for solubilization of the test compounds were evaluated for activity against HIV-1IIIB in CEM-SS cells7. The conjugate buffer and Temsavir buffer were inactive up to the lowest dilution evaluated (1:20,000 and 1:1,000,000, respectively). All of the compounds and buffers were non-toxic up to the highest concentration evaluated. AZT was evaluated in parallel as a positive control compound and was active at the expected concentration.
Example 75. Activity of Conjugate 29a and Conjugate 29b against 5 HIV-1 strains in a cell-based assay with human peripheral blood mononuclear cells (PMBCs) Materials. This study determined the activity of Conjugate 29a and Conjugate 29b against wild type HIV-1 isolates covering 3 different clades and co-receptor specificities obtained from the AIDS Research and Reference Reagent Program (Rockville, MD) (Table 18). Controls consisted of AZT (Sigma-Aldrich), Temsavir (Astatech, PA), and compound buffer (200 mM Arginine, 120 mM NaCl, 1% Sucrose, pH 6.0)
Human PBMCs were isolated from whole blood from three donors. The leukophoresed blood was diluted 1:1 with DPBS and was layered over 15 mL of Ficoll-Hypaque density gradient. These tubes were centrifuged for 30 min at 1800 rpm. Banded PBMCs were gently aspirated from the resulting interface and subsequently washed three times with DPBS by low-speed centrifugation. After the final wash, cells were enumerated by Trypan Blue dye exclusion and re-suspended at 1 x 106 cells/mL in RPMI 1640 with 15% Fetal Bovine Serum (FBS), 2 mmol/L L-glutamine, 2 µg/mL PHA-P, 100 Units/mL penicillin and 100 μg/mL streptomycin were incubated for 72 hours at 37oC/5% CO2. Following the incubation, the PBMCs were centrifuged and resuspended in tissue culture medium (RPMI 1640 with 15% FBS, 2 mmol/L L-glutamine, 100 U/mL penicillin, 100 µg/mL streptomycin and 3.6 ng/mL recombinant human IL-2). The cultures were then maintained until use by half culture volume change with fresh IL-2 containing tissue culture medium every 3 to 4 days. Assay conditions. Assays were initiated with PBMCs that had been induced to proliferate with PHA-P for 72 hours. For the PBMC assay, PHA-P stimulated PBMCs from three donors were pooled together to minimize the variability that occurs when cells from individual donors are used, resuspended in fresh tissue culture medium at 1 x 106 cells/mL and plated in the interior wells of a 96-well round bottom microtiter plate at 50 µL/well. One-hundred microliters (100 µL) of each two times the concentration of compound in assay media was added to designated wells of the round-bottom 96-well plate containing the cells, in triplicate. Immediately following addition of the compound to the wells, 50 µL of a pre- determined dilution of virus was added. After 7 days in culture at 5% CO2/37°C HIV-1 replication was quantified by the measurement of cell-free HIV-1 RT activity in the tissue culture supernatant. Cytotoxicity was evaluated using the tetrazolium dye XTT following well known standard methods. Quantification of reverse transcriptase (RT) activity. Virus production by untreated and treated cells was quantified by measuring reverse transcriptase in cell-free supernatants using a standard radioactive incorporation polymerization assay. Tritiated thymidine triphosphate (TTP) was purchased at 1 Ci/mL and 1 µL was used per enzyme reaction. Poly rA and oligo dT were prepared at concentrations of 0.5 mg/mL and 1.7 Units/mL, respectively, from a stock solution which was kept at -20°C. The RT reaction buffer was prepared fresh and consists of 125 µL of 1 M EGTA, 125 µL of dH2O, 125 µL of 20% Triton X-100, 50 µL of 1 M Tris (pH 7.4), 50 µL of 1 M DTT, and 40 µL of 1 M MgCl2. For each reaction, 1 µL of TTP, 4 µL of dH2O, 2.5 µL of rAdT and 2.5 µL of reaction buffer were mixed. Ten microliters (10 µL) of this reaction mixture was placed in a round bottom microtiter plate with 15 µL of virus containing
supernatant. The plate was incubated at 37°C in a humidified incubator for 60 to 90 minutes. Following the incubation, 10 µL of the reaction volume was spotted onto Hybond -N+ paper in the appropriate plate format, washed 5 times for 5 minutes each in a 5% sodium phosphate buffer, 2 times for 1 minute each in distilled water, 2 times for 1 minute each in 70% reagent alcohol, and then air dried. The dried paper was placed in a plastic sleeve and 4 mL of Opti-Fluor O was added to each sleeve. Incorporated radioactivity was quantified utilizing a Wallac 1450 Microbeta Trilux liquid scintillation counter. Data analysis. Microsoft Excel 2010 was used to analyze and graph data. EC50 and EC90 (50% and 90% inhibition of virus replication), TC50 and TC90 (50% and 90% reduction in cell viability) and a therapeutic index (TI, TC50/EC50, TC90/ EC90) were determined. Results. Against HIV-1US/92/727 Conjugate 29a and Conjugate 29b demonstrated significant potency (Table 19). Activity of the comparator molecules AZT and Temsavir were 1.94 and 0.002 nM, respectively. Importantly there was no cytotoxicity with Conjugate 29a and Conjugate 29b against this, or any strain, in the panel (TC50 > 250 nM). Against HIV-1HT/92/599, Conjugate demonstrated single digit potency. Potency against HIV-1HT/92/594 was 0.49 nM for Conjugate 1. Conjugate 1 also displayed potent activity against the Subtype C strain (HIV-1ZA/97/009). Likewise, Conjugate 1 demonstrated significant activity against the A Subtype (HIV-1UG/92/037). Significantly, Conjugate 1 was more potent than the Temsavir comparator against the A Subtype. Conjugate 29a and Conjugate 29b were highly active against 5 clinically relevant HIV-1 strains with EC50 values in the single digit (or better) nM range. The HIV-1 panel tested in this study also included 3 different Subtypes with 3 different co-receptor tropisms indicating broad coverage by Conjugate 29a and Conjugate 29b. Lastly, this assay was conducted with actual human PBMCs isolated from 3 different donors, making the assay, and results, all the more meaningful.
Example 76. Screening of HIV antiviral small molecules and conjugates in an in vitro cell fusion inhibition assay Activity of compounds was determined by using an assay designed to measure the inhibition of cell-cell fusion mediated by gp120 and CD4, a key step in the HIV infection process. Briefly, this assay measures the fusion of two cell lines, HeLa-CD4-LTR-β-Gal (catalog #1470) and HL2/3 cells (catalog #1294), obtained from the NIH AIDS Research Reagent and Reference Program (Rockville, MD). HL2/3 cells were plated at a density of 2 x 104 cells per well in a volume of 50 µL, with 50 µL of nine serial half- logarithmic dilutions of compound in triplicate for 1 h at 37ºC/5% CO2. Following the incubation, 100 µL of HeLa-CD4-LTR-β-Gal cells were added to the plates. The cultures were incubated for an additional 48 h at 37ºC/5% CO2. Following incubation, plates were evaluated for β-galactosidase production using a chemiluminescent substrate and buffer (ThermoFisher). Briefly, all media was removed from the efficacy plates and replaced with 100 µL of DPBS. One hundred microliters of Gal-Screen substrate diluted 1:25 in Gal Screen Buffer was added to all wells of the plate. The plate was incubated for 90 min at room temperature. Following the incubation, the contents of the wells were transferred to a clear bottom white plate. The plate was covered and chemiluminescence was detected using an EnSpire multimode plate reader (PerkinElmer). In this assay, compounds were evaluated in parallel with 2 control compounds, including Chicago Sky Blue (CSB) and Temsavir, known to prevent cell fusion mediated by binding of viral gp120 to the host cell receptor (CD4). CSB and Temsavir were active at the expected concentrations with EC50 values of 3.1E+03 and 1.7 nM, respectively (Table 20). Several molecules showed sub-nM or single-digit nM potency. Table 20. Evaluation of inhibition of cell to cell fusion in HeLa-CD4-LTR-β-Gal and HL2/3 cells. All compounds inhibited HIV-induced cell to cell fusion with EC50 values ranging from 1.1E-05 to 3.1E+03 nM.
The conjugates were also evaluated for the ability to inhibit HIV-1-induced cell to cell fusion. The EC50 values defined in the assays are shown below in Table 21. Several conjugates had single-digit nM potency (Table 11) Table 21. Evaluation of inhibition of cell to cell fusion in HeLa-CD4-LTR-β-Gal and HL2/3 cells.
DAR, drug-to-antibody ratio Example 77.7-day mouse PK study comparing IV administration of Conjugate 5b, Conjugate 29a and Fc control (SEQ ID NO: 72) at 5 mg/kg Mouse PK studies were performed using male BALB/c mice 6 weeks of age (n = 2 mice/group). Mice were injected intravenously (IV) via the tail vein with 5 mg/kg of test article (5 mL/kg dose volume). Animals were housed under standard IACUC approved housing conditions. At indicated times (0.25, 1, 2, 4, 24, 48, 120, 144, and 168 h), animals were non-terminally bled (retro-orbital, cheek, or by tail vein) with blood collected in K2EDTA tubes to prevent coagulation. Collected blood was centrifuged (2,000 x g, for 10 min) and plasma withdrawn for analysis of test article concentrations over time. The Fc plasma concentrations at each time point were measured by Fc-capture sandwich ELISA as described above. The 7-day mouse PK profiles of Conjugate 5b, Conjugate 29a, and Fc control (SEQ ID NO: 72) are shown in FIG.25. Following IV administration of compound at 5 mg/kg, the average plasma exposure levels of Conjugate 5b (AUC = 2257) and Conjugate 29a (AUC = 2035) were superior to Fc control (SEQ ID NO: 72) (AUC = 1239) (FIG.25). Example 78. Screening using MT4-CCR5-Luc cell line and infectious virus Compounds and conjugates were screened in a reporter cell-line based antiviral assay (using an MT4-CCR5-Luc cell line and infectious virus) with a luciferase readout, following a protocol similar to the protocol described in Thomas et al., Journal of Immunological Methods, 480: 112766 (2020). The EC50 values are shown in Table 22, Table 23, and Table 24 (each datapoint is averaged from n = 2 runs). Table 22
Table 23
Table 24
Example 79. Activity of Conjugate 71 against HIV-1 clinical isolates strains in a cell-based assay with human peripheral blood mononuclear cells Materials. This study determined the activity of Conjugate 71 against clinical HIV-1 isolates covering five different clades, three different co-receptor specificities, and contained three multi-drug resistant isolates (MDR) obtained from the AIDS Research and Reference Reagent Program (Rockville, MD) (Table 25). Controls consisted of AZT (Sigma-Aldrich), Temsavir (Astatech, PA), and Nevirapine. Table 25. HIV strains tested
Human PBMCs were isolated from whole blood from three donors. The leukophoresed blood was diluted 1:1 with DPBS and was layered over 15 mL of Ficoll-Hypaque density gradient. These tubes were centrifuged for 30 min at 1800 rpm. Banded PBMCs were gently aspirated from the resulting interface and subsequently washed three times with DPBS by low-speed centrifugation. After the final wash, cells were enumerated by Trypan Blue dye exclusion and re-suspended at 1 x 106 cells/mL in
RPMI 1640 with 15% Fetal Bovine Serum (FBS), 2 mmol/L L-glutamine, 2 μg/mL PHA-P, 100 Units/mL penicillin and 100 μg/mL streptomycin were incubated for 72 hours at 37 oC/5% CO2. Following the incubation, the PBMCs were centrifuged and resuspended in tissue culture medium (RPMI 1640 with 15% FBS, 2 mmol/L L-glutamine, 100 U/mL penicillin, 100 μg/mL streptomycin and 3.6 ng/mL recombinant human IL-2). The cultures were then maintained until use by half culture volume change with fresh IL-2 containing tissue culture medium every 3 to 4 days. Assay conditions. Assays were initiated with PBMCs that had been induced to proliferate with PHA-P for 72 hours. For the PBMC assay, PHA-P stimulated PBMCs from three donors were pooled together to minimize the variability that occurs when cells from individual donors are used, resuspended in fresh tissue culture medium at 1 x 106 cells/mL and plated in the interior wells of a 96-well round bottom microtiter plate at 50 µL/well. One-hundred microliters (100 µL) of each two times the concentration of compound in assay media was added to designated wells of the round-bottom 96-well plate containing the cells, in triplicate. Immediately following addition of the compound to the wells, 50 µL of a pre- determined dilution of virus was added. After 7 days in culture at 5% CO2/37°C HIV-1 replication was quantified by the measurement of cell-free HIV-1 RT activity in the tissue culture supernatant. Cytotoxicity was evaluated using the tetrazolium dye XTT following well known standard methods. Quantification of reverse transcriptase (RT) activity. Virus production by untreated and treated cells was quantified by measuring reverse transcriptase in cell-free supernatants using a standard radioactive incorporation polymerization assay. Tritiated thymidine triphosphate (TTP) was purchased at 1 Ci/mL and 1 µL was used per enzyme reaction. Poly rA and oligo dT were prepared at concentrations of 0.5 mg/mL and 1.7 Units/mL, respectively, from a stock solution which was kept at -20°C. The RT reaction buffer was prepared fresh and consists of 125 µL of 1 M EGTA, 125 µL of dH2O, 125 µL of 20% Triton X-100, 50 µL of 1 M Tris (pH 7.4), 50 µL of 1 M DTT, and 40 µL of 1 M MgCl2. For each reaction, 1 µL of TTP, 4 µL of dH2O, 2.5 µL of rAdT and 2.5 µL of reaction buffer were mixed. Ten microliters (10 µL) of this reaction mixture was placed in a round bottom microtiter plate with 15 µL of virus containing supernatant. The plate was incubated at 37 °C in a humidified incubator for 60 to 90 minutes. Following the incubation, 10 µL of the reaction volume was spotted onto Hybond -N+ paper in the appropriate plate format, washed 5 times for 5 minutes each in a 5% sodium phosphate buffer, 2 times for 1 minute each in distilled water, 2 times for 1 minute each in 70% reagent alcohol, and then air dried. The dried paper was placed in a plastic sleeve and 4 mL of Opti-Fluor O was added to each sleeve. Incorporated radioactivity was quantified utilizing a Wallac 1450 Microbeta Trilux liquid scintillation counter. Data analysis. Microsoft Excel 2010 was used to analyze and graph data. EC50 and EC90 (50% and 90% inhibition of virus replication), TC50 and TC90 (50% and 90% reduction in cell viability) and a therapeutic index (TI, TC50/EC50, TC90/EC90) were determined. Results. Conjugate 71 is a conjugate with single small molecule (SM) targeting moieties (TM), which generated EC50 values ranging from 0.00828 to >100 (Table 26 and Table 27). The comparators, AZT and Temsavir, generated EC50 ranges of 0.011 to >500 and 0.0003 to >100 nM respectively (Table 26 and Table 27).
Conclusions. Against a panel of eleven clinical HIV isolates in a relevant assay using hPBMCs, Conjugate 71 demonstrated potency on par or better than two approved therapeutics against HIV. This includes coverage of MDR isolates. Lastly, cytotoxicity was undetectable for Conjugate 71. Table 26. Activity of test articles against various HIV isolates (CPE assay, hPBMC)
Table 27. Activity of test articles against multi-drug resistant HIV isolates (CPE assay, hPBMC)
Example 80. Synthesis of Int-183
A mixture of 1-(3-Bromopropyl)piperazine (1.50 g, 4.4 mmol), amino (1.53 g, 8.8 mmol) and Et3N (1.83 ml, 13.1 mmol) in acetonitrile (10 mL) were heated at 85oC for 30 min. The mixture was filtered, concentrated, and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 1.20 g, 63 %. LCMS: [M + H]+ = 435.5.
Step b.
To a solution of step-a product (500 mg, 0.75 mmol) in DMF (8 ml) was added 4-imidazolecarboxylic acid (85mg, 0.75mmol), HATU (574 mg, 1.5 mmol) and DIPEA (0.79 ml, 4.5 mmol). The reaction solution was stirred at r.t for 2 hours, concentrated and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 520 mg, 79%. Ion found by LCMS: [M + H]+ = 529.6. Step c.
A mixture of step-b product (500 mg, 0.94 mmol) and 20% Pd(OH)2 (100 mg) were dissolved in MeOH (10 ml), then vacuum flushed with hydrogen from a balloon. Reaction was complete after 2h as monitored by LCMS. The crude mixture was filtered through a pad of celite, concentrated, and used in the next step without addition purification. LCMS: [M + H]+ = 395.5. Step d.
A mixture of step-c product (298 mg, 0.75 mmol), propargyl-Peg4-mesylate (257 mg, 0.83 mmol), and potassium carbonate (310 mg, 2.26 mmol) in acetonitrile (2 mL) were heated at 65oC for 12 h. The mixture was filtered, concentrated, and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 242 mg, 52%. LCMS: [M + H]+ = 609.7. Step e.
The step-d product (242 mg, 0.29 mmol) was dissolved in a solution of HCl (2 ml, 4N in dioxane) and reaction was stirred for 2 hrs. The solution was concentrated to an oil and used in the next step without purification. Yield of HCl salt 147 mg. Ion found by LCMS: [M + H]+ = 509.6. Step f.
To a solution of product-e (147 mg, 0.29 mmol) in DMF (1 mL) was added triazole-carboxylic acid (98 mg, 0.19 mmol), DIPEA (0.2 mL, 1.15 mmol), HOBT (52 mg, 0.38 mmol), and HATU (146 mg, 0.38 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi-preparative HPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 110 mg, 68%. LCMS: [(M + 2H)/2]+ =502.1. Step
A mixture of product-f (40 mg, 0.032 mmol), and azido-Peg4-trifluorophenol ester (18 mg, 0.042 mmol) in DMF (0.6 mL) was treated with a solution of copper(II) sulfate (1.2 mg, 0.008 mmol), sodium ascorbate (6.4 mg, 0.032 mmol), and THPTA (7 mg, 16.2 mmol) in water (0.6 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by semi- preparative HPLC (5% to 100% ACN/water). Yield mg, %. LCMS: [(M + 2H)/2]+ = 712.7. Example 81. Synthesis of Int-190
A mixture of starting amine (177 mg, 0.4 mmol), imidazolebromide (125 mg, 0.516 mmol), and potassium carbonate (357 mg, 2.58 mmol) in acetonitrile (2 mL) were heated at 65oC for 12h. The mixture was filtered, concentrated, and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 160 mg, 61%. LCMS: [M + H]+ = 515.6.
Step b.
A mixture of step-a product (160 mg, 0.71 mmol) and 20% Pd(OH)2 (32 mg) were dissolved in MeOH (5 ml), then vacuum flushed with hydrogen from a balloon. Reaction was complete after 2h as monitored by LCMS. The crude mixture was filtered through a pad of celite, concentrated, and used in the next step without purification. LCMS: [M + H]+ = 381.5. Step c.
A mixture of step-b product (115 mg, 0.25 mmol), propargyl-Peg4-mesylate (95 mg, 0.3 mmol), and potassium carbonate (213 mg, 1.5 mmol) were dissolved in acetonitrile (2 mL) and heated at 65oC for 12h. The mixture was filtered, concentrated, and purified by RPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 68 mg, 45%. LCMS: [M + H]+ = 595.7. Step d.
The step-c product (68 mg, 0.1 mmol) was dissolved in HCl (2 ml, 4N in dioxane) and stirred for 2 hrs. The solution was concentrated to an oil and used in the next step without purification. Yield of HCl salt 47.4 mg. Ion found by LCMS: [M + H]+ = 495.6. Step e.
To a solution of product from step-d (47.4 mg, 0.1 mmol) in DMF (1 mL) was added triazole- carboxylic acid (41 mg, 0.08 mmol), DIPEA (0.083 mL, 0.48 mmol), HOBT (21.6 mg, 0.16 mmol), and HATU (61 mg, 0.16 mmol). LCMS after 30min shows complete consumption of the triazole-carboxylic acid. The reaction was concentrated and purified by semi-preparative HPLC (5% to 95% acetonitrile and water, using 0.1% TFA as modifier). Yield 44 mg, 55%. LCMS: [M + H]+ = 989.1.
Step f.
A mixture of product from step-e (22 mg, 0.022 mmol), and azido-Peg4-trifluorophenol ester (12 mg, 0.028 mmol) in DMF (0.6 mL) was treated with a solution of copper(II) sulfate (0.9 mg, 0.0055 mmol), sodium ascorbate (4.4 mg, 0.022 mmol), and THPTA (4.8 mg, 0.011 mmol) in water (0.6 mL), and stirred room temperature for 30 min. Reaction progress was monitored by LCMS. The product was purified by semi-preparative HPLC (5% to 100% ACN/water). Yield 20 mg, 63%. LCMS: [(M + 2H)/2]+ = 705.7.
NUMBERED EMBODIMENTS 1. A conjugate described by any one of formulas (D-I), (M-I), (1), or (2):
(D-I) (M-I) (1) (2) wherein each A1 and each A2 is independently described by formula (A-I) or (A-II):
wherein Q is selected from the group consisting of: ,
, S is selected from the group consisting of:
R1, R2, R3, are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C3-C20 cycloalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C5-C20 aryl, optionally substituted C3-C15 heteroaryl, and optionally substituted C1-C20 alkoxy; R4 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, optionally substituted C3- C15 heteroaryl, and a bond; R5 is selected from H or optionally substituted C1-C6 alkyl; R6 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl; U1, U2, U3, U4, and U5 are each independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl. R7 and Y are each independently selected from
R8 are each independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; R9 are each independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; x is 1 or 2; k is 0, 1, 2, 3, 4, or 5; Ar is selected from the group consisting of optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl; n is 1 or 2; each E comprises an Fc domain monomer, an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to each Y of each A1 or each A1 and A2; T is an integer from 1 to 20, and each squiggly line in formulas (D-I), (M-I), (1), or (2) indicates that L is covalently attached to E; or a pharmaceutically acceptable salt thereof. 2. The conjugate of embodiment 1, wherein each A1 and each A2 is independently described by formula (A-I).
3. The conjugate of embodiment 2, wherein each A1 and each A2 is independently described by any one of formulas (A-Ia)-(A-Ih):
wherein each X is independently C or N; or a pharmaceutically acceptable salt thereof. 4. The conjugate of embodiment 3, wherein each A1 and each A2 is independently described by any one of formulas (A-Ia-i)-(A-Ih-i):
or a pharmaceutically acceptable salt thereof. 5. The conjugate of embodiment 3, wherein each A1 and each A2 is independently described by any one of formulas (A-Ia-ii)-(A-Ih-ii):
6. The conjugate of embodiment 1, wherein each A1 and each A2 is independently described by any one of formulas (A-Ii)-(A-Ip):
wherein each X is independently C or N; or a pharmaceutically acceptable salt thereof. 7. The conjugate of embodiment 1, wherein each A1 and each A2 is independently described by any one of formulas (A-Iq)-(A-Ix):
or a pharmaceutically acceptable salt thereof.
8. The conjugate of embodiment 1, wherein the conjugate is described by formula (M-I):
wherein each A1 is independently described by formula (A-I); each E comprises an Fc domain monomer, and the squiggly line connected to the E indicates that each A1-L is covalently attached to E; or a pharmaceutically acceptable salt thereof. 9. The conjugate of embodiment 8, wherein the conjugate is described by formula (M-II):
wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof. 10. The conjugate of embodiment 9, wherein the conjugate is described by formula (M-III):
or a pharmaceutically acceptable salt thereof.
11. The conjugate of embodiment 10, wherein the conjugate is described by formula (M-III-1):
or a pharmaceutically acceptable salt thereof. 12. The conjugate of embodiment 11, wherein the conjugate is described by formula (M-III-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 13. The conjugate of embodiment 12, wherein L’ is a nitrogen atom.
14. The conjugate of embodiment 10, wherein the conjugate is described by formula (M-III-3):
or a pharmaceutically acceptable salt thereof. 15. The conjugate of embodiment 14, wherein the conjugate is described by formula (M-III-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 16. The conjugate of embodiment 15, wherein L’ is a nitrogen atom.
17. The conjugate of embodiment 10, wherein the conjugate is described by formula (M-III-5):
or a pharmaceutically acceptable salt thereof. 18. The conjugate of embodiment 17, wherein the conjugate is described by formula (M-III-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 19. The conjugate of embodiment 18, wherein L’ is a nitrogen atom.
20. The conjugate of embodiment 9, wherein the conjugate is described by formula (M-IV):
or a pharmaceutically acceptable salt thereof. 21. The conjugate of embodiment 20, wherein the conjugate is described by formula (M-IV-1):
or a pharmaceutically acceptable salt thereof.
22. The conjugate of embodiment 21, wherein the conjugate is described by formula (M-IV-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 23. The conjugate of embodiment 22, wherein L’ is a nitrogen atom. 24. The conjugate of embodiment 20, wherein the conjugate is described by formula (M-IV-3):
or a pharmaceutically acceptable salt thereof.
25. The conjugate of embodiment 24, wherein the conjugate is described by formula (M-IV-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 26. The conjugate of embodiment 25, wherein L’ is a nitrogen atom. 27. The conjugate of embodiment 20, wherein the conjugate is described by formula (M-IV-5):
or a pharmaceutically acceptable salt thereof.
28. The conjugate of embodiment 27, wherein the conjugate is described by formula (M-IV-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 29. The conjugate of embodiment 28, wherein L’ is a nitrogen atom. 30. The conjugate of embodiment 20, wherein the conjugate is described by formula (M-IV-7):
or a pharmaceutically acceptable salt thereof.
31. The conjugate of embodiment 30, wherein the conjugate is described by formula (M-IV-8):
or a pharmaceutically acceptable salt thereof. 32. The conjugate of embodiment 31, wherein the conjugate is described by formula (M-IV-9):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof.
33. The conjugate of embodiment 9, wherein the conjugate is described by formula (M-V):
or a pharmaceutically acceptable salt thereof. 34. The conjugate of embodiment 33, wherein the conjugate is described by formula (M-V-1):
or a pharmaceutically acceptable salt thereof.
35. The conjugate of embodiment 34, wherein the conjugate is described by formula (M-V-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 36. The conjugate of embodiment 35, wherein L’ is a nitrogen atom. 37. The conjugate of embodiment 33, wherein the conjugate is described by formula (M-V-3):
or a pharmaceutically acceptable salt thereof.
38. The conjugate of embodiment 35, wherein the conjugate is described by formula (M-V-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 39. The conjugate of embodiment 38, wherein L’ is a nitrogen atom. 40. The conjugate of embodiment 33, wherein the conjugate is described by formula (M-V-5):
or a pharmaceutically acceptable salt thereof.
41. The conjugate of embodiment 40, wherein the conjugate is described by formula (M-V-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 42. The conjugate of embodiment 41, wherein L’ is a nitrogen atom. 43. The conjugate of embodiment 33, wherein the conjugate is described by formula (M-V-7):
or a pharmaceutically acceptable salt thereof.
44. The conjugate of embodiment 43, wherein the conjugate is described by formula (M-V-8):
or a pharmaceutically acceptable salt thereof. 45. The conjugate of embodiment 44, wherein the conjugate is described by formula (M-V-9):
wherein L’ is the remainder of L, and y1 is an integer from 1-20 , or a pharmaceutically acceptable salt thereof.
46. The conjugate of embodiment 9, wherein the conjugate is described by formula (M-VI):
or a pharmaceutically acceptable salt thereof. 47. The conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-1):
or a pharmaceutically acceptable salt thereof.
48. The conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-2):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 49. The conjugate of embodiment 48, wherein L’ is a nitrogen atom. 50. The conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-3):
or a pharmaceutically acceptable salt thereof.
51. The conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-4):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 52. The conjugate of embodiment 51, wherein L’ is a nitrogen atom. 53. The conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-5):
or a pharmaceutically acceptable salt thereof.
54. The conjugate of embodiment 53, wherein the conjugate is described by formula (M-VI-6):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 55. The conjugate of embodiment 54, wherein L’ is a nitrogen atom. 56. The conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-7):
wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof.
57. The conjugate of embodiment 46, wherein the conjugate is described by formula (M-VI-8):
wherein L’ is the remainder of L, or a pharmaceutically acceptable salt thereof. 58. The conjugate of embodiment 8, wherein the conjugate is described by formula (M-VII):
wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof.
59. The conjugate of embodiment 58, wherein the conjugate is described by formula (M-VIII):
or a pharmaceutically acceptable salt thereof. 60. The conjugate of embodiment 59, wherein the conjugate is described by formula (M-VIII-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 61. The conjugate of embodiment 60, wherein L’ is a nitrogen atom.
62. The conjugate of emb
odiment 58, wherein the conjugate is described by formula (M-IX):
or a pharmaceutically acceptable salt thereof. 63. The conjugate of embodiment 62, wherein the conjugate is described by formula (M-IX-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 64. The conjugate of embodiment 63, wherein L’ is a nitrogen atom.
65. The conjugate of embodiment 58, wherein the conjugate is described by formula (M-X):
or a pharmaceutically acceptable salt thereof. 66. The conjugate of embodiment 65, wherein the conjugate is described by formula (M-X-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 67. The conjugate of embodiment 66, wherein L’ is a nitrogen atom.
68. The conjugate of embodiment 58, wherein the conjugate is described by formula (M-XI):
or a pharmaceutically acceptable salt thereof. 69. The conjugate of embodiment 68, wherein the conjugate is described by formula (M-XI-1):
wherein L’ is the remainder of L, and y1 is an integer from 1-20, or a pharmaceutically acceptable salt thereof. 70. The conjugate of embodiment 69, wherein L’ is a nitrogen atom.
71. The conjugate of embodiment 8, wherein the conjugate is described by formula (M-XII):
wherein X is C, O, or N, or a pharmaceutically acceptable salt thereof. 72. The conjugate of embodiment 71, wherein the conjugate is described by formula (M-XII-1):
or a pharmaceutically acceptable salt thereof. 73. The conjugate of embodiment 72, wherein the conjugate is described by formula (M-XII-2):
or a pharmaceutically acceptable salt thereof.
74. The conjugate of embodiment 8, wherein the conjugate is described by formula (M-XIII):
or a pharmaceutically acceptable salt thereof. 75. The conjugate of embodiment 74, wherein the conjugate is described by formula (M-XIII-1):
or a pharmaceutically acceptable salt thereof. 76. The conjugate of embodiment 75, wherein the conjugate is described by formula (M-XIII-2):
or a pharmaceutically acceptable salt thereof.
77. The conjugate of any one of embodiments 1-76, wherein L or L’ comprises one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl. 78. The conjugate of embodiment 77, wherein the backbone of L or L’ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or
imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl. 79. The conjugate of embodiment 77 or 78, wherein L or L’ is oxo substituted.
80. The conjugate of any one of embodiments 1-79, wherein the backbone of L or L’ comprises no more than 250 atoms. 81. The conjugate of any one of embodiments 1-80, wherein L or L’ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage. 82. The conjugate of any one of embodiments 1-80, wherein L or L’ is a bond. 83. The conjugate of any one of embodiments 1-80, wherein L or L’ is an atom. 84. The conjugate of any one of embodiments 1-83, wherein each L is described by formula (M- L): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached to A1; J2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4- C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0, 1, or 2; or a pharmaceutically acceptable salt thereof.
85. The conjugate of any one of embodiments 1-84, wherein each L is described by formula (M-L-I): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached A1; J2 is a bond attached to E; each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1. 86. The conjugate of embodiment 85, wherein L comprises:
, ,
, where each e is, independently, an integer from 1 to 20. 87. The conjugate of embodiment 85, wherein L comprises: ,
,
, where each e is, independently, an integer from 1 to 20. 88. The conjugate of any one of embodiments 1-87, wherein the squiggly line connected to E indicates that the L of each A1-L or each A1-L-A2 is covalently attached to a nitrogen atom of a solvent- exposed lysine of E.
89. The conjugate of any one of embodiments 1-87, wherein the squiggly line connected to E indicates that the L of each A1-L or each A1-L-A2 is covalently attached to the sulfur atom of a solvent- exposed cysteine of E. 90. The conjugate of any one of embodiments 1-89, wherein each E is an Fc domain monomer. 91. The conjugate of embodiment 90, wherein n is 2, and each E dimerizes to form an Fc domain. 92. The conjugate of any one of embodiments 1-91, wherein each E has the sequence of any one of SEQ ID NOs: 1-95 and 125-153. 93. The conjugate of embodiment 92, wherein each E comprises the sequence of SEQ ID NO: 72 or SEQ ID NO: 73. 94. A conjugate selected from any one of conjugates 1-121. 95. The conjugate of embodiment 94, wherein the conjugate is selected from any one of conjugates 5b, 9b, 12b, 13b, 14b, 15-28, 29a, 29b, 30a, 31-36, 37a, and 38-48. 96. The conjugate of embodiment 94, wherein the conjugate is selected from any one of conjugates 30b, 37b, and 49-76. 97. The conjugate of embodiment 94, wherein the conjugate is selected from any one of conjugates 77-121. 98. A conjugate described by a formula of Table 2. 99. The conjugate of any one of embodiments 1-98, wherein T is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10. 100. A population of conjugates of any one of embodiments 1-98, wherein the average value of T is 1 to 10. 101. A population of conjugates of embodiment 100, wherein the average value of T is 1 to 5. 102. A pharmaceutical composition comprising a conjugate of any one of embodiments 1-99, or a population of conjugates of embodiments 100 or 101, or a pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable excipient.
103. A method for the treatment of a subject having a viral infection or presumed to have a viral infection, the method comprising administering to the subject an effective amount of a conjugate of any one of embodiments 1-99, a population of conjugates of embodiments 100 or 101, or a composition of embodiment 102. 104. A method for the prophylactic treatment of a viral infection in a subject in need thereof, the method comprising administering to the subject an effective amount of a conjugate of any one of embodiments 1-99, a population of conjugates of embodiments 100 or 101, or a composition of embodiment 102. 105. The method of embodiment 103 or 104, wherein the viral infection is caused by human immunodeficiency virus (HIV). 106. The method of embodiment 105, wherein the HIV is HIV-1 or HIV-2. 107. The method of any one of embodiments 103-106, wherein the subject is immunocompromised. 108. The method of any one of embodiments 103-107, wherein the subject has been diagnosed with humoral immune deficiency, T cell deficiency, neutropenia, asplenia, or complement deficiency. 109. The method of any one of embodiments 103-108, wherein the subject is being treated or is about to be treated with an immunosuppressive therapy. 110. The method of any one of embodiments 103-109, wherein said subject has been diagnosed with a disease which causes immunosuppression. 111. The method of embodiment 110, wherein the disease is cancer. 112. The method of embodiment 111, wherein the cancer is leukemia, lymphoma, or multiple myeloma. 113. The method of any one of embodiments 103-112, wherein the subject has undergone or is about to undergo hematopoietic stem cell transplantation. 114. The method of any one of embodiments 103-113, wherein the subject has undergone or is about to undergo an organ transplant. 115. The method of any one of embodiments 103-114, wherein the conjugate of composition is administered intramuscularly, intravenously, intradermally, intraarterially, intraperitoneally, intralesionally,
intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally, locally, by inhalation, by injection, or by infusion. 116. The method of any one of embodiments 103-115, wherein the subject is treated with a second therapeutic agent. 117. The method of embodiment 116, wherein the second therapeutic agent is an antiviral agent. 118. The method of embodiment 117, wherein the antiviral agent is selected from an integrase inhibitor, a nucleoside reverse transcriptase inhibitor (NRTI), a non-nucleoside reverse transcriptase inhibitor (NNRTI), a protease inhibitor, an inhibitor of viral entry, a CCR5 antagonist, or a CYP3A inhibitor. 119. The method of embodiment 118, wherein the integrase inhibitor is selected from dolutegravir, elvitegravir, or raltegravir. 120. The method of embodiment 118, wherein the nucleoside reverse transcriptase inhibitor (NRTI) is selected from abacavir, lamivudine, zidovudine, emtricitabine, tenofovir, emtricitabine, didanosine, or stavudine. 121. The method of embodiment 118, wherein the non-nucleoside reverse transcriptase inhibitor (NNRTI) is selected from efavirenz, etravirine, nevirapine, rilpivirine, or delavirdine. 122. The method of embodiment 118, wherein the protease inhibitor is selected from atazanavir, cobicistat, darunavir, cobicistat, lopinavir, ritonavir, fosamprenavir, tipranavir, nelfinavir, indinavir, or saquinavir. 123. The method of embodiment 118, wherein the inhibitor of viral entry is enfuvirtide. 124. The method of embodiment 118, wherein the CCR5 antagonist is maraviroc. 125. The method of embodiment 118, wherein the CYP3A inhibitor is cobicistat or ritonavir. 126. A method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102, the method comprising: (a) providing a first composition including E;
(b) providing a second composition including a compound of formula (DF-I) or salt thereof:
wherein L’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. 127. A method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102, the method comprising: (a) providing a first composition including E; (b) providing a second composition
including a compound of formula (MF-I) or salt thereof:
wherein L’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. 128. A method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102, the method comprising: (a) providing a first composition including E;
(b) providing a second composition including a compound of formula (DF-II) or salt thereof:
wherein G is optionally substituted C1-C6 alkylene, optionally substituted C1-C6 heteroalkylene, optionally substituted C2-C6 alkenylene, optionally substituted C2-C6 heteroalkenylene, optionally substituted C2-C6 alkynylene, optionally substituted C2-C6 heteroalkynylene, optionally substituted C3-C10 cycloalkylene, optionally substituted C2-C10 heterocycloalkylene, optionally substituted C6-C10 arylene, or optionally substituted C2-C10 heteroarylene; L’-G-L’’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. 129. A method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102, the method comprising: (a) providing a first composition including E; (b) providing a second composition including a compound of formula (MF-II) or salt thereof:
wherein G is optionally substituted C1-C6 alkylene, optionally substituted C1-C6 heteroalkylene, optionally substituted C2-C6 alkenylene, optionally substituted C2-C6 heteroalkenylene, optionally substituted C2-C6
alkynylene, optionally substituted C2-C6 heteroalkynylene, optionally substituted C3-C10 cycloalkylene, optionally substituted C2-C10 heterocycloalkylene, optionally substituted C6-C10 arylene, or optionally substituted C2-C10 heteroarylene; L’-G-L’’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture. 130. A method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102, the method comprising: (a) providing a first composition including formula (D-G3-A) or a salt thereof:
where Ga is a functional group that reacts with Gb to form G; (b) providing a second composition including formula (D-G3-B) or a salt thereof:
where Gb is a functional group that reacts with Ga to form G; and (c) combining the first composition and the second composition to form a first mixture, where m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group. 131. A method of synthesizing a conjugate of any one of embodiments 1-99, a population of conjugates of any one of embodiments 100 or 101, or pharmaceutical composition of embodiment 102, the method comprising: (a) providing a first composition including formula (M-G3-A) or a salt thereof:
where Ga is a functional group that reacts with Gb to form G;
(b) providing a second composition including formula (M-G3-B) or a salt thereof:
where Gb is a functional group that reacts with Ga to form G; and (c) combining the first composition and the second composition to form a first mixture.
Other Embodiments While the invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications and this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the invention that come within known or customary practice within the art to which the invention pertains and may be applied to the essential features hereinbefore set forth, and follows in the scope of the claims. All publications, patents, and patent applications mentioned in the above specification are hereby incorporated by reference to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety. Detailed descriptions of one or more preferred embodiments are provided herein. It is to be understood, however, that the present invention may be embodied in various forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but rather as a basis for the claims and as a representative basis for teaching one skilled in the art to employ the present invention in any appropriate manner.
Claims
Claims 1. A conjugate described by any one of formulas (D-I), (M-I), (1), or (2):
wherein each A1 and each A2 is independently described by formula (A-I) or (A-II):
wherein Q is selected from the group consisting of:
, S is selected from the group consisting of:
R1, R2, R3, are each independently selected from H, OH, halogen, nitrile, nitro, optionally substituted amine, optionally substituted sulfhydryl, optionally substituted carboxyl, optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C3-C20 cycloalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C5-C20 aryl, optionally substituted C3-C15 heteroaryl, and optionally substituted C1-C20 alkoxy; R4 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, optionally substituted C3- C15 heteroaryl, and a bond; R5 is selected from H or optionally substituted C1-C6 alkyl; R6 is selected from optionally substituted C1-C20 alkyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl; U1, U2, U3, U4, and U5 are each independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl. R7 and Y are each independently selected from
R8 are each independently selected from H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; R9 are each independently selected from optionally substituted C1-C20 alkylene, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C2-C15 heteroaryl; x is 1 or 2; k is 0, 1, 2, 3, 4, or 5; Ar is selected from the group consisting of optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C5-C15 aryl, and optionally substituted C3-C15 heteroaryl; n is 1 or 2; each E comprises an Fc domain monomer, an albumin protein, an albumin protein-binding peptide, or an Fc-binding peptide; L is a linker covalently attached to E and to each Y of each A1 or each A1 and A2; T is an integer from 1 to 20, and each squiggly line in formulas (D-I), (M-I), (1), or (2) indicates that L is covalently attached to E; or a pharmaceutically acceptable salt thereof.
2. The conjugate of claim 1, wherein each A1 and each A2 is independently described by formula (A-I).
4. The conjugate of claim 3, wherein each A1 and each A2 is independently described by any one of formulas (A-Ia-ii)-(A-Ih-ii):
5. The conjugate of claim 1, wherein the conjugate is described by formula (M-I):
wherein each A1 is independently described by formula (A-I); each E comprises an Fc domain monomer, and the squiggly line connected to the E indicates that each A1-L is covalently attached to E; or a pharmaceutically acceptable salt thereof.
10. The conjugate of claim 9, wherein L’ is a nitrogen atom.
13. The conjugate of claim 12, wherein L’ is a nitrogen atom.
16. The conjugate of claim 15, wherein L’ is a nitrogen atom.
19. The conjugate of any one of claims 1-18, wherein L or L’ comprises one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.
20. The conjugate of claim 19, wherein the backbone of L or L’ consists of one or more optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally
substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, optionally substituted C3-C15 heteroarylene, O, S, NRi, , P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or
imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl.
21. The conjugate of claim 19 or 20, wherein L or L’ is oxo substituted.
22. The conjugate of any one of claims 1-21, wherein the backbone of L or L’ comprises no more than 250 atoms.
23. The conjugate of any one of claims 1-22, wherein L or L’ is capable of forming an amide, a carbamate, a sulfonyl, or a urea linkage.
24. The conjugate of any one of claims 1-22, wherein L or L’ is a bond.
25. The conjugate of any one of claims 1-22, wherein L or L’ is an atom.
26. The conjugate of any one of claims 1-25, wherein each L is described by formula (M-L): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached to A1; J2 is a bond attached to E or a functional group capable of reacting with a functional group conjugated to E (e.g., maleimide and cysteine, amine and activated carboxylic acid (e.g., carboxylic acid activated by tetrafluorophenol or trifluorophenol), thiol and maleimide, activated sulfonic acid and amine, isocyanate and amine, azide and alkyne, and alkene and tetrazine); each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20
heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino; Ri is H, optionally substituted C1-C20 alkyl, optionally substituted C1-C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3-C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4- C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C2-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0, 1, or 2; or a pharmaceutically acceptable salt thereof.
27. The conjugate of any one of claims 1-26, wherein each L is described by formula (M-L-I): J1-(Q1)g-(T1)h-(Q2)i-(T2)j-(Q3)k-(T3)l-(Q4)m-(T4)n-(Q5)o-J2 wherein J1 is a bond attached A1; J2 is a bond attached to E; each of Q1, Q2, Q3, Q4, and Q5 is, independently, optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, optionally substituted C2-C20 heteroalkynylene, optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C3-C15 heteroarylene; each of T1, T2, T3, T4 is, independently, O, S, NRi,
, P, carbonyl, thiocarbonyl, sulfonyl, phosphate, phosphoryl, or imino, wherein each Ri is, independently, H, optionally substituted C1-C20 alkyl, optionally substituted C1- C20 heteroalkyl, optionally substituted C2-C20 alkenyl, optionally substituted C2-C20 heteroalkenyl, optionally substituted C2-C20 alkynyl, optionally substituted C2-C20 heteroalkynyl, optionally substituted C3- C20 cycloalkyl, optionally substituted C2-C20 heterocycloalkyl, optionally substituted C4-C20 cycloalkenyl, optionally substituted C4-C20 heterocycloalkenyl, optionally substituted C8-C20 cycloalkynyl, optionally substituted C8-C20 heterocycloalkynyl, optionally substituted C5-C15 aryl, or optionally substituted C3-C15 heteroaryl; and each of g, h, i, j, k, l, m, n, and o is, independently, 0 or 1.
28. The conjugate of claim 26 or 27, wherein L is a linker described by formula (M-L-Ia): J1-Q1-Q2-Q3-Q4-Q5-J2.
29. The conjugate of claim 26 or 27, wherein L is a linker described by formula (M-L-Ib): J1-Q2-Q3-Q4-Q5-J2.
30 The conjugate of claim 26 or 27, wherein L is a linker described by formula (M-L-Ic): J1-Q1-Q3-Q4-Q5-J2.
31. The conjugate of claim 26 or 27, wherein L is a linker described by formula (M-L-Id): J1-Q2-T2-Q3-Q4-Q5-J2.
32. The conjugate of any one of claims 26-31, wherein Q1 is optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, or optionally substituted C2-C20 heteroalkynylene; Q2 is optionally substituted C3-C20 cycloalkylene, optionally substituted C2-C20 heterocycloalkylene, optionally substituted C4-C20 cycloalkenylene, optionally substituted C4-C20 heterocycloalkenylene, optionally substituted C8-C20 cycloalkynylene, optionally substituted C8-C20 heterocycloalkynylene, optionally substituted C5-C15 arylene, or optionally substituted C2-C15 heteroarylene; Q3 is optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, or optionally substituted C2-C20 heteroalkynylene; Q4 is optionally substituted C5-C15 arylene or optionally substituted C2-C15 heteroarylene; and Q5 is optionally substituted C1-C20 alkylene, optionally substituted C1-C20 heteroalkylene, optionally substituted C2-C20 alkenylene, optionally substituted C2-C20 heteroalkenylene, optionally substituted C2-C20 alkynylene, or optionally substituted C2-C20 heteroalkynylene.
34. The conjugate of any one of claims 1-33, wherein the squiggly line connected to E indicates that the L of each A1-L or each A1-L-A2 is covalently attached to a nitrogen atom of a solvent-exposed lysine of E.
35. The conjugate of any one of claims 1-33, wherein the squiggly line connected to E indicates that the L of each A1-L or each A1-L-A2 is covalently attached to the sulfur atom of a solvent-exposed cysteine of E.
36. The conjugate of any one of claims 1-35, wherein each E is an Fc domain monomer.
37. The conjugate of claim 36, wherein n is 2, and each E dimerizes to form an Fc domain.
38. The conjugate of any one of claims 1-37, wherein each E has the sequence of any one of SEQ ID NOs: 1-95 and 125-153.
39. The conjugate of claim 38, wherein each E comprises the sequence of SEQ ID NO: 72 or SEQ ID NO: 73.
40. A conjugate selected from any one of conjugates 1-121.
41. The conjugate of claim 40, wherein the conjugate is selected from any one of conjugates 5b, 9b, 12b, 13b, 14b, 15-28, 29a, 29b, 30a, 31-36, 37a, and 38-48.
42. The conjugate of claim 40, wherein the conjugate is selected from any one of conjugates 30b, 37b, and 49-76.
43. The conjugate of claim 40, wherein the conjugate is selected from any one of conjugates 77- 121.
44. A conjugate described by a formula of Table 2.
45. The conjugate of any one of claims 1-44, wherein T is 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10.
46. A population of conjugates of any one of claims 1-45, wherein the average value of T is 1 to 10.
47. A population of conjugates of claim 46, wherein the average value of T is 1 to 5.
48. A pharmaceutical composition comprising a conjugate of any one of claims 1-45, or a population of conjugates of claims 46 or 47, or a pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable excipient.
49. A method for the treatment of a subject having a viral infection or presumed to have a viral infection, the method comprising administering to the subject an effective amount of a conjugate of any one of claims 1-45, a population of conjugates of claims 46 or 47, or a composition of claim 48.
50. A method for the prophylactic treatment of a viral infection in a subject in need thereof, the method comprising administering to the subject an effective amount of a conjugate of any one of claims 1-45, a population of conjugates of claims 46 or 47, or a composition of claim 48.
51. The method of claim 49 or 50, wherein the viral infection is caused by human immunodeficiency virus (HIV).
52. The method of claim 51, wherein the HIV is HIV-1 or HIV-2.
53. The method of any one of claims 49-52, wherein the subject is immunocompromised.
54. The method of any one of claims 49-53, wherein the subject has been diagnosed with humoral immune deficiency, T cell deficiency, neutropenia, asplenia, or complement deficiency.
55. The method of any one of claims 49-54, wherein the subject is being treated or is about to be treated with an immunosuppressive therapy.
56. The method of any one of claims 49-55, wherein said subject has been diagnosed with a disease which causes immunosuppression.
57. The method of claim 56, wherein the disease is cancer.
58. The method of claim 57, wherein the cancer is leukemia, lymphoma, or multiple myeloma.
59. The method of any one of claims 49-58, wherein the subject has undergone or is about to undergo hematopoietic stem cell transplantation.
60. The method of any one of claims 49-59, wherein the subject has undergone or is about to undergo an organ transplant.
61. The method of any one of claims 49-60, wherein the conjugate of composition is administered intramuscularly, intravenously, intradermally, intraarterially, intraperitoneally, intralesionally,
intracranially, intraarticularly, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctival, intravesicularlly, mucosally, intrapericardially, intraumbilically, intraocularally, orally, locally, by inhalation, by injection, or by infusion.
62. The method of any one of claims 49-61, wherein the subject is treated with a second therapeutic agent.
63. The method of claim 62, wherein the second therapeutic agent is an antiviral agent.
64. The method of claim 63, wherein the antiviral agent is selected from an integrase inhibitor, a nucleoside reverse transcriptase inhibitor (NRTI), a non-nucleoside reverse transcriptase inhibitor (NNRTI), a protease inhibitor, an inhibitor of viral entry, a CCR5 antagonist, or a CYP3A inhibitor.
65. The method of claim 64, wherein the integrase inhibitor is selected from dolutegravir, elvitegravir, or raltegravir.
66. The method of claim 64, wherein the nucleoside reverse transcriptase inhibitor (NRTI) is selected from abacavir, lamivudine, zidovudine, emtricitabine, tenofovir, emtricitabine, didanosine, or stavudine.
67. The method of claim 64, wherein the non-nucleoside reverse transcriptase inhibitor (NNRTI) is selected from efavirenz, etravirine, nevirapine, rilpivirine, or delavirdine.
68. The method of claim 64, wherein the protease inhibitor is selected from atazanavir, cobicistat, darunavir, cobicistat, lopinavir, ritonavir, fosamprenavir, tipranavir, nelfinavir, indinavir, or saquinavir.
69. The method of claim 64, wherein the inhibitor of viral entry is enfuvirtide.
70. The method of claim 64, wherein the CCR5 antagonist is maraviroc.
71. The method of claim 64, wherein the CYP3A inhibitor is cobicistat or ritonavir.
72. A method of synthesizing a conjugate of any one of claims 1-45, a population of conjugates of any one of claims 46 or 47, or pharmaceutical composition of claim 48, the method comprising: (a) providing a first composition including E;
(b) providing a second composition including a compound of formula (MF-I) or salt thereof:
wherein L’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture.
73. A method of synthesizing a conjugate of any one of claims 1-45, a population of conjugates of any one of claims 46 or 47, or pharmaceutical composition of claim 48, the method comprising: (a) providing a first composition including E; (b) providing a second composition
including a compound of formula (MF-II) or salt thereof:
wherein G is optionally substituted C1-C6 alkylene, optionally substituted C1-C6 heteroalkylene, optionally substituted C2-C6 alkenylene, optionally substituted C2-C6 heteroalkenylene, optionally substituted C2-C6 alkynylene, optionally substituted C2-C6 heteroalkynylene, optionally substituted C3-C10 cycloalkylene, optionally substituted C2-C10 heterocycloalkylene, optionally substituted C6-C10 arylene, or optionally substituted C2-C10 heteroarylene; L’-G-L’’ is the remainder of L; m is 0, 1, 2, 3, or 4; and each R is, independently, halo, cyano, nitro, optionally substituted C1-C6 alkyl group, or optionally substituted C1-C6 heteroalkyl group; and (c) combining the first composition, the second composition, and a buffer to form a mixture.
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063126947P | 2020-12-17 | 2020-12-17 | |
US63/126,947 | 2020-12-17 | ||
US202163159959P | 2021-03-11 | 2021-03-11 | |
US63/159,959 | 2021-03-11 | ||
US202163187906P | 2021-05-12 | 2021-05-12 | |
US63/187,906 | 2021-05-12 | ||
US202163246758P | 2021-09-21 | 2021-09-21 | |
US63/246,758 | 2021-09-21 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022133281A1 true WO2022133281A1 (en) | 2022-06-23 |
Family
ID=80123110
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2021/064151 WO2022133281A1 (en) | 2020-12-17 | 2021-12-17 | Compositions and methods for the treatment of human immunodeficiency virus |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2022133281A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050287153A1 (en) | 2002-06-28 | 2005-12-29 | Genentech, Inc. | Serum albumin binding peptides for tumor targeting |
US20170081389A1 (en) | 2009-02-11 | 2017-03-23 | Albumedix A/S | Albumin variants and conjugates |
WO2020051498A1 (en) * | 2018-09-06 | 2020-03-12 | Cidara Therapeutics, Inc. | Compositions and methods for the treatment of viral infections |
WO2020252393A1 (en) | 2019-06-13 | 2020-12-17 | Cidara Therapeutics, Inc. | Compositions and methods for the treatment of human immunodeficiency virus |
-
2021
- 2021-12-17 WO PCT/US2021/064151 patent/WO2022133281A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050287153A1 (en) | 2002-06-28 | 2005-12-29 | Genentech, Inc. | Serum albumin binding peptides for tumor targeting |
US20170081389A1 (en) | 2009-02-11 | 2017-03-23 | Albumedix A/S | Albumin variants and conjugates |
WO2020051498A1 (en) * | 2018-09-06 | 2020-03-12 | Cidara Therapeutics, Inc. | Compositions and methods for the treatment of viral infections |
WO2020252393A1 (en) | 2019-06-13 | 2020-12-17 | Cidara Therapeutics, Inc. | Compositions and methods for the treatment of human immunodeficiency virus |
Non-Patent Citations (21)
Title |
---|
"Pharmaceutical Preformulation and Formulation", 2009, CRC PRESS |
"Remington: The Science and Practice of Pharmacy", 2012, LIPPINCOTT WILLIAMS & WILKINS |
"Swissprot", Database accession no. P14639-1 |
"UniProtKB", Database accession no. P01861 |
AGARWALL. P. ET AL., BIOCONJUGATE CHEM., vol. 26, 2015, pages 176 - 192 |
BOSSE ET AL., J CLIN PHARMACOL, vol. 45, no. 1, 2005, pages 57 - 67 |
CARTER ET AL., SCIENCE, vol. 244, no. 4909, 1998, pages 1195 - 8 |
DEMARCO ET AL., INTERNATIONAL JOURNAL FOR PARASITOLOGY, vol. 37, no. 11, 2007, pages 1201 - 1208 |
J. SWARBRICKJ. C. BOYLANMARCEL DEKKE: "Encyclopedia of Pharmaceutical Technology", 2013 |
KHAN ET AL., INT. J. BIOL. MACROMOL., vol. 30, no. 3-4, 2002, pages 171 - 8 |
LAI YEN-TING ET AL: "Lattice engineering enables definition of molecular features allowing for potent small-molecule inhibition of HIV-1 entry", NATURE COMMUNICATIONS, vol. 10, 47, 3 January 2019 (2019-01-03), pages 1 - 11, XP055907525, DOI: 10.1038/s41467-018-07851-1 * |
LEGER ET AL., BIOORG MED CHEM LETT, vol. 14, no. 17, 2004, pages 4395 - 8 |
LU ET AL., CURR. TOP. MED. CHEM., vol. 16, no. 10, pages 1074 - 1090 |
MAEDA, A. ET AL.: "Identification of human IgG1 variant with enhanced FcRn binding and without increased binding to rheumatoid factor autoantibody", MABS, vol. 9, no. 5, 2017, pages 844 - 853, XP055827394, DOI: 10.1080/19420862.2017.1314873 |
PETERS ET AL.: "All About Albumin: Biochemistry, Genetics and Medical Applications", 1996, ACADEMIC PRESS, INC., pages: 170 - 181 |
SAZINSKY S.L. ET AL.: "Aglycosylated immunoglobulin G1 variants productively engage activating Fc receptors", PNAS, vol. 105, no. 51, 2008, pages 20167 - 20172, XP009143223, DOI: 10.1073/pnas.0809257105 |
SLEEP ET AL., BIOTECHNOLOGY, vol. 8, no. 1, 1990, pages 42 - 6 |
SUGIO ET AL., PROTEIN ENG., vol. 12, no. 6, 1999, pages 439 - 46 |
THIBAUDEAU ET AL., BIOCONJUG CHEM, vol. 16, no. 4, 2005, pages 1000 - 8 |
VIDARSSON ET AL.: "IgG subclasses and allotypes: from structure to effector function", FRONTIERS IN IMMUNOLOGY, vol. 5, no. 520, 2014, pages 1 - 17, XP055429512, DOI: 10.3389/fimmu.2014.00520 |
WIBMER ET AL., CURR. OPIN. HIV AIDS, vol. 10, no. 3, 2015, pages 135 - 143 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11833213B2 (en) | Compositions and methods for the treatment of viral infections | |
US20230079107A1 (en) | Compositions and methods for the treatment of human immunodeficiency virus | |
US20230398226A1 (en) | Compositions and methods for the treatment of viral infections | |
US20230082611A1 (en) | Compositions and methods for the treatment of respiratory syncytial virus | |
WO2021050612A1 (en) | Compositions and methods for the treatment of respiratory syncytial virus | |
WO2022133281A1 (en) | Compositions and methods for the treatment of human immunodeficiency virus | |
EP4551563A2 (en) | Fc conjugates including an inhibitor of cd73 and uses thereof | |
RU2816717C2 (en) | Compositions and methods for treating viral infections | |
WO2021081515A2 (en) | Compositions and methods for the treatment of human immunodeficiency virus | |
BR112022004058B1 (en) | CONJUGATE AND POPULATION OF CONJUGATES | |
HK40057792A (en) | Compositions and methods for the treatment of viral infections |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21851903 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21851903 Country of ref document: EP Kind code of ref document: A1 |