GPR15L   

GtoPdb Ligand ID: 10567

Synonyms: antimicrobial peptide-57 | colon-derived SUSD2 binding factor
Comment: This peptide is a product of the human C10orf99 gene. It is proposed as an agonistic ligand of the orphan GPCR, GPR15, and has been referred to as GPR15L [1]. Structurally, GPR15L is predicted to contain two intramolecular disulphide bonds, which implies a similarity to CC family cytokines.
Species: Human
2D Structure
Click here for structure editor
SMILES / InChI / InChIKey
Canonical SMILES NCCCCC(C(=O)NC(C(=O)NC(C(=O)N1CCCC1C(=O)NC(C(=O)N1CCCC1C(=O)NC(C(=O)NC(C(=O)NC(C(=O)NC(C(=O)NC(C(=O)N1CCCC1C(=O)NCC(=O)NC(C(=O)NC(C(=O)N1CCCC1C(=O)NC(C(=O)NC(C(=O)O)C(C)C)CCC(=O)N)CC(C)C)C)C(C)C)C(C)C)Cc1c[nH]c2c1cccc2)CC(C)C)CCCN=C(N)N)CCC(=O)O)CCC(=O)O)CC(C)C)NC(=O)C1CSSCC(NC(=O)C(NC(=O)C(NC(=O)C(C(O)C)NC(=O)C(NC(=O)C(NC(=O)CNC(=O)C(NC(=O)C(Cc2c[nH]c3c2cccc3)NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C2CCCN2C(=O)C(NC(=O)C(NC(=O)C(CCCCN)N)CCCN=C(N)N)CCCN=C(N)N)C)CCCCN)C)CO)CCCN=C(N)N)CCCN=C(N)N)CCCN=C(N)N)CC(C)C)C(=O)NC2CSSCC(C(=O)NC(C(=O)N3C(C(=O)N1)CCC3)CCCCN)NC(=O)C(CC(C)C)NC(=O)C(CCCN=C(N)N)NC(=O)C(NC(=O)C(Cc1c[nH]cn1)NC(=O)C(NC(=O)CNC(=O)C(CCCCN)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C(NC(=O)C1N(C(=O)C(NC(=O)C3N(C(=O)C(NC(=O)C(NC(=O)C(NC2=O)Cc2c[nH]cn2)CCCN=C(N)N)C(C)C)CCC3)CO)CCC1)CC(=O)N)CO)C(O)C)CC(=O)N)Cc1c[nH]cn1)C(C)C
Isomeric SMILES NCCCC[C@@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)O)C(C)C)CCC(=O)N)CC(C)C)C)C(C)C)C(C)C)Cc1c[nH]c2c1cccc2)CC(C)C)CCCN=C(N)N)CCC(=O)O)CCC(=O)O)CC(C)C)NC(=O)[C@@H]1CSSC[C@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](Cc2c[nH]c3c2cccc3)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]2CCCN2C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)N)CCCN=C(N)N)CCCN=C(N)N)C)CCCCN)C)CO)CCCN=C(N)N)CCCN=C(N)N)CCCN=C(N)N)CC(C)C)C(=O)N[C@H]2CSSC[C@@H](C(=O)N[C@H](C(=O)N3[C@H](C(=O)N1)CCC3)CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H]1N(C(=O)[C@@H](NC(=O)[C@H]3N(C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC2=O)Cc2c[nH]cn2)CCCN=C(N)N)C(C)C)CCC3)CO)CCC1)CC(=O)N)CO)[C@H](O)C)CC(=O)N)Cc1c[nH]cn1)C(C)C
InChI Key MKYBCIBFBPPBKT-QSWODBGPSA-N
Peptide Sequence
MRLLVLSSLLCILLLCFSIFSTEGKRRPAKAWSGRRTRLCCHRVPSPNSTNLKGHHVRLCKPCKLEPEPRLWVVPGALPQ
V
H-Lys-Arg-Arg-Pro-Ala-Lys-Ala-Trp-Ser-Gly-Arg-Arg-Thr-Arg-Leu-Cys(1)-Cys(2)-His-Arg-Val-Pro-Ser-Pro-Asn-Ser-Thr-Asn-Leu-Lys-Gly-His-His-Val-Arg-Leu-Cys(2)-Lys-Pro-Cys(1)-Lys-Leu-Glu-Pro-Glu-Pro-Arg-Leu-Trp-Val-Val-Pro-Gly-Ala-Leu-Pro-Gln-Val-OH
Post-translational Modification
Amino acids 1-24 (MRLLVLSSLLCILLLCFSIFSTEG) represent a signal peptide and have been removed to show the mature peptide sequence in the 3 letter code.
Download 2D Structure
Canonical SMILES Download
Isomeric SMILES Download

Molecular structure representations generated using Open Babel