|
|
Title:
Structure-based hepatitis C virus drug design
United States Patent: 7,657,385
Issued: February 2, 2010
Inventors: Bolognesi;
Martino (Torre d'Isola, IT), Grandi; Guido (Segrate, IT)
Assignee: Novartis Vaccines
and Diagnostics, S.R.L. (Siena, IT)
Appl. No.: 10/312,490
Filed: July 3, 2001
PCT Filed: July 03, 2001
PCT No.: PCT/IB01/01450
371(c)(1),(2),(4) Date: April
22, 2003
PCT Pub. No.: WO02/02631
PCT Pub. Date: January 10,
2002
|
|
|
Patheon
|
Abstract
The extracellular loop of CD81 is a
cellular receptor for the E2 protein of hepatitis C virus. A CD81 crystal
structure has been elucidated and is provided for use in the
structure-based design of compounds which bind to CD81 and thus block the
binding of HCV. Methods such as docking and de novodrug design can be
used.
Description of the
Invention
TECHNICAL FIELD
This invention is in the field of the use of structure-based drug design
methods to identify compounds that bind to CD81, which is a cell-surface
receptor for hepatitis C virus (HCV).
BACKGROUND ART
Chronic HCV infection occurs in about 3% of the world's population. It is
a major cause of liver disease, but effective anti-HCV drugs are not yet
available.
In October 1998, the extracellular loop of CD81 was reported as a cellular
receptor for the E2 protein of HCV [1]. It is an object of the invention
to facilitate rational structure-based drug design of chemical compounds
that interact with CD81 to inhibit the binding of HCV.
DISCLOSURE OF THE INVENTION
The invention is based on the elucidation of a crystal structure for the
large extracellular loop (LEL; amino acids 113-201) of human CD81. This
facilitates structure-based design of compounds which bind to CD81 and
thus block the binding of HCV.
The invention provides a computer-based method for identifying a ligand
for CD81, comprising the steps of: (a) providing a 3D structural
representation of the large extracellular loop of CD81 in a storage medium
on a computer; and (b) using the computer to apply structure-based drug
design techniques to the structural representation.
Structure-Based Drug Design Techniques
Structure-based drug design techniques can be applied to the structural
representation of the LEL in order to identify compounds that interact
with CD81 to block HCV binding. A variety of suitable techniques [e.g.
ref. 2] are available to the skilled person.
Software packages for implementing molecular modelling techniques for use
in structure-based drug design include SYBYL [3], AMBER [4], CERIUS.sup.2
[5], INSIGHT II [5], CATALYST [5], QUANTA [5], HYPERCHEM [6], CHEMSITE [7]
etc.
This software can be used to determine binding surfaces of the LEL in
order to reveal features such as van der Waals contacts, electrostatic
interactions, and/or hydrogen bonding opportunities. These binding
surfaces may be used as follows:
Docking
Docking aligns the 3D structures of two or more molecules to predict the
conformation of a complex formed from the molecules [e.g. ref 8].
According to the present invention, molecules are docked with the LEL
structure to assess their ability to interact with CD81.
Docking can be accomplished by either geometric matching of the ligand and
its receptor or by minimising the energy of interaction. Geometric
matching algorithms are preferred because of their relative speed.
Suitable docking algorithms include, but are not limited to: DOCK.[9], the
prototypical program for structure-based drug design. AUTODOCK [10, 4],
which docks ligands in a flexible manner to receptors using grid-based
Monte Carlo simulated annealing. The flexible nature of the AUTODOCK
procedure helps to avoid bias (e.g. in orientation and conformation of the
ligand in the active site) introduced by the user researcher [11] because,
whilst the starting conformation in a rigid docking is normally biased
towards an minimum energy conformation of the ligand, the binding
conformation may be of relatively high conformational energy [12].
MOE-DOCK [13], in which a simulated annealing search algorithm is used to
flexibly dock ligands. A grid-based energy evaluation is used to score
docked conformations. FLExX [3], which docks conformationally flexible
ligands into a binding site using an incremental construction algorithm
that builds the ligand in the site. Docked conformations are scored based
on the strength of ligand-receptor interactions. GOLD [14], a genetic
algorithm for flexible ligand docking, with full ligand and partial
protein flexibility. Energy functions are partly based on conformation and
non-bonded contact information. AFFINITY [5], which uses a two step
process to dock ligands. First, initial placements of the ligand within
the receptor are made using a Monte Carlo type procedure to search both
conformational and Cartesian space. Second, a simulated annealing phase
optimises the location of each ligand placement. During this phase,
AFFINITY holds the `bulk` of the receptor (atoms not in the binding site)
rigid, while the binding site atoms and ligand atoms are movable.
C.sup.2.LigandFit [5], which uses the energy of the ligand-receptor
complex to automatically find best binding modes. Stochastic
conformational search technique are used, and the best results from the
conformational sampling are retained. A grid method is used to evaluate
non-bonded interactions between the rigid receptor and the flexible ligand
atoms.
Preferably, the docking algorithm is used, in a `high throughput` mode, in
which members of large structural libraries of potential ligands are
screened against the receptor structure [15].
Suitable structural libraries include the ACD (Available Chemical
Directory, from MDL Inc), AsInEx, Bionet, ComGenex, the Derwent World Drug
Index (WDI), the Contact Service Company database, LaboTest, ChemBridge
Express Pick, ChemStar, BioByteMasterFile, Orion, SALOR, TRIAD, ILIAD, the
National Cancer Institute database (NCI), and the Aldrich, Fluka, Sigman
and Maybridge catalogs. These are commercially available (e.g. the HTS
Chemicals collections from Oxford Molecular, or the LeadQuest.TM. files
from Tripos).
Pharmacophore hypotheses
A pharmacophore (i.e. a collection of chemical features and 3D constraints
that expresses specific characteristics responsible for activity) can be
defined for the LEL. The pharmacophore preferably includes
surface-accessible features, more preferably including hydrogen bond
donors and acceptors, charged/ionisable groups, and/or hydrophobic
patches. These may be weighted depending on their relative importance in
conferring activity [16].
Pharmacophores can be determined using software such as CATALYST
(including HypoGen or HipHop) [5], CERIUS2, or constructed by hand from a
known conformation of a lead compound. The pharmacophore can be used to
screen structural libraries, using a program such as CATALYST [5]. The
CLIX program [17] can also be used, which searches for orientations of
candidate molecules in structural databases that yield maximum spatial
coincidence with chemical groups which interact with the receptor.
de novo Compound Design
The binding surface or pharmacophore of the LEL can be used to map
favourable interaction positions for functional groups (e.g. protons,
hydroxyl groups, amine groups, hydrophobic groups and/or divalent cations)
or small molecule fragments. Compounds can then be designed de novo in
which the relevant functional groups are located in the correct spatial
relationship to interact with CD81.
Once functional groups or small molecule fragments which can interact with
specific sites in the CD81 binding surface have been identified, they can
be linked in a single compound using either bridging fragments with the
correct size and geometry or frameworks which can support the functional
groups at favourable orientations, thereby providing a compound according
to the invention. Whilst linking of functional groups in this way can be
done manually, perhaps with the help of software such as QUANTA or SYBYL,
automated or semi-automated de novo design approaches are also available:
MCDLNG [18], which fills a receptor binding site with a close-packed array
of generic atoms and uses a Monte Carlo procedure to randomly vary atom
types, positions, bonding arrangements and other properties. MCSS/HOOK
[19, 20, 5], which links multiple functional groups with molecular
templates taken from a database. LUDI [21, 5], which computes the points
of interaction that would ideally be fulfilled by a ligand, places
fragments in the binding site based on their ability to interact with the
receptor, and then connects them to produce a ligand. GROW [22], which
starts with an initial `seed` fragment (placed manually or automatically)
and grows the ligand outwards. SPROUT [23], suite which includes modules
to: identify favourable hydrogen bonding and hydrophobic regions within a
binding pocket (HIPPO module); select functional groups and position them
at target sites to form starting fragments for structure generation (EleFAnT);
generate skeletons that satisfy the steric constraints of the binding
pocket by growing spacer fragments onto the start fragments and then
connecting the resulting part skeletons (SPIDeR); substitute hetero atoms
into the skeletons to generate molecules with the electrostatic properties
that are complementary to those of the receptor site (MARABOU). The
solutions can be clustered and scored using the ALLigaTOR module. LEAPFROG
[3], which evaluates ligands by making small stepwise structural changes
and rapidly evaluating the binding energy of the new compound. Changes are
kept or discarded based on the altered binding energy, and structures
evolve to increase the interaction energy with the receptor. GROUPBUILD
[24], which uses a library of common organic templates and a complete
empirical force field description of the non-bonding interactions between
a ligand and receptor to construct ligands that have chemically reasonable
structure and have steric and electrostatic properties complimentary to
the receptor binding site. CAVEAT [25], which designs linking units to
constrain acyclic molecules. RASSE [26] The LEL Binding Site
To simplify computational complexity, algorithms for docking and ligand
design will typically focus only on the binding site of a receptor--it is
pointless to attempt to dock a ligand with a region in the receptor which
is known not to be involved. Binding site identification is included in
some algorithms (e.g. C.sup.2.LigandFit, the `Binding Site Analysis`
module of INSIGHT II, the SPHGEN routine of DOCK). Some manual guidance
may be required (e.g. AFFINITY).
Where a binding site has to be defined for the CD81-LEL, this should
include amino acid residue Phe186. It may also include one or more of
Leu154, Thr163, Ile181, Ile182, Leu185, Glu188, and Asp196. The binding
site may include the whole of helix D.
The Structural Representation
The invention involves the use of a 3D structural representation of the
LEL. This may be a representation of (a) the complete LEL, (b) a fragment
of CD81 that comprises the LEL, or (c) a fragment of the LEL which
includes the amino acids which interact with HCV E2 protein.
The structural representation is preferably based on or derived from the
atomic co-ordinates cd81lel.pdb as set out herein, which represents the
LEL dimer [see also refs. 27, 28 & 29]. Suitable structural
representations include 3D models and molecular surfaces derived from
these atomic co-ordinates.
Variants of cd81lel.pdb can also be used for the invention, such as
variants in which the r.m.s. deviation of the x, y and z co-ordinates for
all heavy (i.e. not hydrogen) atoms are all less than 2.5 .ANG. (e.g. less
than 2 .ANG., preferably less than 1 .ANG., and more preferably less than
0.5 .ANG. or less than 0.1 .ANG.) compared with cd81lel.pdb. Co-ordinate
transformations which retain the 3D spatial relationships of atoms may
also be used to give suitable variants.
Preferred fragments of the LEL whose co-ordinates can be used in the
invention include amino acid residue Phe186. The fragments may also
include one or more of Leu154, Thr163, Ile181, Ile182, Leu185, Glu188, and
Asp196.
It is preferred that the methods of the invention use only one protein
chain i.e. only of the monomers in cd81lel.pdb. Where only one monomer is
used, it is preferred to use the first monomer (residues 113-202) rather
than the second (residues 213-302).
The water molecules in cd81lel.pdb can optionally be omitted when
performing the methods of the invention.
The atomic co-ordinates given herein can also be used as the basis of
models of further protein structures. For example, a homology model could
be based on the LEL structure of the present invention. The co-ordinates
can also be used in the solution or refinement of further crystal
structures of CD81.
The Storage Medium
The storage medium in which the LEL structural representation is provided
is preferably random-access memory (RAM), but may also be read-only memory
(ROM e.g. CDROM), or a diskette. The storage medium may be local to the
computer, or may be remote (e.g. a networked storage medium, including the
internet).
The invention also provides a computer-readable medium for a computer,
characterised in that the medium contains atomic co-ordinates and/or a 3D
structural representation of the LEL of CD81. The atomic co-ordinates are
preferably cd81lel.pdb or variants thereof.
Any suitable computer can be used in the present invention.
Testing Compounds
The methods may comprise the further steps of: (c) providing a compound
identified by said structure-based drug design techniques; and (d)
contacting said compound with CD81, or a fragment thereof containing the
LEL, and assaying the interaction between them.
The assay may be of a competitive nature. For example, the assay may
include HCV E2 protein (either purified, or in the context of the HCV
virion), such that E2 and the compound compete for binding to CD81.
Compounds and Their Uses
The methods of the invention identify compounds that can interact with
CD81. These compounds may be designed de novo, may be known compounds, or
may be based on known compounds. The compounds may be useful
pharmaceuticals themselves, or may be prototypes which can be used for
further pharmaceutical refinement (i.e. lead compounds) in order to
improve binding affinity or other pharmacologically important features
(e.g. bio-availability, toxicology, metabolism, pharmacokinetics etc.).
The invention thus provides: (i) a compound identified using the methods
of the invention; (ii) a compound identified using the methods of the
invention for use as a pharmaceutical; (iii) the use of a compound
identified using the methods of the invention in the manufacture of a
medicament for treating hepatitis C infection; and (iv) a method of
treating a patient with hepatitis C infection, comprising administering an
effective amount of a compound identified using the methods of the
invention.
These compounds preferably interact with CD81 with a binding constant in
the micromolar or, more preferably, nanomolar range or stronger.
As well as being useful compounds individually, ligands identified in
silico by the structure-based design techniques can also be used to
suggest libraries of compounds for `traditional` in vitro or in vivo
screening methods. Important pharmaceutical motifs in the ligands can be
identified and mimicked in compound libraries (e.g. combinatorial
libraries) for screening for CD81-binding activity.
Crystals
The invention also provides a composition comprising an extracellular
region (e.g. the LEL) of CD81 in crystalline form. The crystal can be used
for diffraction studies e.g. X-ray or neutron diffraction.
The crystal is preferably in the monoclinic space group P2.sub.1 (a=31.5 .ANG.,
b=77.2 .ANG., c=38.5 .ANG., .beta.=107.4.degree.), with two molecules per
asymmetric unit (Vm of 2.16 .ANG..sup.3/dalton).
In some embodiments, the composition may include ligands which are co-crystallised
with the CD81 fragment; in other embodiments the composition may be
essentially pure protein.
MODES FOR CARRYING OUT THE INVENTION
CD81-LEL Production
CD81-LEL was purified from a recombinant E. coli strain as a fusion
protein with the IgG binding domain of the S. aureus protein A. After
purification of the chimeric protein, CD81-LEL was separated from the
fusion by specific protease cleavage and further purified for the
crystallization experiments. The purified protein was fully active as
judged by the recombinant E2/CD81 inhibition of binding assay [1].
CD81-LEL Crystallisation
Crystals of CD81-LEL were obtained by mixing purified CD81-LEL at 10
mg/mil with 0.1M MES buffer (pH 6.0), 0.1M sodium chloride, and 10%
PEG4000. The crystals belong to the monoclinic space group P2.sub.1
(a=31.5 .ANG., b=77.2 .ANG., c=38.5 .ANG., .beta.=107.4.degree.), with two
molecules per asymmetric unit (Vm of 2.16 .ANG..sup.3/dalton). Crystals
were flash-cooled at 100K with addition of 20% glycerol as cryoprotectant.
Whilst native crystals could be obtained easily, it was difficult to
obtain suitable heavy atom derivatives for multiple isomorphous
replacement techniques.
Three suitable derivatives were eventually produced, based on Lu, Hg and
Pt
-- see Original Patent.
Diffraction and Structure Solving
Native and derivative diffraction data were collected in house. Additional
native data (at 1.6 .ANG. resolution) were collected on beamline ID14 (ESRF,
Grenoble, France), on a MAR CCD detector. All data were processed using
DENZO and SCALEPACK [31] and merged using the CCP4 program suite [32].
Crystallographic phases were calculated with CCP4 programs and refined
using SHARP [33] and SOLOMON [34]. The resulting electron density maps
allowed about 80% of the two independent molecules to be traced. Model
building and inspection was based on the O suite [35]. The structure was
refined using CNS [36] and REFMAC [37]0.5% of the unique data were used to
monitor the free R-factor. The final values for general R-factor and free
R-factor are 18.7% and 23.8%, respectively. The refined model consists of
176 amino acids, with 194 water molecules, at 1.60 .ANG. resolution. No
residues are found in disallowed regions of the Ramachandran plot.
Residues 238-241 are disordered in a A'B'loop.
Analysis of the Crystal Structure
FIG. 1A (see Original Patent) shows the CD81-LEL tertiary structure, which
is essentially composed of five .alpha.-helices (A, B, C, D, E) spanning
residues Asn115-Asp137, Ala140-Asp155, Leu165-Asn172, Asn180-Phe186 and
Asp189-Gly200, respectively (amino acids numbered according to the full
CD81 sequence; refs 38 & 39). A short 3.sub.10 helical segment covers
residues Leu162-Ala164. The anti-parallel A- and E-helices can be seen as
the stalk of a mushroom-shaped molecule, whose head domain (about 60
residues) is built by packing of the shorter B-, C- and D-helices and
their intervening loops (loops are defined by the helices they connect
i.e. AB, . . . , DE). A DALI search [40] of 3D protein structures did not
show significant structural homology of the CD81-LEL fold to any known
protein tertiary structure.
The dimeric assembly observed in the crystallographic asymmetric unit is
shown in FIG. 1B (see Original Patent). Two CD81-LEL chains assemble
around a local two-fold axis, displaying intermolecular contacts mainly at
the A:A' helix interface (a ' symbol distinguishes features of the partner
subunit) and along the B-helix, which is in contact with the C-terminal
region of the facing protomer. The subunit interface (986 .ANG..sup.2) is
composed of apolar residues for about 70% of its contact region.
The A:A' interface segment involves primarily residues Val114, Ile119,
Val123, Gln125, Phe126 and Gln129, in both antiparallel helices, together
with solvent mediated interactions between exposed polar side chains at
the rim of the contact region. The second interface region includes
residues Asn142, Val146, Thr149, Phe150, Thr153 & Leu154 which contact
Leu197', Phe198', Ser199' & Gly200'. No water molecules are buried at the
association interface.
Bivalent molecules (such as specific antibodies) capable of recognising
human CD81 have been reported to have a higher affinity than E2 for CD81
[41]. In agreement with the crystal structure, this suggests that CD81
exists as a homodimeric species at the cell surface. In the crystal
structure, the distance between the D- and D'-helices is about 30 .ANG.,
close to the distance between antigen-binding sites in an assembled
antibody molecule.
The C-terminal His-tag does not appear to have a structural influence
because the proximity of N- and C-termini within each chain, permitted by
the antiparallel arrangement of A- and E-helices, is compatible with their
topological location between TM3 and TM4 of native CD81 [39]. Moreover,
due to the quaternary structure two-fold symmetry, the N- and C-termini of
the two protomers fall in a restricted area, but on opposite faces of the
assembled dimer (FIG. 1B (see Original Patent)). This quaternary
organization is compatible with inter- and intra-cellular aggregation of
CD81 in homo-dimeric species, through the association interface described.
The overall shape of the CD81-LEL dimer is further characterized in FIG.
1C (see Original Patent), as a surface displaying electrostatic potential.
Besides the localization of negative potential in the central region of
the dimer, a low polarity patch is present in a surface region comprising
the C- and D-helices. In the crystal packing this region is virtually
solvent inaccessible, due to extended contacts with a symmetry equivalent
dimer.
The CD81-LEL head domain is essentially composed of the last two turns of
the A-helix, the B-, C-, D-helices, the intervening segments, and the DE
loop. The domain fold is firstly stabilized by two tetraspanin-invariant
intramolecular disulfide bridges [1, 38, 39, 41], which originate from two
adjacent Cys residues (156 & 157) and are oriented approximately in
opposite directions within the head domain (FIG. 3A (see Original Patent)).
Their topological connections are Cys156--Cys190 and Cys157--Cys175: the
first involves residues located at the C-terminus of the B-helix (Cys156)
and at the N-terminus of the E-helix (Cys190); the second connects the BC
segment (Cys157) with the CD loop region (Cys175). The latter region is
fully exposed to solvent and specifically kinked towards the head domain
core by virtue of the Cys157--Cys175 bridge. Of the four Cys residues,
only Cys 175 is partly accessible to solvent (20 .ANG..sup.2).
The head domain is built around the core residue Tyr127, fully buried and
surrounded by Leu131, Va147, Phe150, His151 and by the. Asp128--His191
hydrogen-bonded salt-bridge. Tyr127 OH atom is hydrogen-bonded to His151
NE2 atom (2.7 .ANG.) and is 3.61 .ANG. from the Cys190S.gamma. atom (FIG.
3B (see Original Patent)). The imidazole ring of His151 is nestled between
the two disulfide bridges, and is at hydrogen-bonding distance from Cys190
S.gamma. (3.56 .ANG.). A hydrogen-bonding network involving Tyr127, His151
and Cys190 may play a role in regulating the redox properties of one or
both disulfide bridges. Tyr127 is strictly conserved between species in
CD81 (FIG. 2 (see Original Patent)), and in all the CD9 sequences known to
date; residue 191 is either His or Gln in amino acid sequences
representing different tetraspanin sub-families (FIG. 2).
Sequence database surveys show that the tetraspanin family is composed of
several sub-families (CD9, CD37, CD63, CD53, CD82, CD151 and others; ref.
38), for a total of about 160 amino acid sequences currently recognized.
Among these, CD81 and CD9 families display close sequence (and thus
structural) homology (23% residue identities are observed between human
CD81-LEL and human CD9-LEL). FIG. 2 shows multiple amino acid sequence
alignments of CD81-LEL from seven different species, together with human
CD9-LEL and other tetraspanins. Sequence-based dendrogram analysis
indicates that, within the sequences included in FIG. 2, human CD82-LEL is
the one most distantly related to CD81-LEL (only 9% identical residues).
Besides the clearly recognized tetraspanin four-Cys consensus motif,
inspection of the CD81 and CD9 alignment in FIG. 2 shows strict residue
conservation at thirteen residues. Among these, four are charged amino
acids (Lys116, Asp117, Lys148, and Lys201), which are totally solvent
exposed in the assembled CD81-LEL dimer. The remaining nine conserved
residues can be divided into three structural classes: (a) Tyr127, His151
and Ile194 are buried residues involved in intramolecular interactions
stabilizing the head domain and its contacts to the E-helix, as described
above. (b) Val123, Phe126, Leu154 and Phe198, which are primarily involved
in subunit interface contacts, together with residues Ile119 and Phe150,
conservatively substituted at the association interface. Conservation of
the interface hydrophobic residues can be recognized throughout the
sequence alignment of FIG. 2 (see Original Patent), suggesting that the
observed subunit interface has a general functional significance for both
homo- or hetero-dimeric tetraspanin association related to intra- or
inter-cellular recognition processes. (c) Gly158 and Pro176, both of which
display unique positions and conformations (as indicated by their
Ramachandran .PHI.,.PSI. pairs) within the head domain fold, occurring
after the Cys156-Cys157 pair and after Cys175. The selection of Gly and
Pro at sites 158 and 176, respectively, may be required by the structural
constraints imposed on the CD loop by the Cys157-Cys175 bridge, coding its
specific kink towards the B-helix.
Analysis of the available sequences indicates that tetraspanin LEL
segments display enhanced residue variability, deletions or insertions in
the protein segments 158-174 and 176-189, which are comprised between the
two disulfide bridges (see FIG. 2). The two segments span the BC loop, the
C- and the D-helices and the DE hinge i.e. mostly solvent-exposed regions
of the head domain (see FIGS. 1B & 3A (see Original Patent)). The
structural location and sequence variability of this protein region, as
opposed to generally conserved protein interface and core regions,
suggests its involvement in species- or tetraspanin-specific recognition
processes.
No binding of the HCV E2 glycoprotein to african green monkey (Chlorocebus
aethiops) CD81 has been observed, but the LELs of human and AGM CD81
differ at only five residues: 154, 163, 186, 188 and 197 [1]. Conversely,
E2 does bind to tamarin (Saguinus oedipus) CD81 with high affinity, and it
differs from human CD81 only at residues 155, 163, 169, 180 and 196. Three
of the AGM mutations have been engineered into human CD81-LEL and their
binding properties versus recombinant E2 and anti-CD81 antibodies have
been analysed in vitro [41]. Phe186 is the most critical residue affecting
E2 binding, since the F186A mutation in human CD81-LEL fully impairs
binding to E2 and to specific antibodies, the introduction of Phe186 in
AGM CD81-LEL restores E2 binding, and tamarin CD81 displays Phe186 [1, 41,
42]. The Thr163Ala mutation marginally improves human CD81 binding to E2,
whereas Asp196Glu mutation decreases binding to E2 [41].
From a structural viewpoint, the tamarin and AGM mutated residues map in
the head domain, with the exception of residues 196 and 197, which fall
next to the C-terminus (see FIG. 3C), Leu197 being a subunit interface
residue in human CD81. Residues 155, 163, 180, 186 and 188 are solvent
exposed, such that their conservative substitution should not affect the
domain conformation. On the other hand, the substitution of residues 154
and 169 is subject to polarity and residue size restrictions, since they
occur at buried locations next to both disulfide bridges. In human CD81
Thr163 is located in the short 3.sub.10 segment preceding the C-helix, and
Phe186 is the last residue in the D-helix. The two antiparallel helices
build up a sort of narrow "canyon" sub-structure, whose floor is
essentially defined by the Cys157--Cys175 disulfide connecting the CD loop
to the domain core. Residues 163 and 186 are located at the canyon end
opposing the CD loop (FIGS. 3A & 3C).
The mutant AGM and tamarin residues which affect E2 binding map to the C-,
D-, E-helices, and the intervening segments i.e. residues mostly comprised
within the two conserved disulfide bridges and including the 179-193
sequence stretch recognised as the minimal epitope for E2 binding [41].
Conformational integrity of this region is likely related to the oxidised
state of the disulfide bridges, particularly the Cys157-Cys175 bridge;
reduction of the disulfides impairs E2 and antibody recognition
experiments [41, 43]. Sequence alignments show that the identified region
displays highest residue variability within the CD81-LEL subfamily (FIG.
2), but also within other members of the tetraspanin family, which may
bear specific residue deletions and insertions affecting the tertiary
structure according to their molecular recognition requirements.
The conservation in the CD81 sequences of the hydrophobic residues Ile181,
Ile182, Leu185, and Phe186 belonging to the D-helix (FIG. 2) is peculiar,
since their solvent-exposed location (FIGS. 1C & 3A (see Original Patent))
should select against their conservation through species. Such an uncommon
structural property, the extended intermolecular crystal contacts observed
in this region, and the dramatic loss of E2 affinity related to Phe86
mutation, suggest that the D-helix region is the docking site for the
viral glycoprotein E2.
Claim 1 of 11 Claims
1. A method for identifying a ligand for
the large extracellular loop of CD81, comprising the steps of (a)
providing a 3D structural representation of the large extracellular loop
of CD81, wherein the 3D structural representation comprises a D helix
having a residue Phe186, and wherein the 3D structural representation of
the large extracellular loop of CD81 is set forth in the atomic
co-ordinates of Table 1 (see Original Patent), or variants of the atomic
co-ordinates of Table 1 thereof, in which the r.m.s. deviation of the x, y
and z co-ordinates for all heavy atoms is less than 2.5 .ANG.; (b)
providing a 3D structural representation of a potential ligand; (c) using
a computer to dock the 3D structural representation of the potential
ligand with the 3D structural representation of the large extracellular
loop of CD81 at a site comprising the residue Phe186, wherein a potential
ligand that docks with the large extracellular loop of CD81 at the site
comprising the residue Phe186 is identified as a ligand for the large
extracellular loop of CD81; (d) contacting the ligand identified in step
(c) with CD81 or a fragment thereof containing the large extracellular
loop, and (e) assaying the interaction between the ligand and CD81 or the
fragment thereof containing the large extracellular loop to determine
whether the ligand identified in step (c) is a ligand for the large
extracellular loop of CD81. ____________________________________________
If you want to learn more
about this patent, please go directly to the U.S.
Patent and Trademark Office Web site to access the full
patent.
|