Internet for Pharmaceutical and Biotech Communities
| Newsletter | Advertising |
 
 
 

  

Pharm/Biotech
Resources

Outsourcing Guide

Cont. Education

Software/Reports

Training Courses

Web Seminars

Jobs

Buyer's Guide

Home Page

Pharm Patents /
Licensing

Pharm News

Federal Register

Pharm Stocks

FDA Links

FDA Warning Letters

FDA Doc/cGMP

Pharm/Biotech Events

Consultants

Advertiser Info

Newsletter Subscription

Web Links

Suggestions

Site Map
 

 
   

 

  Pharmaceutical Patents  

 

Title:  Structure-based hepatitis C virus drug design
United States Patent: 
7,657,385
Issued: 
February 2, 2010

Inventors:
 Bolognesi; Martino (Torre d'Isola, IT), Grandi; Guido (Segrate, IT)
Assignee:
  Novartis Vaccines and Diagnostics, S.R.L. (Siena, IT)
Appl. No.: 
10/312,490
Filed:
 July 3, 2001
PCT Filed:
 July 03, 2001
PCT No.:
 PCT/IB01/01450
371(c)(1),(2),(4) Date:
 April 22, 2003
PCT Pub. No.:
 WO02/02631
PCT Pub. Date:
 January 10, 2002


 

Patheon


Abstract

The extracellular loop of CD81 is a cellular receptor for the E2 protein of hepatitis C virus. A CD81 crystal structure has been elucidated and is provided for use in the structure-based design of compounds which bind to CD81 and thus block the binding of HCV. Methods such as docking and de novodrug design can be used.

Description of the Invention

TECHNICAL FIELD

This invention is in the field of the use of structure-based drug design methods to identify compounds that bind to CD81, which is a cell-surface receptor for hepatitis C virus (HCV).

BACKGROUND ART

Chronic HCV infection occurs in about 3% of the world's population. It is a major cause of liver disease, but effective anti-HCV drugs are not yet available.

In October 1998, the extracellular loop of CD81 was reported as a cellular receptor for the E2 protein of HCV [1]. It is an object of the invention to facilitate rational structure-based drug design of chemical compounds that interact with CD81 to inhibit the binding of HCV.

DISCLOSURE OF THE INVENTION

The invention is based on the elucidation of a crystal structure for the large extracellular loop (LEL; amino acids 113-201) of human CD81. This facilitates structure-based design of compounds which bind to CD81 and thus block the binding of HCV.

The invention provides a computer-based method for identifying a ligand for CD81, comprising the steps of: (a) providing a 3D structural representation of the large extracellular loop of CD81 in a storage medium on a computer; and (b) using the computer to apply structure-based drug design techniques to the structural representation.

Structure-Based Drug Design Techniques

Structure-based drug design techniques can be applied to the structural representation of the LEL in order to identify compounds that interact with CD81 to block HCV binding. A variety of suitable techniques [e.g. ref. 2] are available to the skilled person.

Software packages for implementing molecular modelling techniques for use in structure-based drug design include SYBYL [3], AMBER [4], CERIUS.sup.2 [5], INSIGHT II [5], CATALYST [5], QUANTA [5], HYPERCHEM [6], CHEMSITE [7] etc.

This software can be used to determine binding surfaces of the LEL in order to reveal features such as van der Waals contacts, electrostatic interactions, and/or hydrogen bonding opportunities. These binding surfaces may be used as follows:

Docking

Docking aligns the 3D structures of two or more molecules to predict the conformation of a complex formed from the molecules [e.g. ref 8]. According to the present invention, molecules are docked with the LEL structure to assess their ability to interact with CD81.

Docking can be accomplished by either geometric matching of the ligand and its receptor or by minimising the energy of interaction. Geometric matching algorithms are preferred because of their relative speed.

Suitable docking algorithms include, but are not limited to: DOCK.[9], the prototypical program for structure-based drug design. AUTODOCK [10, 4], which docks ligands in a flexible manner to receptors using grid-based Monte Carlo simulated annealing. The flexible nature of the AUTODOCK procedure helps to avoid bias (e.g. in orientation and conformation of the ligand in the active site) introduced by the user researcher [11] because, whilst the starting conformation in a rigid docking is normally biased towards an minimum energy conformation of the ligand, the binding conformation may be of relatively high conformational energy [12]. MOE-DOCK [13], in which a simulated annealing search algorithm is used to flexibly dock ligands. A grid-based energy evaluation is used to score docked conformations. FLExX [3], which docks conformationally flexible ligands into a binding site using an incremental construction algorithm that builds the ligand in the site. Docked conformations are scored based on the strength of ligand-receptor interactions. GOLD [14], a genetic algorithm for flexible ligand docking, with full ligand and partial protein flexibility. Energy functions are partly based on conformation and non-bonded contact information. AFFINITY [5], which uses a two step process to dock ligands. First, initial placements of the ligand within the receptor are made using a Monte Carlo type procedure to search both conformational and Cartesian space. Second, a simulated annealing phase optimises the location of each ligand placement. During this phase, AFFINITY holds the `bulk` of the receptor (atoms not in the binding site) rigid, while the binding site atoms and ligand atoms are movable. C.sup.2.LigandFit [5], which uses the energy of the ligand-receptor complex to automatically find best binding modes. Stochastic conformational search technique are used, and the best results from the conformational sampling are retained. A grid method is used to evaluate non-bonded interactions between the rigid receptor and the flexible ligand atoms.

Preferably, the docking algorithm is used, in a `high throughput` mode, in which members of large structural libraries of potential ligands are screened against the receptor structure [15].

Suitable structural libraries include the ACD (Available Chemical Directory, from MDL Inc), AsInEx, Bionet, ComGenex, the Derwent World Drug Index (WDI), the Contact Service Company database, LaboTest, ChemBridge Express Pick, ChemStar, BioByteMasterFile, Orion, SALOR, TRIAD, ILIAD, the National Cancer Institute database (NCI), and the Aldrich, Fluka, Sigman and Maybridge catalogs. These are commercially available (e.g. the HTS Chemicals collections from Oxford Molecular, or the LeadQuest.TM. files from Tripos).

Pharmacophore hypotheses

A pharmacophore (i.e. a collection of chemical features and 3D constraints that expresses specific characteristics responsible for activity) can be defined for the LEL. The pharmacophore preferably includes surface-accessible features, more preferably including hydrogen bond donors and acceptors, charged/ionisable groups, and/or hydrophobic patches. These may be weighted depending on their relative importance in conferring activity [16].

Pharmacophores can be determined using software such as CATALYST (including HypoGen or HipHop) [5], CERIUS2, or constructed by hand from a known conformation of a lead compound. The pharmacophore can be used to screen structural libraries, using a program such as CATALYST [5]. The CLIX program [17] can also be used, which searches for orientations of candidate molecules in structural databases that yield maximum spatial coincidence with chemical groups which interact with the receptor.

de novo Compound Design

The binding surface or pharmacophore of the LEL can be used to map favourable interaction positions for functional groups (e.g. protons, hydroxyl groups, amine groups, hydrophobic groups and/or divalent cations) or small molecule fragments. Compounds can then be designed de novo in which the relevant functional groups are located in the correct spatial relationship to interact with CD81.

Once functional groups or small molecule fragments which can interact with specific sites in the CD81 binding surface have been identified, they can be linked in a single compound using either bridging fragments with the correct size and geometry or frameworks which can support the functional groups at favourable orientations, thereby providing a compound according to the invention. Whilst linking of functional groups in this way can be done manually, perhaps with the help of software such as QUANTA or SYBYL, automated or semi-automated de novo design approaches are also available: MCDLNG [18], which fills a receptor binding site with a close-packed array of generic atoms and uses a Monte Carlo procedure to randomly vary atom types, positions, bonding arrangements and other properties. MCSS/HOOK [19, 20, 5], which links multiple functional groups with molecular templates taken from a database. LUDI [21, 5], which computes the points of interaction that would ideally be fulfilled by a ligand, places fragments in the binding site based on their ability to interact with the receptor, and then connects them to produce a ligand. GROW [22], which starts with an initial `seed` fragment (placed manually or automatically) and grows the ligand outwards. SPROUT [23], suite which includes modules to: identify favourable hydrogen bonding and hydrophobic regions within a binding pocket (HIPPO module); select functional groups and position them at target sites to form starting fragments for structure generation (EleFAnT); generate skeletons that satisfy the steric constraints of the binding pocket by growing spacer fragments onto the start fragments and then connecting the resulting part skeletons (SPIDeR); substitute hetero atoms into the skeletons to generate molecules with the electrostatic properties that are complementary to those of the receptor site (MARABOU). The solutions can be clustered and scored using the ALLigaTOR module. LEAPFROG [3], which evaluates ligands by making small stepwise structural changes and rapidly evaluating the binding energy of the new compound. Changes are kept or discarded based on the altered binding energy, and structures evolve to increase the interaction energy with the receptor. GROUPBUILD [24], which uses a library of common organic templates and a complete empirical force field description of the non-bonding interactions between a ligand and receptor to construct ligands that have chemically reasonable structure and have steric and electrostatic properties complimentary to the receptor binding site. CAVEAT [25], which designs linking units to constrain acyclic molecules. RASSE [26] The LEL Binding Site

To simplify computational complexity, algorithms for docking and ligand design will typically focus only on the binding site of a receptor--it is pointless to attempt to dock a ligand with a region in the receptor which is known not to be involved. Binding site identification is included in some algorithms (e.g. C.sup.2.LigandFit, the `Binding Site Analysis` module of INSIGHT II, the SPHGEN routine of DOCK). Some manual guidance may be required (e.g. AFFINITY).

Where a binding site has to be defined for the CD81-LEL, this should include amino acid residue Phe186. It may also include one or more of Leu154, Thr163, Ile181, Ile182, Leu185, Glu188, and Asp196. The binding site may include the whole of helix D.

The Structural Representation

The invention involves the use of a 3D structural representation of the LEL. This may be a representation of (a) the complete LEL, (b) a fragment of CD81 that comprises the LEL, or (c) a fragment of the LEL which includes the amino acids which interact with HCV E2 protein.

The structural representation is preferably based on or derived from the atomic co-ordinates cd81lel.pdb as set out herein, which represents the LEL dimer [see also refs. 27, 28 & 29]. Suitable structural representations include 3D models and molecular surfaces derived from these atomic co-ordinates.

Variants of cd81lel.pdb can also be used for the invention, such as variants in which the r.m.s. deviation of the x, y and z co-ordinates for all heavy (i.e. not hydrogen) atoms are all less than 2.5 .ANG. (e.g. less than 2 .ANG., preferably less than 1 .ANG., and more preferably less than 0.5 .ANG. or less than 0.1 .ANG.) compared with cd81lel.pdb. Co-ordinate transformations which retain the 3D spatial relationships of atoms may also be used to give suitable variants.

Preferred fragments of the LEL whose co-ordinates can be used in the invention include amino acid residue Phe186. The fragments may also include one or more of Leu154, Thr163, Ile181, Ile182, Leu185, Glu188, and Asp196.

It is preferred that the methods of the invention use only one protein chain i.e. only of the monomers in cd81lel.pdb. Where only one monomer is used, it is preferred to use the first monomer (residues 113-202) rather than the second (residues 213-302).

The water molecules in cd81lel.pdb can optionally be omitted when performing the methods of the invention.

The atomic co-ordinates given herein can also be used as the basis of models of further protein structures. For example, a homology model could be based on the LEL structure of the present invention. The co-ordinates can also be used in the solution or refinement of further crystal structures of CD81.

The Storage Medium

The storage medium in which the LEL structural representation is provided is preferably random-access memory (RAM), but may also be read-only memory (ROM e.g. CDROM), or a diskette. The storage medium may be local to the computer, or may be remote (e.g. a networked storage medium, including the internet).

The invention also provides a computer-readable medium for a computer, characterised in that the medium contains atomic co-ordinates and/or a 3D structural representation of the LEL of CD81. The atomic co-ordinates are preferably cd81lel.pdb or variants thereof.

Any suitable computer can be used in the present invention.

Testing Compounds

The methods may comprise the further steps of: (c) providing a compound identified by said structure-based drug design techniques; and (d) contacting said compound with CD81, or a fragment thereof containing the LEL, and assaying the interaction between them.

The assay may be of a competitive nature. For example, the assay may include HCV E2 protein (either purified, or in the context of the HCV virion), such that E2 and the compound compete for binding to CD81.

Compounds and Their Uses

The methods of the invention identify compounds that can interact with CD81. These compounds may be designed de novo, may be known compounds, or may be based on known compounds. The compounds may be useful pharmaceuticals themselves, or may be prototypes which can be used for further pharmaceutical refinement (i.e. lead compounds) in order to improve binding affinity or other pharmacologically important features (e.g. bio-availability, toxicology, metabolism, pharmacokinetics etc.).

The invention thus provides: (i) a compound identified using the methods of the invention; (ii) a compound identified using the methods of the invention for use as a pharmaceutical; (iii) the use of a compound identified using the methods of the invention in the manufacture of a medicament for treating hepatitis C infection; and (iv) a method of treating a patient with hepatitis C infection, comprising administering an effective amount of a compound identified using the methods of the invention.

These compounds preferably interact with CD81 with a binding constant in the micromolar or, more preferably, nanomolar range or stronger.

As well as being useful compounds individually, ligands identified in silico by the structure-based design techniques can also be used to suggest libraries of compounds for `traditional` in vitro or in vivo screening methods. Important pharmaceutical motifs in the ligands can be identified and mimicked in compound libraries (e.g. combinatorial libraries) for screening for CD81-binding activity.

Crystals

The invention also provides a composition comprising an extracellular region (e.g. the LEL) of CD81 in crystalline form. The crystal can be used for diffraction studies e.g. X-ray or neutron diffraction.

The crystal is preferably in the monoclinic space group P2.sub.1 (a=31.5 .ANG., b=77.2 .ANG., c=38.5 .ANG., .beta.=107.4.degree.), with two molecules per asymmetric unit (Vm of 2.16 .ANG..sup.3/dalton).

In some embodiments, the composition may include ligands which are co-crystallised with the CD81 fragment; in other embodiments the composition may be essentially pure protein.


MODES FOR CARRYING OUT THE INVENTION

CD81-LEL Production

CD81-LEL was purified from a recombinant E. coli strain as a fusion protein with the IgG binding domain of the S. aureus protein A. After purification of the chimeric protein, CD81-LEL was separated from the fusion by specific protease cleavage and further purified for the crystallization experiments. The purified protein was fully active as judged by the recombinant E2/CD81 inhibition of binding assay [1].

CD81-LEL Crystallisation

Crystals of CD81-LEL were obtained by mixing purified CD81-LEL at 10 mg/mil with 0.1M MES buffer (pH 6.0), 0.1M sodium chloride, and 10% PEG4000. The crystals belong to the monoclinic space group P2.sub.1 (a=31.5 .ANG., b=77.2 .ANG., c=38.5 .ANG., .beta.=107.4.degree.), with two molecules per asymmetric unit (Vm of 2.16 .ANG..sup.3/dalton). Crystals were flash-cooled at 100K with addition of 20% glycerol as cryoprotectant.

Whilst native crystals could be obtained easily, it was difficult to obtain suitable heavy atom derivatives for multiple isomorphous replacement techniques.

Three suitable derivatives were eventually produced, based on Lu, Hg and Pt -- see Original Patent.

Diffraction and Structure Solving

Native and derivative diffraction data were collected in house. Additional native data (at 1.6 .ANG. resolution) were collected on beamline ID14 (ESRF, Grenoble, France), on a MAR CCD detector. All data were processed using DENZO and SCALEPACK [31] and merged using the CCP4 program suite [32]. Crystallographic phases were calculated with CCP4 programs and refined using SHARP [33] and SOLOMON [34]. The resulting electron density maps allowed about 80% of the two independent molecules to be traced. Model building and inspection was based on the O suite [35]. The structure was refined using CNS [36] and REFMAC [37]0.5% of the unique data were used to monitor the free R-factor. The final values for general R-factor and free R-factor are 18.7% and 23.8%, respectively. The refined model consists of 176 amino acids, with 194 water molecules, at 1.60 .ANG. resolution. No residues are found in disallowed regions of the Ramachandran plot. Residues 238-241 are disordered in a A'B'loop.

Analysis of the Crystal Structure

FIG. 1A (see Original Patent) shows the CD81-LEL tertiary structure, which is essentially composed of five .alpha.-helices (A, B, C, D, E) spanning residues Asn115-Asp137, Ala140-Asp155, Leu165-Asn172, Asn180-Phe186 and Asp189-Gly200, respectively (amino acids numbered according to the full CD81 sequence; refs 38 & 39). A short 3.sub.10 helical segment covers residues Leu162-Ala164. The anti-parallel A- and E-helices can be seen as the stalk of a mushroom-shaped molecule, whose head domain (about 60 residues) is built by packing of the shorter B-, C- and D-helices and their intervening loops (loops are defined by the helices they connect i.e. AB, . . . , DE). A DALI search [40] of 3D protein structures did not show significant structural homology of the CD81-LEL fold to any known protein tertiary structure.

The dimeric assembly observed in the crystallographic asymmetric unit is shown in FIG. 1B (see Original Patent). Two CD81-LEL chains assemble around a local two-fold axis, displaying intermolecular contacts mainly at the A:A' helix interface (a ' symbol distinguishes features of the partner subunit) and along the B-helix, which is in contact with the C-terminal region of the facing protomer. The subunit interface (986 .ANG..sup.2) is composed of apolar residues for about 70% of its contact region.

The A:A' interface segment involves primarily residues Val114, Ile119, Val123, Gln125, Phe126 and Gln129, in both antiparallel helices, together with solvent mediated interactions between exposed polar side chains at the rim of the contact region. The second interface region includes residues Asn142, Val146, Thr149, Phe150, Thr153 & Leu154 which contact Leu197', Phe198', Ser199' & Gly200'. No water molecules are buried at the association interface.

Bivalent molecules (such as specific antibodies) capable of recognising human CD81 have been reported to have a higher affinity than E2 for CD81 [41]. In agreement with the crystal structure, this suggests that CD81 exists as a homodimeric species at the cell surface. In the crystal structure, the distance between the D- and D'-helices is about 30 .ANG., close to the distance between antigen-binding sites in an assembled antibody molecule.

The C-terminal His-tag does not appear to have a structural influence because the proximity of N- and C-termini within each chain, permitted by the antiparallel arrangement of A- and E-helices, is compatible with their topological location between TM3 and TM4 of native CD81 [39]. Moreover, due to the quaternary structure two-fold symmetry, the N- and C-termini of the two protomers fall in a restricted area, but on opposite faces of the assembled dimer (FIG. 1B (see Original Patent)). This quaternary organization is compatible with inter- and intra-cellular aggregation of CD81 in homo-dimeric species, through the association interface described.

The overall shape of the CD81-LEL dimer is further characterized in FIG. 1C (see Original Patent), as a surface displaying electrostatic potential. Besides the localization of negative potential in the central region of the dimer, a low polarity patch is present in a surface region comprising the C- and D-helices. In the crystal packing this region is virtually solvent inaccessible, due to extended contacts with a symmetry equivalent dimer.

The CD81-LEL head domain is essentially composed of the last two turns of the A-helix, the B-, C-, D-helices, the intervening segments, and the DE loop. The domain fold is firstly stabilized by two tetraspanin-invariant intramolecular disulfide bridges [1, 38, 39, 41], which originate from two adjacent Cys residues (156 & 157) and are oriented approximately in opposite directions within the head domain (FIG. 3A (see Original Patent)). Their topological connections are Cys156--Cys190 and Cys157--Cys175: the first involves residues located at the C-terminus of the B-helix (Cys156) and at the N-terminus of the E-helix (Cys190); the second connects the BC segment (Cys157) with the CD loop region (Cys175). The latter region is fully exposed to solvent and specifically kinked towards the head domain core by virtue of the Cys157--Cys175 bridge. Of the four Cys residues, only Cys 175 is partly accessible to solvent (20 .ANG..sup.2).

The head domain is built around the core residue Tyr127, fully buried and surrounded by Leu131, Va147, Phe150, His151 and by the. Asp128--His191 hydrogen-bonded salt-bridge. Tyr127 OH atom is hydrogen-bonded to His151 NE2 atom (2.7 .ANG.) and is 3.61 .ANG. from the Cys190S.gamma. atom (FIG. 3B (see Original Patent)). The imidazole ring of His151 is nestled between the two disulfide bridges, and is at hydrogen-bonding distance from Cys190 S.gamma. (3.56 .ANG.). A hydrogen-bonding network involving Tyr127, His151 and Cys190 may play a role in regulating the redox properties of one or both disulfide bridges. Tyr127 is strictly conserved between species in CD81 (FIG. 2 (see Original Patent)), and in all the CD9 sequences known to date; residue 191 is either His or Gln in amino acid sequences representing different tetraspanin sub-families (FIG. 2).

Sequence database surveys show that the tetraspanin family is composed of several sub-families (CD9, CD37, CD63, CD53, CD82, CD151 and others; ref. 38), for a total of about 160 amino acid sequences currently recognized. Among these, CD81 and CD9 families display close sequence (and thus structural) homology (23% residue identities are observed between human CD81-LEL and human CD9-LEL). FIG. 2 shows multiple amino acid sequence alignments of CD81-LEL from seven different species, together with human CD9-LEL and other tetraspanins. Sequence-based dendrogram analysis indicates that, within the sequences included in FIG. 2, human CD82-LEL is the one most distantly related to CD81-LEL (only 9% identical residues).

Besides the clearly recognized tetraspanin four-Cys consensus motif, inspection of the CD81 and CD9 alignment in FIG. 2 shows strict residue conservation at thirteen residues. Among these, four are charged amino acids (Lys116, Asp117, Lys148, and Lys201), which are totally solvent exposed in the assembled CD81-LEL dimer. The remaining nine conserved residues can be divided into three structural classes: (a) Tyr127, His151 and Ile194 are buried residues involved in intramolecular interactions stabilizing the head domain and its contacts to the E-helix, as described above. (b) Val123, Phe126, Leu154 and Phe198, which are primarily involved in subunit interface contacts, together with residues Ile119 and Phe150, conservatively substituted at the association interface. Conservation of the interface hydrophobic residues can be recognized throughout the sequence alignment of FIG. 2 (see Original Patent), suggesting that the observed subunit interface has a general functional significance for both homo- or hetero-dimeric tetraspanin association related to intra- or inter-cellular recognition processes. (c) Gly158 and Pro176, both of which display unique positions and conformations (as indicated by their Ramachandran .PHI.,.PSI. pairs) within the head domain fold, occurring after the Cys156-Cys157 pair and after Cys175. The selection of Gly and Pro at sites 158 and 176, respectively, may be required by the structural constraints imposed on the CD loop by the Cys157-Cys175 bridge, coding its specific kink towards the B-helix.

Analysis of the available sequences indicates that tetraspanin LEL segments display enhanced residue variability, deletions or insertions in the protein segments 158-174 and 176-189, which are comprised between the two disulfide bridges (see FIG. 2). The two segments span the BC loop, the C- and the D-helices and the DE hinge i.e. mostly solvent-exposed regions of the head domain (see FIGS. 1B & 3A (see Original Patent)). The structural location and sequence variability of this protein region, as opposed to generally conserved protein interface and core regions, suggests its involvement in species- or tetraspanin-specific recognition processes.

No binding of the HCV E2 glycoprotein to african green monkey (Chlorocebus aethiops) CD81 has been observed, but the LELs of human and AGM CD81 differ at only five residues: 154, 163, 186, 188 and 197 [1]. Conversely, E2 does bind to tamarin (Saguinus oedipus) CD81 with high affinity, and it differs from human CD81 only at residues 155, 163, 169, 180 and 196. Three of the AGM mutations have been engineered into human CD81-LEL and their binding properties versus recombinant E2 and anti-CD81 antibodies have been analysed in vitro [41]. Phe186 is the most critical residue affecting E2 binding, since the F186A mutation in human CD81-LEL fully impairs binding to E2 and to specific antibodies, the introduction of Phe186 in AGM CD81-LEL restores E2 binding, and tamarin CD81 displays Phe186 [1, 41, 42]. The Thr163Ala mutation marginally improves human CD81 binding to E2, whereas Asp196Glu mutation decreases binding to E2 [41].

From a structural viewpoint, the tamarin and AGM mutated residues map in the head domain, with the exception of residues 196 and 197, which fall next to the C-terminus (see FIG. 3C), Leu197 being a subunit interface residue in human CD81. Residues 155, 163, 180, 186 and 188 are solvent exposed, such that their conservative substitution should not affect the domain conformation. On the other hand, the substitution of residues 154 and 169 is subject to polarity and residue size restrictions, since they occur at buried locations next to both disulfide bridges. In human CD81 Thr163 is located in the short 3.sub.10 segment preceding the C-helix, and Phe186 is the last residue in the D-helix. The two antiparallel helices build up a sort of narrow "canyon" sub-structure, whose floor is essentially defined by the Cys157--Cys175 disulfide connecting the CD loop to the domain core. Residues 163 and 186 are located at the canyon end opposing the CD loop (FIGS. 3A & 3C).

The mutant AGM and tamarin residues which affect E2 binding map to the C-, D-, E-helices, and the intervening segments i.e. residues mostly comprised within the two conserved disulfide bridges and including the 179-193 sequence stretch recognised as the minimal epitope for E2 binding [41]. Conformational integrity of this region is likely related to the oxidised state of the disulfide bridges, particularly the Cys157-Cys175 bridge; reduction of the disulfides impairs E2 and antibody recognition experiments [41, 43]. Sequence alignments show that the identified region displays highest residue variability within the CD81-LEL subfamily (FIG. 2), but also within other members of the tetraspanin family, which may bear specific residue deletions and insertions affecting the tertiary structure according to their molecular recognition requirements.

The conservation in the CD81 sequences of the hydrophobic residues Ile181, Ile182, Leu185, and Phe186 belonging to the D-helix (FIG. 2) is peculiar, since their solvent-exposed location (FIGS. 1C & 3A (see Original Patent)) should select against their conservation through species. Such an uncommon structural property, the extended intermolecular crystal contacts observed in this region, and the dramatic loss of E2 affinity related to Phe86 mutation, suggest that the D-helix region is the docking site for the viral glycoprotein E2.
 

Claim 1 of 11 Claims

1. A method for identifying a ligand for the large extracellular loop of CD81, comprising the steps of (a) providing a 3D structural representation of the large extracellular loop of CD81, wherein the 3D structural representation comprises a D helix having a residue Phe186, and wherein the 3D structural representation of the large extracellular loop of CD81 is set forth in the atomic co-ordinates of Table 1 (see Original Patent), or variants of the atomic co-ordinates of Table 1 thereof, in which the r.m.s. deviation of the x, y and z co-ordinates for all heavy atoms is less than 2.5 .ANG.; (b) providing a 3D structural representation of a potential ligand; (c) using a computer to dock the 3D structural representation of the potential ligand with the 3D structural representation of the large extracellular loop of CD81 at a site comprising the residue Phe186, wherein a potential ligand that docks with the large extracellular loop of CD81 at the site comprising the residue Phe186 is identified as a ligand for the large extracellular loop of CD81; (d) contacting the ligand identified in step (c) with CD81 or a fragment thereof containing the large extracellular loop, and (e) assaying the interaction between the ligand and CD81 or the fragment thereof containing the large extracellular loop to determine whether the ligand identified in step (c) is a ligand for the large extracellular loop of CD81.

____________________________________________
If you want to learn more about this patent, please go directly to the U.S. Patent and Trademark Office Web site to access the full patent.
 

 

     
[ Outsourcing Guide ] [ Cont. Education ] [ Software/Reports ] [ Training Courses ]
[ Web Seminars ] [ Jobs ] [ Consultants ] [ Buyer's Guide ] [ Advertiser Info ]

[ Home ] [ Pharm Patents / Licensing ] [ Pharm News ] [ Federal Register ]
[ Pharm Stocks ] [ FDA Links ] [ FDA Warning Letters ] [ FDA Doc/cGMP ]
[ Pharm/Biotech Events ] [ Newsletter Subscription ] [ Web Links ] [ Suggestions ]
[ Site Map ]