WO2003019140A2 - Method for molecular subshape similarity matching - Google Patents
Method for molecular subshape similarity matching Download PDFInfo
- Publication number
- WO2003019140A2 WO2003019140A2 PCT/US2002/026844 US0226844W WO03019140A2 WO 2003019140 A2 WO2003019140 A2 WO 2003019140A2 US 0226844 W US0226844 W US 0226844W WO 03019140 A2 WO03019140 A2 WO 03019140A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- query
- subshape
- molecule
- target
- shape
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/40—Searching chemical structures or physicochemical data
Definitions
- the field of the instant invention relates to molecular modeling, and in particular to methods for matching three-dimensional molecular structures based upon similarity in molecular subshapes.
- FIG. 1A shows a simplified schematic view of the interaction of a drug, referred to here as query molecule 100, with a biological macromolecule 102 such as a protein.
- Drug 100 may function by projecting an active region 100a into a receptor site 102a having a distinctive three-dimensional shape.
- Fig. IB shows a schematic view of the role of shape similarity matching in drug design.
- the researcher conducting such in silico screening may therefore conduct a search of a database 110 containing two- or three-dimensional information of a large number of molecules, posing a query in the form of the three-dimensional shape 104 of query molecule 100 in order to identify a target molecule shape 112 having features shaped similarly to those of query molecule 100.
- Fig. 1C shows a schematic view of the interaction between similar subshapes of two differently-shaped drug molecules with the same receptor.
- the affinity of a molecule 100 to receptor site 102a is generally dictated by molecule subshape such as active regions 100a or 112a, rather than by the overall shape of the molecule 100 or 112.
- molecule subshape such as active regions 100a or 112a
- shape matching processes may be computationally-intensive, requiring expensive equipment and extended periods of time to perform.
- These shape matching processes also tend to emphasize overall molecule shape similarity, and may fail to recognize similarity between subshapes of the molecules.
- FIG. 2 shows a conventional molecular similarity matching method which identifies the centroid ' 114 of query molecule shape 104 and the centroid 116 of target molecule 112. Centroids 114 and 116 of query and target molecule shapes 104 and 112 respectively are placed at the origin of a three-dimensional grid
- a database comprising three-dimensional target molecule shapes may be effectively searched for similarity to a subshape of a query molecule.
- Methods in accordance with embodiments of the present invention thus enable a researcher to identify promising candidates for biological screening experiments during the drug discovery process, based upon only a query molecule of known three-dimensional structure exhibiting affinity to a particular receptor.
- triangle matching is first performed between a query subshape triangle representative of a local volume distribution within the query molecule, and a target subshape triangle representative of a local volume distribution within a target molecule of the database.
- Shape matching is then performed between the query molecule and the target molecule, based upon overlap of the target molecule and query molecule as determined by alignment of matched query and target subshape triangles. Comparison of a characteristic direction assigned to each vertex of the subshape triangles based upon a principal axis of sampled local volume may eliminate unsuitable subshape triangle pairs from consideration prior to the shape matching step, thereby optimizing computational efficiency.
- An embodiment of a method for comparing a query molecule shape with a target molecule shape comprises, sampling a distribution of local volume of the query molecule shape to generate a plurality of skeleton points, each skeleton point including a location and a direction. The direction of each skeleton point is determined by a principal axis of the sampled local volume distribution of the query molecule shape.
- a distribution of local volume of the target molecule shape is sampled to generate a plurality of terminal points fewer in number than the skeleton points, each terminal point including a location and a direction. The direction of each terminal point is determined by a principal axis of the sampled local volume distribution of the target molecule shape.
- a query subshape triangle is created from three skeleton points.
- a target subshape triangle is created from three terminal points.
- the query subshape triangle and the target subshape triangle are matched to determine an optimal translation and rotation of the target subshape triangle relative to the query subshape triangle to align corresponding skeleton and terminal points.
- the query molecule shape and the target molecule shape are overlapped by alignment of matched target and query subshape triangles, and the overlapped query and target molecule shapes are compared.
- a query molecule shape is provided.
- a distribution of local volume of the query molecule shape is sampled to generate a plurality of skeleton points.
- Each skeleton point includes a location and a direction, the direction determined by a principal axis of the sampled local volume distribution of the query molecule shape.
- a distribution of local volume of a target molecule shape is sampled to generate a plurality of terminal points fewer in number than the skeleton points.
- Each terminal point includes a location and a direction, the direction determined by a principal axis of the sampled local volume distribution of the target molecule shape.
- Query subshape triangles are created from three skeleton points.
- Target subshape triangles are created from three terminal points.
- Triangle matching values are determined for query/target subshape triangle pairs having an optimal translation and rotation relative to one another. Net direction differences of corresponding skeleton points and terminal points are determined only for vertices of query/target subshape triangle pairs satisfying a triangle matching threshold.
- the query molecule shape and the target molecule shape are overlapped by alignment of the query subshape triangle and the target subshape triangle only for query/target subshape triangle pairs satisfying a net direction difference threshold. The overlapped query and target molecule shapes are then compared.
- Fig. 1 A shows a simplified schematic view of the interaction of a drug with a biological macromolecule.
- Fig. IB shows a schematic view of the role of shape similarity matching in drug design.
- Fig. 1C shows a schematic view of the interaction between similar subshapes of two differently-shaped drug molecules with the same receptor.
- Fig. 2 shows a schematic view of a conventional molecular shape matching technique
- FIG. 3 shows a schematic view of one embodiment of a molecular subshape matching technique in accordance with the present invention
- FIG. 4 shows a flowchart of steps performed for molecular subshape matching in accordance with one embodiment of the present invention
- Figs. 5A-5K show schematic views of the steps of Fig. 4;
- FIG. 6A is a flowchart showing the steps of one embodiment of a method in accordance with the present invention for generating skeleton points from a target molecule shape
- FIG. 6B is a flowchart showing the steps of one embodiment of a method in accordance with the present invention for generating a minimum required number of initial skeleton points
- Fig. 6C shows a schematic view of the iterative process for generating additional initial skeleton points of a query molecule shape
- FIG. 7 shows a schematic view of the succession of various filtering steps performed in accordance with an embodiment of the present invention
- FIG. 8A is a simplified diagram of a computing device for processing information according to an embodiment of the present invention.
- FIG. 8B is an illustration of basic subsystems in the computer system of Fig. 8 A;
- FIG. 9 is a simplified block diagram an embodiment of a software program used to perform subshape matching in accordance with the present invention.
- Figs. 10 A- 10C show the two-dimensional connectivities of a query molecule and two target molecules utilized in an experiment to demonstrate an embodiment of a method in accordance with the present invention.
- Figs. 11 A-l IE show the three-dimensional alignment of the query molecule and the target molecules obtained from the first experiment.
- Fig. 12 is a flow chart showing the steps of a method for applying a subshape- matched target molecule in accordance with an embodiment of the present invention to identify possible drug leads.
- Fig. 13A shows the two-dimensional connectivity of the ltlp and ppp molecules.
- Fig. 13B shows the a three-dimensional representation of the ltlp and ppp ligands aligned as bound to thermolysin.
- Fig. 14A shows a simplified cross-sectional view of a two-dimensional representation of molecule shape volume encoding.
- Fig. 14B shows a simplified cross-sectional view of a two-dimensional representation of molecule shape surface encoding.
- FIG. 3 shows a schematic view of operation of a method in accordance with one embodiment of the present invention.
- a database comprising three-dimensional target molecule shapes 112 may be effectively searched for similarity to a subshape 104a of query molecule 100 having overall shape 104.
- triangle matching is first performed between a query subshape triangle 140 representative of a local volume distribution within query molecule 100, and a target subshape triangle 142 representative of a local volume distribution within target molecule 112.
- Fig. 4 shows a schematic flow chart of one embodiment of a method 400 in accordance with the present invention for searching for three-dimensional target molecule shapes matching a query molecule shape.
- Figs. 5A-5K show schematic views of the corresponding steps in the method of Fig. 4.
- query molecule shape 104 of Fig. 5 A is provided.
- the shape of the query molecule may be determined from conformational analysis of the molecule. Examples of approaches for analyzing the three-dimensional conformation of a molecule are presented by Smellie et al., "Conformational Analysis by intersection: Ring conformation", Proc. of the 217th Meeting of the ACS, Anaheim (1999), and by Smellie et al., "Conformational Analysis by Intersection", J. Comput. Chem. Vol. _, No. _, pp.
- the shape of the query molecule may be determined from experimental results, for example through multi-dimensional NMR spectroscopy studies, circular dichoism (CD) spectroscopy studies, or x-ray crystallography of the query molecule 100 bound to receptor 102.
- target molecule shape 112 of Fig. 5B is provided.
- Shape 112 of the target molecule may be obtained from information contained in a relevant database, which may contain a direct representation of the configuration of the molecule in three-dimensional space.
- relevant database which may contain a direct representation of the configuration of the molecule in three-dimensional space.
- formats for presenting representations of molecules in space include, but are not limited to the SMILES format from Daylight Chemical Information Systems, Mission Viejo, California, and described by Weininger, in "SMILES 1. Introduction and Encoding Rules", J. Chem. Inf. Comput. Sci. 28, 31 (1988), inco ⁇ orated herein by reference, the MOL2 format by Tripos Inc. of St.
- the three-dimensional shape 112 of the target molecule can be derived from conformational analysis of two- dimensional molecular connectivity information.
- the query and target molecule shapes may be superimposed onto a three-dimensional cubic grid to facilitate encoding of the molecule shape.
- Grid spacing, the edge length of each cube of the lattice, may be specified by the user and determines the accuracy of the shape encoding.
- the volume of a molecule shape may be encoded using a bit vector whose length is proportional to the number of grid points.
- FIG. 14A shows a simplified cross-sectional view of a two-dimensional representation of volume encoding.
- Molecule 1400 is represented by shape 1402.
- Grid points falling within interior region 1404 are assigned an occupancy value of 3.
- the occupancy value of a grid point decreases gradually moving beyond the van der Waal surface of the molecule.
- r w is the van der Walls radius of atom 1400a
- ⁇ ⁇ (r w + (r b /s g ))
- r 2 (r w + (2rt/s g ))
- r w van der Waals radius of the heavy atom
- r b user specified parameter
- s g grid spacing.
- grid points falling within interior surface region 1406 are assigned a value of 2
- grid points falling within exterior surface region 1408 are assigned a value of 1
- grid points falling outside of the molecule shape entirely are assigned a value of 0.
- the occupancy at each grid point may be assigned based on its distance to the closest heavy atom.
- Two bits are assigned to store the occupancy of an individual grid point, allowing for four distinct values based on whether the grid point is located at the interior of the molecule, the exterior of the molecule, or at the interior or exterior surface of the molecule.
- a grid for volume encoding is described, the invention is not limited to this particular technique, and the volumes of molecule shapes can be encoded in other ways, for example based upon the atom coordinates of the atoms comprising the molecule.
- a distribution of local volume of the query molecule shape is sampled to generate a plurality of skeleton points 120 of Fig. 5C.
- Figs. 6A-6B discussed below, provide a detailed description of the generation of skeleton points.
- Each skeleton point 120 includes a characteristic location 120a and direction 120b, and may further include one or more chemical feature types 120c.
- the location 120a of a particular skeleton point corresponds to its position within the three-dimensional query molecule shape.
- the direction 120b of the skeleton point is determined by a principal axis of the sampled local volume corresponding to that particular skeleton point.
- Chemical feature types 120c of the skeleton point are determined by chemical groups proximate to that skeleton point.
- a distribution of local volume of the target molecule shape is sampled to generate a plurality of terminal points 122 of Fig. 5D.
- the number of terminal points 122 is fewer than the number of skeleton points of the query molecule shape.
- each terminal point 122 includes a characteristic location 122a and direction 122b, and may also include one or more chemical feature types 122c.
- the location of a particular terminal point corresponds to its position within the three- dimensional target molecule shape.
- the direction of the terminal point is determined by a principal axis of the sampled local volume corresponding to the particular skeleton point.
- Chemical feature types of the terminal points may be determined by the identities of chemical groups proximate to the terminal point. A description of the generation of terminal points is also provided below.
- a fifth step 410 as shown in Fig. 5E, the distance 130a between selected pairs of skeleton points 120 and the distance 130b between selected pairs of terminal points 122 are measured and compared. Based upon the difference between distances 130a and 130b, an edge matching value is calculated.
- a sixth step 412 as shown in Fig. 5F, query subshape triangles 140 and target subshape triangles 142 are assembled from three pairs of skeleton points 120 and terminal points 122, respectively, with corresponding subshape triangle edges formed from pairs of skeleton points and pairs of terminal points. For each subshape triangle pair, the edge matching values just calculated for each edge pair are checked against an edge matching threshold value. Only subshape triangle pairs having all three corresponding edges satisfying this edge matching threshold are passed on to the next step.
- query subshape triangle 140 is matched with target subshape triangle 142 and a triangle matching value is generated.
- the triangle matching process produces a triangle matching value and a translation 146 and rotation 148 of the target subshape triangle 142 relative to the query subshape triangle 140.
- the triangle matching value may be based upon the root mean square difference (RMSD) between the query and target subshape triangles.
- RMSD root mean square difference
- subshape triangle pairs having a triangle matching value satisfying a triangle matching threshold are subjected to feature matching.
- feature matching Specifically, as shown in Fig. 5H, chemical feature types 120c and 122c proximate to corresponding vertices of matched query/target subshape triangle pairs 144 are compared, and a feature difference calculated.
- feature types that may be identified proximate to subshape triangle vertices include hydrogen bond donors, hydrogen bond acceptors, positive charges, negative charges, aromatic groups, and hydrophobic groups.
- a ninth step 418 matched subshape triangles satisfying a feature difference threshold are subjected to direction matching. Specifically, as shown in Fig. 51, directions 120b and 122b assigned to corresponding skeleton points 120 and terminal points 122 of the vertices of matched query/target subshape triangle pairs 144 are compared, and a net direction difference calculated.
- This calculation includes taking the sum of the sines (or cosines) of angles ⁇ created by the respective directions of each corresponding skeleton terminal point.
- the directions of the skeleton/terminal points considered during this direction matching step reflect rotation/translation of the respective subshape triangles performed during the previous triangle matching step.
- molecule volumes corresponding to matched subshape triangles satisfying a direction matching threshold are overlapped.
- query molecule shape 104 and target molecule shape 112 are overlapped. This overlap is based upon alignment of query and target subshape triangles satisfying a direction difference threshold value.
- the result of this overlap produces three volumes: a volume 190 common to the query and target molecule shapes, a volume 192 of the query molecule shape projecting outside of the target molecule shape, and a volume 194 of the target molecule shape projecting outside of the query molecule shape.
- an eleventh step 422 shape matching is performed for overlapped query and target molecule shapes, and a shape matching value is calculated.
- the shape matching value generated during this step reflects the overlapped and non-overlapped volumes of the target and query molecule shapes, with a high degree of overlap between the query and target molecule would indicate favored matching.
- target and query molecule shapes are overlaid onto a three-dimensional cubic grid.
- a shape matching value is determined based on the position of the grid point relative to the surface of the molecule shapes.
- the shape matching value considers grid points occupying volumes common to the molecule shapes, grid points occupying volumes of the query shape projecting out of the target volume, and grid points occupying volumes of the target shape projecting out of the query shape.
- shape matching in accordance with embodiments of the present invention is performed based upon alignment of subshape triangles 140 and 142, more accurate matching of subshapes between molecules 100 and 112 will result.
- the relative size of the target and query molecules may influence the algorithm utilized to quantify overlap between the molecule volumes or surfaces.
- One approach for calculating overlap between the target and query molecule volumes or shells during shape matching is the protrusion distance calculated according to equation (1):
- S volume of smaller shape protruding out of larger shape
- N volume of the smaller shape
- Tanimoto distance a second measure of molecular overlap
- the Tanimoto distance metric assigns equal weight to the volumes of the query and target molecules regardless of their relative size, this metric is particularly suited for shape matching between query and target molecules of approximately the same size.
- the relative size of the target and query molecules may also influence the particular shapes of the molecules that are overlapped during the shape matching step.
- the chemical activity of a molecule is determined predominantly by shapes and functionalities presented on the surface of the molecule and available to interact with the chemical or biological environment, including other molecules such as receptors or enzymes. Accordingly, during shape matching it may be valuable to emphasize the importance of overlap or non-overlap between portions of the molecular shapes proximate to the surface.
- embodiments of subshape matching processes in accordance with the present invention may utilize a shape matching step based upon overlap between the volumes of shells representing the surface regions (i.e. outlines) of the aligned molecular shapes, rather than the entire volume of the molecular shapes.
- this surface matching approach interior volumes of the molecule shapes are not considered during calculation of overlap, whether using the protrusion or Tanimoto distance measures previously described.
- FIG. 14B shows a simplified cross-sectional view of an alternative embodiment in accordance with the present invention for use in surface matching, wherein grid points falling within the interior 1404 or exterior 1412 of molecule shape 1402 are accorded no value, and grid points occurring in shells 1414, 1416, 1418, 1420, or 1422 at or near the surface are assigned higher grid occupancy values, thereby creating a molecule volume in the shape of a shell.
- the relative sizes of the target and query molecules may influence both the choice of shapes (i.e. molecule volume or molecule shell volume) that are overlapped during the shape matching process, and the algorithm utilized to quantify that overlap.
- a protein bump checking step is performed for query and target molecule shapes that satisfy a shape matching threshold.
- This protein bump checking step is depicted in Fig. 5K.
- Query molecule 100 is oriented within receptor 102a of macromolecule shape 102 which is typically a protein but could be another type of macromolecule such as a nucleic acid .
- target molecule shape 112 is then substituted for the query molecule shape 104 within receptor 102a, and an overlap 196 between target molecule shape 112 and protein shape 102 is identified.
- a protein overlap value reflecting overlap between the target molecule shape and the protein shape is then calculated. This protein overlap value reflects the volume overlapped between the query and target molecule shapes. Unlike the prior shape matching value just calculated, a high degree of overlap between the protein shape and the target molecule shape would indicate a disfavored interaction between target and receptor.
- target molecule shapes satisfying a protein overlap threshold are identified as close matches. These ultimately selected target molecules are favorable candidates for actual screening experiments to determine affinity to the receptor.
- one aspect of embodiments of methods in accordance with the present invention is that shape matching between query and target molecules is based upon alignment of subshape triangles representative of a local molecule volume distribution. These subshape triangles are in turn generated from skeleton terminal points resulting from sampling of local volume distributions.
- Fig. 6A shows a flow chart illustrating the steps of one embodiment of a method in accordance with the present invention for generating skeleton points.
- a first step 602 of skeleton point generation process 600 the known three-dimensional shape of a query molecule is overlaid onto a three-dimensional grid.
- grid points falling within the query molecule shape are identified.
- a local volume is sampled at each of encompassed points by positioning spheres of a consistent radius at each encompassed grid point, and then calculating the fraction of the volume of the sphere occupied by the query molecule shape.
- Spheres falling below a minimum volume fraction are determined as defining the boundaries of the query molecule shape.
- centers of these minimum volume fraction spheres are clustered together to produce initial skeleton points.
- Fig. 6B is a flowchart showing the steps of one embodiment of a method in accordance with the present invention for generating additional initial skeleton points.
- a pair (A and B) of existing initial skeleton points is selected.
- middle point C is determined along a line segment joining initial skeleton points A and B.
- a sphere having center O and radius r is placed at middle point C.
- centroid D of the query molecule shape volume contained within the sphere is computed.
- a step 660 the distance between points O and D is computed.
- distance OD is compared to a threshold valve. If this distance OD is greater than a threshold value, in a step 664, the sphere is moved such that O coincides with D. Steps 658, 660, and 662 are then repeated until the distance between sphere center O and centroid D is less than the threshold valve. As shown in Fig. 6C, this iterative procedure attempts to place point D within query molecule shape 104, even if the original midpoint of the line AB was located outside of query molecule shape 104.
- a step 666 the distance d from centroid D to the closest initial skeleton point is computed.
- a step 668 the location of centroid D and corresponding distance d is stored.
- a determination as to whether centroids have been determined for all pairs of skeleton points is made. Where additional pairs of untested initial skeleton points remain, steps 652, 654, 656, 658, 660, 662, and, if necessary, step 664, are repeated until all pairs of initial skeleton points have served as a basis for the generation of centroids.
- a new initial skeleton point is chosen from the stored points such that the new initial skeleton point selected corresponds to the maximum distance d.
- supplemental skeleton points are then added to characterize remaining portions of the query molecule shape.
- a center of a sphere of a given radius is placed at each grid point encompassed within the query molecule shape.
- the centroid of the query molecule shape volume inside the sphere is then computed and stored, along with the volume fraction of the molecule shape falling within the sphere. Spheres falling above a minimum volume function are determined as defining the backbone of the query molecule.
- centroids that are close to one another are removed.
- this filtering step 614 can involve starting from each skeleton point (initial and supplemental) and drawing a sphere of a particular radius. All stored high volume fraction centroids falling within the sphere are considered. The centroid of the sphere having the largest corresponding volume fraction is added as a supplemental skeleton point. Remaining centroids within the sphere are discarded. This process is repeated over all the initial and supplemental skeleton points generated.
- the supplemental skeleton points, together with the initial skeleton points define the backbone/skeleton of the query molecule shape.
- the total number of skeleton points varies according to the overall size of the query molecule, with a range of between 25-100 skeleton points being typical.
- a truncated version of the same process is utilized to sample local volumes of the target molecule shape and thereby generate terminal points.
- the primary difference between the process for generating skeleton points of the query molecule and the process for generating terminal points of the target molecule is that supplemental terminal points are not generated. This is because the desirable character of the three-dimensional query molecule shape has aheady been established through its affinity to the receptor of interest.
- a target molecule shape is merely one of many present in the database. Accordingly, in certain embodiments in accordance with the present invention only the initial terminal points are generated for a target molecule shape. Supplemental terminal points are not generated. Limiting the number of terminal points in this manner reduces the number of target subshape triangles, and thus the number of possible combinations of matched query and target subshape triangles, to a quantity manageable by the processing power generally available to personal computers or workstations.
- Methods of molecular similarity matching in accordance with embodiments of the present invention offer a number of advantages over conventional methods.
- One advantage is that the method recognizes similarity between molecule subshapes. This is because shape matching is performed upon query and target molecules aligned according to matched subshape triangles that reflect local, rather than overall molecule shape.
- Still another advantage of methods in accordance with embodiments of the present invention is efficiency in the allocation of computing power. The use of multiple subshape triangles as a basis for similarity matching increases the number of possibilities that must be considered. Calculations of the present method requiring the most computational power are triangle matching, shape matching, and protein bump checking steps.
- embodiments in accordance with the present invention perform a number of filtering steps to eliminate unpromising subshape orientations from further consideration prior to performance of the computationally-intensive steps.
- the results of this filtering is shown in Fig. 7, which schematically depicts the successively fewer number of possible combinations that must be evaluated by each method step.
- Fig. 7 shows that edge matching initially filters out a large number of potential subshape triangles of disproportionate size prior to the triangle matching step. In this manner, the number of possible subshape triangles pairs to be created from all skeleton points and all terminal points is substantially reduced.
- feature matching and direction matching serve to filter additional unpromising matched subshape triangle pairs. In this manner, the possible number of material subshape triangles is successively reduced from on the order of one million to on the order of 100 or fewer.
- Additional filtering may be accomplished during the shape matching step to eliminate unpromising molecular overlaps.
- surface encoding refines volumes of molecule shapes into shells or surfaces, removing from consideration alignments not implicating the molecule surface considered especially relevant to chemical activity.
- Such surface matching is particularly valuable in reducing unhelpful matches involving a smaller molecule shape aligned wholly within the interior of a larger molecule shape.
- the protein bump checking performed by embodiments of methods in accordance with the present invention ensures that only the most promising matched pairs of query/subshape triangles are considered. This is particularly important given the complexity of most receptor shapes and the large quantity of computing resources that must be allocated to describe overlap between the target and a receptor shape.
- a method in accordance with an embodiment of the present invention may dispense entirely with the protein matching step, basing subshape matching solely upon overlap of molecule shapes based upon alignment of matched subshape triangles.
- the order in that the steps are performed can also be changed in order without limiting the scope of the invention claimed herein.
- a method could perform feature matching either before or after direction matching, and still remain within the scope of the present invention.
- FIG. 8 A is a simplified diagram of a computing device for processing information according to an embodiment of the present invention. This diagram is merely an example which should not limit the scope of the claims herein. One of ordinary skill in the art would recognize many other variations, modifications, and alternatives. Embodiments according to the present invention can be implemented in a single application program such as a browser, or can be implemented as multiple programs in a distributed computing environment, such as a workstation, personal computer or a remote terminal in a client server relationship. [95] Fig.
- FIG. 8 A shows a computer system 810 including a display device 820, a display screen 830, a cabinet 840, a keyboard 850, and a mouse 870.
- Mouse 870 and keyboard 850 are representative "user input devices.”
- Mouse 870 includes buttons 880 for selection of buttons on a graphical user interface device.
- Other examples of user input devices are a touch screen, light pen, track ball, data glove, microphone, and so forth.
- Fig. 8 A is representative of but one type of system for embodying the present invention. It will be readily apparent to one of ordinary skill in the art that many system types and configurations are suitable for use in conjunction with the present invention.
- computer system 810 includes a PentiumTM class based computer, running LINUX or WindowsTM NT operating system by Microsoft Co ⁇ oration.
- mouse 870 can have one or more buttons such as buttons 880.
- Cabinet 840 houses familiar computer components such as disk drives, a processor, storage device, etc. Storage devices include, but are not limited to, disk drives, magnetic tape, solid state memory, bubble memory, etc.
- Cabinet 840 can include additional hardware such as input/output (I/O) interface cards for connecting computer system 810 to external devices external storage, other computers or additional peripherals, further described below.
- I/O input/output
- FIG. 8B is an illustration of basic subsystems in computer system 810 of Fig. 8 A.
- This diagram is merely an illustration and should not limit the scope of the claims herein.
- the subsystems are interconnected via a system bus 875. Additional subsystems such as a printer 874, a keyboard 878, a fixed disk 879, a monitor 876, which is coupled to a display adapter 882, and others are shown.
- Peripherals and input/output (I/O) devices which couple to an I/O controller 871, can be connected to the computer system by any number of means known in the art, such as a serial port 877.
- serial port 877 can be used to connect the computer system to a modem 881, which in turn connects to a wide area network such as the Internet, a mouse input device, or a scanner.
- a wide area network such as the Internet, a mouse input device, or a scanner.
- the interconnection via system bus allows a central processor 873 to communicate with each subsystem and to control the execution of instructions from system memory 872 or the fixed disk 879, as well as the exchange of information between subsystems.
- Other arrangements of subsystems and interconnections are readily achievable by those of ordinary skill in the art.
- Fig. 9 presents a simplified block diagram of one embodiment of a software program 900 used to perform subshape matching in accordance with the present invention.
- Program 900 exhibits a two-tier architecture that includes an interface 902 as the first tier.
- interface 902 may be written in the PYTHON programming language.
- Interface 902 is in communication with a database 903 containing information relevant to a large number of possible target molecules. This information may be organized in the form of a number of searchable categories, such as molecule name, biological activity, two-dimensional connectivity, and three-dimensional shape molecular shape.
- database 903 is the MDL Drug Data ReportTM (MDDR) available from MDL Information Systems, Inc. of San Leandro, California.
- the second tier of software program 900 includes a C++ library 906 and may further include a visualization module 908.
- Visualization module 908 enables the display and manipulation of various shapes during subshape matching in accordance with embodiments of the present invention.
- Visualization module 908 is in communication with molecular imaging tool 950 for display of three-dimensional molecular shapes.
- a number of independent software programs are available to serve as a molecular imaging tool, including the WebLabTM software program manufactured by Accelrys, Inc. of San Diego, California.
- C++ library 906 is formed from a number of components.
- a conformer generation module 930 enables the generation of three-dimensional conformers based upon two- dimensional molecular connectivities.
- a skeleton/terminal point generation module 932 performs the localized volume sampling and point generation techniques performed on the three-dimensional molecular volumes as described above in conjunction with Figs. 6A-B.
- An edge matching module 934 performs edge matching of distances between skeleton point pairs and terminal point pairs as previously described.
- a triangle matching module 936 performs the matching of query/target subshape triangle pairs assembled from the skeleton/terminal points.
- a feature matching module 938 compares the chemical environment proximate to the vertices of matched query/target subshape triangles.
- a direction matching module 940 of C++ library 906 compares the principal axes of local sampled volume at the vertices of the matched subshape triangles.
- a shape matching module 942 performs alignment of the matched subshape triangles, overlaps the query molecule shape and the target molecule shape based upon this alignment, and then calculates relative overlap between query and target molecule shapes as previously described in connection with Fig. 5 J.
- a protein matching module 944 performs alignment of the target molecule shape within the active site of the protein, and calculates relative overlap between protein and target molecule shapes as previously described in connection with Fig. 5K. Protein matching module 944 may obtain three-dimensional protein shapes from database 903. [104] Embodiments of subshape matching methods and codes in accordance with the present invention may be utilized in performing a number of applications in connection with the discovery and testing of pharmaceutical compounds. [105] One possible application for methods of subshape matching in accordance with embodiments of the present invention is in conformational evaluation of potential therapeutic compounds. For example, experimental results may reveal that several ligands exhibit affinity to a particular receptor, but the actual three-dimensional orientation of only one of the ligands is known or suspected.
- an embodiment of subshape similarity matching in accordance with the present invention could utilize the known shape of the bound ligand as the query shape, and various conformers of the other active ligands as the target shapes.
- Such an conformational evaluation would reveal for the target molecules the conformation(s) in which they may bind to the receptor. Further the three-dimensional orientations of these target molecule binding conformations can also be studied to reveal key ligand-receptor interactions during the binding process.
- the method was utilized to evaluate conformers for active compounds. Specifically, an experiment was performed comparing the molecular alignment predicted by molecular subshape matching versus the actual molecular alignment as revealed by crystallographic data. Crystallographic data was available showing the alignment of one large ligand (fibrinogen), and each of two smaller ligands (NAPAP, PPACK) within the same receptor of a protein (thrombin). Using this empirically-derived alignment information, the larger ligand (fibrinogen) was utilized as a query molecule shape for subshape matching from a database containing the smaller ligands as target molecule shapes.
- TABLE C shows the results of this subshape matching experiment. The last column of TABLE C quantifies the difference in RMSD between the molecular alignment predicted by subshape matching in accordance with an embodiment of the present invention, and the actual molecular alignment empirically determined from prior crystallographic studies.
- Figs. 11 A-l IE show the three-dimensional orientation of the fibrinogen query molecule as aligned in the thrombin receptor, and the NAPAP and PPACK target molecules, respectively, as obtained from this experiment.
- Another possible application for methods of subshape matching in accordance with embodiments of the present invention is searching of a database of three-dimensional molecule shapes to identify molecules possessing subshapes similar to one or more generic molecules active against a particular receptor or enzyme.
- subshape matching against a set of 83,178 small molecules from the MDDR database was performed utilizing the NAPAP ligand (known to bind to thrombin) as the query molecule.
- NAPAP ligand known to bind to thrombin
- the number of thrombin-active compounds in the resulting pool of matched target molecules would be expected to be higher than in the original set of 83,000+ molecules.
- the Tanimoto distance overlap approach was employed to evaluate the shape matching step.
- the database search performed in this example enabled determination of the versatility of the code for handling a wide variety of molecules, the approximate time required to perform the subshape matching, and the behavior of different shape comparison measures.
- a conformer-generating program was utilized to generate an average of about 65 low-energy conformers for each of the 83,178 compounds. The total conformer library was then searched for subshape similarity to the NAPAP ligand shape posed as bound to the thrombin receptor.
- TABLE D lists the total number of selected compounds passing all of the subshape matching filters (N a ⁇ ota i), along with the corresponding number of these selected compounds which are annotated as "thrombin inhibitors" in the MDDR (N ap ).
- the final number presented in TABLE D for each search is the enrichment value (E t ) for activity in the compounds selected for each experiment. This enrichment is calculated according to
- Equation (3) Equation (3) below as the density of thrombin actives in the selected set over the density in the original pool of compounds.
- N ap number of "thrombin active" target molecules selected
- NaTotai total number of target molecules selected
- combinatorial synthesis utilizes a molecular scaffold as a starting point, mixing these scaffolds with molecules that react with the scaffold to form side chains.
- Discussion of the use of scaffolds in drug design is presented by Lee et al., "Scaffold Architecture and Pharmacophoric Properties of Natural Products and Trade Drugs: Application in the Design of Natural Product-Based Combinatorial Libraries", J. Comb. Chem, 3, 284-89 (2001), and by Lewell et al, "RECAP-Retrosynthetic Combinatorial Analysis Procedure: A Powerful New Technique for Identifying Privileged Molecular Fragments with Useful Applications in Combinatorial Chemistry", J. Chem. Inf. Comput.
- an embodiment of shape matching in accordance with the present invention comprises the steps of identifying subshape similarity between a query shape defined by a molecule known to be active toward a particular receptor, and a target shape defined by a template or side chain to the template.
- the query molecule could typically correspond in size to a drug-like molecule having a molecular weight of between about 400- 700, with the target molecule corresponding to a molecule fragment having a molecular weight of 100 or less.
- a further possible application for embodiments of subshape matching in accordance with embodiments of the present invention is the supe ⁇ osition or alignment of molecules on the basis of their molecular shape, and sometimes also in combination with other molecular features such as the presence of certain chemical functionalities. Such spatial supe ⁇ ositions may not be immediately apparent to a user, particularly when viewing subshape similarities and taking into account conformational flexibility. For example, Fig.
- FIG. 13 A shows the 2-D representations of two ligands (ltlp and ppp) which do not apparently share common structural features or shapes.
- experimental evidence has revealed the ltlp and ppp ligands to be bound to the protein thermolysin in the manner indicated by the superimposed three-dimensional shapes of Fig. 13B [120] While not intuitively obvious, such molecular supe ⁇ ositions may be useful for determining binding modes, or key interactions between ligand and receptor, see Bohm et al., "What Can We Learn from Molecular Recognition in protein-Ligand Complexes for the Design of New Drugs?", Angew Chem. Int. Ed. Engl. 35-2588-2614 (1996), hereby inco ⁇ orated by reference for all pu ⁇ oses..
- FIG. 120 Yet another example of a possible application for subshape similarity matching in accordance with embodiments of the present invention is in performing docking studies. Docking involves the creation of a shape of the space representing the receptor site itself, rather than the shape of a ligand known to bind to that receptor. In performing docking studies, complementarity in shape between the target molecule and the receptor is important because short contacts will typically result in high repulsive energies. Where the three- dimensional structure of the receptor is known, the shape of the active site can be deduced from the receptor's atomic coordinates using techniques and programs known in the art. For example, the use of protein structural information for drug design is often referred to as Structure-Based Drug Design.
- an embodiment of a method of performing docking studies in accordance with the present invention comprises the steps of identifying a target molecule exhibiting subshape similarity with a query shape defined by the binding volume present on a particular receptor structure.
- a binding volume would typically correspond in size to the shape occupied by a larger molecule (i.e. M.W. > 1000).
- Still another example of an application for subshape similarity matching in accordance with embodiments of the present invention is in the field of Qualitative Structure Activity Relations (QSAR).
- QSAR Qualitative Structure Activity Relations
- the QSAR technique is explained in detail by Hoekman et al in Exploring OSAR. ACS, Washington, D.C. (1995).
- 3D QSAR studies rely on alignment of multiple molecules with alignment superimposition often based on molecular shape.
- An example of using subshape technology for this application may involve utilizing a low energy conformation of the largest of the molecules as a query shape. The target shape can then be obtained from the remaining molecules to compare to the query shape. Once a consistent alignment has been obtained for all the molecules, follow-up methods such as Comparative Molecular Field Analysis can be employed.
- a shape catalog is generated from three-dimensional conformations of molecules known to be active. This step may involve comparison of subshapes in accordance with an embodiment of the present invention to pick the most diverse set of shapes for the catalog.
- shapes of conformers known to be active could serve as the query molecule shape and other conformers of known active compounds could serve as the target molecule shape.
- a shape in the catalog may be used as the query shape, with the target shape obtained from molecules in a input data set comprising both active and inactive molecules.
- locations of the chemical features on the target shape are marked. Based on these chemical feature locations and the subshape matched target molecules, a fmge ⁇ rint in the form of a bit string can be generated for molecules of the input data set.
- the finge ⁇ rints are analyzed using various machine learning techniques to identify a small number of query shape and chemical feature location on them, that are important for activity.
- a query shape identified through machine learning analysis of the finge ⁇ rints can then be used as the query molecule shape to search through a database of target molecules to identify compounds of the database suitable for biological assaying.
- the target molecule may be further evaluated as a potentially useful lead candidate in the process of drug discovery.
- FIG. 12 is a flow chart showing the steps of a method 1200 for applying a subshape-matched target molecule in accordance with an embodiment of the present invention to identify possible drug leads.
- a target molecule exhibiting desirable subshape characteristics is identified as described in detail above.
- the first step 1202 of the method 1200 shown in FIG. 12 thus corresponds to step 422 of FIG. 4.
- the target molecule identified by subshape matching in accordance with an embodiment of the present invention is procured.
- the target molecule can be procured in a number of ways.
- One approach is to synthesize the molecule in the library.
- Such synthesis can comprise conventional techniques, or more efficiently can employ combinatorial synthesis strategies wherein large numbers of organic compounds are created in parallel by linking chemical building blocks in all possible combinations.
- Such combinatorial synthesis approaches may involve solid phase synthesis wherein the molecules are anchored to beads, or may involve solution phase synthesis wherein the molecules are present in solution. Either or both solid or solution phase combinatorial synthesis techniques could be utilized to procure a subshape-matched target molecule identified in accordance with embodiments of the present invention.
- Another alternative approach for procuring a target molecule identified by subshape matching in accordance with the present invention is to purchase existing molecules from commercial sources.
- Examples of commercial sources of molecules suitable for procuring members of a gene family screening library created in accordance with an embodiment of the present invention include, but are not limited to, Pharmacopeia Inc. of Princeton, New Jersey, Sigma- Aldrich Co ⁇ . of St. Louis, Missouri, Maybridge Plc.of Tintagel, Cornwall U.K., Chembridge Co ⁇ . of San Diego, CA, and Albany Molecular Research, of Albany, NY.
- the procured target molecule identified by subshape matching in accordance with the present invention can be screened for activity.
- Screening of the target molecule can take the form of biological assays conducted outside of living tissue (in vitro).
- assay formats for measurement of enzyme activity or receptor binding include, but are not limited to, electrophoresis, scintillation proximity, ELIS As, immunoprecipitation, western blotting, and bead-based methods.
- detection techniques for application with biological assays include, but are not limited to, the use of time-resolved fluorescence, resonance energy transfer (FRET), fluorescence polarization, radioisotopic tracers, and chemiluminescent or colorimetric substrates.
- FRET resonance energy transfer
- in vitro screening techniques for use in conjunction with gene family screening libraries created in accordance with the present invention include, but are not limited to, binding assays, enzyme activity assays, and cell-based assays such as functional assays and metabolism assays.
- One or more of the screening techniques described above can be performed with different levels of throughput.
- High-throughput screening of compounds is a standard approach in pharmaceutical research to discover new lead compounds for drug design.
- High- throughput screening typically involves the use of ninety-six or a greater number of wells per plate.
- Such high-throughput screening methods have discovered novel molecules, dissimilar to known ligands, that nevertheless bind to the target receptor at micromolar or submicromolar concentrations.
- procured subshape-matched molecules in accordance with the embodiments of the present invention can be subjected to screening in living tissue (in vivo).
- in vivo assays include but are not limited to evaluation of a subshape-matched molecule activity in rodents, dogs, primates, or any other species. This evaluation may include testing of the molecules in a suitable pharmacological model of a particular disease state, wherein physiological or behavioral changes in an animal are monitored. Such animals may be nom al (wild-type) or genetically-modified, or may be subject to a particular experimental protocol.
- Data produced from in vivo assays may include but is not limited to physical examination, histological (organ/tissue) or behavioral observations, post-mortem examinations, and gene-expression analyses from tissue samples of animals exposed to library molecules.
- subshape-matched molecules may effectively reduce the size, weight and or adipose tissue density of animals fed a high-fat diet, as a model for human obesity and diabetes, or may produce a response associated with reduced anxiety in a behavioral test, or may alter normal gene-expression in a given tissue as a result of interacting with an appropriate biological target.
- screening "in silico” within the silicon of the integrated circuits comprising a computer processor or memory, is emerging as an increasingly useful technique.
- silico screening also known as virtual screening, relies upon electronic representations of the molecules in two- or three- dimensions, rather than upon the physical molecules themselves.
- In silico screening may permit a researcher to rapidly compare and evaluate similarity between subshape-matched target molecule and other structures, such as receptors or other molecules with previously- demonstrated activity against a particular receptor.
- ⁇ silico screening examples include the use of structurally-definite molecular substructures (e.g., privileged [sub] structures), structurally-definite molecular fragments, structurally-definite chemical scaffolds or structurally-definite sidechains
- structurally-definite molecular substructures e.g., privileged [sub] structures
- structurally-definite molecular fragments e.g., structurally-definite chemical scaffolds
- structurally-definite sidechains e.g., privileged [sub] structures
- in silico screening can comprise searching the subshape-matched target molecules with at least one class of structurally-abstract molecule descriptors.
- the selection of the class of structurally-abstract molecule descriptors can be based on any suitable structurally-abstract characteristic, feature or property of a molecule. Therefore, structurally- abstract molecule descriptor classes include pharmacophore descriptors, atom path-length descriptors, BCUT descriptors, and other biophysical descriptors (e.g., solubility) known to one skilled in the art.
- a discussion of BCUT descriptors is given by Pearlman et al. in "Metric Validation and the Receptor-Relevant Subspace Concept", J. Chem. Inf. Comput. Sci, 39, 28-35 (1999), hereby inco ⁇ orated by reference for all pu ⁇ oses.
- Target molecules identified by subshape matching which evidence desirable activity in vitro, in vivo, in silico, or in some combination thereof, against certain desired objectives are designated as 'hits', and may be validated and further optimized to identify leads and ultimately, drug candidates and drugs.
- a typical sequence of screening utilizing maximum efficiency of resources is initial screening of subshape matched target molecules in silico, followed by in vitro screening of subshape matched target molecules revealed as promising in silico, followed by in vivo screening of subshape matched target molecules revealed as promising in vitro.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Hematology (AREA)
- Evolutionary Biology (AREA)
- Medical Informatics (AREA)
- Urology & Nephrology (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- Computing Systems (AREA)
- Cell Biology (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Image Generation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2002329816A AU2002329816A1 (en) | 2001-08-23 | 2002-08-22 | Method for molecular subshape similarity matching |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US31438001P | 2001-08-23 | 2001-08-23 | |
US60/314,380 | 2001-08-23 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2003019140A2 true WO2003019140A2 (en) | 2003-03-06 |
WO2003019140A3 WO2003019140A3 (en) | 2004-04-15 |
Family
ID=23219725
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2002/026844 WO2003019140A2 (en) | 2001-08-23 | 2002-08-22 | Method for molecular subshape similarity matching |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU2002329816A1 (en) |
WO (1) | WO2003019140A2 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799779A (en) * | 2012-07-16 | 2012-11-28 | 中山大学 | Molecular volume calculating method and shape comparing method of two molecules |
WO2014089359A1 (en) * | 2012-12-05 | 2014-06-12 | Hudson Robotics, Inc. | System for the efficient discovery of new therapeutics drugs |
US10515715B1 (en) | 2019-06-25 | 2019-12-24 | Colgate-Palmolive Company | Systems and methods for evaluating compositions |
CN115116553A (en) * | 2021-03-19 | 2022-09-27 | 合肥本源量子计算科技有限责任公司 | Molecular parameter configuration method, device, medium and electronic device |
WO2023123023A1 (en) * | 2021-12-29 | 2023-07-06 | 深圳晶泰科技有限公司 | Method and device for screening molecules and application thereof |
-
2002
- 2002-08-22 WO PCT/US2002/026844 patent/WO2003019140A2/en not_active Application Discontinuation
- 2002-08-22 AU AU2002329816A patent/AU2002329816A1/en not_active Abandoned
Non-Patent Citations (3)
Title |
---|
BITTAR ET AL.: 'Automatic reconstruction of unstructured 3D data: combining a medial axis and implicit surfaces' COMPUTER GRAPHICS FORUM vol. 14, no. 3, pages 457 - 468, XP002972808 * |
FERMIN ET AL.: 'Planar motion detected by randomized triangle matching' PATTERN RECOGNITION LETTERS vol. 18, no. 8, August 1997, pages 741 - 749, XP004103472 * |
LEMMEN ET AL.: 'FLEXS: a method for fast flexible ligand superposition' J. MEDICINAL CHEMISTRY vol. 41, 1998, pages 4502 - 4520, XP002241912 * |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799779A (en) * | 2012-07-16 | 2012-11-28 | 中山大学 | Molecular volume calculating method and shape comparing method of two molecules |
WO2014012309A1 (en) * | 2012-07-16 | 2014-01-23 | 中山大学 | Calculation method for molecular volume and comparison method for shapes of two molecules |
US9811642B2 (en) | 2012-07-16 | 2017-11-07 | Iprecision Medicine Technology, Inc. | Methods for shape comparison between drug molecules |
WO2014089359A1 (en) * | 2012-12-05 | 2014-06-12 | Hudson Robotics, Inc. | System for the efficient discovery of new therapeutics drugs |
US10839942B1 (en) | 2019-06-25 | 2020-11-17 | Colgate-Palmolive Company | Systems and methods for preparing a product |
US10839941B1 (en) | 2019-06-25 | 2020-11-17 | Colgate-Palmolive Company | Systems and methods for evaluating compositions |
US10515715B1 (en) | 2019-06-25 | 2019-12-24 | Colgate-Palmolive Company | Systems and methods for evaluating compositions |
US10861588B1 (en) | 2019-06-25 | 2020-12-08 | Colgate-Palmolive Company | Systems and methods for preparing compositions |
US11315663B2 (en) | 2019-06-25 | 2022-04-26 | Colgate-Palmolive Company | Systems and methods for producing personal care products |
US11342049B2 (en) | 2019-06-25 | 2022-05-24 | Colgate-Palmolive Company | Systems and methods for preparing a product |
US11728012B2 (en) | 2019-06-25 | 2023-08-15 | Colgate-Palmolive Company | Systems and methods for preparing a product |
US12165749B2 (en) | 2019-06-25 | 2024-12-10 | Colgate-Palmolive Company | Systems and methods for preparing compositions |
CN115116553A (en) * | 2021-03-19 | 2022-09-27 | 合肥本源量子计算科技有限责任公司 | Molecular parameter configuration method, device, medium and electronic device |
WO2023123023A1 (en) * | 2021-12-29 | 2023-07-06 | 深圳晶泰科技有限公司 | Method and device for screening molecules and application thereof |
Also Published As
Publication number | Publication date |
---|---|
WO2003019140A3 (en) | 2004-04-15 |
AU2002329816A1 (en) | 2003-03-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Krovat et al. | Recent advances in docking and scoring | |
Jónsdóttir et al. | Prediction methods and databases within chemoinformatics: emphasis on drugs and drug candidates | |
Desaphy et al. | Comparison and druggability prediction of protein–ligand binding sites from pharmacophore-annotated cavity shapes | |
Smith et al. | Prediction of protein–protein interactions by docking methods | |
Villoutreix et al. | Free resources to assist structure-based virtual ligand screening experiments | |
Chang et al. | Pharmacophore-based discovery of ligands for drug transporters | |
Biesiada et al. | Survey of public domain software for docking simulations and virtual screening | |
JP5032120B2 (en) | Method and apparatus for classifying molecules | |
T Garcia-Sosa et al. | Molecular property filters describing pharmacokinetics and drug binding | |
Patel et al. | A review on computational software tools for drug design and discovery | |
Putta et al. | A novel subshape molecular descriptor | |
Khan et al. | Modern methods & web resources in drug design & discovery | |
Langer et al. | Virtual combinatorial chemistry and in silico screening: Efficient tools for lead structure discovery? | |
Guterres et al. | CHARMM-GUI LBS finder & refiner for ligand binding site prediction and refinement | |
Brooijmans | Docking methods, ligand design, and validating data sets in the structural genomic era | |
Lauria et al. | Drugs polypharmacology by in silico methods: new opportunities in drug discovery | |
Kasahara et al. | Comprehensive classification and diversity assessment of atomic contacts in protein–small ligand interactions | |
Bisht et al. | Emerging need of today: significant utilization of various databases and softwares in drug design and development | |
WO2003019140A2 (en) | Method for molecular subshape similarity matching | |
Bologa et al. | How to prepare a compound collection prior to virtual screening | |
JP5093110B2 (en) | Ligand search method | |
Andreev et al. | Colabind: A Cloud-Based Approach for Prediction of Binding Sites Using Coarse-Grained Simulations with Molecular Probes | |
Balakin et al. | Rational design approaches to chemical libraries for hit identification | |
JP2005018447A (en) | Method for searching receptor-ligand stable complex structure | |
Preto et al. | Molecular dynamics and related computational methods with applications to drug discovery |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BY BZ CA CH CN CO CR CU CZ DE DM DZ EC EE ES FI GB GD GE GH HR HU ID IL IN IS JP KE KG KP KR LC LK LR LS LT LU LV MA MD MG MN MW MX MZ NO NZ OM PH PL PT RU SD SE SG SI SK SL TJ TM TN TR TZ UA UG UZ VC VN YU ZA ZM Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ UG ZM ZW AM AZ BY KG KZ RU TJ TM AT BE BG CH CY CZ DK EE ES FI FR GB GR IE IT LU MC PT SE SK TR BF BJ CF CG CI GA GN GQ GW ML MR NE SN TD TG Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 69(1) EPC(EPO FORM 1205A OF 22.062004) |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: JP |