|
May 2002, Volume 1, Issue 3Development of Quantitative Structure-Activity Relationships (QSARs) and Their Use for Priority Setting in the Testing Strategy of Endocrine DisruptorsWeida Tong1, Roger Perkins1, Hong Fang1, Huixiao Hong1, Qian Xie1, William Branham2, Daniel M. Sheehan2*, Jeanne Farr Anson2 1 Logicon ROW Sciences, Jefferson, AR 720792FDA's National Center for Toxicological Research (NCTR), Jefferson, AR 72079 *Current address: Daniel M. Sheehan and Associates, Little Rock, AR 72202 AbstractConsiderable scientific, regulatory and popular press attention has been devoted to the Endocrine Disrupting Chemicals (EDCs), of which estrogenic chemicals figure prominently. A large number of potential estrogenic EDCs are associated with products regulated by the Food and Drug Administration (FDA), including plastics used in food packaging, phytoestrogens, food additives, pharmaceuticals, cosmetics, etc. Recent legislation mandates the U.S. Environmental Protection Agency (EPA), a sister regulatory agency, to develop a screening and testing program for potential EDCs in drinking water and food additives. Under the legislation, a large number of chemicals will undergo various in vitro and in vivo assays for their potential estrogenicity, as well as other hormonal activities. There is a crucial need to set priority for these chemicals to reduce the cost and speed the screening and testing process. At the FDA National Center for Toxicological Research (NCTR), quantitative structure-activity relationships (QSARs) is a major component of the Endocrine Disruptor Knowledge Base (EDKB) project - a prototype Toxicological Knowledge Base. By integrating experimentation and modeling, a series of QSAR models have been developed and validated in the project. These models are integrated into a "Four-Phase" scheme, with each successive phase eliminating unlikely estrogen receptor (ER) binders, resulting in a priority listing of chemicals for regulatory application. The system performance has been validated using several data sets with known estrogenic activity and, subsequently, applied to three environmental data sets identified by the EPA. It has also been used to assess estrogenic activity of chemicals of concern at other Centers within the FDA, namely the Center for Food Safety and Applied Nutrition (CFSAN) and the Center for Drug Evaluation and Research (CDER). The rigorous validation of the integrated system is ongoing via the interagency agreement (IAG) between EPA and NCTR. The approach presented here for estrogen is anticipated to be equally applicable for other receptor-mediated, endocrine disrupting mechanisms, e.g., androgen receptor binding, and other toxicity endpoints. IntroductionWhile the toxicological sciences have advanced steadily over past decades owing to constant improvement in experimental techniques and instrumentation, and in the general understanding of biological mechanisms, the overall paradigm is largely unchanged. Specifically, models for risk assessment are predominantly dependent on slow and expensive testing in laboratory animals, since epidemiological data are insufficient and testing in humans is not possible (except for drugs during the advanced stages of development). A fundamental tenet of this paradigm is that effects in laboratory animals can be extrapolated to humans (or other animals). Intrinsic variability in the response of the test species, and the number of animals tested, limits the resolving power of the affordable protocols to determine a No Observed Adverse Effect Level (NOAEL), the dose which provides no statistically significant increase in adverse effects above the control value. This, together with the need to extrapolate to humans, results in the use of safety factors, generally in the range of two or more orders of magnitude to determine an allowable dose for non-genotoxic chemicals. In 1995, the National Center for Toxicological Research (NCTR) reoriented its strategic goals to begin to alter the paradigm of toxicological research, taking direct aim at increasing regulatory efficiency by reducing the time, expense and animal use in the regulation process. One primary strategic goal was the development of knowledge bases, that were defined to be computer-based systems that unify applicable literature and data and provide computational models to predict a chemicals toxic potential, predict experimental needs and improve regulatory risk assessment capability. The inspiration for the Estrogen Knowledge Base (EKB) program came from a center wide challenge issued by then NCTR Director, Dr. Bernard Schwetz, to develop a knowledge base with the capability to extend the predictability of existing data. Scientists within the Division of Reproductive and Developmental Toxicology suggested that estrogens might provide an appropriate area to develop a prototype. Knowing that NCTR scientists had been engaged in estrogen related research for more than two decades, Dr. Schwetz posed the questions whether they could recognize an estrogen receptor ligand, solely based on chemical structure, and whether models based on existing data could be used to develop models to make such a prediction. The answers were "no" and "lets try," respectively. The "let us try" subsequently developed into a concept to develop a prototype knowledge base to predict estrogenicity, and a grant from the FDAs Office of Womens Health enabled the acquisition of the required computer hardware and software. The earliest models, based on 13 chemicals tested at NCTR, proved very inadequate. Despite the clear statement in endocrinology textbooks that a single nuclear receptor protein that exhibited high ligand specificity mediated most estrogenic effects, it became quickly apparent that the estrogen receptor binds chemical structures of surprisingly broad diversity. Next, we used the extensive data published in the literature to build models based on 50-chemical training sets. While these models were much improved, we learned that better models were possible, given a proper database. During the course of our work on the early models, a major scientific and regulatory issue developed surrounding environmental chemicals, termed endocrine disrupting chemicals (EDCs), suspected of disrupting endocrine function by mimicking natural hormones in experimental animals, wildlife and humans. There was a constantly growing concern among the scientific community, government regulators and the public that EDCs in the environment were adversely affecting human and wildlife health (1, 2). Adverse outcomes had been observed in experimental animals and wildlife; potential effects in humans included reproductive and developmental toxicity, carcinogenesis, immunotoxicity, and neurotoxicity, among others (3). The scientific debate surrounding EDCs grew contentious, in part owing to the fact that some suspected EDCs are high production volume, economically important chemicals. The public and regulatory concerns led to government regulatory actions and expanded research across Europe, Japan and North America (4, 5). In 1996, the U.S. Congress mandated that the Environmental Protection Agency (EPA) develop a strategy for screening and testing a large number of chemicals found in drinking water, food additives and other sources for their endocrine disruption potential (4). In response to Congressional action, the EPA established the Endocrine Disruptor Screening and Testing Advisory Committee (EDSTAC), which includes scientific expertise from government, academia and industry. EDSTAC recommended a two-tier (Tier 1 screening and Tier 2 testing) strategy to screen and test for estrogenic, androgenic and thyroid endpoints for a large number of chemicals. To accomplish this, chemicals will be screened (Tier 1) using a multiple-endpoint strategy that includes more than 20 different in vitro and in vivo assays recommended by EDSTAC (6). Although more than ~87,000 chemicals were initially selected for evaluation, many were polymers or otherwise unlikely to bind to steroid receptors, leaving about ~58,000 chemicals for evaluation in Tier 1. The number that will progress to the testing step (Tier 2) (7) is not known. Processing all chemicals through both tiers, if required, will require many years and extensive resources. Hence, the EPA has adopted an approach requiring priority setting before Tier 1 (www.epa.gov/scipoly/oscpendo/), and where QSAR predictions are likely to prove of particular utility. The EKB multidisciplinary team of researchers developed a plan to meet the newly important challenge for precise and validated predictive models that was reviewed and endorsed by the NCTR Scientific Advisory Board (SAB) in 1997. The most important finding of the early model building (8-12) effort was that, despite decades of testing for estrogenicity, the existing data were inadequate to construct robust QSAR models (SETAC chapter) (12). Accordingly, the EKB plan called for developing a model training data set by conducting ER binding measurements in a validated assay. The chemicals to be tested would be chosen by the computational chemists to obtain a training set spanning the broad range of chemical structures of ER ligands, both agonists and antagonists. Furthermore, there needed to be several types of models ranging from fast and easy classification methods to rule out inactive chemicals, to highly accurate but time-intensive, three-dimensional QSAR models to quantify binding affinity for active chemicals. As the scope of the EKB project expanded in the late 1990's, to include all Endocrine Disrupting Compounds, we renamed the prototype knowledge base the Endocrine Disrupter Knowledge Base (EDKB). A follow up second SAB review occurred in 2000, which supported placing the EDKB database on the WEB (http://edkb.fda.gov). As the program developed, additional funding was provided by the OWH, in 1998 and 2000, by a Cooperative Research And Development Agreement (CRADA) with the American Chemistry Council (ACC), formerly the Chemical Manufacturers Association (CMA), and again in 2001, with continued support to 2005, by EPA. The objective of the EDC priority setting is to rank order a large number of chemicals for more resource-intensive and costly Tier 1 evaluations from most important to least important. There are a number of criteria that can be used for this purpose, such as production volume, persistence and fate in the environment, human exposure levels, etc. Most of the 58,000 chemicals required for assay have no biological data. Both QSARs and transcription-based high throughput pre-screening (HTPS) were recommended by the EDSTAC as the primary source of biological effect information for priority setting. However, in a pilot study undertaken by EPA, the HTPS system did not perform well (13), such that the EDKB models could prove critical in EDC screening and testing program. The compounds assayed at NCTR were selected based on providing uniform coverage of the diverse chemical structure space of chemicals that bind the receptors, as well as coverage of an activity range extending down to a million fold below that of the endogenous hormones.The model training set designed for chemical structure diversity comprises 130 ER binders and 100 non-binders (14, 15). The large number of inactive chemicals, included in the training sets, enables models to be trained to distinguish active from inactive compounds. Many SAR, QSAR and chemometric predictive models were developed using the many powerful commercial software packages that are now routinely applied in drug discovery and development. In the end, the rigorous, three-dimensional, QSAR method of Comparative Molecular Field Analysis (CoMFA) was selected for quantitative prediction of receptor binding affinity (16). Three different types of models (structural alerts, pharmacophores and classification methods) were combined (17) to make qualitative prediction (i.e., active or inactive) of ER binding activity. In the sections that follow, we present results of integrating the qualitative and quantitative predictive models into a sequential "Four-Phase" scheme (18, 19) according to the strength of each type of model. Hierarchical sequencing of the models allows faster models to be used to eliminate the majority of inactive chemicals with an extremely low rate of false negatives. The more time-consuming but more precise models can be used to refine predictions for an increasingly smaller number of remaining chemicals. The application of the more refined models further eliminates true negatives, as well as false positives, from earlier models. Results are presented suggesting that the use of this scheme could eliminate from testing about 90% of the chemicals of potential concern in the national screening and testing program. Should the ER-binding models be used for priority setting, the program begun as a prototype effort will have matured to one of the most significant uses of QSAR in the regulatory application. Currently, the EDKB team is completing models for prediction of binding to the androgen receptor (AR). A validation program is also now underway via an Interagency Agreement between FDA/NCTR and EPA. A large number of chemicals will be tested blind by the models, for both ER and AR binding, and the predictions then compared to assay results from an outside contract laboratory. The validation results will define whether and how model predictions are used in priority setting in the EPA's Endocrine Disruptor Screening and Testing Program. Quantitative Structure-Activity Relationships (QSARs)QSAR modeling employs statistical approaches to correlate and rationalize variations in the biological activity of a series of chemicals with variations in their molecular structures. The molecular structure is often represented by a set of quantities commonly known as molecular descriptors. QSARs have been applied extensively in a wide range of scientific disciplines including chemistry, biology and toxicology (20, 21). In both drug discovery and environmental toxicology (22), QSAR models are now regarded as a scientifically credible tool for predicting and classifying the biological activities of untested chemicals. As we enter the informatics era, QSAR has become essential in the drug discovery process as a screening and enrichment tool to eliminate, from further development, those chemicals lacking drug-like properties (23-25) or those chemicals predicted to elicit a toxic response. This developing scenario portends the spread of QSAR, beyond the pharmaceutical industry, to human and environmental regulatory authorities for use in toxicology (16, 19, 26-30). The EDSTAC considers QSAR as an important part of its priority setting process (31). QSARs can be used to evaluate untested chemicals to provide biological data for use in priority setting (32-36). The basic assumption in every QSAR model is that a chemicals physical and chemical properties and its biological activities are predicated by its structure (37). Since structural descriptors of a chemical can be determined by computational means more efficiently than its biological activity using in vitro or in vivo approaches, a statistically validated QSAR model is capable of predicting the biological activity of a new chemical in lieu of the time-consuming and labor-intensive processes of chemical synthesis and biological evaluation. Applied judiciously, QSAR can save substantial amounts of time, money, and human resources. A major advantage of QSARs, to priority setting, is the efficiency of scale when applied to a large number of chemicals. When several endpoints are analyzed simultaneously, the efficiency of scale of computation is even more pronounced. The first step in developing a QSAR model is acquisition of a training set of chemicals that have known activities. Second, descriptors representing molecular structure of individual chemicals (i.e., hydrophobicity, structural fragments, charged surface area, the number of hydrogen bonds, solubility, and etc.) are calculated. Then, a correlation between descriptors and activity for the training set is evaluated by employing various statistical approaches to determine the most statistically significant relationship (the QSAR model). A proper validation is required to ensure the models predictive value for the chemicals not used in the training set. With adequately validated performance, such models can be used to predict activities of untested potential EDCs. Obtaining a good quality QSAR model heavily depends on many factors, in the approach, particularly on quality of biological data, descriptor selection and statistical methods:
QSAR models are useful in research for purposes beyond prediction (39). Additional benefits that may accrue include: (1) leveraging existing structure-activity data; (2) providing insights into mechanism or identifying an alternative mechanism (e.g., metabolism) of action; (3) identifying important chemical structure characteristics; (4) suggesting new design strategies and synthetic targets; (5) narrowing the dose range for a planned assay; (6) assisting in generation of new hypotheses to guide further research; and (7) revealing chemicals that deviate from the QSAR model. The NCTR Model Development ProcessIn the past few years, a number of QSAR models have been developed for ligand binding to the ER (8-11, 40-44). Most of these QSAR models were constructed using the Comparative Molecular Field Analysis (CoMFA). Although a predictive QSAR model is dependent on a number of factors, a training set with a broad representation over the chemistry space is critical to ensure its predictive capability for a large number of diverse chemicals. Unfortunately, most QSAR models for ER binding, developed previously, were based on data sets available in the literature, which to date had been small data sets with limited structural diversity (8, 42, 43). Although these models yield good statistical results and explain some structural determinants for ER binding, they have limited applicability in predicting the ER-ligand binding affinity of chemicals that, in fact, cover a wide range of structural diversity. In order to obtain an adequate training set to develop a more robust QSAR model for regulatory purpose, a recursive process (see Figure 1, following this paragraph) has been adopted at NCTR by integrating assay and QSAR modeling to determine chemicals for the training set (12, 45) along with the model construction. The process is highly interdisciplinary, involving computational chemists, biologists and experimental toxicologists. As depicted in Figure 1 (following this paragraph), the process starts with an initial set of chemicals from literatures for QSAR modeling. Next, the preliminary QSAR models are used prospectively to define a set of chemicals that will further improve models robustness and predictive capability. The new chemicals are assayed, and these data are then used to challenge and refine the QSAR models. Through this process, we identified and assayed ~230 chemicals for final model construction. This NCTR data set contains chemicals that were selected to cover the structural diversity of chemicals (see Figure 2a, following this paragraph) that bind to ER with an activity distribution ranging over six orders of magnitude (see Figure 2b, following this paragraph), which is an essential requirement for a robust predictive model for structurally diverse estrogens. The NCTR data set is a highly consistent data set for use in developing models for estrogens.
Figure 1. Depiction of the recursive process used by NCTR to develop QSAR models for predicting estrogen receptor binding. The process starts with an initial set of chemicals from literatures for QSAR modeling. Next, the preliminary QSAR models are used prospectively to define a set of chemicals that will further improve models robustness and predictive capability. The new chemicals are assayed, and these data are then used to challenge and refine the QSAR models. The process output is the models for use in toxicological regulation. Validation of the model is critical, particularly with respect to confirming minimal false negative predictions. Figure 2A. Figure 2B.
Figures 2(A and B). The distribution of (A) chemical classes and (B) binding activity for the NCTR data set. The estrogen receptor binding activity is represented as RBA (Relative Binding Affinity). The RBA for the endogenous ligand, 17ß-estradiol, was set to 100. The assay used in the process to provide ER binding data is a rat uterine cytosol ER competitive binding assay that is considered the gold standard for in vitro ER assays (14, 15). We found a strong linear correlation for ER binding affinities among a diverse group of chemicals assayed with ER derived from our rat uterine cytosol and human ERalpha (46). Furthermore, the rat ER binding data also correlated strongly with the results from assays measuring estrogenicity using a downstream event, i.e., a yeast-based reporter gene assay and MCF-7 cell proliferation assay. Importantly, chemicals positive in uterotrophic responses (in vivo estrogenic activity) are also positive in the ER binding assay, indicating that binding affinity is a good predictor of in vivo activity with few false negatives observed (47). These findings demonstrate that ER binding is the major determinant for estrogenic EDCs, and the prediction of the rat ER binding affinity provides an important piece of information for priority setting. In this process, a model validation step is specifically emphasized to ensure the models predictive value for priority setting purposes (12). Internal and external validation were included in the process; each provides a different level of confidence for the models predictivity. Generally, the model is first validated using leave-one-out cross validation. In this method, each chemical in the training set is systematically excluded once from the data set, after which its activity is predicted by a model derived from the remaining chemicals. This internal validation assesses the models extrapolation within the training set. Sometimes, we employ leave-N-out cross-validation to achieve more robust internal validation; a procedure similar to leave-one-out, but in this one, we systematically exclude one group of chemicals after randomly dividing the training set into N groups. When additional data are available, the model is used to predict other chemicals not used in the training set, but which have known activities (the testing set). This external validation assesses the models predictive capability for untested chemicals. Several benefits accrue from the integration of the experimental and modeling efforts. Immediate feedback can be given to the experimentalists so that suspected problems can be rapidly investigated. Also, as the predictive and diversity models evolve, the modelers can select the chemicals for subsequent testing, based on considerations of structural diversity and the activity range. While cross validation using the training set remains an important part of the model validation, each new data point directly from the lab become a challenge to the evolving model, the results of which can then be assessed by the joint team of the modeler and experimentalist. Each time the model is challenged, the result is either further confirmation of its validity, identification of a limitation or an outlier prediction. Failure of the model will, in turn, provide important information. This may include identification of the need for new data based on a rational understanding of the dependence of activity on structure. Alternatively, it may help elucidate which of many mechanisms may play an important role in a specific chemicals response; for example, delineating agonists from antagonists, or defining where metabolism may be important. Regardless of the cause of model failure, in essence, a research hypothesis is spawned that should lead to new data and/or an improved training set, and an improvement to what is a living model. The NCTR "Four-Phase" SystemPriority setting using QSAR is widely applied in the process of drug discovery. The objective of priority setting in pharmaceutical industry is to increase the chance of finding active chemicals or "hits" that are more likely to be developed into "leads". Hence, false positives are of great concern. In contrast, a good priority setting method for regulatory application should generate a small fraction of false negatives (chemicals predicted to fail to bind to their receptor, but which actually bind). False negatives constitute a crucial error, because they will receive a relatively lower priority for evaluation, in Tier 1, and may remain in use for many years. Furthermore, the methods should provide reasonable quantitative accuracy for true positives, as those with higher affinities will generally be of higher priority. Based on these considerations, we have developed an integrated computational system that rationally combines different QSAR models into a sequential "Four-Phase" scheme according to the strength of each type of model (see Figure 3, following this paragraph). A progressive Phase paradigm is used as a screen to reduce the number of chemicals to be considered in each subsequent Phase. Therefore, these four phases work in a hierarchical way to incrementally reduce the size of a data set with increasing precision of prediction. Within each phase, different models have been selected to work complementarily in representing key activity-determining structure features to minimize the rate of false negatives. For predicting ER binding affinity, the models comprised of these four phases were:
Figure 3. Overview diagram of the NCTR "Four-Phase" approach for priority setting. In Phase I, chemicals with molecular weight <94 or >1000 or containing no ring structure will be rejected. In Phase II, three types of approach (structural alerts, pharmacophores, and classification methods) with total 11 models are used to make a qualitative activity prediction. In Phase III, a 3D QSAR/CoMFA model is used to make a more accurate quantitative activity prediction. Phase IV, an expert system is expected to combine information gained from Phase II and Phase III and other sources to make a decision on priority setting. Different phases are hierarchical; different methods within each phase are complementary.
Figure 4. Plot of CoMFA-calculated log relative binding affinity (logRBA) versus observed logRBA for the NCTR data set. r2= 0.91; q2=0.76.
Figure 5. Fold differences for experimental measurements and CoMFA predicted results.
The NCTR "Four-Phase" system has been validated by a number of existing data sets, including the NCTR ER binding dataset (14) , the E-SCREEN assay data (48), the yeast two-hybrid reporter gene assay data (49), and other data sets (41, 50-54). To date, the system has produced no false negatives, which is critical in priority setting for regulatory purposes. The same integrated scheme is being extended to include endpoints of other endocrine disrupting mechanisms (e.g., AR binding) at NCTR. Regulatory ApplicationThe NCTR "Four-Phase" system is a primary candidate of priority setting tool for chemical entry to Tier 1 screening. The system was recently applied to three environmental data sets, recognized by EPA, as representative subsets of potential EDCs:
Table 1 (which follows this paragraph) summarizes the priority setting results for these data set using the NCTR "Four-Phase" system. When only the Phase I and II protocols are used, the system dramatically reduced the number of potential estrogens by some 80-85%, demonstrating effectiveness in eliminating these most unlikely ER binders from further expensive experimentation. The Phase III CoMFA model further reduces the data size by about 5-10%. Importantly, the quantitative binding affinity prediction from Phase III provides an important rank-order value for priority setting. Table 1. Size reduction of three environmental data sets processed by the NCTR "Four-Phase" system
Concluding RemarksEDCs have potential adverse effect on human beings and wildlife. The potential of chemicals to interfere with estrogen functions may be related to their ability to mimic estrogen and bind to ER. The potential to mimic estrogen and bind to ER can be quantitatively predicted using computational methods such as QSARs. QSARs are an important part of priority setting to determine which chemicals should be experimentally evaluated first in Tier 1 screening recommended by EDSTAC. In conjunction with EPA, the EDKB project team at the FDA/NCTR has developed a number of QSAR models for prediction of chemical binding to the ER. These models are integrated into a "Four-Phase" scheme, which we have shown to demonstrate efficiency and accuracy for priority setting of potential estrogenic EDCs for use, by regulators, at EPA. We anticipate that the same scheme will be equivalently applicable to other mechanisms (e.g., androgen receptor binding) involved in endocrine disruption and other toxicity endpoints. The stringent requirement for developing models for toxicity mechanisms is appropriately designed training data set similar to the one employed here for the ER-binding models. Properly validated data allow the structural rules that govern activity to be determined and used to develop robust predictive models. While the results presented here clearly show both the feasibility and utility of using QSARs for priority setting, it is important to note that predictions from any model are intrinsically no better than the experimental data employed for modeling. Any limitations of the assay used to generate the training data apply equally to the models predictions. Moreover, false negatives and false positives depend on the defined cut-off value to distinguish active from inactive for the models only providing YES/NO prediction. As the cut off value is lowered, it is likely that error will increase even for a well designed and executed assay, and false positives and false negatives will both increase. Similarly, more false prediction might be introduced for chemicals with activity close to the cut-off value. The issue for a large number of chemicals of the rate of false positives and false negatives, in predicted values, must be dealt with experimentally by running assays on a sufficiently large number of chemicals to characterize the rates. Therefore, our current model validation process with EPA is an important step to ensure the models quality for regulatory application. AcknowledgementThe EDKB team gratefully acknowledges funding from the FDAs Office of Womens Health that enabled the EDKB Program to begin and make early rapid progress. The American Chemical Council, through a Cooperative Research and Development Agreement, supported the estrogen and androgen receptor competitive binding assays. Finally, through an Interagency Agreement, the Environmental Protection Agency provides support for predictive model development and validation. The EDKB team acknowledges the continued support of Dr. Bern Schwetz who initiated the concept of knowledge bases at the NCTR. References1. Hileman, B. 1994. Environmental estrogens linked to reproductive abnormalities, cancer. Chem Eng News 72:19-23. 2. Hileman, B. 1997. Hormone Disrupter Research Expands. C & EN:24-25. 3. Kavlock, R. J., Daston, G. P., DeRosa, C., Fenner-Crisp, P., Gray, L.E., Kaattari, S., Lucier, G., Luster, M., Mac, M. J., Maczka, C., Miller, R., Moore, J., Rolland, R., Scott, G., Sheehan, D. M., Sinks, T., Tilson, H. A.1996. Research needs for the risk assessment of health and environmental effects of endocrine disruptors: a report of the U.S. EPA-sponsored workshop. Environmental Health Perspectives 104 Suppl 4:715-40. 4. U.S. Congress. 1996. The Food Quality Protection Act (FQPA) and the Safe Drinking Water Act (SDWA). 5. 104th U.S. Congress. 1996. Food Quality Protection Act 21 U.S.C. 346a(p). The Safe Drinking Water Act (42 U.S.C. (300j-17). Public Law 104-182. 6. Gray, L. E., Jr. 1998. Tiered screening and testing strategy for xenoestrogens and antiandrogens. Toxicol Lett 102-103:677-80. 7. Patlak, M. 1996. A testing deadline for endocrine disrupters. Environmental Science & Technology 30:540A-544A. 8. Tong, W., Perkins, R., Strelitz, R., Collantes, E. R., Keenan, S., Welsh, W.J., Branham, W.S., Sheehan, D. M. 1997. Quantitative structure-activity relationships (QSARs) for estrogen binding to the estrogen receptor: predictions across species. Environmental Health Perspectives 105:1116-24. 9. Tong, W., Perkins, R., Xing, L., Welsh, W. J., Sheehan, D. M. 1997. QSAR models for binding of estrogenic compounds to estrogen receptor alpha and beta subtypes. Endocrinology 138:4022-5. 10. Tong, W., Lowis, D. R., Perkins, R., Chen, Y., Welsh, W. J., Goddette, D. W., Heritage, T. W., Sheehan, D. M. 1998. Evaluation of quantitative structure-activity relationship methods for large-scale prediction of chemicals binding to the estrogen receptor. Journal of Chemical Information and Computer Sciences 38:669-77. 11. Xing, L., Welsh, W.J., Tong, W., Perkins, R., Sheehan, D. M. 1999. Comparison of estrogen receptor alpha and beta subtypes based on comparative molecular field analysis (CoMFA). SAR/QSAR Environmental Research 10:215-237. 12. Perkins, R., Anson, J., Blair, R., Branham, W. S., Dial, S., Fang, H., Hass, B. S., Moland, C., Shi, L., Tong, W., Welsh, W., Walker, J. D., Sheehan, D. 2001. The Endocrine Disruptor Knowledge Base (EDKB), A Prototype Toxicological Knowledge Base for Endocrine Disrupting Chemicals. In: Handbook on Quantitative Structure Activity Relationships (QSARs) for Predicting Chemical Endocrine Disruption Potentials (Walker JD, ed). Pensacola, FL:SETAC press. 13. E/ELetter. 1999. EPA Scientific Advisors: HTPS Needs More Work; Mixtures Low Priority. In: Endocrine/Estrogen Letter. 14. Blair, R., Fang, H., Branham, W. S., Hass, B., Dial, S. L., Moland, C. L., Tong, W., Shi, L., Perkins, R., Sheehan, D. M. 2000. Estrogen Receptor Relative Binding Affinities of 188 Natural and Xenochemicals: Structural Diversity of Ligands. Toxicological Science 54:138-153. 15. Branham, W. S., Dial, S. L., Moland, C. L., Hass, B., Blair, R., Fang, H., Shi, L., Tong, W., Perkins, R., Sheehan, D. M. 2002. Binding of Phytoestrogens and Mycoestrogens to the Rat Uterine Estrogen Receptor. Journal of Nutrition [in press]. 16. Shi, L. M., Fang, H., Tong, W., Wu, J., Perkins, R., Blair, R., Branham, W., Sheehan, D. 2002. QSAR models using a large diverse set of estrogens. Journal Chemical Information and Computer Sciences 41:186-195. 17. Hong, H., Tong, W., Fang, H., Shi, L. M., Xie, Q., Wu, J., Perkins, R., Walker, J., Branham, W., Sheehan, D. 2002. Prediction of Estrogen Receptor Binding for 58,000 chemicals using an Integrated system of a tree-based model with structural alerts. Environmental Health Perspectives 110:29-36. 18. Shi, L. M., Tong, W, Fang, H., Perkins, R., Wu, J., Tu, M., Blair, R., Branham, W., Walker, J., Waller, C., Sheehan, D. 2002. An integrated "4-Phase" approach for setting endocrine disruption screening priorities - Phase I and II predictions of estrogen receptor binding affinity. SAR and QSAR in Environmental Research (in press). 19. Tong, W., Perkins, R., Wu, J., Shi, L., Tu, M., Fang, H., Blair, R., Branham, W., Sheehan, D. M. 1999. An Intergrated Computational Approach for Prioritizing Potential Estrogens. In: International Symposium on Environmental Endocrine Disruptors, Kobe, Japan, Dec 9-11. 20. Hansch, C., Leo, A.1995. Exploring QSAR - Fundamentals and applications in chemistry and biology. Washington, D.C.:The American Chemical Society. 21. Hansch, C., Telzer, B. R., Zhang, L. 1995. Comparative QSAR in toxicology: examples from teratology and cancer chemotherapy of aniline mustards. Crit Rev Toxicol 25:67-89. 22. Bradbury, S. 1995. Quantitative structure-activity relationship and ecological risk assessment: an overview of predictive aquatic toxicology research. Toxicology Letter 79:229-237. 23. Hopfinger, A. J. 1997. Practical applications of computer-aided drug design. In: Practical applications of computer-aided design (Charifson, P. S., ed). New York:Marcel-Dekker, 105-164. 24. Kubinyi, H., Folkers, G., Martin, Y.C. 1998. 3D QSAR IN DRUG DESIGN - RECENT ADVANCES. Perspectives in Drug Discovery & Design 12:R 5-R 7. 25. Lipinski, C. A., Lombardo, F., Dominy, B. W., Feeney, P. J. 1997. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Advanced Drug Delivery Reviews 23:3-25. 26. Bradbury, S. P. 1994. Predicting modes of toxic action from chemical structure: an overview. SAR QSAR Environ Res 2:89-104. 27. Russom, C. L., Bradbury, S. P., Carlson, A. R. 1995. Use of knowledge bases and QSARs to estimate the relative ecological risk of agrichemicals: a problem formulation exercise. SAR QSAR Environ Res 4:83-95. 28. Benigni, R., Richard, A. M. 1998. Quantitative structure-based modeling applied to characterization and prediction of chemical toxicity. Methods 14:264-76. 29. Schultz, T. W., Seward, J. R. 2000. Health-effects related structure-toxicity relationships: a paradigm for the first decade of the new millennium. Sci Total Environ 249:73-84. 30. Hansch, C., Hoekman, D., Leo, A., Zhang, L., Li, P. 1995. The expanding role of quantitative structure-activity relationships (QSAR) in toxicology. Toxicol Lett 79:45-53. 31. EDSTAC. http://www.epa.gov/opptintr/opptendo/finalrpt.htm. In: Final Report. 32. Walker, J. D., Brink, R. H. 1989. New cost-effective, computerized approaches to selecting chemicals for priority testing consideration. In: Aquatic Toxicology and Environmental Fate, vol 11 (G.W. Suter, I., Lewis, M. A., eds):Philadelphia, 507-536. 33. Walker, J. D. 1991. Chemical selection by the TSCA Interagency Testing Committee: use of computerized substructure searching to identify chemical groups for health effects, chemical fate and ecological effects testing. Sci Total Environ 109-110:691-700. 34. Walker, J. D. 1995. Estimation methods used by the TSCA Interagency Testing Committee to prioritize chemicals for testing: Exposure and biological effects scoring and structure activity relationships. Toxicology Modeling 1:123-141. 35. Walker, J. D., Gray, D. A., Pepling, M. 1999. Past and Future Strategies for Sorting and Ranking Chemicals: Applications to the 1998 Drinking Water Contaminants List Chemicals. In: Identifying Future Drinking Water Contaminants. Washington, D.C.:National Academy Press; 51-102. 36. Walker, J. D., Gray, D. A. 2001. The Substructure-based Computerized Chemical Selection Expert System (SuCCSES): Providing Chemical Right-to-Know (CRTK) Information on Potential Actions of Structurally-Related Chemical Classes on the Environment and Human Health. In: Handbook on Quantitative Structure Activity Relationships (QSARs) for Predicting Effects of Chemicals on Environmental-Human Health Interactions. (Walker, J. D., ed). Pensacola, FL.:SETAC Press. 37. Johnson, M., Maggiora, G. M. 1990. Concepts and Applications of Molecular Similarity. New York:Wiley. 38. Tetko, I. V., Aksenova, T. I., Volkovich, V. V., Kasheva, T. N., Filipov, D. V., Welsh, W. J., Livingstone, D. J., Villa, A. E. P. 2000. Polynomial neural network for linear and non-linear model selection in quantitative-structure activity relationship studies on the Internet. SAR QSAR Environ Res 11:263-80. 39. McKinney, J. D., Richard, A., Waller, C., Newman, M. C., Gerberick, F. 2000. The practice of structure activity relationships (SAR) in toxicology. Toxicol Sci 56:8-17. 40. Bradbury, S., Mekenyan, O. G., Ankley, G. T. 1996. Quantitative structure-activity relationships for polychlorinated hydroxybiphenyl estrogen receptor binding affinity - An assessment of conformer flexibility. Environmental Toxicology & Chemistry 15:1945-1954. 41. Waller, C. L., Oprea, T. I., Chae, K., Park, H. K., Korach, K. S., Laws, S. C., Wiese, T. E., Kelce, W. R., Gray, L. E., Jr. 1996. Ligand-based identification of environmental estrogens. Chem Res Toxicol 9:1240-8. 42. Wiese, T.E., Polin, L. A., Palomino, E., Brooks, S. C. 1997. Induction of the estrogen specific mitogenic response of MCF-7 cells by selected analogues of estradiol-17 beta: a 3D QSAR study. Journal of Medicinal Chemistry 40:3659-69. 43. Sadler, B. R., Cho, S. J., Ishaq, K. S., Chae, K., Korach, K. S. 1998. Three-dimensional quantitative structure-activity relationship study of nonsteroidal estrogen receptor ligands using the comparative molecular field analysis/cross-validated r2-guided region selection approach. Journal of Medicinal Chemistry 41:2261-7. 44. Zheng, W., Tropsha, A. 2000. A Novel Variable Selection QSAR Approach based on the K-Nearest Neighbor Principle. Journal of Chemical Information and Computer Sciences 40:185-194. 45. Tong, W., Welsh, W. J., Shi, L., Fang, H., Perkins, R. 2001. SAR approaches and applications. Environmental Toxicology and Chemistry (Accepted). 46. Fang, H., Tong, W., Perkins, R., Soto, A., Prechtl, N., Sheehan, D. M. 2000. Quantitative comparison of in vitro assays for estrogenic activity. Environmental Health Perspectives 108:723-729. 47. Zacharewski, T. 1998. Identification and assessment of endocrine disruptors: limitations of in vivo and in vitro assays. Environmental Health Perspectives 106 Suppl 2:577-82. 48. Soto, A. M., Sonnenschein, C., Chung, K. L., Fernandez, M. F., Olea, N., Serrano, F. O. 1995. The E-SCREEN Assay as a Tool to Identify Estrogens: an update on Estrogenic Environmental Pollutants. Environmental Health Perspectives 103 Suppl 7:113-22. 49. Nishirara, T., Nishikawa, J., Kanayama, T., Dakeyama, F., Saito, K., Imagawa, S., Takatori, S., Kitagawa, Y., Hori, S., Utsumi, H. 2000. Estrogenic Activities of 517 Chemicals by Yeast Two-Hybrid Assay. Journal of Health Science 46:282-298. 50. Gaido, K. W., Leonard, L. S., Lovell, S., Gould, J.C., Babai, D., Portier, C. J., McDonnell, D. P. 1997. Evaluation of chemicals with endocrine modulating activity in a yeast-based steroid hormone receptor gene transcription assay. Toxicology & Applied Pharmacology 143:205-12. 51. Coldham, N. G., Dave, M., Sivapathasundaram, S., McDonnell, D. P., Connor, C., Sauer, M. J.1997. Evaluation of a recombinant yeast cell estrogen screening assay. Environmental Health Perspectives 105:734-42. 52. Routledge, E. J., Sumpter, J. P. 1997. Structural features of alkylphenolic chemicals associated with estrogenic activity. Journal of Biological Chemistry 272:3280-8. 53. Routledge, E. J., Parker, J., Odum, J., Ashby, J., Sumpter, J. P. 1998. Some alkyl hydroxy benzoate preservatives (parabens) are estrogenic. Toxicology & Applied Pharmacology 153:12-9. 54. Harris, C. A., Henttu, P., Parker, M. G., Sumpter, J. P. 1997. The estrogenic activity of phthalate esters in vitro. Environmental Health Perspectives 105:802-11. 55. Walker, J. D., Waller, C. W., Kane, S. 2001. The Endocrine Disruption Priority Setting Database (EDPSD): A Tool to Rapidly Sort and Prioritize Chemicals for Endocrine Disruption Screening and Testing. In: Handbook on Quantitative Structure Activity Relationships (QSARs) for Predicting Chemical Endocrine Disruption Potentials (Walker, J. D., ed). Pensacola, FL:SETAC press. Glossary
The AuthorsPhotos/Bill Branham; Anson Photo/V. B. Taylor
Weida Tong is Manager of Bioinformatics/Computational Science Group of Northrop Grumman Health Solutions located at the US FDA's National Center for Toxicological Research (NCTR). He is also an adjunct assistant professor in the Department of Pharmaceutical Sciences at the University of Arkansas for Medical Sciences. Dr. Tong received his Ph.D. in 1990, and was a research associate in computational chemistry for six years at the University of Missouri-St. Louis. In 1996, he joined a multidisciplinary research team at NCTR that has been focusing on development of computational toxicology tools for risk assessment, and bioinformatics approaches for genomics and proteomics. Dr. Tong's primary research interests encompass computational chemistry, bioinformatics, chemoinformatics, and scientific visualization applied to toxicology. His specialty areas included data mining, QSARs, 3D database searching, programming, molecular modeling, and chemometrics. Currently, he manages the bioinformatics/computational science group that supports toxicological and regulatory research at NCTR through the provision of modern computing technologies.
Roger Perkins received his B.S. and M.S. in Nuclear Engineering Sciences at the University of Florida and has since had a diverse career centered in computational science. He currently is Director of the Scientific Computing Division of Northrop Grumman Health Solutions. He also is Program Manager of a diverse information technology contract at the FDA/NCTR located in Jefferson, Arkansas. This contract provides systems, network, software engineers, statistics, and computational science support. Mr. Perkins spent 10 years as a reactor physicist in the U.S. Fast Breeder Reactor Program. He was part of the management team at the inception of the National Science Foundations San Diego Supercomputer Center, in 1985, prior to becoming Director of the Navys Advanced Scientific Computational Center in 1990. He became manager of the information technology contract at NCTR, in 1994, where he has been directly involved in a range of computational science efforts.
Hong Fang received her Ph.D. in Chemistry from the University of Missouri-St. Louis in 1995, and continued her post-doctoral work at the same university. She next worked for two years as a forensic scientist with the St. Louis Police Department. In 1997, she began a two-year post-doctoral assignment at the NCTR, in the Division of Reproductive and Developmental Toxicology working for Dr. Daniel Sheehan, in the Endocrine Disruptor Knowledge Base (EDKB) program. Dr. Fangs EDKB work ranged from design and conduct of assays to data analysis, and then using the experimental data to develop computer-based predictive models. In 1999, she joined the Northrop Grumman Health Solutions contract at NCTR as a Senior Computational Scientist, where she continued EDKB work developing models to predict estrogen and androgen activity. She is now developing models to predict chemically-induced carcinogenicity and has participated in development of the NCTR bioinformatics capability.
Huixiao Hong received his Ph.D. in computational chemistry at Nanjing University, in 1990, in China. From 1990 to 1992, he worked on computer-assisted organic synthesis at the Maxwell Institute for Molecular Sciences at Leeds University in England as a postdoctoral research fellow. Dr. Hong held an Associate Professor position at Nanjing University, from 1992 to 1995, where he investigated computer-assisted molecular design and developed a suite of software for analytical instrumentation simulations. From 1995 to 1998, he worked at the National Cancer Institute (NCI) at National Institutes of Health (NIH) as a Visiting Scientist where he discovered and designed more than 60 potent HIV Integrase inhibitors and Cyclooxygenase (COX) inhibitors through molecular modeling and pharmacophore identification and searching on NCI database. He also identified a Mononucleotide binding site in HIV Integrase, and elucidated the stereoselective binding mode of ibuprofen in Cyclooxygenase-1 (COX1) using docking and molecular dynamics simulation. He held a Research Scientist position at Sumitomo Chemical Co., Ltd. in Japan, from 1998 to 2000, where his research was mining two-dimensional NMR data and developing software for structure elucidation from two-dimensional NMR spectra. Dr. Hong joined the computational science group of Northrop Grumman at NCTR, in 2000, and has been engaged in both analysis and methods development for chemoinformatics and bioinformatics.
Qian Xie received his Ph.D. in Chemistry from Beijing Normal University, Beijing, China in 1991. He worked as an assistant and then associate professor in the laboratory of computer chemistry of the Institute of Chemical Metallurgy, Chinese Academy of Science, Beijing, China, from 1991 to 1995, where his research focused on computer-aided drug design for agricultural chemicals. He was a research associate in the biochemistry department of Hong Kong University of Science and Technology, Hong Kong from 1996 to 1997, where he worked on protein recognition and geometric complements. From 1998 to 1999, he worked as a research associate in the department of biochemistry and biophysics at Washington State University, Pullman, Washington, where his research involved computer simulations of electron transfer proteins. In 2000, he joined the bioinformatics/computational sciences group of Northrop Grumman at NCTR and is currently working on bioinformatics and structure-activity relationship models for estrogens and androgens of environmental chemicals.
William (Bill) Branham received a Master of Science degree in 1972 from West Virginia University. Following a four-year commitment to the Air Force, he came to the NCTR to work with Dr. Daniel Sheehan as part of the Hormone Research Program. During the 25 years of their working relationship, Mr. Branham published a series of papers describing the toxic effects of estrogens and antiestrogens on the development of the reproductive tract in the female rat. This work was, in part, responsible for increased precautionary FDA warnings to deter the prescription of antiestrogenic drugs during pregnancy. Mr. Branham is currently working under the direction of Dr. Jim Fuscoe, in the Center for Functional Genomics, on a project to bring microarray technology to the NCTR.
Daniel Sheehan received his Ph.D. in Biochemistry from the University of Tennessee's Biomedical School located at the Oak Ridge National Laboratories. He did postdoctoral studies at Baylor University, in the laboratory of Bert O'Malley, investigating receptor-mediated regulation of cellular physiology. He joined the NCTR approximately 25 years ago and has spent the majority of that time investigating estrogen agonist and antagonist effects on uterine morpholgy and physiology. Several years ago, he initiated a collaboration to build knowledge bases at the NCTR beginning with the Estrogen Disruptor Knowledge Base. He retired from the FDA in the summer of 2001.
Jeanne Farr Anson is the NCTR Associate Director for the Office of Planning, Finance and Information Technology. In 1974, Ms. Anson began her career in FDA as a microbiologist studying unscheduled DNA synthesis using tissue culture. She went on to study nuclear proteins to identify biomarkers of toxic stress using flow cytometric and autoradiographic techniques. At this juncture, she returned to school at night and obtained her Masters in Health Services Administration. She moved into management at NCTR, in 1990, and is active in promoting the computational biosciences for use in the development of knowledge bases (including development of a data warehouse) and predictive models. She serves as project officer for an IAG with EPA to validate androgen receptor binding as a screen for endocrine disrupting activity. Regulatory Research Perspectives
|