FDA Maximum Recommended Daily Dose (FDAMDD) - Maximum recommended daily dose

Endpoint Definition

Maximum recommended daily dose (or maximum recommended therapeutic dose) values were determined from pharmaceutical clinical trials that employed an oral route of exposure and daily treatments, usually for 3-12 months. Drugs were given as single or divided dose treatment regimens to achieve desired pharmacological effects. Roughly 5% of the pharmaceuticals in the FDAMDD data filewere antineoplastics and anesthetics and were administered intravenously and/or intramuscularly. When separate MRDDs were reported for different routes of exposure, only the oral MRDD was included in the data file and only MRDD values reported for the average adult patient were used. Pharmaceuticals that are administered orally are usually tested over a limited range of doses and have MRDDs reported as mg/day. The mg/day unit was converted to mg/kg-body weight (bw)/day based upon an average adult weighing 60 kg. In contrast, the dose unit for most antineoplastic drug MRDDs is reported as mg/m 2 which was converted to mg/kg-bw/day using the formula mg/kg-bw/day = mg/m 2/37 for an average adult. Additionally, a few drugs had MRDDs reported in parts per million (ppm) which were converted to mg/kg-bw/day on the basis that 1000 ppm equals 25 mg/kg-bw/day for an average 60 kg adult. These MRDD values were the basis of the QSAR analysis in (Matthews et al, 2004).
MRDD values were extracted from Martindale: The Extra Pharmacopoeia (1973, 1983, and 1993) and The Physicians Desk Reference (1995 and 1999).
Maximum recommended daily dose measure, Dose_MRDD_mg, converted to millimoles: Dose_MRDD_mmol = Dose_MRDD_mg / STRUCTURE_MolecularWeight. Note that this mg to mmol conversion in FDAMDD assumes that the compound dose in mg corresponds to the dose of the active ingredient in a formulation. [
Details ] [ Original data ]

Algorithm Definition

lazar obtains predictions from the experimental results of compounds with similar structures ( neighbors ). For differentiated predictions chemical similarities are always determined in respect to the endpoint under investigation . A detailled description and formal definition of the lazar regression algorithm has been published in: You can donwload the source code for lazar ( GNU General Public License ) with git :
git clone git://github.com/helma/lazar-core.git

Applicability Domain Definition

The applicability domain (AD) of the training set is characterized by the confidence index of a prediction (high confidence index: close to the applicability domain of the training set/reliable prediction, low confidence: far from the applicability domain of the trainingset/unreliable prediction). The confidence index considers (i) the similarity and number of neighbors and (ii) contradictory examples within the neighbors. A formal definition can be found in: The reliability of predictions decreases gradually with increasing distance from the applicability domain (i.e. decreasing confidence index). Figure 1 shows this dependency visually.

For simplicity we provide also results for an applicability domain definition with a sharp border at a confidence index of 0.2 . These results are summarized in Table 1 and indicated by the grey area in Figure 1 and by the black triangles in Figure 2 . Misclassifications within the applicability domain are summarized in the table of misclassifications .

The presence of substructures that are unknown to the training set ( unknown fragments ) is another factor that limits the applicability domain. As the training data cannot provide any information about unknown fragments, their relevance has to be evaluated by an expert user (as a rule of thumb large fragments are of less concern, because all shorter subfragments have been evaluated by the system). For this reason the presence/absence of unknown fragments is not considered in the formal applicability domain definition, but their presence is indicated in the table of misclassifications .

Validation Results (leave-one-out crossvalidation)

Definition and experimental comparison with external validation procedures:

Table 1: Predictions within applicability domain

r 2 0.71
RMSE 0.66
Mean error 0.47
Predictions within 1 log unit 422 from 474 ( 89.0 %)

Predictions with a confidence > 0.2 are considered to be within the applicability domain (see Applicability Domain Definition )

Table 2: All predictions

r 2 0.53
RMSE 0.82
Mean error 0.59
Predictions within 1 log unit 910 from 1112 ( 81.8 %)

Poor indication of the overall performance. Depends predominatly on the fraction of compounds beyond the applicability domain, which are by definition poorly predictable (see Applicability Domain Definition )

Figure 1: Predictions within 1 log unit vs. prediction confidence

Cumulative_accuracies

Depicts the dependency of predictive accuracy on the confidence index (i.e. the distance to the applicability domain, see Applicability Domain Definition ). Fluctuations at the left hand side of the figure are statistical artefacts (small sample sizes) and therefore irrelevant.
The table of misclassifications shows all misclassified instances within the applicability domain.

Figure 2: Correlation of predicted vs meaѕured values

Correlation

Predictions within the applicability domain are marked in black.

Mechanistic Interpretation

Neighbors

Neighbors are compounds that are similar in respect to maximum recommended daily dose . It is likely that compounds with high similarity act by similar mechanisms as the query compound. You can retrieve additional experimental data and literature citations for the neighbors and the query structure by following the "Search PubChem" links on the prediction page.

Fragments

Activating and deactivating parts of the query compound are highlighted in red and green. Fragments that are unknown (or too infrequent for statistical evaluation are marked in yellow. You can retrieve additional statistical information about the individual fragments by following the "Relevant Fragments" link. Please note that lazar predictions are based on neighbors and not on fragments. Fragments and their statistical significance are used for the calculation of activity specific similarities.
© in silico toxicology 2004-2008
Built with: lazar, OpenBabel, CDK, Ruby on Rails
JME Editor courtesy of Peter Ertl, Novartis