needs to evaluate the cumulative bivariate normal distribution,
which is computationally intensive. If no covariates are used, only
one set of computations need to be done for the entire data set. If
any covariates are used, a new computation must be computed for every
observation.