Bioequivalence Analysis (BE)
Bioequivalence.jl is a package for performing bioequivalence analysis.
The full API is available in the next section and provides the signatures and examples for using all the available functionality.
Quickstart
In order to use Bioequivalence.jl, add the package through
using Pkg
Pkg.add("Bioequivalence")
or using the package REPL
]add Bioequivalence
You can then load the library through
using Bioequivalence
A bioequivalence study is an instance of the type BioequivalenceStudy
and can be constructed through the pumas_be
function.
What is bioequivalence?
Bioequivalence is a concept in pharmacokinetics that captures the idea that various pharmaceutical products administrated in a similar manner (e.g., same molar dose of the same active ingredient, route of administration) can be expected to have, for all intents and purposes, the same effect on individuals in a defined population.
Clinical studies collect data which can be analyzed such as through noncompartmental analysis to obtain insightful descriptives about the contentration curve also known as pharmacokinetic endpoints. These endpoints relate to the rate (e.g., maximum concentration, time of peak concentration) and extent of absorption (e.g., area under the curve). Bioequivalence relies on the study design and pharmacokinetic endpoints from clinical trials or simulation models to make a determination about the expected effects of formulations.
Three major types of bioequivalence are regularly used:
Average (ABE): are the mean values of the distributions of the pharmacokinetic endpoints for the reference and the test formulations similar enough? The concept is the most popular with a rise in adoption in the early 1990's by the United States and the European Union. It is considered to be the easiest criterion for a new formulation to achieve bioequivalence. It is required by most regulatories agencies for the product to be approved under a bioequivalence process.
Population (PBE): are the distributions of the pharmacokinetic endpoints for the reference and the test formulations similar enough? In this case, it is not longer comparing just the expected value of the distributions but the full distribution. PBE is especially important for determining prescribability or the decision to assign a patient one of formulations as part of a treatment for the first time.
Individual (IBE): are the distributions of the pharmacokinetic endpoints for the reference and the test formulations similar enough across a large proportion of the intended population? IBE is particularly relevant for switchability or the decision to substitute an ongoing regimen (change formulation) without detrimental effects to the patient.
PBE and IBE can be assessed through two different methods:
constant scaling: the regulatory agency provides a value to be used in determining PBE or IBE.
reference scaling: the estimated total variance of the reference formulation in determining PBE or IBE.
mixed scaling: use reference scaling when the estimated total variance of the reference formulation is greater than that of the test formulation and the constant scaling otherwise.
One argument for using the mixed scaling is that if the estimate of the total variance of the test formulation is greater than of the reference it bioequivalence would be very conservative.
The reference scaling system is most used when working with highly variable drugs (HVD), those with intrasubject variability > 30%, and narrow therapeutic index drugs (NTI), those drugs where small differences in dose or blood concentration may lead to serious therapeutic failures and/or adverse drug reactions that are life-threatening or result in persistent or significant disability or incapacity.
Designs
There are three major categories of bioequivalence study desings.
Nonparametric for endpoints such as time of maximum concentration which typically do not have (or can easily transformed) a normal-like distribution.
Parallel designs which are typically used when crossover designs are not feasible.
Crossover (replicated and nonreplicated) designs which are the most commonly used by the industry.
Designs are fully characterized by:
- Subjects: participants in the study (each is assigned to a sequence)
- Formulations: the different formulations being compared (i.e., reference and additional test formulations)
- Periods: each dosing period at which each subject is administrated a formulation based on the sequence it has been assigned to
- Sequences: a dosing regimen which establishes what formulation is given at each period
The periods should be spaced enough such that there are no carryover effects from dosings in the previous periods.
Nonparametric analysis (i.e., x formulations, y sequences, z periods)
The nonparametric design performs a Wilcoxon signed rank test of the null hypothesis that the distribution of the reference formulation and the distribution of an alternative formulation have the same median.
When there are no tied ranks and ≤ 50 samples, or tied ranks and ≤ 15 samples, it will perform an exact signed rank test or approximate it otherwise.
Since the study design can be inferred from the data argument (i.e., based on sequences, formulations, and periods), the inferred study design approach will be automatically selected. Once can manually overwrite the method for the nonparametric option by selecting nonparametric = true
in pumas_be
.
Parallel design (i.e., x formulations, y periods, z sequences, e.g. R|S|T
)
Perform a Welch's t-test (i.e., unequal variance two-sample t-test) of the null hypothesis that the distribution of the reference formulation and the distribution of an alternative formulation comes have equal means. The number of degrees of freedom of the test uses the Welch-Satterthwaite equation:
Crossover designs
Crossover designs are divided into two categories:
Replicated
Nonreplicated
Nonreplicated designs have subjects assigned to distinct formulations in each period.
Replicated crossover designs are those with subjects receiving the same formulation more than once. A key feature of replicated designs is that it allows to estimate within-subject variances per formulation which are a component for assessing PBE and IPE.
Common crossover designs:
Name | Number of Formulations | Number of Periods | Number of Sequences | Example | Replicated |
---|---|---|---|---|---|
2x2 | 2 | 2 | 2 | RT|TR | false |
Balaam | 2 | 2 | 2 | RR|RT|TR|TT | true |
Dual | 2 | 3 | 2 | RTT|TRR | true |
Inner | 2 | 4 | 2 | RRTT|TTRR | true |
Outer | 2 | 4 | 2 | RTRT|TRTR | true |
Williams 3 | 3 | 3 | 6 | RST|RTS|SRT|STR|TRS|TSR | false |
Williams 4 | 4 | 4 | 4 | ADBC|BACD|CBDA|DCAB | false |
Replicated designs are preferred, particularly the inner and outer designs.
If employing the dual design, it is recommended to have a larger sample size in order to achieve the same level of statistical power.
In the United States, there is a minimum requirement of at least 12 evaluable subjects for any bioequivalence study.
For analyzing more than two formulations at a time, the Williams designs, a generalized latin square, is the preferred design given its statistical power.
There are two ways to analyze crossover designs:
Linear model
Performs a linear regression with the following model
where βⱼ, j ∈ [1, 2, 3, 4], are vectors for features where formulation
uses the dummy variable coding and sequence
and period
use contrast coding.
Linear mixed model
log(endpoint) ~ formulation + sequence + period + (1 | id)
The linear mixed model corresponds to
proc mixed data = data method = ml;
class sequence subject period formulation;
model ln_endpoint = sequence period formulation;
random subject(sequence);
in SAS.
One can request to use the restricted maximum likelihood (REML) objective to match SAS default value through passing the reml = true
argument to pumas_be
.
Per the Food and Drug Administration (US regulatory agency) guidance, replicated crossover designs should employ the linear mixed model approach while nonreplicated crossover desings should employ a linear model (linear mixed models for nonreplicated crossover desings are also acceptable).
Each design has been tested using various sources including:
- Chow, Shein-Chung, and Jen-pei Liu. 2009. Design and Analysis of Bioavailability and Bioequivalence Studies. 3rd ed. Chapman & Hall/CRC Biostatistics Series 27. Boca Raton: CRC Press. DOI: 10.1201/9781420011678.
- Fuglsang, Anders, Helmut Schütz, and Detlew Labes. 2015. "Reference Datasets for Bioequivalence Trials in a Two-Group Parallel Design." The AAPS Journal 17 (2): 400–404. DOI: 10.1208/s12248-014-9704-6.
- Patterson, Scott D, and Byron Jones. 2017. Bioequivalence and Statistics in Clinical Pharmacology. 2nd ed. Chapman & Hall/CRC Biostatistics Series. DOI: 10.1201/9781315374161.
- Schütz, Helmut, Detlew Labes, and Anders Fuglsang. 2014. "Reference Datasets for 2-Treatment, 2-Sequence, 2-Period Bioequivalence Studies." The AAPS Journal 16 (6): 1292–97. DOI: 10.1208/s12248-014-9661-0.
API
Public
Bioequivalence.Bioequivalence
— ModuleBioequivalence.jl
This module offers a suite of routines for bioequivalence (BE) analysis.
Bioequivalence.BioequivalenceStudy
— TypeBioequivalenceStudy
Return a bioequivalence study.
See also: pumas_be
.
Fields
data::DataFrame
data used for the studydata_stats::NamedTuple
total::Int
refers to the number of observations the data passed to the function had.used_for_analysis::Int
refers to the number of observations used for fitting the model (e.g., drop missing values)formulation::DataFrame
gives a DataFrame with the summary statistics of the statistical model's response by formulationsequence::DataFrame
gives a DataFrame with the summary statistics of the statistical model's response by sequenceperiod::DataFrame
gives a DataFrame with the summary statistics of the statistical model's response by period
design::NamedTuple
number of subjects in each sequencemodel
statistical models used for the analysisresult::DataFrame
results for inference
Examples
julia> data = Bioequivalence.testdata("PJ2017_4_5")
186×5 DataFrame
│ Row │ id │ sequence │ period │ AUC │ Cmax │
│ │ Int64 │ Cat… │ Int64 │ Int64? │ Int64? │
├─────┼───────┼──────────┼────────┼─────────┼─────────┤
│ 1 │ 1 │ SRT │ 1 │ 7260 │ 1633 │
│ 2 │ 1 │ SRT │ 2 │ 6463 │ 1366 │
│ 3 │ 1 │ SRT │ 3 │ 8759 │ 2141 │
│ 4 │ 2 │ RTS │ 1 │ 3457 │ 776 │
│ 5 │ 2 │ RTS │ 2 │ 6556 │ 2387 │
│ 6 │ 2 │ RTS │ 3 │ 4081 │ 1355 │
│ 7 │ 4 │ TSR │ 1 │ 4006 │ 1326 │
│ 8 │ 4 │ TSR │ 2 │ 4879 │ 1028 │
│ 9 │ 4 │ TSR │ 3 │ 3817 │ 1052 │
│ 10 │ 5 │ STR │ 1 │ 4250 │ 945 │
⋮
│ 176 │ 61 │ RTS │ 2 │ 2947 │ 744 │
│ 177 │ 61 │ RTS │ 3 │ 3779 │ 1144 │
│ 178 │ 62 │ SRT │ 1 │ 5787 │ 1461 │
│ 179 │ 62 │ SRT │ 2 │ 7069 │ 1995 │
│ 180 │ 62 │ SRT │ 3 │ 6530 │ 1236 │
│ 181 │ 63 │ TRS │ 1 │ 2204 │ 495 │
│ 182 │ 63 │ TRS │ 2 │ 2927 │ 770 │
│ 183 │ 63 │ TRS │ 3 │ missing │ missing │
│ 184 │ 67 │ RST │ 1 │ 4045 │ 1025 │
│ 185 │ 67 │ RST │ 2 │ 7865 │ 2668 │
│ 186 │ 67 │ RST │ 3 │ missing │ missing │
julia> output = pumas_be(data, endpoint = :Cmax, method = :lmm, reml = true)
Design: RST|RTS|SRT|STR|TRS|TSR
Sequences: RST|RTS|SRT|STR|TRS|TSR (6)
Periods: 1:3 (3)
Subjects per Sequence: (RST = 9, RTS = 11, SRT = 11, STR = 10, TRS = 11, TSR = 10)
Average Bioequivalence
───────────────────────────────────────────────────────────────────────
PE SE lnLB lnUB GMR LB UB
───────────────────────────────────────────────────────────────────────
S - R 0.468471 0.0525592 0.381334 0.555607 1.59755 1.4642 1.743
T - R 0.259687 0.0525372 0.172587 0.346787 1.29652 1.1883 1.4146
───────────────────────────────────────────────────────────────────────
julia> output.data_stats.formulation
3×10 DataFrame
│ Row │ formulation │ exp_mean │ mean │ std │ min │ q25 │ median │ q75 │ max │ n │
│ │ Cat… │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Int64 │
├─────┼─────────────┼──────────┼─────────┼──────────┼─────────┼─────────┼─────────┼─────────┼─────────┼───────┤
│ 1 │ 'R' │ 837.478 │ 6.7304 │ 0.466938 │ 5.89715 │ 6.3257 │ 6.66568 │ 7.09589 │ 7.71334 │ 62 │
│ 2 │ 'S' │ 1339.96 │ 7.20039 │ 0.419893 │ 6.09131 │ 6.92952 │ 7.20117 │ 7.5251 │ 8.02027 │ 61 │
│ 3 │ 'T' │ 1078.08 │ 6.98294 │ 0.473224 │ 5.75574 │ 6.62539 │ 7.00851 │ 7.29641 │ 7.90286 │ 61 │
julia> output.data_stats.sequence
6×10 DataFrame
│ Row │ sequence │ exp_mean │ mean │ std │ min │ q25 │ median │ q75 │ max │ n │
│ │ Cat… │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Int64 │
├─────┼──────────┼──────────┼─────────┼──────────┼─────────┼─────────┼─────────┼─────────┼─────────┼───────┤
│ 1 │ RST │ 997.282 │ 6.90503 │ 0.493722 │ 5.90263 │ 6.6385 │ 6.92755 │ 7.17642 │ 7.88908 │ 26 │
│ 2 │ RTS │ 1084.03 │ 6.98844 │ 0.499403 │ 6.09131 │ 6.4677 │ 7.05618 │ 7.32449 │ 7.77779 │ 33 │
│ 3 │ SRT │ 1187.43 │ 7.07954 │ 0.482725 │ 5.89715 │ 6.63068 │ 7.11964 │ 7.45124 │ 7.90286 │ 33 │
│ 4 │ STR │ 869.299 │ 6.76769 │ 0.404857 │ 6.04501 │ 6.4758 │ 6.8663 │ 6.95607 │ 7.71913 │ 30 │
│ 5 │ TRS │ 1180.5 │ 7.07369 │ 0.466733 │ 6.20456 │ 6.71254 │ 7.11698 │ 7.39368 │ 7.96797 │ 32 │
│ 6 │ TSR │ 1071.51 │ 6.97682 │ 0.556906 │ 5.75574 │ 6.69448 │ 7.02452 │ 7.28049 │ 8.02027 │ 30 │
julia> output.data_stats.period
3×10 DataFrame
│ Row │ period │ exp_mean │ mean │ std │ min │ q25 │ median │ q75 │ max │ n │
│ │ Cat… │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Float64 │ Int64 │
├─────┼────────┼──────────┼─────────┼──────────┼─────────┼─────────┼─────────┼─────────┼─────────┼───────┤
│ 1 │ 1 │ 1009.89 │ 6.9176 │ 0.498246 │ 5.75574 │ 6.46653 │ 6.97018 │ 7.28186 │ 7.72356 │ 62 │
│ 2 │ 2 │ 1108.56 │ 7.01082 │ 0.460876 │ 5.89715 │ 6.63035 │ 7.06641 │ 7.31235 │ 8.02027 │ 62 │
│ 3 │ 3 │ 1076.82 │ 6.98176 │ 0.516518 │ 6.03787 │ 6.63167 │ 6.99805 │ 7.30986 │ 7.96797 │ 60 │
julia> output.model
Linear mixed model fit by REML
Cmax ~ 1 + formulation + sequence + period + (1 | id)
REML criterion at convergence: 197.54813471940628
Variance components:
Column Variance Std.Dev.
id (Intercept) 0.121775198 0.34896303
Residual 0.084569128 0.29080772
Number of obs: 184; levels of grouping factors: 62
Fixed-effects parameters:
────────────────────────────────────────────────────────────
Estimate Std.Error z value P(>|z|)
────────────────────────────────────────────────────────────
(Intercept) 6.72421 0.0578144 116.307 <1e-99
formulation: S 0.468471 0.0525592 8.9132 <1e-18
formulation: T 0.259687 0.0525372 4.94292 <1e-6
sequence: RTS 0.0215114 0.107371 0.200346 0.8412
sequence: SRT 0.112618 0.107371 1.04887 0.2942
sequence: STR -0.19924 0.111523 -1.78653 0.0740
sequence: TRS 0.101829 0.107712 0.945378 0.3445
sequence: TSR 0.0098946 0.111523 0.0887222 0.9293
period: 2 0.0505335 0.0302924 1.66819 0.0953
period: 3 0.00726902 0.0306267 0.237342 0.8124
────────────────────────────────────────────────────────────
julia> output.model_stats.Wald
────────────────────────────────────────────────────────
Wald Distribution p-value
────────────────────────────────────────────────────────
Formulation 39.914 FDist(ν1=2.0, ν2=116.0) <1e-13
Sequence 0.898625 FDist(ν1=5.0, ν2=56.0) 0.4885
Period 2.17727 FDist(ν1=2.0, ν2=116.0) 0.1180
────────────────────────────────────────────────────────
julia> output.model_stats.lsmeans
──────────────────────────────────────────────────────────────────────────────
exp_Mean Mean Standard Deviation t-statistic Distribution p-value
──────────────────────────────────────────────────────────────────────────────
R 832.312 6.72421 0.0578144 116.307 TDist(ν=118.0) <1e-99
S 1329.66 7.19268 0.0580685 123.865 TDist(ν=118.0) <1e-99
T 1079.11 6.98389 0.0581118 120.18 TDist(ν=118.0) <1e-99
──────────────────────────────────────────────────────────────────────────────
Bioequivalence.generate_design
— Methodgenerate_design(design::AbstractString,
amt::Union{Number,AbstractVector{<:Number}},
formulation::AbstractVector,
subjects_per_sequence::Union{<:Integer,AbstractVector{<:Integer}},
)::DataFrame
Returns a DataFrame with id, sequence, period, formulation, amt, evid, cmt, and time. It can be used to quickly set up data for Pumas, NCA, and Bioequivalence. In order to add covariates, use innerjoin
to join the result of this function with another DataFrame with covariates.
The following designs are available:
- "Parallel" => 'A':'A' + num_formulations - 1
- "2x2" => ["RT", "TR"]
- "Balaam" => ["RR", "RT", "TR", "TT"]
- "Dual" => ["RTT", "TRR"]
- "2S4P1" => ["RTTR", "TRRT"]
- "2S4P2" => ["RTRT", "TRTR"]
- "WD3F" => ["ABC", "ACB", "BAC", "BCA", "CAB", "CBA"]
- "WD4F" => ["ABCD", "CADB", "DCBA", "BDAC"]
Examples
julia> using DataFrames, Random
julia> skeleton = generate_design("Parallel", 100, ["tablet", "soft", "hard"], 10)
30×8 DataFrame
│ Row │ id │ sequence │ period │ formulation │ amt │ time │ evid │ cmt │
│ │ Int64 │ Cat… │ Int64 │ Cat… │ Int64 │ Int64 │ Int64 │ Int64 │
├─────┼───────┼──────────┼────────┼─────────────┼───────┼───────┼───────┼───────┤
│ 1 │ 1 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 2 │ 2 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 3 │ 3 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 4 │ 4 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 5 │ 5 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 6 │ 6 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 7 │ 7 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 8 │ 8 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 9 │ 9 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
│ 10 │ 10 │ A │ 1 │ tablet │ 100 │ 0 │ 4 │ 1 │
⋮
│ 20 │ 20 │ B │ 1 │ soft │ 100 │ 0 │ 4 │ 1 │
│ 21 │ 21 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 22 │ 22 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 23 │ 23 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 24 │ 24 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 25 │ 25 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 26 │ 26 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 27 │ 27 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 28 │ 28 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 29 │ 29 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
│ 30 │ 30 │ C │ 1 │ hard │ 100 │ 0 │ 4 │ 1 │
julia> skeleton = generate_design("2S4P2", [50, 25], ["tablet", "capsule"], [10, 9])
76×8 DataFrame
│ Row │ id │ sequence │ period │ formulation │ amt │ time │ evid │ cmt │
│ │ Int64 │ Cat… │ Int64 │ Cat… │ Int64 │ Int64 │ Int64 │ Int64 │
├─────┼───────┼──────────┼────────┼─────────────┼───────┼───────┼───────┼───────┤
│ 1 │ 1 │ RTRT │ 1 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 2 │ 1 │ RTRT │ 2 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 3 │ 1 │ RTRT │ 3 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 4 │ 1 │ RTRT │ 4 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 5 │ 2 │ RTRT │ 1 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 6 │ 2 │ RTRT │ 2 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 7 │ 2 │ RTRT │ 3 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 8 │ 2 │ RTRT │ 4 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 9 │ 3 │ RTRT │ 1 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 10 │ 3 │ RTRT │ 2 │ capsule │ 25 │ 0 │ 4 │ 1 │
⋮
│ 66 │ 17 │ TRTR │ 2 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 67 │ 17 │ TRTR │ 3 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 68 │ 17 │ TRTR │ 4 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 69 │ 18 │ TRTR │ 1 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 70 │ 18 │ TRTR │ 2 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 71 │ 18 │ TRTR │ 3 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 72 │ 18 │ TRTR │ 4 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 73 │ 19 │ TRTR │ 1 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 74 │ 19 │ TRTR │ 2 │ tablet │ 50 │ 0 │ 4 │ 1 │
│ 75 │ 19 │ TRTR │ 3 │ capsule │ 25 │ 0 │ 4 │ 1 │
│ 76 │ 19 │ TRTR │ 4 │ tablet │ 50 │ 0 │ 4 │ 1 │
julia> Random.seed!(0);
julia> data = innerjoin(skeleton,
DataFrame(id = 1:size(skeleton, 1),
wt = rand(100:200, size(skeleton, 1)),
age = rand(25:85, size(skeleton, 1))),
on = :id)
76×10 DataFrame
│ Row │ id │ sequence │ period │ formulation │ amt │ time │ evid │ cmt │ wt │ age │
│ │ Int64 │ Cat… │ Int64 │ Cat… │ Int64 │ Int64 │ Int64 │ Int64 │ Int64 │ Int64 │
├─────┼───────┼──────────┼────────┼─────────────┼───────┼───────┼───────┼───────┼───────┼───────┤
│ 1 │ 1 │ RTRT │ 1 │ tablet │ 50 │ 0 │ 4 │ 1 │ 164 │ 63 │
│ 2 │ 1 │ RTRT │ 2 │ capsule │ 25 │ 0 │ 4 │ 1 │ 164 │ 63 │
│ 3 │ 1 │ RTRT │ 3 │ tablet │ 50 │ 0 │ 4 │ 1 │ 164 │ 63 │
│ 4 │ 1 │ RTRT │ 4 │ capsule │ 25 │ 0 │ 4 │ 1 │ 164 │ 63 │
│ 5 │ 2 │ RTRT │ 1 │ tablet │ 50 │ 0 │ 4 │ 1 │ 182 │ 59 │
│ 6 │ 2 │ RTRT │ 2 │ capsule │ 25 │ 0 │ 4 │ 1 │ 182 │ 59 │
│ 7 │ 2 │ RTRT │ 3 │ tablet │ 50 │ 0 │ 4 │ 1 │ 182 │ 59 │
│ 8 │ 2 │ RTRT │ 4 │ capsule │ 25 │ 0 │ 4 │ 1 │ 182 │ 59 │
│ 9 │ 3 │ RTRT │ 1 │ tablet │ 50 │ 0 │ 4 │ 1 │ 125 │ 76 │
│ 10 │ 3 │ RTRT │ 2 │ capsule │ 25 │ 0 │ 4 │ 1 │ 125 │ 76 │
⋮
│ 66 │ 17 │ TRTR │ 2 │ tablet │ 50 │ 0 │ 4 │ 1 │ 185 │ 62 │
│ 67 │ 17 │ TRTR │ 3 │ capsule │ 25 │ 0 │ 4 │ 1 │ 185 │ 62 │
│ 68 │ 17 │ TRTR │ 4 │ tablet │ 50 │ 0 │ 4 │ 1 │ 185 │ 62 │
│ 69 │ 18 │ TRTR │ 1 │ capsule │ 25 │ 0 │ 4 │ 1 │ 195 │ 79 │
│ 70 │ 18 │ TRTR │ 2 │ tablet │ 50 │ 0 │ 4 │ 1 │ 195 │ 79 │
│ 71 │ 18 │ TRTR │ 3 │ capsule │ 25 │ 0 │ 4 │ 1 │ 195 │ 79 │
│ 72 │ 18 │ TRTR │ 4 │ tablet │ 50 │ 0 │ 4 │ 1 │ 195 │ 79 │
│ 73 │ 19 │ TRTR │ 1 │ capsule │ 25 │ 0 │ 4 │ 1 │ 149 │ 33 │
│ 74 │ 19 │ TRTR │ 2 │ tablet │ 50 │ 0 │ 4 │ 1 │ 149 │ 33 │
│ 75 │ 19 │ TRTR │ 3 │ capsule │ 25 │ 0 │ 4 │ 1 │ 149 │ 33 │
│ 76 │ 19 │ TRTR │ 4 │ tablet │ 50 │ 0 │ 4 │ 1 │ 149 │ 33 │
Bioequivalence.pumas_be
— Methodpumas_be(data::AbstractDataFrame;
endpoint::Union{Integer, Symbol} = :AUC,
logtransformed::Bool = false,
σw₀::Real = 0.1,
𝛥::Real = 1.11,
id::Union{Integer, Symbol} = :id,
sequence::Union{Integer, Symbol} = :sequence,
period::Union{Integer, Symbol} = :period,
reference::Union{Nothing, Char} = nothing,
method::Symbol = occursin(r"(?i)tmax", string(endpoint)) ? :nonpar : :fda,
reml::Bool = false,
)::BioequivalenceStudy
BioequivalenceStudy
constructor.
See also: BioequivalenceStudy
.
Arguments
data
: must haveid
,sequence
,period
, and anendpoint
.endpoint
: which variable is the endpoint?logtransformed
: has the endpoint been log transformed?id
: which variable is the subject identifier?sequence
: which variable is the sequence?period
: which variable is the period?σw₀
: regulatory constant for reference-scaled estimates𝛥
: auxiliary parameter for reference-scaled estimatesreference
: which formulation is the reference?
For example, in a design with RTTR|TRRT one can specify 'R' to be the reference. By default, the reference is taken to be the first character (alphabetically).
method
: Which statistical method should it use?
Values can be
:fda
(default value; choose linear models if nonreplicated and linear mixed models if replicated):lm
use a linear model:lmm
use a linear mixed model:nonpar
use a nonparametric model (default if name of endpoint includestmax
ignoring case)reml
: if the design uses a linear mixed model, should it optimize REML instead of ML?
Current designs include:
- Nonparametric
- Parallel (e.g., R|T, A|B|C)
- 2x2 (e.g., RT|TR)
- Balaam (i.e., 2 formulations, 4 sequences, 2 periods, e.g.,
RR|RT|TR|TT
) - Dual (i.e., 2 formulations, 2 sequences, 3 periods, e.g.,
RTT|TRR
) - Inner (i.e., 2 formulations, 2 sequences, 4 periods, e.g.,
RRTT|TTRR
) - Outer (i.e., 2 formulations, 2 sequences, 4 periods, e.g.,
RTRT|TRTR
) - Williams 3 (i.e., 3 formulations, 6 sequences, 3 periods, e.g.,
RST|RTS|SRT|STR|TRS|TSR
) - Williams 4 (i.e., 4 formulations, 4 sequences, 4 periods, e.g.,
ADBC|BACD|CBDA|DCAB
)
Examples
julia> data = Bioequivalence.testdata("SLF2014_1")
36×4 DataFrame
│ Row │ id │ sequence │ period │ AUC │
│ │ Int64 │ Cat… │ Int64 │ Float64 │
├─────┼───────┼──────────┼────────┼─────────┤
│ 1 │ 1 │ RT │ 1 │ 181.09 │
│ 2 │ 1 │ RT │ 2 │ 210.14 │
│ 3 │ 2 │ RT │ 1 │ 114.48 │
│ 4 │ 2 │ RT │ 2 │ 98.72 │
│ 5 │ 3 │ TR │ 1 │ 225.95 │
│ 6 │ 3 │ TR │ 2 │ 241.09 │
│ 7 │ 4 │ RT │ 1 │ 176.91 │
│ 8 │ 4 │ RT │ 2 │ 186.65 │
│ 9 │ 5 │ TR │ 1 │ 147.01 │
│ 10 │ 5 │ TR │ 2 │ 139.56 │
⋮
│ 26 │ 13 │ TR │ 2 │ 165.09 │
│ 27 │ 14 │ TR │ 1 │ 179.96 │
│ 28 │ 14 │ TR │ 2 │ 181.09 │
│ 29 │ 15 │ TR │ 1 │ 173.86 │
│ 30 │ 15 │ TR │ 2 │ 206.66 │
│ 31 │ 16 │ RT │ 1 │ 144.0 │
│ 32 │ 16 │ RT │ 2 │ 143.25 │
│ 33 │ 17 │ RT │ 1 │ 185.1 │
│ 34 │ 17 │ RT │ 2 │ 192.22 │
│ 35 │ 18 │ TR │ 1 │ 117.99 │
│ 36 │ 18 │ TR │ 2 │ 125.5 │
julia> output = pumas_be(data)
Design: RT|TR
Sequences: RT|TR (2)
Periods: 1:2 (2)
Subjects per Sequence: (RT = 9, TR = 9)
Average Bioequivalence
──────────────────────────────────────────────────────────────────────────────
PE SE lnLB lnUB GMR LB UB
──────────────────────────────────────────────────────────────────────────────
T - R -0.0503868 0.026658 -0.0969286 -0.00384499 0.950862 0.9076 0.9962
──────────────────────────────────────────────────────────────────────────────
julia> data = Bioequivalence.testdata("PJ2017_4_1")
285×5 DataFrame
│ Row │ id │ sequence │ period │ AUC │ Cmax │
│ │ Int64 │ Cat… │ Int64 │ Float64? │ Float64? │
├─────┼───────┼──────────┼────────┼──────────┼──────────┤
│ 1 │ 101 │ TRR │ 1 │ 12.26 │ 0.511 │
│ 2 │ 101 │ TRR │ 2 │ 16.19 │ 0.688 │
│ 3 │ 101 │ TRR │ 3 │ 11.34 │ 0.533 │
│ 4 │ 102 │ TRR │ 1 │ 397.98 │ 13.27 │
│ 5 │ 102 │ TRR │ 2 │ 267.63 │ 7.933 │
│ 6 │ 102 │ TRR │ 3 │ 487.55 │ 12.952 │
│ 7 │ 103 │ TRR │ 1 │ 243.81 │ 16.771 │
│ 8 │ 103 │ TRR │ 2 │ 141.7 │ 6.926 │
│ 9 │ 103 │ TRR │ 3 │ 198.44 │ 9.257 │
│ 10 │ 109 │ TRR │ 1 │ 182.52 │ 8.816 │
⋮
│ 275 │ 186 │ RTT │ 2 │ 28.82 │ 1.65 │
│ 276 │ 186 │ RTT │ 3 │ 87.63 │ 4.87 │
│ 277 │ 190 │ RTT │ 1 │ 82.78 │ 3.88 │
│ 278 │ 190 │ RTT │ 2 │ 164.56 │ 7.37 │
│ 279 │ 190 │ RTT │ 3 │ 213.98 │ 7.01 │
│ 280 │ 191 │ RTT │ 1 │ 98.86 │ 4.59 │
│ 281 │ 191 │ RTT │ 2 │ 99.02 │ 2.96 │
│ 282 │ 191 │ RTT │ 3 │ 75.48 │ 2.38 │
│ 283 │ 194 │ RTT │ 1 │ 21.29 │ 1.51 │
│ 284 │ 194 │ RTT │ 2 │ 46.3 │ 2.74 │
│ 285 │ 194 │ RTT │ 3 │ 15.41 │ 1.41 │
julia> output = pumas_be(data)
Design: RTT|TRR
Sequences: RTT|TRR (2)
Periods: 1:3 (3)
Subjects per Sequence: (RTT = 46, TRR = 48)
Regulatory constant for reference-scaled estimates: 0.10
Auxiliary parameter for reference-scaled estimates: 1.11
Average Bioequivalence
──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
PE SE lnLB lnUB GMR LB UB scLB scUB Varratio VarLB VarUB
──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
T - R -0.0285211 0.0674897 -0.140116 0.0830736 0.971882 0.8692 1.0867 0.4525 2.2095 1.53684 1.1907 1.9909
──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
Private
Base.summary
— Methodreference_scaled!(obj::BioequivalenceStudy)
Adds the reference-scaled parameters when applicable.
Bioequivalence._compute_σ²wv
— Method_compute_σ²wv(data::AbstractDataFrame, sequence::Union{AbstractString,Symbol})::DataFrame
Used by compute_σ²wv
internally to compute the value per formulation. The data
argument assumes a single formulation.
Bioequivalence.abe
— Functionabe(::Type{T},
data::AbstractDataFrame,
test::AbstractChar,
reference::AbstractChar) where T <: Union{ApproximateSignedRankTest,
UnequalVarianceTTest}::DataFrame
Average Bioequivalence Bioequivalence Modeling
Bioequivalence.compute_σ²wv
— Methodcompute_σ²wv(data::AbstractDataFrame, id::Symbol, sequence::Symbol, endpoint::Symbol)::DataFrame
Returns the values per formulation for assessing population bioequivalence.
See also: _compute_σ²wv
.
Bioequivalence.detect_design
— Methoddetect_design(sequences::AbstractVector)::Tuple{Symbol,Vector{Char}}
The study design: Parallel, Crossover, Balaam, Higher or throws an error. Sequences based on the implied formulation and order.
Bioequivalence.implied_design
— Methodimplied_design(data::AbstractDataFrame)::NamedTuple
Sequence and number of subjects per sequence
Bioequivalence.lsmeans
— Functionlsmeans(model::Union{LinearMixedModel, TableRegressionModel{<:LinearModel}},
formulations::AbstractVector{<:Char},
)::CoefTable
Return the least squares geometric means for the formulations.
Bioequivalence.summarize
— Methodsummarize(data, endpoint::Symbol)
Returns the summary statistics for endpoint.
Bioequivalence.testdata
— Methodtestdata(filename::AbstractString)::DataFrame
Return the test dataset requested.
Examples
julia> designs = readdir(joinpath(dirname(pathof(Bioequivalence)), "..", "data"))
7-element Array{String,1}:
"2S2P"
"2S4P"
"Balaam"
"Dual"
"Nonparametric"
"Parallel"
"Williams"
julia> crossover_datasets = readdir(joinpath(dirname(pathof(Bioequivalence)), "..", "data", designs[1]))
11-element Array{String,1}:
"CL2009_3_6_1.tsv"
"PJ2017_3_1.tsv"
"PJ2017_3_12.tsv"
"SLF2014_1.tsv"
"SLF2014_2.tsv"
"SLF2014_3.tsv"
"SLF2014_4.tsv"
"SLF2014_5.tsv"
"SLF2014_6.tsv"
"SLF2014_7.tsv"
"SLF2014_8.tsv"
julia> Bioequivalence.testdata("PJ2017_3_1")
64×5 DataFrame
│ Row │ id │ sequence │ period │ AUC │ Cmax │
│ │ Int64 │ Cat… │ Int64 │ Int64 │ Int64 │
├─────┼───────┼──────────┼────────┼───────┼───────┤
│ 1 │ 1 │ RT │ 1 │ 2849 │ 499 │
│ 2 │ 1 │ RT │ 2 │ 2230 │ 436 │
│ 3 │ 2 │ TR │ 1 │ 2025 │ 438 │
│ 4 │ 2 │ TR │ 2 │ 2000 │ 361 │
│ 5 │ 3 │ TR │ 1 │ 2090 │ 535 │
│ 6 │ 3 │ TR │ 2 │ 1826 │ 558 │
│ 7 │ 4 │ RT │ 1 │ 2790 │ 733 │
│ 8 │ 4 │ RT │ 2 │ 2864 │ 416 │
│ 9 │ 5 │ RT │ 1 │ 2112 │ 344 │
│ 10 │ 5 │ RT │ 2 │ 1744 │ 48 │
⋮
│ 54 │ 29 │ RT │ 2 │ 1147 │ 221 │
│ 55 │ 30 │ TR │ 1 │ 2519 │ 537 │
│ 56 │ 30 │ TR │ 2 │ 1941 │ 400 │
│ 57 │ 31 │ RT │ 1 │ 1696 │ 390 │
│ 58 │ 31 │ RT │ 2 │ 1801 │ 350 │
│ 59 │ 34 │ RT │ 1 │ 1737 │ 425 │
│ 60 │ 34 │ RT │ 2 │ 1655 │ 319 │
│ 61 │ 35 │ TR │ 1 │ 1560 │ 463 │
│ 62 │ 35 │ TR │ 2 │ 1629 │ 372 │
│ 63 │ 36 │ RT │ 1 │ 2040 │ 464 │
│ 64 │ 36 │ RT │ 2 │ 2199 │ 384 │
Bioequivalence.walds_tests
— Methodwalds_tests(model::LinearMixedModel)::CoefTable
Wald test provdes a statistical test assessing whether the model parameters are jointly statistically significant from zero.
The degrees of freedom are computed à la containment method from SAS Software.
The standard errors are computed à la model-based method from SAS Software.
Bioequivalence.walds_tests
— Methodwalds_tests(model::TableRegressionModel{<:LinearModel})::CoefTable
Wald test provdes a statistical test assessing whether the model parameters are jointly statistically significant from zero.