# 5 Data Specification

The function IQdataNCA takes the source dataset and extends it by adding columns containing information about the handling of the data and the methods used in the NCA analysis. In this way, all options and choices by the user about the analysis are stored in one place. This augmented dataset, namely the IQdataNCA object, is then ready for the NCA parameter calculation.

## 5.1 IQdataNCA function

[Insert link here to IQdataNCA R DocumentATION]

## 5.2 IQdataNCA object

The IQdataNCA object is the source dataset augmented by a fixed number of columns which are added through the function. These added columns may be empty or contain default values if they cannot be derived from the data or user input. Note that some column names from the source dataset need to be mapped beforehand according to the IQdataNCA specification.

The types of columns in the IQdataNCA object are:

• Required: columns from the source dataset which are minimally required by IQdataNCA
• Conditionally required: columns from the source dataset, conditionally required depending on profile and administration type
• Optional: columns from the source dataset used for annotation of the dataset
• Covariate: columns from the source dataset
• Index/Flag/Comment: defined by user input, these columns flag records which are to be ignored, with corresponding comments
• Time: columns containing time data, derived from the original time data of the source data set
• Blloq: columns containing information about the handling of blloq values, defined by user input
• Concentration: columns containing concentration data, derived from the original concentration data of the source data set
• Slope: parameters calculated by the best/manual-slope algorithm
• NCA: columns indicating the AUC calculation method to be used, and intervals for corresponding interval AUC calculation
Table 5.1: Columns of IQdataNCA object
Column Label Description Comment
Required
USUBJID Unique subject identifier Unique subject identifier Always to be provided
STUDYID Study ID Study ID
COMPOUND Name of compound Name of the compound
ANALYTE Name of analyte Name of the analyte Might be available in PCTEST
MATRIX Matrix definition Biological matrix PCSPEC
PROFILE Annotation of distinct profiles Unique name for the same type of profiles to be analyzed together. User defined - could be based on one or multiple columns in the source data.
PROFTYPE Type of profile “SD” for single dose, “FD” for first dose, “SS” for steady-state. “FD” is generally treated the same as “SD”. The main difference is that the follow-up will be limited by the dosing interval. Always to be provided
GROUP Treatment group information Treatment group information, used for stratification and annotation in figures, tables, listings. Might be available as ACTARMCD or ACTARM in adpc. Could also be constructed based on DOSE+DOSEUNIT and potentially other identifiers.
GROUPN Treatment group dose information Numeric representation for GROUP. In fixed dose dosing the GROUPN value should be identical to DOSE column. In relative dose dosing (e.g. X mg/kg or X ng/m2) GROUPN = X. The information is used for stratification and annotation in figures, tables, listings. Always to be provided
GROUPU Unit of GROUPN Unit of GROUPN. The unit of the numeric group representation. E.g. “mg”, “mg/kg”, “mg/m2”, etc.
DAY Actual Study Day of PK sampling Day of PK sampling PCDAY
ATIME Actual relative time to previous dose Actual relative time of concentration observation in TIMEUNIT units. Difference between PCDTC and PCRFTDTC in TIMEUNIT units. If some information is missing then set to NA (single records or all records). Imputation in certain cases can be done but for now should be done manually, entering info what was done into the COMMENTR column
NTIME Nominal relative time to previous dose Nominal / protocol relative time of concentration observations in TIMEUNIT units. PCTPTNUM If not available set to NA (single records or all records)
TIMEUNIT Time unit Used time unit. Definition of the general time unit for analysis. Key time and duration columns need to be in this unit. “Minutes”, “Hours”, “Days”, “Weeks” “Minutes”, “Hours”, “Days”, “Weeks” are currently valid entries
ACONC Actual observed concentrations Actual observed concentrations in CONCUNIT units. NA: missing. 0: BLLOQ. “Value”: actual value. If other alphanumeric character then interpret as no value available and set to NA To be used for listings. 0 then to be reported only as BLQ.

Based on ACONC=PCSTRESN
ACONC[PCORRES==“BLQ“] = 0

If units allow then can also be determined by: ACONC = PCORRES ACONC[PCORRES==“BLQ”] = 0 as.numeric(ACONC)
CONCUNIT Unit of observed concentrations Unit of observed concentrations PCSTRESU or PCORRESU
LLOQ Lower limit of quantification Lower limit of quantification in CONCUNIT units PCLLOQ
ADM Administration type Administration type to be considered in NCA PK parameter determination. “BOLUS”, “INFUSION”, or “EXTRAVASCULAR“ Could be based on EXROUTE but user adaptation needed to required values: “BOLUS”, “INFUSION”, or “EXTRAVASCULAR“ (non-case sensitive)
DOSE Actual dose amount Actual dose amount given with previous dose (absolute amount) in DOSEUNIT unit. EXDOSE (potentially conversion to absolute dose needed)
DOSEUNIT Dose unit Unit of DOSE (always absolute units only allowed: g, mg, ug, ng, pg) EXDOSU. Allowed values: “g”,“mg”,“ug”,“ng”,“pg”
Conditionally required
TAU Dosing interval for steady-state analysis Nominal dosing interval in TIMEUNIT time units for steady-state and first-dose analysis (SS, FD PROFTYPE) Required for profiles that are defined by PROFTYPE==”SS” or “FD”. Otherwise NA
ADUR Actual duration of administration Actual duration of administration in TIMEUNIT unit for ADM=“INFUSION” Required for IV infusion (ADM = “INFUSION”). Otherwise NA. Defined as difference of EXENDTC and PCRFTDTC. If EXENDTC is missing but PCRFTDTC is defined then ADUR=0. If PCRFTDTC is missing the ADUR=NA.
NDUR Nominal duration of administration Nominal (per protocol) duration of administration (>0: infusion, =0: bolus) in TIMEUNIT unit. Required for IV infusion (ADM = “INFUSION”). Otherwise NA.
Optional
VISIT Visit Name Visit Name If not available value set to NA. Main use is for annotation of the dataset
VISITNUM Visit Number Visit Number
PCTPT Nominal/protocol Time Point Name Nominal/protocol Time Point Name
PCDTC Date/Time of PK sampling Date and time of PK sampling.
EXSTDTC Start Date/Time of Treatment Date and time of treatment start (related to the profile measured).
Covariate
PERIOD Period of study Period of study in sequential or cross-over trials These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
SEQUENCE Sequence Sequence of treatment in cross-over studies
COUNTRY Country identifier Country identifier for reporting purposes
SITEID Site identifier Site identifier for reporting purposes
AGE Age in years Age of subject in years
SEX Sex Gender of subject
RACE Race Race of subject
Index/Flag/Comment
IX Index of profile observations 1:N – numbering of profile records. The use of the index is mainly the selection of records for slope calculation

1:N based on time information, starting with 1 for each profile (PROFILE) within a subject (USUBJID)

This column is always derived even if present already. It is defined based on TIME! So it has to be generated after the derivation (and potential imputation) of the TIME column
COMPTYPE Type of compound Type of compound (“exogenous” or “endogenous”) User defined
IGNOREI Reason for ignoring the subject If not NA then the entry is assumed to define the reason why this subject is ignored in the analysis. This subject still appears in INDIVIDUAL listings and figures - but not in summary results. The entries in IGNOREI within a USUBJID and PROFILE combination have to be unique.
IGNORER Reason for ignoring the record If not NA then the entry is assumed to define the reason why this record is ignored in the totality of the analysis. Records ingored via IGNORER do not appear in listings and individual figures.
IGNORSUM Reason for ignoring record in summary and inferential procedures If not NA then the entry is assumed to define the reason why this record is ignored in the summary and inferential procedures.

User defined

Automatic definition: If TIME and/or ACONC is NA ACONC>=LLOQ for TIME<=0 and PROFTYPE=“SD” or “FD”
IGNORNCA Reason for ignoring record in NCA PK parameter determination If not NA then the entry is assumed to define the reason why this record is ignored in the calculation of the NCA PK parameters.
COMMENTR Record specific comments Record specific comments. Adding comments and imputation information as needed. Multiple comments can be present and then should be separated by “:::” for ease of later parsing and extraction. COMMENTR needs to include the reason for exclusion (if FLAGLTF/SUM/NCA is set to 0 for this record). Comments will be displayed in record listings. Default: "" (empty string) A recors is IGNORED from the whole analysis if the COMMENTR column contains “IGNORED RECORD”. In addition a reason should be provided as comment as well.
COMMENTI Individual subject specific comments Individual subject specific comments. Multiple comments can be present and then should be separated by “:::” for ease of later parsing and extraction. They can also be present in different rows for this subject. The aggregated information out of all rows in this subject and exploded by ::: would be applicable. If a subject is excluded from the analysis, COMMENTI needs to include the reason for exclusion. Comments will be displayed in record listings for the specific subject. Default: "" (empty string) A subjects is ignored from the whole analysis if the COMMENTI column containes “IGNORED SUBJECT”. In addition a reason should be provided as comment as well.
Time
ATAFD Actual time since first dose Absolute / actual time since first dose in TIMEUNIT units Either user provided or set based on ATIME
NTAFD Nominal time since first dose Absolute nominal / protocol time since first dose in TIMEUNIT units Either user provided or set based on NTIME
FLAGTIME Definition if actual or nominal time “actual”: actual time used in NCA PK parameter determination “nominal”: nominal time used in NCA PK parameter determination Default: “actual” It will be checked if ATIME is defined. If not “nominal” will be used.
FATIMIMP Time imputation flag Used in case if basis for analysis is actual time but some actual time point is missing. In this case the user can decide to impute this time point if the nominal time is available.“asis” or “nominal”. “asis”: if missing then it will not be imputed and not be considered in the analysis. If “nominal” then it will be imputed based on NTIME, NTAFD, NDUR and impact TIME, TAFD, and DUR

Default: “asis”

If imputation is done then COMMENTR for these records needs to be set
TAFD Analysis time since first dose Analysis time derived/selected from ATAFD and NTAFD in TIMEUNIT units.

Derived based on ATAFD and NTAFD, depending on flag FLAGTIME

Imputation of actual time with nominal time possible if NTAFD defined
TIME Analysis relative time to previous dose Analysis time derived/selected from ATIME and NTIME in TIMEUNIT units. Used for NCA PK parameter determination.

Derived based on ATIME and NTIME, depending on flag FLAGTIME

Imputation of actual time with nominal time possible if NTIME defined
DUR Analysis administration duration Administration duration used in analysis, derived based on ADUR and NDUR in TIMEUNIT unit. Used for NCA PK parameter determination when ADM=“INFUSION” Derived based on ADUR and NDUR depending on flag FLAGTIME
Blloq
BLLOQ BLLOQ flag BLLOQ flag, 1=value is <LLOQ, 0 value indicates >=LLOQ as.numeric(ACONC lessthan LLOQ)
BLLOQPR Pre first >=LLOQ value Flag indicating BLLOQ values before first non-BLLOQ value Derived based on ACONC and LLOQ. NA BLLOQ (due to ACONC NA or BLLOQ “missing” records not considered). Ignored records (general and NCA) will not be considered.
BLLOQIN BLLOQ inbetween >=LLOQ values Flag indicating BLLOQ values inbetween non-BLLOQ values or a single trailing BLLOQ value.
BLLOQP1 First BLLOQ post last >=LLOQ value Flag indicating the first BLLOQ value after the last non-BLLOQ value if at least two BLLOQ values come in sequence.
BLLOQPO 2nd to last BLLOQ post last >= LLOQ Flag indicating the 2nd-last BLLOQ value after the last non-BLLOQ value
FLGBLQPR BLLOQ handling pre first >=LLOQ record Handling BLLOQ values before first observation above LLOQ. Options: “asis”, “0”, " LLOQ/2“,”LLOQ“,”missing". User definition. Default: “0”
FLGBLQIN BLLOQ handling inbetween >=LLOQ records Handling BLLOQ values between observations above LLOQ or a single BLLOQ value as last sample. Options: “asis”, “0”, " LLOQ/2“,” LLOQ “,”missing". User definition. Default: “missing”
FLGBLQP1 BLLOQ handling for first BLLOQ record post last >= LLOQ record Handling first BLLOQ value after last observation above LLOQ if at least 2 BLLOQ values come in sequence. Options: “asis”, “0”, " LLOQ/2“,” LLOQ “,”missing". User definition. Default: “LLOQ/2”
FLGBLQPO BLLOQ handling for BLLOQ records following first BLQ record post last >= LLOQ record Handling BLLOQ values after first BLLOQ post last observation above LLOQ. Options: “asis”, “0”, " LLOQ/2“,” LLOQ “,”missing". User definition. Default: “missing”
FGBQPLIN BLLOQ handling in plots lin Y axis Handling BLLOQ values in plots on linear Y axis. Options: “asis”, “0”, " LLOQ/2“,” LLOQ “,”missing“. In addition it can be defined”asCONC", which will use the same rules as defined for the NCA analysis with flags FLGBLQPR, FLGBLQIN, FLGBLQP1, FLGBLQPO User definition. Default: “asCONC”
FGBQPLOG BLLOQ handling in plots – log Y axis Handling BLLOQ values in plots on log Y axis. Options: “asis”, “0”, " LLOQ/2“,” LLOQ “,”missing“. In addition it can be defined”asCONC", which will use the same rules as defined for the NCA analysis with flags FLGBLQPR, FLGBLQIN, FLGBLQP1, FLGBLQPO
Concentration
CONC Analysis concentrations Analysis concentrations for summary tables and the NCA PK parameter determination. Derived based on ACONC and BLLOQ handling methods (BLLOQPR, BLLOQIN, BLLOQPO, FLGBLQPR, FLGBLQIN, FLGBLQPO) and potentially other considerations
CONCPLIN Concentration used in Lin Y plots Concentration used in plots with linear Y axis. This could be different from CONC, if desired. Derived based on ACONC and BLLOQ and FGBQPLIN.
CONCPLOG Concentration used in Log Y plots Concentration used in plots with log Y axis. This could be different from CONC, if desired. Derived based on ACONC and BLLOQ and FGBQPLOG.
Slope
FLGSLOPE Flag defining how slope determined Definition of how the terminal slope will be defined. 2 settings possible “bestslope” or “manual” Set based on user settings. Default: “bestslope”. If slope points for a subject defined then “manual” is set
SLOPETOL Tolerance for best slope calculation Tolerance setting for best slope calculation. Default: 0.0001
SLOPEPT Definition if used for slope calc 0: this point IS NOT included in the slope calculation 1: this point IS used in the slope calculation. Info set using the functions slope_IQRdataNCA
R2 R Squared Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx