5 Data Specification

5.1 IQdataNCA Function

The function IQdataNCA takes the source data set and extends it by adding columns containing information about the handling of the data and the methods used in the NCA. In this way, all options and choices by the user about the analysis are stored in one place. This augmented data set, namely the IQdataNCA object, is then ready for the NCA parameter calculation.

5.2 IQdataNCA Object

The IQdataNCA object is the source data set augmented by a fixed number of columns which are added through the function. These added columns may be empty or contain default values if they cannot be derived from the data or user input. Note that some column names from the source data set need to be mapped beforehand according to the IQdataNCA specification.

The types of columns in the IQdataNCA object are:

Required: columns from the source data set which are minimally required by IQdataNCA,
Conditionally required: columns from the source data set, conditionally required depending on profile and administration type,
Optional: columns from the source data set used for annotation of the data set,
Covariate: columns from the source data set,
Index/Flag/Comment: defined by user input, these columns flag records which are to be ignored, with corresponding comments,
Time: columns containing time data, derived from the original time data of the source data set,
Blloq: columns containing information about the handling values below the LLOQ, defined by user input,
Concentration: columns containing concentration data, derived from the original concentration data of the source data set,
Slope: parameters calculated by the best/manual-slope algorithm
NCA: columns indicating the AUC calculation method to be used, and intervals for corresponding interval AUC calculation.

Table 5.1: **Columns of IQdataNCA object**
Column	Label	Description	Comment
Required
USUBJID	Unique subject identifier	Unique subject identifier	Always to be provided
STUDYID	Study ID	Study ID	Always to be provided
COMPOUND	Name of compound	Name of the compound	Always to be provided
ANALYTE	Name of analyte	Name of the analyte	Might be available in PCTEST
MATRIX	Matrix definition	Biological matrix	PCSPEC
PROFILE	Annotation of distinct profiles	Unique name for the same type of profiles to be analyzed together.	User defined - could be based on one or multiple columns in the source data.
PROFTYPE	Type of profile	“SD” for single dose, “FD” for first dose, “SS” for steady-state. “FD” is generally treated the same as “SD”. The main difference is that the follow-up will be limited by the dosing interval.	Always to be provided
GROUP	Treatment group information	Treatment group information, used for stratification and annotation in figures, tables, listings.	Might be available as ACTARMCD or ACTARM in adpc. Could also be constructed based on DOSE+DOSEUNIT and potentially other identifiers.
GROUPN	Treatment group dose information	Numeric representation for GROUP. In fixed dose dosing the GROUPN value should be identical to DOSE column. In relative dose dosing (e.g. X mg/kg or X ng/m2) GROUPN = X. The information is used for stratification and annotation in figures, tables, listings.	Always to be provided
GROUPU	Unit of GROUPN	Unit of GROUPN. The unit of the numeric group representation. E.g. “mg”, “mg/kg”, “mg/m2”, etc.	Always to be provided
DAY	Actual Study Day of PK sampling	Day of PK sampling	PCDAY
ATIME	Actual relative time to previous dose	Actual relative time of concentration observation in TIMEUNIT units.	Difference between PCDTC and PCRFTDTC in TIMEUNIT units. If some information is missing then set to NA (single records or all records). Imputation in certain cases can be done but for now should be done manually, entering info what was done into the COMMENTR column
NTIME	Nominal relative time to previous dose	Nominal / protocol relative time of concentration observations in TIMEUNIT units.	PCTPTNUM If not available set to NA (single records or all records)
TIMEUNIT	Time unit	Used time unit. Definition of the general time unit for analysis. Key time and duration columns need to be in this unit. “Minutes”, “Hours”, “Days”, “Weeks”	“Minutes”, “Hours”, “Days”, “Weeks” are currently valid entries
ACONC	Actual observed concentrations	Actual observed concentrations in CONCUNIT units. NA: missing. 0: BLLOQ. “Value”: actual value. If other alphanumeric character then interpret as no value available and set to NA To be used for listings. 0 then to be reported only as BLQ.	Based on ACONC=PCSTRESN ACONC[PCORRES==“BLQ“] = 0 If units allow then can also be determined by: ACONC = PCORRES ACONC[PCORRES==“BLQ”] = 0 as.numeric(ACONC)
CONCUNIT	Unit of observed concentrations	Unit of observed concentrations	PCSTRESU or PCORRESU
LLOQ	Lower limit of quantification	Lower limit of quantification in CONCUNIT units	PCLLOQ
ADM	Administration type	Administration type to be considered in NCA PK parameter determination. “BOLUS”, “INFUSION”, or “EXTRAVASCULAR“	Could be based on EXROUTE but user adaptation needed to required values: “BOLUS”, “INFUSION”, or “EXTRAVASCULAR“ (non-case sensitive)
DOSE	Actual dose	Actual dose given with previous dose (absolute amount or relative amount, e.g., per kg body weight or per m2 body surface area) in DOSEUNIT unit.	EXDOSE (absolute or relative dose)
DOSEUNIT	Dose unit	Unit of DOSE (accepted units of absolute doses are: g, mg, ug, ng, pg). Typical relative dose units are mg/kg or mg/m2. However, expressions other than kg or m2 are also accepted as long as the first part is an accepted absolute dose unit.	EXDOSU. Allowed values: “g”,“mg”,“ug”,“ng”,“pg”, and combinations thereof with body size descriptor for relative doses
Conditionally required
TAU	Dosing interval for steady-state analysis	Nominal dosing interval in TIMEUNIT time units for steady-state and first-dose analysis (SS, FD PROFTYPE)	Required for profiles that are defined by PROFTYPE==”SS” or “FD”. Otherwise NA
ADUR	Actual duration of administration	Actual duration of administration in TIMEUNIT unit for ADM=“INFUSION”	Required for IV infusion (ADM = “INFUSION”). Otherwise NA. Defined as difference of EXENDTC and PCRFTDTC. If EXENDTC is missing but PCRFTDTC is defined then ADUR=0. If PCRFTDTC is missing the ADUR=NA.
NDUR	Nominal duration of administration	Nominal (per protocol) duration of administration (>0: infusion, =0: bolus) in TIMEUNIT unit.	Required for IV infusion (ADM = “INFUSION”). Otherwise NA.
Optional
VISIT	Visit Name	Visit Name	If not available value set to NA. Main use is for annotation of the data set
VISITNUM	Visit Number	Visit Number	If not available value set to NA. Main use is for annotation of the data set
PCTPT	Nominal/protocol Time Point Name	Nominal/protocol Time Point Name	If not available value set to NA. Main use is for annotation of the data set
PCDTC	Date/Time of PK sampling	Date and time of PK sampling.	If not available value set to NA. Main use is for annotation of the data set
EXSTDTC	Start Date/Time of Treatment	Date and time of treatment start (related to the profile measured).	If not available value set to NA. Main use is for annotation of the data set
Covariate
PERIOD	Period of study	Period of study in sequential or cross-over trials	These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
SEQUENCE	Sequence	Sequence of treatment in cross-over studies	These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
COUNTRY	Country identifier	Country identifier for reporting purposes	These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
SITEID	Site identifier	Site identifier for reporting purposes	These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
AGE	Age in years	Age of subject in years	These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
SEX	Sex	Gender of subject	These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
RACE	Race	Race of subject	These are all considered covariates. They can be used for stratification of results. But also are displayed in a systematic manner in listings.
Index/Flag/Comment
IX	Index of profile observations	1:N – numbering of profile records. The use of the index is mainly the selection of records for slope calculation	1:N based on time information, starting with 1 for each profile (PROFILE) within a subject (USUBJID) This column is always derived even if present already. It is defined based on TIME! So it has to be generated after the derivation (and potential imputation) of the TIME column
COMPTYPE	Type of compound	Type of compound (“exogenous” or “endogenous”)	User defined
IGNOREI	Reason for ignoring the subject	If not NA then the entry is assumed to define the reason why this subject is ignored in the analysis. This subject still appears in INDIVIDUAL listings and figures - but not in summary results. The entries in IGNOREI within a USUBJID and PROFILE combination have to be unique.	User defined
IGNORER	Reason for ignoring the record	If not NA then the entry is assumed to define the reason why this record is ignored in the totality of the analysis. Records ingored via IGNORER do not appear in listings and individual figures.	User defined
IGNORSUM	Reason for ignoring record in summary and inferential procedures	If not NA then the entry is assumed to define the reason why this record is ignored in the summary and inferential procedures.	User defined Automatic definition: If TIME and/or ACONC is NA ACONC>=LLOQ for TIME<=0 and PROFTYPE=“SD” or “FD”
IGNORNCA	Reason for ignoring record in NCA PK parameter determination	If not NA then the entry is assumed to define the reason why this record is ignored in the calculation of the NCA PK parameters.	User defined Automatic definition: If TIME and/or ACONC is NA ACONC>=LLOQ for TIME<=0 and PROFTYPE=“SD” or “FD”
COMMENTR	Record specific comments	Record specific comments. Adding comments and imputation information as needed. Multiple comments can be present and then should be separated by “:::” for ease of later parsing and extraction. COMMENTR needs to include the reason for exclusion (if FLAGLTF/SUM/NCA is set to 0 for this record). Comments will be displayed in record listings.	Default: “” (empty string) A recors is IGNORED from the whole analysis if the COMMENTR column contains “IGNORED RECORD”. In addition a reason should be provided as comment as well.
COMMENTI	Individual subject specific comments	Individual subject specific comments. Multiple comments can be present and then should be separated by “:::” for ease of later parsing and extraction. They can also be present in different rows for this subject. The aggregated information out of all rows in this subject and exploded by ::: would be applicable. If a subject is excluded from the analysis, COMMENTI needs to include the reason for exclusion. Comments will be displayed in record listings for the specific subject.	Default: “” (empty string) A subjects is ignored from the whole analysis if the COMMENTI column containes “IGNORED SUBJECT”. In addition a reason should be provided as comment as well.
Time
ATAFD	Actual time since first dose	Absolute / actual time since first dose in TIMEUNIT units	Either user provided or set based on ATIME
NTAFD	Nominal time since first dose	Absolute nominal / protocol time since first dose in TIMEUNIT units	Either user provided or set based on NTIME
FLAGTIME	Definition if actual or nominal time	“actual”: actual time used in NCA PK parameter determination “nominal”: nominal time used in NCA PK parameter determination	Default: “actual” It will be checked if ATIME is defined. If not “nominal” will be used.
FATIMIMP	Time imputation flag	Used in case if basis for analysis is actual time but some actual time point is missing. In this case the user can decide to impute this time point if the nominal time is available.“asis” or “nominal”. “asis”: if missing then it will not be imputed and not be considered in the analysis. If “nominal” then it will be imputed based on NTIME, NTAFD, NDUR and impact TIME, TAFD, and DUR	Default: “asis” If imputation is done then COMMENTR for these records needs to be set
TAFD	Analysis time since first dose	Analysis time derived/selected from ATAFD and NTAFD in TIMEUNIT units.	Derived based on ATAFD and NTAFD, depending on flag FLAGTIME Imputation of actual time with nominal time possible if NTAFD defined
TIME	Analysis relative time to previous dose	Analysis time derived/selected from ATIME and NTIME in TIMEUNIT units. Used for NCA PK parameter determination.	Derived based on ATIME and NTIME, depending on flag FLAGTIME Imputation of actual time with nominal time possible if NTIME defined
DUR	Analysis administration duration	Administration duration used in analysis, derived based on ADUR and NDUR in TIMEUNIT unit. Used for NCA PK parameter determination when ADM=“INFUSION”	Derived based on ADUR and NDUR depending on flag FLAGTIME
Blloq
BLLOQ	BLLOQ flag	BLLOQ flag, 1=value is <LLOQ, 0 value indicates >=LLOQ	as.numeric(ACONC lessthan LLOQ)
BLLOQPR	Pre first >=LLOQ value	Flag indicating BLLOQ values before first non-BLLOQ value	Derived based on ACONC and LLOQ. NA BLLOQ (due to ACONC NA or BLLOQ “missing” records not considered). Ignored records (general and NCA) will not be considered.
BLLOQIN	BLLOQ inbetween >=LLOQ values	Flag indicating BLLOQ values inbetween non-BLLOQ values or a single trailing BLLOQ value.	Derived based on ACONC and LLOQ. NA BLLOQ (due to ACONC NA or BLLOQ “missing” records not considered). Ignored records (general and NCA) will not be considered.
BLLOQP1	First BLLOQ post last >=LLOQ value	Flag indicating the first BLLOQ value after the last non-BLLOQ value if at least two BLLOQ values come in sequence.	Derived based on ACONC and LLOQ. NA BLLOQ (due to ACONC NA or BLLOQ “missing” records not considered). Ignored records (general and NCA) will not be considered.
BLLOQPO	2nd to last BLLOQ post last >= LLOQ	Flag indicating the 2nd-last BLLOQ value after the last non-BLLOQ value	Derived based on ACONC and LLOQ. NA BLLOQ (due to ACONC NA or BLLOQ “missing” records not considered). Ignored records (general and NCA) will not be considered.
FLGBLQPR	BLLOQ handling pre first >=LLOQ record	Handling BLLOQ values before first observation above LLOQ. Options: “asis”, “0”, ” LLOQ/2”, “LLOQ”, “missing”.	User definition. Default: “0”
FLGBLQIN	BLLOQ handling inbetween >=LLOQ records	Handling BLLOQ values between observations above LLOQ or a single BLLOQ value as last sample. Options: “asis”, “0”, ” LLOQ/2”, ” LLOQ “,”missing”.	User definition. Default: “missing”
FLGBLQP1	BLLOQ handling for first BLLOQ record post last >= LLOQ record	Handling first BLLOQ value after last observation above LLOQ if at least 2 BLLOQ values come in sequence. Options: “asis”, “0”, ” LLOQ/2”, ” LLOQ “,”missing”.	User definition. Default: “LLOQ/2”
FLGBLQPO	BLLOQ handling for BLLOQ records following first BLQ record post last >= LLOQ record	Handling BLLOQ values after first BLLOQ post last observation above LLOQ. Options: “asis”, “0”, ” LLOQ/2”, ” LLOQ “,”missing”.	User definition. Default: “missing”
FGBQPLIN	BLLOQ handling in plots lin Y axis	Handling BLLOQ values in plots on linear Y axis. Options: “asis”, “0”, ” LLOQ/2”, ” LLOQ “,”missing”. In addition it can be defined “asCONC”, which will use the same rules as defined for the NCA analysis with flags FLGBLQPR, FLGBLQIN, FLGBLQP1, FLGBLQPO	User definition. Default: “asCONC”
FGBQPLOG	BLLOQ handling in plots – log Y axis	Handling BLLOQ values in plots on log Y axis. Options: “asis”, “0”, ” LLOQ/2”, ” LLOQ “,”missing”. In addition it can be defined “asCONC”, which will use the same rules as defined for the NCA analysis with flags FLGBLQPR, FLGBLQIN, FLGBLQP1, FLGBLQPO	User definition. Default: “asCONC”
Concentration
CONC	Analysis concentrations	Analysis concentrations for summary tables and the NCA PK parameter determination.	Derived based on ACONC and BLLOQ handling methods (BLLOQPR, BLLOQIN, BLLOQPO, FLGBLQPR, FLGBLQIN, FLGBLQPO) and potentially other considerations
CONCPLIN	Concentration used in Lin Y plots	Concentration used in plots with linear Y axis. This could be different from CONC, if desired.	Derived based on ACONC and BLLOQ and FGBQPLIN.
CONCPLOG	Concentration used in Log Y plots	Concentration used in plots with log Y axis. This could be different from CONC, if desired.	Derived based on ACONC and BLLOQ and FGBQPLOG.
Slope
FLGSLOPE	Flag defining how slope determined	Definition of how the terminal slope will be defined. 2 settings possible “bestslope” or “manual”	Set based on user settings. Default: “bestslope”. If slope points for a subject defined then “manual” is set
SLOPETOL	Tolerance for best slope calculation	Tolerance setting for best slope calculation.	Default: 0.0001
SLOPEPT	Definition if used for slope calc	0: this point IS NOT included in the slope calculation 1: this point IS used in the slope calculation.	Info set using the functions slope_IQRdataNCA
R2	R Squared	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
R2ADJ	R Squared Adjusted	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
LAMZNPT	Number of Points for Lambda z	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
LAMZ	Lambda z	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
LAMZICPT	Intercept of regression	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
CORRXY	Correlation Between TimeX and Log ConcY	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
LAMZLL	Lambda z Lower Limit	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
LAMZUL	Lambda z Upper Limit	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
CLSTP	Last Nonzero Conc Pred	Calculated by best or manual slope calculation. Definition repeated in IQRparamNCA_Specification_edit.xlsx
NCA
AUCMETHD	AUC calculation method	Definition of the AUC calculation method. “Linear Log”, “LinearUp LogDown”, “Linear LinearInterpolation”, “Linear LinearLogInterpolation”.	Same methods as available in Winnonlin (https://onlinehelp.certara.com/phoenix/8.2/topics/nca.htm#XREF_52753_Options_tab)
AUCINVAL	AUC interval definition	Definition of the AUC intervals to consider in the calculation. Example: “[0;24];[0;48]”	Same methods as available in Winnonlin (https://onlinehelp.certara.com/phoenix/8.2/topics/nca.htm#XREF_52753_Options_tab)