In the free-response ROC (FROC) paradigm (Barnes, Sones, and Tesic 1985) the observer TBA. FROC data consists of mark-rating pairs, where a mark a location that was considered suspicious for presence of a localized lesion and the rating is the confidence level. By adopting a proximity criterion, each mark is classified by the investigator as a lesion localization (LL) - if it is close to a real lesion - or a non-lesion localization (NL) otherwise.
The observer gives a single rating to each ROI, denoted \({{R}_{k}}\). The rating can be an integer or quasi- continuous (e.g., 0 – 100), or a floating point value, as long as higher numbers represent greater confidence in presence of one or more lesions in the ROI.
ROI-level-normal ratings are stored in the NL field and ROI-level-abnormal ratings are stored in the LL field.
One can think of the ROI paradimg as similar to the FROC paradigm, but with localization accuracy restricted to belonging to a region (one cannot distinguish muttiple lesions within a region). Unlike the FROC paradigm, a rating is required for every ROI.
An example simulated ROI dataset is included as dataset04.
str(dataset04)
#> List of 8
#> $ NL : num [1:5, 1:4, 1:200, 1:7] -Inf -Inf 1 -Inf -Inf ...
#> $ LL : num [1:5, 1:4, 1:100, 1:3] 4 5 4 5 4 3 5 4 4 3 ...
#> $ lesionNum : int [1:100] 1 1 1 1 1 1 1 1 1 1 ...
#> $ lesionID : num [1:100, 1:3] 1 1 1 1 1 1 1 1 1 1 ...
#> $ lesionWeight: num [1:100, 1:3] 1 1 1 1 1 1 1 1 1 1 ...
#> $ dataType : chr "FROC"
#> $ modalityID : Named chr [1:5] "1" "2" "3" "4" ...
#> ..- attr(*, "names")= chr [1:5] "1" "2" "3" "4" ...
#> $ readerID : Named chr [1:4] "1" "3" "4" "5"
#> ..- attr(*, "names")= chr [1:4] "1" "3" "4" "5"
dataset04$NL[1,1,1,]
#> [1] -Inf -Inf -Inf -Inf -Inf -Inf -Inf
mean(dataset04$NL[,,1:50,])
#> [1] -Inf
dataset04$NL[1,1,51,]
#> [1] -Inf -Inf -Inf -Inf -Inf -Inf -Inf
dataset04$lesionNum[1]
#> [1] 1
dataset04$LL[1,1,1,]
#> [1] 4 -Inf -Inf
x <- dataset04$LL;mean(x[is.finite(x)])
#> [1] 3.678532Examination of the output reveals that:
This is a 5-treatment 4-reader dataset, with 100 non-diseased cases and 100 diseased cases.
For treatment 1, reader 1, case 1 (the 1st non-diseased case) the 4 ratings are -, -, -, -, -, -, -. The mean of all ratings on non-diseased cases is -.
$lesionNum field is 1.The ratings of the 2 ROI-level-abnormal ROIs on this case are 4. The mean rating over all ROI-level-abnormal ROIs is 3.6785323.
An Excel file in JAFROC format containing simulated ROI data corresponding to dataset04, is included with the distribution. The first command (below) finds the location of the file and the second command reads it and saves it to a dataset object ds.
fileName <- system.file(
"extdata", "includedFrocData.xlsx", package = "RJafroc", mustWork = TRUE)
ds <- DfReadDataFile(fileName)
ds$dataType
#> [1] "FROC"The DfReadDataFile function automatically recognizes that this is an ROI dataset. Its structure is similar to the JAFROC format Excel file, with some important differences, noted below. It contains three worksheets:
Truth worksheet - this indicates which cases are diseased and which are non-diseased and the number of ROI-level-abnormal region on each case.
CaseID and 40 abnormal cases (labeled 51-90).LesionID field for each normal case (e.g., CaseID = 1) is zero and there is one row per case. For abnormal cases, this field has a variable number of entries, ranging from 1 to 4. As an example, there are two rows for CaseID = 51 in the Excel file: one with LesionID = 2 and one with LesionID = 3.Weights field is always zero (this field is not used in ROI analysis).
FP (or NL) worksheet - this lists the ratings of ROI-level-normal regions.
ReaderID = 1, ModalityID = 1 and CaseID = 1 there are 4 rows, corresponding to the 4 ROI-level-normal regions in this case. The corresponding ratings are . The pattern repeats for other treatments and readers, but the rating are, of course, different.CaseID is represented in the FP worksheet (a rare exception could occur if a case-level abnormal case has 4 abnormal regions).TP (or LL) worksheet - this lists the ratings of ROI-level-abnormal regions.
CaseID = 1-50 in the TP worksheet.CaseID in the TP worksheet is 51, which corresponds to the first abnormal case.CaseID in the Truth worksheet there were two entries with LesionID = 2 and 3. These must match the LesionID’s listed for this case in the TP worksheet. Complementing these two entries, in the FP worksheet for CaseID = 51, there are 2 entries corresponding to the two ROI-level-normal regions in this case.TP and FP worksheets is always 4.Barnes, Gary T., Richard A. Sones, and Mike M. Tesic. 1985. “Digital Chest Radiography: Performance Evaluation of a Prototype Unit.” Journal Article. Radiology 154: 801–6.