Translate this page into:
Diagnostic accuracy of the Yokohama system for reporting breast fine-needle aspiration cytology and breast imaging-reporting and data system in breast lesions
-
Received: ,
Accepted: ,
How to cite this article: Shakthivel V, Ahuja S, Zaheer S, Swarna S, Singh M, Chintamani C. Diagnostic accuracy of the Yokohama system for reporting breast fine-needle aspiration cytology and breast imaging-reporting and data system in breast lesions. CytoJournal. 2026;23:20. doi: 10.25259/Cytojournal_167_2025
Abstract
Objectives:
The Yokohama system standardizes fine-needle aspiration cytology (FNAC) reporting for breast lesions, while the breast imaging-reporting and data system (BI-RADS) categorizes lesions based on imaging risk assessment. This study aimed to evaluate the diagnostic accuracy of FNAC using the Yokohama System in correlation with BI-RADS classifications.
Material and Methods:
A retrospective analysis was conducted on 188 breast lesion cases that underwent FNAC and were categorized using the BI-RADS system. The breast FNAs were classified according to the Yokohama System and compared with histopathological diagnoses to calculate the risk of malignancy (ROM). The ROM for the BI-RADS categories was also determined. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated for both.
Results:
According to the Yokohama classification, 5.9%, 52.1%, 4.8%, 8.5%, and 28.7% were classified as non-diagnostic, benign, atypical, suspicious, and malignant, respectively. The majority of cases were classified as BIRADS 3 (23.9%) and 5 (23.9%), followed by 4a (22.3%). Higher Yokohama and BI-RADS categories were more prevalent among malignant cases, with 64.9% in Yokohama category 5 and 51.9% in BI-RADS 5. Sensitivity was highest in Yokohama Scenario C (81.8%), while specificity (96.4%) and PPV (92.6%) were highest in Scenario A. When BI-RADS 4b, 4c, 5, and 6 were considered malignant, Scenario C had the highest sensitivity (87.0%) and NPV (90.4%), while Scenario B demonstrated the highest diagnostic accuracy (87.7%).
Conclusion:
FNAC using the Yokohama System and imaging-based BI-RADS classification both show high diagnostic accuracy for breast lesions. While their individual performance is comparable, further studies are needed to explore whether combining these modalities improves diagnostic outcomes in clinically indeterminate cases.
Keywords
Breast
Breast imaging-reporting and data system
Risk of malignancy
Sensitivity
Yokohama
INTRODUCTION
Female breast cancer has surpassed lung cancer as the most commonly diagnosed cancer, with an estimated 2.3 million new cases (11.7%), followed by lung (11.4%), as reported in the Global Cancer Statistics 2020.[1] In India, it contributes to 13.5% of all cancer cases and 10.6% of cancer-related deaths, with a cumulative risk of 2.81.[1] Despite advances in cancer care, late-stage diagnosis remains a significant challenge, particularly in developing countries. Early detection through effective screening and diagnostic methods is essential to improving outcomes. Fine-needle aspiration cytology (FNAC) has gained prominence among the diagnostic tools available for its simplicity, rapid results, and cost-effectiveness.
FNAC, while widely used, faces challenges such as variability in diagnostic accuracy due to differences in reporting standards, sample quality, and observer expertise.[2,3] To address these limitations, the International Academy of Cytology developed the Yokohama System for Reporting Breast FNAC. This standardized framework aims to enhance diagnostic accuracy, reduce inter-observer variability, and improve communication between cytopathologists and clinicians, thereby optimizing patient care.[4,5] Preliminary studies have demonstrated its potential, but most of the evidence is based on retrospective analyses, and prospective evaluations remain limited.
To bridge this gap, the current study aims to evaluate the diagnostic accuracy of the Yokohama System for Reporting Breast FNAC. It specifically seeks to compare the system’s findings with histopathological results to validate its accuracy. In addition, the study aims to explore the diagnostic utility of the Breast Imaging-Reporting and Data System (BI-RADS) in breast lesion evaluation, correlating it with histopathology. By assessing the risk of malignancy (ROM) for each diagnostic category under the Yokohama System and BI-RADS, the study intends to provide objective insights to support better clinical decision-making and optimize patient outcomes.
MATERIAL AND METHODS
Study design and setting
This cross-sectional study was conducted over a period of 18 months, in collaboration with the Departments of Pathology, Surgery, and Radiology. The study was done after approval of the Institutional Ethics Committee (Vardhman Mahavir Medical College and Safdarjung Hospital, New Delhi) vide no. IEC/VMMC/SJH/Thesis/2023-03/CC-245 and in accordance with the Declaration of Helsinki. Patients presenting with breast lesions who underwent FNAC followed by biopsy/resection were included. Patients who had received neo-adjuvant therapy were excluded. All eligible breast cancer patients were included, and cases were selected consecutively. In this study, FNAC was performed in 188 cases, of which 155 cases had subsequent histopathology reports available for correlation. In 33 cases, FNAC was done without a follow-up histopathology report due to a lack of biopsy/resection. In addition, 12 cases had histopathological reports available, but did not undergo FNAC. FNAC was reported without knowledge of histopathological findings to ensure blinding. Similarly, histopathology was reported without prior knowledge of FNAC results to maintain objectivity and prevent bias. The baseline clinical and lesion characteristics (such as age, lesion size, and clinical symptoms) of the 33 cases without histopathology follow-up were not systematically different from those included (n = 155). These cases were excluded solely due to a lack of histopathology follow-up and not because of any intrinsic clinical or pathological differences.
Sample size calculation
The sample size was calculated to determine the diagnostic accuracy of FNAC in detecting malignancy, using histopathology as the gold standard. The Wilson Score Interval method was used for a proportion-based accuracy study. Assuming a 95% confidence interval, an expected accuracy of 90% based on Ahuja and Malviya, and an absolute error margin of 5%, the required sample size was estimated using the formula:[6] −
Where Z = 1.96 (for 95% confidence), accuracy = 0.90, and d = 0.05. Substituting values, the calculated sample size was 138.2, which was adjusted for a 10% margin of error, resulting in a final sample size of 153, rounded to 155 participants. The calculation assumes a proportion-based estimation of diagnostic accuracy using FNAC and histopathology results, application of the Wilson Score Interval method for confidence interval estimation, random sampling of patients presenting with breast lesions, independence of observations, and a sufficiently large sample to ensure a precise and stable estimate of diagnostic accuracy.
BI-RADS and Yokohama system classification
BI-RADS Classification includes 7 categories (0 = incomplete, 1 = negative, 2 = benign, 3 = probably benign, 4 = suspicious [4a: low suspicion; 4b: moderate suspicion; 4c: high suspicion], 5 = highly suggestive of malignancy, and 6 = proven malignancy).[7,8] Yokohama System includes 5 categories (1 = insufficient, 2 = benign, 3 = atypical, 4 = suspicious, and 5 = malignant).[9]
Data collection and preparation
Clinical details were obtained from patient records. Mammography was performed to assign the BI-RADS score. FNAC was done following standard procedures after obtaining informed consent. FNAC was performed using a 22-23-gauge disposable needle attached to a 10 mL syringe. The puncture site was selected based on clinical and radiological guidance, with preference for the most representative and accessible area of the lesion. Multiple passes (typically 2-3) were made to ensure adequate sampling. The aspirated material was expelled onto clean glass slides, smeared, and immediately fixed in 95% ethanol for Papanicolaou (Hi-Media; RM5375; India) staining, while air-dried smears were prepared for May Grunwald Giemsa (Hi-Media; GRM945; India) staining. In cases where sample adequacy was uncertain, rapid on-site evaluation (ROSE) was employed when feasible. Biopsy specimens were fixed in formalin, embedded in paraffin, and stained with Hematoxylin and Eosin (H&E, Biolab; CY1577; India). When necessary, additional immunohistochemical staining was performed. However, in the present study, the criteria for defining “when necessary” were not pre-specified. The cases that underwent immunohistochemistry typically included those with atypical cytological features or where the nature of the lesion remained uncertain after routine staining. The specific markers applied (such as Estrogen receptor, Progesterone receptor, HER2, and Cytokeratin 5/6) were chosen on a case-by-case basis, but the exact selection strategy and interpretation criteria were not standardized. Furthermore, the study did not analyze the incremental impact of immunohistochemistry on the diagnostic accuracy of either the Yokohama System or BI-RADS classifications.
Data analysis
For analysis, cytological classifications were grouped into three categories based on their degree of malignancy suspicion to facilitate comparison with histopathological diagnoses (Scenario A: malignant category considered positive, Scenario B: malignant and suspicious category considered positive, and Scenario C: malignant, suspicious, and atypical category considered positive). This categorization assessed the impact of expanding diagnostic thresholds on diagnostic accuracy calculations. All statistical analyses were performed using the Statistical Package for the Social Sciences version 26.0. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and diagnostic accuracy were calculated using histopathological results as the gold standard. The ROM was calculated for each diagnostic category. Similarly, the diagnostic accuracy of BI-RADS scores was assessed using the same criteria.
RESULTS
Demographics and lesion distribution
A total of 188 patients were enrolled in the study, with ages ranging from 14-75 years (mean age: 35.3 years). The cohort predominantly consisted of younger patients, with the majority falling into the age groups of 14-30 years (40.4%) and 31-45 years (38.3%). The sex distribution was predominantly female (99.5%), with one male patient diagnosed with invasive papillary carcinoma.
Malignant lesions were more commonly observed in patients aged ≥45 years, with the mean age for malignant lesions being 44.5 years, compared to a mean age of 28.9 years for benign lesions. This age difference between benign and malignant lesions was statistically significant, highlighting the role of age as a factor in breast cancer diagnosis.
Lesion characteristics
Histopathology results showed that, out of the 188 lesions, 77 (41%) were malignant, while 111 (59%) were benign. Among the benign lesions, the most frequent types were fibroadenomas, which accounted for 33.5% of the total cases. Other benign lesions included benign ductal hyperplasia (11.7%) and fibroadenomatoid hyperplasia (6.9%). The most common malignant lesions were invasive ductal carcinoma (IDC) (27.1%) and intraductal carcinoma (21.1%).
The breakdown of the malignant lesions included both in situ and invasive cancers, with 45.4% of malignant lesions being invasive and the remaining 54.6% being in situ, which reflects the varied presentation of breast malignancies. Histopathology findings confirmed the presence of various tumor subtypes, including IDC, invasive lobular carcinoma, and invasive papillary carcinoma, with IDC being the most prevalent.
BI-RADS and Yokohama classification
According to the Yokohama classification, 5.9% (11 cases) of lesions were classified as non-diagnostic (Category 1), 52.1% (98 cases) as benign (Category 2), while 4.8% (9 cases) were categorized as atypical (Category 3), 8.5% (16 cases) as suspicious (Category 4), and 28.7% (54 cases) as malignant (Category 5).
The majority of cases were classified as BI-RADS 3 (23.9%) and 5 (23.9%), followed by 4a (22.3%). Categories 4b (10.6%) and 2 (9.0%) were less common, while 4c (5.3%) and 6 (4.8%) had the lowest frequencies. No cases were assigned to categories 0 or 1 [Figures 1 and 2].

- Cytopathological, breast imaging-reporting and data system (BI-RADS), histopathological images in a case of fibroadenoma. (a) Smears exhibit monolayered sheet of ductal cells with overlying myoepithelial cells in a case of fibroadenoma (Yokohama category 2) (scale bar: 50 µm; 400x, MayGrünwald-Giemsa), (b) Radiological images show a BI-RADS 3 lesion – probably benign, (c) Sections show an intracanalicular pattern with compressed ducts surrounded by proliferating stroma (scale bar: 100 µm; 200x, Hematoxylin and eosin).

- Cytopathological, breast imaging-reporting and data system (BI-RADS), histopathological images in a case of ductal carcinoma. (a) Smears exhibit a discohesive cluster of atypical ductal cells with moderate pleomorphism in a case of ductal carcinoma (Yokohama category 5) (scale bar: 50 µm; 400x, May-Grünwald-Giemsa), (b) Radiological images show BI-RADS 5 lesion, highly suggestive of malignancy, (c) Sections show sheets and clusters of atypical ductal cells with enlarged hyperchromatic nuclei, moderate cytoplasm, and prominent nucleoli (scale bar: 100 µm; 200x, Hematoxylin and eosin).
Diagnostic performance of FNAC
FNAC was performed on all lesions, with the results compared against histopathology as the gold standard. On histopathology, 111 (59%) cases turned out to be non-malignant, and the remaining 77 (41%) were malignant. Among patients with malignant histopathology diagnosis, the higher Yokohama category was more prevalent than the lower Yokohama category. A total of 50 (64.9%) patients with malignant diagnoses in histopathology were in the Yokohama category 5. Among patients with malignant histopathology diagnosis, 5 (6.5%) belonged to Category 1 of the Yokohama classification. 9 (11.7%) belonged to Category 2, whereas 3 (3.9%) and 10 (13.0%) belonged to Categories 3 and 4, respectively.
Among patients with non-malignant histopathology diagnosis, 6 (5.4%) belonged to Category 1 of the Yokohama classification. 89 (80.2%) belonged to Category 2, whereas 6 (5.4%) belonged to Categories 3 and 4, respectively. Only 4 (3.6%) belonged to Category 5. The ROM for the Yokohama system was 45.5%, 9.2%, 33.3%, 62.5%, and 92.6% in Categories 1, 2, 3, 4, and 5, respectively [Table 1].
| Yokohama category | Histopathological diagnosis (%) | Total cases (%) | Histologically confirmed malignant cases | Risk of malignancy (%) |
|---|---|---|---|---|
| 1 | 6 (5.4%) NonMalignant, 5 (6.5%) Malignant | 11 (5.9) | 5 | 45.5 |
| 2 | 89 (80.2%) NonMalignant, 9 (11.7%) Malignant | 98 (52.1) | 9 | 9.2 |
| 3 | 6 (5.4%) NonMalignant, 3 (3.9%) Malignant | 9 (4.8) | 3 | 33.3 |
| 4 | 6 (5.4%) NonMalignant, 10 (13.0%) Malignant | 16 (8.5) | 10 | 62.5 |
| 5 | 4 (3.6%) NonMalignant, 50 (64.9%) Malignant | 54 (28.7) | 50 | 92.6 |
| Total | 111 (100.0%) NonMalignant, 77 (100.0%) Malignant | 188 (100.0) | - | - |
The analysis of false positives and false negatives revealed that FNAC yielded false-positive results in 4 cases (3.6%), wherein lesions classified as malignant by FNAC were found to be benign on histopathology. These cases primarily consisted of fibroadenomas with atypical features and sclerosing adenosis, both of which can mimic malignancy cytologically. Cytologically, these cases exhibited high cellularity with overlapping epithelial fragments, mild nuclear pleomorphism, and occasional prominent nucleoli, features that led to their categorization as malignant on FNAC. These findings highlight specific benign entities that may mimic carcinoma on cytology, particularly in the absence of architectural correlation. Inclusion of these entities in malignant reporting thresholds lowered the specificity to 96.4%, underscoring the importance of cautious interpretation when such cytomorphological overlaps are present.
False-negative results were observed in 9 cases (11.7%), where lesions initially categorized as benign by FNAC were later confirmed to be malignant on histopathology. The predominant factors contributing to false-negative diagnoses included sampling errors, low cellularity of aspirates, and the presence of well-differentiated carcinomas that exhibited minimal cytological atypia.
A higher BI-RADS score was more prevalent among malignant cases. No malignancies were observed in BIRADS Categories 0, 1, or 2. The distribution of cases across BIRADS scores has been detailed in Table 2. The ROM for each BI-RADS category was 0.0% for Category 2, 6.7% for Category 3, 16.7% for Category 4a, 40.0% for Category 4b, 100% for Category 4c, 88.9% for Category 5, and 100% for Category 6 [Table 2].
| BIRADS score | Histopathological diagnosis (%) | Total cases (%) | Histologically confirmed malignant cases | Risk of malignancy (%) |
|---|---|---|---|---|
| 0 | 0 (0.0%) NonMalignant, 0 (0.0%) Malignant | 0 (0.0) | NA | NA |
| 1 | 0 (0.0%) NonMalignant, 0 (0.0%) Malignant | 0 (0.0) | NA | NA |
| 2 | 17 (15.3%) NonMalignant, 0 (0.0%) Malignant | 17 (15.3) | 0 | 0.0 |
| 3 | 42 (37.8%) NonMalignant, 3 (3.9%) Malignant | 45 (23.9) | 3 | 6.7 |
| 4a | 35 (31.5%) NonMalignant, 7 (9.1%) Malignant | 42 (22.3) | 7 | 16.7 |
| 4b | 12 (10.8%) NonMalignant, 8 (10.4%) Malignant | 20 (10.6) | 8 | 40.0 |
| 4c | 0 (0.0%) NonMalignant, 10 (13.0%) Malignant | 10 (5.3) | 10 | 100.0 |
| 5 | 5 (4.5%) NonMalignant, 40 (51.9%) Malignant | 45 (23.9) | 40 | 88.9 |
| 6 | 0 (0.0%) NonMalignant, 9 (11.7%) Malignant | 9 (4.8) | 9 | 100.0 |
| Total | 111 (100.0%) NonMalignant, 77 (100.0%) Malignant | 188 (100.0) | - | - |
BI-RADS: Breast imaging-reporting and data system
When using the Yokohama classification, sensitivity was highest for Scenario C, with a value of 81.8%. Specificity was highest in Scenario A with 96.4%. Maximum PPV was achieved by Scenario A with a value of 92.6%, and maximum NPV was achieved by Scenario C with a value of 87.2%. The Scenario with the highest diagnostic accuracy was Scenario B with 85.6% [Table 3].
| Yokohama classification | Scenario A (%) | Scenario B (%) | Scenario C(%) |
|---|---|---|---|
| Sensitivity | 64.9 (53.275.5) | 77.9 (67.086.6) | 81.8 (71.489.7) |
| Specificity | 96.4 (91.099.0) | 90.9 (84.195.6) | 85.6 (77.691.5) |
| PPV | 92.6 (82.597.1) | 85.7 (76.691.6) | 79.7 (71.286.2) |
| NPV | 79.8 (74.584.3) | 85.6 (79.590.1) | 87.2 (80.891.6) |
| Diagnostic accuracy | 83.5 (77.488.5) | 85.6 (79.890.3) | 84.0 (78.088.9) |
Scenario A (Yokohama category 5 is considered malignant), Scenario B (Yokohama categories 5 and 4 are considered malignant), Scenario C (Yokohama categories 5, 4, and 3 are considered malignant). PPV: Positive predictive value, NPV: Negative predictive value
The highest sensitivity and NPV (87.0% and 90.4%) were seen in Scenario C when BI-RADS scores 4b, 4c, 5, and 6 are considered malignant. The specificity was maximum in both Scenarios A and B, with 95.5% each. Maximum PPV was seen in Scenario B with 92.2%. Maximum diagnostic accuracy was seen in Scenario B, which was 87.7% [Table 4].
DISCUSSION
The “triple test” approach, combining clinical examination, imaging, and pathological analysis, has long been recognized as a cornerstone in the evaluation of breast lesions. This method ensures a comprehensive diagnostic process, optimizing the accuracy of breast cancer diagnosis and, ultimately, patient management. Conventionally, clinical examination, mammography, and FNAC have been integral to this diagnostic triad. More recently, ultrasound and core needle biopsy (CNB) have been incorporated into the evaluation of breast lesions, particularly for younger patients. This multi-modality approach enhances the sensitivity and specificity of diagnostic results, leading to more reliable outcomes, particularly in the context of malignant lesions.[10]
| BIRADS classification | Scenario A (%) | Scenario B (%) | Scenario C(%) |
|---|---|---|---|
| Sensitivity | 63.6 (51.974.3) | 76.6 (65.685.5) | 87.0 (77.493.6) |
| Specificity | 95.5 (89.898.5) | 95.5 (89.898.5) | 84.6 (76.690.8) |
| PPV | 90.7 (80.495.9) | 92.2 (83.296.6) | 79.7 (71.686.0) |
| NPV | 79.1 (73.783.6) | 85.5 (79.789.8) | 90.4 (84.094.4) |
| Diagnostic accuracy | 82.4 (76.287.6) | 87.7 (82.292.1) | 85.6 (79.890.3) |
Scenario A (BI-RADS scores 5 and 6 are considered malignant), Scenario B (BI-RADS scores 4c, 5, and 6 are considered malignant), Scenario C (BI-RADS scores 4b, 4c, 5, and 6 are considered malignant). BI-RADS: Breast Imaging-Reporting and Data System, PPV: Positive predictive value, NPV: Negative predictive value
FNAC remains a valuable diagnostic tool due to its simplicity, cost-effectiveness, and quick results. It is widely used in tertiary care settings where large numbers of core needle biopsies might be impractical. FNAC’s role extends beyond the identification of lesions; it can also be employed to assess hormone receptor status through immunohistochemistry, an essential aspect of breast cancer subtyping. However, while FNAC provides rapid results, CNB offers more detailed histological information, including tumor grading, which is essential for treatment planning. Although FNAC is favored for superficial and accessible lumps, CNB is often preferred for atypical, suspicious, or deep-seated lesions due to its ability to provide a more comprehensive evaluation of the lesion’s nature.[11]
A crucial aspect of the diagnostic process is the establishment of standardized reporting systems. As highlighted by previous studies, implementing a uniform system for categorizing breast lesions improves the communication between clinicians and pathologists, reducing the risk of diagnostic errors and improving patient outcomes. Both FNAC and CNB contribute significantly to the diagnostic workflow, with their roles tailored to the specific characteristics of the lesion and patient.
This study included 188 patients, predominantly females (99.5%). The average age of the patients was 35.3 years, with the majority presenting with benign lesions. Interestingly, malignancies were more frequently observed in older age groups, with an average age of 44.5 years for malignant cases. This finding aligns with established literature, supporting the notion that increasing age is a significant risk factor for breast cancer.[2,5,6] In contrast, the mean age for patients with non-malignant lesions was significantly lower, averaging 28.9 years.
The study utilized the Yokohama classification system, which was applied to categorize the lesions based on their cytological features. The majority of patients (52.1%) were classified into Yokohama category 2, which is consistent with the findings of previous studies (e.g., Wong et al., Ahuja and Malviya, De Rosa et al., McHugh et al., Montezuma et al., Nigam et al.).[2,6,12-15] In contrast, Yokohama category 3 (atypical) was the least common category (4.8%), which also mirrors the findings of other researchers. This classification was instrumental in understanding the spectrum of lesions encountered and their correlation with malignancy. A significant finding was that the sensitivity and specificity of the Yokohama system varied across its categories, with Scenario C (which includes atypical, suspicious, and malignant categories) showing the highest sensitivity (81.8%), while Scenario A (malignant cases) demonstrated the highest specificity (96.4%).
When comparing the sensitivity, specificity, PPV, and NPV of the Yokohama system in the current study with those reported in other studies, several similarities and differences emerged. The overall diagnostic accuracy of Scenario B (suspicious and malignant lesions) in this study was 85.6%, which is comparable to the findings of Wong et al., Ahuja and Malviya, De Rosa et al., and Nigam et al., who observed similar accuracy in their respective populations.[2,6,12,15] The PPV in Scenario A (malignant lesions) was 92.6%, consistent with the high specificity and the reduced likelihood of false positives, while Scenario C achieved the highest NPV (87.2%), signifying the reduced likelihood of malignancy when the test is negative [Table 5].
| Category included | Wong et al.[2] | Ahuja and Malviya[6] | De Rosa et al.[12] | McHugh et al.[13] | Montezuma et al.[14] | Agarwal et al.[11] |
Nigam et al.[15] | Presentstudy |
|---|---|---|---|---|---|---|---|---|
| Number of cases | 536 | 224 | 1616 | 199 | 755 | 299 | 123 | 188 |
| Only the malignant category is taken as positive (%) | ||||||||
| Sensitivity | 75.4 | 79.2 | 82.2 | 65.4 | 68.7 | 86.7 | 73.6 | 64.9 |
| Specificity | 100 | 100 | 97.8 | 95.9 | 100 | 100 | 98.5 | 96.4 |
| PPV | 100 | 100 | 98.8 | 91.1 | 100 | 100 | 97.5 | 92.6 |
| NPV | 80.7 | 90.9 | 71.0 | 81.1 | 87.7 | 71.2 | 82.1 | 79.8 |
| Accuracy | 87.9 | 93.2 | 87.0 | 83.9 | 90.3 | 90.0 | 87.3 | 83.5 |
| Suspicious of malignancy and malignant taken as positive (%) | ||||||||
| Sensitivity | 92.0 | 91.7 | 93.7 | 79.5 | 83.3 | 96.0 | 81.1 | 77.9 |
| Specificity | 97.8 | 98.7 | 90.8 | 85.1 | 99.8 | 91.9 | 95.4 | 90.9 |
| PPV | 97.6 | 97.1 | 95.8 | 77.5 | 99.5 | 97.3 | 93.5 | 85.7 |
| NPV | 92.7 | 96.1 | 86.6 | 86.6 | 93.0 | 88.3 | 86.1 | 85.6 |
| Accuracy | 95.0 | 96.4 | 92.8 | 82.9 | 94.7 | 95.0 | 88.9 | 85.6 |
| Atypical, suspicious, and malignant categories taken as positive (%) | ||||||||
| Sensitivity | 98.9 | 97.2 | 98.9 | 84.6 | 98.3 | 98.2 | 92.5 | 81.8 |
| Specificity | 62.1 | 86.0 | 46.3 | 75.2 | 54.8 | 59.5 | 81.5 | 85.6 |
| PPV | 71.7 | 77.0 | 80.5 | 68.8 | 49.2 | 88.0 | 80.3 | 79.7 |
| NPV | 98.3 | 98.5 | 95.1 | 88.3 | 98.6 | 91.7 | 92.9 | 87.2 |
| Accuracy | 80.2 | 89.6 | 82.7 | 78.9 | 68.2 | 88.6 | 86.4 | 84.0 |
PPV: Positive predictive value, NPV: Negative predictive value
The comparison of these results with those from other studies, such as those conducted by Wong et al. and McHugh et al., reinforces the reliability of the Yokohama system in the assessment of breast lesions, though the choice of categories influences diagnostic performance.[2,13] Notably, the sensitivity of the present study was lower than in some studies, such as those by Wong et al., where a higher sensitivity was achieved when multiple categories (including atypical and suspicious) were considered as positive.[2] This discrepancy may be attributed to differences in sample size, patient demographics, or regional variations in lesion characteristics.
In addition, the study addressed the issue of discordance between cytological and histopathological diagnoses, which can arise from inadequate samples or technical limitations in FNAC. Inadequate smears, such as those in the Yokohama category 1, may lead to false negatives or misclassification.
This study observed that 5/11 cases in the inadequate category (Yokohama category 1) were found to be malignant upon histopathological examination. This highlights the potential drawbacks of FNAC, particularly when sampling issues arise, especially in deep-seated or challenging lesions. To address this, the authors recommend the use of ROSE to improve sample adequacy and suggest that CNB might be a more reliable approach for patients with inadequate FNAC results, as it provides better tissue sampling.
Regarding the comparison of the BI-RADS system with histopathological diagnoses, the present study observed that higher BI-RADS scores were more likely to be associated with malignancy. The sensitivity and specificity of the BIRADS system in this study were comparable to those seen in other studies, with Scenario C (which includes 4b, 4c, 5, 6) achieving the highest sensitivity (100%) and NPV (100%). The highest diagnostic accuracy was seen in Scenario B, with a value of 87.7%. This reinforces the complementary nature of radiological and pathological assessments, emphasizing the importance of the “triple test” approach. The overall findings indicate that the BI-RADS system was more sensitive and had a better NPV than the Yokohama system. However, the Yokohama system demonstrated better specificity and PPV, making it particularly useful in confirming malignancy.
SUMMARY
In summary, this study highlights the comparable diagnostic performance of FNAC and imaging-based systems (BIRADS and Yokohama classifications) in evaluating breast lesions. While each modality demonstrates high individual accuracy, the current findings do not provide direct evidence of a complementary role. Rather, FNAC and imaging appear to offer similar diagnostic capabilities. However, the potential for complementarity remains relevant in clinical contexts where limitations in one modality, such as inadequate FNAC smears or indeterminate imaging, may be mitigated by another. For instance, false negatives in FNAC due to poor cellularity may be further evaluated using high BI-RADS scores that prompt additional tissue sampling. Conversely, equivocal imaging findings (e.g., BI-RADS 4a) may benefit from cytological correlation.
It should be emphasized that, knowing these benefits of using the triple test, it should be promoted more in LMICs. Given its ability to enhance diagnostic accuracy and improve patient outcomes, wider adoption in resource-limited settings could be crucial in early breast cancer detection and management. Integrating FNAC, CNB, and imaging techniques in routine clinical practice can significantly reduce diagnostic delays and ensure timely treatment interventions. Strengthening the implementation of the triple test in LMICs through capacity-building, training programs, and standardized reporting systems would enhance its accessibility and reliability, ultimately improving breast cancer care in these regions.
Therefore, although this study does not directly establish the compensatory value of combining modalities, it supports the rationale for a multi-modal diagnostic framework, particularly in ambiguous or borderline cases. Although in the present study, the sample size was sufficient to estimate overall diagnostic accuracy, it was not specifically powered for precision of sensitivity and specificity estimates. The relatively wide confidence intervals, particularly for sensitivity, highlight the need for larger, stratified samples in future studies to improve statistical precision in these key parameters. Future studies should investigate this potential complementarity more rigorously by analyzing discordant cases and evaluating how integration improves diagnostic confidence or alters clinical decision-making.
ACKNOWLEDGMENTS
Not applicable.
AVAILABILITY OF DATA AND MATERIALS
All data generated or analyzed during this study are included in this article. Further inquiries can be directed to the corresponding author.
ABBREVIATIONS
BIRADS: Breast imaging-reporting and data system
CNB: Core needle biopsy
FNAC: fine-needle aspiration cytology
IDC: Invasive ductal carcinoma
NPV: Negative predictive value
PPV: Positive predictive value
ROM: Risk of malignancy
ROSE: Rapid onsite evaluation
AUTHOR CONTRIBUTIONS
SV: Formal analysis, data curation; SA: Conceptualization, methodology, writing original draft; SZ: Conceptualization, methodology, formal analysis, resources, writing – review and editing, supervision; SS: Conceptualization, resources, writing – review and editing; MS: Resources, supervision, writing – review and editing; CC: Conceptualization, resources, writing – review and editing. All authors meet ICMJE authorship requirements.
ETHICS APPROVAL AND CONSENT TO PARTICIPATE
The study was done after approval of the Institutional Ethics Committee (VMMC and SJH) vide no. IEC/VMMC/SJH/Thesis/2023-03/CC-245. Written informed consent was taken from the participants to participate in the study.
CONFLICTS OF INTEREST
The authors declare no conflicts of interest.
EDITORIAL/PEER REVIEW
To ensure the integrity and highest quality of CytoJournal publications, the review process of this manuscript was conducted under a double-blind model (authors are blinded for reviewers and vice versa) through an automatic online system.
FUNDING: None.
References
- Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA A Cancer J Clin. 2021;71:209-49.
- [CrossRef] [PubMed] [Google Scholar]
- The international academy of cytology Yokohama system for reporting breast fine needle aspiration biopsy cytopathology: A single institutional retrospective study of the application of the system categories and the impact of rapid onsite evaluation. Acta Cytol. 2019;63:280-91.
- [CrossRef] [PubMed] [Google Scholar]
- IAC standardized reporting of breast fine-needle aspiration biopsy cytology. Acta Cytol. 2017;61:3-6.
- [CrossRef] [PubMed] [Google Scholar]
- The use of the international academy of cytology Yokohama system for reporting breast fine-needle aspiration biopsy. Am J Clin Pathol. 2023;159:138-45.
- [CrossRef] [PubMed] [Google Scholar]
- IAC standardized reporting of breast fine-needle aspiration cytology, Yokohama 2016: A critical appraisal over a 2 year period. Breast Dis. 2019;38:109-15.
- [CrossRef] [PubMed] [Google Scholar]
- Categorization of breast fine needle aspirates using the international academy of cytology Yokohama system along with assessment of risk of malignancy and diagnostic accuracy in a tertiary care centre. J Cytol. 2021;38:158-63.
- [CrossRef] [PubMed] [Google Scholar]
- Mammography BIRADS grading In: StatPearls. Treasure Island, FL: StatPearls Publishing; 2023. Available from: https://www.ncbi.nlm.nih.gov/books/nbk541075 [Last accessed on 2025 Sep 13]
- [Google Scholar]
- Breast imaging reporting and data system In: StatPearls. Treasure Island, FL: StatPearls Publishing; 2024. Available from: https://www.ncbi.nlm.nih.gov/books/nbk459169 [Last accessed on 2024 Oct 07]
- [Google Scholar]
- The international academy of cytology Yokohama system for reporting breast fine-needle aspiration biopsy cytopathology. Acta Cytol. 2019;63:257-73.
- [CrossRef] [Google Scholar]
- A modified triple test for palpable breast masses: The value of ultrasound and core needle biopsy. Ann Surg Oncol. 2013;20:850-5.
- [CrossRef] [PubMed] [Google Scholar]
- Accuracy of the International academy of cytology Yokohama system of breast cytology reporting for fine needle aspiration biopsy of the breast in a dedicated breast care setting. Diagn Cytopathol. 2021;49:195-202.
- [CrossRef] [PubMed] [Google Scholar]
- The continuing role of breast fine-needle aspiration biopsy after the introduction of the IAC Yokohama system for reporting breast fine needle aspiration biopsy cytopathology. Diagn Cytopathol. 2020;48:1244-53.
- [CrossRef] [PubMed] [Google Scholar]
- Concordance of breast fine needle aspiration cytology interpretation with subsequent surgical pathology: An 18-year review from a single subSaharan African institution. Cytopathology. 2019;30:519-25.
- [CrossRef] [PubMed] [Google Scholar]
- Breast fine needle aspiration biopsy cytology using the newly proposed IAC Yokohama system for reporting breast cytopathology: The experience of a single institution. Acta Cytol. 2019;63:1-6.
- [CrossRef] [PubMed] [Google Scholar]
- The International Academy of Cytology standardized reporting of breast fine-needle aspiration biopsy cytology: A 2 year's retrospective study with application of categories and their assessment for risk of malignancy. Cytojournal. 2021;18:27.
- [CrossRef] [PubMed] [Google Scholar]

