Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Filter by Categories
Abstracts
Book Review
Case Report
Case Series
CMAS‡ - Pancreas - EUS-FNA Cytopathology (PSC guidelines) S1:1 of 5
CMAS‡ - Pancreas - EUS-FNA Cytopathology (PSC guidelines) S1:3 of 5
CMAS‡ - Pancreas - EUS-FNA Cytopathology (PSC guidelines) S1:4 of 5
CMAS‡ - Pancreas -Sampling Techniques for Cytopathology (PSC guidelines) S1:2 of 5
CMAS‡ - Pancreas- EUS-FNA Cytopathology (PSC guidelines) S1:5 of 5
Commentary
Correction
CytoJournal Monograph Related Review Series
CytoJournal Monograph Related Review Series (CMAS), Editorial
CytoJournal Monograph Related Review Series: Editorial
Cytojournal Quiz Case
Editorial
Erratum
Letter to Editor
Letter to the Editor
Letters to Editor
Methodology
Methodology Article
Methodology Articles
Original Article
Pap Smear Collection and Preparation: Key Points
Quiz Case
Research
Research Article
Review
Review Article
Systematic Review and Meta Analysis
View Point
View/Download PDF

Translate this page into:

Research Article
2026
:23;
20
doi:
10.25259/Cytojournal_167_2025

Diagnostic accuracy of the Yokohama system for reporting breast fine-needle aspiration cytology and breast imaging-reporting and data system in breast lesions

Department of Pathology, Vardhman Mahavir Medical College and Safdarjung Hospital, New Delhi, India.
Department of Radiology, Vardhman Mahavir Medical College and Safdarjung Hospital, New Delhi, India.
Department of Surgery, Vardhman Mahavir Medical College and Safdarjung Hospital, New Delhi, India.
Author image
Corresponding author: Sufian Zaheer, Department of Pathology, Vardhman Mahavir Medical College and Safdarjung Hospital, New Delhi, India. sufianzaheer@gmail.com
Licence
This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-Share Alike 4.0 License, which allows others to remix, transform, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.

How to cite this article: Shakthivel V, Ahuja S, Zaheer S, Swarna S, Singh M, Chintamani C. Diagnostic accuracy of the Yokohama system for reporting breast fine-needle aspiration cytology and breast imaging-reporting and data system in breast lesions. CytoJournal. 2026;23:20. doi: 10.25259/Cytojournal_167_2025

Abstract

Objectives:

The Yokohama system standardizes fine-needle aspiration cytology (FNAC) reporting for breast lesions, while the breast imaging-reporting and data system (BI-RADS) categorizes lesions based on imaging risk assessment. This study aimed to evaluate the diagnostic accuracy of FNAC using the Yokohama System in correlation with BI-RADS classifications.

Material and Methods:

A retrospective analysis was conducted on 188 breast lesion cases that underwent FNAC and were categorized using the BI-RADS system. The breast FNAs were classified according to the Yokohama System and compared with histopathological diagnoses to calculate the risk of malignancy (ROM). The ROM for the BI-RADS categories was also determined. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated for both.

Results:

According to the Yokohama classification, 5.9%, 52.1%, 4.8%, 8.5%, and 28.7% were classified as non-diagnostic, benign, atypical, suspicious, and malignant, respectively. The majority of cases were classified as BIRADS 3 (23.9%) and 5 (23.9%), followed by 4a (22.3%). Higher Yokohama and BI-RADS categories were more prevalent among malignant cases, with 64.9% in Yokohama category 5 and 51.9% in BI-RADS 5. Sensitivity was highest in Yokohama Scenario C (81.8%), while specificity (96.4%) and PPV (92.6%) were highest in Scenario A. When BI-RADS 4b, 4c, 5, and 6 were considered malignant, Scenario C had the highest sensitivity (87.0%) and NPV (90.4%), while Scenario B demonstrated the highest diagnostic accuracy (87.7%).

Conclusion:

FNAC using the Yokohama System and imaging-based BI-RADS classification both show high diagnostic accuracy for breast lesions. While their individual performance is comparable, further studies are needed to explore whether combining these modalities improves diagnostic outcomes in clinically indeterminate cases.

Keywords

Breast
Breast imaging-reporting and data system
Risk of malignancy
Sensitivity
Yokohama

INTRODUCTION

Female breast cancer has surpassed lung cancer as the most commonly diagnosed cancer, with an estimated 2.3 million new cases (11.7%), followed by lung (11.4%), as reported in the Global Cancer Statistics 2020.[1] In India, it contributes to 13.5% of all cancer cases and 10.6% of cancer-related deaths, with a cumulative risk of 2.81.[1] Despite advances in cancer care, late-stage diagnosis remains a significant challenge, particularly in developing countries. Early detection through effective screening and diagnostic methods is essential to improving outcomes. Fine-needle aspiration cytology (FNAC) has gained prominence among the diagnostic tools available for its simplicity, rapid results, and cost-effectiveness.

FNAC, while widely used, faces challenges such as variability in diagnostic accuracy due to differences in reporting standards, sample quality, and observer expertise.[2,3] To address these limitations, the International Academy of Cytology developed the Yokohama System for Reporting Breast FNAC. This standardized framework aims to enhance diagnostic accuracy, reduce inter-observer variability, and improve communication between cytopathologists and clinicians, thereby optimizing patient care.[4,5] Preliminary studies have demonstrated its potential, but most of the evidence is based on retrospective analyses, and prospective evaluations remain limited.

To bridge this gap, the current study aims to evaluate the diagnostic accuracy of the Yokohama System for Reporting Breast FNAC. It specifically seeks to compare the system’s findings with histopathological results to validate its accuracy. In addition, the study aims to explore the diagnostic utility of the Breast Imaging-Reporting and Data System (BI-RADS) in breast lesion evaluation, correlating it with histopathology. By assessing the risk of malignancy (ROM) for each diagnostic category under the Yokohama System and BI-RADS, the study intends to provide objective insights to support better clinical decision-making and optimize patient outcomes.

MATERIAL AND METHODS

Study design and setting

This cross-sectional study was conducted over a period of 18 months, in collaboration with the Departments of Pathology, Surgery, and Radiology. The study was done after approval of the Institutional Ethics Committee (Vardhman Mahavir Medical College and Safdarjung Hospital, New Delhi) vide no. IEC/VMMC/SJH/Thesis/2023-03/CC-245 and in accordance with the Declaration of Helsinki. Patients presenting with breast lesions who underwent FNAC followed by biopsy/resection were included. Patients who had received neo-adjuvant therapy were excluded. All eligible breast cancer patients were included, and cases were selected consecutively. In this study, FNAC was performed in 188 cases, of which 155 cases had subsequent histopathology reports available for correlation. In 33 cases, FNAC was done without a follow-up histopathology report due to a lack of biopsy/resection. In addition, 12 cases had histopathological reports available, but did not undergo FNAC. FNAC was reported without knowledge of histopathological findings to ensure blinding. Similarly, histopathology was reported without prior knowledge of FNAC results to maintain objectivity and prevent bias. The baseline clinical and lesion characteristics (such as age, lesion size, and clinical symptoms) of the 33 cases without histopathology follow-up were not systematically different from those included (n = 155). These cases were excluded solely due to a lack of histopathology follow-up and not because of any intrinsic clinical or pathological differences.

Sample size calculation

The sample size was calculated to determine the diagnostic accuracy of FNAC in detecting malignancy, using histopathology as the gold standard. The Wilson Score Interval method was used for a proportion-based accuracy study. Assuming a 95% confidence interval, an expected accuracy of 90% based on Ahuja and Malviya, and an absolute error margin of 5%, the required sample size was estimated using the formula:[6]

N=Z2Accuracy1Accuracyd2

Where Z = 1.96 (for 95% confidence), accuracy = 0.90, and d = 0.05. Substituting values, the calculated sample size was 138.2, which was adjusted for a 10% margin of error, resulting in a final sample size of 153, rounded to 155 participants. The calculation assumes a proportion-based estimation of diagnostic accuracy using FNAC and histopathology results, application of the Wilson Score Interval method for confidence interval estimation, random sampling of patients presenting with breast lesions, independence of observations, and a sufficiently large sample to ensure a precise and stable estimate of diagnostic accuracy.

BI-RADS and Yokohama system classification

BI-RADS Classification includes 7 categories (0 = incomplete, 1 = negative, 2 = benign, 3 = probably benign, 4 = suspicious [4a: low suspicion; 4b: moderate suspicion; 4c: high suspicion], 5 = highly suggestive of malignancy, and 6 = proven malignancy).[7,8] Yokohama System includes 5 categories (1 = insufficient, 2 = benign, 3 = atypical, 4 = suspicious, and 5 = malignant).[9]

Data collection and preparation

Clinical details were obtained from patient records. Mammography was performed to assign the BI-RADS score. FNAC was done following standard procedures after obtaining informed consent. FNAC was performed using a 22-23-gauge disposable needle attached to a 10 mL syringe. The puncture site was selected based on clinical and radiological guidance, with preference for the most representative and accessible area of the lesion. Multiple passes (typically 2-3) were made to ensure adequate sampling. The aspirated material was expelled onto clean glass slides, smeared, and immediately fixed in 95% ethanol for Papanicolaou (Hi-Media; RM5375; India) staining, while air-dried smears were prepared for May Grunwald Giemsa (Hi-Media; GRM945; India) staining. In cases where sample adequacy was uncertain, rapid on-site evaluation (ROSE) was employed when feasible. Biopsy specimens were fixed in formalin, embedded in paraffin, and stained with Hematoxylin and Eosin (H&E, Biolab; CY1577; India). When necessary, additional immunohistochemical staining was performed. However, in the present study, the criteria for defining “when necessary” were not pre-specified. The cases that underwent immunohistochemistry typically included those with atypical cytological features or where the nature of the lesion remained uncertain after routine staining. The specific markers applied (such as Estrogen receptor, Progesterone receptor, HER2, and Cytokeratin 5/6) were chosen on a case-by-case basis, but the exact selection strategy and interpretation criteria were not standardized. Furthermore, the study did not analyze the incremental impact of immunohistochemistry on the diagnostic accuracy of either the Yokohama System or BI-RADS classifications.

Data analysis

For analysis, cytological classifications were grouped into three categories based on their degree of malignancy suspicion to facilitate comparison with histopathological diagnoses (Scenario A: malignant category considered positive, Scenario B: malignant and suspicious category considered positive, and Scenario C: malignant, suspicious, and atypical category considered positive). This categorization assessed the impact of expanding diagnostic thresholds on diagnostic accuracy calculations. All statistical analyses were performed using the Statistical Package for the Social Sciences version 26.0. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and diagnostic accuracy were calculated using histopathological results as the gold standard. The ROM was calculated for each diagnostic category. Similarly, the diagnostic accuracy of BI-RADS scores was assessed using the same criteria.

RESULTS

Demographics and lesion distribution

A total of 188 patients were enrolled in the study, with ages ranging from 14-75 years (mean age: 35.3 years). The cohort predominantly consisted of younger patients, with the majority falling into the age groups of 14-30 years (40.4%) and 31-45 years (38.3%). The sex distribution was predominantly female (99.5%), with one male patient diagnosed with invasive papillary carcinoma.

Malignant lesions were more commonly observed in patients aged ≥45 years, with the mean age for malignant lesions being 44.5 years, compared to a mean age of 28.9 years for benign lesions. This age difference between benign and malignant lesions was statistically significant, highlighting the role of age as a factor in breast cancer diagnosis.

Lesion characteristics

Histopathology results showed that, out of the 188 lesions, 77 (41%) were malignant, while 111 (59%) were benign. Among the benign lesions, the most frequent types were fibroadenomas, which accounted for 33.5% of the total cases. Other benign lesions included benign ductal hyperplasia (11.7%) and fibroadenomatoid hyperplasia (6.9%). The most common malignant lesions were invasive ductal carcinoma (IDC) (27.1%) and intraductal carcinoma (21.1%).

The breakdown of the malignant lesions included both in situ and invasive cancers, with 45.4% of malignant lesions being invasive and the remaining 54.6% being in situ, which reflects the varied presentation of breast malignancies. Histopathology findings confirmed the presence of various tumor subtypes, including IDC, invasive lobular carcinoma, and invasive papillary carcinoma, with IDC being the most prevalent.

BI-RADS and Yokohama classification

According to the Yokohama classification, 5.9% (11 cases) of lesions were classified as non-diagnostic (Category 1), 52.1% (98 cases) as benign (Category 2), while 4.8% (9 cases) were categorized as atypical (Category 3), 8.5% (16 cases) as suspicious (Category 4), and 28.7% (54 cases) as malignant (Category 5).

The majority of cases were classified as BI-RADS 3 (23.9%) and 5 (23.9%), followed by 4a (22.3%). Categories 4b (10.6%) and 2 (9.0%) were less common, while 4c (5.3%) and 6 (4.8%) had the lowest frequencies. No cases were assigned to categories 0 or 1 [Figures 1 and 2].

Cytopathological, breast imaging-reporting and data system (BI-RADS), histopathological images in a case of fibroadenoma. (a) Smears exhibit monolayered sheet of ductal cells with overlying myoepithelial cells in a case of fibroadenoma (Yokohama category 2) (scale bar: 50 µm; 400x, MayGrünwald-Giemsa), (b) Radiological images show a BI-RADS 3 lesion – probably benign, (c) Sections show an intracanalicular pattern with compressed ducts surrounded by proliferating stroma (scale bar: 100 µm; 200x, Hematoxylin and eosin).
Figure 1:
Cytopathological, breast imaging-reporting and data system (BI-RADS), histopathological images in a case of fibroadenoma. (a) Smears exhibit monolayered sheet of ductal cells with overlying myoepithelial cells in a case of fibroadenoma (Yokohama category 2) (scale bar: 50 µm; 400x, MayGrünwald-Giemsa), (b) Radiological images show a BI-RADS 3 lesion – probably benign, (c) Sections show an intracanalicular pattern with compressed ducts surrounded by proliferating stroma (scale bar: 100 µm; 200x, Hematoxylin and eosin).
Cytopathological, breast imaging-reporting and data system (BI-RADS), histopathological images in a case of ductal carcinoma. (a) Smears exhibit a discohesive cluster of atypical ductal cells with moderate pleomorphism in a case of ductal carcinoma (Yokohama category 5) (scale bar: 50 µm; 400x, May-Grünwald-Giemsa), (b) Radiological images show BI-RADS 5 lesion, highly suggestive of malignancy, (c) Sections show sheets and clusters of atypical ductal cells with enlarged hyperchromatic nuclei, moderate cytoplasm, and prominent nucleoli (scale bar: 100 µm; 200x, Hematoxylin and eosin).
Figure 2:
Cytopathological, breast imaging-reporting and data system (BI-RADS), histopathological images in a case of ductal carcinoma. (a) Smears exhibit a discohesive cluster of atypical ductal cells with moderate pleomorphism in a case of ductal carcinoma (Yokohama category 5) (scale bar: 50 µm; 400x, May-Grünwald-Giemsa), (b) Radiological images show BI-RADS 5 lesion, highly suggestive of malignancy, (c) Sections show sheets and clusters of atypical ductal cells with enlarged hyperchromatic nuclei, moderate cytoplasm, and prominent nucleoli (scale bar: 100 µm; 200x, Hematoxylin and eosin).

Diagnostic performance of FNAC

FNAC was performed on all lesions, with the results compared against histopathology as the gold standard. On histopathology, 111 (59%) cases turned out to be non-malignant, and the remaining 77 (41%) were malignant. Among patients with malignant histopathology diagnosis, the higher Yokohama category was more prevalent than the lower Yokohama category. A total of 50 (64.9%) patients with malignant diagnoses in histopathology were in the Yokohama category 5. Among patients with malignant histopathology diagnosis, 5 (6.5%) belonged to Category 1 of the Yokohama classification. 9 (11.7%) belonged to Category 2, whereas 3 (3.9%) and 10 (13.0%) belonged to Categories 3 and 4, respectively.

Among patients with non-malignant histopathology diagnosis, 6 (5.4%) belonged to Category 1 of the Yokohama classification. 89 (80.2%) belonged to Category 2, whereas 6 (5.4%) belonged to Categories 3 and 4, respectively. Only 4 (3.6%) belonged to Category 5. The ROM for the Yokohama system was 45.5%, 9.2%, 33.3%, 62.5%, and 92.6% in Categories 1, 2, 3, 4, and 5, respectively [Table 1].

Table 1: Accuracy and risk of malignancy for Yokohama classification.
Yokohama category Histopathological diagnosis (%) Total cases (%) Histologically confirmed malignant cases Risk of malignancy (%)
1 6 (5.4%) NonMalignant, 5 (6.5%) Malignant 11 (5.9) 5 45.5
2 89 (80.2%) NonMalignant, 9 (11.7%) Malignant 98 (52.1) 9 9.2
3 6 (5.4%) NonMalignant, 3 (3.9%) Malignant 9 (4.8) 3 33.3
4 6 (5.4%) NonMalignant, 10 (13.0%) Malignant 16 (8.5) 10 62.5
5 4 (3.6%) NonMalignant, 50 (64.9%) Malignant 54 (28.7) 50 92.6
Total 111 (100.0%) NonMalignant, 77 (100.0%) Malignant 188 (100.0) - -

The analysis of false positives and false negatives revealed that FNAC yielded false-positive results in 4 cases (3.6%), wherein lesions classified as malignant by FNAC were found to be benign on histopathology. These cases primarily consisted of fibroadenomas with atypical features and sclerosing adenosis, both of which can mimic malignancy cytologically. Cytologically, these cases exhibited high cellularity with overlapping epithelial fragments, mild nuclear pleomorphism, and occasional prominent nucleoli, features that led to their categorization as malignant on FNAC. These findings highlight specific benign entities that may mimic carcinoma on cytology, particularly in the absence of architectural correlation. Inclusion of these entities in malignant reporting thresholds lowered the specificity to 96.4%, underscoring the importance of cautious interpretation when such cytomorphological overlaps are present.

False-negative results were observed in 9 cases (11.7%), where lesions initially categorized as benign by FNAC were later confirmed to be malignant on histopathology. The predominant factors contributing to false-negative diagnoses included sampling errors, low cellularity of aspirates, and the presence of well-differentiated carcinomas that exhibited minimal cytological atypia.

A higher BI-RADS score was more prevalent among malignant cases. No malignancies were observed in BIRADS Categories 0, 1, or 2. The distribution of cases across BIRADS scores has been detailed in Table 2. The ROM for each BI-RADS category was 0.0% for Category 2, 6.7% for Category 3, 16.7% for Category 4a, 40.0% for Category 4b, 100% for Category 4c, 88.9% for Category 5, and 100% for Category 6 [Table 2].

Table 2: Accuracy and risk of malignancy for BI-RADS classification.
BIRADS score Histopathological diagnosis (%) Total cases (%) Histologically confirmed malignant cases Risk of malignancy (%)
0 0 (0.0%) NonMalignant, 0 (0.0%) Malignant 0 (0.0) NA NA
1 0 (0.0%) NonMalignant, 0 (0.0%) Malignant 0 (0.0) NA NA
2 17 (15.3%) NonMalignant, 0 (0.0%) Malignant 17 (15.3) 0 0.0
3 42 (37.8%) NonMalignant, 3 (3.9%) Malignant 45 (23.9) 3 6.7
4a 35 (31.5%) NonMalignant, 7 (9.1%) Malignant 42 (22.3) 7 16.7
4b 12 (10.8%) NonMalignant, 8 (10.4%) Malignant 20 (10.6) 8 40.0
4c 0 (0.0%) NonMalignant, 10 (13.0%) Malignant 10 (5.3) 10 100.0
5 5 (4.5%) NonMalignant, 40 (51.9%) Malignant 45 (23.9) 40 88.9
6 0 (0.0%) NonMalignant, 9 (11.7%) Malignant 9 (4.8) 9 100.0
Total 111 (100.0%) NonMalignant, 77 (100.0%) Malignant 188 (100.0) - -

BI-RADS: Breast imaging-reporting and data system

When using the Yokohama classification, sensitivity was highest for Scenario C, with a value of 81.8%. Specificity was highest in Scenario A with 96.4%. Maximum PPV was achieved by Scenario A with a value of 92.6%, and maximum NPV was achieved by Scenario C with a value of 87.2%. The Scenario with the highest diagnostic accuracy was Scenario B with 85.6% [Table 3].

Table 3: Sensitivity, specificity, positive predictive value, negative predictive value, and diagnostic accuracy of Yokohama classification for Scenarios A, B, and C.
Yokohama classification Scenario A (%) Scenario B (%) Scenario C(%)
Sensitivity 64.9 (53.275.5) 77.9 (67.086.6) 81.8 (71.489.7)
Specificity 96.4 (91.099.0) 90.9 (84.195.6) 85.6 (77.691.5)
PPV 92.6 (82.597.1) 85.7 (76.691.6) 79.7 (71.286.2)
NPV 79.8 (74.584.3) 85.6 (79.590.1) 87.2 (80.891.6)
Diagnostic accuracy 83.5 (77.488.5) 85.6 (79.890.3) 84.0 (78.088.9)

Scenario A (Yokohama category 5 is considered malignant), Scenario B (Yokohama categories 5 and 4 are considered malignant), Scenario C (Yokohama categories 5, 4, and 3 are considered malignant). PPV: Positive predictive value, NPV: Negative predictive value

The highest sensitivity and NPV (87.0% and 90.4%) were seen in Scenario C when BI-RADS scores 4b, 4c, 5, and 6 are considered malignant. The specificity was maximum in both Scenarios A and B, with 95.5% each. Maximum PPV was seen in Scenario B with 92.2%. Maximum diagnostic accuracy was seen in Scenario B, which was 87.7% [Table 4].

DISCUSSION

The “triple test” approach, combining clinical examination, imaging, and pathological analysis, has long been recognized as a cornerstone in the evaluation of breast lesions. This method ensures a comprehensive diagnostic process, optimizing the accuracy of breast cancer diagnosis and, ultimately, patient management. Conventionally, clinical examination, mammography, and FNAC have been integral to this diagnostic triad. More recently, ultrasound and core needle biopsy (CNB) have been incorporated into the evaluation of breast lesions, particularly for younger patients. This multi-modality approach enhances the sensitivity and specificity of diagnostic results, leading to more reliable outcomes, particularly in the context of malignant lesions.[10]

Table 4: Sensitivity, specificity, positive predictive value, negative predictive value, and diagnostic accuracy of BI-RADS classification- Scenarios A, B, C.
BIRADS classification Scenario A (%) Scenario B (%) Scenario C(%)
Sensitivity 63.6 (51.974.3) 76.6 (65.685.5) 87.0 (77.493.6)
Specificity 95.5 (89.898.5) 95.5 (89.898.5) 84.6 (76.690.8)
PPV 90.7 (80.495.9) 92.2 (83.296.6) 79.7 (71.686.0)
NPV 79.1 (73.783.6) 85.5 (79.789.8) 90.4 (84.094.4)
Diagnostic accuracy 82.4 (76.287.6) 87.7 (82.292.1) 85.6 (79.890.3)

Scenario A (BI-RADS scores 5 and 6 are considered malignant), Scenario B (BI-RADS scores 4c, 5, and 6 are considered malignant), Scenario C (BI-RADS scores 4b, 4c, 5, and 6 are considered malignant). BI-RADS: Breast Imaging-Reporting and Data System, PPV: Positive predictive value, NPV: Negative predictive value

FNAC remains a valuable diagnostic tool due to its simplicity, cost-effectiveness, and quick results. It is widely used in tertiary care settings where large numbers of core needle biopsies might be impractical. FNAC’s role extends beyond the identification of lesions; it can also be employed to assess hormone receptor status through immunohistochemistry, an essential aspect of breast cancer subtyping. However, while FNAC provides rapid results, CNB offers more detailed histological information, including tumor grading, which is essential for treatment planning. Although FNAC is favored for superficial and accessible lumps, CNB is often preferred for atypical, suspicious, or deep-seated lesions due to its ability to provide a more comprehensive evaluation of the lesion’s nature.[11]

A crucial aspect of the diagnostic process is the establishment of standardized reporting systems. As highlighted by previous studies, implementing a uniform system for categorizing breast lesions improves the communication between clinicians and pathologists, reducing the risk of diagnostic errors and improving patient outcomes. Both FNAC and CNB contribute significantly to the diagnostic workflow, with their roles tailored to the specific characteristics of the lesion and patient.

This study included 188 patients, predominantly females (99.5%). The average age of the patients was 35.3 years, with the majority presenting with benign lesions. Interestingly, malignancies were more frequently observed in older age groups, with an average age of 44.5 years for malignant cases. This finding aligns with established literature, supporting the notion that increasing age is a significant risk factor for breast cancer.[2,5,6] In contrast, the mean age for patients with non-malignant lesions was significantly lower, averaging 28.9 years.

The study utilized the Yokohama classification system, which was applied to categorize the lesions based on their cytological features. The majority of patients (52.1%) were classified into Yokohama category 2, which is consistent with the findings of previous studies (e.g., Wong et al., Ahuja and Malviya, De Rosa et al., McHugh et al., Montezuma et al., Nigam et al.).[2,6,12-15] In contrast, Yokohama category 3 (atypical) was the least common category (4.8%), which also mirrors the findings of other researchers. This classification was instrumental in understanding the spectrum of lesions encountered and their correlation with malignancy. A significant finding was that the sensitivity and specificity of the Yokohama system varied across its categories, with Scenario C (which includes atypical, suspicious, and malignant categories) showing the highest sensitivity (81.8%), while Scenario A (malignant cases) demonstrated the highest specificity (96.4%).

When comparing the sensitivity, specificity, PPV, and NPV of the Yokohama system in the current study with those reported in other studies, several similarities and differences emerged. The overall diagnostic accuracy of Scenario B (suspicious and malignant lesions) in this study was 85.6%, which is comparable to the findings of Wong et al., Ahuja and Malviya, De Rosa et al., and Nigam et al., who observed similar accuracy in their respective populations.[2,6,12,15] The PPV in Scenario A (malignant lesions) was 92.6%, consistent with the high specificity and the reduced likelihood of false positives, while Scenario C achieved the highest NPV (87.2%), signifying the reduced likelihood of malignancy when the test is negative [Table 5].

Table 5: Comparison of the performance analysis of the Yokohama system of previous studies.
Category included Wong et al.[2] Ahuja and Malviya[6] De Rosa et al.[12] McHugh et al.[13] Montezuma et al.[14] Agarwal
et al.[11]
Nigam et al.[15] Presentstudy
Number of cases 536 224 1616 199 755 299 123 188
Only the malignant category is taken as positive (%)
  Sensitivity 75.4 79.2 82.2 65.4 68.7 86.7 73.6 64.9
  Specificity 100 100 97.8 95.9 100 100 98.5 96.4
  PPV 100 100 98.8 91.1 100 100 97.5 92.6
  NPV 80.7 90.9 71.0 81.1 87.7 71.2 82.1 79.8
  Accuracy 87.9 93.2 87.0 83.9 90.3 90.0 87.3 83.5
Suspicious of malignancy and malignant taken as positive (%)
  Sensitivity 92.0 91.7 93.7 79.5 83.3 96.0 81.1 77.9
  Specificity 97.8 98.7 90.8 85.1 99.8 91.9 95.4 90.9
  PPV 97.6 97.1 95.8 77.5 99.5 97.3 93.5 85.7
  NPV 92.7 96.1 86.6 86.6 93.0 88.3 86.1 85.6
  Accuracy 95.0 96.4 92.8 82.9 94.7 95.0 88.9 85.6
Atypical, suspicious, and malignant categories taken as positive (%)
  Sensitivity 98.9 97.2 98.9 84.6 98.3 98.2 92.5 81.8
  Specificity 62.1 86.0 46.3 75.2 54.8 59.5 81.5 85.6
  PPV 71.7 77.0 80.5 68.8 49.2 88.0 80.3 79.7
  NPV 98.3 98.5 95.1 88.3 98.6 91.7 92.9 87.2
  Accuracy 80.2 89.6 82.7 78.9 68.2 88.6 86.4 84.0

PPV: Positive predictive value, NPV: Negative predictive value

The comparison of these results with those from other studies, such as those conducted by Wong et al. and McHugh et al., reinforces the reliability of the Yokohama system in the assessment of breast lesions, though the choice of categories influences diagnostic performance.[2,13] Notably, the sensitivity of the present study was lower than in some studies, such as those by Wong et al., where a higher sensitivity was achieved when multiple categories (including atypical and suspicious) were considered as positive.[2] This discrepancy may be attributed to differences in sample size, patient demographics, or regional variations in lesion characteristics.

In addition, the study addressed the issue of discordance between cytological and histopathological diagnoses, which can arise from inadequate samples or technical limitations in FNAC. Inadequate smears, such as those in the Yokohama category 1, may lead to false negatives or misclassification.

This study observed that 5/11 cases in the inadequate category (Yokohama category 1) were found to be malignant upon histopathological examination. This highlights the potential drawbacks of FNAC, particularly when sampling issues arise, especially in deep-seated or challenging lesions. To address this, the authors recommend the use of ROSE to improve sample adequacy and suggest that CNB might be a more reliable approach for patients with inadequate FNAC results, as it provides better tissue sampling.

Regarding the comparison of the BI-RADS system with histopathological diagnoses, the present study observed that higher BI-RADS scores were more likely to be associated with malignancy. The sensitivity and specificity of the BIRADS system in this study were comparable to those seen in other studies, with Scenario C (which includes 4b, 4c, 5, 6) achieving the highest sensitivity (100%) and NPV (100%). The highest diagnostic accuracy was seen in Scenario B, with a value of 87.7%. This reinforces the complementary nature of radiological and pathological assessments, emphasizing the importance of the “triple test” approach. The overall findings indicate that the BI-RADS system was more sensitive and had a better NPV than the Yokohama system. However, the Yokohama system demonstrated better specificity and PPV, making it particularly useful in confirming malignancy.

SUMMARY

In summary, this study highlights the comparable diagnostic performance of FNAC and imaging-based systems (BIRADS and Yokohama classifications) in evaluating breast lesions. While each modality demonstrates high individual accuracy, the current findings do not provide direct evidence of a complementary role. Rather, FNAC and imaging appear to offer similar diagnostic capabilities. However, the potential for complementarity remains relevant in clinical contexts where limitations in one modality, such as inadequate FNAC smears or indeterminate imaging, may be mitigated by another. For instance, false negatives in FNAC due to poor cellularity may be further evaluated using high BI-RADS scores that prompt additional tissue sampling. Conversely, equivocal imaging findings (e.g., BI-RADS 4a) may benefit from cytological correlation.

It should be emphasized that, knowing these benefits of using the triple test, it should be promoted more in LMICs. Given its ability to enhance diagnostic accuracy and improve patient outcomes, wider adoption in resource-limited settings could be crucial in early breast cancer detection and management. Integrating FNAC, CNB, and imaging techniques in routine clinical practice can significantly reduce diagnostic delays and ensure timely treatment interventions. Strengthening the implementation of the triple test in LMICs through capacity-building, training programs, and standardized reporting systems would enhance its accessibility and reliability, ultimately improving breast cancer care in these regions.

Therefore, although this study does not directly establish the compensatory value of combining modalities, it supports the rationale for a multi-modal diagnostic framework, particularly in ambiguous or borderline cases. Although in the present study, the sample size was sufficient to estimate overall diagnostic accuracy, it was not specifically powered for precision of sensitivity and specificity estimates. The relatively wide confidence intervals, particularly for sensitivity, highlight the need for larger, stratified samples in future studies to improve statistical precision in these key parameters. Future studies should investigate this potential complementarity more rigorously by analyzing discordant cases and evaluating how integration improves diagnostic confidence or alters clinical decision-making.

ACKNOWLEDGMENTS

Not applicable.

AVAILABILITY OF DATA AND MATERIALS

All data generated or analyzed during this study are included in this article. Further inquiries can be directed to the corresponding author.

ABBREVIATIONS

BIRADS: Breast imaging-reporting and data system

CNB: Core needle biopsy

FNAC: fine-needle aspiration cytology

IDC: Invasive ductal carcinoma

NPV: Negative predictive value

PPV: Positive predictive value

ROM: Risk of malignancy

ROSE: Rapid onsite evaluation

AUTHOR CONTRIBUTIONS

SV: Formal analysis, data curation; SA: Conceptualization, methodology, writing original draft; SZ: Conceptualization, methodology, formal analysis, resources, writing – review and editing, supervision; SS: Conceptualization, resources, writing – review and editing; MS: Resources, supervision, writing – review and editing; CC: Conceptualization, resources, writing – review and editing. All authors meet ICMJE authorship requirements.

ETHICS APPROVAL AND CONSENT TO PARTICIPATE

The study was done after approval of the Institutional Ethics Committee (VMMC and SJH) vide no. IEC/VMMC/SJH/Thesis/2023-03/CC-245. Written informed consent was taken from the participants to participate in the study.

CONFLICTS OF INTEREST

The authors declare no conflicts of interest.

EDITORIAL/PEER REVIEW

To ensure the integrity and highest quality of CytoJournal publications, the review process of this manuscript was conducted under a double-blind model (authors are blinded for reviewers and vice versa) through an automatic online system.

FUNDING: None.

References

  1. , , , , , , et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA A Cancer J Clin. 2021;71:209-49.
    [CrossRef] [PubMed] [Google Scholar]
  2. , , , , , . The international academy of cytology Yokohama system for reporting breast fine needle aspiration biopsy cytopathology: A single institutional retrospective study of the application of the system categories and the impact of rapid onsite evaluation. Acta Cytol. 2019;63:280-91.
    [CrossRef] [PubMed] [Google Scholar]
  3. , , . IAC standardized reporting of breast fine-needle aspiration biopsy cytology. Acta Cytol. 2017;61:3-6.
    [CrossRef] [PubMed] [Google Scholar]
  4. , , , , , , et al. The use of the international academy of cytology Yokohama system for reporting breast fine-needle aspiration biopsy. Am J Clin Pathol. 2023;159:138-45.
    [CrossRef] [PubMed] [Google Scholar]
  5. , , , , , , et al. IAC standardized reporting of breast fine-needle aspiration cytology, Yokohama 2016: A critical appraisal over a 2 year period. Breast Dis. 2019;38:109-15.
    [CrossRef] [PubMed] [Google Scholar]
  6. , . Categorization of breast fine needle aspirates using the international academy of cytology Yokohama system along with assessment of risk of malignancy and diagnostic accuracy in a tertiary care centre. J Cytol. 2021;38:158-63.
    [CrossRef] [PubMed] [Google Scholar]
  7. , . Mammography BIRADS grading In: StatPearls. Treasure Island, FL: StatPearls Publishing; . Available from: https://www.ncbi.nlm.nih.gov/books/nbk541075 [Last accessed on 2025 Sep 13]
    [Google Scholar]
  8. , , . Breast imaging reporting and data system In: StatPearls. Treasure Island, FL: StatPearls Publishing; . Available from: https://www.ncbi.nlm.nih.gov/books/nbk459169 [Last accessed on 2024 Oct 07]
    [Google Scholar]
  9. , , , , , , et al. The international academy of cytology Yokohama system for reporting breast fine-needle aspiration biopsy cytopathology. Acta Cytol. 2019;63:257-73.
    [CrossRef] [Google Scholar]
  10. , , , , , , et al. A modified triple test for palpable breast masses: The value of ultrasound and core needle biopsy. Ann Surg Oncol. 2013;20:850-5.
    [CrossRef] [PubMed] [Google Scholar]
  11. , , , , , , et al. Accuracy of the International academy of cytology Yokohama system of breast cytology reporting for fine needle aspiration biopsy of the breast in a dedicated breast care setting. Diagn Cytopathol. 2021;49:195-202.
    [CrossRef] [PubMed] [Google Scholar]
  12. , , , , , , et al. The continuing role of breast fine-needle aspiration biopsy after the introduction of the IAC Yokohama system for reporting breast fine needle aspiration biopsy cytopathology. Diagn Cytopathol. 2020;48:1244-53.
    [CrossRef] [PubMed] [Google Scholar]
  13. , , . Concordance of breast fine needle aspiration cytology interpretation with subsequent surgical pathology: An 18-year review from a single subSaharan African institution. Cytopathology. 2019;30:519-25.
    [CrossRef] [PubMed] [Google Scholar]
  14. , , . Breast fine needle aspiration biopsy cytology using the newly proposed IAC Yokohama system for reporting breast cytopathology: The experience of a single institution. Acta Cytol. 2019;63:1-6.
    [CrossRef] [PubMed] [Google Scholar]
  15. , , , , , . The International Academy of Cytology standardized reporting of breast fine-needle aspiration biopsy cytology: A 2 year's retrospective study with application of categories and their assessment for risk of malignancy. Cytojournal. 2021;18:27.
    [CrossRef] [PubMed] [Google Scholar]
Show Sections