Benchmarking multiple instance learning architectures from patches to pathology for prostate cancer detection and grading using attention-based weak supervision

Artículo Materias > Biomedicina
Materias > Ingeniería
Universidad Europea del Atlántico > Investigación > Producción Científica
Fundación Universitaria Internacional de Colombia > Investigación > Producción Científica
Universidad Internacional Iberoamericana México > Investigación > Producción Científica
Universidad Internacional Iberoamericana Puerto Rico > Investigación > Producción Científica
Universidad Internacional do Cuanza > Investigación > Producción Científica
Universidad de La Romana > Investigación > Producción Científica
Abierto Inglés Histopathological evaluation is necessary for the diagnosis and grading of prostate cancer, which is still one of the most common cancers in men globally. Traditional evaluation is time-consuming, prone to inter-observer variability, and challenging to scale. The clinical usefulness of current AI systems is limited by the need for comprehensive pixel-level annotations. The objective of this research is to develop and evaluate a large-scale benchmarking study on a weakly supervised deep learning framework that minimizes the need for annotation and ensures interpretability for automated prostate cancer diagnosis and International Society of Urological Pathology (ISUP) grading using whole slide images (WSIs). This study rigorously tested six cutting-edge multiple instance learning (MIL) architectures (CLAM-MB, CLAM-SB, ILRA-MIL, AC-MIL, AMD-MIL, WiKG-MIL), three feature encoders (ResNet50, CTransPath, UNI2), and four patch extraction techniques (varying sizes and overlap) using the PANDA dataset (10,616 WSIs), yielding 72 experimental configurations. The methodology used distributed cloud computing to process over 31 million tissue patches, implementing advanced attention mechanisms to ensure clinical interpretability through Grad-CAM visualizations. The optimum configuration (UNI2 encoder with ILRA-MIL, 256 256 patches, 50% overlap) achieved 78.75% accuracy and 90.12% quadratic weighted kappa (QWK), outperforming traditional methods and approaching expert pathologist-level diagnostic capability. Overlapping smaller patches offered the best balance of spatial resolution and contextual information, while domain-specific foundation models performed noticeably better than generic encoders. This work is the first large-scale, comprehensive comparison of weekly supervised MIL methods for prostate cancer diagnosis and grading. The proposed approach has excellent clinical diagnostic performance, scalability, practical feasibility through cloud computing, and interpretability using visualization tools. metadata Butt, Naveed Anwer; Sarwat, Dilawaiz; Delgado Noya, Irene; Tutusaus, Kilian; Samee, Nagwan Abdel y Ashraf, Imran mail SIN ESPECIFICAR, SIN ESPECIFICAR, irene.delgado@uneatlantico.es, kilian.tutusaus@uneatlantico.es, SIN ESPECIFICAR, SIN ESPECIFICAR (2026) Benchmarking multiple instance learning architectures from patches to pathology for prostate cancer detection and grading using attention-based weak supervision. Scientific Reports. ISSN 2045-2322

[img] Texto
s41598-026-39196-x_reference.pdf

Descargar (2MB)

Resumen

Histopathological evaluation is necessary for the diagnosis and grading of prostate cancer, which is still one of the most common cancers in men globally. Traditional evaluation is time-consuming, prone to inter-observer variability, and challenging to scale. The clinical usefulness of current AI systems is limited by the need for comprehensive pixel-level annotations. The objective of this research is to develop and evaluate a large-scale benchmarking study on a weakly supervised deep learning framework that minimizes the need for annotation and ensures interpretability for automated prostate cancer diagnosis and International Society of Urological Pathology (ISUP) grading using whole slide images (WSIs). This study rigorously tested six cutting-edge multiple instance learning (MIL) architectures (CLAM-MB, CLAM-SB, ILRA-MIL, AC-MIL, AMD-MIL, WiKG-MIL), three feature encoders (ResNet50, CTransPath, UNI2), and four patch extraction techniques (varying sizes and overlap) using the PANDA dataset (10,616 WSIs), yielding 72 experimental configurations. The methodology used distributed cloud computing to process over 31 million tissue patches, implementing advanced attention mechanisms to ensure clinical interpretability through Grad-CAM visualizations. The optimum configuration (UNI2 encoder with ILRA-MIL, 256 256 patches, 50% overlap) achieved 78.75% accuracy and 90.12% quadratic weighted kappa (QWK), outperforming traditional methods and approaching expert pathologist-level diagnostic capability. Overlapping smaller patches offered the best balance of spatial resolution and contextual information, while domain-specific foundation models performed noticeably better than generic encoders. This work is the first large-scale, comprehensive comparison of weekly supervised MIL methods for prostate cancer diagnosis and grading. The proposed approach has excellent clinical diagnostic performance, scalability, practical feasibility through cloud computing, and interpretability using visualization tools.

Tipo de Documento: Artículo
Palabras Clave: Prostate cancer detection Weakly supervised learning Multiple instance learning Whole slide images ISUP grading
Clasificación temática: Materias > Biomedicina
Materias > Ingeniería
Divisiones: Universidad Europea del Atlántico > Investigación > Producción Científica
Fundación Universitaria Internacional de Colombia > Investigación > Producción Científica
Universidad Internacional Iberoamericana México > Investigación > Producción Científica
Universidad Internacional Iberoamericana Puerto Rico > Investigación > Producción Científica
Universidad Internacional do Cuanza > Investigación > Producción Científica
Universidad de La Romana > Investigación > Producción Científica
Depositado: 13 Mar 2026 23:30
Ultima Modificación: 13 Mar 2026 23:30
URI: https://repositorio.uniromana.edu.do/id/eprint/27825

Acciones (logins necesarios)

Ver Objeto Ver Objeto

<a href="/27825/1/s41598-026-39196-x_reference.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Benchmarking multiple instance learning architectures from patches to pathology for prostate cancer detection and grading using attention-based weak supervision

Histopathological evaluation is necessary for the diagnosis and grading of prostate cancer, which is still one of the most common cancers in men globally. Traditional evaluation is time-consuming, prone to inter-observer variability, and challenging to scale. The clinical usefulness of current AI systems is limited by the need for comprehensive pixel-level annotations. The objective of this research is to develop and evaluate a large-scale benchmarking study on a weakly supervised deep learning framework that minimizes the need for annotation and ensures interpretability for automated prostate cancer diagnosis and International Society of Urological Pathology (ISUP) grading using whole slide images (WSIs). This study rigorously tested six cutting-edge multiple instance learning (MIL) architectures (CLAM-MB, CLAM-SB, ILRA-MIL, AC-MIL, AMD-MIL, WiKG-MIL), three feature encoders (ResNet50, CTransPath, UNI2), and four patch extraction techniques (varying sizes and overlap) using the PANDA dataset (10,616 WSIs), yielding 72 experimental configurations. The methodology used distributed cloud computing to process over 31 million tissue patches, implementing advanced attention mechanisms to ensure clinical interpretability through Grad-CAM visualizations. The optimum configuration (UNI2 encoder with ILRA-MIL, 256 256 patches, 50% overlap) achieved 78.75% accuracy and 90.12% quadratic weighted kappa (QWK), outperforming traditional methods and approaching expert pathologist-level diagnostic capability. Overlapping smaller patches offered the best balance of spatial resolution and contextual information, while domain-specific foundation models performed noticeably better than generic encoders. This work is the first large-scale, comprehensive comparison of weekly supervised MIL methods for prostate cancer diagnosis and grading. The proposed approach has excellent clinical diagnostic performance, scalability, practical feasibility through cloud computing, and interpretability using visualization tools.

Producción Científica

Naveed Anwer Butt mail , Dilawaiz Sarwat mail , Irene Delgado Noya mail irene.delgado@uneatlantico.es, Kilian Tutusaus mail kilian.tutusaus@uneatlantico.es, Nagwan Abdel Samee mail , Imran Ashraf mail ,

Butt

<a class="ep_document_link" href="/27552/1/fnut-13-1744444.pdf"><img class="ep_doc_icon" alt="[img]" src="/27552/1.hassmallThumbnailVersion/fnut-13-1744444.pdf" border="0"/></a>

en

open

Inflammatory potential of the diet and self-rated quality of life in Italian adults

Background: Dietary quality is widely acknowledged as a key factor in maintaining good health. Recommendations that promote plant-based eating patterns are largely grounded in evidence showing that dietary choices can modulate the immune function. In line with such a hypothesis, diet may be considered as a potential driver of persistent low-grade inflammation. Quality of life (QoL), on the other hand, serves as a broad indicator that encompasses both physical and psychological wellbeing.Aim: The purpose of this cross-sectional study was to examine the relationship between the inflammatory potential of the diet and QoL in a population sample of Italian adults.Design: A total of 1,936 participants completed a 110-item food frequency questionnaire to assess eating habits. The inflammatory potential of their diet was calculated using the dietary inflammatory score (DIS). Quality of life was measured with the Manchester Short Appraisal (MANSA).Results: Higher DIS values, reflecting a more pro-inflammatory diet, were linked to reduced likelihood of reporting high QoL (OR = 0.56; 95% CI: 0.40–0.78). Several specific domains of QoL, including general life satisfaction, social relationships, personal safety, satisfaction with cohabitation, physical health, and mental health, also showed significant associations with DIS.Conclusion: The findings suggest an association between the inflammatory potential of the diet and QoL.

Producción Científica

Francesca Giampieri mail francesca.giampieri@uneatlantico.es, Justyna Godos mail , Giuseppe Caruso mail , Marco Antonio Olvera-Moreira mail , Fabrizio Furnari mail , Andrea Di Mauro mail , Irma Dominguez Azpíroz mail irma.dominguez@unini.edu.mx, Raynier Zambrano-Villacres mail , Evelyn Frias-Toral mail , Fabio Galvano mail , Giuseppe Grosso mail ,

Giampieri

<a href="/26722/1/nutrients-18-00257.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/26722/1.hassmallThumbnailVersion/nutrients-18-00257.pdf" border="0"/></a>

en

open

Innovative Application of Chatbots in Clinical Nutrition Education: The E+DIEting_Lab Experience in University Students

Background/Objectives: The growing integration of Artificial Intelligence (AI) and chatbots in health professional education offers innovative methods to enhance learning and clinical preparedness. This study aimed to evaluate the educational impact and perceptions in university students of Human Nutrition and Dietetics, regarding the utility, usability, and design of the E+DIEting_Lab chatbot platform when implemented in clinical nutrition training. Methods: The platform was piloted from December 2023 to April 2025 involving 475 students from multiple European universities. While all 475 students completed the initial survey, 305 finished the follow-up evaluation, representing a 36% attrition rate. Participants completed surveys before and after interacting with the chatbots, assessing prior experience, knowledge, skills, and attitudes. Data were analyzed using descriptive statistics and independent samples t-tests to compare pre- and post-intervention perceptions. Results: A total of 475 university students completed the initial survey and 305 the final evaluation. Most university students were females (75.4%), with representation from six languages and diverse institutions. Students reported clear perceived learning gains: 79.7% reported updated practical skills in clinical dietetics and communication were updated, 90% felt that new digital tools improved classroom practice, and 73.9% reported enhanced interpersonal skills. Self-rated competence in using chatbots as learning tools increased significantly, with mean knowledge scores rising from 2.32 to 2.66 and skills from 2.39 to 2.79 on a 0–5 Likert scale (p < 0.001 for both). Perceived effectiveness and usefulness of chatbots as self-learning tools remained positive but showed a small decline after use (effectiveness from 3.63 to 3.42; usefulness from 3.63 to 3.45), suggesting that hands-on experience refined, but did not diminish, students’ overall favorable views of the platform. Conclusions: The implementation and pilot evaluation of the E+DIEting_Lab self-learning virtual patient chatbot platform demonstrate that structured digital simulation tools can significantly improve perceived clinical nutrition competences. These findings support chatbot adoption in dietetics curricula and inform future digital education innovations.

Producción Científica

Iñaki Elío Pascual mail inaki.elio@uneatlantico.es, Kilian Tutusaus mail kilian.tutusaus@uneatlantico.es, Imanol Eguren García mail imanol.eguren@uneatlantico.es, Álvaro Lasarte García mail , Arturo Ortega-Mansilla mail arturo.ortega@uneatlantico.es, Thomas Prola mail thomas.prola@uneatlantico.es, Sandra Sumalla Cano mail sandra.sumalla@uneatlantico.es,

Elío Pascual

<a href="/26964/1/s44196-025-01123-9_reference.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/26964/1.hassmallThumbnailVersion/s44196-025-01123-9_reference.pdf" border="0"/></a>

en

open

Suicide Ideation Detection Using Social Media Data and Ensemble Machine Learning Model

Identifying the emotional state of individuals has useful applications, particularly to reduce the risk of suicide. Users’ thoughts on social media platforms can be used to find cues on the emotional state of individuals. Clinical approaches to suicide ideation detection primarily rely on evaluation by psychologists, medical experts, etc., which is time-consuming and requires medical expertise. Machine learning approaches have shown potential in automating suicide detection. In this regard, this study presents a soft voting ensemble model (SVEM) by leveraging random forest, logistic regression, and stochastic gradient descent classifiers using soft voting. In addition, for the robust training of SVEM, a hybrid feature engineering approach is proposed that combines term frequency-inverse document frequency and the bag of words. For experimental evaluation, “Suicide Watch” and “Depression” subreddits on the Reddit platform are used. Results indicate that the proposed SVEM model achieves an accuracy of 94%, better than existing approaches. The model also shows robust performance concerning precision, recall, and F1, each with a 0.93 score. ERT and deep learning models are also used, and performance comparison with these models indicates better performance of the SVEM model. Gated recurrent unit, long short-term memory, and recurrent neural network have an accuracy of 92% while the convolutional neural network obtains an accuracy of 91%. SVEM’s computational complexity is also low compared to deep learning models. Further, this study highlights the importance of explainability in healthcare applications such as suicidal ideation detection, where the use of LIME provides valuable insights into the contribution of different features. In addition, k-fold cross-validation further validates the performance of the proposed approach.

Producción Científica

Erol KINA mail , Jin-Ghoo Choi mail , Abid Ishaq mail , Rahman Shafique mail , Mónica Gracia Villar mail monica.gracia@uneatlantico.es, Eduardo René Silva Alvarado mail eduardo.silva@funiber.org, Isabel de la Torre Diez mail , Imran Ashraf mail ,

KINA

<a class="ep_document_link" href="/26965/1/s40203-025-00539-7.pdf"><img class="ep_doc_icon" alt="[img]" src="/26965/1.hassmallThumbnailVersion/s40203-025-00539-7.pdf" border="0"/></a>

en

open

In silico prediction, molecular docking and simulation of natural flavonoid apigenin and xanthoangelol E against human metapneumovirus

Human metapneumovirus (hMPV) is one of the potential pandemic pathogens, and it is a concern for elderly subjects and immunocompromised patients. There is no vaccine or specific antiviral available for hMPV. We conducted an in-silico study to predict initial antiviral candidates against human metapneumovirus. Our methodology included protein modeling, stability assessment, molecular docking, molecular simulation, analysis of non-covalent interactions, bioavailability, carcinogenicity, and pharmacokinetic profiling. We pinpointed four plant-derived bio-compounds as antiviral candidates. Among the compounds, apigenin showed the highest binding affinity, with values of − 8.0 kcal/mol for the hMPV-F protein and − 7.6 kcal/mol for the hMPV-N protein. Molecular dynamic simulations and further analyses confirmed that the protein-ligand docked complexes exhibited acceptable stability compared to two standard antiviral drugs. Additionally, these four compounds yielded satisfactory outcomes in bioavailability, drug-likeness, and ADME-Tox (absorption, distribution, metabolism, excretion, and toxicity) and STopTox analyses. This study highlights the potential of apigenin and xanthoangelol E as an initial antiviral candidate, underscoring the necessity for wet-lab evaluation, preclinical and clinical trials against human metapneumovirus infection.

Producción Científica

Hasan Huzayfa Rahaman mail , Afsana Khan mail , Nadim Sharif mail , Wasifuddin Ahmed mail , Nazmul Sharif mail , Rista Majumder mail , Silvia Aparicio Obregón mail silvia.aparicio@uneatlantico.es, Rubén Calderón Iglesias mail ruben.calderon@uneatlantico.es, Isabel De la Torre Díez mail , Shuvra Kanti Dey mail ,

Rahaman