KoopaML: A Graphical Platform for Building Machine Learning Pipelines Adapted to Health Professionals.

Authors

DOI:

https://doi.org/10.9781/ijimai.2023.01.006

Keywords:

Artificial Intelligence, Human-Computer Interaction (HCI), Health, Information System, Medical Data, Medical Images
Supporting Agencies
This research work has been supported by the Spanish Ministry of Education and Vocational Training under an FPU fellowship (FPU17/03276). This research was partially funded by the Spanish Government Ministry of Economy and Competitiveness through the DEFINES project grant number (TIN2016-80172-R) and the Ministry of Science and Innovation through the AVisSA project grant number (PID2020-118345RB-I00). This work was also supported by national (PI14/00695, PIE14/00066, PI17/00145, DTS19/00098, PI19/00658, PI19/00656 Institute of Health Carlos III, Spanish Ministry of Economy and Competitiveness and cofunded by ERDF/ESF, “Investing in your future”) and community (GRS 2033/A/19, GRS 2030/A/19, GRS 2031/A/19, GRS 2032/A/19, SACYL, Junta Castilla y León) competitive grants.

Abstract

Machine Learning (ML) has extended its use in several domains to support complex analyses of data. The medical field, in which significant quantities of data are continuously generated, is one of the domains that can benefit from the application of ML pipelines to solve specific problems such as diagnosis, classification, disease detection, segmentation, assessment of organ functions, etc. However, while health professionals are experts in their domain, they can lack programming and theoretical skills regarding ML applications. Therefore, it is necessary to train health professionals in using these paradigms to get the most out of the application of ML algorithms to their data. In this work, we present a platform to assist non-expert users in defining ML pipelines in the health domain. The system’s design focuses on providing an educational experience to understand how ML algorithms work and how to interpret their outcomes and on fostering a flexible architecture to allow the evolution of the available components, algorithms, and heuristics.

Downloads

Download data is not yet available.

References

G. Litjens et al., “A survey on deep learning in medical image analysis,” (in eng), Med Image Anal, vol. 42, pp. 60-88, Dec 2017, doi: 10.1016/j.media.2017.07.005.

S. González Izard, R. Sánchez Torres, Ó. Alonso Plaza, J. A. Juanes Méndez, and F. J. García-Peñalvo, “Nextmed: Automatic Imaging Segmentation, 3D Reconstruction, and 3D Model Visualization Platform Using Augmented and Virtual Reality,” (in eng), Sensors (Basel), vol. 20, no. 10, p. 2962, 2020, doi: 10.3390/s20102962.

S. G. Izard, J. A. Juanes, F. J. García Peñalvo, J. M. G. Estella, M. J. S. Ledesma, and P. Ruisoto, “Virtual Reality as an Educational and Training Tool for Medicine,” Journal of Medical Systems, vol. 42, no. 3, p. 50, 2018/02/01 2018, doi: 10.1007/s10916-018-0900-2.

J. C. Weyerer and P. F. Langer, “Garbage in, garbage out: The vicious cycle of ai-based discrimination in the public sector,” in Proceedings of the 20th Annual International Conference on Digital Government Research, Dubai, United Arab Emirates, 2019, pp. 509-511, doi: https://doi.org/10.1145/3325112.3328220

X. Ferrer, T. van Nuenen, J. M. Such, M. Coté, and N. Criado, “Bias and Discrimination in AI: a cross-disciplinary perspective,” IEEE Technology and Society Magazine, vol. 40, no. 2, pp. 72-80, 2021, doi: 10.1109/MTS.2021.3056293.

S. Hoffman, “The Emerging Hazard of AI‐Related Health Care Discrimination,” Hastings Center Report, vol. 51, no. 1, pp. 8-9, 2021, doi: 10.1002/hast.1203.

S. Wachter, B. Mittelstadt, and C. Russell, “Why fairness cannot be automated: Bridging the gap between EU non-discrimination law and AI,” Computer Law & Security Review, vol. 41, p. 105567, 2021, doi: 10.2139/ssrn.3547922.

M. Abadi et al., “TensorFlow: A System for Large-Scale Machine Learning,” in 12th USENIX Symposium on Operating Systems Design and Implementation OSDI 16. Savannah, GA: USENIX Association, 2016, pp. 265-283.

R. Anil et al., “Apache Mahout: Machine Learning on Distributed Dataflow Systems,” Journal of Machine Learning Research, vol. 21, no. 127, pp. 1-6, 2020. [Online]. Available: https://jmlr.org/papers/v21/18-800.html

E. Frank, M. Hall, G. Holmes, R. Kirkby, B. Pfahringer, and I. H. Witten, “Weka-A Machine Learning Workbench for Data Mining,” in Data Mining and Knowledge Discovery Handbook, O. Maimon and L. Rokach Eds. Boston, MA: Springer, 2009.

M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, “The WEKA data mining software: an update,” SIGKDD Explor. Newsl., vol. 11, no. 1, pp. 10–18, 2009, doi: 10.1145/1656274.1656278.

A. Vázquez-Ingelmo, F. J. García-Peñalvo, and R. Therón, “Information Dashboards and Tailoring Capabilities - A Systematic Literature Review,” IEEE Access, vol. 7, pp. 109673-109688, 2019, doi: 10.1109/ ACCESS.2019.2933472.

A. Sarikaya, M. Correll, L. Bartram, M. Tory, and D. Fisher, “What Do We Talk About When We Talk About Dashboards?,” IEEE Transactions on Visualization Computer Graphics, vol. 25, no. 1, pp. 682 - 692, 2018, doi: 10.1109/TVCG.2018.2864903.

S. Few, Information dashboard design. Sebastopol‎, CA, USA: O’Reilly Media, 2006.

S. Land and S. Fischer, “Rapid miner 5,” Rapid-I GmbH, 2012.

J. Bosch, “From software product lines to software ecosystems,” in SPLC, 2009, vol. 9, pp. 111-119.

L. Chen, M. Ali Babar, and N. Ali, “Variability management in software product lines: a systematic review,” 2009.

P. Clements and L. Northrop, Software product lines. Addison-Wesley Boston, 2002.

C. Kästner, S. Apel, and M. Kuhlemann, “Granularity in software product lines,” in 2008 ACM/IEEE 30th International Conference on Software Engineering, 2008: IEEE, pp. 311-320.

J. Van Gurp, J. Bosch, and M. Svahnberg, “On the notion of variability in software product lines,” in Proceedings Working IEEE/IFIP Conference on Software Architecture, 2001: IEEE, pp. 45-54.

S. Lampa, J. Alvarsson, and O. Spjuth, “Towards agile large-scale predictive modelling in drug discovery with flow-based programming design principles,” Journal of Cheminformatics, vol. 8, no. 1, p. 67, 2016/11/24 2016, doi: 10.1186/s13321-016-0179-6.

S. Lampa, M. Dahlö, J. Alvarsson, and O. Spjuth, “SciPipe: A workflow library for agile development of complex and dynamic bioinformatics pipelines,” GigaScience, vol. 8, no. 5, 2019, doi: 10.1093/gigascience/giz044.

R. Skjong and B. H. Wentworth, “Expert judgment and risk perception,” in Proceedings of the Eleventh (2001) International Offshore and Polar Engineering Conference (Stavanger, Norway, June 17-22, 2001): International Society of Offshore and Polar Engineers, 2001.

S. Brown. “The C4 Model for Software Architecture.” https://c4model.com/ (accessed 16-05-022).

A. García-Holgado et al., “User-Centered Design Approach for a Machine Learning Platform for Medical Purpose,” Cham, 2021: Springer International Publishing, in Human-Computer Interaction, pp. 237-249.

A. Vázquez Ingelmo, A. García-Holgado, F. J. García-Peñalvo, and R. Therón Sánchez, “A Meta-modeling Approach to Take into Account Data Domain Characteristics and Relationships in Information Visualizations,” in 9th World Conference on Information Systems and Technologies, Azores, Portugal, Á. Rocha, H. Adeli, G. Dzemyda, F. Moreira, and A. M. R. Correia, Eds., 2021, vol. 2: Springer Nature, in Trends and Innovations in Information Systems and Technologies, WorldCIST 2021, pp. 570-580, doi: 10.1007/978-3-030-72651-5_54. [Online]. Available: http://hdl.handle.net/10366/145626

A. Vázquez-Ingelmo, A. García-Holgado, F. J. García-Peñalvo, and R. Therón, “Proof-of-concept of an information visualization classification approach based on their fine-grained features,” Expert Systems, vol. n/a, no. n/a, p. e12872, In Press, doi: https://doi.org/10.1111/exsy.12872

A. Vázquez-Ingelmo, F. J. García-Peñalvo, and R. Therón, “Taking advantage of the software product line paradigm to generate customized user interfaces for decision-making processes: a case study on university employability,” PeerJ Computer Science, vol. 5, p. e203, 2019/07/01 2019, doi: 10.7717/peerj-cs.203.

C. Schaffer, “Selecting a classification method by cross-validation,” Machine Learning, vol. 13, no. 1, pp. 135-143, 1993.

F. García-Peñalvo et al., “Application of Artificial Intelligence Algorithms Within the Medical Context for Non-Specialized Users: the CARTIER-IA Platform,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 6, no. 6, 2021.

A. García-Holgado and F. J. García-Peñalvo, “Validation of the learning ecosystem metamodel using transformation rules,” Future Generation Computer Systems, vol. 91, pp. 300-310, 2019/02/01/ 2019, doi: https://doi.org/10.1016/j.future.2018.09.011

A. Martínez-Rojas, A. Jiménez-Ramírez, and J. Enríquez, “Towards a Unified Model Representation of Machine Learning Knowledge,” presented at the Proceedings of the 15th International Conference on Web Information Systems and Technologies, Vienna, Austria, 2019. [Online]. Available: https://doi.org/10.5220/0008559204700476

C. Kumar, M. Käppel, N. Schützenmeier, P. Eisenhuth, and S. Jablonski, “A Comparative Study for the Selection of Machine Learning Algorithms based on Descriptive Parameters,” in Proceedings of the 8th International Conference on Data Science, Technology and Applications. Volume 1. DATA, Prague, Czech Republic, 2019, pp. 408-415, doi: 10.5220/0008117404080415.

Downloads

Published

2024-06-01
Metrics
Views/Downloads
  • Abstract
    212
  • PDF
    62

How to Cite

García Peñalvo, F. J., Vázquez Ingelmo, A., García Holgado, A., Sampedro Gómez, J., Sánchez Puente, A., Vicente Palacios, V., … L. Sánchez, P. (2024). KoopaML: A Graphical Platform for Building Machine Learning Pipelines Adapted to Health Professionals. International Journal of Interactive Multimedia and Artificial Intelligence, 8(6), 112–119. https://doi.org/10.9781/ijimai.2023.01.006

Most read articles by the same author(s)