Journal Article Luca Berardinelli, Vittoriano Muttillo, Romina Eramo, Hugo Bruneliere, Abbas Rahimi, Antonio Cicchetti, Joan Giner-Miguelez, Abel Gómez, Pasqualina Potena, Mehrdad Saadatmand Model Driven Engineering, Artificial Intelligence, and DevOps for Software and Systems Engineering: A Systematic Mapping Study of Synergies and Challenges In: ACM Trans. Softw. Eng. Methodol., 2025, ISSN: 1049-331X, (Just Accepted). Abstract | Links | BibTeX | Tags: Artificial Intelligence, Cloud Computing, Continuous Integration, Cyber-Physical Systems, DevOps, Internet of Things, Machine learning, Model-Driven Engineering @article{10.1145/3759454,
title = {Model Driven Engineering, Artificial Intelligence, and DevOps for Software and Systems Engineering: A Systematic Mapping Study of Synergies and Challenges},
author = { Luca Berardinelli and Vittoriano Muttillo and Romina Eramo and Hugo Bruneliere and Abbas Rahimi and Antonio Cicchetti and Joan Giner-Miguelez and Abel G\'{o}mez and Pasqualina Potena and Mehrdad Saadatmand},
url = {https://doi.org/10.1145/3759454},
doi = {10.1145/3759454},
issn = {1049-331X},
year = {2025},
date = {2025-08-01},
urldate = {2025-08-01},
journal = {ACM Trans. Softw. Eng. Methodol.},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
abstract = {This paper presents a systematic mapping study classifying existing scientific contributions on synergies of Model Driven Engineering (MDE), Artificial Intelligence/Machine Learning (AI/ML), and DevOps, with the overall objective of supporting the continuous development of Cyber-Physical Systems (CPSs). We collected papers from bibliographic sources and selected primary studies to analyse. Then, we characterised and classified the current state of the art, focusing on 1) main aspects already tackled at the intersection of at least two of the three studied areas, and 2) findings emerging from the analysis as a framework for potential future research, notably regarding the integration of the three studied areas. The results reveal that few approaches combine MDE, AI/ML, and DevOps for software and systems engineering. In contrast, several approaches have combined two of them, specifically MDE and DevOps. Approaches combining AI/ML with MDE or DevOps are also becoming more frequent and will most likely continue to progress in the future. These synergies cover a range of engineering activities, from requirements and design to monitoring, maintenance, and evolution. Open research challenges include advancing AI/ML, MDE, and DevOps integration, supporting scalable, data-oriented solutions, proposing new continuous engineering methods, and adapting DevOps practices to diverse systems.},
note = {Just Accepted},
keywords = {Artificial Intelligence, Cloud Computing, Continuous Integration, Cyber-Physical Systems, DevOps, Internet of Things, Machine learning, Model-Driven Engineering},
pubstate = {published},
tppubtype = {article}
}
This paper presents a systematic mapping study classifying existing scientific contributions on synergies of Model Driven Engineering (MDE), Artificial Intelligence/Machine Learning (AI/ML), and DevOps, with the overall objective of supporting the continuous development of Cyber-Physical Systems (CPSs). We collected papers from bibliographic sources and selected primary studies to analyse. Then, we characterised and classified the current state of the art, focusing on 1) main aspects already tackled at the intersection of at least two of the three studied areas, and 2) findings emerging from the analysis as a framework for potential future research, notably regarding the integration of the three studied areas. The results reveal that few approaches combine MDE, AI/ML, and DevOps for software and systems engineering. In contrast, several approaches have combined two of them, specifically MDE and DevOps. Approaches combining AI/ML with MDE or DevOps are also becoming more frequent and will most likely continue to progress in the future. These synergies cover a range of engineering activities, from requirements and design to monitoring, maintenance, and evolution. Open research challenges include advancing AI/ML, MDE, and DevOps integration, supporting scalable, data-oriented solutions, proposing new continuous engineering methods, and adapting DevOps practices to diverse systems. |
ConferenceJoan Giner-Miguelez, Abel Gómez, Jordi Cabot DataDoc Analyzer: A Tool for Analyzing the Documentation of Scientific Datasets Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM '23 Association for Computing Machinery, Birmingham, United Kingdom, 2023, ISBN: 9798400701245. Abstract | Links | BibTeX | Tags: Datasets, explainability, Fairness, large language models, Machine learning, reverse engineering @conference{Giner-Miguelez:CIKM:2023,
title = {DataDoc Analyzer: A Tool for Analyzing the Documentation of Scientific Datasets},
author = {Joan Giner-Miguelez and Abel G\'{o}mez and Jordi Cabot},
doi = {10.1145/3583780.3614737},
isbn = {9798400701245},
year = {2023},
date = {2023-10-01},
booktitle = {Proceedings of the 32nd ACM International Conference on Information and Knowledge Management},
pages = {5046\textendash5050},
publisher = {Association for Computing Machinery},
address = {Birmingham, United Kingdom},
series = {CIKM '23},
abstract = {Recent public regulatory initiatives and relevant voices in the ML community have identified the need to document datasets according to several dimensions to ensure the fairness and trustworthiness of machine learning systems. In this sense, the data-sharing practices in the scientific field have been quickly evolving in the last years, with more and more research works publishing technical documentation together with the data for replicability purposes. However, this documentation is written in natural language, and its structure, content focus, and composition vary, making them challenging to analyze.We present DataDoc Analyzer, a tool for analyzing the documentation of scientific datasets by extracting the details of the main dimensions required to analyze the fairness and potential biases. We believe that our tool could help improve the quality of scientific datasets, aid dataset curators during its documentation process, and be a helpful tool for empirical studies on the overall quality of the datasets used in the ML field. The tool implements an ML pipeline that uses Large Language Models at its core for information retrieval. DataDoc is open-source, and a public demo is published online.},
keywords = {Datasets, explainability, Fairness, large language models, Machine learning, reverse engineering},
pubstate = {published},
tppubtype = {conference}
}
Recent public regulatory initiatives and relevant voices in the ML community have identified the need to document datasets according to several dimensions to ensure the fairness and trustworthiness of machine learning systems. In this sense, the data-sharing practices in the scientific field have been quickly evolving in the last years, with more and more research works publishing technical documentation together with the data for replicability purposes. However, this documentation is written in natural language, and its structure, content focus, and composition vary, making them challenging to analyze.We present DataDoc Analyzer, a tool for analyzing the documentation of scientific datasets by extracting the details of the main dimensions required to analyze the fairness and potential biases. We believe that our tool could help improve the quality of scientific datasets, aid dataset curators during its documentation process, and be a helpful tool for empirical studies on the overall quality of the datasets used in the ML field. The tool implements an ML pipeline that uses Large Language Models at its core for information retrieval. DataDoc is open-source, and a public demo is published online. Full Text AvailableOpen Access |
Journal ArticleJoan Giner-Miguelez, Abel Gómez, Jordi Cabot A domain-specific language for describing machine learning datasets In: Journal of Computer Languages, vol. 76, pp. 101209, 2023, ISSN: 2590-1184. Abstract | Links | BibTeX | Tags: Datasets, Domain-specific languages, Fairness, Machine learning, MDE @article{Giner-Miguelez:COLA:2023,
title = {A domain-specific language for describing machine learning datasets},
author = {Joan Giner-Miguelez and Abel G\'{o}mez and Jordi Cabot},
doi = {10.1016/j.cola.2023.101209},
issn = {2590-1184},
year = {2023},
date = {2023-08-01},
urldate = {2023-08-01},
journal = {Journal of Computer Languages},
volume = {76},
pages = {101209},
abstract = {Datasets are essential for training and evaluating machine learning (ML) models. However, they are also at the root of many undesirable model behaviors, such as biased predictions. To address this issue, the machine learning community is proposing a data-centric cultural shift, where data issues are given the attention they deserve and more standard practices for gathering and describing datasets are discussed and established. So far, these proposals are mostly high-level guidelines described in natural language and, as such, they are difficult to formalize and apply to particular datasets. In this sense, and inspired by these proposals, we define a new domain-specific language (DSL) to precisely describe machine learning datasets in terms of their structure, provenance, and social concerns. We believe this DSL will facilitate any ML initiative to leverage and benefit from this data-centric shift in ML (e.g., selecting the most appropriate dataset for a new project or better replicating other ML results). The DSL is implemented as a Visual Studio Code plugin, and it has been published under an open-source license.},
keywords = {Datasets, Domain-specific languages, Fairness, Machine learning, MDE},
pubstate = {published},
tppubtype = {article}
}
Datasets are essential for training and evaluating machine learning (ML) models. However, they are also at the root of many undesirable model behaviors, such as biased predictions. To address this issue, the machine learning community is proposing a data-centric cultural shift, where data issues are given the attention they deserve and more standard practices for gathering and describing datasets are discussed and established. So far, these proposals are mostly high-level guidelines described in natural language and, as such, they are difficult to formalize and apply to particular datasets. In this sense, and inspired by these proposals, we define a new domain-specific language (DSL) to precisely describe machine learning datasets in terms of their structure, provenance, and social concerns. We believe this DSL will facilitate any ML initiative to leverage and benefit from this data-centric shift in ML (e.g., selecting the most appropriate dataset for a new project or better replicating other ML results). The DSL is implemented as a Visual Studio Code plugin, and it has been published under an open-source license. Full Text AvailableOpen Access |