Model-Driven Engineering, Process Discovery, and Reinforcement Learning Testing

Report on Current Developments in the Research Area

General Direction of the Field

The recent advancements in the research area are significantly shaping the future of model-driven software engineering (MDE), process discovery, and reinforcement learning (RL) testing. A common thread across these developments is the integration of artificial intelligence, particularly Large Language Models (LLMs), to enhance the accuracy, efficiency, and robustness of various methodologies.

In the realm of MDE, there is a notable shift towards leveraging LLMs for synthetic data generation. This approach aims to address the limitations of traditional methods that require extensive training data, often unavailable due to privacy concerns. The integration of LLMs in modeling operations is seen as a promising solution to generate realistic and relevant modeling events, thereby supporting designers in complex system modeling.

Process discovery is also witnessing a transformation with the incorporation of domain knowledge into the discovery process. Traditional methods have primarily relied on event logs, often neglecting valuable insights from domain experts and process documentation. The recent trend is to bridge this gap by using LLMs to translate domain-specific knowledge into actionable rules that guide the construction of process models. This approach not only enhances the accuracy of discovered models but also ensures they align with both domain expertise and actual process executions.

In the field of RL, there is a growing emphasis on mutation testing to ensure the robustness of RL agents before deployment. The focus is on developing mutation operators that mimic real faults, thereby providing a more realistic assessment of test scenarios. This approach is critical for complex sequential tasks where the reliability of RL agents is paramount.

Noteworthy Innovations

  1. Synthetic Trace Generation using LLMs: The integration of LLMs in generating synthetic modeling operations is a significant advancement, particularly in scenarios where traditional data collection methods are infeasible. This approach not only reduces dependency on extensive training data but also enhances the realism of generated modeling events.

  2. Imposing Rules in Process Discovery: The novel inductive mining approach that incorporates domain knowledge into process discovery is a notable innovation. This method leverages LLMs to translate domain-specific knowledge into rules that guide model construction, resulting in more accurate and domain-aligned process models.

  3. Mutation Testing for RL: The development of a mutation testing pipeline specifically for RL, based on real faults, is a noteworthy contribution. This approach provides a more realistic assessment of test scenarios, crucial for ensuring the robustness of RL agents in complex tasks.

Sources

Towards Synthetic Trace Generation of Modeling Operations using In-Context Learning Approach

muPRL: A Mutation Testing Pipeline for Deep Reinforcement Learning based on Real Faults

Imposing Rules in Process Discovery: an Inductive Mining Approach

Bridging Domain Knowledge and Process Discovery Using Large Language Models