What are data-driven abstractions in control theory?

Data-driven abstractions construct simplified finite models directly from sampled trajectories rather than from an explicit mathematical description of the system dynamics.

How does random exploration generate the necessary data?

Independent initial states are sampled according to a probability distribution, and finite-length input-output trajectories are recorded to build the abstraction.

What PAC guarantees does the method provide?

Probably Approximately Correct bounds quantify the probability that the abstraction correctly captures all behaviors of the concrete system for the sampled horizon.

Can the guarantees extend to arbitrarily long time horizons?

Yes, under additional assumptions such as Lipschitz continuity, the PAC properties carry over to longer horizons than those used for sampling.

Which specifications can be addressed with the resulting controllers?

Reach-avoid specifications and other properties amenable to abstraction-based synthesis are supported through the probabilistic alternating simulation relation.

How does this differ from traditional model-based abstraction?

Traditional methods require complete knowledge of the dynamics function, whereas this approach operates on black-box systems using only observed trajectories.

What role does scenario theory play?

Scenario theory supplies distribution-free statistical bounds that certify the generalization performance of the abstraction constructed from finite samples.

Are the authors affiliated with a specific institution?

Rudi Coppola, Andrea Peruffo, and Manuel Mazo are researchers at the Technical University of Delft.

Where can the full paper be accessed?

The work appears in Automatica with the given ScienceDirect link and is also available as an arXiv preprint.

What practical systems might benefit from this technique?

Applications include robotics, autonomous vehicles, and process control where obtaining accurate models is challenging.

Does the method require continuous or discrete inputs?

The framework assumes a finite discrete input set, which is common in many abstraction-based control problems.

How might this research influence future academic hiring?

Departments may seek faculty and researchers with expertise in data-driven formal methods and statistical verification techniques.

Data-Driven Abstractions for Control Systems via Random

person using black industrial machine — Photo by L N on Unsplash

A new paper published in Automatica presents a method for constructing data-driven symbolic abstractions for control systems using random exploration of finite-length trajectories. Titled 'Data-driven abstractions for control systems via random exploration,' the work is authored by Rudi Coppola, Andrea Peruffo, and Manuel Mazo from the Technical University of Delft.

The approach addresses challenges in building abstractions for systems where obtaining an accurate mathematical model is difficult or costly. By sampling trajectories from the unknown dynamics under random initial conditions, the method creates finite-state models suitable for verification and controller synthesis while providing statistical guarantees.

Understanding Symbolic Abstractions in Dynamical Systems

Symbolic abstractions simplify complex continuous or hybrid dynamical systems into discrete models that preserve key behaviors. These finite-state representations enable the application of formal methods from computer science to prove properties such as safety or reachability and to synthesize controllers that guarantee desired outcomes. Traditional abstraction techniques typically require a precise model of the system dynamics, including knowledge of functions describing state evolution. In many practical scenarios, such as robotics or process control, deriving or identifying these models demands significant resources and may not capture all uncertainties accurately.

The new research shifts the paradigm by relying solely on sampled data. Researchers collect finite sequences of inputs and outputs generated by running the system from randomly chosen starting states. This data-driven construction avoids explicit model identification while still yielding abstractions that support control design.

The Role of Random Exploration and Scenario Theory

Central to the method is the use of random sampling of initial states to generate independent trajectories. Scenario theory, a framework from statistical learning, provides Probably Approximately Correct (PAC) bounds on the quality of the resulting abstraction. These bounds quantify the probability that the abstraction correctly over-approximates the behaviors of the concrete system, with explicit dependence on the number of samples collected.

The authors introduce a probabilistic variant of alternating simulation relations. This relation ensures that controllers synthesized on the abstract model can be refined to controllers for the original system while preserving guarantees. The PAC properties hold for the horizon length corresponding to the sampled trajectories and, under additional mild assumptions such as Lipschitz continuity of the dynamics, can be extended to arbitrarily long horizons.

Key Technical Contributions and Guarantees

The paper focuses on deterministic control systems with finite input sets and unknown transition functions. By constructing Strongest Asynchronous ℓ-complete Abstractions from trajectory data, the approach captures memory-dependent behaviors through sequences of past inputs and outputs. This memory aspect proves essential for accurate control synthesis in black-box settings.

Extensive numerical benchmarks demonstrate the method on standard control examples, confirming that the generated abstractions support reach-avoid specifications with the predicted statistical confidence. The guarantees remain valid even when the underlying dynamics are nonlinear and only partially characterized.

Photo by Brecht Corbeel on Unsplash

Implications for Formal Methods and Control Engineering

This development bridges gaps between data-driven machine learning techniques and rigorous formal verification. Engineers working on safety-critical systems can now derive certified controllers without investing in full system identification. The framework supports synthesis for specifications expressed in temporal logic or as reach-avoid problems, common in autonomous systems and cyber-physical applications.

Academic researchers in systems and control may find new avenues for extending these ideas to stochastic systems or output-feedback settings. The statistical nature of the guarantees aligns well with modern emphasis on robustness under uncertainty.

Connections to Broader Research Landscape

The work builds on prior contributions by the same authors on data-driven abstractions for verification tasks. It advances beyond one-step transition sampling by using full trajectories, thereby enabling longer-horizon properties without violating independence assumptions required by scenario optimization.

Related techniques in the literature include interval Markov decision processes and growth-rate estimation, yet the current method distinguishes itself through its focus on control synthesis and explicit PAC extension mechanisms.

Practical Considerations for Implementation

Deploying the approach involves selecting an appropriate trajectory length and sample count to achieve desired confidence levels. Computational complexity scales with the number of collected behaviors, but the finite nature of the resulting abstraction keeps subsequent controller synthesis tractable using standard graph algorithms.

Institutions with strong programs in control theory and formal methods are well positioned to incorporate these techniques into graduate curricula and research projects. The method's reliance on sampling rather than analytic models makes it particularly suitable for experimental platforms where dynamics are learned through interaction.

Future Directions and Open Questions

Extensions could address continuous input sets, partial observability, or integration with reinforcement learning for policy improvement. Combining the abstraction framework with online adaptation mechanisms represents another promising direction for handling time-varying systems.

As data-driven methods mature, the boundary between model-based and model-free control continues to blur, offering hybrid strategies that leverage both data and partial knowledge when available.

A row of blue and white electrical switches

Photo by Jason Leung on Unsplash

Relevance for Academic and Research Careers

PhD candidates and postdoctoral researchers specializing in control systems or formal verification may explore this area for thesis topics or publications. Faculty positions in electrical engineering, mechanical engineering, and computer science departments increasingly value expertise at the intersection of data science and rigorous systems analysis.

Universities seeking to strengthen their research portfolios in cyber-physical systems or autonomous technologies can reference such advances when recruiting talent or forming interdisciplinary teams.

Access the original publication through the ScienceDirect abstract page or the open arXiv preprint. The authors' affiliation with TU Delft provides additional context on the institutional research environment supporting this line of inquiry.

Understanding Symbolic Abstractions in Dynamical Systems

The Role of Random Exploration and Scenario Theory

Key Technical Contributions and Guarantees

Implications for Formal Methods and Control Engineering

Connections to Broader Research Landscape

Practical Considerations for Implementation

Future Directions and Open Questions

Relevance for Academic and Research Careers

Data-Driven Abstractions for Control Systems via Random Exploration Published

PAC Guarantees Enable Controller Synthesis Without Full Models

Frequently Asked Questions

📊What are data-driven abstractions in control theory?

🔍How does random exploration generate the necessary data?

✅What PAC guarantees does the method provide?

⏱️Can the guarantees extend to arbitrarily long time horizons?

🎯Which specifications can be addressed with the resulting controllers?

🔄How does this differ from traditional model-based abstraction?

📈What role does scenario theory play?

🏛️Are the authors affiliated with a specific institution?

📄Where can the full paper be accessed?

🤖What practical systems might benefit from this technique?

⌨️Does the method require continuous or discrete inputs?

👥How might this research influence future academic hiring?

Browse by Faculty

Browse by Subject

Assistant Lecturer / Lecturer / Assistant Professor / Associate Professor / Professor in Cybersecurity

Postdoc in Artificial Intelligence and Machine Learning for Clinical Decision Support

Lecturer/Senior Lecturer in Structural Engineering

Lecturer Pool - Mechanical Engineering & Aerospace Engineering - College of Engineering

Associate Lecturer - AERO2483 Introduction to Aviation (HE6)

Postdoctoral Research Associate - College of Engineering - Electrical and Computer Engineering

Faculty - Mechatronics Engineering Technology

Tenure-Track Faculty Position(s) in Artificial Intelligence and Machine Learning for Drug Discovery

2026 Rosenblum Award Digital Preservation US Scholarly Record | AcademicJobs

Library Publishing Forum 2026: Metadata Standards in US Higher Ed | AcademicJobs

Brazilian Reproducibility Network: Transforming Research in Higher Ed | AcademicJobs

TESOL in Context 2026 Special Issue on AI in English Language Learning | AcademicJobs

June 2026 Brazilian Scientific Journals New Articles | AcademicJobs

Literary Journals Capacity Building Fund 2026–27 | AcademicJobs Australia

Publish Your Research… Share it Worldwide

Expert Academics Wanted… Become an Author