Formal Methods Update Meeting 2026

Talks and Abstracts

Tutorials

Tutorial on Reactive Synthesis

Shibashis Guha

Reactive synthesis is the problem of automatically constructing a system that interacts continuously with an environment while satisfying a formal specification. Instead of verifying a manually designed controller, synthesis aims to generate a correct-by-construction implementation directly from the specification itself. A central framework for this area is synthesis from Linear Temporal Logic (LTL), where desired behaviors are expressed using temporal operators describing how the system should evolve over infinite executions. The origins of the field trace back to the seminal “Church synthesis problem,” posed by Alonzo Church in the 1950s, which asked whether one can automatically construct a machine satisfying a logical specification against all possible inputs from an adversarial environment. Church’s question established the deep connection between logic, automata, and infinite games that still underpins modern reactive synthesis. Since then, LTL synthesis has become a foundational topic in formal methods, with applications ranging from hardware design and communication protocols to robotics and autonomous systems, while also driving major advances in automata theory, game theory, and algorithmic verification.

Applied Formal Methods in Cryptography

Society for Electronic Transactions and Security

This training aims to provide hands-on experience in formal verification methods for cryptographic systems and secure communication mechanisms. The sessions will introduce the foundations of formal verification and demonstrate how mathematical reasoning is used to establish strong security guarantees in adversarial environments. Participants will understand why conventional testing alone is insufficient for validating security-critical systems against malicious behavior and complex execution scenarios.

The training will cover symbolic, computational and implementation-level verification approaches for analyzing properties such as confidentiality, authentication, integrity, correctness, and memory safety. Through practical demonstrations and guided examples, attendees will gain exposure to attacker modeling, automated verification techniques, proof construction and secure implementation analysis.

Invited Talks

Decision Tree Ensembles: A Sweet Spot between Machine Learning and Formal Methods

Akshay S.

Decision trees and their ensembles are among the most widely used machine learning models, particularly for tabular data. Their discrete and structured nature also makes them attractive targets for formal reasoning. In this talk, we will survey the state of the art in the verification of decision tree ensembles and discuss recent advances in the area. We will see how formal reasoning and symbolic techniques enable rigorous analysis of decision tree ensembles, with applications to robustness, explainability, and sensitivity, and conclude by discussing some recent developments in sensitivity verification and quantification.

Edit Distance of Finite State Transducers

Aiswarya C.

We lift metrics over words to metrics over word-to-word transductions, by defining the distance between two transductions as the supremum of the distances of their respective outputs over all inputs. This allows to compare transducers beyond equivalence. Two transducers are close (resp. k-close) with respect to a metric if their distance is finite (resp. at most k). Over integer-valued metrics computing the distance between transducers is equivalent to deciding the closeness and k-closeness problems. For common integer-valued edit distances such as Hamming, transposition, conjugacy and Levenshtein family of distances, we show that the closeness and the k-closeness problems are decidable for functional transducers. Hence, the distance with respect to these metrics is also computable. Finally, we relate the notion of distance between functions to the notions of diameter of a relation and index of a relation in another. We show that computing edit distance between functional transducers is equivalent to computing diameter of a rational relation and both are a specific instance of the index problem of rational relations.

Joint work with Amaldev Manuel and Saina Sunny (ICALP’24).

Proof Systems for QBF Synthesis: Extracting Skolem and Herbrand Functions

Supratik Chakraborty

Strategy extraction in QBF proof systems usually attempts to extract winning strategies from valid proofs. However, an alternative (and arguably more powerful) view is to extract Skolem/Herbrand functions, or equivalently synthesis of the game values at all intermediate points. In this talk, we investigate the existence and properties of such proof systems from which one can extract Skolem and Herbrand functions. We propose such a proof system for QBF, which we show is sound and complete, and from which extraction of Skolem/Herbrand functions can be performed, and game values computed, in polynomial time. We also show that this system is optimal among all proof systems that allow efficient extraction of Skolem/Herbrand functions. We provide conditional lower bound results for our new proof system and compare it to several existing/standard proof systems for QBF that have been studied in the literature, showing interesting orthogonality results. Finally, we provide a compilation algorithm that takes an arbitrary QBF and synthesizes a proof in our system, from which Skolem and Herbrand functions can be easily computed.

Joint work with S. Akshay, Olaf Beyersdorff, Lea Kasche, Meena Mahajan and Luc Nicolas Spachmann. To be presented at SAT 2026.

Verifying Error Correction Coding Algorithms using Symbolic Trajectory Evaluation

Ramit Das

We present a usecase of datapath/algorithmic verification in the context of Error Correction Algorithms for Hardware Systems. The lattice based symbolic simulation methodology of STE comes to our rescue for this complex problem. We shall dive into what the tool promises and get a peek into industrial challenges in the domain of datapath verification.

Understanding the IC3 verification algorithm

Deepak D'Souza

The IC3 algorithm for verifying safety properties was proposed by Aaron Bradley in 2011. Since then the approach has found widespread use across domains including hardware, software, and discrete-time systems. In this talk we will describe the basic algorithm in the setting of boolean systems, its extension to software verification, and try to bring out the reasons for its effectiveness.

Sure-Almost-Sure and Sure-Limit-Sure Window Mean Payoff in Markov Decision Processes

Pranshu Gaba

Given rationals α and β, the sure-almost-sure problem for a quantitative objective φ in a Markov decision process (MDP) asks if one can simultaneously ensure that all outcomes of the MDP have φ-value at least α (i.e., sure α satisfaction), and with probability 1 the outcome has φ-value at least β (i.e., almost-sure β satisfaction). Moreover, if simultaneous satisfaction of objectives is possible, then one would also like to construct a strategy that achieves this. Even if both sure satisfaction and almost-sure satisfaction for an objective are known, combining the two is often non-trivial and requires novel techniques and approaches. In this talk, we look at the sure-almost-sure problem for window mean-payoff objectives. The window mean-payoff objective strengthens the standard mean-payoff objective by requiring that the average payoff from each point in the play become greater than the threshold in at most ℓ steps. For the window mean-payoff objective, we show that while the computational complexity of the sure-almost-sure problem matches that of sure satisfaction and almost-sure satisfaction, the memory requirement of winning strategies for sure-almost-sure satisfaction is greater than that of both sure and almost-sure satisfaction.

Autoformalization by Combinator Code Generation

Siddhartha Gadgil

Autoformalization is the translation of mathematics in natural language into a formal language that is machine-verifiable. In our system LeanAide, we introduce and use an autoformalization framework based on Combinator Code Generation, where we combine handlers for different mathematical constructs in a manner analogous to Combinator Parsing. We will discuss this approach and its various advantages, including extensibility and human readability of the generated code.

A Modal Approach Towards Substitutions

Sujata Ghosh

Substitutions play a crucial role in a wide range of contexts, from analyzing the dynamics of social opinions and conducting mathematical computations to engaging in game-theoretical analysis. For many situations, considering one-step substitutions is often adequate. Yet, for more complex cases, iterative substitutions become indispensable. In this talk, we will describe logical frameworks that model both single-step and iterative substitutions. We will explore a number of properties of these logics, including their expressive strength, Hilbert-style proof systems, and satisfiability problems. We will also establish connections between our proposed frameworks and relevant existing ones in the literature.

Joint work with Dazhu Li, Fenrong Liu and Yaxin Tu.

Set Automata and Limits of Decidability of Two-Variable Logic on Data Words

Rishal S. P.

We extend the two-variable logic on data words with guarded regular binary predicates of the form L(x, y) that is true if positions x and y have the same data value and the factor strictly between x and y is in the regular language L. We characterise the class of monoids for which the extension of the two-variable logic with guarded predicates recognised by the monoid is decidable, namely the class of idempotent monoids whose two-sided ideals are linearly ordered. For this, we introduce an automata formalism, set automata, that has an undecidable emptiness problem. A set automaton is a finite state automaton with a fixed number of sets that can store data values. During the run, the automaton can add or remove the current data value from the sets as well as assign a union of sets to each set. The latter kind of updates can be represented as an element of a semigroup of relations on the sets. The acceptance of a run is determined by the state as well as the sets in which the data values are present at the end of the run. We identify a subclass of set automata called ordered quasi-normal set automata that has a decidable emptiness problem by reduction to the emptiness problem of ordered multicounter automata. We show that the two-variable logic extended with guarded regular predicates recognised by a semigroup S is expressively equivalent to a quasi-normal set automaton with the semigroup of relations S. In particular, if S is a linear band monoid then the resulting automaton is ordered, and the decidability result follows.

Joint work with Shibashis Guha (TIFR) and Amaldev Manuel (IIT Goa).

Certified Regular Expression Matcher in Rocq

Piyush Kurur

The talk is about designing certified hardware, i.e. hardware with a formal proof of correctness, in Rocq. We present a library for generating certified hardware for regular expression matching. The user gives a regular expression encoded using our eDSL in Rocq, and the hardware matcher is obtained using the synthesize function of our library. We prove the correctness of the synthesize function: the language of the input regular expression is the same as the language accepted by the generated hardware. This means the hardware is verified end-to-end with no particular effort from the user (push-button certified matcher). Experimental results show that the matcher is very efficient even when hardly any effort has gone into optimization other than choosing the right algorithm (Ken Thompson’s algorithm). The code is available from the public repository.

Circuit Complexity of Regular Languages

Amaldev Manuel

Many natural classes of languages defined via logic, automata, or finite semigroups form varieties, i.e., they are closed under Boolean operations, inverse morphisms, and quotients by words. This connection extends to Boolean circuits if inverse length-multiplying morphisms replace inverse morphisms; the resulting classes are called lm-varieties (Straubing, 2002). An Eilenberg-type correspondence holds: lm-varieties of languages correspond to classes of syntactic morphisms satisfying suitable HSP conditions. As an example, a strengthening of the Furst–Saxe–Sipser theorem (Parity is not in AC0) characterises the regular languages computed by constant-depth polynomial-size circuits as exactly the lm-variety QA, i.e., those whose syntactic morphisms have aperiodic stable semigroups (Barrington et al. 1992). We use the framework of lm-varieties to give an effective classification of the circuit complexity of regular languages.

Joint work with Stefan Göller (University of Kassel).

TBA

Ravindra Metta

Abstract TBD.

Refinement Types: A Powerful Machinery for Verification, Synthesis and Testing

Ashish Mishra

Abstract TBD.

Synthesizing POMDP Policies: Sampling meets Model-checking via Learning

Sayan Mukherjee

Partially Observable Markov Decision Processes (POMDPs) are the standard framework for decision-making under uncertainty. While sampling-based methods scale well, they lack formal correctness guarantees, making them unsuitable for safety-critical applications. Conversely, formal synthesis techniques provide correctness-by-construction but often struggle with scalability, as general POMDP synthesis is undecidable. To bridge this gap, we propose a synthesis framework that integrates sampling, automata learning, and model-checking. Inspired by Angluin’s L* algorithm, our approach utilizes sampling as a membership oracle and model-checking as an equivalence oracle. This enables the synthesis of finite-state controllers with formal guarantees, provided the sampling-induced policy is regular. We establish a relative completeness result for this framework. Experimental results from our prototypical implementation demonstrate that this method successfully solves threshold-safety problems that remain challenging for existing formal synthesis tools. We believe our algorithm serves as a valuable component in a portfolio approach to tackling the inherent difficulty of POMDP synthesis problems.

Joint work with Debraj Chakraborty, Anirban Majumdar, Prince Mathew and Jean-François Raskin. To appear in CAV 2026.

On Robustness of Linear Classifiers to Targeted Data Poisoning

Sumanth Prabhu

Data poisoning is a training-time attack that undermines the trustworthiness of learned models. In a targeted data poisoning attack, an adversary manipulates the training dataset to alter the classification of a targeted test point. Given the typically large size of training dataset, manual detection of poisoning is difficult. An alternative is to automatically measure a dataset’s robustness against such an attack. In this talk, I will present our recent result that this is an NP-complete problem for linear classifiers. I will also present our technique that finds lower and upper bounds of robustness using constraint solving.

Joint work with Nakshtra Gupta, Supratik Chakraborty, and Venkatesh R. Appeared in AAAI’26.

Bridging Bit-Vectors and Natural Numbers in Isabelle/HOL

Shweta Rajiv

Bit-vectors are a core tool in software and hardware verification, and modern solvers handle fixed-width bit-vectors very well using techniques like bit blasting. However, these approaches struggle with large bit widths and do not generalize to parametric bit-vectors, where the size is symbolic and reasoning must work uniformly for all widths. In this talk, I present a mechanized translation from parametric bit-vectors to natural numbers in Isabelle/HOL. The approach builds on prior work on int blasting, encoding bit-vector operations as arithmetic over naturals with explicit bounds. We show that this translation preserves semantics for a wide range of operations, including arithmetic, logical, and bit-level constructs. The formalization is developed in two styles: a shallow embedding that supports automated rewriting of bit-vector goals into natural number formulas, and a deep embedding that establishes a global correctness theorem. These results provide a sound and fully verified reduction from parametric bit-vector reasoning to natural number arithmetic.

TBA

Abhisekh Sankaran

Abstract TBD.

Formal Verification of End-to-End Verifiable Election Protocols

Subodh Sharma

End-to-end verifiable (E2E-V) election protocols use cryptographic techniques to provide strong guarantees of vote integrity, privacy, and transparency. However, the complexity of these protocols makes rigorous validation essential. This talk explores how formal methods can be used to specify and verify key election properties, including ballot secrecy, cast-as-intended, recorded-as-cast, and tallied-as-recorded guarantees. We will survey verification approaches based on model checking, theorem proving, and automated protocol analysis, and discuss lessons learned from the formal analysis of real-world voting systems. The talk concludes with emerging challenges at the intersection of cryptography, formal verification, and trustworthy digital democracy.

State Reachability Under Weak Memory: From Decidability Frontiers to Visibility-Guided Testing

Abhishek Singh

This talk surveys the state of the art in state reachability for weak memory models (WMMs), focusing on declarative as well as operational semantics for the release-acquire (RA) fragment of C/C++. We first review the theoretical landscape: the decidability of state reachability for full C/C++ concurrency, the RA fragment, and other closely related variants restricted by different sets of axiomatic constraints. We then examine existing model checking approaches, which scale poorly, and popular testing frameworks (including probabilistic concurrency testing and concurrency fuzzers), which find bugs quickly but rarely trigger subtle visibility violations. Finally, we briefly outline our group’s ongoing research on a visibility-guided fuzzer that generates skeleton (abstract execution) graphs and mutates skeletons to maximize behavioral diversity. This shifts testing from hoping to stumble upon weak behaviors to systematically proposing and instantiating them.

From Convergence to Confidence: Push-Button Verification for Replicated Data Types

K. C. Sivaramakrishnan

Replicated Data Types (RDTs) underpin a growing class of geo-distributed applications, but what it means for an RDT to be correct, and how to prove it, remains surprisingly subtle. Strong eventual consistency guarantees only that replicas eventually agree; replication-aware linearizability further requires that whatever state they agree on is reachable by linearising the updates each replica has actually observed. A compelling specification, but one that has historically demanded painstaking, hand-written proofs. In this talk, I will trace a multi-year arc aimed at replacing those proofs with push-button verification for Mergeable Replicated Data Types (MRDTs). Automated proof assistants and, more recently, agentic proof-oriented programming are steadily eroding the need for manual scaffolding. The story, I hope, illustrates something broader: that the gap between “we believe this merges correctly” and “we have proved it merges correctly” is narrowing to the point where correct-by-construction replicated data is becoming a realistic engineering proposition rather than a research aspiration.

Regular Expressions over Countable Words

Sreejith A V

We investigate the expressive power of regular expressions for languages of countable words and establish their expressive equivalence with logical and algebraic characterizations. Our goal is to extend the classical theory of regular languages, defined over finite words and characterized by automata, monadic second-order logic, and regular expressions, to the setting of countable words.

Generalized Timed Automata

B. Srivathsan

Timed automata are a widely adopted formalism for modeling real-time systems, using clocks to enforce timing constraints on transitions. In this talk, we present the idea of “future clocks”, which are in some sense, dual to ordinary clocks. Future clocks are like timers: they predict the time to a future event, and remain active until they time out. “Generalized timed automata” allow both clocks and future clocks. In the talk, we will introduce this model and describe some of its properties. We will then demonstrate their usefulness in logic-to-automata translations.

Based on joint works with S. Akshay, Prerak Contractor, Paul Gastin, R. Govind and Aniruddha Joshi.

Intruder Theories

S. P. Suresh

The Dolev-Yao model is the primary abstraction used in the symbolic analysis of cryptographic protocols. In this model, cryptographic operations are abstracted as operators in a term algebra and rules in a formal derivation system on messages. Special properties of some cryptographic operations like XOR, homomorphic encryption etc. are modelled as equation (or rewrite) theories, called “intruder theories” in the literature. In this lecture, we introduce the example of XOR, study its equation theory and how it can be incorporated into the derivation system. We also study the verification problem for protocols that use XOR, and we study a further extension to the protocol model that communicates assertions as well as terms.

Joint work with R Ramanujam (Azim Premji University) and Vaishnavi Sundararajan (IIT Delhi).

Incorrectness Proofs with Underapproximation Invariants

Gourav Takhar

Incorrectness logic provides a theoretical framework to reason about program faults. While incorrectness logic has been immensely successful at explaining bug-finding tools, automating incorrectness logic, especially its backwards variant rule has remained challenging. The tools based on incorrectness logic, instead, analyze bounded models of program using a loop unrolling rule that unrolls loops to finite bounds. While such analysis via loop unrolling is quite effective for shallow bugs, discovering deep bugs remains a challenge. In this work, we propose a new proof system, I3, that, instead of requiring backwards variants, allows partial incorrectness proofs with a new kind of invariant, the underapproximation invariant. Unlike regular invariants that must hold on all program traces, underapproximation invariants are required to hold only on a subset of traces. We also show that, under certain sufficient conditions, a partial incorrectness proof does guarantee total incorrectness. Given the success of invariant inference in automating partial correctness proofs with Floyd-Hoare logic, we believe that I3 should also be easier to automate. We implement our proof system within a prover, I3Prove, that is able to automatically infer suitable underapproximation invariants en route to incorrectness proofs. We compare I3Prove against leading bug-finding tools from the SV-COMP23 competition, viz. UAutomizer, CPAChecker, and Veriabs. We also compare it with a recent proposal, Banal, that uses abstract interpretation to reason about incorrectness claims. On our benchmark suite of 106 incorrect programs, I3Prove is able to prove the incorrectness claims in 97 programs, while the best of the baseline tools is only able to solve 69 instances.