Publications at Software and Computational Systems Lab

Compact view

Publications of year 2026

Articles in journal or book chapters

Salih Ates, Dirk Beyer, Po-Chun Chien, and Nian-Ze Lee. Bridging Hardware and Software Analysis with Btor2C: A Word-Level-Circuit-to-C Translator (Extended Version). International Journal on Software Tools for Technology Transfer (STTT), 2026. doi:10.1007/s10009-026-00847-z
Keyword(s): Software Model Checking, Cooperative Verification, Btor2 Funding: DFG-CONVEY, DFG-BRIDGE Publisher's Version PDF Supplement
Artifact(s)
1. doi:10.5281/zenodo.16933839
Abstract

Across the broad research field concerned with analyzing computing systems, algorithms and tools revolve around the modeling languages used to describe the systems, hindering their applications to similar problems of systems in other modeling languages. For example, the research communities for formal verification and testing of hardware and software share common theoretical foundations and solving methods, including symbolic encoding, satisfiability solving, and abstraction refinement. Nevertheless, it requires significant effort for one community to benefit from the advancements of the other, as analyzers assume different modeling languages for input instances. To bridge the gap between hardware and software analysis, we propose Btor2C, a translator from word-level sequential circuits in the Btor2 language to C programs. We choose the Btor2 language as frontend because its simple syntax and bit-precise semantics make it a suitable intermediate representation for analysis purposes. Using Btor2C, we translate Btor2 circuits from the Hardware Model Checking Competitions into C programs and analyze them using tools from the Intl. Competitions on Software Verification and Testing. Our results show that software analyzers can complement hardware model checkers for enhanced quality assurance: Prominently, the software verifier CBMC (with Btor2C for preprocessing) found more bugs than the best hardware model checkers ABC and AVR in our experiment.

BibTeX Entry

@article{Btor2C-STTT, author = {Salih Ates and Dirk Beyer and Po-Chun Chien and Nian-Ze Lee}, title = {Bridging Hardware and Software Analysis with \textsc{Btor2C}: {A} Word-Level-Circuit-to-{C} Translator (Extended Version)}, journal = {International Journal on Software Tools for Technology Transfer (STTT)}, volume = {}, number = {}, pages = {}, year = {2026}, doi = {10.1007/s10009-026-00847-z}, url = {https://www.sosy-lab.org/research/btor2c/}, pdf = {https://www.sosy-lab.org/research/pub/2026-STTT.Bridging_Hardware_and_Software_Analysis_with_Btor2C_A_Word-Level-Circuit-to-C_Translator_Extended_Version.pdf}, abstract = {Across the broad research field concerned with analyzing computing systems, algorithms and tools revolve around the modeling languages used to describe the systems, hindering their applications to similar problems of systems in other modeling languages. For example, the research communities for formal verification and testing of hardware and software share common theoretical foundations and solving methods, including symbolic encoding, satisfiability solving, and abstraction refinement. Nevertheless, it requires significant effort for one community to benefit from the advancements of the other, as analyzers assume different modeling languages for input instances. To bridge the gap between hardware and software analysis, we propose <a href="https://gitlab.com/sosy-lab/software/btor2c">Btor2C</a>, a translator from word-level sequential circuits in the <a href="https://doi.org/10.1007/978-3-319-96145-3_32">Btor2</a> language to C programs. We choose the Btor2 language as frontend because its simple syntax and bit-precise semantics make it a suitable intermediate representation for analysis purposes. Using Btor2C, we translate Btor2 circuits from the Hardware Model Checking Competitions into C programs and analyze them using tools from the Intl. Competitions on Software Verification and Testing. Our results show that software analyzers can complement hardware model checkers for enhanced quality assurance: Prominently, the software verifier CBMC (with Btor2C for preprocessing) found more bugs than the best hardware model checkers ABC and AVR in our experiment.}, keyword = {Software Model Checking, Cooperative Verification, Btor2}, artifact = {10.5281/zenodo.16933839}, funding = {DFG-CONVEY, DFG-BRIDGE}, }
Max Barth, Matthias Heizmann, and Jochen Hoenicke. A lazy and modular approach to int-blasting. Acta Informatica, 63(16), 2026. doi:10.1007/s00236-026-00529-y Funding: DFG-ConVeY, DFG-ReVeriX Publisher's Version PDF

BibTeX Entry

@article{intBlasting, author = {Max Barth and Matthias Heizmann and Jochen Hoenicke}, title = {A lazy and modular approach to int-blasting}, journal = {Acta Informatica}, volume = {63}, number = {16}, pages = {}, year = {2026}, doi = {10.1007/s00236-026-00529-y}, url = {}, pdf = {}, keyword = {}, annote = {}, artifact = {}, funding = {DFG-ConVeY, DFG-ReVeriX}, }
Franco Barbanera and Rolf Hennicker. Safe Orchestrated Multicomposition of Systems of Communicating Finite State Machines. Journal of Logical and Algebraic Methods in Programming:101109, 2026. doi:10.1016/j.jlamp.2026.101109 Publisher's Version

Abstract

The Participants-as-Interfaces (PaI) approach to system composition suggests that participants in a system can be considered interfaces to the outside world. Given a set of systems, one participant per system is chosen to play the role of an interface. When systems are composed, these interface participants are replaced by gateways that communicate with each other by forwarding messages. The PaI approach for systems of asynchronously communicating finite state machines (CFSMs) has been exploited in the literature for binary composition where the forwarding policy is necessarily unique. In this paper we consider the case of multiple system composition and extend preliminary work to the case where interactions among gateways can be mediated by additional orchestrating participants that comply with a given connection model. We represent the interactions among gateways as CFSM systems (called orchestrated connection policies) and prove that a number of relevant communication properties (e.g. deadlock-freedom, reception-error-freedom) are preserved by orchestrated PaI multicomposition, provided that the orchestrated connection policy used also satisfies the communication property in question.

BibTeX Entry

@article{HennickerJLAP26, author = {Franco Barbanera and Rolf Hennicker}, title = {Safe Orchestrated Multicomposition of Systems of Communicating Finite State Machines}, journal = {Journal of Logical and Algebraic Methods in Programming}, pages = {101109}, year = {2026}, doi = {10.1016/j.jlamp.2026.101109}, abstract = {The Participants-as-Interfaces (PaI) approach to system composition suggests that participants in a system can be considered interfaces to the outside world. Given a set of systems, one participant per system is chosen to play the role of an interface. When systems are composed, these interface participants are replaced by gateways that communicate with each other by forwarding messages. The PaI approach for systems of asynchronously communicating finite state machines (CFSMs) has been exploited in the literature for binary composition where the forwarding policy is necessarily unique. In this paper we consider the case of multiple system composition and extend preliminary work to the case where interactions among gateways can be mediated by additional orchestrating participants that comply with a given connection model. We represent the interactions among gateways as CFSM systems (called orchestrated connection policies) and prove that a number of relevant communication properties (e.g. deadlock-freedom, reception-error-freedom) are preserved by orchestrated PaI multicomposition, provided that the orchestrated connection policy used also satisfies the communication property in question.}, issn = {2352-2208}, }

Articles in conference or workshop proceedings

Dirk Beyer and Marian Lingsch-Rosenfeld. SvLibChecker: A Light-Weight Tool for Software Model Checking. In Proceedings of the 38th International Conference on Computer-Aided Verification (CAV 2026, Lisbon, Portugal, July 26-29), 2026. Springer. Keyword(s): Software Model Checking Funding: DFG-CONVEY PDF

BibTeX Entry

@inproceedings{CAV26b, author = {Dirk Beyer and Marian Lingsch-Rosenfeld}, title = {\textsc{SvLibChecker}: {A} Light-Weight Tool for Software Model Checking}, booktitle = {Proceedings of the 38th International Conference on Computer-Aided Verification ({CAV} 2026, Lisbon, Portugal, July 26-29)}, pages = {}, year = {2026}, publisher = {Springer}, url = {}, pdf = {https://www.sosy-lab.org/research/pub/2026-CAV.SvLibChecker_A_Light-Weight_Tool_for_Software_Model_Checking.pdf}, abstract = {}, keyword = {Software Model Checking}, annote = {to appear}, artifact = {}, doinone = {Unpublished: Last checked: 2026-03-22}, funding = {DFG-CONVEY}, video = {}, }

Additional Infos

to appear
Dirk Beyer, Marek Jankola, and Marian Lingsch-Rosenfeld. Transition Invariants Revisited: Termination Witnesses and Their Validation. In Proceedings of the 38th International Conference on Computer-Aided Verification (CAV 2026, Lisbon, Portugal, July 26-29), 2026. Springer. Keyword(s): Software Model Checking Funding: DFG-CONVEY PDF

BibTeX Entry

@inproceedings{CAV26a, author = {Dirk Beyer and Marek Jankola and Marian Lingsch-Rosenfeld}, title = {Transition Invariants Revisited: {Termination} Witnesses and Their Validation}, booktitle = {Proceedings of the 38th International Conference on Computer-Aided Verification ({CAV} 2026, Lisbon, Portugal, July 26-29)}, pages = {}, year = {2026}, publisher = {Springer}, url = {}, pdf = {https://www.sosy-lab.org/research/pub/2026-CAV.Transition_Invariants_Revisited_Termination_Witnesses_and_Their_Validation.pdf}, abstract = {}, keyword = {Software Model Checking}, annote = {to appear}, artifact = {}, doinone = {Unpublished: Last checked: 2026-03-22}, funding = {DFG-CONVEY}, video = {}, }

Additional Infos

to appear
Dirk Beyer, Po-Chun Chien, Bo-Yuan Huang, Nian-Ze Lee, and Thomas Lemberger. HarnessForge: Automated Extraction of Verification Tasks from Industry-Scale Software Projects. In Proceedings of the 34th ACM International Conference on the Foundations of Software Engineering (FSE Companion 2026, Montreal, Canada, July 5-9), 2026. ACM.
Keyword(s): Software Model Checking, Software Testing, Firmware verification Funding: DFG-CONVEY, DFG-BRIDGE, Intel PDF Video Supplement
Artifact(s)
1. doi:10.5281/zenodo.18348148
Abstract

We present HarnessForge, a command-line tool to streamline the extraction of verification tasks from industry-scale software projects written in C. Industry-scale code consists of multiple source and header files with various build processes, complicating the creation of verification tasks and hindering the applicability of off-the-shelf software verifiers. HarnessForge handles this complexity for verification engineers and tools, allowing harnesses to be structured independently from the code under verification. It automatically derives build commands, assembles relevant source files, and performs static program slicing to remove irrelevant components. To demonstrate its applicability, we use HarnessForge to create a total of 949 verification tasks from three projects: AWS C Common, GNU Coreutils, and Intel TDX Module. All created tasks were used in SV-COMP 2026. A demo video is available at youtu.be/wHPEfQ3NBFQ.

BibTeX Entry

@inproceedings{FSE26, author = {Dirk Beyer and Po-Chun Chien and Bo-Yuan Huang and Nian-Ze Lee and Thomas Lemberger}, title = {HarnessForge: Automated Extraction of Verification Tasks from Industry-Scale Software Projects}, booktitle = {Proceedings of the 34th {ACM} International Conference on the Foundations of Software Engineering ({FSE} Companion 2026, Montreal, Canada, July 5-9)}, pages = {}, year = {2026}, publisher = {ACM}, url = {https://gitlab.com/sosy-lab/software/harnessforge}, pdf = {https://www.sosy-lab.org/research/pub/2026-FSE.HarnessForge_Automated_Extraction_of_Verification_Tasks_from_Industry-Scale_Software_Projects.pdf}, abstract = {We present HarnessForge, a command-line tool to streamline the extraction of verification tasks from industry-scale software projects written in C. Industry-scale code consists of multiple source and header files with various build processes, complicating the creation of verification tasks and hindering the applicability of off-the-shelf software verifiers. HarnessForge handles this complexity for verification engineers and tools, allowing harnesses to be structured independently from the code under verification. It automatically derives build commands, assembles relevant source files, and performs static program slicing to remove irrelevant components. To demonstrate its applicability, we use HarnessForge to create a total of 949 verification tasks from three projects: AWS C Common, GNU Coreutils, and Intel TDX Module. All created tasks were used in SV-COMP 2026. A demo video is available at <a href="https://youtu.be/wHPEfQ3NBFQ">youtu.be/wHPEfQ3NBFQ</a>.}, keyword = {Software Model Checking, Software Testing, Firmware verification}, artifact = {10.5281/zenodo.18348148}, doinone = {Unpublished: Last checked: 2026-03-22 (10.1145/3803437.3806420)}, funding = {DFG-CONVEY, DFG-BRIDGE, Intel}, video = {https://youtu.be/wHPEfQ3NBFQ}, }
Dirk Beyer and Jan Strejček. Evaluating Software Verifiers for C, Java, and SV-LIB (Report on SV-COMP 2026). In Proceedings of the 31th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS 2026, Turin, Italy, April 11-16), part 2, LNCS 16506, 2026. Springer. doi:10.1007/978-3-032-22749-2_23 Keyword(s): Competition on Software Verification (SV-COMP), Competition on Software Verification (SV-COMP Report), Software Model Checking Funding: DFG-CONVEY, DFG-BRIDGE Publisher's Version PDF Poster

BibTeX Entry

@inproceedings{TACAS26b, author = {Dirk~Beyer and Jan~Strejček}, title = {Evaluating Software Verifiers for {C}, {Java}, and {SV-LIB} (Report on {SV-COMP 2026})}, booktitle = {Proceedings of the 31th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS~2026, Turin, Italy, April 11-16), part~2}, pages = {}, year = {2026}, series = {LNCS~16506}, publisher = {Springer}, doi = {10.1007/978-3-032-22749-2_23}, poster = {https://www.sosy-lab.org/research/pst/2026-03-17_PROG26_SVCOMP26.pdf}, keyword = {Competition on Software Verification (SV-COMP),Competition on Software Verification (SV-COMP Report),Software Model Checking}, annote = {<a href="https://www.sosy-lab.org/research/pst/2026-03-17_PROG26_SVCOMP26.pdf">Poster available online</a>.}, funding = {DFG-CONVEY, DFG-BRIDGE}, }

Additional Infos

Poster available online.
Dirk Beyer, Po-Chun Chien, Bo-Yuan Huang, Nian-Ze Lee, and Thomas Lemberger. A Case Study in Firmware Verification: Applying Formal Methods to Intel TDX Module. In Proceedings of the 31th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS 2026, Turin, Italy, April 11-16), part 2, LNCS 16506, pages 42-64, 2026. Springer. doi:10.1007/978-3-032-22749-2_3
Keyword(s): Software Model Checking, Software Testing, Firmware verification Funding: DFG-CONVEY, DFG-BRIDGE, Intel Publisher's Version PDF Presentation Poster Supplement
Artifact(s)
1. doi:10.5281/zenodo.18371342
Abstract

Firmware underpins system security but remains challenging to verify due to hardware dependency, specialized coding idioms, and limited open-source examples. Manual verification approaches, while common in industry, are labor-intensive and difficult to scale. This paper presents a detailed case study on applying automatic formal methods for software to a security-critical firmware component in Intel Trust Domain Extensions (TDX), known as TDX Module. In this study, we employ six state-of-the-art C-program analyzers on the production TDX Module firmware, leveraging techniques ranging from bounded model checking and symbolic execution to abstract interpretation. Our empirical evaluation identifies obstacles unique to firmware, highlights harness-design decisions essential for verifying industry-scale code bases, and demonstrates opportunities in advanced slicing for more scalable verification. Although the case study focuses on TDX Module, the findings are broadly applicable to large-scale, low-level programs and have already influenced the software-verification community, such as standardizing nondeterministic object initialization. All verification tasks and proof harnesses are publicly released to foster reproducible research and future tool development.

BibTeX Entry

@inproceedings{TACAS26a, author = {Dirk Beyer and Po-Chun Chien and Bo-Yuan Huang and Nian-Ze Lee and Thomas Lemberger}, title = {{A} Case Study in Firmware Verification: {Applying} Formal Methods to {Intel TDX Module}}, booktitle = {Proceedings of the 31th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS~2026, Turin, Italy, April 11-16), part~2}, pages = {42-64}, year = {2026}, series = {LNCS~16506}, publisher = {Springer}, doi = {10.1007/978-3-032-22749-2_3}, url = {https://www.sosy-lab.org/research/tdx-module-firmware-verification/}, pdf = {https://www.sosy-lab.org/research/pub/2026-TACAS.A_Case_Study_in_Firmware_Verification_Applying_Formal_Methods_to_Intel_TDX_Module.pdf}, presentation = {https://www.sosy-lab.org/research/prs/2026-04-13_TACAS_Intel_TDX_Nian-Ze.pdf}, poster = {https://www.sosy-lab.org/research/pst/2026-04-13_ETAPS_Firmware_Verification_TDX_Module_Poster.pdf}, abstract = {Firmware underpins system security but remains challenging to verify due to hardware dependency, specialized coding idioms, and limited open-source examples. Manual verification approaches, while common in industry, are labor-intensive and difficult to scale. This paper presents a detailed case study on applying automatic formal methods for software to a security-critical firmware component in Intel Trust Domain Extensions (TDX), known as TDX Module. In this study, we employ six state-of-the-art C-program analyzers on the production TDX Module firmware, leveraging techniques ranging from bounded model checking and symbolic execution to abstract interpretation. Our empirical evaluation identifies obstacles unique to firmware, highlights harness-design decisions essential for verifying industry-scale code bases, and demonstrates opportunities in advanced slicing for more scalable verification. Although the case study focuses on TDX Module, the findings are broadly applicable to large-scale, low-level programs and have already influenced the software-verification community, such as standardizing nondeterministic object initialization. All verification tasks and proof harnesses are publicly released to foster reproducible research and future tool development.}, keyword = {Software Model Checking, Software Testing, Firmware verification}, annote = {This article has been selected as an "<a href="https://etaps.org/2026/programme/">ETAPS Distinguished Paper</a>." An <a href="https://www.sosy-lab.org/research/tdx-module-firmware-verification/tacas26-appendix.pdf">appendix</a> to this article is available on our supplementary webpage.}, artifact = {10.5281/zenodo.18371342}, funding = {DFG-CONVEY, DFG-BRIDGE, Intel}, }

Additional Infos

This article has been selected as an "ETAPS Distinguished Paper." An appendix to this article is available on our supplementary webpage.
Dirk Beyer. Evaluating Tools for Automatic Software Testing (Report on Test-Comp 2026). In Proceedings of the 29th International Conference on Fundamental Approaches to Software Engineering (FASE 2026, Turin, Italy, April 11-16), LNCS 16504, 2026. Springer. doi:10.1007/978-3-032-22774-4_23 Keyword(s): Competition on Software Testing (Test-Comp), Competition on Software Testing (Test-Comp Report), Software Testing Funding: DFG-COOP Publisher's Version PDF Poster

BibTeX Entry

@inproceedings{FASE26b, author = {Dirk~Beyer}, title = {Evaluating Tools for Automatic Software Testing (Report on Test-Comp 2026)}, booktitle = {Proceedings of the 29th International Conference on Fundamental Approaches to Software Engineering (FASE~2026, Turin, Italy, April 11-16)}, pages = {}, year = {2026}, series = {LNCS~16504}, publisher = {Springer}, doi = {10.1007/978-3-032-22774-4_23}, poster = {https://www.sosy-lab.org/research/pst/2026-03-17_PROG26_TestComp26.pdf}, keyword = {Competition on Software Testing (Test-Comp),Competition on Software Testing (Test-Comp Report),Software Testing}, annote = {<a href="https://www.sosy-lab.org/research/pst/2026-03-17_PROG26_TestComp26.pdf">Poster available online</a>.}, funding = {DFG-COOP}, }

Additional Infos

Poster available online.
Dirk Beyer, Thomas Lemberger, and Henrik Wachowitz. Testing in Formal Verification via Witness Generation (Empirical Evaluation). In Proceedings of the 29th International Conference on Fundamental Approaches to Software Engineering (FASE 2026, Turin, Italy, April 11-16), LNCS 16504, 2026. Springer.
Keyword(s): Software Model Checking, Software Testing Funding: DFG-CONVEY, DFG-COOP PDF
Artifact(s)
1. doi:10.5281/zenodo.18190957
Abstract

Despite potential synergies, the communities surrounding formal software verifiers and automatic test generators have developed different formats to describe a path to an error. Test generators export a test case whose execution makes the error observable, while verifiers produce a violation witness, an abstract description of the error path. Previous work transformed violation witnesses into test cases and evaluated their effectiveness, and other work found that test generators are more effective in bug finding than formal verifiers. While there are hybrid approaches to formal verification that utilize testing, there is no empirical evaluation of the usefulness of the off-the-shelf use of test generators in formal verification. We change that by transforming test cases to violation witnesses. This allows the use of test generators for finding counterexamples in verification scenarios like the Competition on Software Verification (SV-COMP), both directly and as parts of bigger verification systems. In a large empirical evaluation we examine the potential improvements this use of test generators can add to formal verifiers.

BibTeX Entry

@inproceedings{FASE26a, author = {Dirk~Beyer and Thomas~Lemberger and Henrik~Wachowitz}, title = {Testing in Formal Verification via Witness Generation (Empirical Evaluation)}, booktitle = {Proceedings of the 29th International Conference on Fundamental Approaches to Software Engineering (FASE~2026, Turin, Italy, April 11-16)}, pages = {}, year = {2026}, series = {LNCS~16504}, publisher = {Springer}, url = {}, pdf = {https://www.sosy-lab.org/research/pub/2026-FASE.Testing_in_Formal_Verification_via_Witness_Generation.pdf}, abstract = {Despite potential synergies, the communities surrounding formal software verifiers and automatic test generators have developed different formats to describe a path to an error. Test generators export a test case whose execution makes the error observable, while verifiers produce a violation witness, an abstract description of the error path. Previous work transformed violation witnesses into test cases and evaluated their effectiveness, and other work found that test generators are more effective in bug finding than formal verifiers. While there are hybrid approaches to formal verification that utilize testing, there is no empirical evaluation of the usefulness of the off-the-shelf use of test generators in formal verification. We change that by transforming test cases to violation witnesses. This allows the use of test generators for finding counterexamples in verification scenarios like the Competition on Software Verification (SV-COMP), both directly and as parts of bigger verification systems. In a large empirical evaluation we examine the potential improvements this use of test generators can add to formal verifiers.}, keyword = {Software Model Checking, Software Testing}, artifact = {10.5281/zenodo.18190957}, doinone = {Unpublished: Last checked: 2026-03-22}, funding = {DFG-CONVEY, DFG-COOP}, }
Dirk Beyer. Find, Use, and Conserve Tools for Formal Methods. In Proc. Festschrift Podelski 65th Birthday, LNCS 14765, pages 75-91, 2026. Springer. doi:10.1007/978-3-032-13711-1_5 Publisher's Version PDF

BibTeX Entry

@inproceedings{Podelski65, author = {Dirk~Beyer}, title = {Find, Use, and Conserve Tools for Formal Methods}, booktitle = {Proc. Festschrift Podelski 65th Birthday}, pages = {75-91}, year = {2026}, series = {LNCS~14765}, publisher = {Springer}, doi = {10.1007/978-3-032-13711-1_5}, }
Manuel Bentele, Max Barth, Marcel Ebbinghaus, Jan Körner, Daniel Dietsch, Matthias Heizmann, Dominik Klumpp, Frank Schüssele, and Andreas Podelski. Ultimate Automizer with a One-Dimensional Memory Model - (Competition Contribution). In Proc. TACAS, LNCS 16506, pages 589-594, 2026. Springer. doi:10.1007/978-3-032-22749-2_37
Keyword(s): Ultimate Automizer, Software Model Checking Publisher's Version PDF
Artifact(s)
1. doi:10.5281/zenodo.17735224
BibTeX Entry

@inproceedings{UAutomizer-SVCOMP2026, author = {Manuel Bentele and Max Barth and Marcel Ebbinghaus and Jan K{\"{o}}rner and Daniel Dietsch and Matthias Heizmann and Dominik Klumpp and Frank Sch{\"{u}}ssele and Andreas Podelski}, title = {Ultimate Automizer with a One-Dimensional Memory Model - (Competition Contribution)}, booktitle = {Proc.\ {TACAS}}, pages = {589--594}, year = {2026}, series = {LNCS~16506}, publisher = {Springer}, doi = {10.1007/978-3-032-22749-2_37}, url = {}, pdf = {}, keyword = {Ultimate Automizer, Software Model Checking}, annote = {}, artifact = {10.5281/zenodo.17735224}, funding = {}, }
Max Barth, Daniel Dietsch, Matthias Heizmann, and Marie-Christine Jakobs. Ultimate TestGen: Combining Parallel Trace Abstraction and Symbolic Path Execution (Competition Contribution). In Proc. FASE, LNCS 16504, pages 492-496, 2026. Springer. doi:10.1007/978-3-032-22774-4_28
Keyword(s): Ultimate Automizer, Software Model Checking, Test-Case Generation Funding: DFG-ConVeY, DFG-ReVeriX Publisher's Version PDF
Artifact(s)
1. doi:10.5281/zenodo.17792491
BibTeX Entry

@inproceedings{UTestGen-TestComp2026, author = {Max Barth and Daniel Dietsch and Matthias Heizmann and Marie{-}Christine Jakobs}, title = {Ultimate TestGen: Combining Parallel Trace Abstraction and Symbolic Path Execution (Competition Contribution)}, booktitle = {Proc. {FASE}}, pages = {492--496}, year = {2026}, series = {LNCS~16504}, publisher = {Springer}, doi = {10.1007/978-3-032-22774-4_28}, url = {}, pdf = {}, keyword = {Ultimate Automizer, Software Model Checking, Test-Case Generation}, annote = {}, artifact = {10.5281/zenodo.17792491}, funding = {DFG-ConVeY, DFG-ReVeriX}, }
Max Barth, Daniel Dietsch, Matthias Heizmann, and Marie-Christine Jakobs. Ultimate Paralizer: Parallel Trace Abstraction (Competition Contribution). In Proc. TACAS, LNCS 16506, pages 595-599, 2026. Springer. doi:10.1007/978-3-032-22749-2_38
Keyword(s): Ultimate Automizer, Software Model Checking Funding: DFG-ConVeY, DFG-ReVeriX Publisher's Version PDF
Artifact(s)
1. doi:10.5281/zenodo.17555227
BibTeX Entry

@inproceedings{UParalizer, author = {Max Barth and Daniel Dietsch and Matthias Heizmann and Marie{-}Christine Jakobs}, title = {Ultimate Paralizer: Parallel Trace Abstraction (Competition Contribution)}, booktitle = {Proc. {TACAS}}, pages = {595--599}, year = {2026}, series = {LNCS~16506}, publisher = {Springer}, doi = {10.1007/978-3-032-22749-2_38}, url = {}, pdf = {}, keyword = {Ultimate Automizer, Software Model Checking}, annote = {}, artifact = {10.5281/zenodo.17555227}, funding = {DFG-ConVeY, DFG-ReVeriX}, }
Áron Ricardo Perez-Lopez, Po-Chun Chien, Florian Lonsing, Samantha Archer, Ahmed Irfan, and Clark Barrett. Pono 2.0: A Versatile SMT-Based Model Checker for Safety and Liveness (Long Tool Paper). In Proceedings of the 27th International Symposium on Formal Methods (FM 2026, Tokyo, Japan, May 18-22), LNCS 16557, 2026. Springer.
Keyword(s): Btor2, SMT Funding: DFG-CONVEY, DFG-BRIDGE PDF Supplement
Artifact(s)
1. doi:10.5281/zenodo.18680798
Abstract

We introduce an updated version of the Pono model checker. Pono is a versatile SMT-based model checker that integrates multiple verification algorithms and interfaces with a wide range of SMT solvers through a solver-agnostic back end. It emphasizes usability, offering support for commonly used input formats and providing C++ and Python APIs for programmatic access. The new version 2.0 introduces several important new features, including support for liveness properties, new interpolation-based safety-checking engines, a new VMT-LIB front end, and a number of usability and performance enhancements. An evaluation of the new version demonstrates significant improvements in performance over its previous version and comparable performance to other state-of- the-art model checkers. These results highlight Pono 2.0's effectiveness as a general-purpose and easily extensible verification platform.

BibTeX Entry

@inproceedings{Pono-FM26, author = {Áron Ricardo Perez-Lopez and Po-Chun Chien and Florian Lonsing and Samantha Archer and Ahmed Irfan and Clark Barrett}, title = {{Pono} 2.0: {A} Versatile {SMT}-Based Model Checker for Safety and Liveness (Long Tool Paper)}, booktitle = {Proceedings of the 27th International Symposium on Formal Methods (FM~2026, Tokyo, Japan, May 18-22)}, pages = {}, year = {2026}, series = {LNCS~16557}, publisher = {Springer}, url = {https://github.com/stanford-centaur/pono}, pdf = {https://www.sosy-lab.org/research/pub/2026-FM.Pono_2.0_A_Versatile_SMT-Based_Model_Checker_for_Safety_and_Liveness.pdf}, presentation = {}, abstract = {We introduce an updated version of the Pono model checker. Pono is a versatile SMT-based model checker that integrates multiple verification algorithms and interfaces with a wide range of SMT solvers through a solver-agnostic back end. It emphasizes usability, offering support for commonly used input formats and providing C++ and Python APIs for programmatic access. The new version 2.0 introduces several important new features, including support for liveness properties, new interpolation-based safety-checking engines, a new VMT-LIB front end, and a number of usability and performance enhancements. An evaluation of the new version demonstrates significant improvements in performance over its previous version and comparable performance to other state-of- the-art model checkers. These results highlight Pono 2.0's effectiveness as a general-purpose and easily extensible verification platform.}, keyword = {Btor2, SMT}, artifact = {10.5281/zenodo.18680798}, doinone = {Unpublished: Last checked: 2026-04-30 (10.1007/978-3-032-26220-2_1)}, funding = {DFG-CONVEY, DFG-BRIDGE}, }
Thomas Lemberger and Henrik Wachowitz. AFL-TC: Transforming Fuzzer Test Inputs for Test-Comp (Competition Contribution). In Proceedings of the 29th International Conference on Fundamental Approaches to Software Engineering (FASE 2026, Turin, Italy, April 11-16), LNCS 16504, 2026. Springer.
Keyword(s): Software Testing, Fuzzing Funding: DFG-CONVEY, DFG-COOP PDF Supplement
Artifact(s)
1. doi:10.5281/zenodo.18060896
Abstract

AFL-TC is a tool chain that integrates AFL++ into the environment of Test-Comp. Coverage-guided greybox fuzzers like AFL++ produce raw binary data that is given to programs as input on stdin, without any knowledge of how this data is interpreted. In contrast to that, Test-Comp requires structured XML descriptions of test cases that list a sequence of individual input values, which are read whenever the program calls an input function. Previous adaptations of fuzzers used tool-specific modifications for Test-Comp. Now, AFL-TC demonstrates a flexible solution that decouples the test generation from the Test-Comp format: AFL-TC first runs AFL++ (or any other tester that produces binary input for stdin), then replays each input with a test harness that (a) records how the test input is interpreted by the program and (b) outputs the recording as corresponding XML elements. To provide test cases early, AFL-TC employs a monitor that triggers a transformation whenever new test files are discovered. AFL-TC participated in both Test-Comp categories Cover-Error and Cover-Branches. It placed 6th overall, 4th among active participants, and best in the sub-category C.coverage-branches.Arrays

BibTeX Entry

@inproceedings{FASE26b, author = {Thomas~Lemberger and Henrik~Wachowitz}, title = {{AFL-TC}: Transforming Fuzzer Test Inputs for {Test-Comp} (Competition Contribution)}, booktitle = {Proceedings of the 29th International Conference on Fundamental Approaches to Software Engineering (FASE~2026, Turin, Italy, April 11-16)}, pages = {}, year = {2026}, series = {LNCS~16504}, publisher = {Springer}, doi = {}, url = {https://gitlab.com/sosy-lab/software/test-to-witness}, pdf = {https://www.sosy-lab.org/research/pub/2026-FASE.AFL-TC_Transforming_Fuzzer_Test_Inputs_for_Test-Comp.pdf}, abstract = {AFL-TC is a tool chain that integrates AFL++ into the environment of Test-Comp. Coverage-guided greybox fuzzers like AFL++ produce raw binary data that is given to programs as input on stdin, without any knowledge of how this data is interpreted. In contrast to that, Test-Comp requires structured XML descriptions of test cases that list a sequence of individual input values, which are read whenever the program calls an input function. Previous adaptations of fuzzers used tool-specific modifications for Test-Comp. Now, AFL-TC demonstrates a flexible solution that decouples the test generation from the Test-Comp format: AFL-TC first runs AFL++ (or any other tester that produces binary input for stdin), then replays each input with a test harness that (a) records how the test input is interpreted by the program and (b) outputs the recording as corresponding XML elements. To provide test cases early, AFL-TC employs a monitor that triggers a transformation whenever new test files are discovered. AFL-TC participated in both Test-Comp categories Cover-Error and Cover-Branches. It placed 6th overall, 4th among active participants, and best in the sub-category C.coverage-branches.Arrays}, keyword = {Software Testing, Fuzzing}, artifact = {10.5281/zenodo.18060896}, funding = {DFG-CONVEY, DFG-COOP}, }

Internal reports

Dirk Beyer, Po-Chun Chien, Bo-Yuan Huang, Nian-Ze Lee, and Thomas Lemberger. A Case Study in Firmware Verification: Applying Formal Methods to Intel TDX Module (Appendix). 2026.
Keyword(s): Software Model Checking, Software Testing, Firmware verification Funding: DFG-CONVEY, DFG-BRIDGE, Intel PDF Supplement
Artifact(s)
1. doi:10.5281/zenodo.18371342
BibTeX Entry

@techreport{Appendix26, author = {Dirk Beyer and Po-Chun Chien and Bo-Yuan Huang and Nian-Ze Lee and Thomas Lemberger}, title = {{A} Case Study in Firmware Verification: {Applying} Formal Methods to {Intel TDX Module} (Appendix)}, year = {2026}, doi = {}, url = {https://www.sosy-lab.org/research/tdx-module-firmware-verification/}, pdf = {https://www.sosy-lab.org/research/tdx-module-firmware-verification/tacas26-appendix.pdf}, keyword = {Software Model Checking, Software Testing, Firmware verification}, annote = {This is an appendix to our <a href="https://www.sosy-lab.org/research/bib/All/index.html#TACAS26a">TACAS 2026 paper</a>.}, artifact = {10.5281/zenodo.18371342}, funding = {DFG-CONVEY, DFG-BRIDGE, Intel}, }

Additional Infos

This is an appendix to our TACAS 2026 paper.

Theses and projects (PhD, MSc, BSc, Project)

Johannes Tim Wildberger. LLM-assisted Generation of Formal-Verification Harnesses for the Intel TDX Module Firmware. Master's Thesis, LMU Munich, Software Systems Lab, 2026. Keyword(s): Software Model Checking, Firmware verification PDF

BibTeX Entry

@misc{WildbergerHarnessForgeMCP, author = {Johannes Tim Wildberger}, title = {LLM-assisted Generation of Formal-Verification Harnesses for the Intel TDX Module Firmware}, year = {2026}, pdf = {https://www.sosy-lab.org/research/msc/2026.Wildberger.LLM-assisted_Generation_of_Formal-Verification_Harnesses_for_the_Intel_TDX_Module_Firmware.pdf}, keyword = {Software Model Checking, Firmware verification}, field = {Computer Science}, howpublished = {Master's Thesis, LMU Munich, Software Systems Lab}, }
Yuan Cui. Value Analysis with Initial Precision from YML Correctness Witness. Master's Thesis, LMU Munich, Software Systems Lab, 2026.

BibTeX Entry

@misc{WitnessReuseValueAnalysis, author = {Yuan Cui}, title = {Value Analysis with Initial Precision from YML Correctness Witness}, year = {2026}, field = {Computer Science}, howpublished = {Master's Thesis, LMU Munich, Software Systems Lab}, }
Yuemiao Xiang. Parallel Test-Case Generation via Test-Goal Splitting. Master's Thesis, LMU Munich, Software Systems Lab, 2026.

BibTeX Entry

@misc{ParallelTestingGoalSplit, author = {Yuemiao Xiang}, title = {Parallel Test-Case Generation via Test-Goal Splitting}, year = {2026}, field = {Computer Science}, howpublished = {Master's Thesis, LMU Munich, Software Systems Lab}, }

Disclaimer:

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Last modified: Wed May 13 01:05:06 2026 UTC