Computing Applications India Region Special Section: Hot Topics

Impactful Research and Tooling for Program Correctness

By Priyanka Darke, Ravindra Metta, Raveendra Kumar Medicherla, and R. Venkatesh

Posted Nov 1 2022

Introduction
VeriAbs
VeriFuzz
References
Authors

checklist in front of a laptop computer, illustration

In 2020, poor-quality software systems led to financial losses of approximately USD 2.08 trillion in the U.S. alone.¹⁹ Formal methods, such as bounded model checking (BMC), help to improve software quality, but they often fail to scale to the size and complexity of software. To solve these problems, we have been developing two frameworks—VeriAbs and VeriFuzz—with novel techniques, tools, and strategies.

VeriAbs

Aim. Prove code correctness or generate tests that demonstrate correctness violations.

Innovation. VeriAbs implements several novel verification techniques, such as loop shrinking,²⁰ aimed at verifying programs with complex loops, and improving the operational efficiency of popular verification techniques, such as k-induction.^3,9 Further, in our experience, the application of an appropriate sequence of techniques is more effective at verifying complex programs than any single technique. As shown in Figure 1, VeriAbs implements four novel strategies aimed at verifying different kinds of code, where each strategy consists of a sequence of verification techniques:

Figure 1. VeriAbs architecture (S: Program Safe, F: Property Fails, U: Unknown).

Strategy 1 comprises only k-induction, which is effective on loops with unstructured control flow. This does not occur in real-world code but in synthetic programs to challenge verification engines. In practice, we switch off the first strategy and retain k-induction as the final step of the default strategy.

Strategy 2 is useful for verifying quantified properties, such as whether arrays are sorted. This kind of code is commonly found in the automotive sector, where domain-specific parameters and configurations are coded using arrays and are accessed using pointers. This strategy correctly verifies 87% of all quantified properties in automotive applications.¹⁷

Strategy 3 aims to verify implementations of reactive systems and email clients that admit inputs to loops with very short ranges. As a result, 50% of the properties of such benchmarks in the International Competition on Software Verification (SV-COMP)⁵ could be verified using this strategy.

Strategy 4 is the default strategy of VeriAbs and handles programs containing linear acceleration of variables, which are very common in practice. This strategy works well in practice; it helped not only with the verification of real-world software¹³ but also to eliminate false positives from those generated by static analysis tools,¹⁰ such as the TCS Embedded Code Analyzer (ECA).¹⁸

Impact. The main technical novelty of VeriAbs is in the various abstractions it implements to scale program verification, which resulted in four publications^7,13,17,20 and six patents. The architectural advances in VeriAbs are published in a tool paper¹ and several competition-contribution papers.^2,9,12,15 To benchmark against other verifiers, VeriAbs has participated in SV-COMP since 2017, competing with more than 30 state-of-the-art verifiers. The results show that VeriAbs solves about 20% more programs than the competition,⁵ helping VeriAbs win a gold medal in SV-COMP’s ReachSafety category each year since 2019.

VeriAbs has also been used in different industrial projects.^{8,10,14,16,21} This experience helped us refine the strategies in VeriAbs to successfully verify code sizes of 1.5 MLOC for properties such as array index out of bounds, division by zero, variable value overflow and underflow, and null pointer dereferences. VeriAbs verified around 60% of these properties across domains including automotive, office automation, networking, device drivers, and security.

VeriFuzz

Aim. Automatic test generation for the discovery of complex software errors related to functionality, security, and privacy.

Innovation. Figure 2 shows the architecture of the VeriFuzz framework, which combines coverage guided fuzzing (CGF), BMC, and static analysis (SA) technology in novel ways for automatic test-case generation for programs. VeriFuzz uses the open source engines AFL²⁴ and CBMC⁶ respectively for CGF and BMC, and TCS’s in-house SA framework PRISM.¹⁸

Figure 2. VeriFuzz architecture.

The input to VeriFuzz is a program P and a desired coverage ∅. First, VeriFuzz extracts some syntactic features and classifies P into a category K, which determines the parameters to the BMC and the fuzzer. Next, P is instrumented to measure coverage, and BMC is then invoked to generate an initial test corpus T_c. The fuzzer invokes P_e, the executable of instrumented P, with the initial corpus T_c and measures the initial coverage. Each test from T_c is mutated several times to generate a set of new test inputs T_g. P_e is repeatedly executed with each test input t_g ∈ T_g and coverage is measured. The fitness check determines whether t_g should be added to T_c or not. This process is repeated until ∅ is achieved. VeriFuzz is efficient (takes only a few minutes) and effective (able to achieve ∅).

Impact. VeriFuzz combines the strengths of BMC and CGF for efficient and effective test generation,²² which led to a nomination for a best paper award. The architectural advances in VeriFuzz are published in competition-contribution papers.^11,23 To benchmark it against state-of-the-art test engines, VeriFuzz has participated in the International Competition on Software Testing (Test-Comp)⁴ each year since 2019, competing with more than 10 state-of-the-art test engines and evaluated on more than 3,000 C benchmarks. VeriFuzz has earned seven gold and five silver medals in Test-Comp and discovered errors 5- to 30-times faster than other top tools. In the industry, VeriFuzz was recently evaluated on an already well-tested smart contract implementation, where it exposed security errors, especially in the handling of Unicode string character buffers that were hitherto undiscovered.

Submit an Article to CACM

CACM welcomes unsolicited submissions on topics of relevance and value to the computing community.

You Just Read

Impactful Research and Tooling for Program Correctness

View in the ACM Digital Library

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and full citation on the first page. Copyright for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or fee. Request permission to publish from permissions@acm.org or fax (212) 869-0481.

DOI

10.1145/3551665

November 2022 Issue

Published: November 1, 2022

Vol. 65 No. 11

Pages: 52-53

Table of Contents

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Explore More

BLOG@CACM Dec 30 2024

Is Computer Science a Profession? Should It Be?

Robin K. Hill

Architecture and Hardware

man standing between infinite lines of code, illustration

News Dec 27 2024

Where Art and Tech Click: Algorithmic Photography

Mark Halper

Artificial Intelligence and Machine Learning

birds flying against a blue sky in an algorithmic photo

BLOG@CACM Dec 26 2024

AI: Beyond the Headlines

Erdin Beshimov

Artificial Intelligence and Machine Learning

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More

VeriAbs

VeriFuzz

Impactful Research and Tooling for Program Correctness

DOI

November 2022 Issue

Related Reading

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

Shape the Future of Computing

Communications of the ACM (CACM) is now a fully Open Access publication.