Software Engineering and Programming Languages

New Spectral Algorithms for Refuting Smoothed k-SAT

In this article, we survey recent progress on designing algorithms with strong refutation guarantees for smoothed instances of the k-SAT problem.

By Venkatesan Guruswami, Pravesh K. Kothari, and Peter Manohar

Posted Feb 21 2025

Introduction
A New Spectral Approach
Proving Feige’s Conjecture
Even Smaller Refutations
Conclusion
References

Read the related Technical Perspective

Abstract

Despite being a quintessential example of a hard problem, the quest for finding fast algorithms for deciding satisfiability of propositional formulas has occupied computer scientists both in theory and in practice. In this article, we survey recent progress on designing algorithms with strong refutation guarantees for smoothed instances of the k-SAT problem. Smoothed instances are formed by slight random perturbations of arbitrary instances, and their study is a way to bridge the gap between worst-case and average-case models of problem instances. Our methods yield new algorithms for smoothed k-SAT instances with guarantees that match those for the significantly simpler and well-studied model of random formulas. Additionally, they have led to a novel and unexpected line of attack on some longstanding extremal combinatorial problems in graph theory and coding theory. As an example, we will discuss the resolution of a 2008 conjecture of Feige on the existence of short cycles in hypergraphs.

1. Introduction

The famous SAT problem asks to determine if a given propositional formula is satisfiable. That is, can we set the formula’s variables to 0 (False) or 1 (True) in a way so that the formula evaluates to 1 (True). In this article, we will focus on the $k$ -SAT problem where the propositional formula is further restricted to be in the $k$ -CNF form, that is, logical AND of a collection of $k$ -clauses, each of which is a logical OR of at most $k$ literals (variables or their logical negations). For example, $(x_{1} \lor \neg x_{2} \lor x_{3}) \land (x_{2} \lor x_{4} \lor \neg x_{5})$ is a 3-CNF formula in variables $x_{1}, x_{2}, \dots, x_{5}$ where $\lor, \land$ , and $\neg$ denote the logical AND, OR and NOT operations, respectively. Given a $k$ -CNF formula, we are interested in either finding a satisfying truth assignment, if it exists, or a “refutation”—a short, easily-checkable proof that the formula is unsatisfiable. Despite its simplicity, $k$ -SAT is phenomenally expressive and models a long list of important discrete optimization problems. A decades-long quest has thus focused on designing algorithms for $k$ -SAT in both theory and practice.

In this article, we will focus on finding refutations—“obviously verifiable” polynomial size (that is, short) contradictions that confirm unsatisfiability of a $k$ -SAT formula. For any formula, we can simply a tabulate each of the $2^{n}$ possible truth assignments $x$ together with a clause violated by $x$ . This is an obviously verifiable refutation but clearly too long—it’s exponential in size. On the other hand, if we get lucky and our formula happens to contain two 1-clauses $(x_{1}) \land (\neg x_{1})$ , then it is manifestly unsatisfiable and the two 1-clauses serve as an easily verifiable and short certificate of unsatisfiability. Of course, it’s unrealistic for such clauses to magically occur in interesting inputs. But we can often infer additional clauses that must also be satisfied if the input formula is satisfied and hope that such an obviously verifiable short contradiction arises in the inferred clauses. Such clause learning (via mechanical resolution rules) for deriving new clauses form an integral part of practical SAT solvers.

In his famous 1972 paper,¹⁸ Karp proved that ascertaining satisfiability or finding refutations for 3-SAT formulas is NP-hard. Thus, finding a fast algorithm for 3-SAT that succeeds (even approximately) on all possible input is likely hard. One might naturally expect finding refutations to get easier as the number of clauses increases (more clauses means more possibilities for contradictions to manifest), and so perhaps one might hope that denser instances get easier? No such luck! As it turns out, unless a widely believed, stronger variant of the $P \neq NP$ conjecture fails, there are no polynomial time algorithms for refuting $k$ -SAT formulas unless they have essentially the maximum possible $\approx n^{k}$ (out of $\approx n^{k}$ possible) clauses. In fact, even substantially beating brute-force search and finding sub-exponential (for example, $2^{\sqrt{n}}$ ) time algorithms is ruled out for formulas with $\approx n^{k - 1}$ clauses.¹⁴

Despite the grim picture presented by these hardness results, the extraordinary modeling power of $k$ -SAT has motivated a concerted research effort for finding fast heuristics for $k$ -SAT. On the practical side, new algorithmic ideas along with advances in software engineering have made modern SAT solvers¹³ a powerful and indispensable tool with applications to solving practical instances of optimization problems in planning, model checking, and verification of software systems. By encoding the task of finding counter-examples to mathematical conjectures into SAT formulas, SAT solvers have even helped resolve longstanding mathematical conjectures.⁶ On the theoretical side, algorithms research has focused on more careful modeling of input instances to escape worst-case hardness under minimal assumptions. Such beyond worst-case input models for hard discrete optimization problems such as $k$ -SAT now form a vibrant area¹¹ of research in algorithm design.

1.1 The Smoothed $k$ -SAT Model

In 2007, Feige proposed⁹ his smoothed $k$ -SAT model as a way to circumvent the all-pervasive hardness of $k$ -SAT. He was inspired by a groundbreaking work of Spielman and Teng on smoothed analysis of the simplex algorithm. The simplex algorithm for linear programming, introduced by Dantzig in 1947, presented an uncomfortable disconnect between theoretical predictions and practical performance—here was a fast practical algorithm that also provably needed exponential time in the worst-case! In 2001, Spielman and Teng convincingly resolved²³ this tension and showed that the simplex method runs in polynomial time on smoothed inputs—an input obtained by adding a small random perturbation to an arbitrary instance. Such a perturbation, of the sort one might reasonably expect practical instances to naturally possess, is enough to remove all hardness in even carefully crafted instances.

Feige’s model involves an analogous smoothening of a worst case $k$ -SAT formula by randomly perturbing each literal (that is, changing an unnegated variable to negated and vice-versa) in each clause with some small probability, say $0.01$ , independently. For example, given two clauses $(x_{1} \lor \neg x_{2} \lor x_{3}) \land (x_{2} \lor x_{4} \lor \neg x_{5})$ , we imagine tossing 6 independent coins with bias $0.01$ , one for each literal in each of the two clauses. The smoothed version of the first clause, will have, for example, $x_{1}$ negated if the first coin lands on heads, $x_{2}$ unnegated if the 2nd coin lands on heads, and so on.

If the input formula $ϕ$ has $≫ C n$ clauses in $n$ variables for some large enough constant $C$ , then the resulting smoothed formula is unsatisfiable with high probability over the random perturbation. Feige thus asked if the task of finding refutations for smoothed $k$ -SAT formulas gets significantly easier when compared to worst-case formulas. Equivalently, given our discussion above, do $≪ n^{k - 1}$ clause-smoothed $k$ -SAT formulas admit efficient refutation algorithms?

Prior works²^,⁸^,²² showed that the answer is indeed yes for the random $k$ -SAT model—a fully smoothed model where the negation patterns and the clause structure, that is, the variables appearing in the clauses (that are worst-case in smoothed $k$ -SAT) are chosen independently at random. However, those algorithms strongly exploit the abundant randomness in the choice of variables appearing in the clauses.

In this article, we will survey recent developments¹^,¹⁶^,¹⁷ on a new class of algorithms, based on the eigenvalues of certain specialized Kikuchi matrices (introduced earlier²⁴ for statistical inference and to simplify algorithms for random $k$ -SAT²² for even $k$ ), that yield optimal (modulo certain hardness conjectures) algorithms for smoothed $k$ -SAT. As a result, these new algorithms succeed in refuting smoothed $k$ -SAT formulas with $m ≳ n^{k / 2} log n$ clauses, that is, $≪ n^{k - 1}$ , in polynomial time and significantly beat brute-force search if $m ≳ n^{1 + ∊}$ . In fact, our guarantees for smoothed $k$ -SAT match the best-known (and conjectured optimal) results for the significantly simpler and restricted setting of random $k$ -SAT formulas, provide quantitative bounds on the number of clauses that every truth assignment must violate, and extend far beyond $k$ -SAT to handle all logical constraint satisfaction problems.

1.2 Spectral Methods for Combinatorics

While the theoretical advances in algorithms for $k$ -SAT haven’t yet influenced practical SAT-solving, they already have some surprising applications to long open problems in other areas of mathematics. These include resolving Feige’s conjecture on small even covers (cycles) in hypergraphs,¹⁶^,¹⁷ making progress on the decades-long quest for optimal bounds for locally decodable⁴ and locally correctable¹⁹ error-correcting codes, and problems⁷ in additive number theory that generalize the famous Szemeredi’s theorem.

The principle behind such applications is analogous to how SAT solvers helped resolve mathematical conjectures by encoding the search for a proof into a SAT formula. Our theoretical analog strongly exploits the newfound ability to tackle $k$ -SAT formulas with a worst-case clause structure. In this article, we will discuss an application of this method to proving Feige’s conjecture on small cycles in hypergraphs. Surprisingly, proving this conjecture will let us go full circle to show even better refutations for smoothed $k$ -SAT.

Short Cycles in Graphs.

Feige’s conjecture¹⁰ is a generalization of a basic result about short cycles in graphs. Recall that a graph (aka network), is simply a collection of pairs, called edges (modeling pairwise associations), on $n$ nodes. A cycle in such a network is a sequence of nodes $v_{1} \to v_{2} \to \dots \to v_{ℓ} \to v_{1}$ that starts and ends at the same node such that every consecutive pair has an edge between them. In his famous 1978 book,⁵ mathematician Béla Bollobás conjectured that every graph with $n$ nodes where every node, on average, has $d > 2$ edges (this quantity is called the average degree), must have a cycle of length at most $\approx {log}_{d - 1} (n)$ . When $d = 2$ , the network can be a single giant cycle on all $n$ vertices that clearly has no cycle of length $\leq n - 1$ . For any $d > 2$ , however, the conjecture implies that we cannot even avoid a cycle of length $O (log n)$ —an exponentially smaller bound than $n$ —and thus signals a phase transition in the extremal length of the smallest cycle as the average degree $d$ crosses 2.

Bollobas’s conjecture is an example of a result in extremal combinatorics. Such results uncover a truth that holds for all (thus extremal) mathematical objects. Here, it says that no matter how we might build a graph on $n$ nodes, so long as it has average degree $d > 2$ , we cannot avoid introducing a short cycle. In their elegant 2002 paper,³ mathematicians Alon, Hoory and Linial confirmed this conjecture.

Short Cycles in Hypergraphs.

Feige’s conjecture asks a similar question about short cycles in hypergraphs. A $k$ -uniform hypergraph is a collection of subsets of size $k$ on $n$ nodes, called hyperedges, that model associations between a larger number of nodes (instead of 2 in graphs). A 2-uniform hypergraph is simply a graph. In order to pose Feige’s question, we will identify a key property of cycles in graphs and use it to motivate a generalized notion of cycles in hypergraphs. Observe that every vertex appears in either two or zero edges in any cycle in a graph. In particular, a cycle is an even subgraph—a subset of edges on which, every vertex appears an even integer number of times. Even subgraphs naturally generalize to hypergraphs. We will define a hypergraph cycle or even covers to be a collection $C_{1}, C_{2}, \dots, C_{ℓ}$ of hyperedges such that every node of the hypergraph is included in an even number of $C_{i}$ ’s (that is, an even subhypergraph). (See Figure 1a and 1b.)

Figure 1a. A length 4 cycle in a graph on 4 vertices.

Figure 1b. A length 4 even cover in a 3-uniform hypergraph on 6 vertices.

This definition may look odd (or perhaps a little too even?), at first. A cycle in a graph has an appealing combinatorial structure of a loop. Hypergraph cycles seem to lack a combinatorial meaning. Why did Feige (and why should we) care about it? Feige’s motivation for studying short cycles actually stemmed from to finding refutations for $k$ -SAT formulas (see next section). Here, we outline a different source of motivations that comes from deep connections to the theory of error correcting codes because hypergraph cycles naturally relate to solving systems of linear equations.

To see why, let’s associate a variable to every node of a graph and let’s think of each edge as specifying a linear equation modulo 2. Thus, the edge $1 \sim 2$ between nodes 1 and 2 relates to the equation $x_{1} + x_{2} = b$ mod 2 where $b \in {0, 1}$ . A cycle in the graph then naturally corresponds to a subset of 2-sparse (that is, each equation has only two non-zero coefficients) equations that is linearly dependent—that is, if you want this subset of equations to be satisfied, then the right hand side $b$ of at least one of the equations is already fixed (that is, cannot be chosen independently) once you fix the choice of the right hand sides for all the others. As a simple example, consider the graph on 3 nodes with edges $1 \sim 2$ , $2 \sim 3$ and $3 \sim 1$ . Suppose you knew that some $x$ satisfies the two equations corresponding to the first two edges $x_{1} + x_{2} = 0$ mod 2 and $x_{2} + x_{3}$ is 1 mod 2. Then, adding the left hand sides of the two equations gives $x_{1} + 2 x_{2} + x_{3} = 1$ mod 2 which is equivalent to $x_{1} + x_{3} = 1$ mod 2 since $2 x_{2} = 0$ mod 2, regardless of the value of $x_{2}$ . Thus, the right hand side of third equation is determined/dependent and cannot be chosen independently of the first two equations. A cycle in a $k$ -uniform hypergraph similarly corresponds to a linearly dependent subset of $k$ -sparse (that is, each equation has $k$ non-zero coefficients) equations corresponding to each hyperedge.

The length of the smallest cycle thus equals the size of the smallest linearly dependent subset of equations in a given system. Understanding the size of such a set turns out to have a whole gamut of applications, especially in designing error-correcting codes. Error correcting codes (or just codes, in short) are a systematic method of adding redundancy to a message so that, when the message transmitted across a noisy channel and incurs errors, one can still decode it uniquely thanks to the redundancy. The systematic methods or codes naturally involve adding “parity checks”, that is, right hand sides of an appropriately chosen set of linear equations evaluated at the message. In such codes, the length of the smallest linearly dependent subset of equations naturally corresponds to distance—a crucial quantity that controls the number of errors that can be corrected. The smallest linear dependencies in $k$ -sparse equations turns out to be equivalent to understanding the best possible distance (and thus, the largest possible rate of errors that can be tolerated) by an important class of codes called low density parity check codes introduced by Gallager in 1960s with numerous theoretical and practical applications.

Feige’s Conjecture and the Resolution.

As in the case of graphs, we are interested in the extremal trade-off between average degree (that is, average number of hyperedges containing a node) and the length of the smallest hypergraph cycle in a $k$ -uniform hypergraph. Given the connection to linear dependencies above and the basic fact that every collection of $m = n + 1$ equations in $n$ variables are linearly dependent, whenever the average degree $d = m k / n > k$ , then the hypergraph must have a (rather long) cycle of length $n + 1$ . Making an analogy to graphs, one might expect that if $d ≫ k$ , the hypergraph must have a $O (log n)$ -length cycle, but this turns out to be false! Mathematicians Assaf Naor and Jacques Verstraete in 2006 showed²¹ that one needs (and this is enough!) the average degree $d ≳ n^{k / 2 - 1}$ in order for every hypergraph to have a $O (log n)$ -length cycle. For $k = 2$ , this matches a coarse version of the bound for graphs but for $k \geq 3$ suggests a new regime between $d \approx 1$ and $d \approx \sqrt{n}$ that has no analog for graphs. What happens when, for example, $d \approx n^{0.25}$ ? Feige conjectured a precise behavior for this regime:

Conjecture 1 (Feige¹⁰).

Every $k$ -uniform hypergraph on $n$ nodes with average degree $\approx {(n / ℓ)}^{k / 2 - 1}$ has an $ℓ log n$ -length cycle.

Feige’s conjecture (up to some $log n$ factors in $d$ ) was motivated by the hypothesis (and is, in fact, equivalent to it) that there is no better construction of hypergraphs avoiding short cycles than simply choosing a random hypergraph. It is thus analogous to the famous 1947 theorem of Shannon (the birthplace of modern coding theory!) that random error correcting codes are “best” in a precise sense.

Despite being a foundational statement about hypergraphs, the conjecture remained largely open with only some partial progress for a special case by Alon and Feige in 2009. Now, by invoking the new algorithms for solving smoothed $k$ -SAT we can essentially completely resolve it — up to one additional $log n$ factor in the degree bound! This was established first by the authors with additional $log n$ factors which were later¹⁷ trimmed down to a single $log n$ factor.

Theorem 2.

Every $k$ -uniform hypergraph on $n$ nodes with average degree $\approx {(n / ℓ)}^{k / 2 - 1} log n$ has an $ℓ log n$ -length cycle.

As we will explain later in this article, the proof of this theorem makes a new connection between the success of our spectral approach for smoothed $k$ -SAT and existence of short cycles in hypergraphs. It forms the first (of a growing list of) application spectral refutations via Kikuchi matrices in combinatorics.

2. A New Spectral Approach

Our approach for finding refutations for smoothed $k$ -SAT formulas relies on continuous time algorithms based on spectral methods—methods that use eigenvalues and eigenvectors of matrices built from the input. This is in contrast to the largely discrete algorithmic toolkit (such as resolution refutations and their generalization) in modern practical SAT solvers. In fact, it has been known for more than 20 years that even random $k$ -SAT formulas with $≪ n^{k - 1}$ clauses do not admit efficient resolution refutations—a natural formalization of combinatorial refutations.

The spectral approach for refuting $k$ -SAT formulas was conceived¹⁵ by Goerdt and Krivilevich back in 2001. A decade and half of work led to spectral methods with conjectured-optimal guarantees for refuting random $k$ -SAT in 2017. These spectral methods, however, are rather brittle and strongly rely on the randomness in the variables appearing in each clause. For smoothed $k$ -SAT, such methods provably fail since the variables appearing in the clauses are completely arbitrary. In this article, we will present a new class of spectral methods, based on Kikuchi matrices, which, when combined with combinatorial pre-processing, provide robust methods for refutation that significantly simplify the results for random $k$ -SAT and extend to smoothed $k$ -SAT with no loss in performance.

From $k$ -SAT to Degree $k$ Polynomials.

To bring in spectral methods, we will make a simple but conceptually important translation between refuting a $k$ -SAT formula and finding certificates of upper bounds on the maximum value of a degree $k$ polynomial. For this purpose, it will be more convenient to view truth assignments as $+ 1$ (True) and $- 1$ (False). Given a $k$ -SAT formula $ϕ$ with $m$ clauses, let $Φ : {- 1, 1}^{n} \to N$ map truth assignments $x \in {- 1, 1}^{n}$ to the number of clauses satisfies by $x$ . Then, $Φ (x)$ is clearly the sum of $m$ functions $Φ_{C}$ , one for each clause $C$ in $ϕ$ where $Φ_{C} (x) = 1$ if and only if $x$ satisfies clause $C$ . Since $Φ_{C}$ depends only on $k$ ${\pm 1}$ -variables, it is a degree $k$ polynomial. For example, $k = 3$ and $C = (x_{1} \lor x_{2} \lor x_{3})$ :

$Φ_{C} (x) = \frac{7}{8} + \frac{1}{8} (x_{1} + x_{2} + x_{3} - x_{1} x_{2} - x_{2} x_{3} - x_{3} x_{1} + x_{1} x_{2} x_{3})$ (1)

To refute the $k$ -SAT formula $ϕ$ , we will find an easily-checkable certificate of the fact $Φ (x) = \sum_{C} Φ_{C} (x) < m$ for all $x$ . In fact, we will certify a stronger bound of $Φ (x) \leq 0.99 m$ , that is, every $x$ must violate not just 1 but in fact $1 %$ of the clauses.

Observe that $Φ_{C}$ is a sum of 8 (in general, $2^{k}$ ) monomials, each of degree $\leq 3$ ( $k$ , more generally). A standard idea, going back to early 2000s, is to certify bounds on 8 different polynomials, obtained by taking one out of 8 terms corresponding to each $C$ . We can then obtain a bound on $Φ (x)$ by adding all the 8 quantities. For each such polynomial obtained from a smoothed 3-SAT formula, with a little more work that we omit here, we can also assume that the coefficients of each monomial is an independent, uniform ${\pm 1}$ . We thus focus on strong refutation of semirandom homogeneous polynomials $Ψ (x)$ of the form:

$Ψ (x) = \sum_{C \in H} b_{C} \prod_{i \in C} x_{i},$ (2)

where $H$ , the instance hypergraph, is simply the collection of $m$ different sets $C \subseteq [n]$ of size $k$ (corresponding to each original clause) and $b_{C} \in {\pm 1}$ are chosen uniformly and independently for each $C \in H$ . Here, strong refutation involves certifying that $Ψ (x) \leq ∊ m$ for a sufficiently tiny $∊ > 0$ .

2.1 From Quadratic Polynomials to Matrices

Let us show how spectral methods show up by starting with the simplest setting of $k = 2$ . Then, each $C \in H$ is a set of size 2, or simply a pair ${i, j} \subseteq [n]$ , and $Ψ (x)$ from (2) is a degree 2 polynomial in $x$ . The idea is to view such a degree 2 polynomial as a quadratic form.

For an $n \times n$ matrix $A$ , its quadratic form on a vector $v$ equals $v^{⊤} A v = 〈 v, A v 〉 = \sum_{i, j \leq n} v_{i} v_{j} A (i, j)$ . This expression is a homogeneous quadratic polynomial in $v$ . Indeed, every quadratic polynomial is a quadratic form of an associated matrix and vice-versa. For our $Ψ$ , let the $n \times n$ matrix $A$ be defined by:

$A (i, j) = \{\begin{matrix} b_{{i, j}} if C = {i, j} \in H \\ 0 otherwise . \end{matrix}$

Then, for any $x \in {- 1, 1}^{n}$ , $x^{⊤} A x = \sum_{i, j} x_{i} x_{j} b_{i, j} = 2 Ψ (x)$ .

A basic result in linear algebra allows upper-bounding any quadratic form of $A$ as:

$v^{⊤} A v \leq {∥ v ∥}_{2}^{2} {∥A∥}_{2},$

where ${∥v∥}_{2} = \sqrt{\sum_{i} v_{i}^{2}}$ is the length of the vector $v$ and ${∥A∥}_{2}$ is the “spectral norm” or the largest singular value of $A$ . Since $x$ has ${\pm 1}$ -coordinates and thus length $\sqrt{n}$ , we obtain that $Ψ (x) \leq n {∥A∥}_{2}$ .

This bound on $Ψ (x)$ is easily verifiable. Given $Ψ$ (obtained easily from the $k$ -SAT formula $ϕ$ ), we form the matrix $A$ and use a linear time algorithm (called power iteration) to obtain good estimates on the spectral norm ${∥A∥}_{2}$ .

To certify $Ψ (x) \leq ∊ m$ , we need to check that ${∥A∥}_{2} \leq ∊ m / n$ . $A$ is an example of a random matrix since its entries $b_{{i, j}}$ are uniformly and independently distributed in ${\pm 1}$ . There is a well-developed theory for understanding the typical value of ${∥A∥}_{2}$ for such random matrices that allows us to conclude that ${∥A∥}_{2} ≲ \sqrt{Δ_{m a x} log n}$ where $Δ_{m a x}$ is the maximum number of non-zero entries in any row of the matrix $A$ . If the pairs $C = {i, j}$ are “equidistributed” that is, any variable $i$ participates in roughly the same number of pairs, then we would expect $Δ_{m a x} \approx Δ_{a v g} \approx m / n$ where $Δ_{a v g}$ is the average number of non-zero entries in a row of $A$ . Thus, ${∥A∥}_{2} ≲ \sqrt{m log n / n}$ which is $\leq ∊ m / n$ if $m ≳ n log n$ .

What if the set of pairs $H$ is not regular and some $i$ is “over-represented” in the set of pairs $C$ ? While we omit formal details, it turns out that one can use an elegant reweighting trick (discovered in the work of¹⁷) on the matrix $A$ that effectively allows us to assume regularity if $Δ_{a v g} ≫ 1$ .

2.2 Generalizing to Quartic Polynomials

The case of odd $k$ turns out to be technically challenging so let us skip $k = 3$ and generalize the above approach when $Ψ (x)$ is of degree $k = 4$ (that is, the case of 4-SAT). So, $Ψ (x)$ is not quadratic in $x$ . We will now view it as a quadratic form in $(_{2}^{n})$ variables each corresponding to quadratic monomials $x_{i} x_{j}$ in the original assignment $x$ . Let us write $x^{\circ 2}$ for the vector in $R^{(_{2}^{n})}$ indexed by pairs ${i, j}$ with entry at ${i_{1}, i_{2}}$ given by $x_{i_{1}} x_{i_{2}}$ . Define the $(_{2}^{n}) \times (_{2}^{n})$ matrix $A$ :

$A ({i_{1}, i_{2}}, {j_{1}, j_{2}}) : = \{\begin{matrix} b_{i_{1}, i_{2}, j_{1}, j_{2}} if C = {i_{1}, i_{2}, j_{1}, j_{2}} \in H \\ 0 otherwise. \end{matrix}$

Then, as before, we can observe that:

${(x^{\circ 2})}^{⊤} A (x^{\circ 2}) = \sum_{{i, j}, {k, ℓ}} x_{i} x_{j} x_{k} x_{ℓ} A ({i, j}, {k, ℓ}) = 6 Ψ (x)$

The factor 6 comes from the fact that there $(_{2}^{4}) = 6$ different ways that a set $C$ of size 4 can be broken into pairs of pairs each of which arises as a term in the quadratic form above.

We can now write

$Ψ (x) \leq {∥x^{\circ 2}∥}_{2}^{2} {∥A∥}_{2} = (_{2}^{n}) {∥A∥}_{2} .$

And a similar appeal to results in random matrix theory tells us that with high probability ${∥A∥}_{2} ≲ \sqrt{Δ_{m a x} log (_{2}^{n})} ≲ \sqrt{Δ_{m a x} log n}$ where $Δ_{m a x}$ is the maximum number of non-zero entries in any row of $A$ . Equivalently, $Δ_{m a x}$ is the maximum number of 4-clauses that a pair ${i, j}$ participates in. When all pairs behave roughly similarly, we will have $Δ_{m a x} \approx Δ_{a v g} ≲ \frac{m}{(_{2}^{n})}$ , in which case

$Ψ (x) \leq (_{2}^{n}) \sqrt{\frac{m}{(_{2}^{n})} log n} \leq ∊ m$

with high probability if $m ≳ n^{2} log n$ .

Early proofs of such facts used different tools from random matrix theory and worked for random 4-SAT by utilizing that $H$ is a random collection of 4-sets in that case. Our approach here explicitly reveals that only an equi-distribution (that is, $Δ_{m a x} \approx Δ_{a v g}$ ) property of $H$ is required for the success of this approach. This allows us, via a similar reweighting trick (that succeeds if $Δ_{a v g} ≫ 1$ ) discussed above, to obtain a result that works for arbitrary (worst-case) hypergraphs.

Let’s finish this part by noting our quantitative bounds. For $k = 2$ , our refutation succeeded when $m ≳ n log n$ . For $k = 4$ , we instead needed $m ≳ n^{2} log n$ . Indeed, for arbitrary even $k$ , a similar argument yields a bound of $m ≳ n^{k / 2} log n$ —a significant improvement over the $Ω (n^{k})$ clauses required for refuting an unsatisfiable $k$ -SAT formula in the worst-case, showing us the power of the spectral approach.

2.3 Beyond Basic Spectral Refutations

A smoothed $k$ -SAT formula is unsatisfiable with high probability whenever it has $m ≳ n$ clauses. But our spectral refutations above require $m ≳ n^{2} log n$ —a bound higher by a factor $\approx n$ (and $n^{k / 2 - 1}$ for arbitrary even $k$ ). This is because our approach fails whenever the average number of entries in a row of $A$ , that is, $Δ_{a v g} = m / (_{2}^{n})$ , is $≪ 1$ . The question of whether there are non-trivial refutations for $k$ -SAT formulas when $m ≪ n^{k / 2}$ (now called the spectral threshold) remained open for more than a decade and half. In the same time, researchers found evidence, in the form of restricted lower bounds,²⁰ that there may be no polynomial time refutation algorithm for $m ≪ n^{k / 2}$ . This, nevertheless, left open the possibility of significantly beating brute-force search below this threshold. This possibility was realized for random $k$ -SAT in 2017. We will now discuss a significantly simpler (described essentially in full below!) spectral approach that succeeds even for smoothed $k$ -SAT.

We will continue to work with $k = 4$ . As before, we will write $Ψ (x)$ as a quadratic form but instead of the natural matrices we discussed above, we will use Kikuchi matrices that we next introduce. First, though, a piece of notation: for sets $S, T \subseteq [n]$ , we let $S \oplus T = (S \cup T) (S \cap T)$ denote the symmetric difference of $S$ and $T$ .

Definition 3 (Kikuchi Matrices).

For any $r \in N$ , the level $r$ -Kikuchi matrix for $Ψ$ is a $(_{r}^{n}) \times (_{r}^{n})$ matrix with rows and columns indexed by sets $S, T \subseteq [n]$ of size $r$ and entries given by: $A (S, T) = \{\begin{matrix} b_{C} if S \oplus T = C \in H \\ 0 otherwise. \end{matrix}$

Observe that for $r = 2$ , the above Kikuchi matrices specialize to the $(_{2}^{n}) \times (_{2}^{n})$ matrix we saw in the previous subsection. Let’s see why $Ψ (x)$ is a quadratic form of $A$ .

For any assignment $x \in {\pm 1}^{n}$ , denote by $x^{\circ r}$ the $(_{r}^{n})$ -dimensional vector with coordinates indexed by sets $S \subseteq [n]$ of size $r$ and $S$ -th entry given by $x_{S} = \prod_{i \in S} x_{i}$ . Then, for $D = (_{2}^{4}) (_{r}^{n}_{-}^{-}_{2}^{4})$ we have:

$\begin{matrix} {(x^{\circ r})}^{⊤} A (x^{\circ r}) & = \sum_{S, T} x_{S} x_{T} A (S, T) \\ = \sum_{C \in H} b_{C} \sum_{S, T : S \oplus T = C} x_{C} = D Ψ (x) . \end{matrix}$

Here, we used that since $S \oplus T = C$ , for any $x \in {\pm 1}^{n}$ , $x_{S} \cdot x_{T} = x_{S \cup T S \cap T} x_{S \cap T}^{2} = x_{C}$ as $x_{i}^{2} = 1$ for every $i$ . The last equality holds true because the number of pairs $(S, T)$ such that $S, T$ are $r$ -size sets and $S \oplus T = C$ is exactly $D$ for any set $C$ of size 4. Observe how our notational trick of switching to ${\pm 1}$ -valued truth assignments paid off here.

Given this simple observation, we can now again construct the spectral upper bound $Ψ (x) \leq {∥x^{\circ r}∥}_{2}^{2} {∥A∥}_{2} = (_{r}^{n}) {∥A∥}_{2}$ . Furthermore, it turns out that powerful tools of random matrix theory still allow us to conclude as before that

${∥A∥}_{2} ≲ \sqrt{Δ_{m a x} log (_{r}^{n})} = \sqrt{Δ_{m a x} r log n} .$ (3)

The Kikuchi superpower: Density Increment.

Why might this upper bound be better? The meat is in the density increment. As $r$ grows, the number of rows in $A$ grow. But the number of non-zero entries in $A$ grow even faster, giving us a net increase in $Δ_{a v g}$ . Indeed, let $m = n^{2} log n / ℓ$ for some parameter $ℓ \in N$ . Then, since each $C \in H$ contributes $D$ non-zero entries, $Δ_{a v g} = m D / (_{r}^{n}) \approx (n^{2} log n / ℓ) (r^{2} / n^{2}) \approx r^{2} log n / ℓ$ . In particular, even when $ℓ ≫ log n$ (and thus we have $m = n^{2} log n / ℓ ≪ n^{2}$ clauses) choosing $r$ large enough still allows us to obtain a $Δ_{a v g} ≫ 1$ .

Surprisingly, the rest of the proof idea is more or less the same as before! Let us assume, as we did at first in both the previous subsections, that all rows of $A$ have roughly equal number of non-zero entries. Such a condition holds true if $H$ is a random collection of sets of size 4. Then, $Δ_{m a x} ≲ Δ_{a v g} \approx r^{2} log n / ℓ$ . Plugging this back in (3) gives

$Ψ (x) \leq \frac{(_{r}^{n})}{D} {∥A∥}_{2} ≲ \frac{m}{Δ_{a v g}} \sqrt{Δ_{a v g} r log n} \leq ∊ m$

if $Δ_{a v g} ≳ \sqrt{Δ_{a v g} r log n} / ∊$ or $Δ_{a v g} ≳ \frac{r log n}{∊^{2}} log n$ . Since $Δ_{a v g} = r^{2} log n / ℓ$ , this condition holds if $r \geq ℓ / ∊^{2}$ .

Furthermore, as in the previous two subsections, we can use a variant of our reweighting trick to generalize this argument to arbitrary $H$ without any further assumptions. To verify this bound algorithmically (that is, to “check” our refutation), we need to construct the matrix $A$ and compute its spectral norm. This requires a run time proportional to the dimension of the matrix which scales as $\approx n^{r}$ . So, all in all, we obtain a roughly $n^{ℓ / ∊^{2}}$ time algorithm to certify that $Ψ (x) \leq ∊ m$ whenever $m \geq n^{2} / ℓ$ for any $ℓ \in N$ . When $m \approx n^{1 + δ}$ for some small $δ > 0$ , that is, even slightly superlinear, the runtime of our algorithm is strictly sub-exponential (specifically $\approx 2^{n^{1 - δ}}$ ) and thus asymptotically beats brute-force search.

Handling odd k: We described our approach so far for even $k$ . The case of odd $k$ turns out to be a little more involved. This has been true for spectral algorithms ever since the earliest spectral algorithms for the problem. The polynomial time case (for example, when $m ≳ n^{1.5} log n$ for 3-SAT analogous to $m ≳ n^{2} log n$ for 4-SAT) were first found in a work of Abascal, Guruswami, Kothari in 2021. The full trade-off required introducing the correct generalizations of the Kikuchi matrices that we have described above. The analysis of the spectral norms of such matrices requires more effort and some additional combinatorial ideas.

We will not formalize these ideas here but note the following eventual result that we derive as a consequence:

Theorem 4.

For any $k \in N$ and $ℓ \in N$ , given a semi-random homogeneous degree $k$ polynomial $Ψ (x)$ with $m \geq n {(n / ℓ)}^{k / 2 - 1} log n$ non-zero coefficients, there is a $2^{O (ℓ log n / ∊^{2})}$ time spectral algorithm that certifies $Ψ (x) \leq ∊ m$ . Consequently, for any $k$ , we can refute smoothed $k$ -SAT formulas with $m$ clauses also in time $2^{O (ℓ log n / ∊^{2})}$ .

3. Proving Feige’s Conjecture

Let us now see how our spectral algorithms for smoothed $k$ -SAT provides a resolution for Feige’s conjecture. Our approach can be thought of as a theoretical equivalent of encoding the search for a proof into (un)-satisfiability of a SAT formula and then running a practical SAT solver. Given a hypergraph $H$ , we will build a SAT formula $Ψ_{random}$ that will be satisfiable if $H$ does not have a short cycle. We will then prove that $Ψ_{random}$ is in fact unsatisfiable to complete our proof. Of course, instead of using the computer to find such a refutation, we will “find” them (that is, argue their existence) analytically by appealing to our spectral algorithms.

Let us now describe this our argument in more detail. We are given an arbitrary $k$ -uniform hypergraph $H$ on $n$ nodes and average degree $d \approx {(n / ℓ)}^{k / 2 - 1} log n$ . Our goal is to show that $H$ must have an $ℓ log n$ -length cycle. Starting from $H$ , we will define a family of homogeneous degree $k$ polynomials:

$Ψ_{sat} = \sum_{C \in H} b_{C} x_{C} .$

Observe that $Ψ_{sat}$ is clearly satisfiable. Indeed, if $x_{i} = 1$ for every $i$ then $Ψ_{sat} (x) = | H |$ , the maximum possible value.

Our key claim below will argue, using the analysis of our spectral algorithm from above, that if $H$ has no short cycle then $Ψ_{sat}$ must in fact be unsatisfiable:

Lemma 5 (Key Claim).

If $H$ has no $\approx (ℓ log n) / ∊^{2}$ length cycle, then, ${max}_{x \in {\pm 1}^{n}} Ψ_{sat} (x) \leq ∊ | H |$ .

We thus immediately hit a contradiction unless $H$ has a $\approx ℓ log n$ length cycle.

Let us now discuss why Lemma 5 must hold for even $k$ . For $r \in N$ , we let $A$ be the Kikuchi matrix for the polynomial $Ψ_{sat}$ that we built in the previous section:

$A_{sat} (S, T) = \{\begin{matrix} 1 if S \oplus T \in H \\ 0 otherwise . \end{matrix}$

Then, we have: $Ψ_{sat} (x) \leq (_{r}^{n}) {∥A_{sat}∥}_{2}$ .

We will now argue a rather odd-looking fact. Consider the polynomial $Ψ$ defined below for arbitrary $b_{C} \in {\pm 1}$ :

$Ψ_{b} = \sum_{C \in H} b_{C} x_{C},$

We also let $Ψ_{random}$ be the special case when $b_{C}$ s are chosen uniformly at random and independently. Notice that $Ψ_{random}$ has the same form as the polynomial $Ψ$ we analyzed in the previous section. Let $A_{b}$ be the Kikuchi matrix built from $Ψ_{b}$ :

$A_{b} (S, T) = \{\begin{matrix} b_{C} if S \oplus T = C \in H \\ 0 otherwise . \end{matrix}$

We will argue that if $H$ had no $log (_{r}^{n}) \approx ℓ log n$ -length cycle, then ${∥A_{b}∥}_{2} \approx {∥A_{sat}∥}_{2}$ no matter what the value of $b_{C}$ ’s are. Now, from the previous section, we know that $(_{r}^{n}) {∥A_{b}∥}_{2} \leq ∊ | H |$ for random $b$ thus for every $x$ :

$Ψ_{sat} (x) \leq (_{r}^{n}) {∥A_{sat}∥}_{2} \leq (_{r}^{n}) {∥A_{b}∥}_{2} \leq ∊ | H | .$

This claim can appear strange. How can it be that ${∥A_{b}∥}_{2}$ does not depend on the $b_{C}$ ’s at all? In a sense, our proof reveals how short cycles in $H$ are “necessary” for ${∥A_{b}∥}_{2}$ to be as small as it is in the previous section!

Trace Moments and Spectral Norms.

To relate ${∥A_{sat}∥}_{2}$ and ${∥A_{random}∥}_{2}$ and to bring in the cycles in $H$ , we will utilize a classical connection between ${∥B∥}_{2}$ and the so-called trace moments of a matrix $B$ that, in turn, are related to a certain combinatorial count of walks on $B$ .

For any $N \times N$ symmetric matrix $B$ , let ${∥B∥}_{2} = σ_{1} \geq σ_{2} \geq \dots σ_{N} \geq 0$ be its $N$ singular values placed in descending order. The trace $tr (B)$ is simply the sum of the diagonal elements of $B$ . For any even $2 t \in N$ , a classical observation in linear algebra says that the trace of the $2 t$ -th power of $B$ equals the sum of $2 t$ -th powers of its singular values:

$tr (B^{2 t}) = σ_{1}^{2 t} + σ_{2}^{2 t} + \dots + σ_{N}^{2 t} .$

This is helpful because we can now write:

${∥B∥}_{2}^{2 t} = σ_{1}^{2 t} \leq tr (B^{2 t}) \leq \sum_{i = 1}^{n} σ_{i}^{2 t} \leq N σ_{1}^{2 t} = N {∥B∥}_{2}^{2 t}$

That is, the $2 t$ -th power of ${∥B∥}_{2}$ equals $tr (B^{2 t})$ up to a factor $N$ . By taking $2 t = log N$ and taking $1 / 2 t$ -th powers on both sides and recalling that $N^{1 / log N} \to 1$ as $N \to \infty$ gives:

${∥B∥}_{2} \leq tr {(B^{2 t})}^{1 / 2 t} = N^{1 / log N} {∥B∥}_{2} \approx {∥B∥}_{2} .$

Thus, for $2 t \approx log N$ , $tr {(B^{2 t})}^{1 / 2 t}$ is a faithful approximation to ${∥B∥}_{2}$ .

Relating ${∥A_{sat}∥}_{2}$ and ${∥A_{random}∥}_{2}$ via Trace Moments.

Using the above connection, we will now focus on arguing that $tr (A_{sat}^{2 t}) = tr (A_{random}^{2 t})$ for $2 t = log N = log (_{r}^{n}) \approx r log n$ . This would give us ${∥A_{sat}∥}_{2} \approx {∥A_{random}∥}_{2}$ .

We now recall another classical observation from basic linear algebra that relates trace moments of a matrix $B$ to a certain combinatorial count of “walks” on $B$ . For $A$ (standing for $A_{sat}$ or $A_{b}$ ), we thus have:

$tr (A^{2 t}) = \sum_{S_{1}, S_{2}, \dots, S_{2 t}} A (S_{1}, S_{2}) A (S_{2}, S_{3}) \dots A (S_{2 t}, S_{1}) .$ (4)

That is, $tr (A^{2 t})$ is the sum over all sequences of $2 t$ row indices (that is, subsets of $[n]$ of size $r$ ) of product of the entries of $A$ on consecutive elements of the sequence.

Every entry of $A$ is either 0 or $\pm 1$ and for each non-zero entry $A (S_{i}, S_{i + 1})$ , $S_{i} \oplus S_{i + 1} = C$ for some $C \in H$ . Thus, any $2 t$ -sequence $(S_{1}, S_{2}, \dots, S_{2 t})$ that contributes a non-zero (and thus exactly $\prod_{i = 1}^{2 t} b_{C_{i}}$ in $tr (A_{b}^{2 t})$ ) value to the sum above must correspond to a $2 t$ -tuple $(C_{1}, C_{2}, \dots, C_{2 t})$ of hyperedges from $H$ , one for each entry $A (S_{i}, S_{i + 1})$ .

More is true about such a $(C_{1}, C_{2}, \dots, C_{2 t})$ , as we next demonstrate in the crucial observation below. Let $1_{C_{i}} \in {0, 1}^{n}$ be the 0-1 indicator of the set $C_{i} \subseteq [n]$ . That is, $1_{C_{i}} (j) = 1$ if and only if $j \in C_{i}$ .

Observation 6.

Any $(C_{1}, C_{2}, \dots, C_{2 t})$ corresponding to a non-zero term in (4) satisfies $\sum_{i = 1}^{2 t} 1_{C_{i}} = 0$ mod 2.

This crucial observation is actually quite simple to prove. We know that $1_{S_{i}} + 1_{S_{i + 1}} = 1_{C_{i}}$ mod 2 for every $i$ —since $S_{i} \oplus S_{i + 1} = C_{i}$ . If we add up all the left hand sides we get a sum over $1_{S_{i}}$ ’s where every $1_{S_{i}}$ appears exactly twice (since $S_{i}$ occurs in exactly two entries $A (S_{i - 1}, S_{i})$ and $A (S_{i}, S_{i + 1})$ ). Thus the left hand side (and thus also the right hand side) must add up to the 0 vector modulo 2.

Next, notice that the $j$ -th entry $\sum_{i = 1}^{2 t} 1_{C_{i}} (j) = 0$ mod 2 if and only if $j$ occurs in an even number of $C_{i}$ ’s. Thus, the above observation says that the (multi)-set ${C_{1}, C_{2}, \dots, C_{2 t}}$ is a cycle or an even cover. This appears exciting since we have a direct relationship between $tr (A_{sat}^{2 t})$ and cycles in $H$ !

There is a crucial snag though—the same $C$ could recur multiple times in $(C_{1}, C_{2}, \dots, C_{2 t})$ . Indeed, if $C_{i} = C$ for every $i$ or more generally, every $C_{i}$ appeared in pairs, then, of course every element $j \in [n]$ will occur in an even number of $C_{i}$ ’s, for the trivial reason that the $C_{i}$ ’s themselves occur in pairs. Let’s call such $2 t$ -tuples trivial cycles—that is, the $C_{i}$ ’s occur in pairs and thus do not relate to cycles in $H$ .

Now for our endgame. For every trivial cycle, since $C_{i}$ ’s appear in pairs, the quantity $\prod_{i = 1}^{2 t} b_{C_{i}}$ has even number of copies of $b_{C_{i}}$ for every $b_{C_{i}}$ . Since $b_{C_{i}}^{2} = 1$ , this quantity must equal 1 regardless of the $C_{i}$ ’s! Thus, no matter what the $b_{C_{i}}$ ’s, all non-zero terms contribute exactly 1. In particular, $tr (A_{sat}^{2 t})$ (where all $b_{C_{i}}$ ’s equal 1) must equal $tr (A_{b}^{2 t})$ regardless of $b_{C_{i}}$ s.

This finishes our argument, but it’s perhaps helpful to summarize it: we related ${∥A∥}_{2}$ (for both $A = A_{sat}$ and $A = A_{b}$ ) to $tr (A^{2 t})$ . We then related $tr (A^{2 t})$ to a sum over $2 t$ -tuples $(S_{1}, S_{2}, \dots, S_{2 t})$ . The non-zero terms in this sum correspond to $\prod_{i = 1}^{2 t} b_{C_{i}}$ for $(C_{1}, C_{2}, \dots, C_{2 t})$ for $C_{i} \in H$ such that $\sum 1_{C_{i}} = 0$ mod 2—this step crucially uses the structure of the Kikuchi matrices. If $H$ has no $2 t = log (_{r}^{n})$ length cycles, then in every nonzero term in the sum the $C_{i}$ ’s must occur in pairs, in which case we observe that $\prod_{i = 1}^{2 t} b_{C_{i}} = 1$ and is thus independent of what the $b_{C_{i}}$ ’s themselves are.

4. Even Smaller Refutations

In the final act of this article, we will come full circle to show how the purely combinatorial Feige’s conjecture yields a surprising corollary for refutations for $k$ -SAT. We discussed a spectral algorithm that finds refutations for smoothed $k$ -SAT whenever the number of clauses $m ≳ n^{k / 2} log n$ . Improving on this spectral threshold even for the substantially specialized setting of random 3-SAT has been open (with accumulating evidence that this maybe impossible) ever since the 2004 work⁸ that obtained first such result.

In a surprising twist from 2006, Feige, Kim and Ofek proved¹² that for random 3-SAT formulas with $m ≳ n^{1.4}$ clauses (significantly short of the spectral threshold of $\approx n^{1.5}$ ) admit short, polynomial size refutations with high probability. That is, there exists a polynomial size certificate, based on a clever combination of spectral and combinatorial ideas, which, if given, can easily help convince us of its unsatisfiability. But despite around two decades of efforts, we do not know polynomial time algorithms to find such a certificate. The FKO result forces us to grapple with the possibility that there may be a marked difference between existence of short certificates for NP-hard problems and efficient algorithms to find them. No such gap is known (or expected!) for worst-case $k$ -SAT, making this a truly average-case phenomenon. And, no such gap is known for any other discrete optimization problem, even in the average case. Indeed, ever since its discovery, FKO has been a one-of-a-kind result with an aura of mystery around it.

Are the mysterious FKO certificates a quirk of the random 3-SAT model? Or should we expect analogs in more general instances? We will now sketch how our results from the previous two sections allow us a surprising corollary:

Corollary 7.

With high probability, smoothed 3-SAT formulas with $m ≳ n^{1.4} log n$ clauses admit an easily-checkable polynomial size refutation. More generally, a similar result holds for smoothed $k$ -SAT formulas with $n^{k / 2 - δ_{k}}$ clauses where $δ_{k} > 0$ depends only on $k$ .

That is, the FKO results extends without any quantitative change to smoothed 3-SAT formulas. The existence of short cycles in hypergraphs plays a major role in obtaining this corollary. Indeed, this was also a principle motivation for Feige’s conjecture back in 2008. At the time, since Conjecture 1 was not known, FKO’s proof used a sophisticated application of the second moment method from probability theory with rather complicated calculations. Given Theorem 2, our new certificate and its analysis will be simple.

The idea for construction such refutations is quite simple. Our spectral refutation worked by splitting the polynomial $Φ$ into 8 different polynomials of degree $\leq 3$ and then refuting each polynomial via our spectral algorithm. As before, we will do the splitting and use the spectral algorithm for all terms of degree $\leq 2$ , that is, for all except for the homogeneous degree 3 polynomial corresponding to the last term in (1) for which, we will use a “combinatorial” method. Importantly, for the degree $\leq 2$ terms, our polyomial time spectral algorithm from Section 2.1 only needs $m ≳ n / ∊^{2} log n$ (instead of $\approx n^{1.5}$ ) to certify a bound $\leq ∊$ .

The basic observation behind the combinatorial method is rather simple. Let $H$ be the 3-uniform hypergraph of monomials appearing in $Ψ$ . Let ${C_{1}, C_{2}, \dots, C_{t}}$ be a cycle in $H$ and let $Ψ_{cycle} = \sum_{i = 1}^{t} b_{C_{i}} x_{C_{i}}$ be the “fragment” of $Ψ$ that only keeps the monomials corresponding to the cycle. Then, if $\prod_{i = 1}^{t} b_{C_{i}} = - 1$ (which happens with probability $1 / 2$ over the choice of $b_{C_{i}}$ ’s), then, we claim that $Ψ_{cycle} (x) \leq t - 1$ for every $x$ . That is, every $x$ must in fact be at least 1 short of the maximum value of $t$ on such a fragment. Suppose not and say for some $x$ , $Ψ_{cycle} (x) = t$ . Then, $x_{C_{i}} = b_{C_{i}}$ for every $1 \leq i \leq t$ . Thus, $\prod_{i = 1}^{t} x_{C_{i}} = \prod_{i = 1}^{t} b_{C_{i}} = - 1$ . But on the other hand, since ${C_{1}, C_{2}, \dots, C_{t}}$ is a cycle, every $j$ occurs in an even number of $C_{i}$ ’s and thus, $\prod_{i = 1}^{t} x_{C_{i}} = \prod_{j = 1}^{n} x_{j}^{(e v e n)} = 1$ . This is a contradiction!

Here’s how this basic observation helps. Suppose $H$ has $m \approx n^{1.5} log n / \sqrt{ℓ}$ hyperedges. Then, we know from Theorem 2 that $H$ contains a $\approx ℓ log n$ cycle. We can then remove the hyperedges in this cycle and repeatedly find $ℓ log n$ length cycles in the residual hypergraph. This process gives us a “cycle partition” of $99 %$ of hyperedges of $H$ into cycles of length $\approx ℓ log n$ . From our argument above, for about $1 / 2$ of such cycles, the product of the corresponding $b_{C_{i}}$ ’s will turn out to be $- 1$ . Let’s call such cycles violated. Thus, can write:

$Ψ = \sum_{violated cycles in partition} Ψ_{cycle} (x) + Ψ_{remaining}$

For each violated cycle, every $x$ must “lose” at least 1 on the maximum possible value of the polynomial. So, we know that at any $x$ , $Ψ$ must be short of its maximum value by at least the number of violated cycles in the partition. So, $Ψ (x) \leq m - O (\frac{m}{ℓ log n}) = (1 - O (\frac{1}{ℓ log n})) m$ . This is a significantly weaker bound than that of our spectral algorithm ( $∊ m$ ), but is still non-trivial. It is also efficiently verifiable given a list of violated cycles (of size at most $O (n^{1.5})$ , so polynomial size).

The corollary follows by combining this combinatorial certificate with the spectral bound on the degree $\leq 2$ parts of $Φ$ . The precise parameters are obtained by optimizing $ℓ$ above but we will omit it here.

5. Conclusion

In this article, we surveyed a new class of spectral algorithms based on Kikuchi matrices to find refutations, that is, easily verifiable proofs of unsatisfiability, for smoothed $k$ -SAT formulas. The guarantees we obtained were as strong as the best-known (and conjectured optimal) ones for the substantially simpler random $k$ -SAT formulas and substantially surpass the best-possible (assuming standard hardness conjectures) running times for worst-case $k$ -SAT formulas at every clause density. The approach generalizes to yield similar results for all logical constraint satisfaction problems. We also saw the resolution of the 2008 conjecture of Feige on short cycles in hypergraphs as an example application. And as a consequence, we saw how to extend the one-of-a-kind Feige-Kim-Ofek result from random $k$ -SAT formulas to all smoothed $k$ -SAT formulas. Taken together, the results show that, per the current state-of-the-art, smoothed $k$ -SAT is no harder than the substantially simpler random $k$ -SAT formulas for both refutation algorithms and existence of short certificates.

The Kikuchi matrix method, the method of proving results by finding spectral refutations for a related propositional formula, coming out this line of work appears to be a promising new attack on problems in combinatorics and coding theory. It is a pleasing theoretical analog of the powerful approach for resolving mathematical problems via practical SAT solvers—a decidedly “computer science” approach to solve problems in mathematics. A few more applications, including making progress on some decades-old problems in the theory of local error correcting codes,⁴^,¹⁹ are now already around and we anticipate more such results in the near future. (See Figure 2a and 2b.)

Figure 2a. runtime vs #clauses for worst-case k-SAT (best possible modulo standard conjectures).

Figure 2b. Runtime vs #clauses for smoothed k-SAT via Kikuchi-based spectral algorithm.

Submit an Article to CACM

CACM welcomes unsolicited submissions on topics of relevance and value to the computing community.

You Just Read

New Spectral Algorithms for Refuting Smoothed k-SAT

View in the ACM Digital Library

DOI

10.1145/3635117

March 2025 Issue

Vol. 68 No. 3

Pages: 83-91

Table of Contents

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Explore More

News Mar 25 2025

Fulfilling the Growing Power Requirements of AI Datacenters

Esther Shein

Architecture and Hardware

BLOG@CACM Mar 24 2025

Research Against the Machine

Matthieu Latapy

Philosophy of Computing

clenched fist signaling protest, illustration

News Mar 20 2025

AI Transforms Medical Diagnostics

Samuel Greengard

Artificial Intelligence and Machine Learning

doctors and medical professionals walking in a hallway

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More

Abstract

1. Introduction

1.1 The Smoothed k<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><mi>k</mi></math>-SAT Model

1.2 Spectral Methods for Combinatorics

Short Cycles in Graphs.

Short Cycles in Hypergraphs.

Feige’s Conjecture and the Resolution.

2. A New Spectral Approach

From k<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><mi>k</mi></math>-SAT to Degree k<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><mi>k</mi></math> Polynomials.

2.1 From Quadratic Polynomials to Matrices

2.2 Generalizing to Quartic Polynomials

2.3 Beyond Basic Spectral Refutations

The Kikuchi superpower: Density Increment.

3. Proving Feige’s Conjecture

Trace Moments and Spectral Norms.

4. Even Smaller Refutations

5. Conclusion

New Spectral Algorithms for Refuting Smoothed k-SAT

DOI

March 2025 Issue

Related Reading

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

Shape the Future of Computing

Communications of the ACM (CACM) is now a fully Open Access publication.

1.1 The Smoothed $k$ -SAT Model

From $k$ -SAT to Degree $k$ Polynomials.