620 smtlib2 parser tokenizer wrongly rejects set logic commands and has a non helpful error message for parse for 1 items by baierd · Pull Request #621 · sosy-lab/java-smt

baierd · 2026-03-20T14:44:48Z

Fixes issues from #620 by adding a state-machine for processing SMT-LIB2 programs.

Closes #620.

…formula is returned

SMTLIB2 parsing and add a check for set-info

daniel-raffler

Our SMTLIB2 parser, to be exact its tokenizer, rejects set-logic commands that are not at the very beginning of the SMT-LIB2 file. This is wrong, these commands are allowed to be everywhere in the file and may be repeated with distinct settings!

I think this is backwards. Here is the diagram from the standard:

(Source of diagram: https://smt-lib.org/papers/smt-lib-reference-v2.6-r2021-05-12.pdf , 4.1 General Requirements, page 53)

So set-logic must be used before anything else can be done, and set-info can appear anywhere in the file

Did you run into any parsing issues with their benchmarks?

src/org/sosy_lab/java_smt/basicimpl/AbstractFormulaManager.java

ThomasHaas · 2026-03-21T08:54:04Z

Did you run into any parsing issues with their benchmarks?

The issues he faces come from the smtlib files I gave in #613. The output we generate conforms to the shape required by SMT-COMP because we once submitted our benchmarks there.

daniel-raffler · 2026-03-21T10:12:16Z

Did you run into any parsing issues with their benchmarks?

The issues he faces come from the smtlib files I gave in #613. The output we generate conforms to the shape required by SMT-COMP because we once submitted our benchmarks there.

Thanks! I've had a look at it, and there seem to be two issues:

We expect set-logic to be the first command in the file
We currently don't support push/pop while parsing

In this case the problem is easily fixed by just commenting out some lines:
manyRegReads.smt2.txt

When I now try the tests, I get a runtime of 8s with logic ALL and 3s for the BV logic. However, this is only true after adding back in the push/pop around the assertions. Otherwise, the runtime is <1s for both cases, and the BV logic appears to be slightly slower

@baierd: We could easily fix the first problem by skipping set-option/set-info before the set-logic command. However, I'm not sure how push/pop could be handled by our parser

ThomasHaas · 2026-03-21T11:32:29Z

In this case the problem is easily fixed by just commenting out some lines: manyRegReads.smt2.txt

When I now try the tests, I get a runtime of 8s with logic ALL and 3s for the BV logic. However, this is only true after adding back in the push/pop around the assertions. Otherwise, the runtime is <1s for both cases, and the BV logic appears to be slightly slower

If the query is not incremental, Z3 will use the SAT Solver independently of the logic set (easy to see by executing Z3 with higher verbosity levels). I think most solvers ignore (set-logic) for non-incremental queries: they might only check syntactic conformance to the specified logic but otherwise exploit the fact that they know the whole query and what settings are best for solving it.
But for incremental queries, the solver has do expect the worst-case, i.e., that every further query introduces new logics.
This is particularly true over the programmatic API, where the solver simply cannot know about future queries; all he knows is the upper bound on logics given by (set-logic).

However, I'm not sure how push/pop could be handled by our parser

I think you would need a more general concept of an SMTQuery which is a list of SMTCommands: assert(formula), push, pop, check-sat. I don't think the existing API can do it.

kfriedberger

lgtm.

Why does the formatter or Checkstyle not warn about the String concatenation?
Please fix, then merge. Thanks.

kfriedberger · 2026-03-21T12:01:25Z

src/org/sosy_lab/java_smt/api/FormulaManager.java

    checkArgument(!formulas.isEmpty(), "No assertion found in the SMTLIB string.");
+    checkArgument(
+        formulas.size() == 1,
+        "More than one assertion found in the SMT-LIB string, " + "please use parseAll(String).");


String concatenation not required. Remove the PLUS.

src/org/sosy_lab/java_smt/basicimpl/AbstractFormulaManager.java

baierd · 2026-03-21T12:33:23Z

So set-logic must be used before anything else can be done, and set-info can appear anywhere in the file

If you want to solve the query, it is needed before something is asserted. Which is less strict.
But we don't want to solve the query, we parse it to a formula. Hence it does not matter for us currently, as Karlheinz already stated.

Btw. i left the token detection in for if we actually decide that we want to do something with it + documentation.

The standard however states that For benchmarks in the official repository a call to set-info with this attribute can occur only as the first command of a script., hence i wanted to allow it explicitly.

@ThomasHaas do you know whether comments are allowed to be placed before this set-info command?
Also, thank you for the clarification!

baierd · 2026-03-21T12:43:38Z

However, I'm not sure how push/pop could be handled by our parser.

If there is only push and pop, everything is easy, you just need a list of lists of formulas. The problem occurs when check-sat is used, because that can not be easily reflected with generic data-structures without being easily confusing or wrongly used.

We could just add a class ParsingResult that returns the information in a proper form. This could easily also reflect the other commands like set-logic as well. With such a structure it would also be easy to add Prover support (i.e. solve the formulas as dictated in the SMTLIB2), and therefore a parsing command that also solves could easily be added.

ThomasHaas · 2026-03-21T14:02:26Z

@ThomasHaas do you know whether comments are allowed to be placed before this set-info command? Also, thank you for the clarification!

I'm not sure if this is permitted in SMT-COMP, but does it really matter? You want to parse general smtlib2 which is far less restrictive: you can have comments everywhere and (set-info) doesn't need to be called at all.
Only when dumping whole queries to smtlib you might want to adhere to a more strict form.

daniel-raffler · 2026-03-21T19:09:06Z

If you want to solve the query, it is needed before something is asserted. Which is less strict.
But we don't want to solve the query, we parse it to a formula. Hence it does not matter for us currently, as Karlheinz already stated.

But shouldn't we at least try to keep to the standard? If we just want to allow set-info before set-logic, it's easy enough to just add a flag:

There is really no need to remove the check for that

The standard however states that For benchmarks in the official repository a call to set-info with this attribute can occur only as the first command of a script., hence i wanted to allow it explicitly.

The problem here is that MathSAT adds (set-info :source |printed by MathSAT|) at the top of its output when dumping formulas. This will then cause the new precondition to fail:

private String sanitize(String formulaStr) {
    List<String> tokens = Tokenizer.tokenize(formulaStr);

    StringBuilder builder = new StringBuilder();
    int pos = 0; // index of the current token

    for (String token : tokens) {
      if (pos == 0 && Tokenizer.isSetInfoToken(token)) {
        // set-info call is allowed to be the very first command in the benchmark, but can also
        // appear repeatedly throughout a SMT2 program
        // Technically it is even required to be the very first command, and it must set the
        //  :smt-lib-version attribute. See SMT-LIB 2.7 §3.11.3
        // TODO: check that version >= 2?
        Preconditions.checkArgument(token.contains(":smt-lib-version"));

The token is (set-info :source |printed by MathSAT|) here, and does not contain :smt-lib-version

We could probably fix this, but I'm not sure the check is even needed. According to the standard these are additional requirements for their benchmark files:

Benchmarks in the official SMT-LIB repository at www.smt-lib.org must satisfy additional
requirements on the meta-level information they contain and the order in which it appears.

:smt-lib-version, :source, :license, and :category must be set exactly once,

:status must be set as many times as needed so that each occurrence of the command
check-sat-assuming or check-sat in the benchmark is preceded (not necessarily immediately) by a corresponding :status info

Moreover, the set set-info call for attribute :smt-lib-version must be the very first com-
mand in the benchmark.

We're trying to parse "regular" SMTLIB files, so this shouldn't apply to us

If there is only push and pop, everything is easy, you just need a list of lists of formulas. The problem occurs when check-sat is used, because that can not be easily reflected with generic data-structures without being easily confusing or wrongly used.

Yes, that might be one way to solve it. In this particular example, however, the interesting assertion isn't even on the stack anymore at the end of the SMTLIB file. Instead, it was replaced by false, which makes the problem trivially unsat. We would need some way to get the assertions at different points in time. However, this might just become a little to confusing for the user

baierd · 2026-03-21T19:58:23Z

But shouldn't we at least try to keep to the standard?

Yes and no. We try to follow the standard as far as it makes sense. Many solvers do non-standard things and we adapt.

If we just want to allow set-info before set-logic, it's easy enough to just add a flag:

No. This is a clear example for a state-machine. Starting with flags etc. would lead to confusing/complex code.

And yes, as stated above, the rule with set-info having to specify :smt-lib-version first is only relevant for the competition, i will remove the precondition.
I find it curious that the standard specifies things about the competition though.

…luate them if there are none

…ct the actual rules of set-logic in the SMTLIB2 standard with some leniency due to solvers often assuming ALL to be set implicitly

…nal simplifications. This currently only checks that our SMTLIB2 out/input is sticking to some important rules, but can be used to actually control the flow of possible parsing with solving etc.

baierd · 2026-03-23T10:48:14Z

So the problem is the following: if we assume the diagram above to be accurate, most solvers don't produce valid SMT-LIB2. Since we know this, we allow some leniency, most notably; we assume implicit logic selection for logic ALL if there is none. This merges the 2 states essentially. Since we are only parsing, and not solving, the SAT_MODE and UNSAT_MODE are essentially merged into RESULT_MODE for us. And since we are not allowing reset currently, what is left is a very limited automaton with 2 states.

Btw. i also found that the diagram above is incomplete, as exit is for example not handled. But it is quite clear how it should be handled (additional sink state).

I now added a implementation that can easily switch between all of this, depending on a single boolean flag to control whether we allow implicit set-logic 'ALL'.

@daniel-raffler i found that Boolector seems to return an empty SMTLIB2 String in a test case (that i now disabled). Could you open a issue and take a look?

Also, i found that we throws exceptions every time we load Bitwuzlas options due to our "further options" implementation (in our code, not Bitwuzlas). While not part of this PR, I still added a small simplification to check for an empty options string to skip the evaluation (as it made my debugging harder ;D). @daniel-raffler would you be so kind and open another issue for this and take a look? Exception based control flow is Java is a very bad idea and we should not do it if possible.

…ts-set-logic-commands-and-has-a-non-helpful-error-message-for-parse-for-1-items

baierd · 2026-03-23T11:00:38Z

~~TODO: is a SMTLIB2 program with only (check-sat) a valid program if we assume that the logic is set implicitly? (Because we don't allow this currently)~~

Edit: nvm we actually have tests for this. I need to change the current behavior.

…pty strings to be parsed

…leading to exception to a checkArgument()

daniel-raffler

Looks fine to me! There are a few TODOs left open, and a lot of the methods in SMTLIB2ProgramStateMachine are currently unused. Do you plan to extend this later? Otherwise, we could probably simplify the class a bit

daniel-raffler · 2026-04-02T20:13:49Z

src/org/sosy_lab/java_smt/basicimpl/AbstractFormulaManager.java

-    int pos = 0; // index of the current token
+
+    // SMTLIB2ProgramState tracks that the SMTLIB2 query conforms to the rules outlined in the
+    // standard. SMTLIB2ProgramStateMachine models the


This comment seems to have been cut off

baierd added 3 commits March 20, 2026 14:51

Add check to parse() to return a proper error message in case only 1 …

2190f46

…formula is returned

Add SMTLIB2 tokenizer for set-info command

cf191f3

Allow set-logic in all cases in

62a97f9

SMTLIB2 parsing and add a check for set-info

baierd linked an issue Mar 20, 2026 that may be closed by this pull request

SMTLIB2 parser (tokenizer) wrongly rejects set-logic commands and has a non-helpful error message for parse() for > 1 items #620

Open

baierd requested review from daniel-raffler and kfriedberger March 20, 2026 14:44

daniel-raffler requested changes Mar 20, 2026

View reviewed changes

src/org/sosy_lab/java_smt/basicimpl/AbstractFormulaManager.java Outdated Show resolved Hide resolved

src/org/sosy_lab/java_smt/basicimpl/AbstractFormulaManager.java Outdated Show resolved Hide resolved

kfriedberger previously approved these changes Mar 21, 2026

View reviewed changes

baierd added 4 commits March 23, 2026 11:41

Add simplification to Bitwuzlas option resolving so that we don't eva…

237a4b5

…luate them if there are none

Refactor some SanitizerTest tests into dedicated test cases and refle…

1d589d9

…ct the actual rules of set-logic in the SMTLIB2 standard with some leniency due to solvers often assuming ALL to be set implicitly

Disable a VariableNamesTest for Boolector, as it returns an empty string

ce363b6

Add a SMTLIB2 state-machine according to the standard with some optio…

4b9656a

…nal simplifications. This currently only checks that our SMTLIB2 out/input is sticking to some important rules, but can be used to actually control the flow of possible parsing with solving etc.

baierd dismissed kfriedberger’s stale review via 4b9656a March 23, 2026 10:44

baierd added 3 commits March 23, 2026 11:49

Merge branch 'master' into 620-smtlib2-parser-tokenizer-wrongly-rejec…

9a93da1

…ts-set-logic-commands-and-has-a-non-helpful-error-message-for-parse-for-1-items

Remove wrongfully pushed debug code ;D

6af7ff2

Format

7ae56bb

baierd added 2 commits March 23, 2026 16:57

Remove assertion added with new smtlib state machine that prevents em…

4d71f4c

…pty strings to be parsed

Apply checkstyle suggestions to AbstractFormulaManager and switch if …

1dba0bb

…leading to exception to a checkArgument()

baierd requested review from daniel-raffler and kfriedberger March 24, 2026 16:22

daniel-raffler approved these changes Apr 2, 2026

View reviewed changes

Conversation

baierd commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daniel-raffler left a comment • edited by kfriedberger Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ThomasHaas commented Mar 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daniel-raffler commented Mar 21, 2026

Uh oh!

ThomasHaas commented Mar 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kfriedberger left a comment

Choose a reason for hiding this comment

Uh oh!

kfriedberger Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

baierd commented Mar 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

baierd commented Mar 21, 2026

Uh oh!

ThomasHaas commented Mar 21, 2026

Uh oh!

daniel-raffler commented Mar 21, 2026

Uh oh!

baierd commented Mar 21, 2026

Uh oh!

baierd commented Mar 23, 2026

Uh oh!

baierd commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daniel-raffler left a comment

Choose a reason for hiding this comment

Uh oh!

daniel-raffler Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

baierd commented Mar 20, 2026 •

edited

Loading

daniel-raffler left a comment •

edited by kfriedberger

Loading

ThomasHaas commented Mar 21, 2026 •

edited

Loading

ThomasHaas commented Mar 21, 2026 •

edited

Loading

baierd commented Mar 21, 2026 •

edited

Loading

baierd commented Mar 23, 2026 •

edited

Loading