initial integration of jq. #4980

DavidKorczynski · 2021-01-15T17:22:15Z

Initial integration of jq. jq is a lightweight and flexible command-line JSON processor. It's a widely used command-line for handling json processing, and has more than 25K stars on Github. In essence am not sure which corporations use it as such (but I would assume many), but I use it frequently and would place it in some form of position similar to Binutils from a user perspective. It is, in essence, the sed command in the JSON world.

@jonathanmetzman @oliverchang this one is ready for review!

Signed-off-by: David Korczynski <[email protected]>

projects/jq/Dockerfile

nicowilliams · 2023-07-10T19:02:44Z

Do you need to specify that the inputs are JSON? Do you need to specify an initial test corpus?

Signed-off-by: David Korczynski <[email protected]>

nicowilliams · 2023-07-10T19:52:40Z

How does this know what sorts of inputs to start with?

nicowilliams · 2023-07-10T19:52:53Z

Also, I sent you gmail addresses as well.

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2023-07-10T20:13:34Z

Do you need to specify that the inputs are JSON? Do you need to specify an initial test corpus?

We don't need it for now. I would prefer to leave it as is and let the fuzzer engine explore the codebase.

How does this know what sorts of inputs to start with?

In short: It starts with basically a NULL byte. The fuzzing engine then relies on mutating and adding arbitrary bytes to the seed, while consecutively running the seed against the target and measuring the code coverage using relevant instrumentation. If a given seed is determined to trigger new code coverage the seed is saved in a seed database. Thus, the seed database will contain a set of inputs that each trigger unique code paths. In this sense, it's a genetic mutational algorithm.

After running the fuzzer for 180 seconds, this is the code coverage achieved:

OSS-Fuzz will run it for a lot longer than 180 seconds, and will make code coverage reports available as well. As such, we can assess after a few days what's the missing code to be analysed and adjust accordingly using techniques e.g. (1) add a corpus; (2) add a dictionary; (3) add a new fuzzer; (4) modify the existing fuzzer.

Signed-off-by: David Korczynski <[email protected]>

nicowilliams · 2023-07-10T20:21:16Z

Do you need to specify that the inputs are JSON? Do you need to specify an initial test corpus?

We don't need it for now. I would prefer to leave it as is and let the fuzzer engine explore the codebase.

+1

How does this know what sorts of inputs to start with?

In short: It starts with basically a NULL byte. [...]

I imagine that starting with some test corpus might help find certain bugs faster than starting with one byte and going from there, but starting from one byte (or even no bytes) makes a lot of sense if you have lots of cycles to spare.

After running the fuzzer for 180 seconds, this is the code coverage achieved: [...]

That's quite good!

OSS-Fuzz will run it for a lot longer than 180 seconds, and will make code coverage reports available as well. As such, we can assess after a few days what's the missing code to be analysed and adjust accordingly using techniques e.g. (1) add a corpus; (2) add a dictionary; (3) add a new fuzzer; (4) modify the existing fuzzer.

Great! Thanks for the info!

I'll see about making more fuzzer interface functions available for fuzzing the language too, not just the JSON parser, as well as for fuzzing the streaming JSON parser. Should these be differently named source files?

nicowilliams · 2023-07-10T20:59:47Z

@DavidKorczynski how would one fuzz things that need authentication? I'd like to write fuzzer functions for Heimdal, but much of that codebase deals in cryptographic network protocols (mainly Kerberos, but also PKI). One idea I have is that the fuzzer interface can just create credentials as needed and create an envelope with credentials around the payload provided by the fuzzer, but this will reduce coverage. Anyways, there must be examples of codebases like that that are in OSS-Fuzz.

nicowilliams · 2023-07-10T21:20:36Z

Also, is there a link for a dashboard to check the fuzzer's progress?

DavidKorczynski · 2023-07-10T21:23:41Z

I'll see about making more fuzzer interface functions available for fuzzing the language too, not just the JSON parser, as well as for fuzzing the streaming JSON parser. Should these be differently named source files?

That would be great! Feel free to take over and adjust things however you like. I'm also happy to continue contributing fuzzers upstream.

@DavidKorczynski how would one fuzz things that need authentication? I'd like to write fuzzer functions for Heimdal, but much of that codebase deals in cryptographic network protocols (mainly Kerberos, but also PKI). One idea I have is that the fuzzer interface can just create credentials as needed and create an envelope with credentials around the payload provided by the fuzzer, but this will reduce coverage. Anyways, there must be examples of codebases like that that are in OSS-Fuzz.

Am giving Heimdal a look now and will get back on this.

Also, is there a link for a dashboard to check the fuzzer's progress?

Yes, once this PR is merged you should be able to track things on https://oss-fuzz.com as well as introspector.oss-fuzz.com See https://introspector.oss-fuzz.com/project-profile?project=liblouis for an example of how progress can be tracked, as well as further links to code coverage reports and Fuzz Introspector reports.

nicowilliams · 2023-07-10T21:31:16Z

Am giving Heimdal a look now and will get back on this.

You could start with one of the Heimdal ASN.1 compiler's READMEs where I document how I've fuzzed it with AFL.

ASN.1 of course doesn't have the credentials problem I mentioned above -- it's as easy to fuzz as jq's JSON parser.

nicowilliams · 2023-07-10T21:33:58Z

There's also things like ~~MIT Kerberos (ping @greghudson)~~ (EDIT: MIT Kerberos is already enrolled). All sorts of JWT, OAuth, and other things out there. I think in general the best thing to do may be to factor out all the crypto and fuzz just payloads and also envelopes where the fuzz interface will ignore credentials.

nicowilliams · 2023-07-10T23:06:59Z

Looking at https://github.com/google/oss-fuzz/tree/master/projects/krb5 it looks like doing fuzz testing for cryptographic and/or stateful protocols is just hard.

jonathanmetzman

lgtm

DavidKorczynski mentioned this pull request Jan 26, 2021

Added a first fuzzer for integration with OSS-Fuzz. jqlang/jq#2255

Merged

initial integration of jq.

3bee5ea

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski force-pushed the jq branch from 51d451a to 3bee5ea Compare July 10, 2023 18:14

nicowilliams reviewed Jul 10, 2023

View reviewed changes

projects/jq/Dockerfile Outdated Show resolved Hide resolved

DavidKorczynski added 3 commits July 10, 2023 12:43

use latest repo

e9a1840

Signed-off-by: David Korczynski <[email protected]>

use upstream fuzzer

c4773ef

Signed-off-by: David Korczynski <[email protected]>

Set nico to primary contact

f1fb60e

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski added 2 commits July 10, 2023 13:07

adjust to latest upstream

238ff49

Signed-off-by: David Korczynski <[email protected]>

set up maintainer emails

24909d9

Signed-off-by: David Korczynski <[email protected]>

set proper main website

9e7a7b2

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski marked this pull request as ready for review July 10, 2023 20:16

DavidKorczynski mentioned this pull request Jul 11, 2023

heimdal: initial integration #10678

Closed

jonathanmetzman approved these changes Jul 11, 2023

View reviewed changes

Merge branch 'master' into jq

18f97db

jonathanmetzman enabled auto-merge (squash) July 11, 2023 03:33

jonathanmetzman disabled auto-merge July 11, 2023 03:33

jonathanmetzman enabled auto-merge (squash) July 11, 2023 03:33

jonathanmetzman merged commit 330cc0c into google:master Jul 11, 2023
15 checks passed

itchyny mentioned this pull request Jul 31, 2023

Make and submit a Docker file to OSSFuzz jqlang/jq#2687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial integration of jq. #4980

initial integration of jq. #4980

DavidKorczynski commented Jan 15, 2021 •

edited

Loading

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

DavidKorczynski commented Jul 10, 2023

nicowilliams commented Jul 10, 2023 •

edited

Loading

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

DavidKorczynski commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023 •

edited

Loading

nicowilliams commented Jul 10, 2023

jonathanmetzman left a comment

initial integration of jq. #4980

initial integration of jq. #4980

Conversation

DavidKorczynski commented Jan 15, 2021 • edited Loading

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

DavidKorczynski commented Jul 10, 2023

nicowilliams commented Jul 10, 2023 • edited Loading

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

DavidKorczynski commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

nicowilliams commented Jul 10, 2023 • edited Loading

nicowilliams commented Jul 10, 2023

jonathanmetzman left a comment

Choose a reason for hiding this comment

DavidKorczynski commented Jan 15, 2021 •

edited

Loading

nicowilliams commented Jul 10, 2023 •

edited

Loading

nicowilliams commented Jul 10, 2023 •

edited

Loading