Releases · streamingfast/substreams

11 Apr 10:30

sduchesneau

v1.5.3

efb1b3d

v1.5.3

Performance, memory leak and bug fixes

Server

fix memory leak on substreams execution (by bumping wazero dependency)
prevent substreams-tier1 stopping if blocktype auto-detection times out
allow specifying blocktype directly in Tier1 config to skip auto-detection
fix missing error handling when writing output data to files. This could result in tier1 request just "hanging" waiting for the file never produced by tier2.
fix handling of dstore error in tier1 'execout walker' causing stalling issues on S3 or on unexpected storage errors
increase number of retries on storage when writing states or execouts (5 -> 10)
prevent slow squashing when loading each segment from full KV store (can happen when a stage contains multiple stores)

Gui

prevent 'gui' command from crashing on 'incomplete' spkgs without moduledocs (when using --skip-package-validation)

Assets 7

04 Apr 18:16

sduchesneau

v1.5.2

5194e34

v1.5.2

Fix a context leak causing tier1 responses to slow down progressively

Assets 7

03 Apr 20:59

sduchesneau

v1.5.1

622c902

v1.5.1

Fix a panic on tier2 when not using any wasm extension.
Fix a thread leak on metering GRPC emitter
Rollback scheduler optimisation: different stages can run concurrently if they are schedulable. This will prevent taking much time to execute when restarting close to HEAD.
Add substreams_tier2_active_requests and substreams_tier2_request_counter prometheus metrics
Fix the tools tier2call method to make it work with the new 'generic' tier2 (added necessary flags)

Assets 7

02 Apr 17:36

sduchesneau

v1.5.0

80b7ec1

v1.5.0

Operators

A single substreams-tier2 instance can now serve requests for multiple chains or networks. All network-specific parameters are now passed from Tier1 to Tier2 in the internal ProcessRange request.

Important

Since the tier2 services will now get the network information from the tier1 request, you must make sure that the file paths and network addresses will be the same for both tiers.

Tip

The cached 'partial' files no longer contain the "trace ID" in their filename, preventing accumulation of "unsquashed" partial store files. The system will delete files under '{modulehash}/state' named in this format{blocknumber}-{blocknumber}.{hexadecimal}.partial.zst when it runs into them.

Assets 7

22 Mar 18:09

sduchesneau

v1.4.0

9bb7dea

v1.4.0

Client

Implement a use feature, enabling a module to use an existing module by overriding its inputs or initial block. (Inputs should have the same output type than override module's inputs).
Check a usage of this new feature on the substreams-db-graph-converter repository.
Fix panic when using '--header (-H)' flag on gui command
When packing substreams, pick up docs from the README.md or README in the same directory as the manifest, when top-level package.doc is empty
Added "Total read bytes" summary at the end of 'substreams run' command

Server performance in "production-mode"

Some redundant reprocessing has been removed, along with a better usage of caches to reduce reading the blocks multiple times when it can be avoided. Concurrent requests may benefit the other's work to a certain extent (up to 75%!)(MISSING)

All module outputs are now cached. (previously, only the last module was cached, along with the "store snapshots", to allow parallel processing). (this will increase disk usage, there is no automatic removal of old module caches)
Tier2 will now read back mapper outputs (if they exist) to prevent running them again. Additionally, it will not read back the full blocks if its inputs can be satisfied from existing cached mapper outputs.
Tier2 will skip processing completely if it's processing the last stage and the output_module is a mapper that has already been processed (ex: when multiple requests are indexing the same data at the same time)
Tier2 will skip processing completely if it's processing a stage that is not the last, but all the stores and outputs have been processed and cached.
The "partial" store outputs no longer contain the trace ID in the filename, allowing them to be reused. If many requests point to the same modules being squashed, the squasher will detect if another Tier1 has squashed its file and reload the store from the produced full KV.
Scheduler modification: a stage now waits for the previous stage to have completed the same segment before running, to take advantage of the cached intermediate layers.
Improved file listing performance for Google Storage backends by 25%!

Operator concerns

Tier2 service now supports a maximum concurrent requests limit. Default set to 0 (unlimited).
Readiness metric for Substreams tier1 app is now named substreams_tier1 (was mistakenly called firehose before).
Added back deadiness metric for Substreams tiere app (named substreams_tier2).
Added metric substreams_tier1_active_worker_requests which gives the number of active Substreams worker requests a tier1 app is currently doing against tier2 nodes.
Added metric substreams_tier1_worker_request_counter which gives the total Substreams worker requests a tier1 app made against tier2 nodes.

Assets 7

07 Mar 19:14

maoueh

v1.3.7

e57757d

v1.3.7

Fixed substreams init generated The Graph GraphQL regarding wrong Bool types.
The substreams init command can now be used on Arbitrum Mainnet network.

Assets 7

01 Mar 16:03

sduchesneau

v1.3.6

47ddf07

v1.3.6

This release brings important server-side improvements regarding performance, especially while processing over historical blocks in production-mode.

Backend (through firehose-core)

Performance: prevent reprocessing jobs when there is only a mapper in production mode and everything is already cached
Performance: prevent "UpdateStats" from running too often and stalling other operations when running with a high parallel jobs count
Performance: fixed bug in scheduler ramp-up function sometimes waiting before raising the number of workers
Added support for authentication using api keys. The env variable can be specified with --substreams-api-key-envvar and defaults to SUBSTREAMS_API_KEY.
Added the output module's hash to the "incoming request" log
Added trace_id in grpc authentication calls
Bumped connect-go library to new "connectrpc.com/connect" location

Assets 7

07 Feb 15:46

sduchesneau

v1.3.5

3b0afbe

v1.3.5

Code generation

Added substreams init support for creating a substreams with data from fully-decoded Calls instead of only extracting events.

Assets 7

01 Feb 19:18

sduchesneau

v1.3.4

a3c6d69

v1.3.4

Code generation

Added substreams init support for creating a substreams with the "Dynamic DataSources" pattern (ex: a Factory contract creating pool contracts through the PoolCreated event)
Changed substreams init to always add prefixes the tables and entities with the project name
Fixed substreams init support for unnamed params and topics on log events

Assets 7

30 Jan 22:03

maoueh

v1.3.3

5c9067d

v1.3.3

Fixed substreams init generated code when dealing with Ethereum ABI events containing array types.

[!NOTE]
For now, the generated code only works with Postgres, an upcoming revision is going to lift that constraint.

Assets 7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Server

Gui

Operators

Client

Server performance in "production-mode"

Operator concerns

Backend (through firehose-core)

Code generation

Code generation

Releases: streamingfast/substreams

v1.5.3

Server

Gui

v1.5.2

v1.5.1

v1.5.0

Operators

v1.4.0

Client

Server performance in "production-mode"

Operator concerns

v1.3.7

v1.3.6

Backend (through firehose-core)

v1.3.5

Code generation

v1.3.4

Code generation

v1.3.3