Discussion: Should multiple related announcement types be combined in a single batch schema? #298

wesbiggs · 2025-03-06T17:21:40Z

To improve and simplify the ability for clients to construct threads from batches, we could make a single Parquet schema that can include all the content-related announcement types (think of each Parquet row as being a union of the fields in Broadcast, Reply, Reaction, Update, etc.) — many of these fields are overlapping anyway). Rather than posting separate batches for each of those types we get more efficient use of blockspace. Clients need to change the way they process the batch but IMO this will make thread construction and maintenance more efficient than less.

Pros:

For batch creators, fewer batches (and therefore on-chain transactions) represent
For batch consumers, fewer batches to consume

Cons:

Somewhat more complex schema parsing
Arguably less expressive schemas (as many columns would be declared optional
Small increase to bytes per row

Open questions:

Which Announcement Types would logically be grouped by this approach?
Dealing with legacy data?

wesbiggs · 2025-03-06T19:02:11Z

Notes from Community Call 2025-03-06:

Consider how Bloom filters would apply; size vs. false positives
Original intent was that not all applications would be interested in all announcement types, but that applied more to the differences between content announcements, graph announcements, etc., where several of these types have moved to become DSNP User Data instead.
Not an all-or-nothing; question is which types logically should be grouped together?
Polymorphic content increases computational complexity (but Parquet has some facilities for automatically handling polymorphism)
Benefits to batch producer decrease at high volumes of activity

enddynayn · 2025-03-06T19:22:38Z

My understanding is that grouping batches together will optimize blockspace usage. Is the current separation significantly impacting block space utilization?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Discussion: Should multiple related announcement types be combined in a single batch schema? #298

Discussion: Should multiple related announcement types be combined in a single batch schema? #298

wesbiggs commented Mar 6, 2025

wesbiggs commented Mar 6, 2025

Uh oh!

enddynayn commented Mar 6, 2025 •

edited

Loading

Uh oh!

Discussion: Should multiple related announcement types be combined in a single batch schema? #298

Discussion: Should multiple related announcement types be combined in a single batch schema? #298

Comments

wesbiggs commented Mar 6, 2025

wesbiggs commented Mar 6, 2025

Uh oh!

enddynayn commented Mar 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

enddynayn commented Mar 6, 2025 •

edited

Loading