Sharding math
Abstract
In the tree of metafeeds, sharding is a technique of using intermediate nodes in the tree to group the leaf nodes into clusters. Peers who replicate a portion of the metafeed tree are thus not forced to know about all leaf feeds.
However, what is the optimal sharding ratio - one that minimizes unnecessary replicated messages and feeds?
Definitions
Let Feeds be the total number of content feeds, and Feeds′ be the number of content feeds which the current peer is going to replicate. Feeds′ ≤ Feeds.
Let Shards be the number of shard feeds, and Shards′ the number of shard feeds the current peer is going to replicate.
Replicating refers to the number of feeds (either content feeds or metafeeds) in the tree that are replicated.
Overhead is the total number of metafeed messages replicated. It includes messages from the root metafeed and from the shard metafeeds. Does not include messages from content feeds.
If we replicate a shard feed, we say that we become aware of a content feed via the message on the shard feed declaring the content feed.
Awareness is the total number of content feeds that we become aware of by replicating shard feeds.
Theorem 1
Replicating = Feeds′ + Shards′ + 1.
Proof: the root metafeed is always replicated, thus it counts as 1. By definition, the only other types of feeds are content feeds and shard feeds, of which the ones we are replicating are Feeds′ and Shards′, respectively. ∎
Theorem 2
Overhead = Shards + Awareness.
Proof: we always replicate the root metafeed, which means we fetch all of its messages. Since the root metafeed only announces the existence of shard feeds, the number of messages in the root metafeed is Shards.
The other type of metafeed that we replicate are shard feeds. By definition, Awareness is the number of content feeds under the shard feeds that we’re replicating. Since shard feeds only announce the existence of content feeds, the number of messages in a shard correspond to the number of content feeds under that shard.
Overhead are the total number of metafeed messages, thus metafeed messages from the root ( Shards ) plus metafeed messages from all shards ( Awareness ). ∎
Case studies
Average sharding
Feeds = 6 | Feeds′ = 3 | Shards = 3 | Shards′ = 2 |
---|
- Replicating = 6
- Awareness = 4
- Overhead = 7
Min sharding
Feeds = 6 | Feeds′ = 3 | Shards = 1 | Shards′ = 1 |
---|
- Awareness = 6
- Replicating = 3 + 1 + 1 = 5
- Overhead = 1 + 6 = 7
In the general case, min sharding has:
Formula | Conclusion |
---|---|
Replicating = Feeds′ + 2 | :slightly_smiling_face: |
Awareness = Feeds | :neutral_face: |
Overhead = Feeds + 1 | :grimacing: |
Min sharding minimizes Shards but maximizes Awareness which leaves us with an overall large Overhead.
Max sharding
Feeds = 6 | Feeds′ = 3 |
---|---|
Shards = Feeds = 6 | Shards′ = Feeds′ = 3 |
- Awareness = 3
- Replicating = 3 + 3 + 1 = 7
- Overhead = 6 + 3 = 9
In the general case, max sharding has:
Formula | Conclusion |
---|---|
Replicating = 2 × Feeds′ + 1 | :grimacing: |
Awareness = Feeds′ | :slightly_smiling_face: |
Overhead = Feeds + Feeds′ | :grimacing: |
Max sharding minimizes Awareness but maximizes Shards which leaves us with an overall large Overhead.
Scenario: Shards < Feeds′
Because Shards′ ≤ Shards, it follows that Shards′ ≤ Feeds′.
So
Replicating = Feeds′ + Shards′ + 1
becomes
Replicating ≤ 2 × Feeds′ + 1
which means that Replicating = O(Feeds′).
Further, because Shards < Feeds′
$$
\frac{Feeds'}{Shards} > 1
$$
This means an even distribution of feeds across shards is highly likely to cause Awareness = Feeds, which leads to a large Overhead.
Realistic case with 16 shards (4-bit)
Feeds = 128 | Feeds′ = 32 | Shards = 16 |
---|
It’s reasonable to assume that one user has a dozens of apps, and up to a hundred private groups. Let’s set Feeds at 128, as a convenient power of two. Let’s set Feeds′ at 32, which means a few apps and a few dozen groups.
Because we have 16 shards and 128 feeds, there are on average 8 feeds in each shard (assuming random shard allocation). $\alpha = \frac{Feeds}{Shards} = 8$
Because we have 32 feeds-to-replicate and 16 shards, there are on average 2 chosen feeds in each shard. $\alpha' = \frac{Feeds'}{Shards} = 2$. Sometimes there are 0 chosen feeds in a shard. So let’s assume Shards′ = 14.
This means:
Formula | Conclusion | ||
---|---|---|---|
Replicating= | Feeds′ + 14 + 1= | 47 | :slightly_smiling_face: |
Awareness= | Shards′ × α= | 112 | :neutral_face: |
Overhead= | Shards + 112= | 128 | :grimacing: |
Realistic case with 64 shards (6-bit)
Feeds = 128 | Feeds′ = 32 | Shards = 64 |
---|
α = 2
α′ = 0.5
Shards′ ≤ 0.5 × Shards = 32
This means:
Formula | Conclusion | ||
---|---|---|---|
Replicating≤ | 32 + 32 + 1= | 65 | :slightly_smiling_face: |
Awareness≤ | Shards′ × α= | 64 | :slightly_smiling_face: |
Overhead≤ | Shards + 64= | 128 | :grimacing: |
Realistic case with 256 shards (8-bit)
Feeds = 128 | Feeds′ = 32 | Shards = 256 |
---|
In this case there are more shards than content feeds, so this most likely means that max sharding occurs.
α = 1
α′ = 0.25
Shards′ = Feeds′ = 32
This means:
Formula | Conclusion | ||
---|---|---|---|
Replicating= | 32 + 32 + 1= | 65 | :slightly_smiling_face: |
Awareness= | Feeds′= | 32 | :slightly_smiling_face: |
Overhead= | Feeds + Feeds′= | 160 | :grimacing: |
Realistic Case: clumping
Feeds = 128 | Feeds′ = 32 | Shards = 16 |
---|
Assume Staltz has dozens of apps, and up to a hundred private groups. So lets set his Feeds at 128, as a convenient power of two.
Mix wants to replicate some subset of those (he only uses a couple of apps, and isn’t in the same groups as Staltz) - set Feeds′ at 32.
Let us assume that each Application/ group consists of 3 feeds, and it’ “clumps” these into the same shard.
So we have 32 / 3 ~= 11 clumps
Expected number of groups 11 clumps would randomly land on with 16 shards:
1
2
3
4
5 ✓
6 ✓✓✓✓✓✓✓
7 ✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓
8 ✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓
9 ✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓✓
10 ✓✓✓✓✓✓✓✓✓✓✓
11 ✓✓
12
13
14
15
16
(see code)
const shards = 16
const clumps = 11
const shardsUsedCount = {}
for (let i = 0; i < 1000; i++) {
const shardsPicked = new Set()
for (let c = 0; c < clumps; c++) {
const shardPick = Math.floor((Math.random() * shards)) + 1
.add(shardPick)
shardsPicked
}
if (!shardsUsedCount[shardsPicked.size]) shardsUsedCount[shardsPicked.size] = 0
.size]++
shardsUsedCount[shardsPicked
}
for (let s = 1; s <= shards; s++) {
const count = Math.round((shardsUsedCount[s] || 0) / 6)
console.log(
.toString().padStart(2),
snew Array(count).fill('✓').join('')
) }
So, about 8 of our shards need replicating (half in this case). Which means our Total Awareness is at about 50%, which seems great!
If each app is a clump of 2 feeds on average, we need to replicate 10/16 shards ~= 60%
- Shards′ = 8
- α = 8
- α′ = 2
- Replicating = 32 + 8 + 1 = 41
- Awareness = Shards′ × α = 8 × 8 = 64
- Overhead = Shards + 64 = 80
Conclusion
It’s important to minimize both Replicating and Overhead. But Overhead is directly proportional to Awareness, which means we must minimize Awareness. We know that Awareness is at its lowest when Shards is at its highest, but Overhead is also directly proportional to Shards, so we must minimize Shards too.
Maybe we should aim for Feeds′ < Shards < Feeds as a general rule?
Shards = 1 < Feeds′ < Feeds is min sharding where the overhead is O(M) which is pretty bad. The “realistic case Shards = 8” is not min sharding, but it is quite close to min sharding because Shards < Feeds′ < Feeds, and overhead is pretty bad.
On the other hand, Feeds′ < Shards = Feeds is max sharding, and the result is even worse, we end up with O(Feeds + Feeds′) overhead and O(2 × Feeds′) feeds to replicate.
Shards = Feeds′ < Feeds is somewhat an optimal situation (“realistic case S=64
”) but it quickly becomes Shards < Feeds′ when Feeds′ increases over time (e.g. joining new groups).
So we want Shards to be greater than Feeds′, but significantly smaller than Feeds. Thus Feeds′ < Shards < Feeds.
However, if we have clumping and noticing that Shards = Feeds′ is an optimal solution, we might have low overhead after all if Shards < Feeds′, as long as we avoid min sharding.