distributed authorization of access to nodes #295
Labels
No Label
0.2.0
0.3.0
0.4.0
0.5.0
0.5.1
0.6.0
0.6.1
0.7.0
0.8.0
0.9.0
1.0.0
1.1.0
1.10.0
1.10.1
1.10.2
1.10a2
1.11.0
1.12.0
1.12.1
1.13.0
1.14.0
1.15.0
1.15.1
1.2.0
1.3.0
1.4.1
1.5.0
1.6.0
1.6.1
1.7.0
1.7.1
1.7β
1.8.0
1.8.1
1.8.2
1.8.3
1.8β
1.9.0
1.9.0-s3branch
1.9.0a1
1.9.0a2
1.9.0b1
1.9.1
1.9.2
1.9.2a1
LeastAuthority.com automation
blocker
cannot reproduce
cloud-branch
code
code-dirnodes
code-encoding
code-frontend
code-frontend-cli
code-frontend-ftp-sftp
code-frontend-magic-folder
code-frontend-web
code-mutable
code-network
code-nodeadmin
code-peerselection
code-storage
contrib
critical
defect
dev-infrastructure
documentation
duplicate
enhancement
fixed
invalid
major
minor
n/a
normal
operational
packaging
somebody else's problem
supercritical
task
trivial
unknown
was already fixed
website
wontfix
worksforme
No Milestone
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Reference: tahoe-lafs/trac-2024-07-25#295
Loading…
Reference in New Issue
No description provided.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Zooko and I just got of the phone, and we sketched out a proposal for a
distributed introduction scheme that can accomplish the following goals:
improves reliability.
we want for client nodes
clients of the commercial grid to be confident that their shares are
placed on reliable+available servers
that have been "blessed", to allow the commercial grid to enforce quotas
and reject data from non-customers
The idea is that the introduction mechanism is uncontrolled and gossip-based,
making it highly robust but unsuitable for enforcing access control. An
additional later of public-key signatures is used to filter down the
unmanaged server list into a set of "blessed" servers.
This builds upon the scheme described in #68. Each Introducer has a publish
and subscribe interface that deal in terms of "announcements". Each
announcement is a tuple of (FURL, Purpose, Blessing). The 'Purpose' might be
a string like "storage". The 'Blessing' is a public-key signature (by some
specific "blesser" private key) of the (FURL+purpose) pair, which might
include a timestamp, validity period, revocation info, etc.
Subscribers will typically be connected to several Introducers. They'll tell
each one that they are interested in hearing about all announcements for a
specific Purpose which are signed by a set of blesser public keys. As an
efficiency measure, Introducers will only forward announcements that match
those restrictions, but the subscriber is ultimately responsible for checking
the signatures themselves.
As described in #68, introducers subscribe to each other, Introducers are
announced to each other just like all other servers, and anybody can publish
announcements into the mesh. As a result, any one Introducer FURL is
sufficient to bootstrap a connection to many/all of them, and peer-to-peer
gossip will provide some sort of limited-flood broadcast of all server
announcements.
Clients will be configured with a set of "blesser pubkeys" for each purpose.
Clients will use all storage servers that arrive with correctly signed
announcements.
(eventually) To protect storage servers against unauthorized clients, we'll
change the meaning of the server FURL to point to a credential-accepting
interface. Clients will present credentials to the server to convince it to
provide a client-specific reference (the "personal storage server facet"),
and then use that facet to upload share data. These credentials will come in
the form of a signed message (TODO: signing what, exactly? a tubid? a
callback FURL? how should we identify the "sender" of a message? we must make
sure that the server can't steal the credentials and use them on other
storage servers) that blesses the client's public key as authorized to
consume storage space (and bound to an "account ID", for accounting
purposes). The server will be configured with a set of client blesser pubkeys
for this purpose.
The server will also be configured with the FURL of the server-blesser. All
storage servers for the commercial grid will be configured with the same
FURL. When the storage server boots, it submits its server FURL to the
blesser, and gets back the blessing (signed message). After that, it can
submit the pair to the introducers. The client can be configured with an
analogous client-blesser.
This means all servers will be configured with two FURLs: one that gives them
access to the introduction mesh, and a second that distinguishes them from
non-blessed storage servers. Clients get the introducer FURL and an account
FURL. All nodes get the same introducer data, but not the blessing FURLs.
There is probably a way to express this in strict ocap terms, with Sealers
and Unsealers, but right now we seem to have a better handle on the
public-key approach.
The immediate questions are:
Oh, and also I think we really need to nail down "grid ID" issues as we build this. Moving to gossip-based distributed introduction makes it all the more important to allow clean separation between one grid and a different one. Experience with the previous allmydata.com architecture emphasized this point: several problems were attributed to a node on the test network managing to announce itself to the production network.
I don't even know how to define a "grid ID".. a random number? With a centralized introducer, it's easy.. just use the FURL or tubid of that one introducer. With decentralized introduction, I can't think of anything better to use than a random (i.e. hopefully unique) string.
This would suggest that all introduction messages include the grid id in them, and the publish interface should completely ignore any messages that use a different grid id than their own. I think it's reasonable to declare that no node exist on multiple grids at once (or at least make this a special case, that must be enabled separately.. I can imagine wanting "bridges" between otherwise-distinct grids, for scaling purposes, especially if filenode/dirnode caps included a grid id, which could point you to a suitable bridge to retrieve the data).
Hm. I think maybe the public key of the storage-server-blesser is the grid id. If the problem was nodes on the test network managing to announce themselves to the production network, then a solution is to require a public key signature from the production network blesser.
hm. My first reaction is "why should there only be one storage-server-blesser?". I.e., there might be a friendnet-type scenario in which everybody is running their own storage-server-blesser, and they achieve uniformity by accepting all the blessers at once. OTOH, it might be easier to accomplish this case by having just one blesser key and share the privkey with everyone in the grid.
I see no reason why you wouldn't want to have more than one storage-server-blesser.
But still, wouldn't requiring a blessing from at least one of your storage-server-blessers be sufficient to solve this problem of errant nodes wandering into parties where they aren't welcome?
To make it easier to deploy this, we should update clients to ignore parts of announcements which they don't know how to deal with:
http://allmydata.org/trac/tahoe/browser/src/allmydata/introducer.py?rev=55dfb697a448dbc7#L291
could be changed to
In the future we might prefer to use named arguments instead of positional, for example the ann[6] element could be a dict and could be used to hold arguments by name for future evolutions of introduction.
Is this a duplicate of #466?
not exactly. Let's say that #295 is about a distributed introducer, while #466 is about signed/blessed extendable announcements. I'm changing the summary to match.
distributed introduction and public-key-based server blessingto distributed introduction: robust, gossip-basedChanging the name to reflect my understanding of #68 as being about distributed introducer and this ticket as being about distributed control of access to nodes.
distributed introduction: robust, gossip-basedto distributed control of access to nodesdistributed control of access to nodesto distributed authorization of access to nodes