anonymous friendly tahoe create-node/create-client #2490
Labels
No Label
0.2.0
0.3.0
0.4.0
0.5.0
0.5.1
0.6.0
0.6.1
0.7.0
0.8.0
0.9.0
1.0.0
1.1.0
1.10.0
1.10.1
1.10.2
1.10a2
1.11.0
1.12.0
1.12.1
1.13.0
1.14.0
1.15.0
1.15.1
1.2.0
1.3.0
1.4.1
1.5.0
1.6.0
1.6.1
1.7.0
1.7.1
1.7β
1.8.0
1.8.1
1.8.2
1.8.3
1.8β
1.9.0
1.9.0-s3branch
1.9.0a1
1.9.0a2
1.9.0b1
1.9.1
1.9.2
1.9.2a1
LeastAuthority.com automation
blocker
cannot reproduce
cloud-branch
code
code-dirnodes
code-encoding
code-frontend
code-frontend-cli
code-frontend-ftp-sftp
code-frontend-magic-folder
code-frontend-web
code-mutable
code-network
code-nodeadmin
code-peerselection
code-storage
contrib
critical
defect
dev-infrastructure
documentation
duplicate
enhancement
fixed
invalid
major
minor
n/a
normal
operational
packaging
somebody else's problem
supercritical
task
trivial
unknown
was already fixed
website
wontfix
worksforme
No Milestone
No Assignees
4 Participants
Notifications
Due Date
No due date set.
Reference: tahoe-lafs/trac-2024-07-25#2490
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
subset of tasks from https://tahoe-lafs.org/trac/tahoe-lafs/ticket/517#comment:29
rough sketch of create-node cli here:
https://github.com/david415/tahoe-lafs/tree/2490.anonymous-create-node-client.0
--listen
and--location
docs should be at the top of the list if they are required (since the list has no defined ordering at present). They should probably be above theOptions
section to emphasize this, although that would likely require more backend changes to the config flag handling code.I agree with daira's comments on the commit - the distinction between the two arguments is unclear. Examples for each would help, as well as something to clarify the difference between them, or connecting them to the corresponding
tahoe.cfg
variables. Again, maybe a section above "Options" would help. (This is stepping into man-page territory though.) Alternatively, maybe--location
could be renamed--advertize
(or--broadcast
for something with consistent spelling) to better convey its meaning.warner's email mentioned additional arguments like
--tor-only
and--listen-on-tor
. As I say in /tahoe-lafs/trac-2024-07-25/issues/5579#comment:33, I don't think these scale. Additionally, they duplicate information from--listen
:--tor-only
is equivalent to--listen=tor:foo
.--listen-on-tor
is equivalent to--listen=tcp:0,tor:foo
(or--listen=tcp:0 --listen=tor:foo
, it's unclear which will be used).If
--listen
is to be required and will be the authoritative source of what-to-listen-on information, then additional arguments for configuring the type of setup are IMHO unnecessary.For arguments specific to configuring Tor/I2P/etc, I think
--tor-*
,--i2p-*
etc. make the most sense. For clients, they can easily be collected and passed straight to the plugins; for servers, the relevant server code can just use them directly. I can't think of any arguments that a user might want to pass to both Tor and I2P.Note: this ticket (#2490) is about --listen=tor/i2p and automatic server setup. #2773 is just about --port/--location/--hostname, the non-automatic setup (well, it's automatic for plain TCP listeners, if you use --hostname=, since it'll allocate a TCP port for you. But it won't do any automatic tor/i2p stuff).
#2773 may include --listen=tcp and/or --listen=none, but should not block on the async-ification of node creation, or on anything involving onion server creation. All those Tor/I2P pieces belong here in #2490.
#2773 will be in 1.12 . This ticket doesn't need to be, and I've currently scheduled it for 1.13, but if it gets done early, we'll change the target milestone.
pulling this forward into 1.12, it's looking likely
From today's devchat:
CLI arguments:
tahoe create-node --listen=tor
will try to connect to the default control port (9051, 9151, orunix:/var/run/tor/control
)tahoe create-node --listen=tor --tor-control-port=ENDPOINT
tahoe create-node --listen=tor --launch-tor
(or maybe--tor-launch
to make all tor-related arguments start with the same prefix?)tor control.port=
or (tor launch=
andtor tor.executable=
)Then create-node will:
NODEDIR/private/tor-onion-private-key
(TODO: is there a special file suffix we should use?)tub.location = tor:XYZ.onion:EXTPORT
tub.port = disabled
tor control.port=
orlaunch=
/tor.executable=
tor onion.privatekeyfile = NODEDIR/private/tor-onion-private-key
tor onion.external-port = EXTPORT
onion.*
keys must be present, or neither, but not one without the otherAnd at runtime (
tahoe start
/Client.*init*
), if theonion.*
keys are present, we do the following:control.port
/launch
/tor.executable
explicitly point to a Tor, and forbid the use of default search-for-tor behaviorcontrol.port
/etc keys to tell txtorcon how we want to access Torep = txtorcon.TorHiddenServiceEndpoint(key-material)
tub.listenOn(ep)
This means we're adding an additional listener (using whatever localhost or unix-domain socket that txtorcon picked for its endpoint), on top of whatever
tcp.port=
specified (which is usually none, leaving us with just the one onion listener). It also means we don't need to commit to the local socket at create-node time, nor mention it in the config file.We record EXTPORT separately in the config file do avoid needing to parse
tub.location
for it at startup time, which might be confused by manually-added hints.We have
tahoe.cfg
record the private key as a filename, rather than putting the (moderately-sized RSA) key bytes inline inside the config file. We're trying to stick to a policy of not adding new secrets totahoe.cfg
.We forbid search-for-Tor in the onion case because we're going to be revealing the onion private key to whichever Tor we use, and I don't want to enable a different user (on the same box) from being able to camp out on 9150 and steal these keys. For the use-Tor-for-outbound case, this isn't as big a deal, because 1: Foolscap provides the crypto and authentication, and 2: such an attacker already knows your IP address. An attacker who camps out on 9150 early enough to catch your
tahoe create-node
interaction will be the one to provide you with the onion key in the first place. So I think the only threat is when you spoke with a real Tor at create-node time, but are talking to a spoofed Tor at start-node time, which is why I'm thinking that it's sufficient to just record how we contacted Tor the first time and do the same thing the second time.TODO: currently tahoe.cfg must either have both
tub.location
andtub.port
, or neither, so thetub.port = disabled
above won't work. We need to decide on a fix for that.tahoe.cfg
key names should probably be:tor onion.external_port =
tor onion.private_key_file =
(since almost everything else is using underscores, not hyphens)
I think we'll need a Foolscap tor-connection-handler that takes a "call me to get a Tor control protocol object" function. Then tahoe-side code will be responsible for creating one (based upon it's config: either
control.port=
to use an existing one, orlaunch=true
to spin up a new one). The function will remember the protocol object it created, so it will use the same one for both local onion-service listening and outbound connection-handling.I've opened foolscap#270 for this.
I've written a TorProvider class, here:
https://github.com/david415/tahoe-lafs/tree/2490.tor_provider.0
However this is a work in progress because I've yet to actually put the TorProvider class to use... that is for node listen and client connect code paths.
I'm working on a branch which changes dawuud's TorProvider to provide three methods:
allocate_onion()
(run at create-node time, launches/connects-to Tor, creates onion service, listens long enough to get .onion address)get_tub_handler()
(run at start-node time): immediately returns a Tub handler for the configured tor-launch/connect methodget_tub_listener()
(run at start-node time): immediately returns an Endpoint which the Tub should listen onBut now I'm wondering if we should commit to the local port at create-node time, and then write the matching listener endpoint spec into
tub.port=
. That would:tub.port
andtub.location
must either both be provided, or both disabled" requirementtub.port
It would also require that if we're doing an e.g. unix-domain listener, it gets put into tub.port as a string. It prevents the use of listeners that can't be represented as normal strings.
So I think we'd either need to pick a TCP/unix-domain listening port at
allocate_onion()
time, or use some fancyonion:
server endpoint string with enough arguments to rebuild it at start-node time (hopefully putting the private key in a file, rather than in the endpoint string).meejah: is txtorcon equipped to use pre-allocated keys/ports/onion-names? The
TCPHiddenServiceEndpoint
I'm looking at seems keen on generating all those values at construction time.New idea. There are two objects involved:
The first object is the "creator". It's actually just a function that returns a Deferred. It's invoked with the
twisted.python.usage.Usage
"options" object (which records the--listen=tor
,--tor-launch
, and other CLI argument flags). Its job is:tub.port
tub.location
from the .onion hostname and the external portBASEDIR/private/tor-onion.privkey
tub.port
,tub.location
, and a dictionary of key/values for thetor
section of tahoe.cfgtahoe create-node
process will terminate a moment later, but it might make a difference for tests, and we need to make sure the pre-existing tor won't get confused if we temporarily start the onion during create-node and then start it a second time at runtime)The second object is the "provider". It's created with a synchronous function that's called with the
tor
portion of tahoe.cfg (not sure of the best way to manage this, maybe just give it a reference toNode.get_config
, or maybe the whole Node). Its job:Node.init_connections()
, and subsumesNode._make_tor_handler
. It needs to parse the tahoe.cfg data and decide if we're launching or control-port-ing or socks-port-ing. For control.port and socks.port, it can return a foolscap-provided handler. For "launch", it needs to prepare to launch Tor (but not actually launch it yet, since we're still in synchronous startup land). It will use the newfoolscap.connections.tor.control_endpoint_maker()
to build a handler that will launch tor on demand, and it will share that launched Tor with the onion listener.MultiService
, and whenstartService
happens, if tahoe.cfg says to start an onion listener, it needs to launch Tor (if not already launched by the connection handler, probably because theIntroducerClient
FURL was processed), wait for it to start, establish the control protocol, then use the control protocol to start the onion service that was allocated by the creator.tub.port
andtub.location
are not controlled by the provider. These values are written intotahoe.cfg
like the user-provided--port=
and--location=
values. That means we're allocating the local port at create-node time, and making Tor re-use that same local port at runtime.The last part (starting the onion service) might require something unusual from txtorcon.
All the information about the onion service to be started will be recorded in tahoe.cfg. We'll document the keys it uses, but mark them as normally generated by create-node, and thus not really intended for user modification.
I'm sketching this out in my github
2490-tor
branch, in case anyone wants to take a look.I'm making good progress on this in my
2490-tor
branch: I think all the code is in place. I'm writing unit tests now, and will try to do real (manual) tests today.warner: regarding handling pre-allocated keys, in 0.15.1 txtorcon only really handles "
HiddenServiceDir
"-based ones. Then, you passhidden_service_dir=
to any of.system_tor
,.global_tor
or.private_tor
class-methods.In the 1.0 branch, ephemeral services are supported, and
TCPHiddenServiceEndpoint
learns aboutephemeral=
andprivate_key=
kwargs. See http://txtorcon.readthedocs.io/en/release-1.x/txtorcon-endpoints.html#txtorcon.TCPHiddenServiceEndpointYou can also pass
privateKey=
in the endpoint strings.There may still be changes to the details/terminology (and e.g. some things are called
*Onion*
because tor is trying to encourage use of "onion services" vs. "hidden services").Okay, I made a PR to your branch fixing a few things, and have tested against a small grid with 2 tor-only servers (one launched, one on system tor) and everything appears to work. I haven't checked for "outside Tor" traffic leaks.
You'll need "master" branch from txtorcon for now, to get the Unix-socket
ControlPort
stuff to work properly.In 5a195e2/trunk: