tahoe restart barfs on non-node directories #315
Labels
No Label
0.2.0
0.3.0
0.4.0
0.5.0
0.5.1
0.6.0
0.6.1
0.7.0
0.8.0
0.9.0
1.0.0
1.1.0
1.10.0
1.10.1
1.10.2
1.10a2
1.11.0
1.12.0
1.12.1
1.13.0
1.14.0
1.15.0
1.15.1
1.2.0
1.3.0
1.4.1
1.5.0
1.6.0
1.6.1
1.7.0
1.7.1
1.7β
1.8.0
1.8.1
1.8.2
1.8.3
1.8β
1.9.0
1.9.0-s3branch
1.9.0a1
1.9.0a2
1.9.0b1
1.9.1
1.9.2
1.9.2a1
LeastAuthority.com automation
blocker
cannot reproduce
cloud-branch
code
code-dirnodes
code-encoding
code-frontend
code-frontend-cli
code-frontend-ftp-sftp
code-frontend-magic-folder
code-frontend-web
code-mutable
code-network
code-nodeadmin
code-peerselection
code-storage
contrib
critical
defect
dev-infrastructure
documentation
duplicate
enhancement
fixed
invalid
major
minor
n/a
normal
operational
packaging
somebody else's problem
supercritical
task
trivial
unknown
was already fixed
website
wontfix
worksforme
No Milestone
No Assignees
3 Participants
Notifications
Due Date
No due date set.
Reference: tahoe-lafs/trac-2024-07-25#315
Loading…
Reference in New Issue
No description provided.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Allmydata.com has a convention of having all operational nodes in a directory called ~/prodnet. This makes restarting all nodes easy, doing something like "for i in *; do tahoe restart $i; done" (or even tahoe restart -m *).
Unfortunately, there are two things that belong in ~/prodnet that aren't actually nodes. (stats-gatherer and log-gatherer) These cause problems because while tahoe start and tahoe stop will ignore them, if tahoe restart encounters an error during the stop, it won't do the start.
tahoe restart should ignore "not a node directory" errors, at least until the stats and log gatherers can be wrapped up in something that can be controlled by tahoe start.
I don't understand the problem -- you mean
? This will go ahead and restart all the things in
prodnet
that are nodes, right? And the presence of non-nodes won't prevent the restart of the nodes?Also, as per #165 -- "restart should start if the node isn't running" -- you can add
-f
or--force
to causerestart
to start the node even if it can't stop the node first.Replying to zooko:
That will work, but
tahoe restart -m ~/prodnet/*
is tripped up by the presence of non-nodes. It will stop everything and start nothing if it finds a non-node.The real fix for this is to get the stats-gatherer and flogtool running in containers that can be controlled by tahoe restart. I thought a ticket existed on that, but I couldn't locate it so I created a new one, #330 , which I pointed at warner.
FYI, the new foolscap-0.2.5 (released last night) provides a 'flogtool create-gatherer' command, which creates a working directory with a .tac file, just like 'tahoe create-client'. This working directory can be launched with 'tahoe start' and shut down with 'tahoe stop'.
I plan to make the tahoe stats-gatherer behave the same way. I also intend to fix tahoe restart to tolerate shutdown failures.
Milestone 1.0.1 deleted
changeset:fdb0200ecad291e2 changes 'tahoe restart' to use --force by default (#165), which will be in 1.3.1 . The stats-gatherer was turned into a regular 'tahoe start'-able node in 1.3.0, and the foolscap log-gatherer was converted in foolscap-0.3.0 . So I think we're done with this ticket.