'tahoe run' should not fail silently #355
Labels
No Label
0.2.0
0.3.0
0.4.0
0.5.0
0.5.1
0.6.0
0.6.1
0.7.0
0.8.0
0.9.0
1.0.0
1.1.0
1.10.0
1.10.1
1.10.2
1.10a2
1.11.0
1.12.0
1.12.1
1.13.0
1.14.0
1.15.0
1.15.1
1.2.0
1.3.0
1.4.1
1.5.0
1.6.0
1.6.1
1.7.0
1.7.1
1.7β
1.8.0
1.8.1
1.8.2
1.8.3
1.8β
1.9.0
1.9.0-s3branch
1.9.0a1
1.9.0a2
1.9.0b1
1.9.1
1.9.2
1.9.2a1
LeastAuthority.com automation
blocker
cannot reproduce
cloud-branch
code
code-dirnodes
code-encoding
code-frontend
code-frontend-cli
code-frontend-ftp-sftp
code-frontend-magic-folder
code-frontend-web
code-mutable
code-network
code-nodeadmin
code-peerselection
code-storage
contrib
critical
defect
dev-infrastructure
documentation
duplicate
enhancement
fixed
invalid
major
minor
n/a
normal
operational
packaging
somebody else's problem
supercritical
task
trivial
unknown
was already fixed
website
wontfix
worksforme
No Milestone
No Assignees
4 Participants
Notifications
Due Date
No due date set.
Reference: tahoe-lafs/trac-2024-07-25#355
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
I use supervisord, so I'd like to run tahoe in the foreground.
'tahoe start BASEDIR' is really just a wrapper around 'cd BASEDIR; twistd -y *.tac', so you could have supervisord run 'twistd --nodaemon -y foo.tac', and that would leave it in the foreground.
We could also add a --nodaemon option to 'tahoe start', although in that case I'd be tempted to call it 'tahoe run' instead.
I see 'tahoe run' is already there, and I'm sorry I missed that before.
But, I have some usability complaints about 'tahoe run':
Simply running 'tahoe run' silently exits with status 1. Nothing is written to console nor ~/.tahoe/logs/twistd.log. Note that I have only a partial ~/.tahoe, since my real instance is somewhere else.
I deliberately tried running in the correct root while my daemonized tahoe was already going, to see what would happen. The result is that "Couldn't listen on any:34279: (98, 'Address already in use')" goes to INSTANCE/logs/tahoesvc.log and then the reactor keeps running. I wish it would write that kind of startup error to the console and then exit 1.
I guess you missed the existence of
tahoe run
because it isn't mentioned in docs/running.html. Perhaps we should change that document to describetahoe run
instead oftahoe start
? (That document is intended for the beginner, who might prefer the simplicity oftahoe run
.)Something else that might help is if the output of
tahoe --help
separated node control (start
,stop
,restart
,run
,create-client
,create-introducer
) from filesystem usage (put
,get
,rm
, etc.)Drew: as someone with "fresh eyes", what do you think about these two proposed changes?
Heh, it probably says something about the documentation when the author of a command forgets about its existence :). Oops.
The ~/.tahoe/ directory is meant to be useable as either a complete Tahoe node (which is why 'tahoe create-client' and 'tahoe start' default to using it) or as a pointer to the URL at which your real node lives (i.e. the 'tahoe ls' command looks in ~/.tahoe/node.url for a starting point).
I need to think more about the startup problems that you hit. Maybe we need a 'chdir' into the target directory in src/allmydata/scripts/startstop_node.py:run().
Separating the help into sections would be fine, but that's minor IMHO. It might also help to make start and run look more like peers:
In my case, I don't think I'll put a pointer in ~/.tahoe/node.url since I expect to be running at least two nodes all the time: the test grid and a production friendnet one. I'd rather have tahoe commands fail loudly when I fail to specify an instance directory (as opposed to sometimes accidentally operating on the wrong instance).
Anyway, the silent errors and hangs are much more important to me.
Documentation improvements split into #856.
run daemon in foregroundto 'tahoe run' should not fail silentlyReplying to warner:
The [current code does that]source:src/allmydata/scripts/startstop_node.py@4641#L212. I don't think that would have addressed the second point in comment:65421, though, or similar errors that occur on startup.
It looks like this is part of the problem making #1360 fail silently (well, at least the "silent" part). Errors during startup are written to logs/tahoesvc.log, which is not where we usually expect them to go (normally I look in logs/twistd.log).
I think we could get rid of "tahoe run" and leave "tahoe start --nodaemon", or change the way "tahoe run" does logging so that any problems during startup get dumped to stderr (in addition to the logfile).
I'm still +1 in favor of eliminating
tahoe run
in favor oftahoe start --nodaemon
."tahoe start" and the other daemonization commands are now deprecated (https://tahoe-lafs.org/trac/tahoe-lafs/ticket/3273) and "tahoe run" will be the preferred and only way to run a tahoe node after that deprecation period ends.
"tahoe run" exits currently with code 1 when there is no runnable node in ~/.tahoe.
When "tahoe run" is used twice against the same node directory, the second attempt exits with code 1 and an error like "Another twistd server is running, PID ...".