SFTP: handle download failures correctly; remove use of IFinishableConsumer #1525
Labels
No Label
0.2.0
0.3.0
0.4.0
0.5.0
0.5.1
0.6.0
0.6.1
0.7.0
0.8.0
0.9.0
1.0.0
1.1.0
1.10.0
1.10.1
1.10.2
1.10a2
1.11.0
1.12.0
1.12.1
1.13.0
1.14.0
1.15.0
1.15.1
1.2.0
1.3.0
1.4.1
1.5.0
1.6.0
1.6.1
1.7.0
1.7.1
1.7β
1.8.0
1.8.1
1.8.2
1.8.3
1.8β
1.9.0
1.9.0-s3branch
1.9.0a1
1.9.0a2
1.9.0b1
1.9.1
1.9.2
1.9.2a1
LeastAuthority.com automation
blocker
cannot reproduce
cloud-branch
code
code-dirnodes
code-encoding
code-frontend
code-frontend-cli
code-frontend-ftp-sftp
code-frontend-magic-folder
code-frontend-web
code-mutable
code-network
code-nodeadmin
code-peerselection
code-storage
contrib
critical
defect
dev-infrastructure
documentation
duplicate
enhancement
fixed
invalid
major
minor
n/a
normal
operational
packaging
somebody else's problem
supercritical
task
trivial
unknown
was already fixed
website
wontfix
worksforme
No Milestone
No Assignees
3 Participants
Notifications
Due Date
No due date set.
Reference: tahoe-lafs/trac-2024-07-25#1525
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Brian noticed that a Deferred was being dropped at source:src/allmydata/frontends/sftpd.py@5179#L676. (Although this code changed when MDMF landed changeset:a87fc8ffab55dfd2, the immutable path of the [previous code]source:src/allmydata/frontends/sftpd.py@5127#L675 also dropped a Deferred, at line 680.)
It is incorrect to drop the Deferred, because that will cause SFTP read requests to hang if downloading the previous file contents fails. However fixing it while still allowing streaming reads is a bit tricky. The intention is that we download the file in parallel with accepting SFTP requests, and block each read request until the relevant part of the file is available (and prior writes have completed). If the download fails, then any SFTP read requests past the prefix that was read correctly should also fail. Also, if the file was written and not truncated, the SFTP close request should fail, at least in the case when the new contents depend on the part of the old contents that we weren't able to download.
Attachment fix-1525.darcs.patch (64091 bytes) added
SFTP: make sure that download failures are not dropped. fixes #1525
Attachment broken-tests-for-1525.darcs.patch (57845 bytes) added
SFTP: tests for ref #1525, but test_openFile_write is failing depending on the order of previous tests :-( Can anyone help debugging this?
In broken-tests-for-1525.darcs.patch, there seems to be a dependency on the test order:
Notice that the error in test_openfile_write refers to the directory
'sftp/openFile_read/servers/7r4gd6xu/storage/shares/hc/hcatcbxpnrcch37ihub4vi2shy/3'
being missing, but that is the wrong directory -- all the basedirs in each test should be unique, and the ones in test_openFile_write should start withsftp/openFile_write
. Can anyone see what is going on?Attachment more-deterministically-wrong.darcs.patch (77500 bytes) added
Version of broken test that is more deterministic (allmydata.test.test_sftp2)
note: this will also fix the deprecation warning seen in #1564
This ticket is marked
review-needed
but the comments make it sound like it isn't ready for review because some of the tests are wrong. If that's true, please removereview-needed
.The tests are indeed wrong.
On this morning's dev call, we agreed that if the tests can be made to pass soon (in the next week), we can land this, else we'll kick it out to 1.11. davidsarah is still on the hook to fix them.
The current work in progress on this ticket, not including tests, is at https://github.com/davidsarah/tahoe-lafs/commits/1525-sftp-failures. Currently the SFTP tests hang at
test_openFile_read
, and it doesn't look as though I'll have time to fix that for 1.10, so I'm bumping this to 1.11.I'm bringing this back into the 1.10 milestone, because (as mentioned in #1295), Twisted 12.3.0 has removed
IFinishableConsumer
. This patch removes use of that interface.SFTP: handle download failures correctlyto SFTP: handle download failures correctly; remove use of IFinishableConsumerSo the next step is for davidsarah to fix the unit test?
Replying to zooko:
Yes (and rebase the patch).
The import error part of this (formerly ticket #1926) is a blocker for 1.10.
Now, maybe we should re-open #1926 and fix it in the easy way (by constraining the Twisted version to not allow the latest Twisted). But I don't think so. I'd rather fix this ticket, #1525.
I'm working on this now.
Review needed for https://github.com/tahoe-lafs/tahoe-lafs/pull/33.Committed in 48a2989e and 50c65629. Still needs review.
marking as "critical" to indicate this is a 1.10 blocker. All it needs is someone (probably me) to review-after-the-fact.
I spent a couple of hours on the plane studying the SFTP code. Wow, it's complex. It may need to be that complex, but it sure was hard to get my head around the control flows. I've got a list of code-cleanup suggestions, but I suspect most of them were applicable to the pre-#1525 code as well, so don't really qualify as review for this ticket. I think I'll open a separate ticket to discuss them. If Daira's manual and automated testing shows these changes to have worked correctly, I'm +1 on them too.
Replying to warner:
Please do, while they are fresh in your mind!