Tahoe backup should WARN and go on when finding errors like: links to deleted files or access/read permission denied in local files/directories #729
Labels
No Label
0.2.0
0.3.0
0.4.0
0.5.0
0.5.1
0.6.0
0.6.1
0.7.0
0.8.0
0.9.0
1.0.0
1.1.0
1.10.0
1.10.1
1.10.2
1.10a2
1.11.0
1.12.0
1.12.1
1.13.0
1.14.0
1.15.0
1.15.1
1.2.0
1.3.0
1.4.1
1.5.0
1.6.0
1.6.1
1.7.0
1.7.1
1.7β
1.8.0
1.8.1
1.8.2
1.8.3
1.8β
1.9.0
1.9.0-s3branch
1.9.0a1
1.9.0a2
1.9.0b1
1.9.1
1.9.2
1.9.2a1
LeastAuthority.com automation
blocker
cannot reproduce
cloud-branch
code
code-dirnodes
code-encoding
code-frontend
code-frontend-cli
code-frontend-ftp-sftp
code-frontend-magic-folder
code-frontend-web
code-mutable
code-network
code-nodeadmin
code-peerselection
code-storage
contrib
critical
defect
dev-infrastructure
documentation
duplicate
enhancement
fixed
invalid
major
minor
n/a
normal
operational
packaging
somebody else's problem
supercritical
task
trivial
unknown
was already fixed
website
wontfix
worksforme
No Milestone
No Assignees
3 Participants
Notifications
Due Date
No due date set.
Reference: tahoe-lafs/trac-2024-07-25#729
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
I am facing this problem when backing up my files:
This destination does not exists in my local filesystem: /var/named/etc/namedb but I have this (broken) link in my local "to backup" directory: stockrt/Roger/etc
I think Tahoe should warn it but do not break just because of it.
The same should occur for read/access permission denied for files and directories.
Regards,
Rogério Schneider
reformatted traceback
The failure for dangling symlinks is #641. (This ticket is not quite a duplicate because it also asks to tolerate other failures.)
Attachment tahoe-backup-tolerate-errors.dpatch (39327 bytes) added
This patch renders tahoe backup more tolerant to special or unreadable files. A warning is displayed on stderr for each file which was skipped.
I think it's a good first step which allows tahoe backup to work out of the box for most users before it can store all those Unix specialties (see ticket #641).
Vista has symbolic links (NTFS junction points), but Windows Python probably doesn't have
os.symlink
yet. There is amklink
command that could be used to test behaviour with junction points. I don't think that the lack of a test on Windows should hold up this fix, though.Wouldn't it be better to catch the exception caused by an access error, rather than testing using
os.access
etc?I agree, although I'd also like to avoid even trying to open some things. It would be pretty unfortunate to discover a copy of /dev/zero or /dev/urandom lying in your backup source tree.
Our discussions on #833 (dealing with illegal mutable children inside an immutable directory) led us to want "tahoe cp" to warn-and-skip "bad" things found on the tahoe side, and we figured that the same policy would be appropriate for "bad" things found on the local disk side (specifically symlinks and device files). I think it'd be reasonable to have "tahoe backup" do the same.
If so, the way I'd do it would be to have both "tahoe backup" and "tahoe cp" test the potential local source filesystem object with
os.path.isfile
andos.path.isdir
before opening or entering it. These tests should rule out special files and symlinks (whether dangling or not). Skipping symlinks to directories is more important than skipping symlinks to files, but I think it'd be ok to skip both unless/until we come up with some reasonable way to record these sorts of things in tahoe directly.If these tests pass, then we try to open the thing, and if that raises
EnvironmentError
(as would be caused by permission problems), we should emit a warning and skip over it. It might be nice to count how many such warnings we emit and dump a summary at the end, so that folks using --verbose never get to see that something went wrong. Also, we should consider having the "tahoe backup" exit code be non-zero if anything was skipped due to errors or non-normal-fileness.Thanks Brian and David-Sarah for the comments, here is a newer version of the patch which takes your comments into account. What do you think now?
Attachment tahoe-backup-tolerate-errors-v2.dpatch (47449 bytes) added
Looks good, just some nitpicks:
except OSError:
when listing a directory, butexcept IOError:
when processing a file?upload
: "when call" should be "when called"Replying to davidsarah:
It seems to be the way the Python stdlib works. But as Brian mentioned, I should probably catch the parent exception
EnvironmentError
instead in both cases.Thanks.
I'll send an updated patch.
Attachment tahoe-backup-tolerate-errors-v3.dpatch (51943 bytes) added
Brian, Zooko, do you feel comfortable commiting this patch before 1.6?
Reviewed, can't see any problems. Tests look sufficient.
applied in changeset:b03406af9d216278. Thank you very much!
I kinda think we should also be skipping symlinks in general (not just dangling ones). The
os.path.isdir
andos.path.isfile
tests will return True for symlinks that point to directories and files. So if we want to skip symlinks, those tests need to turn intoif os.path.isdir() and not os.path.islink()
, etc.