don't re-use metadata from earlier snapshots, in a "tahoe backup" #2250

Open
opened 2014-06-27 01:12:48 +00:00 by zooko · 6 comments

Currently if you run tahoe backup on a directory, and the contents of the children of that directory have not changed, then tahoe backup will reuse the already-created immutable LAFS-directory. Therefore, the metadata on the children, such as the timestamps, owner information, etc., will be re-used. This can lead to a snapshot which says "Here's the state of the directory at time T", and then it shows the children, with their metadata from an earlier snapshot. This is very misleading, and only a very sophisticated user would be able to figure out that the metadata was actually re-used from a previous snapshot, and would be able to figure out in what cases metadata gets re-used vs. gets read afresh from the filesystem.

To fix this, make it so that if any of the metadata of any of the children has changed, then we make a new LAFS-directory to hold the current metadata of all the children, even if the contents of the children (and therefore their immutable file read-caps) haven't changed. (Excluding st_atime, which we do not record anyway, and which would cause intolerable thrashing if we did.)

Currently if you run `tahoe backup` on a directory, and the *contents* of the children of that directory have not changed, then `tahoe backup` will reuse the already-created immutable LAFS-directory. Therefore, the *metadata* on the children, such as the timestamps, owner information, etc., will be re-used. This can lead to a snapshot which says "Here's the state of the directory at time T", and then it shows the children, with their *metadata* from an earlier snapshot. This is very misleading, and only a very sophisticated user would be able to figure out that the metadata was actually re-used from a previous snapshot, and would be able to figure out in what cases metadata gets re-used vs. gets read afresh from the filesystem. To fix this, make it so that if any of the metadata of any of the children has changed, then we make a new LAFS-directory to hold the current metadata of all the children, even if the contents of the children (and therefore their immutable file read-caps) haven't changed. (*Excluding* `st_atime`, which we do not record anyway, and which would cause intolerable thrashing if we did.)
zooko added the
code
normal
defect
1.10.0
labels 2014-06-27 01:12:48 +00:00
zooko added this to the undecided milestone 2014-06-27 01:12:48 +00:00
Author

See related tickets:

  • #897 ("tahoe backup" thinks "ctime" means "creation time")
  • #1946 (consider removing some st_* fields from metadata)
See related tickets: * #897 ("tahoe backup" thinks "ctime" means "creation time") * #1946 (consider removing some st_* fields from metadata)
daira commented 2014-07-24 00:20:28 +00:00
Owner

Agreed.

Agreed.
tahoe-lafs modified the milestone from undecided to 1.12.0 2014-07-24 00:20:28 +00:00
tahoe-lafs added
code-frontend-cli
and removed
code
labels 2014-07-24 00:21:01 +00:00

Milestone renamed

Milestone renamed
warner modified the milestone from 1.12.0 to 1.13.0 2016-03-22 05:02:25 +00:00

renaming milestone

renaming milestone
warner modified the milestone from 1.13.0 to 1.14.0 2016-06-28 18:17:14 +00:00

Moving open issues out of closed milestones.

Moving open issues out of closed milestones.
exarkun modified the milestone from 1.14.0 to 1.15.0 2020-06-30 14:45:13 +00:00
Owner

Ticket retargeted after milestone closed

Ticket retargeted after milestone closed
meejah modified the milestone from 1.15.0 to soon 2021-03-30 18:40:19 +00:00
Sign in to join this conversation.
No Milestone
No Assignees
5 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Reference: tahoe-lafs/trac-2024-07-25#2250
No description provided.