leasedb: performance regression #1870

New Issue

tahoe-lafs · 2012-11-20T01:37:31Z

davidsarah commented

2012-11-20 01:37:31 +00:00

The 1818-leasedb branch has a performance regression that shows up when running the test suite ~~-- in fact, the test suite is not able to pass at the moment purely due to timeouts~~.

Since the regression does not show up when using make tmpfstest (which uses a memory-based tmpfs filesystem rather than disk), my tentative conclusion is that it is due to the latency of leasedb database syncs. There are currently many redundant syncs due to every SQL query/update being in a separate transaction, and due to there being more SQL queries and updates than necessary per storage API request. We could also use a more relaxed consistency mode, if that is safe.

The [1818-leasedb](https://github.com/davidsarah/tahoe-lafs/commits/1818-leasedb) branch has a performance regression that shows up when running the test suite ~~-- in fact, the test suite is not able to pass at the moment purely due to timeouts~~. Since the regression does *not* show up when using `make tmpfstest` (which uses a memory-based tmpfs filesystem rather than disk), my tentative conclusion is that it is due to the latency of leasedb database syncs. There are currently many redundant syncs due to every SQL query/update being in a separate transaction, and due to there being more SQL queries and updates than necessary per storage API request. We could also use a more relaxed consistency mode, if that is safe.

tahoe-lafs added the

labels 2012-11-20 01:37:31 +00:00

tahoe-lafs added this to the undecided milestone 2012-11-20 01:37:31 +00:00

tahoe-lafs modified the milestone from undecided to 1.11.0

2012-11-20 01:39:56 +00:00

tahoe-lafs added

code-storage

and removed

unknown

labels 2012-11-20 01:40:11 +00:00

zooko commented

2013-02-28 00:02:30 +00:00

Here are my notes about this:

https://tahoe-lafs.org/pipermail/tahoe-dev/2012-December/007877.html

Bottom line: I believe we should turn on sqlite's synchronous = NORMAL, journal mode = WAL.

Also, that we should fix ticket #1893, which would reduce this load.

Here are my notes about this: <https://tahoe-lafs.org/pipermail/tahoe-dev/2012-December/007877.html> Bottom line: I believe we should turn on sqlite's synchronous = NORMAL, journal mode = WAL. Also, that we should fix ticket #1893, which would reduce this load.

davidsarah commented

2013-02-28 04:41:17 +00:00

I thought that mode caused a problem with file handle leakage? Or am I misremembering?

davidsarah commented

2013-02-28 04:43:53 +00:00

No, I remembered correctly. In current leasedb.py:

# synchronous = OFF is necessary for leasedb to pass tests for the time being,
# since using synchronous = NORMAL causes failures that are apparently due to
# a file descriptor leak, and the default synchronous = FULL causes the tests
# to time out.

No, I remembered correctly. In current leasedb.py: ``` # synchronous = OFF is necessary for leasedb to pass tests for the time being, # since using synchronous = NORMAL causes failures that are apparently due to # a file descriptor leak, and the default synchronous = FULL causes the tests # to time out. ```

daira commented

2013-07-05 11:49:38 +00:00

Replying to zooko:

Also, that we should fix ticket #1893, which would reduce this load.

I'm skeptical. We don't even know whether most of the increased latency is for mutable or immutable operations, and #1893 would only affect mutable writes. In any case, that can't be the cause of the regression, since trunk has always renewed leases on mutable writes.

Replying to [zooko](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90352): > Also, that we should fix ticket #1893, which would reduce this load. I'm skeptical. We don't even know whether most of the increased latency is for mutable or immutable operations, and #1893 would only affect mutable writes. In any case, that can't be the cause of the regression, since trunk has always renewed leases on mutable writes.

zooko commented

2013-07-05 13:12:28 +00:00

Replying to [daira]comment:8:

Replying to zooko:

Also, that we should fix ticket #1893, which would reduce this load.

I'm skeptical. We don't even know whether most of the increased latency is for mutable or immutable operations, and #1893 would only affect mutable writes. In any case, that can't be the cause of the regression, since trunk has always renewed leases on mutable writes.

I didn't say that #1893 could be the cause of the regression!

But judging from my comments in [//pipermail/tahoe-dev/2012-December/007877.html https://tahoe-lafs.org/pipermail/tahoe-dev/2012-December/007877.html], it sounds like calls to leasedb might be happening 3X as often as they need to or even more, so that might have a significant effect on performance.

Replying to [daira]comment:8: > Replying to [zooko](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90352): > > Also, that we should fix ticket #1893, which would reduce this load. > > I'm skeptical. We don't even know whether most of the increased latency is for mutable or immutable operations, and #1893 would only affect mutable writes. In any case, that can't be the cause of the regression, since trunk has always renewed leases on mutable writes. I didn't say that #1893 could be the cause of the regression! But judging from my comments in [//pipermail/tahoe-dev/2012-December/007877.html <https://tahoe-lafs.org/pipermail/tahoe-dev/2012-December/007877.html>], it sounds like calls to leasedb might be happening 3X as often as they need to or even more, so that might have a significant effect on performance.

daira commented

2013-07-05 16:05:24 +00:00

I've split the file descriptor issue out to #2015.

daira commented

2013-07-05 16:06:22 +00:00

Replying to [zooko]comment:9:

But judging from my comments in [//pipermail/tahoe-dev/2012-December/007877.html https://tahoe-lafs.org/pipermail/tahoe-dev/2012-December/007877.html], it sounds like calls to leasedb might be happening 3X as often as they need to or even more, so that might have a significant effect on performance.

Yes, but most of those calls would happen regardless of #1893, I think.

Replying to [zooko]comment:9: > But judging from my comments in [//pipermail/tahoe-dev/2012-December/007877.html <https://tahoe-lafs.org/pipermail/tahoe-dev/2012-December/007877.html>], it sounds like calls to leasedb might be happening 3X as often as they need to or even more, so that might have a significant effect on performance. Yes, but most of those calls would happen regardless of #1893, I think.

daira commented

2013-07-22 19:58:24 +00:00

Now that #2015 is fixed, I think this no longer blocks merging the cloud branch, even though I would very much like to reduce or eliminate the remaining performance regression before merging.

Now that #2015 is fixed, I think this no longer blocks merging the cloud branch, even though I would very much *like* to reduce or eliminate the remaining performance regression before merging.

zooko commented

2013-07-23 01:07:49 +00:00

I just ran all unit tests of trunk, and of 1819-cloud-merge-opensource, and the latter took a lot longer:

zooko@spark ~/playground/tahoe-lafs $ cat time-trial-trunk.txt 
Ran 1139 tests in 361.801s

PASSED (skips=6, expectedFailures=3, successes=1130)

real    6m23.092s
user    4m13.752s
sys     0m22.308s
zooko@spark ~/playground/tahoe-lafs $ cat time-trial-1819-opensource.txt 
Ran 1185 tests in 917.104s

PASSED (skips=6, expectedFailures=4, successes=1175)

real    15m28.045s
user    6m36.288s
sys     0m59.712s
zooko@spark ~/playground/tahoe-lafs $ cat time-trial-trunk-2.txt 
Ran 1139 tests in 353.694s

PASSED (skips=6, expectedFailures=3, successes=1130)

real    6m6.877s
user    4m14.344s
sys     0m22.152s

I just ran all unit tests of trunk, and of 1819-cloud-merge-opensource, and the latter took a *lot* longer: ``` zooko@spark ~/playground/tahoe-lafs $ cat time-trial-trunk.txt Ran 1139 tests in 361.801s PASSED (skips=6, expectedFailures=3, successes=1130) real 6m23.092s user 4m13.752s sys 0m22.308s zooko@spark ~/playground/tahoe-lafs $ cat time-trial-1819-opensource.txt Ran 1185 tests in 917.104s PASSED (skips=6, expectedFailures=4, successes=1175) real 15m28.045s user 6m36.288s sys 0m59.712s zooko@spark ~/playground/tahoe-lafs $ cat time-trial-trunk-2.txt Ran 1139 tests in 353.694s PASSED (skips=6, expectedFailures=3, successes=1130) real 6m6.877s user 4m14.344s sys 0m22.152s ```

zooko commented

2013-07-23 05:19:22 +00:00

My next question was: is the 1819-cloud-merge-opensource branch taking longer to do the same tests as the master branch? Or is it that the extra tests on the new branch (1185 vs. 1139 on master) are what is making the test suite take longer? The answer is that it is at least partially that existing tests are slower, for example:

zooko@spark ~/playground/tahoe-lafs $ cat log.1819-cloud-merge-opensource-timings-2.txt
allmydata.test.test_cli.Backup.test_backup ... [OK]
(17.674 secs)

-------------------------------------------------------------------------------
Ran 1 tests in 17.675s

PASSED (successes=1)
zooko@spark ~/playground/tahoe-lafs $ cat log.master-timings-2.txt
allmydata.test.test_cli.Backup.test_backup ... [OK]
(13.620 secs)

-------------------------------------------------------------------------------
Ran 1 tests in 13.622s

PASSED (successes=1)

My next question was: is the 1819-cloud-merge-opensource branch taking longer to do the same tests as the master branch? Or is it that the extra tests on the new branch (1185 vs. 1139 on master) are what is making the test suite take longer? The answer is that it is at least partially that existing tests are slower, for example: ``` zooko@spark ~/playground/tahoe-lafs $ cat log.1819-cloud-merge-opensource-timings-2.txt allmydata.test.test_cli.Backup.test_backup ... [OK] (17.674 secs) ------------------------------------------------------------------------------- Ran 1 tests in 17.675s PASSED (successes=1) zooko@spark ~/playground/tahoe-lafs $ cat log.master-timings-2.txt allmydata.test.test_cli.Backup.test_backup ... [OK] (13.620 secs) ------------------------------------------------------------------------------- Ran 1 tests in 13.622s PASSED (successes=1) ```

zooko commented

2013-07-23 05:28:29 +00:00

Next, I extended fdleakfinder to print out a histogram of how many times each filename was opened and ran that on allmydata.test.test_cli.Backup.test_backup. This showed that 1819 branch is not opening sqlite db files very often in this test (just once each), but that it is opening the share files a lot more often:

zooko@spark ~/playground/tahoe-lafs $ tail log.master-fdleakfinderout-2.txt
  73),
 ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/3", O_RDONLY',
  73),
 ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/6", O_RDONLY',
  73),
 ('"cli/Backup/backup/servers/xgru5adv/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/9", O_RDONLY',
  73),
 ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/5", O_RDONLY',
  73),
 ('"/dev/urandom", O_RDONLY', 236)]
zooko@spark ~/playground/tahoe-lafs $ tail log.1819-cloud-merge-opensource-fdleakfinderout-2.txt
  108),
 ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/0", O_RDONLY',
  108),
 ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/6", O_RDONLY',
  108),
 ('"cli/Backup/backup/servers/xgru5adv/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/2", O_RDONLY',
  108),
 ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/4", O_RDONLY',
  108),
 ('"/dev/urandom", O_RDONLY', 136)]

Next, I extended fdleakfinder [to print out a histogram of how many times each filename was opened](https://github.com/zooko/fdleakfinder/commit/e26db960e8d1674cf5fcc0152b096a64bea03004) and ran that on `allmydata.test.test_cli.Backup.test_backup`. This showed that 1819 branch is *not* opening sqlite db files very often in this test (just once each), but that it is opening the share files a lot more often: ``` zooko@spark ~/playground/tahoe-lafs $ tail log.master-fdleakfinderout-2.txt 73), ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/3", O_RDONLY', 73), ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/6", O_RDONLY', 73), ('"cli/Backup/backup/servers/xgru5adv/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/9", O_RDONLY', 73), ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/5", O_RDONLY', 73), ('"/dev/urandom", O_RDONLY', 236)] zooko@spark ~/playground/tahoe-lafs $ tail log.1819-cloud-merge-opensource-fdleakfinderout-2.txt 108), ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/0", O_RDONLY', 108), ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/6", O_RDONLY', 108), ('"cli/Backup/backup/servers/xgru5adv/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/2", O_RDONLY', 108), ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/4", O_RDONLY', 108), ('"/dev/urandom", O_RDONLY', 136)] ```

daira commented

2013-07-24 04:56:02 +00:00

Interesting. Is it possible to identify where in the code the extra file opens are?

markberger commented

2013-08-01 20:33:54 +00:00

I'm not sure if this will help, but I ran the unit tests on master and 1819-cloud-merge-opensource and calculated the difference in time for all shared tests. Here are all the tests which are more than five seconds slower on the current cloud branch:

                              allmydata.test.test_cli.Cp.test_copy_using_filecap        26.734000s
            allmydata.test.test_mutable.Update.test_replace_locations_max_shares	16.467000s
               allmydata.test.test_upload.EncodingParameters.test_query_counting	8.411000s
                   allmydata.test.test_mutable.Problems.test_privkey_query_error	7.963000s
                     allmydata.test.test_mutable.Update.test_append_power_of_two	6.575000s
                 allmydata.test.test_mutable.Problems.test_privkey_query_missing	6.236000s
            allmydata.test.test_mutable.Problems.test_block_and_hash_query_error	6.175000s
                           allmydata.test.test_download.DownloadV2.test_download	5.764000s
   allmydata.test.test_upload.EncodingParameters.test_dropped_servers_in_encoder	5.594000s
allmydata.test.test_upload.EncodingParameters.test_exception_messages_during_...	5.115000s
                         allmydata.test.test_cli.CreateAlias.test_create_unicode	5.045000s
                             allmydata.test.test_cli.Rm.test_unlink_without_path	5.028000s

You can view the rest of the data I collected here: https://gist.github.com/markberger/136765c5e87480b8017e

I'm not sure if this will help, but I ran the unit tests on master and 1819-cloud-merge-opensource and calculated the difference in time for all shared tests. Here are all the tests which are more than five seconds slower on the current cloud branch: ``` allmydata.test.test_cli.Cp.test_copy_using_filecap 26.734000s allmydata.test.test_mutable.Update.test_replace_locations_max_shares 16.467000s allmydata.test.test_upload.EncodingParameters.test_query_counting 8.411000s allmydata.test.test_mutable.Problems.test_privkey_query_error 7.963000s allmydata.test.test_mutable.Update.test_append_power_of_two 6.575000s allmydata.test.test_mutable.Problems.test_privkey_query_missing 6.236000s allmydata.test.test_mutable.Problems.test_block_and_hash_query_error 6.175000s allmydata.test.test_download.DownloadV2.test_download 5.764000s allmydata.test.test_upload.EncodingParameters.test_dropped_servers_in_encoder 5.594000s allmydata.test.test_upload.EncodingParameters.test_exception_messages_during_... 5.115000s allmydata.test.test_cli.CreateAlias.test_create_unicode 5.045000s allmydata.test.test_cli.Rm.test_unlink_without_path 5.028000s ``` You can view the rest of the data I collected here: <https://gist.github.com/markberger/136765c5e87480b8017e>

markberger commented

2013-08-02 14:31:37 +00:00

Replying to zooko:

Next, I extended fdleakfinder to print out a histogram of how many times each filename was opened and ran that on allmydata.test.test_cli.Backup.test_backup. This showed that 1819 branch is not opening sqlite db files very often in this test (just once each), but that it is opening the share files a lot more often:

zooko@spark ~/playground/tahoe-lafs $ tail log.master-fdleakfinderout-2.txt
  73),
 ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/3", O_RDONLY',
  73),
 ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/6", O_RDONLY',
  73),
 ('"cli/Backup/backup/servers/xgru5adv/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/9", O_RDONLY',
  73),
 ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/5", O_RDONLY',
  73),
 ('"/dev/urandom", O_RDONLY', 236)]
zooko@spark ~/playground/tahoe-lafs $ tail log.1819-cloud-merge-opensource-fdleakfinderout-2.txt
  108),
 ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/0", O_RDONLY',
  108),
 ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/6", O_RDONLY',
  108),
 ('"cli/Backup/backup/servers/xgru5adv/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/2", O_RDONLY',
  108),
 ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/4", O_RDONLY',
  108),
 ('"/dev/urandom", O_RDONLY', 136)]

I'm not sure if this is relevant, but that test runs significantly faster on the cloud branch:

allmydata.test.test_cli.Backup.test_backup	-5.986000s

Edit: I posted the wrong test data.

Replying to [zooko](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90362): > Next, I extended fdleakfinder [to print out a histogram of how many times each filename was opened](https://github.com/zooko/fdleakfinder/commit/e26db960e8d1674cf5fcc0152b096a64bea03004) and ran that on `allmydata.test.test_cli.Backup.test_backup`. This showed that 1819 branch is *not* opening sqlite db files very often in this test (just once each), but that it is opening the share files a lot more often: > ``` > zooko@spark ~/playground/tahoe-lafs $ tail log.master-fdleakfinderout-2.txt > 73), > ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/3", O_RDONLY', > 73), > ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/6", O_RDONLY', > 73), > ('"cli/Backup/backup/servers/xgru5adv/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/9", O_RDONLY', > 73), > ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/eq/eqbt53qswmzlvr5r6tfupcfyam/5", O_RDONLY', > 73), > ('"/dev/urandom", O_RDONLY', 236)] > zooko@spark ~/playground/tahoe-lafs $ tail log.1819-cloud-merge-opensource-fdleakfinderout-2.txt > 108), > ('"cli/Backup/backup/servers/k6vb2bpd/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/0", O_RDONLY', > 108), > ('"cli/Backup/backup/servers/rvsry4kn/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/6", O_RDONLY', > 108), > ('"cli/Backup/backup/servers/xgru5adv/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/2", O_RDONLY', > 108), > ('"cli/Backup/backup/servers/ysbz4st7/storage/shares/ef/efe5w4eitpntre3p4zmbbqy7cy/4", O_RDONLY', > 108), > ('"/dev/urandom", O_RDONLY', 136)] > ``` I'm not sure if this is relevant, but that test runs significantly faster on the cloud branch: ``` allmydata.test.test_cli.Backup.test_backup -5.986000s ``` Edit: I posted the wrong test data.

markberger commented

2013-08-02 14:41:54 +00:00

Replying to zooko:

My next question was: is the 1819-cloud-merge-opensource branch taking longer to do the same tests as the master branch? Or is it that the extra tests on the new branch (1185 vs. 1139 on master) are what is making the test suite take longer? The answer is that it is at least partially that existing tests are slower, for example:
zooko@spark ~/playground/tahoe-lafs $ cat log.1819-cloud-merge-opensource-timings-2.txt
allmydata.test.test_cli.Backup.test_backup ... OK
(17.674 secs)

-------------------------------------------------------------------------------
Ran 1 tests in 17.675s

PASSED (successes=1)
zooko@spark ~/playground/tahoe-lafs $ cat log.master-timings-2.txt
allmydata.test.test_cli.Backup.test_backup ... OK
(13.620 secs)

-------------------------------------------------------------------------------
Ran 1 tests in 13.622s

PASSED (successes=1)

According to data I collected, the total time to execute the extra tests on 1819-cloud-merge-opensource is 232.95s. There is a difference of about 628s with the tests that both branches have in common.

Replying to [zooko](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90361): > My next question was: is the 1819-cloud-merge-opensource branch taking longer to do the same tests as the master branch? Or is it that the extra tests on the new branch (1185 vs. 1139 on master) are what is making the test suite take longer? The answer is that it is at least partially that existing tests are slower, for example: > > ``` > zooko@spark ~/playground/tahoe-lafs $ cat log.1819-cloud-merge-opensource-timings-2.txt > allmydata.test.test_cli.Backup.test_backup ... OK > (17.674 secs) > > ------------------------------------------------------------------------------- > Ran 1 tests in 17.675s > > PASSED (successes=1) > zooko@spark ~/playground/tahoe-lafs $ cat log.master-timings-2.txt > allmydata.test.test_cli.Backup.test_backup ... OK > (13.620 secs) > > ------------------------------------------------------------------------------- > Ran 1 tests in 13.622s > > PASSED (successes=1) > ``` According to data I collected, the total time to execute the extra tests on 1819-cloud-merge-opensource is 232.95s. There is a difference of about 628s with the tests that both branches have in common.

daira commented

2013-08-03 00:23:18 +00:00

markberger: can you test the hypothesis that the main problem is with an increase in latency of filesystem operations? To do that, repeat the comparison of trunk with 1819-cloud-merge-opensource, but using make tmpfstest for both.

markberger: can you test the hypothesis that the main problem is with an increase in latency of filesystem operations? To do that, repeat the comparison of trunk with 1819-cloud-merge-opensource, but using `make tmpfstest` for both.

markberger commented

2013-08-05 23:31:32 +00:00

After running the tests again using make tmpfstest, the increased latency seems to be the largest factor of the regression.

Total time to run tests on master: 320s
Total time to run tests on cloud branch: 452s
Total time to run tests only on cloud branch: 110s

Total difference between shared tests: 22s slower on cloud branch

The 22s regression appears to occur in mutable update:

          allmydata.test.test_mutable.Update.test_replace_locations_max_shares	15.051000s
                              allmydata.test.test_cli.Cp.test_copy_using_filecap	5.158000s
                     allmydata.test.test_mutable.Update.test_append_power_of_two	2.624000s
                allmydata.test.test_mutable.Update.test_multiple_segment_replace	2.449000s
                 allmydata.test.test_mutable.Update.test_replace_in_last_segment	2.378000s
                                  allmydata.test.test_mutable.Update.test_append	2.349000s
              allmydata.test.test_mutable.Update.test_replace_zero_length_middle	2.036000s
           allmydata.test.test_mutable.Update.test_replace_zero_length_beginning	1.937000s
                       allmydata.test.test_mutable.Update.test_replace_segstart1	1.931000s
                       allmydata.test.test_mutable.Update.test_replace_beginning	1.915000s
                          allmydata.test.test_mutable.Update.test_replace_middle	1.900000s
                      allmydata.test.test_mutable.Update.test_replace_and_extend	1.885000s
           allmydata.test.test_mutable.Update.test_replace_zero_length_segstart1	1.864000s

The rest of the tests have insignificant differences.

After running the tests again using `make tmpfstest`, the increased latency seems to be the largest factor of the regression. ``` Total time to run tests on master: 320s Total time to run tests on cloud branch: 452s Total time to run tests only on cloud branch: 110s Total difference between shared tests: 22s slower on cloud branch ``` The 22s regression appears to occur in mutable update: ``` allmydata.test.test_mutable.Update.test_replace_locations_max_shares 15.051000s allmydata.test.test_cli.Cp.test_copy_using_filecap 5.158000s allmydata.test.test_mutable.Update.test_append_power_of_two 2.624000s allmydata.test.test_mutable.Update.test_multiple_segment_replace 2.449000s allmydata.test.test_mutable.Update.test_replace_in_last_segment 2.378000s allmydata.test.test_mutable.Update.test_append 2.349000s allmydata.test.test_mutable.Update.test_replace_zero_length_middle 2.036000s allmydata.test.test_mutable.Update.test_replace_zero_length_beginning 1.937000s allmydata.test.test_mutable.Update.test_replace_segstart1 1.931000s allmydata.test.test_mutable.Update.test_replace_beginning 1.915000s allmydata.test.test_mutable.Update.test_replace_middle 1.900000s allmydata.test.test_mutable.Update.test_replace_and_extend 1.885000s allmydata.test.test_mutable.Update.test_replace_zero_length_segstart1 1.864000s ``` The rest of the tests have insignificant differences.

daira commented

2013-08-06 01:38:49 +00:00

Thanks, that's very helpful.

I'm not too worried about the 22s regression. It's the rest of the difference shown in comment:90360 that we need to focus on, by reducing the number of syncs.

Thanks, that's very helpful. I'm not too worried about the 22s regression. It's the rest of the difference shown in [comment:90360](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90360) that we need to focus on, by reducing the number of syncs.

zooko commented

2013-08-06 07:41:54 +00:00

I was getting confused by the different measurements (mine and markberger's) of the timings of different test runs (comment:90360, comment:90361, comment:90364, comment:20, comment:21, comment:23).

So, I tried to reproduce markberger's observation from comment:90364 that the test case which takes the most additional time on the cloud branch is allmydata.test.test_cli.Cp.test_copy_using_filecap.

I also observe that this test takes about twice as long on cloud branch as on trunk:

zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 12.245s

PASSED (successes=1)
zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 12.560s

PASSED (successes=1)
zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 12.610s

PASSED (successes=1)

zooko@spark ~/playground/LAFS/tahoe-lafs $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 23.842s

PASSED (successes=1)
zooko@spark ~/playground/LAFS/tahoe-lafs $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 23.473s

PASSED (successes=1)
zooko@spark ~/playground/LAFS/tahoe-lafs $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 24.055s

PASSED (successes=1)

So then I ran it under strace, as instructed by the fdleakfinder docs. It took about 64 seconds to run this test under strace on the branch:

zooko@spark ~/playground/LAFS/tahoe-lafs $ strace -q -a1 -s0 -ff -e trace=desc -tttT -ostrace.file ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 63.853s

PASSED (successes=1)

… and about 41 seconds to do it on trunk:

zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ strace -q -a1 -s0 -ff -e trace=desc -tttT -ostrace.file ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap
allmydata.test.test_cli
  Cp
    test_copy_using_filecap ...                                            [OK]

-------------------------------------------------------------------------------
Ran 1 tests in 41.643s

PASSED (successes=1)

I ran fdleakfinder on the strace files. It says that on trunk 6499 files were opened during the course of the test, and on branch 13,999 files were opened.

From eyeballing the output, I see one kind of file that is used a lot on branch and not at all on trunk. There are 6520 uniquely-named tempfiles, located in /var/tmp used on branch and 0 on trunk. Each one is opened only once on branch. Here's an excerpt from fdleakfinder's report:

 ('/var/tmp/etilqs_zrkh9HqWCQ5aDGN', 1),
 ('/var/tmp/etilqs_zsOeiwtD2Vt0LJ7', 1),
 ('/var/tmp/etilqs_zsr0p0ohZWFxMW3', 1),
 ('/var/tmp/etilqs_zsv4OH643Kak6XZ', 1),
 ('/var/tmp/etilqs_zu7fBV4rMZW8BdB', 1),
 ('/var/tmp/etilqs_zvZD3WtYAcZ75kD', 1),
 ('/var/tmp/etilqs_zwFqpNGPbuP1wCt', 1),
 ('/var/tmp/etilqs_zwKxVFd1uODwkfh', 1),

Here's the count:

zooko@spark ~/playground/LAFS $ grep var/tmp/ tahoe-lafs/fdl-openedfiles.txt  | wc -l
6520
zooko@spark ~/playground/LAFS $ grep var/tmp/ tahoe-lafs-trunk/fdl-openedfiles.txt  | wc -l
0

A searching engine query for "sqlite /var/tmp/" took me to this thread, in which sqlite inventor Richard Hipp seems to be saying that sqlite uses these temp files for computing a sql query which could require too much RAM, and that the files in /var/tmp are for holding a temporary table that sqlite generates to compute this query:

(@@http://www.mail-archive.com/sqlite-users@sqlite.org/msg73481.html@@)

The example in the sqlite mailing list discussion is the "UNION ALL" query, which we don't use. But apparently some sql query that we are using is causing sqlite to generate a temporary table.

I was getting confused by the different measurements (mine and markberger's) of the timings of different test runs ([comment:90360](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90360), [comment:90361](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90361), [comment:90364](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90364), comment:20, comment:21, comment:23). So, I tried to reproduce markberger's observation from [comment:90364](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90364) that the test case which takes the most additional time on the cloud branch is `allmydata.test.test_cli.Cp.test_copy_using_filecap`. I also observe that this test takes about twice as long on cloud branch as on trunk: ``` zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 12.245s PASSED (successes=1) zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 12.560s PASSED (successes=1) zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 12.610s PASSED (successes=1) ``` ``` zooko@spark ~/playground/LAFS/tahoe-lafs $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 23.842s PASSED (successes=1) zooko@spark ~/playground/LAFS/tahoe-lafs $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 23.473s PASSED (successes=1) zooko@spark ~/playground/LAFS/tahoe-lafs $ ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 24.055s PASSED (successes=1) ``` So then I ran it under strace, as instructed by [the fdleakfinder docs](https://pypi.python.org/pypi/fdleakfinder). It took about 64 seconds to run this test under strace on the branch: ``` zooko@spark ~/playground/LAFS/tahoe-lafs $ strace -q -a1 -s0 -ff -e trace=desc -tttT -ostrace.file ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 63.853s PASSED (successes=1) ``` … and about 41 seconds to do it on trunk: ``` zooko@spark ~/playground/LAFS/tahoe-lafs-trunk $ strace -q -a1 -s0 -ff -e trace=desc -tttT -ostrace.file ./bin/tahoe debug trial allmydata.test.test_cli.Cp.test_copy_using_filecap allmydata.test.test_cli Cp test_copy_using_filecap ... [OK] ------------------------------------------------------------------------------- Ran 1 tests in 41.643s PASSED (successes=1) ``` I ran fdleakfinder on the strace files. It says that on trunk 6499 files were opened during the course of the test, and on branch 13,999 files were opened. From eyeballing the output, I see one kind of file that is used a lot on branch and not at all on trunk. There are 6520 uniquely-named tempfiles, located in `/var/tmp` used on branch and 0 on trunk. Each one is opened only once on branch. Here's an excerpt from fdleakfinder's report: ``` ('/var/tmp/etilqs_zrkh9HqWCQ5aDGN', 1), ('/var/tmp/etilqs_zsOeiwtD2Vt0LJ7', 1), ('/var/tmp/etilqs_zsr0p0ohZWFxMW3', 1), ('/var/tmp/etilqs_zsv4OH643Kak6XZ', 1), ('/var/tmp/etilqs_zu7fBV4rMZW8BdB', 1), ('/var/tmp/etilqs_zvZD3WtYAcZ75kD', 1), ('/var/tmp/etilqs_zwFqpNGPbuP1wCt', 1), ('/var/tmp/etilqs_zwKxVFd1uODwkfh', 1), ``` Here's the count: ``` zooko@spark ~/playground/LAFS $ grep var/tmp/ tahoe-lafs/fdl-openedfiles.txt | wc -l 6520 zooko@spark ~/playground/LAFS $ grep var/tmp/ tahoe-lafs-trunk/fdl-openedfiles.txt | wc -l 0 ``` A searching engine query for "sqlite /var/tmp/" took me to this thread, in which sqlite inventor Richard Hipp seems to be saying that sqlite uses these temp files for computing a sql query which could require too much RAM, and that the files in `/var/tmp` are for holding a temporary table that sqlite generates to compute this query: (@@http://www.mail-archive.com/sqlite-users@sqlite.org/msg73481.html@@) The example in the sqlite mailing list discussion is the "UNION ALL" query, which we don't use. But apparently some sql query that we are using is causing sqlite to generate a temporary table.

daira commented

2013-08-06 20:27:37 +00:00

Eek. If there are queries using space linear in the number of shares stored, that won't scale at all.

I wonder which queries are creating the temporary tables? I'll see if I can find out.

Eek. If there are queries using space linear in the number of shares stored, that won't scale at all. I wonder which queries are creating the temporary tables? I'll see if I can find out.

daira commented

2013-08-06 20:46:59 +00:00

Only the following queries are used in that test:

INSERT OR REPLACE INTO `leases` VALUES (?,?,?,?,?)
INSERT OR REPLACE INTO `shares` VALUES (?,?,?,?,?,?,?)
SELECT `storage_index`, `shnum` FROM `shares`
       WHERE `storage_index`=? AND `shnum`=?
UPDATE `shares` SET `state`=?, `used_space`=?, `backend_key`=?
       WHERE `storage_index`=? AND `shnum`=? AND `state`!=?'

(found by instrumenting leasedb.py and piping the output through |sort |uniq).

These are not queries that I would expect to use temporary tables, given that (storage_index, shnum) is the primary key on shares.

Only the following queries are used in that test: ``` INSERT OR REPLACE INTO `leases` VALUES (?,?,?,?,?) INSERT OR REPLACE INTO `shares` VALUES (?,?,?,?,?,?,?) SELECT `storage_index`, `shnum` FROM `shares` WHERE `storage_index`=? AND `shnum`=? UPDATE `shares` SET `state`=?, `used_space`=?, `backend_key`=? WHERE `storage_index`=? AND `shnum`=? AND `state`!=?' ``` (found by instrumenting leasedb.py and piping the output through `|sort |uniq`). These are not queries that I would expect to use temporary tables, given that (`storage_index`, `shnum`) is the primary key on `shares`.

daira commented

2013-08-06 21:06:42 +00:00

The temporary files are opened during INSERT OR REPLACE INTO and UPDATE queries. They are not opened during SELECT queries.

The temporary files are opened during `INSERT OR REPLACE INTO` and `UPDATE` queries. They are not opened during `SELECT` queries.

daira commented

2013-08-06 21:13:41 +00:00

This is informative: http://www.sqlite.org/tempfiles.html

This is informative: <http://www.sqlite.org/tempfiles.html>

daira commented

2013-08-06 21:21:14 +00:00

Aha, I believe these are Statement Journal Files (section 2.5 of http://www.sqlite.org/tempfiles.html).

What's confusing is that "A statement journal is only created for an UPDATE or INSERT statement that might change multiple rows of a database [...]". INSERT OR REPLACE INTO queries by definition never affect multiple rows. The UPDATE statement in comment:90372 cannot update multiple rows because the WHERE clause specifies exact values for storage_index and shnum, which together form the primary key, although I can see why sqlite might not "know" that in advance.

Aha, I believe these are Statement Journal Files (section 2.5 of <http://www.sqlite.org/tempfiles.html>). What's confusing is that "A statement journal is only created for an UPDATE or INSERT statement that might change multiple rows of a database [...]". `INSERT OR REPLACE INTO` queries by definition never affect multiple rows. The `UPDATE` statement in [comment:90372](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90372) cannot update multiple rows because the `WHERE` clause specifies exact values for `storage_index` and `shnum`, which together form the primary key, although I can see why sqlite might not "know" that in advance.

daira commented

2013-08-06 22:54:54 +00:00

I added this to dbutil.py, and the strace output no longer shows any files in /var/tmp being opened:

    # Store temporary files not associated with transaction control in memory
    # where possible, provided that sqlite has been compiled to allow that. See
    # <http://www.sqlite.org/tempfiles.html> and
    # <http://www.sqlite.org/pragma.html#pragma_temp_store> for details.
    c.execute("PRAGMA temp_store = MEMORY;")

I added this to `dbutil.py`, and the strace output no longer shows any files in `/var/tmp` being opened: ``` # Store temporary files not associated with transaction control in memory # where possible, provided that sqlite has been compiled to allow that. See # <http://www.sqlite.org/tempfiles.html> and # <http://www.sqlite.org/pragma.html#pragma_temp_store> for details. c.execute("PRAGMA temp_store = MEMORY;") ```

daira commented

2013-08-06 23:16:06 +00:00

1819-cloud-merge branch with PRAGMA temp_store = MEMORY;:

PASSED (skips=13, expectedFailures=4, successes=1216)

real	19m37.547s
user	5m1.051s
sys	0m34.726s

and without:

PASSED (skips=13, expectedFailures=4, successes=1216)

real	20m34.003s
user	5m0.095s
sys	0m37.058s

So a small improvement (1 minute on the full test suite) but nothing spectacular :-(

1819-cloud-merge branch with `PRAGMA temp_store = MEMORY;`: ``` PASSED (skips=13, expectedFailures=4, successes=1216) real 19m37.547s user 5m1.051s sys 0m34.726s ``` and without: ``` PASSED (skips=13, expectedFailures=4, successes=1216) real 20m34.003s user 5m0.095s sys 0m37.058s ``` So a small improvement (1 minute on the full test suite) but nothing spectacular :-(

warner commented

2013-08-07 16:29:49 +00:00

As we noticed in the dev-chat today, Cp.test_copy_using_filecap is mistakenly copying its entire basedir into the tahoe filesystem (which includes all 10 server basedirs, and their shares, which are being created during the cp by the files being copied in.. it's remarkable that the process terminates at all). This resulted in something like 174 shares being created (per server), instead of 4. Fixing this doesn't address the overall regression, but does explain why this one test was taking 17 seconds even on trunk.

OTOH, leaving that test broken while we figure out the performance regression provides us with a convenient lots-of-shares stress test, that's highly responsive to changes in the leasedb times. I'd much rather iterate on a 30-ish second test than on the full test suite.

As we noticed in the dev-chat today, `Cp.test_copy_using_filecap` is mistakenly copying its entire basedir into the tahoe filesystem (which includes all 10 server basedirs, and their shares, which are being created during the `cp` by the files being copied in.. it's remarkable that the process terminates at all). This resulted in something like 174 shares being created (per server), instead of 4. Fixing this doesn't address the overall regression, but does explain why this one test was taking 17 seconds even on trunk. OTOH, leaving that test broken while we figure out the performance regression provides us with a convenient lots-of-shares stress test, that's highly responsive to changes in the leasedb times. I'd much rather iterate on a 30-ish second test than on the full test suite.

daira commented

2013-08-11 13:16:43 +00:00

Replying to warner:

As we noticed in the dev-chat today, Cp.test_copy_using_filecap is mistakenly copying its entire basedir into the tahoe filesystem (which includes all 10 server basedirs, and their shares, which are being created during the cp by the files being copied in.. it's remarkable that the process terminates at all). This resulted in something like 174 shares being created (per server), instead of 4. Fixing this doesn't address the overall regression, but does explain why this one test was taking 17 seconds even on trunk.

Fixed in #2048.

OTOH, leaving that test broken while we figure out the performance regression provides us with a convenient lots-of-shares stress test, that's highly responsive to changes in the leasedb times. I'd much rather iterate on a 30-ish second test than on the full test suite.

That's fine, we'll just not pull that fix onto 1819-cloud-merge[-opensource] yet.

Replying to [warner](/tahoe-lafs/trac-2024-07-25/issues/1870#issuecomment-90378): > As we noticed in the dev-chat today, `Cp.test_copy_using_filecap` is mistakenly copying its entire basedir into the tahoe filesystem (which includes all 10 server basedirs, and their shares, which are being created during the `cp` by the files being copied in.. it's remarkable that the process terminates at all). This resulted in something like 174 shares being created (per server), instead of 4. Fixing this doesn't address the overall regression, but does explain why this one test was taking 17 seconds even on trunk. Fixed in #2048. > OTOH, leaving that test broken while we figure out the performance regression provides us with a convenient lots-of-shares stress test, that's highly responsive to changes in the leasedb times. I'd much rather iterate on a 30-ish second test than on the full test suite. That's fine, we'll just not pull that fix onto 1819-cloud-merge[-opensource] yet.

tahoe-lafs modified the milestone from soon to 1.12.0

2013-08-28 15:37:05 +00:00

daira commented

2014-12-06 15:12:55 +00:00

Actually we shouldn't pay any attention to the unfixed version of Cp.test_copy_using_filecap, because it would be expected to have different behaviour on trunk vs the cloud branch anyway due to the different number of files in the base directory. So that was a red herring, although I'm glad we fixed it.

Actually we shouldn't pay any attention to the unfixed version of `Cp.test_copy_using_filecap`, because it would be expected to have different behaviour on trunk vs the cloud branch anyway due to the different number of files in the base directory. So that was a red herring, although I'm glad we fixed it.

daira commented

2014-12-06 15:15:07 +00:00

If we want a stress test, allmydata.test.test_mutable.Update.test_replace_locations_max_shares is pretty stressful :-p

If we want a stress test, `allmydata.test.test_mutable.Update.test_replace_locations_max_shares` is pretty stressful :-p

warner commented

2016-03-22 05:02:25 +00:00

Milestone renamed

warner modified the milestone from 1.12.0 to 1.13.0

2016-03-22 05:02:25 +00:00

warner commented

2016-06-28 18:17:14 +00:00

renaming milestone