test_magic_folder timeout on windows #2914

Open
opened 2018-03-29 20:50:46 +00:00 by warner · 4 comments

Our Appveyor CI system shows test_magic_folder.RealTest.test_batched_process hitting a timeout error ("test still running after 120 seconds") on windows, on maybe one out of every 5 builds.

I've reconfigured .appveyor.yml to include the trial test.log as an artifact, so we can see if it's throwing an error, or if it's just taking too long and sometimes exceeds the limit. Then we can either be lazy and increase the timeout, or actually make it run faster, or if it's an intermittent deadlock then fix it for real.

Our Appveyor CI system shows `test_magic_folder.RealTest.test_batched_process` hitting a timeout error ("test still running after 120 seconds") on windows, on maybe one out of every 5 builds. I've reconfigured `.appveyor.yml` to include the trial test.log as an artifact, so we can see if it's throwing an error, or if it's just taking too long and sometimes exceeds the limit. Then we can either be lazy and increase the timeout, or actually make it run faster, or if it's an intermittent deadlock then fix it for real.
warner added the
code
normal
defect
1.12.1
labels 2018-03-29 20:50:46 +00:00
warner added this to the 1.13.0 milestone 2018-03-29 20:50:46 +00:00
Author

This test takes 0.441 seconds on our linux buildbot ("xenial"), and the longest magic-folder test takes 2.3 seconds (test_magic_folder.RealTestAliceBob.test_alice_bob), so I'm guessing this is an intermittent deadlock, rather than a very slow test that would complete if we just increased the timeout somewhat. I'm not sure why we've only seen it on appveyor, though; it suggests that there's something windows-specific about the bug.

This test takes 0.441 seconds on our linux buildbot ("xenial"), and the longest magic-folder test takes 2.3 seconds (`test_magic_folder.RealTestAliceBob.test_alice_bob`), so I'm guessing this is an intermittent deadlock, rather than a very slow test that would complete if we just increased the timeout somewhat. I'm not sure why we've only seen it on appveyor, though; it suggests that there's something windows-specific about the bug.
Author

I got some logs: when it passes on appveyor, it only takes about 400ms, just like on linux. I'll wait until we get a failing build to see what's happening there.

I got some logs: when it passes on appveyor, it only takes about 400ms, just like on linux. I'll wait until we get a failing build to see what's happening there.

Moving open issues out of closed milestones.

Moving open issues out of closed milestones.
exarkun modified the milestone from 1.13.0 to 1.15.0 2020-06-30 14:45:13 +00:00
Owner

Ticket retargeted after milestone closed

Ticket retargeted after milestone closed
meejah modified the milestone from 1.15.0 to soon 2021-03-30 18:40:19 +00:00
Sign in to join this conversation.
No Milestone
No Assignees
3 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Reference: tahoe-lafs/trac-2024-07-25#2914
No description provided.