don't let "Calling Stale Broke" trigger an incident -- that's a false alarm #1998

Open
opened 2013-06-07 21:25:23 +00:00 by zooko · 0 comments

A customer (thedod) sent me an incident report file. It turned out to be triggered by this:


    19:33:48.913 [237]: WEIRD Error in Segmentation FAILURE:

    [CopiedFailure instance: Traceback from remote host -- Traceback (most recent call last):
    Failure: allmydata.interfaces.NoSharesError: no shares (need 1). Last failure: [Failure instance: Traceback: <class 'foolscap.ipb.DeadReferenceError'>: Calling Stale Broker
    /home/nonboong/src/tahoe-lafs/src/allmydata/immutable/downloader/share.py:269:_do_loop
    /home/nonboong/src/tahoe-lafs/src/allmydata/immutable/downloader/share.py:744:_send_requests
    /home/nonboong/src/tahoe-lafs/src/allmydata/immutable/downloader/share.py:754:_send_request
    /usr/lib/python2.7/dist-packages/foolscap/referenceable.py:415:callRemote
    --- <exception caught here> ---
    /usr/local/lib/python2.7/site-packages/Twisted-12.0.0-py2.7-linux-i686.egg/twisted/internet/defer.py:134:maybeDeferred
    /usr/lib/python2.7/dist-packages/foolscap/referenceable.py:455:_callRemote
    /usr/lib/python2.7/dist-packages/foolscap/broker.py:477:newRequestID
    ]
    ]

    [INCIDENT-TRIGGER]

I'm pretty sure there's nothing out of the ordinary about this. It's just what happens whenever a network connection to a storage server fails during an operation. Is that right?

So I'm asking the customer to look for other incident report files to send to me. But if this was a false alarm, then it has impeded our investigation and should be silenced.

Assigning to Brian to confirm my understanding that this is common and harmless.

A customer (thedod) sent me an incident report file. It turned out to be triggered by this: ``` 19:33:48.913 [237]: WEIRD Error in Segmentation FAILURE: [CopiedFailure instance: Traceback from remote host -- Traceback (most recent call last): Failure: allmydata.interfaces.NoSharesError: no shares (need 1). Last failure: [Failure instance: Traceback: <class 'foolscap.ipb.DeadReferenceError'>: Calling Stale Broker /home/nonboong/src/tahoe-lafs/src/allmydata/immutable/downloader/share.py:269:_do_loop /home/nonboong/src/tahoe-lafs/src/allmydata/immutable/downloader/share.py:744:_send_requests /home/nonboong/src/tahoe-lafs/src/allmydata/immutable/downloader/share.py:754:_send_request /usr/lib/python2.7/dist-packages/foolscap/referenceable.py:415:callRemote --- <exception caught here> --- /usr/local/lib/python2.7/site-packages/Twisted-12.0.0-py2.7-linux-i686.egg/twisted/internet/defer.py:134:maybeDeferred /usr/lib/python2.7/dist-packages/foolscap/referenceable.py:455:_callRemote /usr/lib/python2.7/dist-packages/foolscap/broker.py:477:newRequestID ] ] [INCIDENT-TRIGGER] ``` I'm pretty sure there's nothing out of the ordinary about this. It's just what happens whenever a network connection to a storage server fails during an operation. Is that right? So I'm asking the customer to look for *other* incident report files to send to me. But if this was a false alarm, then it has impeded our investigation and should be silenced. Assigning to Brian to confirm my understanding that this is common and harmless.
zooko added the
operational
normal
defect
1.10.0
labels 2013-06-07 21:25:23 +00:00
zooko added this to the undecided milestone 2013-06-07 21:25:23 +00:00
warner was assigned by zooko 2013-06-07 21:25:23 +00:00
Sign in to join this conversation.
No Milestone
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Reference: tahoe-lafs/trac-2024-07-25#1998
No description provided.