Skip to content

Commit cbf1f9a

Browse files
Andrew Boyerdledford
authored andcommitted
IB/rxe: Drop future atomic/read packets rather than retrying
If the completer is in the middle of a large read operation, one lost packet can cause havoc. Going to COMPST_ERROR_RETRY will cause the requester to resend the request. After that, any packet from the first attempt still in the receive queue will be interpreted as an error, restarting the error/retry sequence. The transfer will quickly exhaust its retries. This behavior is very noticeable when doing 512KB reads on a QEMU system configured with 1500B MTU. Also, a resent request here will prompt the responder on the other side to immediately start resending, but the resent packets will get stuck in the already-loaded receive queue and will never be processed. Rather than erroring out every time an unexpected future packet arrives, just drop it. Eventually the retry timer will send a duplicate request; the completer will be able to make progress since the queue will start relatively empty. Signed-off-by: Andrew Boyer <andrew.boyer@dell.com> Signed-off-by: Doug Ledford <dledford@redhat.com>
1 parent 37b3619 commit cbf1f9a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

drivers/infiniband/sw/rxe/rxe_comp.c

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -224,7 +224,7 @@ static inline enum comp_state check_psn(struct rxe_qp *qp,
224224
else
225225
return COMPST_DONE;
226226
} else if ((diff > 0) && (wqe->mask & WR_ATOMIC_OR_READ_MASK)) {
227-
return COMPST_ERROR_RETRY;
227+
return COMPST_DONE;
228228
} else {
229229
return COMPST_CHECK_ACK;
230230
}

0 commit comments

Comments
 (0)