Skip to content

Commit ffe81d4

Browse files
committed
blk-mq: fix corruption with direct issue
If we attempt a direct issue to a SCSI device, and it returns BUSY, then we queue the request up normally. However, the SCSI layer may have already setup SG tables etc for this particular command. If we later merge with this request, then the old tables are no longer valid. Once we issue the IO, we only read/write the original part of the request, not the new state of it. This causes data corruption, and is most often noticed with the file system complaining about the just read data being invalid: [ 235.934465] EXT4-fs error (device sda1): ext4_iget:4831: inode #7142: comm dpkg-query: bad extra_isize 24937 (inode size 256) because most of it is garbage... This doesn't happen from the normal issue path, as we will simply defer the request to the hardware queue dispatch list if we fail. Once it's on the dispatch list, we never merge with it. Fix this from the direct issue path by flagging the request as REQ_NOMERGE so we don't change the size of it before issue. See also: https://bugzilla.kernel.org/show_bug.cgi?id=201685 Tested-by: Guenter Roeck <linux@roeck-us.net> Fixes: 6ce3dd6 ("blk-mq: issue directly if hw queue isn't busy in case of 'none'") Cc: stable@vger.kernel.org Signed-off-by: Jens Axboe <axboe@kernel.dk>
1 parent fd6f32f commit ffe81d4

File tree

1 file changed

+25
-1
lines changed

1 file changed

+25
-1
lines changed

block/blk-mq.c

Lines changed: 25 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1715,6 +1715,15 @@ static blk_status_t __blk_mq_issue_directly(struct blk_mq_hw_ctx *hctx,
17151715
break;
17161716
case BLK_STS_RESOURCE:
17171717
case BLK_STS_DEV_RESOURCE:
1718+
/*
1719+
* If direct dispatch fails, we cannot allow any merging on
1720+
* this IO. Drivers (like SCSI) may have set up permanent state
1721+
* for this request, like SG tables and mappings, and if we
1722+
* merge to it later on then we'll still only do IO to the
1723+
* original part.
1724+
*/
1725+
rq->cmd_flags |= REQ_NOMERGE;
1726+
17181727
blk_mq_update_dispatch_busy(hctx, true);
17191728
__blk_mq_requeue_request(rq);
17201729
break;
@@ -1727,6 +1736,18 @@ static blk_status_t __blk_mq_issue_directly(struct blk_mq_hw_ctx *hctx,
17271736
return ret;
17281737
}
17291738

1739+
/*
1740+
* Don't allow direct dispatch of anything but regular reads/writes,
1741+
* as some of the other commands can potentially share request space
1742+
* with data we need for the IO scheduler. If we attempt a direct dispatch
1743+
* on those and fail, we can't safely add it to the scheduler afterwards
1744+
* without potentially overwriting data that the driver has already written.
1745+
*/
1746+
static bool blk_rq_can_direct_dispatch(struct request *rq)
1747+
{
1748+
return req_op(rq) == REQ_OP_READ || req_op(rq) == REQ_OP_WRITE;
1749+
}
1750+
17301751
static blk_status_t __blk_mq_try_issue_directly(struct blk_mq_hw_ctx *hctx,
17311752
struct request *rq,
17321753
blk_qc_t *cookie,
@@ -1748,7 +1769,7 @@ static blk_status_t __blk_mq_try_issue_directly(struct blk_mq_hw_ctx *hctx,
17481769
goto insert;
17491770
}
17501771

1751-
if (q->elevator && !bypass_insert)
1772+
if (!blk_rq_can_direct_dispatch(rq) || (q->elevator && !bypass_insert))
17521773
goto insert;
17531774

17541775
if (!blk_mq_get_dispatch_budget(hctx))
@@ -1810,6 +1831,9 @@ void blk_mq_try_issue_list_directly(struct blk_mq_hw_ctx *hctx,
18101831
struct request *rq = list_first_entry(list, struct request,
18111832
queuelist);
18121833

1834+
if (!blk_rq_can_direct_dispatch(rq))
1835+
break;
1836+
18131837
list_del_init(&rq->queuelist);
18141838
ret = blk_mq_request_issue_directly(rq);
18151839
if (ret != BLK_STS_OK) {

0 commit comments

Comments
 (0)