From: Linus Torvalds <torvalds@linux-foundation.org>
Date: Mon, 29 Jan 2018 19:51:49 +0000 (-0800)
Subject: Merge branch 'for-4.16/block' of git://git.kernel.dk/linux-block
X-Git-Tag: v4.16-rc1~184
X-Git-Url: https://asedeno.scripts.mit.edu/gitweb/?a=commitdiff_plain;h=0a4b6e2f80aad46fb55a5cf7b1664c0aef030ee0;p=linux.git

Merge branch 'for-4.16/block' of git://git.kernel.dk/linux-block

Pull block updates from Jens Axboe:
 "This is the main pull request for block IO related changes for the
  4.16 kernel. Nothing major in this pull request, but a good amount of
  improvements and fixes all over the map. This contains:

   - BFQ improvements, fixes, and cleanups from Angelo, Chiara, and
     Paolo.

   - Support for SMR zones for deadline and mq-deadline from Damien and
     Christoph.

   - Set of fixes for bcache by way of Michael Lyle, including fixes
     from himself, Kent, Rui, Tang, and Coly.

   - Series from Matias for lightnvm with fixes from Hans Holmberg,
     Javier, and Matias. Mostly centered around pblk, and the removing
     rrpc 1.2 in preparation for supporting 2.0.

   - A couple of NVMe pull requests from Christoph. Nothing major in
     here, just fixes and cleanups, and support for command tracing from
     Johannes.

   - Support for blk-throttle for tracking reads and writes separately.
     From Joseph Qi. A few cleanups/fixes also for blk-throttle from
     Weiping.

   - Series from Mike Snitzer that enables dm to register its queue more
     logically, something that's alwways been problematic on dm since
     it's a stacked device.

   - Series from Ming cleaning up some of the bio accessor use, in
     preparation for supporting multipage bvecs.

   - Various fixes from Ming closing up holes around queue mapping and
     quiescing.

   - BSD partition fix from Richard Narron, fixing a problem where we
     can't mount newer (10/11) FreeBSD partitions.

   - Series from Tejun reworking blk-mq timeout handling. The previous
     scheme relied on atomic bits, but it had races where we would think
     a request had timed out if it to reused at the wrong time.

   - null_blk now supports faking timeouts, to enable us to better
     exercise and test that functionality separately. From me.

   - Kill the separate atomic poll bit in the request struct. After
     this, we don't use the atomic bits on blk-mq anymore at all. From
     me.

   - sgl_alloc/free helpers from Bart.

   - Heavily contended tag case scalability improvement from me.

   - Various little fixes and cleanups from Arnd, Bart, Corentin,
     Douglas, Eryu, Goldwyn, and myself"

* 'for-4.16/block' of git://git.kernel.dk/linux-block: (186 commits)
  block: remove smart1,2.h
  nvme: add tracepoint for nvme_complete_rq
  nvme: add tracepoint for nvme_setup_cmd
  nvme-pci: introduce RECONNECTING state to mark initializing procedure
  nvme-rdma: remove redundant boolean for inline_data
  nvme: don't free uuid pointer before printing it
  nvme-pci: Suspend queues after deleting them
  bsg: use pr_debug instead of hand crafted macros
  blk-mq-debugfs: don't allow write on attributes with seq_operations set
  nvme-pci: Fix queue double allocations
  block: Set BIO_TRACE_COMPLETION on new bio during split
  blk-throttle: use queue_is_rq_based
  block: Remove kblockd_schedule_delayed_work{,_on}()
  blk-mq: Avoid that blk_mq_delay_run_hw_queue() introduces unintended delays
  blk-mq: Rename blk_mq_request_direct_issue() into blk_mq_request_issue_directly()
  lib/scatterlist: Fix chaining support in sgl_alloc_order()
  blk-throttle: track read and write request individually
  block: add bdev_read_only() checks to common helpers
  block: fail op_is_write() requests to read-only partitions
  blk-throttle: export io_serviced_recursive, io_service_bytes_recursive
  ...
---

0a4b6e2f80aad46fb55a5cf7b1664c0aef030ee0
diff --cc include/linux/blkdev.h
index 0ce8a372d506,afc43fb63c16..4f3df807cf8f
--- a/include/linux/blkdev.h
+++ b/include/linux/blkdev.h
@@@ -226,11 -225,37 +225,37 @@@ struct request 
  
  	unsigned int extra_len;	/* length of alignment and padding */
  
- 	unsigned short write_hint;
+ 	/*
+ 	 * On blk-mq, the lower bits of ->gstate (generation number and
+ 	 * state) carry the MQ_RQ_* state value and the upper bits the
+ 	 * generation number which is monotonically incremented and used to
+ 	 * distinguish the reuse instances.
+ 	 *
+ 	 * ->gstate_seq allows updates to ->gstate and other fields
+ 	 * (currently ->deadline) during request start to be read
+ 	 * atomically from the timeout path, so that it can operate on a
+ 	 * coherent set of information.
+ 	 */
+ 	seqcount_t gstate_seq;
+ 	u64 gstate;
+ 
+ 	/*
+ 	 * ->aborted_gstate is used by the timeout to claim a specific
+ 	 * recycle instance of this request.  See blk_mq_timeout_work().
+ 	 */
+ 	struct u64_stats_sync aborted_gstate_sync;
+ 	u64 aborted_gstate;
+ 
+ 	/* access through blk_rq_set_deadline, blk_rq_deadline */
+ 	unsigned long __deadline;
  
- 	unsigned long deadline;
  	struct list_head timeout_list;
  
+ 	union {
 -		call_single_data_t csd;
++		struct __call_single_data csd;
+ 		u64 fifo_time;
+ 	};
+ 
  	/*
  	 * completion callback.
  	 */
@@@ -239,21 -264,17 +264,27 @@@
  
  	/* for bidi */
  	struct request *next_rq;
+ 
+ #ifdef CONFIG_BLK_CGROUP
+ 	struct request_list *rl;		/* rl this rq is alloced from */
+ 	unsigned long long start_time_ns;
+ 	unsigned long long io_start_time_ns;    /* when passed to hardware */
+ #endif
  };
  
 +static inline bool blk_op_is_scsi(unsigned int op)
 +{
 +	return op == REQ_OP_SCSI_IN || op == REQ_OP_SCSI_OUT;
 +}
 +
 +static inline bool blk_op_is_private(unsigned int op)
 +{
 +	return op == REQ_OP_DRV_IN || op == REQ_OP_DRV_OUT;
 +}
 +
  static inline bool blk_rq_is_scsi(struct request *rq)
  {
 -	return req_op(rq) == REQ_OP_SCSI_IN || req_op(rq) == REQ_OP_SCSI_OUT;
 +	return blk_op_is_scsi(req_op(rq));
  }
  
  static inline bool blk_rq_is_private(struct request *rq)