blob: 32a96cc392151abf7e04f505069e5a6499c80860 [file] [log] [blame]
Linus Torvalds1da177e2005-04-16 15:20:36 -07001Linux for S/390 and zSeries
2
3Common Device Support (CDS)
4Device Driver I/O Support Routines
5
6Authors : Ingo Adlung
7 Cornelia Huck
8
9Copyright, IBM Corp. 1999-2002
10
11Introduction
12
13This document describes the common device support routines for Linux/390.
14Different than other hardware architectures, ESA/390 has defined a unified
15I/O access method. This gives relief to the device drivers as they don't
16have to deal with different bus types, polling versus interrupt
17processing, shared versus non-shared interrupt processing, DMA versus port
18I/O (PIO), and other hardware features more. However, this implies that
19either every single device driver needs to implement the hardware I/O
20attachment functionality itself, or the operating system provides for a
21unified method to access the hardware, providing all the functionality that
22every single device driver would have to provide itself.
23
24The document does not intend to explain the ESA/390 hardware architecture in
25every detail.This information can be obtained from the ESA/390 Principles of
26Operation manual (IBM Form. No. SA22-7201).
27
28In order to build common device support for ESA/390 I/O interfaces, a
29functional layer was introduced that provides generic I/O access methods to
30the hardware.
31
32The common device support layer comprises the I/O support routines defined
33below. Some of them implement common Linux device driver interfaces, while
34some of them are ESA/390 platform specific.
35
36Note:
37In order to write a driver for S/390, you also need to look into the interface
38described in Documentation/s390/driver-model.txt.
39
40Note for porting drivers from 2.4:
41The major changes are:
42* The functions use a ccw_device instead of an irq (subchannel).
43* All drivers must define a ccw_driver (see driver-model.txt) and the associated
44 functions.
45* request_irq() and free_irq() are no longer done by the driver.
46* The oper_handler is (kindof) replaced by the probe() and set_online() functions
47 of the ccw_driver.
48* The not_oper_handler is (kindof) replaced by the remove() and set_offline()
49 functions of the ccw_driver.
50* The channel device layer is gone.
51* The interrupt handlers must be adapted to use a ccw_device as argument.
52 Moreover, they don't return a devstat, but an irb.
53* Before initiating an io, the options must be set via ccw_device_set_options().
54
55read_dev_chars()
56 read device characteristics
57
58read_conf_data()
Cornelia Huck9fc14272005-05-01 08:59:00 -070059read_conf_data_lpm()
Linus Torvalds1da177e2005-04-16 15:20:36 -070060 read configuration data.
61
62ccw_device_get_ciw()
63 get commands from extended sense data.
64
65ccw_device_start()
Cornelia Huck9fc14272005-05-01 08:59:00 -070066ccw_device_start_timeout()
67ccw_device_start_key()
68ccw_device_start_key_timeout()
Linus Torvalds1da177e2005-04-16 15:20:36 -070069 initiate an I/O request.
70
71ccw_device_resume()
72 resume channel program execution.
73
74ccw_device_halt()
75 terminate the current I/O request processed on the device.
76
77do_IRQ()
78 generic interrupt routine. This function is called by the interrupt entry
79 routine whenever an I/O interrupt is presented to the system. The do_IRQ()
80 routine determines the interrupt status and calls the device specific
81 interrupt handler according to the rules (flags) defined during I/O request
82 initiation with do_IO().
83
84The next chapters describe the functions other than do_IRQ() in more details.
85The do_IRQ() interface is not described, as it is called from the Linux/390
86first level interrupt handler only and does not comprise a device driver
87callable interface. Instead, the functional description of do_IO() also
88describes the input to the device specific interrupt handler.
89
90Note: All explanations apply also to the 64 bit architecture s390x.
91
92
93Common Device Support (CDS) for Linux/390 Device Drivers
94
95General Information
96
97The following chapters describe the I/O related interface routines the
98Linux/390 common device support (CDS) provides to allow for device specific
99driver implementations on the IBM ESA/390 hardware platform. Those interfaces
100intend to provide the functionality required by every device driver
101implementaion to allow to drive a specific hardware device on the ESA/390
102platform. Some of the interface routines are specific to Linux/390 and some
103of them can be found on other Linux platforms implementations too.
104Miscellaneous function prototypes, data declarations, and macro definitions
105can be found in the architecture specific C header file
106linux/include/asm-s390/irq.h.
107
108Overview of CDS interface concepts
109
110Different to other hardware platforms, the ESA/390 architecture doesn't define
111interrupt lines managed by a specific interrupt controller and bus systems
112that may or may not allow for shared interrupts, DMA processing, etc.. Instead,
113the ESA/390 architecture has implemented a so called channel subsystem, that
114provides a unified view of the devices physically attached to the systems.
115Though the ESA/390 hardware platform knows about a huge variety of different
116peripheral attachments like disk devices (aka. DASDs), tapes, communication
117controllers, etc. they can all by accessed by a well defined access method and
118they are presenting I/O completion a unified way : I/O interruptions. Every
119single device is uniquely identified to the system by a so called subchannel,
120where the ESA/390 architecture allows for 64k devices be attached.
121
122Linux, however, was first built on the Intel PC architecture, with its two
123cascaded 8259 programmable interrupt controllers (PICs), that allow for a
124maximum of 15 different interrupt lines. All devices attached to such a system
125share those 15 interrupt levels. Devices attached to the ISA bus system must
126not share interrupt levels (aka. IRQs), as the ISA bus bases on edge triggered
127interrupts. MCA, EISA, PCI and other bus systems base on level triggered
128interrupts, and therewith allow for shared IRQs. However, if multiple devices
129present their hardware status by the same (shared) IRQ, the operating system
130has to call every single device driver registered on this IRQ in order to
131determine the device driver owning the device that raised the interrupt.
132
133In order not to introduce a new I/O concept to the common Linux code,
134Linux/390 preserves the IRQ concept and semantically maps the ESA/390
135subchannels to Linux as IRQs. This allows Linux/390 to support up to 64k
Matt LaPlanted6bc8ac2006-10-03 22:54:15 +0200136different IRQs, uniquely representing a single device each.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700137
138Up to kernel 2.4, Linux/390 used to provide interfaces via the IRQ (subchannel).
139For internal use of the common I/O layer, these are still there. However,
140device drivers should use the new calling interface via the ccw_device only.
141
142During its startup the Linux/390 system checks for peripheral devices. Each
143of those devices is uniquely defined by a so called subchannel by the ESA/390
144channel subsystem. While the subchannel numbers are system generated, each
145subchannel also takes a user defined attribute, the so called device number.
Matt LaPlante84eb8d02006-10-03 22:53:09 +0200146Both subchannel number and device number cannot exceed 65535. During driverfs
Linus Torvalds1da177e2005-04-16 15:20:36 -0700147initialisation, the information about control unit type and device types that
148imply specific I/O commands (channel command words - CCWs) in order to operate
149the device are gathered. Device drivers can retrieve this set of hardware
150information during their initialization step to recognize the devices they
151support using the information saved in the struct ccw_device given to them.
152This methods implies that Linux/390 doesn't require to probe for free (not
153armed) interrupt request lines (IRQs) to drive its devices with. Where
154applicable, the device drivers can use the read_dev_chars() to retrieve device
155characteristics. This can be done without having to request device ownership
156previously.
157
158In order to allow for easy I/O initiation the CDS layer provides a
159ccw_device_start() interface that takes a device specific channel program (one
160or more CCWs) as input sets up the required architecture specific control blocks
161and initiates an I/O request on behalf of the device driver. The
162ccw_device_start() routine allows to specify whether it expects the CDS layer
163to notify the device driver for every interrupt it observes, or with final status
164only. See ccw_device_start() for more details. A device driver must never issue
165ESA/390 I/O commands itself, but must use the Linux/390 CDS interfaces instead.
166
167For long running I/O request to be canceled, the CDS layer provides the
168ccw_device_halt() function. Some devices require to initially issue a HALT
169SUBCHANNEL (HSCH) command without having pending I/O requests. This function is
170also covered by ccw_device_halt().
171
172
173read_dev_chars() - Read Device Characteristics
174
175This routine returns the characteristics for the device specified.
176
Cornelia Huck9b10fe52006-10-18 18:30:55 +0200177The function is meant to be called with the device already enabled; that is,
Linus Torvalds1da177e2005-04-16 15:20:36 -0700178at earliest during set_online() processing.
179
Cornelia Huck9b10fe52006-10-18 18:30:55 +0200180The ccw_device must not be locked prior to calling read_dev_chars().
Linus Torvalds1da177e2005-04-16 15:20:36 -0700181
182The function may be called enabled or disabled.
183
184int read_dev_chars(struct ccw_device *cdev, void **buffer, int length );
185
186cdev - the ccw_device the information is requested for.
187buffer - pointer to a buffer pointer. The buffer pointer itself
188 must contain a valid buffer area.
189length - length of the buffer provided.
190
191The read_dev_chars() function returns :
192
193 0 - successful completion
194-ENODEV - cdev invalid
195-EINVAL - an invalid parameter was detected, or the function was called early.
196-EBUSY - an irrecoverable I/O error occurred or the device is not
197 operational.
198
199
Cornelia Huck9fc14272005-05-01 08:59:00 -0700200read_conf_data(), read_conf_data_lpm() - Read Configuration Data
Linus Torvalds1da177e2005-04-16 15:20:36 -0700201
202Retrieve the device dependent configuration data. Please have a look at your
203device dependent I/O commands for the device specific layout of the node
Cornelia Huck9fc14272005-05-01 08:59:00 -0700204descriptor elements. read_conf_data_lpm() will retrieve the configuration data
205for a specific path.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700206
Cornelia Huck9fc14272005-05-01 08:59:00 -0700207The function is meant to be called with the device already enabled; that is,
Linus Torvalds1da177e2005-04-16 15:20:36 -0700208at earliest during set_online() processing.
209
210The function may be called enabled or disabled, but the device must not be
211locked
212
Cornelia Huck9fc14272005-05-01 08:59:00 -0700213int read_conf_data(struct ccw_device, void **buffer, int *length);
214int read_conf_data_lpm(struct ccw_device, void **buffer, int *length, __u8 lpm);
Linus Torvalds1da177e2005-04-16 15:20:36 -0700215
216cdev - the ccw_device the data is requested for.
217buffer - Pointer to a buffer pointer. The read_conf_data() routine
218 will allocate a buffer and initialize the buffer pointer
219 accordingly. It's the device driver's responsibility to
220 release the kernel memory if no longer needed.
221length - Length of the buffer allocated and retrieved.
222lpm - Logical path mask to be used for retrieving the data. If
223 zero the data is retrieved on the next path available.
224
225The read_conf_data() function returns :
226 0 - Successful completion
227-ENODEV - cdev invalid.
228-EINVAL - An invalid parameter was detected, or the function was called early.
229-EIO - An irrecoverable I/O error occurred or the device is
230 not operational.
231-ENOMEM - The read_conf_data() routine couldn't obtain storage.
232-EOPNOTSUPP - The device doesn't support the read configuration
233 data command.
234
235
236get_ciw() - get command information word
237
238This call enables a device driver to get information about supported commands
239from the extended SenseID data.
240
241struct ciw *
242ccw_device_get_ciw(struct ccw_device *cdev, __u32 cmd);
243
244cdev - The ccw_device for which the command is to be retrieved.
245cmd - The command type to be retrieved.
246
247ccw_device_get_ciw() returns:
248NULL - No extended data available, invalid device or command not found.
249!NULL - The command requested.
250
251
252ccw_device_start() - Initiate I/O Request
253
254The ccw_device_start() routines is the I/O request front-end processor. All
255device driver I/O requests must be issued using this routine. A device driver
256must not issue ESA/390 I/O commands itself. Instead the ccw_device_start()
257routine provides all interfaces required to drive arbitrary devices.
258
259This description also covers the status information passed to the device
260driver's interrupt handler as this is related to the rules (flags) defined
261with the associated I/O request when calling ccw_device_start().
262
263int ccw_device_start(struct ccw_device *cdev,
264 struct ccw1 *cpa,
265 unsigned long intparm,
266 __u8 lpm,
267 unsigned long flags);
Cornelia Huck9fc14272005-05-01 08:59:00 -0700268int ccw_device_start_timeout(struct ccw_device *cdev,
269 struct ccw1 *cpa,
270 unsigned long intparm,
271 __u8 lpm,
272 unsigned long flags,
273 int expires);
274int ccw_device_start_key(struct ccw_device *cdev,
275 struct ccw1 *cpa,
276 unsigned long intparm,
277 __u8 lpm,
278 __u8 key,
279 unsigned long flags);
280int ccw_device_start_key_timeout(struct ccw_device *cdev,
281 struct ccw1 *cpa,
282 unsigned long intparm,
283 __u8 lpm,
284 __u8 key,
285 unsigned long flags,
286 int expires);
Linus Torvalds1da177e2005-04-16 15:20:36 -0700287
288cdev : ccw_device the I/O is destined for
289cpa : logical start address of channel program
290user_intparm : user specific interrupt information; will be presented
291 back to the device driver's interrupt handler. Allows a
292 device driver to associate the interrupt with a
293 particular I/O request.
294lpm : defines the channel path to be used for a specific I/O
295 request. A value of 0 will make cio use the opm.
Cornelia Huck9fc14272005-05-01 08:59:00 -0700296key : the storage key to use for the I/O (useful for operating on a
297 storage with a storage key != default key)
Linus Torvalds1da177e2005-04-16 15:20:36 -0700298flag : defines the action to be performed for I/O processing
Cornelia Huck9fc14272005-05-01 08:59:00 -0700299expires : timeout value in jiffies. The common I/O layer will terminate
300 the running program after this and call the interrupt handler
301 with ERR_PTR(-ETIMEDOUT) as irb.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700302
303Possible flag values are :
304
305DOIO_ALLOW_SUSPEND - channel program may become suspended
306DOIO_DENY_PREFETCH - don't allow for CCW prefetch; usually
307 this implies the channel program might
308 become modified
309DOIO_SUPPRESS_INTER - don't call the handler on intermediate status
310
311The cpa parameter points to the first format 1 CCW of a channel program :
312
313struct ccw1 {
314 __u8 cmd_code;/* command code */
315 __u8 flags; /* flags, like IDA addressing, etc. */
316 __u16 count; /* byte count */
317 __u32 cda; /* data address */
318} __attribute__ ((packed,aligned(8)));
319
320with the following CCW flags values defined :
321
322CCW_FLAG_DC - data chaining
323CCW_FLAG_CC - command chaining
Matt LaPlante2fe0ae72006-10-03 22:50:39 +0200324CCW_FLAG_SLI - suppress incorrect length
Linus Torvalds1da177e2005-04-16 15:20:36 -0700325CCW_FLAG_SKIP - skip
326CCW_FLAG_PCI - PCI
327CCW_FLAG_IDA - indirect addressing
328CCW_FLAG_SUSPEND - suspend
329
330
331Via ccw_device_set_options(), the device driver may specify the following
332options for the device:
333
334DOIO_EARLY_NOTIFICATION - allow for early interrupt notification
335DOIO_REPORT_ALL - report all interrupt conditions
336
337
338The ccw_device_start() function returns :
339
340 0 - successful completion or request successfully initiated
341-EBUSY - The device is currently processing a previous I/O request, or ther is
342 a status pending at the device.
343-ENODEV - cdev is invalid, the device is not operational or the ccw_device is
344 not online.
345
346When the I/O request completes, the CDS first level interrupt handler will
Matt LaPlante3f6dee92006-10-03 22:45:33 +0200347accumulate the status in a struct irb and then call the device interrupt handler.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700348The intparm field will contain the value the device driver has associated with a
349particular I/O request. If a pending device status was recognized,
350intparm will be set to 0 (zero). This may happen during I/O initiation or delayed
351by an alert status notification. In any case this status is not related to the
352current (last) I/O request. In case of a delayed status notification no special
353interrupt will be presented to indicate I/O completion as the I/O request was
354never started, even though ccw_device_start() returned with successful completion.
355
Cornelia Huck9fc14272005-05-01 08:59:00 -0700356The irb may contain an error value, and the device driver should check for this
357first:
358
359-ETIMEDOUT: the common I/O layer terminated the request after the specified
360 timeout value
361-EIO: the common I/O layer terminated the request due to an error state
362
Linus Torvalds1da177e2005-04-16 15:20:36 -0700363If the concurrent sense flag in the extended status word in the irb is set, the
364field irb->scsw.count describes the numer of device specific sense bytes
365available in the extended control word irb->scsw.ecw[0]. No device sensing by
366the device driver itself is required.
367
368The device interrupt handler can use the following definitions to investigate
369the primary unit check source coded in sense byte 0 :
370
371SNS0_CMD_REJECT 0x80
372SNS0_INTERVENTION_REQ 0x40
373SNS0_BUS_OUT_CHECK 0x20
374SNS0_EQUIPMENT_CHECK 0x10
375SNS0_DATA_CHECK 0x08
376SNS0_OVERRUN 0x04
377SNS0_INCOMPL_DOMAIN 0x01
378
379Depending on the device status, multiple of those values may be set together.
380Please refer to the device specific documentation for details.
381
382The irb->scsw.cstat field provides the (accumulated) subchannel status :
383
384SCHN_STAT_PCI - program controlled interrupt
385SCHN_STAT_INCORR_LEN - incorrect length
386SCHN_STAT_PROG_CHECK - program check
387SCHN_STAT_PROT_CHECK - protection check
388SCHN_STAT_CHN_DATA_CHK - channel data check
389SCHN_STAT_CHN_CTRL_CHK - channel control check
390SCHN_STAT_INTF_CTRL_CHK - interface control check
391SCHN_STAT_CHAIN_CHECK - chaining check
392
393The irb->scsw.dstat field provides the (accumulated) device status :
394
395DEV_STAT_ATTENTION - attention
396DEV_STAT_STAT_MOD - status modifier
397DEV_STAT_CU_END - control unit end
398DEV_STAT_BUSY - busy
399DEV_STAT_CHN_END - channel end
400DEV_STAT_DEV_END - device end
401DEV_STAT_UNIT_CHECK - unit check
402DEV_STAT_UNIT_EXCEP - unit exception
403
404Please see the ESA/390 Principles of Operation manual for details on the
405individual flag meanings.
406
407Usage Notes :
408
Cornelia Huck9b10fe52006-10-18 18:30:55 +0200409ccw_device_start() must be called disabled and with the ccw device lock held.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700410
411The device driver is allowed to issue the next ccw_device_start() call from
412within its interrupt handler already. It is not required to schedule a
Matt LaPlantefff92892006-10-03 22:47:42 +0200413bottom-half, unless an non deterministically long running error recovery procedure
Linus Torvalds1da177e2005-04-16 15:20:36 -0700414or similar needs to be scheduled. During I/O processing the Linux/390 generic
415I/O device driver support has already obtained the IRQ lock, i.e. the handler
416must not try to obtain it again when calling ccw_device_start() or we end in a
417deadlock situation!
418
419If a device driver relies on an I/O request to be completed prior to start the
420next it can reduce I/O processing overhead by chaining a NoOp I/O command
421CCW_CMD_NOOP to the end of the submitted CCW chain. This will force Channel-End
422and Device-End status to be presented together, with a single interrupt.
423However, this should be used with care as it implies the channel will remain
424busy, not being able to process I/O requests for other devices on the same
425channel. Therefore e.g. read commands should never use this technique, as the
426result will be presented by a single interrupt anyway.
427
428In order to minimize I/O overhead, a device driver should use the
429DOIO_REPORT_ALL only if the device can report intermediate interrupt
430information prior to device-end the device driver urgently relies on. In this
431case all I/O interruptions are presented to the device driver until final
432status is recognized.
433
434If a device is able to recover from asynchronosly presented I/O errors, it can
435perform overlapping I/O using the DOIO_EARLY_NOTIFICATION flag. While some
436devices always report channel-end and device-end together, with a single
437interrupt, others present primary status (channel-end) when the channel is
438ready for the next I/O request and secondary status (device-end) when the data
439transmission has been completed at the device.
440
441Above flag allows to exploit this feature, e.g. for communication devices that
442can handle lost data on the network to allow for enhanced I/O processing.
443
444Unless the channel subsystem at any time presents a secondary status interrupt,
445exploiting this feature will cause only primary status interrupts to be
446presented to the device driver while overlapping I/O is performed. When a
447secondary status without error (alert status) is presented, this indicates
448successful completion for all overlapping ccw_device_start() requests that have
449been issued since the last secondary (final) status.
450
451Channel programs that intend to set the suspend flag on a channel command word
452(CCW) must start the I/O operation with the DOIO_ALLOW_SUSPEND option or the
453suspend flag will cause a channel program check. At the time the channel program
454becomes suspended an intermediate interrupt will be generated by the channel
455subsystem.
456
457ccw_device_resume() - Resume Channel Program Execution
458
459If a device driver chooses to suspend the current channel program execution by
460setting the CCW suspend flag on a particular CCW, the channel program execution
461is suspended. In order to resume channel program execution the CIO layer
462provides the ccw_device_resume() routine.
463
464int ccw_device_resume(struct ccw_device *cdev);
465
466cdev - ccw_device the resume operation is requested for
467
Cornelia Huck9b10fe52006-10-18 18:30:55 +0200468The ccw_device_resume() function returns:
Linus Torvalds1da177e2005-04-16 15:20:36 -0700469
470 0 - suspended channel program is resumed
471-EBUSY - status pending
472-ENODEV - cdev invalid or not-operational subchannel
473-EINVAL - resume function not applicable
474-ENOTCONN - there is no I/O request pending for completion
475
476Usage Notes:
477Please have a look at the ccw_device_start() usage notes for more details on
478suspended channel programs.
479
480ccw_device_halt() - Halt I/O Request Processing
481
482Sometimes a device driver might need a possibility to stop the processing of
483a long-running channel program or the device might require to initially issue
484a halt subchannel (HSCH) I/O command. For those purposes the ccw_device_halt()
485command is provided.
486
Cornelia Huck9b10fe52006-10-18 18:30:55 +0200487ccw_device_halt() must be called disabled and with the ccw device lock held.
488
Linus Torvalds1da177e2005-04-16 15:20:36 -0700489int ccw_device_halt(struct ccw_device *cdev,
490 unsigned long intparm);
491
492cdev : ccw_device the halt operation is requested for
493intparm : interruption parameter; value is only used if no I/O
494 is outstanding, otherwise the intparm associated with
495 the I/O request is returned
496
497The ccw_device_halt() function returns :
498
Cornelia Huck9b10fe52006-10-18 18:30:55 +0200499 0 - request successfully initiated
Linus Torvalds1da177e2005-04-16 15:20:36 -0700500-EBUSY - the device is currently busy, or status pending.
501-ENODEV - cdev invalid.
502-EINVAL - The device is not operational or the ccw device is not online.
503
504Usage Notes :
505
506A device driver may write a never-ending channel program by writing a channel
507program that at its end loops back to its beginning by means of a transfer in
508channel (TIC) command (CCW_CMD_TIC). Usually this is performed by network
509device drivers by setting the PCI CCW flag (CCW_FLAG_PCI). Once this CCW is
510executed a program controlled interrupt (PCI) is generated. The device driver
511can then perform an appropriate action. Prior to interrupt of an outstanding
512read to a network device (with or without PCI flag) a ccw_device_halt()
513is required to end the pending operation.
514
Cornelia Huck9b10fe52006-10-18 18:30:55 +0200515ccw_device_clear() - Terminage I/O Request Processing
516
517In order to terminate all I/O processing at the subchannel, the clear subchannel
518(CSCH) command is used. It can be issued via ccw_device_clear().
519
520ccw_device_clear() must be called disabled and with the ccw device lock held.
521
522int ccw_device_clear(struct ccw_device *cdev, unsigned long intparm);
523
524cdev: ccw_device the clear operation is requested for
525intparm: interruption parameter (see ccw_device_halt())
526
527The ccw_device_clear() function returns:
528
529 0 - request successfully initiated
530-ENODEV - cdev invalid
531-EINVAL - The device is not operational or the ccw device is not online.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700532
533Miscellaneous Support Routines
534
535This chapter describes various routines to be used in a Linux/390 device
536driver programming environment.
537
538get_ccwdev_lock()
539
540Get the address of the device specific lock. This is then used in
541spin_lock() / spin_unlock() calls.
542
543
544__u8 ccw_device_get_path_mask(struct ccw_device *cdev);
545
546Get the mask of the path currently available for cdev.