Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 1 | On some platforms, so-called memory-mapped I/O is weakly ordered. On such |
| 2 | platforms, driver writers are responsible for ensuring that I/O writes to |
| 3 | memory-mapped addresses on their device arrive in the order intended. This is |
| 4 | typically done by reading a 'safe' device or bridge register, causing the I/O |
| 5 | chipset to flush pending writes to the device before any reads are posted. A |
| 6 | driver would usually use this technique immediately prior to the exit of a |
| 7 | critical section of code protected by spinlocks. This would ensure that |
| 8 | subsequent writes to I/O space arrived only after all prior writes (much like a |
| 9 | memory barrier op, mb(), only with respect to I/O). |
| 10 | |
| 11 | A more concrete example from a hypothetical device driver: |
| 12 | |
| 13 | ... |
| 14 | CPU A: spin_lock_irqsave(&dev_lock, flags) |
| 15 | CPU A: val = readl(my_status); |
| 16 | CPU A: ... |
| 17 | CPU A: writel(newval, ring_ptr); |
| 18 | CPU A: spin_unlock_irqrestore(&dev_lock, flags) |
| 19 | ... |
| 20 | CPU B: spin_lock_irqsave(&dev_lock, flags) |
| 21 | CPU B: val = readl(my_status); |
| 22 | CPU B: ... |
| 23 | CPU B: writel(newval2, ring_ptr); |
| 24 | CPU B: spin_unlock_irqrestore(&dev_lock, flags) |
| 25 | ... |
| 26 | |
| 27 | In the case above, the device may receive newval2 before it receives newval, |
| 28 | which could cause problems. Fixing it is easy enough though: |
| 29 | |
| 30 | ... |
| 31 | CPU A: spin_lock_irqsave(&dev_lock, flags) |
| 32 | CPU A: val = readl(my_status); |
| 33 | CPU A: ... |
| 34 | CPU A: writel(newval, ring_ptr); |
| 35 | CPU A: (void)readl(safe_register); /* maybe a config register? */ |
| 36 | CPU A: spin_unlock_irqrestore(&dev_lock, flags) |
| 37 | ... |
| 38 | CPU B: spin_lock_irqsave(&dev_lock, flags) |
| 39 | CPU B: val = readl(my_status); |
| 40 | CPU B: ... |
| 41 | CPU B: writel(newval2, ring_ptr); |
| 42 | CPU B: (void)readl(safe_register); /* maybe a config register? */ |
| 43 | CPU B: spin_unlock_irqrestore(&dev_lock, flags) |
| 44 | |
| 45 | Here, the reads from safe_register will cause the I/O chipset to flush any |
| 46 | pending writes before actually posting the read to the chipset, preventing |
| 47 | possible data corruption. |