Count key data explained

Count key data should not be confused with Count data.

Count key data (CKD) is a direct-access storage device (DASD) data recording format introduced in 1964, by IBM with its IBM System/360 and still being emulated on IBM mainframes. It is a self-defining format with each data record represented by a Count Area that identifies the record and provides the number of bytes in an optional Key Area and an optional Data Area. This is in contrast to devices using fixed sector size or a separate format track.

Count key data (CKD) also refers to the set of channel commands (collectively Channel Command Words, CCWs) that are generated by an IBM mainframe for execution by a DASD subsystem employing the CKD recording format. The initial set of CKD CCWs, introduced in 1964, was substantially enhanced and improved into the 1990s.

CKD track format

The reason for CKD track format is to allow data field lengths to vary, each recorded block of data on a DASD track, called a record has an associated count field which identifies the record and indicates the size of the key, if used (user-defined up to 255 bytes), and the size of the data area, if used.[1] The count field has the identification of the record in cylinder-head-record format, the length of the key, and the length of the data. The key may be omitted or consist of a string of characters.

"The beginning of a track is signalled when the index marker (index point) is detected. ... The marker is automatically recognized by a special sensing device."[2] Following the index marker is the home address, which indicates the location of this track on the disk, and contains other control information internal to the control unit. A fixed-length gap follows the home address. Next, each track contains a Record 0 (R0), the track descriptor record, which is "designed to enable the entire content of a track to be moved to alternate tracks if a portion of the primary track becomes defective." Following R0 are the data records, separated by gaps.

Because of the gaps and other information, the recorded space is larger than that required for just the count data, key data, or user data. IBM provides a "reference card" for each device, which can be used to compute the number of records per track for various key and data field sizes, and to optimize the capacity of the device.[3] Later, programs were written to do these calculations. Because records are normally not split between tracks, specification of an incorrect record size create problems.

Most often, the key is omitted and the record is located sequentially or by direct cylinder-head-record addressing. If it is present, the key is any data used to find the record, usually using the Search Key Equal or Search Key High or Equal CCW. The key (and hence the record) is locatable via hardware commands.[4] Since the introduction of IBM's System/360 in 1964, nearly all IBM large and intermediate system DASDs have used the count key data record format.[5]

The advantages of count key data record format are:

Reduced CPU and memory prices and higher device and interface speeds have somewhat nullified the advantages of CKD, and it is retained only because IBM's flagship operating system z/OS does not support sector-oriented interfaces.

Originally CKD records had a one-to-one correspondence to a physical track of a DASD device; however over time the records have become more and more virtualized such that in modern IBM mainframes there is no longer a direct correspondence between a CKD record ID and the physical layout of a track.

IBM's CKD DASD subsystems

Packaging

See also: History of IBM CKD Controllers. Initially there was a high degree of correspondence between the logical view of DASD accesses and the actual hardware, as shown in the illustration. Three digit labels were typically affixed to identify the address of channel, control unit and device.

On low end systems the Channel and the Control Unit were frequently physically integrated but remained logically separate. IBM's New Attachment Strategy[6] beginning with the 3830 Model 2 in 1972 physically separated the SCU into two physical entities, a director and a controller while keeping them logically the same. The controller handles the CKD track formatting and is packaged with the first drive or drives in a string of drives and having a model number with the letter "A" as a prefix, an "A-Unit" (or "A-Box") as in 3350 Model A2 containing a controller and two DASDs. DASD without a controller, that is B-Units, have a "B" prefix in their model number.

CKD subsystems and directors were offered by IBM and plug compatible competitors until at least 1996 (2301 to 3390 Model 9);[7] in total 22 unique DASD offered by IBM configured in at least 35 different subsystem configurations. Plug-compatible offered many of the same DASD including 4 CKD subsystems featuring unique DASD.

Programming

See also: Channel I/O. Access to specific classes of I/O devices by an IBM mainframe is under the control of Channel Command Words (CCWs), some of which are generic (e.g. No Operation) but many of which are specific to the type of I/O device (e.g. Read Backwards for a tape drive). The group of CCWs defined by IBM for DASD fall into five broad categories:

CKD CCWs are the specific set of CCWs used to access CKD DASD subsystems. This is in contrast to fixed block architecture (FBA) CCWs which are used to access FBA DASD subsystems.

CKD DASD are addressed like other Input/Output devices; for System/360 and System/370 DASD are addressed directly, through channels and the associated control units (SCU or Storage Control Unit), initially using three hexadecimal digits, one for channel and two for control unit and device, providing addressing for up to 16 channels, for up to 256 DASD access mechanisms/channel and 4,096 DASD addresses total. Modern IBM mainframes use four hexadecimal digits as an arbitrary subchannel number within a channel subsystem subset, whose definition includes the actual channels, control units and device, providing addressing for up to 65,536 DASD per channel subsystem subset. In practice, physical and design constraints of the channel and of the controllers limited the maximum number of attached DASD attachable to a system to a smaller amount than the number that could be addressed.

Initial CKD feature set

The initial feature set provided by IBM with its 1964 introduction of the CKD track format and associated CCWs included: .

A Scan feature set was also provided but not continued into future CKD subsystems beyond the 2314.

Forty one CCWs implemented the feature set:

IBM S/360 DASD Channel Commands
Command ClassCommand‡230123022303
7320
231123212314
2319[9]
MT
Off
MT
On †
Count Length
ControlNo OpSSSSSS03
SeekSSSSSS076
Seek CylinderSSSSSS0B6
Seek HeadSSSSSS1B6
Set File MaskSSSSSS1F1
Space CountSSSSSS0F3
RecalibrateSS13Not zero
RestoreS17Not zero
SenseSense I/OSSSSSS046
Release DeviceOOOOOO946
Reserve DeviceOOOOOOB46
SearchHome Address EQSSSSSS39B94 (usually)
Identifier EQSSSSSS31B15 (usually)
Identifier HISSSSSS51D15 (usually)
Identifier EQ or HISSSSSS71FI5 (usually)
Key EQSSSSSS29A91 to 255
Key HISSSSSS49C91 to 255
Key EQ or HISSSSSS69E91 to 255
Key & Data EQOOOS2DADSee Note 2
Key & Data HIOOOS4DCDSee Note 2
Key & Data EQ or HIOOOS6DEDSee Note 2
Continue Scan
(see Note 1)  
Search EQOOOS25A5See Note 2
Search HIOOOS45C5See Note 2
Search HI or EQOOOS65E5See Note 2
Set CompareOOOS35B5See Note 2
Set CompareOOOS75F5See Note 2
No CompareOOOS55D5See Note 2
ReadHome AddressSSSSSS1A9A5
CountSSSSSS12928
Record 0SSSSSS1696Number of bytes transferred
DataSSSSSS0686
Key & DataSSSSSS0E8E
Count. Key & DataSSSSSS1E9E
IPLSSSSSS02
WriteHome AddressSSSSSS195 (usually)
Record 0SSSSSS158*KL*DL of RO
Count, Key & DataSSSSSS1D8+KL+DL
Special Count, Key & DataSSSSSS018+KL+DL
DataSSSSSS05DL
Key & DataSSSSSS0DKL*DL
EraseSSSSSS118*KL*DL
Total CCWs41303930404040

Notes:

O = optional feature

S = standard feature

MT = multitrack: when supported CCW will continue to operate on next heads in sequence to end of cylinder

‡ = TIC (Transfer In Channel) and other standard commands not shown.

† = code same as MT Off except as listed

1. File Scan Feature (9 CCWs) only available on 2841 for 2302, 2311 and 2321; they were not available on subsequent DASD controllers for DASD later than 2314.

2. Count is number of bytes in search argument, including mask bytes

The CCWs were initially executed by two types of SCU attached to the system's high speed Selector Channels. The 2820 SCU controlled the 2301 Drum while the 2841 SCU controlled combinations of the 2302 Disk Storage, 2311 Disk Drive, 2321 Data Cell and/or 7320 Drum Storage. IBM quickly replaced the 7320 with the faster and larger 2303.

Subsequently, the feature set was implemented on the 2314 family of storage controls and an integrated attachment of the System 370 Model 25.

The following example of a channel program reads a disk record identified by a Key field. The track containing the record and the desired value of the key is known. The SCU will search the track to find the requested record. In this example <> indicate that the channel program contains the storage address of the specified field.

  SEEK             <cylinder/head number>
  SEARCH KEY EQUAL <key value>
  TIC              *-8 Back to search if not equal
  READ DATA        <buffer> 

The TIC (transfer in channel) will cause the channel program to branch to the SEARCH command until a record with a matching key (or the end of the track) is encountered. When a record with a matching key is found the SCU will include Status Modifier in the channel status, causing the channel to skip the TIC CCW; thus the channel program will not branch and the channel will execute the READ command.

Block multiplexer channel enhancements

The block multiplexor channel was introduced beginning in 1971 on some high end System/360 systems along with the 2835 Control Unit and associated 2305 DASD, This channel was then standard on IBM System/370 and subsequent mainframes; when contrasted to the prior Selector channel it offered performance improvements for high speed devices such as DASD, including:

Multiple Requesting

Allowed multiple channel programs,to be simultaneously active in the facility as opposed to only one with a Selector channel. The actual number of subchannels provided depends upon the system model and its configuration.[10] Sometimes described as disconnected command chaining, the control unit could disconnect at various times during a chained set of CCWs, for example, disconnection for a Seek CCW, freeing the channel for another subchannel.

Command Retry

The channel and storage control under certain conditions can inter-operate to cause a CCW to be retried without an I/O interruption.This procedure is initiated by the storage control and used to recover from correctable errors.

Rotational Position Sensing

Rotational position sensing (RPS) was implemented with two new CCWs, SET SECTOR and READ SECTOR enabled the channel to delay command chaining until the disk rotated to a specified angular track position. RPS permits channel disconnection during most of the rotational delay period and thus contributes to increased channel utilization. The control unit implements RPS by dividing each track into equal angular segments.

Example Channel Program

The following example channel program will format a track with an R0 and three CKD records.

  SEEK             <cylinder/head number>
  SET FILE MASK    <allow write operations>
  SET SECTOR       <sector number=0>
  WRITE R0         <cylinder/head/R0, key length=0, data length=6>
  WRITE CKD        <cylinder/head/R1, key length, data length>
  WRITE CKD        <cylinder/head/R2, key length, data length>
  WRITE CKD        <cylinder/head/R3, key length, data length>  
In this example the Record 0 conforms to IBM programming standards. With a block multiplexer channel the channel is free during the time the DASD is seeking and again while the disk rotates to beginning of the track. A selector channel would be busy for the entire duration of this sample program.

Defect skipping

Defect skipping allows data to be written before and after one of more surface defects allowing all of a track to be used except for that portion that has the defect. This also eliminates the time that was formerly required to seek to an alternate track.[11] Only a limited number of defects could be skipped so alternate tracks remained supported for those tracks with excess defects.

Defect skipping was introduced in 1974 with the 3340 attached via the 3830 Model 2 Storage Control Unit[11] or integrated attachments on small systems. Defect skipping was essentially a factory only feature until 1981 when CCWs for management along with associated utilities were released.[12]

Dynamic paths

First introduced with the 3380 DASD on the 3880 Storage Control Unit in 1981 the feature was included with the later CKD DASD subsystems. The dynamic path selection function controls operation of the two controllers, including simultaneous data transfer over the two paths. When supported by the operating system, each controller can serve as an alternate path in the event the other controller is unavailable.

Three additional commands, Set Path Group ID, Sense Path Group ID, andSuspend Multipath Reconnection, are used to support attachment of the3380 Models having two controllers at the head of a string.

The Set Path Group ID command, with the dynamic path selection (DPS)function, provides greater flexibility in operations on reserved devices.Once a path group for a device has been established, it may be accessedover any path which is a member of the group to which it is reserved. Inaddition, on 370-XA systems which set the multipath mode bit in thefunction control byte (byte 0) to a 1, block multiplex reconnections willoccur on the first available path which is a member of the group over whichthe channel program was initiated (regardless of the reservation state of thedevice).

If the controller designated in the I/O address is busy or disabled, the dynamic path selection allows an alternatepath to the device to be established via another storagedirector and the other controller in the model AA.

Nonsynchronous operation

Prior to the 1981 introduction of the 3880 director, CKD records were synchronously accessed, all activities required that one CCW be ended and the next initiated in the gaps between the CKD fields. The gap size placed limitations on cable length but did provide for very high performance since complex chains of CCWs could be performed by the subsystem in real time without use of CPU memory or cycles.

Nonsynchronous operation provided by the Extended CKD ("ECKD") set of CCWs removed the gap timing constraint. The five additional ECKD CCWs are Define Extent, Locate Record, Write Update Data, Write Update Key and Data, and Write CKD Next Track.

In nonsynchronous operation, the transfer of data between the channel and the storage control is not synchronized with the transfer of data between the storage control and the device. Channel programs can be executed such that channel and storage control activities required to end execution of one command and advance to the next do not have to occur during the inter-record gap between two adjacent fields. An intermediate buffer in the storage control allows independent operations between the channel and the device. A major advantage of ECKDs is far longer cables; depending upon application it may improve performance.

ECKD CCWs are supported on all subsequent CKD subsystems.

This example nonsynchronous channel program reads records R1 and R2 from track X'0E' in cylinder X'007F'. Both records have a key length of 8 and a data length of X'64' (10010) bytes.

  Define Extent       <extent= X'007F 0000' through track X'0081 000E'>
  Locate Record       <cylinder = X'007F', head = X'000E'
  Read Key and Data   <key record = X'001038'>
  Read Data           <record = X'001108'>

Caching

Caching first introduced in DASD CKD subsystems by Memorex[13] (1978) and StorageTek (1981) was subsequently introduced in late 1981 by IBM on the 3880 Model 13 for models of the 3380 with dynamic pathing.

The cache is dynamically managed by an algorithm; high activity data is accessed from the high-performance cache and low activity data is accessed from less-expensive DASD storage. A large memory in the Director, the cache, is divided into track slots that store data from the 3380 tracks. A smaller area is a directory that contains entries that allow data to be located in the cache.

Caches were also provided on subsequently introduced storage controls.

Other extensions

Over time a number of path control, diagnostic and/or error recovery CCWs were implemented on one or more storage controls. For example:

Beyond System/370

Reduced CPU and memory prices and higher device and interface speeds have somewhat nullified the advantages of CKD, and support continues by IBM to this date because its flagship operating system z/OS continues to use CKD CCWs for many functions.

Originally CKD records had a one-to-one correspondence to a physical track of a DASD device; however over time the records have become more and more virtualized such that in a modern IBM mainframe there is no longer a direct correspondence between the a CKD record ID and a physical layout of a track. An IBM mainframe constructs CKD track images in memory and executes the ECKD and CKD channel programs against the image. To bridge between the native fixed block sized disks and the variable length ECKD/CKD record format, the CKD track images in memory are mapped onto a series of fixed blocks suitable for transfer to and from an FBA disk subsystem.

Of the 83 CKD CCWs implemented for System/360 and System/370 channels 56 are emulated on System/390 and later systems.

See also

Further reading

Notes and References

  1. Web site: Count key data . . IBM Knowledge Center . IBM . 6 August 2014.
  2. Book: IBM Corporation . IBM System/360 Component Descriptions 2314 Direct Access Storage Facility and 2844 Auxiliary Storage Control . September 1969 . Dec 5, 2019 . March 30, 2020 . https://web.archive.org/web/20200330172847/http://bitsavers.org/pdf/ibm/dasd/2314/A26-3599-4_2314_Sep69.pdf . dead .
  3. Book: IBM Corporation . 3330 Series Disk Storage 3333 Models 1 and 11 3330 Models 1, 2, and 11 Reference Summary . November 1973 . Dec 5, 2019.
  4. Book: Houtekamer . Gilbert E. . Artis . H. Pat . 1993 . MVS I/O Subsystems: Configuration Management and Performance Analysis . New York . McGraw-Hill . 978-0-07-002553-0 . 26096983.
  5. Book: . Introduction to Nonsynchronous Direct Access Storage Subsystems . Synchronous DASD Operations . International Business Machines Corporation . January 1990 . GC46–4519–0.
  6. Web site: Historical Narrative of the 1970s, US v IBM, Exhibit 14971 . July 1980 . 1051.
  7. Web site: Direct Access Storage * 22.7GB, 12 actuators . https://web.archive.org/web/20151222142233/http://www-01.ibm.com/common/ssi/ShowDoc.wss?docURL=/common/ssi/rep_oc/c/877/ENUS3390-B2C/index.html&lang=en&request_locale=en . December 22, 2015 . dead.
  8. I/O Subsystem Architecture . J. Buzen . . June 1975 . 63 . 6 . 871. 10.1109/PROC.1975.9852 . 68000 .
  9. Book: IBM System/360 Component Descriptions 2314 Direct Access Storage Facility and 2844 Auxiliary Storage Control . GA26-3599-6 . November 1971 . Seventh . cs2.
  10. Web site: Input/Output - A White Paper . J. Kettner . IBM . November 2007 . https://web.archive.org/web/20160304113124/http://idcp.marist.edu/pdfs/ztidbitz/An_IO_WhitePaperForZ.pdf . March 4, 2016 . dead.
  11. Book: Reference Manual for 3830 Model 1. March 1974 .
  12. Web site: Device Support Facilities, User's Guide and Reference. Release 4.0 . May 1981 . vi,46,61,87.
  13. Now Memorex fills the gap in your system's performance . Datamation . August 1978 . 85–86.