CA2304620C - Communication processor having buffer list modifier control bits - Google Patents

Communication processor having buffer list modifier control bits Download PDF

Info

Publication number
CA2304620C
CA2304620C CA002304620A CA2304620A CA2304620C CA 2304620 C CA2304620 C CA 2304620C CA 002304620 A CA002304620 A CA 002304620A CA 2304620 A CA2304620 A CA 2304620A CA 2304620 C CA2304620 C CA 2304620C
Authority
CA
Canada
Prior art keywords
frame
transmit
receive
frames
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002304620A
Other languages
French (fr)
Other versions
CA2304620A1 (en
Inventor
Bradley Roach
Peter Fiacco
Greg Scherer
Stuart Berman
David Duckman
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Emulex Corp
Original Assignee
Emulex Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Emulex Corp filed Critical Emulex Corp
Publication of CA2304620A1 publication Critical patent/CA2304620A1/en
Application granted granted Critical
Publication of CA2304620C publication Critical patent/CA2304620C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/9084Reactions to storage capacity overflow
    • H04L49/9089Reactions to storage capacity overflow replacing packets in a storage arrangement, e.g. pushout
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/901Buffering arrangements using storage descriptor, e.g. read or write pointers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/9047Buffering arrangements including multiple buffers, e.g. buffer pools
    • H04L49/9052Buffering arrangements including multiple buffers, e.g. buffer pools with buffers of different sizes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/9084Reactions to storage capacity overflow
    • H04L49/9089Reactions to storage capacity overflow replacing packets in a storage arrangement, e.g. pushout
    • H04L49/9094Arrangements for simultaneous transmit and receive, e.g. simultaneous reading/writing from/to the storage element
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/40Network security protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/12Protocol engines

Abstract

A communication processor sends and receives frames of data and commands.
Transmit and receive protocol engine is controlled by host driver software which utilizes predetermined bits to indicate which frame is the last frame in a series of frames. This information is then placed in the transmit frame before it is sent.

Description

COMMUNICATION PROCESSOR HAVING
BUFFER LIST MODIFIER CONTROL BITS
BACKGROUND OF THE INVENTION
1. Field of the Invention This invention relates to devices for transferring data in computer networks, and more particularly to a device utilizing control bits to facilitate generating and transmitting frames of data across a computer network boundary.
2. Description of Related Art The number of computers and peripherals has mushroomed in recent years. This has created ~o a need for improved methods of interconnecting these devices. A wide variety of networking paradigms have been developed to enable different kinds of computers and peripheral components to communicate with each other.
There exists a bottleneck in the speed with which data can be exchanged along such ~s networks. This is not surprising because increases in network architecture speeds have not kept pace with faster computer processing speeds. The processing power of computer chips has historically doubled about every 18 months, creating increasingly powerful machines and bandwidth hungry applications. It has been estimated that one megabit per second of input/output is generally required per "MIPS" (millions of instructions per second) of 2o processing power. With CPUs now easily exceeding 200 MIPs, it is difficult for network architecture to keep up with these faster speeds.
Area-wide networks (e.g., LANs and WANs) and channels are two approaches that have been developed for computer network architectures. Traditional networks offer a great deal zs of flexibility and relatively long distance capabilities. Channels, such as Enterprise System Connection (ESCON) and Small Computer System Interface (SCSI), have been developed for nigh performance and high reliability. Channels typically use dedicated short-distance connections between computers or between computers and peripherals.

Features of both channels and networks have been incorporated into a new network standard known as "Fibre Channel". Fibre Channel systems combine the speed and reliability of channels with the flexibility and connectivity of networks. Fibre Channel products currently can run at very high data rates, such as 266 or 1062 Mbps. These speeds are sufficient to handle quite demanding applications such as uncompressed, full motion, high-quality video.
There are generally three ways to deploy Fibre Channel: simple point-to-point connections;
arbitrated loops; and switched fabrics. The simplest topology is the point-to-point configuration, which simply connects any two Fibre Channel systems directly.
Arbitrated ~o loops are Fibre Channel ring connections that provide shared access to bandwidth via arbitration. Switched Fibre Channel networks, called "fabrics", yield the highest performance by leveraging the benefits of cross-point switching.
The Fibre Channel fabric works something like a traditional phone system. The fabric can ~s connect varied devices such as work stations, PCs, servers, routers, main frames, and storage devices that have Fibre Channel interface ports. Each such device can have an origination port that "calls" the fabric by entering the address of a destination port in a frame header. The Fibre Channel specification defines the structure of this frame. (This frame structure raises data transfer issues that will be discussed below and addressed by the present invention). The zo Fibre Channel fabric does all the work of setting up the desired connection, hence the frame originator does not need to be concerned with complex routing algorithms.
There are no complicated permanent virtual circuits (PVCs) to set up. Fibre Channel fabrics can handle more than 16 million addresses, and so are capable of accommodating very large networks.
The fabric can be enlarged by simply adding ports. The aggregate data rate of a fully 2s configured Fibre Channel network can be in the tera-bit-per-second range.
Each of the three basic types of Fibre Channel connections are shown in FIGURE
l, which shows a number of ways of using Fibre Channel technology. In particular, point-to-point connections 10 are shown connecting mainframes to each other. A Fibre Channel arbitrated so loop 11 is shown connecting disk storage units. A Fibre Channel switch fabric 12 connects work stations 13, mainframes 14, servers 15, disk drives 16 and local area networks CLANS) 17. The LANS include, for example, Ethernet, Token Ring and FDDI networks.
An ANSI specification (X3.230-1994) defines the Fibre Channel network. The specification distributes Fibre Channel functions among five layers. As shown in FIGURE 2, the five functional layers of the Fibre Channel are: FC-0 - the physical media layer;
FC-1 - the coding and decoding layer; FC-2 - the actual transport mechanism, including the framing protocol and flow control between nodes; FC-3 - the common services layer; and FC-4 -the upper layer protocol.
~o While the Fibre Channel operates at relatively high speed, it would be desirable to increase speeds further to meet the needs of faster processors. One way to do this would be to eliminate, or reduce, delays that occur at interface points. One such delay occurs during the transfer of a frame from the FC-1 layer to the FC-2 layer. At this interface, devices linked by ~5 a Fibre Channel data link receive Fibre Channel frames serially. A protocol engine receives these frames and processes them at the next layer, the FC-2 layer shown in FIGURE 2. The functions of the protocol engine includes validating each frame; queuing up DMA operations to transfer each frame to the host; and building transmit frames.
2o The high bit speeds of the Fibre Channel data link places extreme demands on the protocol engine. Hence, some protocol engines can only operate in half duplex mode, which means that the protocol engine can process data in only one direction at a time.
This significantly slows down speed of the data transfer since either the transmit or the receive task must wait while the other task is performed.
Full-duplex protocol engines can process both received and transmitted frames simultaneously. Hence full-duplex protocol engines significantly improve data throughput.
However, in full-duplex protocol engines, usually a microprocessor with local RAM handles the transmit and receive operations. The use of a microprocessor for this function greatly so increases the cost of the protocol engine.
Conventional protocol engines also sometimes rely on the involvement of a host CPU on a frame-by-frame basis. For example, validation of received frames and generation of acknowledgments to received frames typically involve the host CPU. involving the host CPU
limits frame transmission and reception rates and prevents the host CPU from performing other tasks.
Moreover, a transmit protocol engine must have advance notice of the frame payload data size in order to build a transmit frame "header". One way to accomplish this is for the transmit protocol engine to access computer memory where a series of frames have been ~o stored and modify the header field in the last frame. However, if the transmit protocol engine cannot determine whether or not the current frame is the final frame before transferring payload data, this extra step slows the process of building and transmitting the frame header, and hence slows the overall communication data rate.
15 In view of the foregoing, objects of the invention include: increasing data transfer processing spe.;ds in high speed networks such as the Fibre Channel network; providing a technique that can speed up a protocol engine's processing of data frames; providing a protocol engine that can perform high speed full duplex processing of data without involving the host CPU on a frame-by-frame basis; and providing a way for a transmit protocol engine to predetermine 2o whether or not the current frame is the final frame, as well as having advance notice of the frame payload data size.
SUMMARY OF THE INVENTION
The invention is directed to the processing and transfernng of frames of data in a computer 2s data link. The invention is a full-duplex communication processor that uses dual micro-coded engines and specialized hardware to build transmit frames and to validate receive frames without involving a host CPU on a frame-by-frame basis and without a resident microprocessor. A preferred embodiment of the invention uses independent transmit and receive protocol processors. These independent processors communicate with each other so using a transfer queue. A context manager provides context information that is used by the receive processor to validate received frames and by the transmit processor to build transmit frames.
The context information points to a buffer bit which gives the address and size of buffer segments and control bits (Buffer List Modifier Bits) for each segment. The transmit protocol processor uses these control bits to determine how the data in the buffer segments can be trarsmitted in a sequence of frames. This information increases the speed with which the transmit protocol processor can build and transmit the transmit frame header.
In a preferred embodiment, the data channel is a Fibre Channel data link and the full-duplex ~o communication processor is configured to process FC-2 protocol Fibre Channel frames.
The details of the preferred embodiment of the present invention are set forth in the accompanying drawings and the description below. Once the details of the invention are known, numerous additional innovations and changes will become obvious to one skilled in ~5 the art.
BRIEF DESCRIPTION OF THE DRAWINGS
FIGURE 1 is a block diagram of a prior art complex computer network utilizing Fibre Channel technology.
FIGURE 2 is a diagram of the five functional layers of the prior art Fibre Channel standard.
FIGURE 3 is a simplified block diagram of a communication processing system in accordance with a preferred embodiment of the invention.
FIGURE 4 is a diagram of a typical prior art Fibre Channel frame of data.
FIGURE 5 is a simplified block diagram of a full-duplex communication processor in accordance with a preferred embodiment of the invention.
-S-FIGURE 6 is a diagram of the host data structures in a preferred embodiment of the invention.
FIGURE 7 is a diagram of an Exchange Context Resource Index (XRI) in accordance with a preferred embodiment of the invention.
FIGURE 8 is a diagram of a Buffer Pointer List Entry format in a preferred embodiment of the invention.
~o FIGURE 9 is a diagram of a Buffer Point List format in a preferred embodiment of the invention.
Like reference numbers and designations in the various drawings refer to like elements.
DETAILED DESCRIPTION OF THE INVENTION
The invention is a full-duplex communication processor that improves frame transmission and frame reception rates in high speed data links such as the Fibre Channel.
By using independent transmit and receive microcoded engines communicating directly to host driver software, full-duplex inter-processor communication is accomplished without involving the 2o host CPU. The use of Buffer List Modifier Control Bits speeds the building and transmitting of transmit frames by giving the transmit pracessor advance notice of which frame buffer contains the last frame.
FIC~URE 3 shows a Fibre Channel communication system 20 utilizing the full-duplex 2s communication processor 22 in accordance with a preferred embodiment of the invention.
Serial data is received along a Fibre Channel data link 24. Frames generally will comprise three portions, a preamble, a data or "payload" portion, and a trailer portion. In a Fibre Channel data link, for example, the Fibre Channel frame consists of a start of frame (SOF) word (four bytes); a data portion comprising a frame header (six bytes), between zero and so 2112 payload bytes, and a cyclical redundancy check (CRC) word (4 bytes);
and an end of frame (EOF) word (4 bytes). The frame header is used to control link applications, control device protocol transfers, and detect missing or out of order frames. The CRC
word indicates whether there is a problem in the transmission, such as a data corruption, or whether some part of the frame was dropped during transmission.
Frames received from the Fibre Channel data link 24 are processed by an NL
port 36 which decodes and parallelizes the incoming serial data into words. The NL port 36 assembles the words into frames. The NL port 36 also checks the CRC word for each frame received and adds a resulting "good-bad" CRC status indicator to other status information bits within an EOF status word that is generated from the EOF word. The NL port 36 then writes the frames into a receive frame FIFO buffer 28.
Fibre Channel frames are then received by the full-duplex communication processor 22, also referred to as a protocol engine. Several functions are performed by the full duplex communication processor 22, including: 1 ) queuing up a host command to write the data in a received frame into host memory through direct memory access (DMA); 2) validating the frame header to ensure that the frame is the next logical frame that should be received; 3) determining whether the frame is defective or not; and 4) generating transmit frames in response to a received frame or host-generated transmit command.
Unlike conventional protocol engines, the full-duplex communication processor 22 does not include a microprocessor. Instead, dual microcoded engines are employed in order to separate the protocol engine receive tasks from the protocol engine transmit tasks. In particular, the full-duplex communication processor 22 includes a receive protocol engine 30 and a transmit protocol engine 32. These protocol engines communicate to each other through a transfer ready queue 60. The receive protocol engine 30 validates the receive frame headers received from the receive frame buffer 28. The transmit protocol engine 32 builds transmit frames and sends them to the Fibre Channel data link 24 through a transmit FIFO 66 and the NL port 36.
The full duplex communication processor 22 works in conjunction with a host computer 40 that includes host driver software 38 and host memory 42. In particular, the transmit and receive protocol engines 30, 32 communicate directly to the host driver software 38. Full-duplex conununication is achieved because the receive and transmit protocol engines operate independently and concurrently. An interlocked context information table is used to permit the receive and transmit protocol engines to operate on the same I/O command, as described in more detail below.
The full-duplex communication processor 22 is able to process frames without involving the host CPU on a frame-by-frame basis. For example, one function of the full-duplex communication processor 22 is to allow a remote device to send a frame along the Fibre Channel link 24 to the receive protocol engine 30 which will "wake up" the transmit protocol engine 32 to send data to the remote device through the NL port 36 to the Fibre Channel link 24. Such data may reside, for example, in the host memory 42.
FIGURE 5 shows additional details of the full-duplex communication processor 22 of a 2o preferred embodiment of the invention. The full-duplex communication processor 22 includes data structures resident in host memory 42, which may include contiguous and non-contiguous physical memory.
A Fibre Channel frame is received by the receive protocol engine 30 through the NL port 36.
2s An NL port status unit 44 performs the function of timing receive frame sequence and monitoring NL port interrupts. The received frame is sent through a sequencer 46 to a receive buffer control unit 48 which places the received frame in a receive buffer 50.
The frame header in the receive buffer 50 is then automatically placed into the receive protocol engine 30.
-g-WO 99/1617? PCT/US98/20011 A lookup field inside each frame header includes a pointer to an associated context. In general, the associated context is initialized by the host driver 38 within the host memory 42, and contains information indicating where to put a particular frame of data in host memory 42. More particularly, the context contains fields such as maximum frame size, current buffer pointer and length, and small computer systems interface (SCSI) state information, defined in a list of buffers.
The host memory unit 42 typically will comprise many megabytes of memory, and each particular frame will fit into one slot in that memory. Each frame header tells the receive ~o protocol engine 30 which context to access or "pull down" for that particular frame so that the receive protocol engine can validate that frame . The context is pulled down from the host memory 42 under control of the context manager engine 52 through a host memory interface 54. The receive protocol engine sequencer 46 then validates the frame.
15 Once frame validation is complete, the context pointed to by a frame header will tell the receive protocol engine 30 what to do with the frame. There are a number of possibilities, including: 1) -send the frame out the Routing Control/Type (R CTL/TYPE) ring control unit 56 where it then is sent to host memory 42 through the host memory interface 54; 2) send the frame through the Buffer List ring control unit 58 to one segment in the buffer pointer list 2o inside host memory; and 3) process a non-data receive frame and associated payload . (For example, the frame may be a communication frame such as a "transfer ready"
that tells the transmitter that the target is now ready to accept data. This would cause the receive frame to pass to the transfer ready queue 60. The transmit command would then be sent to the transmit protocol engine 32).
The second case involves sending a frame to a buffer pointer list, which is a sequential list of buffer descriptors. The first entry in the list contains the total transfer size in bytes. In the illustrated embodiment, only word transfers are performed by the full-duplex communication processor 22. Hence, if the total transfer size is not an integral number of 4-byte words, so additional bytes are transferred to the next boundary. Subsequent entries in the buffer list consist of two parts each, one part being an address that points to the start of a buffer and the other part being the size and usage of the buffer.
In accordance with the invention, each buffer pointer list includes buffer list modifier (BLM) s bits that describe the buffer usage and which are used to build an outgoing Fibre Channel frame header (the FC-2 header) for each transmit frame. The full-duplex communication processor 22 must build the frame header and corresponding frame control (F_CTL) bits, and transfer the frame header to a transmit FIFO 66 before transferring the payload via a DMA
operation. The BLM bits and the buffer lengths in the buffer lists assist the full-duplex ~o communication processor 22 in determining whether a frame is the last one in a series of frames. For the receive protocol engine, the BLM bits control proper placement of received data and status information into the buffer segments. The BLM bits are described in more detail below.
~s One example of a task performed by the full-duplex communication processor 22 is the processing of a command to write data to a disk drive on the Fibre Channel link 24 from a remote device. A write command is sent and the full-duplex communication processor transfers the command to the disk drive which sends back a transfer ready message to the receive protocol engine 30 indicating that the disk drive is ready to accept the data. This 2o message goes to the transfer ready queue 60 which instructs the transmit protocol engine 32 to retrieve the data from host memory 42, generate a frame and transmit the data to the disk drive.
The transmit protocol engine 32 is triggered by either of two events: one is the presence of an 2s entry in the transfer ready queue 60, and the other is by action of the command ring controller 62. An Exchange Context Resource Index (XRI), described below, is used to process each command. The command ring is a circular queue of command entries, generally read and write commands. These read and write commands can be used, for example, to communicate commands to a remote device such as a disk drive. The size and base memory so address of the command ring is specified in a command ring base register which contains "put" and "get" pointers that are used for managing the command ring as follows: the host driver 38 manages the put pointer, incrementing the pointer whenever a command is queued to the command ring 62. The full-duplex communication processor 22 manages the get pointer, incrementing the pointer whenever a command is read finm the ring.
s A command other than a fill-frame transmission provides a pointer to a buffer pointer list.
The buffer pointer list contains the total transfer size in the first buffer list entry and buffer pointer-size pairs in subsequent buffer list entries. The XRI field in the command will then be used to instruct the context manager 52 to pull down the appropriate context to the transmit protocol engine 32. This transfer, called an exchange, tells the transmit protocol ~o engine 32 where the engine is in that particular buffer ring list, how much data the fi~ame has and what stage it is in, etc. The context also contains the next frame header.
The next frame header is initially built by the host driver 38 but thereafter the transmit protocol engine 32 builds subsequent frame headers. The context manager 52 retrieves each frame header from the host memory 42 and passes the header to the transmit header controller 68, which sends ~s the frame header to the NL port 36 through the transmit FIFO 66.
Once a frame header is built, the system begins following the buffer list in a process that gathers data from host memory. The context for a command contains a pointer to the buffer list. One entry at a time is pulled down from the buffer list by the buffer list ring controller zo 70. The frame header is transferred to the transmit FIFO 66 through a transmit header control 68. A payload segmenter 72 begins to pull in payload (frame data) and put the payload data into the transmit FIFO 66. Once a frame header and the payload data are in the transmit FIFO
66, the last task is to write an end of frame (EOF) word to the transmit FIFO
66. The EOF
word is an indication to the NL port 36 to begin transmitting the assembled frame onto the zs Fiber Channel /ink 24. Once all frames are sent out successfully, a response is generated which is sent to the host driver 38, indicating that the frames associated with the pending command were, in fact, sent out successfully.
Likewise, the receive protocol engine 30 contains an acknowledge FIFO 74 which generates 3o an acknowledge frame (basically a modified form of the receive frame header) that is sent back over the Fibre Channel link 24 to the sender to acknowledge receipt.

WO 99116177 PCTlUS98/20011 The full-duplex communication processor 22 also includes receive and transmit protocol engine registers 76 and 78. These registers contain autonomous protocol management functions that are linked and synchronized through the context registers in the context manager 52. The context manager 52 manages coherency and caching of exchange context from the host memory 42, and also synchronizes accesses by the receive and transmit protocol engine 30, 32 to the cached exchange context contained in the context registers 80.
In the preferred embodiment, the context manager 52 and the receive and transmit protocol engines 30, 32 communicate with the host 40 through host memory interface 54 which 1o includes a peripheral components interface (PCI), direct memory access (DMA) controller (not shown), and a PCI slave interface (not shown). The protocol engine registers 76, 78 contain the PCI slave interface and interrupt controller for the protocol engines 30, 32. The context manager 52, receive and transmit protocol engines 30, 32 provide status to and from the protocol engine register 76, 78 for the PCI slave interface and interrupt controller.
The receive and transmit protocol engines 30, 32 implement the Fibre Channel protocol by using two independent programmable sequencers 46 and 63. The use of sequencers 46, 63 allows the protocol engine state machine to be implemented in a variable writable control store RAM, which is downloaded into the receive and transmit protocol engine registers 76, 78 during initialization. The host 40 can access this writable control store RAM and can read and write the writable control store RAM through a protocol register map. The use of the sequencers adds great flexibility to the protocol engine state machine implementation since, by changing code in the writable control store RAM, new or different functionality can be downloaded to the full-duplex communication processor 22.
The full-duplex communication processor 22 can be implemented on a single chip {such as an application specific integrated circuit (ASIC)), alone or together with other functions. For example, in the illustrated embodiment, the full-duplex communication processor 22 can cache one instance of the most recent transmit and receive context. However, by adding so additional on-chip memory, additional instances of context can be cached.

Buffer Pointer List FIGURE 6 shows the major data structures in the illustrated embodiment of the full duplex communication processor 22. The Buffer Pointer List shown in FIGURE 6 is a sequential list of buffer descriptors. The first entry in the list contains the total transfer size in bytes;
however, since the full duplex communication processor 22 performs only word transfers, .if the total transfer size is not an integral number of 4-byte words, additional bytes are transferred to the next word boundary. Subsequent entries in the buffer list consist. of two parts each, one being an address that points to the start of a buffer and the other being the size in words of the buffer and control bits. However, a starting word address and a buffer word count parameter are used so the host must align the buffer starting address to a 32-bit word address.
In the illustrated embodiment, a Buffer Pointer List always starts on a quadword (16-byte) bo~:ndary. The last entry is always a NULL descriptor. A Buffer Pointer List must exist in ~s contiguous physical memory. The format of the buffer descriptors and the layout of the list are illustrated in FIGURE 8, Buffer Pointer List Entry Format, and FIGURE 9, Buffer Pointer List Format.
The bit Buffer Pointer List Entry format and Buffer Pointer List format of the illustrated 2o embodiment are described in detail for one possible embodiment as follows:
Bits [31:0] Total Transfer Size (TTSZ) The Total Transfer Size is the total number of bytes to be transmitted or received. In the case of FCP -1/O's the total transfer size does not include the FCP_CMD, 25 FCP XFR RDY or FCP RSP frames.
Bits [31:2] Buffer Starting Word Address (BSWA) This field contains the following information depending on the buffer list entry (BLE) position in the buffer list. For the first BLE the BSWA field is not used. For all so subsequent BLEs except the last, the BSWA field contains the starting word address of a buffer. For the last BLE, the BSWA field contains the IOTAG. The last BLE
is indicated by setting a zero in the BWC (length) field. All BSWA values from OOOO,OOOOh to FFFF,FFFCh are valid.
Bits [31:24] Buffer List Modifiers (BLM) s The BLM bits cause the transmit protocol engine 32 to set or clear certain bits in the FC2 Header. These bits are valid for every BLE except the first BLE, where these bits are redefined as the upper total transfer size bits 31-24. The BLM bits are used as follows:
Bit [31] Receive Buffer This is a receive buffer and will not be sent.
Bit [30] First Frame Indication When set, the transmit protocol engine 32 will set the SOF delimiter to SOFi3 for one frame.
Bit [29] F CTL.fs bit When set, the transmit protocol engine 32 will set the first-frame-of exchange F CTL
bit.
Bit [28] F CTL.si bit When set, the transmit protocol engine 32 will set the sequence-initiative bit in the F CTL field for the last frame transmitted for this sequence.
Bit [27] F_CTL.es bit 2o When set, the transmit protocol engine 32 will set the end-of sequence bit in the F CTL field for the last fi~ame transmitted for this sequence.
Bit [26] F CTL.Is bit When set, the transmit protocol engine 32 will set the last-sequence bit in the F CTL
field for the last frame transmitted for this sequence to terminate the exchange.
25 Bit [25] SE~COMPLETE
When set, the transmit protocol engine 32 will complete the sequence when all the data pointed to by this BLE is sent.
Bit [24] FCP_DATA
When set, the transmit protocol engine 32 will replace the header TYPE field with so FCP_DATA.

WO 99/16177 PCT/US9$/20011 Bits [23:18] Reserved BLM
The Reserved BLM bits are mapped to the sequencer for future use, except for the first BLE, where these bits-are defined as bits 23-18 of the total transfer size.
Bit [17:2] Buffer Word Count (BWC) These bits define the length in 32-bit words of the buffer or the Total Transfer size, depending on the BLE position in the buffer list. For the first BLE, these bits provide bits 17-2 of the Total Transfer size. For all subsequent BLEs except the last, this field defines the buffer word count. For the last BLE the BWC field is reserved and must have a zero 1o value.
Bit [1:0] Reserved Bits Must be written as zero, except for the first BLE, where bits 1-0 define the residual byte length of the total transfer.
Bits [31:0] I/O Tag (IOTAG) Instead of a BSWA, the last BLE can contain a value that is of use to the Host Driver to identify the operation on completion. This field is not processed by the full-duplex communication processor 22. For normal completion of a Transmit Sequence command, 2o word 1 of the response points to this word.
XRi~ Context In the preferred embodiment, each context is divided into two host memory structures:
remote port context and exchange context. An exchange context is contained in an Exchange Context Resource Index (XRI) which is used to process a command. In particular, an exchange context is a structure that describes a complete exchange or controls transmission of one or more sequences. The structure is pointed to by an entry in an exchange pointer table. An XRI context contains the supporting context needed for an operation to take place immediately or through separate sequences. The data to send, or the buffers to receive data, ao are described by a buffer pointer list consisting of a set of buffer list entries that point to the actual buffers. As described above, a buffer list entry contains the address and length of a buffer and control bits to indicate sequence initiative, end of exchange, end of sequence, etc.
For multiple-sequence operations the XRI context provides storage for working-register contents.
In the preferred embodiment, the XRI contexts are used by the full-duplex communication processor 22 for: Fibre Channel Protocol (FCP) exchanges that it originates;
for transmission exchanges; for Receive Frame validation; and for temporary purposes to control transmission of a single frame or multiple frame sequences. The XRI context may be used by the host driver 38 for keeping track of exchanges for which data is received into buffer ring buffers.
~o An example of an XRI context is shown in FIGURE 7. The first word is the XRI
control-status word. The XRI control-status word contains configuration fields that are set by the host driver. The Total Transfer Size word reflects sequences activity. For Fibre Channel protocol (FCP) originated exchanges, the XRI control-status word shows the remaining byte ~s count for write operations and the cumulative received byte count for read operations. For transmit sequence commands, the XRI control-status word shows the remaining byte count if the operation is halted before the complete sequence is transmitted. The Rxeng control-status word is used by the receive protocol engine 30 to validate frames. The Current Buffer List Address word reflects sequences activity, as does the Current Buffer Offset Address word.
zo The buffer list modifier (BLM) bits are set from the corresponding bits of a buffer list entry (BLE) read under sequences control. The Residual Buffer Length in word five reflects sequences activity. Whenever a sequences reads a BLE, this field receives the buffer word count. Whenever a sequences issues a DMA operation to transfer data to or from the buffer, 2s the word count is reduced by the length of the transfer data. The Current Buffer Burst Length word also reflects transmit sequences activity.
The Fibre Channel FC-2 header, in words 7-12, is used to generate header information for each frame transmitted by the transmit protocol engine 22.

R CTLlTYPE Buffer Rings Referring again to FIGURE 5, the R CTL/T'YPE ring control 56 controls buffer rings that are used to receive all frames except FCP responder frames, i.e., for a locally originated FCP
exchange. Three R CTL/TYPE buffer rings assist the host in demultiplexing incoming frames for the appropriate driver entry points. An R CTL/TYPE buffer ring is a fixed-size.
sequential list of buffer descriptors. The list is managed by hardware as a logical ring. Buffer descriptors are like buffer list entries in a buffer pointer list, but do not contain BLM bits.
The host driver 42 specifies the location and size of each buffer ring in the corresponding ~o base register. Specific registers specify which entries in the R CTL/TYPE
buffer rings are valid. Each register consists of a put pointer and a get pointer. The receive buffers for each ring are used in the exact order in which the host driver put the corresponding buffer descriptions into the ring.
~s Loop initialization is initiated locally under host control or remotely by some other port. The host driver 38, the transmit and receive sequencers 46, 63, and the NL port 36 logic, all work together to complete the loop initialization procedure. During this procedure, the host driver either originates or passes on Fibre Channel Extended Link Services (ELS) frames that determine the addresses and capabilities of the ports on the loop. The host driver 38 is 2o responsible for issuing Loop Initialization Select Master (LISM) ELS frames which facilitate the loop initialization process.
Initialization is needed because both the receive and transmit protocol engines 30, 32 are basically two autonomous engines running in full-duplex and they have very minimal is communication between the two of them. During initialization, the transmit protocol engine 32 is turned off and the receive protocol engine 30 is allowed to receive frames and then send them through the transmit protocol engine 32. Thus, the receive protocol engine takes up "ownership" of the transmit protocol engine hardware and uses that hardware to forward frames, in particular, the LISM frames which are transmitted utilizing the transmit LISM
so control module 82.

Buffer List Modifier Bit Interpreter The full duplex communication processor 22 uses the buffer list modifier (BLM) bits to build the outgoing Fibre Channel FC2 header. It must build the header and the corresponding F CTL bits, and transfer it into the transmit FIFO before DMAing the payload.
The BLM
bits and the buffer lengths in the buffer list assist the processor 22 in determining whether the frame is the last one. These parameters are interpreted by the full-duplex communication processor 22 as shown by the following pseudo-code:

if (first frame of command or xfr_rdy) if (BLM.first frame) SOF = SOFi3 else SOF = SOFn3// Set SOFi3/n3 delimiter else SOF = SOFn3 if (BLM.first sequence) F CTL.fs =1 else F CTL.fs = 0// Set F CTL first frame bit ~o if (BLM.FCP DATA) R_CTL = FCP DATA, F CTL.rop =1 // Set up FCP_DATA frame if (sequence complete) if (BLM.si) F_CTL.si =1 if (BLM.es) F_CTL.es = 1 ~s if (BLM.Is) F CTL.Is =1 F_CTL.fill_bytes = fill bytes from burst length running count SOF.Last Frame = 1 where:
2o sequence complete = (total transfer size <= max frame size) II
- (Burst size <= max frame size) II
(BLM.SEQ-COMP && (remaining buffer size <= max frame size)) max frame size = min(N Port max frame size, 1 k) if FCP burst size = FCP_XFR_RDY burst length field 2s else burst size = total transfer size The BLM bits are updated in the XRI Context as each BLE is processed by the full-duplex communication processor 22. In certain exception conditions, the driver must update the BLM bits in the XRI Context before issuing a Restart command to resume sequence so transmission. As an example, the BLM first frame bit must be clear to resume transmitting a sequence after the first frame has already been transmitted.
By .zelping the processor 22 determine whether a frame is the last one, the BLM bits allows the transmit protocal engine 32 to build the transmit frame header in one pass, without 3s advance notice of the frame payload data size. Also, the BLM bits permit the transmit protocal engine 32 to load the transmit header words into the transmit buffer prior to any knowledge of the transmit payload size. This simplifies the transmit buffer architecture without requiring repositioning of the address logic on the address pointer.

It will be appreciated by those skilled in the art that the BLM bits can be used with similar advantages in other communication processors, such as those which utilize a single protocol engine for transmit and receive processes.
A number of embodiments of the present invention have been described.
Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Accordingly, it is to be understood that the invention is not to be limited by the specific illustrated embodiments, but only by the scope of the claims.
-2o-

Claims (15)

  1. WHAT IS CLAIMED IS:

    A communication processor adapted to be coupled to a computer network and to a host computer, the host computer including host memory, CPU, payload data and host driver software, the communication processor comprising:
    a receive processor and a transmit processor coupled to the computer network for receiving and validating received frames of data, and for constructing and transmitting frames of data, wherein the frames of data include headers, and receive and transmit processor being operably coupled to receive and validate said frames of data without fame-by-frame involvement of the CPU and without a resident microprocessor; and an interface for coupling said receive and transmit processors with said host computer, wherein said host driver software is configured to set predetermined bits in a frame which indicate the last frame in a series of flames, and said receive and transmit processors are configured to use said predetermined bits to build a transmit frame header containing information regarding the last frame in a series of frames.
  2. 2. A communication processor according to claim 1, further comprising an interlocked information table containing context information wherein the receive processor uses said context information to process said received frames of data and the transmit processor uses said predetermined bits to build a transmit header from said context information so as to build transmit frames and without involving said CPU.
  3. A communication processor according to claim 1, wherein said transmit and receive processors comprise independent associated transmit and receive microcoded engines.
  4. A communication processor according to claim 1, wherein each said microcoded engine is a sequencer.
  5. A communication processor according to claim 1, wherein said received frames of data are Fibre Channel frames.
  6. 6. A communication processor according to claim 5, wherein said receive and transmit processors each implement an FC-2 Fibre Channel communications protocol.
  7. 7. A communication processor according to claim 1, comprising: a single integrated circuit containing said receive and transmit processors.
  8. 8. A communication processor according to claim 1, wherein said interface includes a direct memory access (DMA) interface.
  9. 9. In a communication processor including a receive processor and a transmit processor operably coupled to receive and validate frames of data without frame-by-frame involvement of an external CPU and without a resident microprocessor, a method of processing a series of frames of data comprising:
    receiving a first frame of data by the receive processor from a computer network;
    transferring said first frame of data from said receive processor to a communication module;
    storing contextual information relating to said first frame in an information table;
    using said contextual information to validate said first frame;
    setting predetermined bits in a frame which indicate the last frame in said series of frames;
    constructing, by the transmit processor, a transmit frame containing information indicating the last frame in a series of frames; and transferring said transmit frame to said computer network.
  10. 10. A method according to claim 9, wherein said first frame is a Fibre Channel frame and said step of constructing a transmit frame includes constructing a Fibre Channel transmit frame.
  11. 11. A method according to claim 10, further comprising the step of coupling said receive and transmit processors to said host computer through a direct memory access (DMA) interface.
  12. 12. A computer network comprising:
    (a) source and destination computer devices;
    (b) a communication channel coupled to said source and destination computer devices;
    (c) a receive processor and a transmit processor coupled to said communication channel, wherein said receive processor receives and validates a series of frames of data from said source computer device where the frames of data include commands, and wherein said transmit processor constructs transmit frames;
    (d) a host computer including a CPU, memory and driver software;
    (e) an interface for coupling said receive and transmit processors with said host computer;
    (f) said driver software in said host computer arranged and configured to set predetermined bits in a frame which indicates which frame is the last frame in said series; and (g) said receive and transmit processors being configured to use said predetermined bits to build a transmit frame header containing information identifying the last frame in a series of frames, and (h) the receive and transmit processors being operably coupled to receive and validate frames of data without frame-by-frame involvement of an external CPU and without a resident microprocessor.
  13. 13. A computer network according to claim 12, wherein said frames of data are Fibre Channel frames.
  14. 14. A computer network according to claim 12, wherein said receive and transmit processors each implement an FC-2 Fibre Channel communications protocol.
  15. 15. A computer network according to claim 12, wherein said interface includes a direct memory access (DMA) interface.
CA002304620A 1997-09-24 1998-09-24 Communication processor having buffer list modifier control bits Expired - Fee Related CA2304620C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US08/937,065 US6304910B1 (en) 1997-09-24 1997-09-24 Communication processor having buffer list modifier control bits
US08/937,065 1997-09-24
PCT/US1998/020011 WO1999016177A2 (en) 1997-09-24 1998-09-24 Communication processor having buffer list modifier control bits

Publications (2)

Publication Number Publication Date
CA2304620A1 CA2304620A1 (en) 1999-04-01
CA2304620C true CA2304620C (en) 2004-08-10

Family

ID=25469442

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002304620A Expired - Fee Related CA2304620C (en) 1997-09-24 1998-09-24 Communication processor having buffer list modifier control bits

Country Status (6)

Country Link
US (1) US6304910B1 (en)
EP (1) EP1023668A4 (en)
JP (1) JP3457947B2 (en)
KR (1) KR100367949B1 (en)
CA (1) CA2304620C (en)
WO (1) WO1999016177A2 (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6118776A (en) 1997-02-18 2000-09-12 Vixel Corporation Methods and apparatus for fiber channel interconnection of private loop devices
US6185203B1 (en) 1997-02-18 2001-02-06 Vixel Corporation Fibre channel switching fabric
DE19882822T1 (en) * 1997-11-17 2001-03-22 Seagate Technology Method and dedicated frame buffer for loop initialization and for responses
US6470026B1 (en) * 1998-10-30 2002-10-22 Agilent Technologies, Inc. Fibre channel loop map initialization protocol implemented in hardware
US6769033B1 (en) 1999-08-27 2004-07-27 International Business Machines Corporation Network processor processing complex and methods
US6985431B1 (en) 1999-08-27 2006-01-10 International Business Machines Corporation Network switch and components and method of operation
US6643710B1 (en) * 1999-09-17 2003-11-04 3Com Corporation Architecture to fragment transmitted TCP packets to a requested window size
US6856619B1 (en) * 2000-03-07 2005-02-15 Sun Microsystems, Inc. Computer network controller
US6775693B1 (en) 2000-03-30 2004-08-10 Baydel Limited Network DMA method
US6757730B1 (en) 2000-05-31 2004-06-29 Datasynapse, Inc. Method, apparatus and articles-of-manufacture for network-based distributed computing
WO2002063479A1 (en) * 2001-02-02 2002-08-15 Datasynapse, Inc. Distributed computing system
US7359397B2 (en) * 2002-04-19 2008-04-15 Seagate Technology Llc Prioritizing transfers across an interface
JP2004080226A (en) * 2002-08-14 2004-03-11 Nec Corp Proxy fc port, fc network, and fc transmission transfer method used for them
US8700724B2 (en) * 2002-08-19 2014-04-15 Broadcom Corporation System and method for transferring data over a remote direct memory access (RDMA) network
US8185602B2 (en) * 2002-11-05 2012-05-22 Newisys, Inc. Transaction processing using multiple protocol engines in systems having multiple multi-processor clusters
US8111696B2 (en) * 2008-10-14 2012-02-07 Emulex Design & Manufacturing Corporation Method to improve the performance of a computer network
JP4922442B2 (en) 2010-07-29 2012-04-25 株式会社東芝 Buffer management device, storage device including the same, and buffer management method

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4346440A (en) * 1978-06-30 1982-08-24 Motorola, Inc. Advanced data link controller
EP0206743A3 (en) 1985-06-20 1990-04-25 Texas Instruments Incorporated Zero fall-through time asynchronous fifo buffer with nonambiguous empty/full resolution
US4975829A (en) * 1986-09-22 1990-12-04 At&T Bell Laboratories Communication interface protocol
US5070477A (en) * 1987-04-13 1991-12-03 Unisys Coporation Port adapter system including a controller for switching channels upon encountering a wait period of data transfer
GB2239724B (en) * 1990-01-05 1993-11-24 Sun Microsystems Inc Apparatus for maintaining consistency in a multi-processor computer system using virtual caching
EP0492025B1 (en) 1990-12-20 1997-08-06 International Business Machines Corporation High-speed multi-port FIFO buffer circuit
US5426639A (en) 1991-11-29 1995-06-20 At&T Corp. Multiple virtual FIFO arrangement
US5444853A (en) 1992-03-31 1995-08-22 Seiko Epson Corporation System and method for transferring data between a plurality of virtual FIFO's and a peripheral via a hardware FIFO and selectively updating control information associated with the virtual FIFO's
DE69330399T2 (en) * 1992-12-18 2002-05-02 Advanced Micro Devices Inc HDLC receiver
US5546347A (en) 1994-07-22 1996-08-13 Integrated Device Technology, Inc. Interleaving architecture and method for a high density FIFO
US5638518A (en) * 1994-10-24 1997-06-10 Lsi Logic Corporation Node loop core for implementing transmission protocol in fibre channel
US5828901A (en) * 1995-12-21 1998-10-27 Cirrus Logic, Inc. Method and apparatus for placing multiple frames of data in a buffer in a direct memory access transfer
US5809328A (en) * 1995-12-21 1998-09-15 Unisys Corp. Apparatus for fibre channel transmission having interface logic, buffer memory, multiplexor/control device, fibre channel controller, gigabit link module, microprocessor, and bus control device
US5727218A (en) * 1996-03-05 1998-03-10 Unisys Corp. Controlling an apparatus disposed for adapting fiber channel transmissions to an industry standard data bus
US5742765A (en) * 1996-06-19 1998-04-21 Pmc-Sierra, Inc. Combination local ATM segmentation and reassembly and physical layer device
US5748905A (en) * 1996-08-30 1998-05-05 Fujitsu Network Communications, Inc. Frame classification using classification keys
US5922046A (en) * 1996-09-12 1999-07-13 Cabletron Systems, Inc. Method and apparatus for avoiding control reads in a network node
JP3317156B2 (en) * 1996-09-18 2002-08-26 三菱電機株式会社 Numerical control device with remote PLC device
US6041058A (en) * 1997-09-11 2000-03-21 3Com Corporation Hardware filtering method and apparatus
US5978378A (en) * 1997-09-11 1999-11-02 3Com Corporation Method and apparatus for VLAN support
US6005849A (en) * 1997-09-24 1999-12-21 Emulex Corporation Full-duplex communication processor which can be used for fibre channel frames
US6098125A (en) * 1998-05-01 2000-08-01 California Institute Of Technology Method of mapping fibre channel frames based on control and type header fields

Also Published As

Publication number Publication date
WO1999016177B1 (en) 1999-10-07
JP3457947B2 (en) 2003-10-20
CA2304620A1 (en) 1999-04-01
JP2001517888A (en) 2001-10-09
KR20010078685A (en) 2001-08-21
EP1023668A2 (en) 2000-08-02
WO1999016177A2 (en) 1999-04-01
KR100367949B1 (en) 2003-01-14
EP1023668A4 (en) 2005-02-09
US6304910B1 (en) 2001-10-16
WO1999016177A3 (en) 1999-08-12

Similar Documents

Publication Publication Date Title
US6005849A (en) Full-duplex communication processor which can be used for fibre channel frames
CA2304620C (en) Communication processor having buffer list modifier control bits
US6185620B1 (en) Single chip protocol engine and data formatter apparatus for off chip host memory to local memory transfer and conversion
US6948004B2 (en) Host-fabric adapter having work queue entry (WQE) ring hardware assist (HWA) mechanism
US20020071450A1 (en) Host-fabric adapter having bandwidth-optimizing, area-minimal, vertical sliced memory architecture and method of connecting a host system to a channel-based switched fabric in a data network
US6775719B1 (en) Host-fabric adapter and method of connecting a host system to a channel-based switched fabric in a data network
US8374175B2 (en) System and method for remote direct memory access over a network switch fabric
JPH02196541A (en) Method and device for connecting work station to local area network
US6134617A (en) Method and apparatus for managing access to a loop in a data processing system
JP2000200241A (en) Method for forming buffer structure in common memory, and network device
US6791989B1 (en) Fibre channel interface controller that performs non-blocking output and input of fibre channel data frames and acknowledgement frames to and from a fibre channel
JP2000115252A (en) Method and device for controlling network data congestion
JP2000165455A (en) Look-ahead watermark for additional data burst in fifo memory
JP2002540701A (en) An interconnect architecture that manages many low-bandwidth connections over high-bandwidth links
GB2409073A (en) Dedicated connection between CPU and network interface in multi-processor systems
JP2000194655A (en) Method and system for transfer control of data by updating descriptor in descriptor ring
CA2330014C (en) Method of mapping fibre channel frames based on control and type header fields
JP2000151713A (en) Routing method and system for data based on network using frame address notification
US6856619B1 (en) Computer network controller
US6314100B1 (en) Method of validation and host buffer allocation for unmapped fibre channel frames
US6463498B1 (en) Transmission of FCP response in the same loop tenancy as the FCP data with minimization of inter-sequence gap
US6526458B1 (en) Method and system for efficient i/o operation completion in a fibre channel node using an application specific integration circuit and determining i/o operation completion status within interface controller
GB2363551A (en) Sending fibre channel response frames from an interface controller to a host only when an I/O operation is unsuccessful
WO1999015973A1 (en) Receive frame fifo with end of frame bypass

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed