US20110170546A1 - Logical separation and accessing of descriptor memories - Google Patents
Logical separation and accessing of descriptor memories Download PDFInfo
- Publication number
- US20110170546A1 US20110170546A1 US13/072,382 US201113072382A US2011170546A1 US 20110170546 A1 US20110170546 A1 US 20110170546A1 US 201113072382 A US201113072382 A US 201113072382A US 2011170546 A1 US2011170546 A1 US 2011170546A1
- Authority
- US
- United States
- Prior art keywords
- data
- header
- execution unit
- higher layer
- header information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000015654 memory Effects 0.000 title abstract description 111
- 238000000926 separation method Methods 0.000 title description 2
- 238000012545 processing Methods 0.000 abstract description 58
- 239000000872 buffer Substances 0.000 description 57
- 239000012634 fragment Substances 0.000 description 26
- 238000010586 diagram Methods 0.000 description 22
- ZJESXGUODSBHSK-UHFFFAOYSA-N {4-[3-(6,7-diethoxy-quinazolin-4-ylamino)-phenyl]-thiazol-2-yl}-methanol Chemical compound C=12C=C(OCC)C(OCC)=CC2=NC=NC=1NC(C=1)=CC=CC=1C1=CSC(CO)=N1 ZJESXGUODSBHSK-UHFFFAOYSA-N 0.000 description 19
- 238000000034 method Methods 0.000 description 13
- 239000004744 fabric Substances 0.000 description 11
- 238000012546 transfer Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 7
- 238000013467 fragmentation Methods 0.000 description 5
- 238000006062 fragmentation reaction Methods 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 3
- 238000005538 encapsulation Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 239000013307 optical fiber Substances 0.000 description 2
- 230000036316 preload Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 239000006163 transport media Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L45/00—Routing or path finding of packets in data switching networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L45/00—Routing or path finding of packets in data switching networks
- H04L45/60—Router architectures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/30—Peripheral units, e.g. input or output ports
- H04L49/3009—Header conversion, routing tables or routing tags
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L12/00—Data switching networks
- H04L12/54—Store-and-forward switching systems
- H04L12/56—Packet switching systems
- H04L12/5601—Transfer mode dependent, e.g. ATM
- H04L2012/5638—Services, e.g. multimedia, GOS, QOS
- H04L2012/5665—Interaction of ATM with other protocols
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/30—Peripheral units, e.g. input or output ports
- H04L49/3072—Packet splitting
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/60—Software-defined switches
- H04L49/602—Multilayer or multiprotocol switching, e.g. IP switching
Definitions
- This application is related to six concurrently-filed applications (attorney docket nos. 0023-0063, 0023-0064, 023-0065, 023-0066, 023-0067, and 0023-0069), Ser. Nos. ______, ______, ______, ______, and ______, respectively.
- the principles of the invention relate generally to high performance network routers, and more particularly, to the processing of network information in a high performance router.
- Routers receive data on a physical media, such as optical fiber, analyze the data to determine its destination, and output the data on a physical media in accordance with the destination.
- Routers were initially designed using a general purpose processor executing large software programs. As line rates and traffic volume increased, however, general purpose processors could not scale to meet these new demands. For example, as functionality was added to the software, such as accounting and policing functionality, these routers suffered performance degradation. In some instances, the routers failed to handle traffic at line rate when the new functionality was turned on.
- purpose-built routers were designed with components optimized for routing. They not only handle higher line rates and higher network traffic volume, they also add functionality without compromising line rate performance.
- a purpose-built router may include a number of input and output ports from which it transmits and receives information packets. A packet received at one port is directed to its appropriate output port based on an examination and processing of the packet's header information, which includes an indication of the packet's destination. A high-performance router must accordingly examine and process the header information at a high speed.
- Systems and methods consistent with the principles of the invention provide for improved memory access within a packet header processing engine.
- a packet header processing engine may include a template memory configured to store template data and a tag descriptor memory configured to store tag descriptor data.
- the engine may also include a level x (Lx) descriptor memory configured to store Lx descriptor data.
- Lx header execution unit may be connected to the template memory, the tag descriptor memory, and the Lx descriptor memory.
- the Lx header execution unit may be configured to construct an Lx header from at least one of the template data, the tag descriptor data, and the Lx descriptor data.
- a packet header processing engine may include a memory configured to store at least one of template data, tag descriptor data, and level 2 (L2) descriptor data as continuous portions of the memory.
- a packet processing unit may be connected to the memory.
- the packet processing unit may be configured to construct packet header information according to a packet header protocol from the at least one of the template data, the tag descriptor data, and the L2 descriptor data.
- a method may include reading descriptor data from a memory and incrementing at least one pointer to the descriptor data in the memory upon the reading.
- Packet header information may be constructed from the descriptor data according to a packet header protocol.
- a network device for processing packets may include a buffer configured to store the packets, including header data and payload data for the packets.
- a descriptor reader component may be coupled to the buffer. The descriptor reader component may be configured to receive the header data for the packets stored in the buffer and configured to look up descriptor information relating to the received packet header data.
- a memory may be coupled to the descriptor reader component. The memory may include a number of distinct portions for receiving the descriptor information from the descriptor reader component.
- a packet header processing unit may be coupled to the descriptor reader component and may include a number of pointers corresponding to the number of distinct memory portions. The packet header processing unit may be configured to receive the packet header data from the descriptor reader component and the descriptor information from the number of distinct memory portions.
- a packet header processing engine may include a memory having a number of distinct portions for respectively storing different types of descriptor information for a header of a packet.
- a packet header processing unit may include a number of pointers corresponding to the number of distinct memory portions. The packet header processing unit may be configured to retrieve the different types of descriptor information from the number of distinct memory portions and to generate header information from the different types of descriptor information.
- FIG. 1 is a block diagram illustrating an exemplary routing system in which principles consistent with the invention may be implemented
- FIG. 2 is an exemplary detailed block diagram illustrating portions of the routing system shown in FIG. 1 ;
- FIG. 3 is a diagram illustrating portions of FIG. 2 in additional detail
- FIG. 4 is a diagram illustrating components of a packet
- FIG. 5 is a diagram illustrating elements of the processing units shown in FIG. 3 ;
- FIG. 6 is a diagram illustrating an L2/L3 engine component having six parallel engines
- FIG. 7 is a diagram illustrating one of L2/L3 engines shown in FIG. 6 ;
- FIG. 8 is a flow chart illustrating operation of the L2/L3 engines shown in FIGS. 6 and 7 ;
- FIG. 9 is a diagram conceptually illustrating the loading and unloading of data in the L2/L3 engines 602 ;
- FIG. 10 is a diagram illustrating the operation of mailbox registers
- FIG. 11 is a diagram illustrating the L2 execution unit of FIG. 7 in additional detail
- FIG. 12 is a diagram illustrating the L3 execution unit of FIG. 7 in additional detail
- FIG. 13 is a diagram illustrating the descriptor memories of FIG. 7 in additional detail
- FIG. 14 is a diagram illustrating checksum processing hardware in conjunction with the L3 execution unit of FIG. 7 ;
- FIG. 15 is a flow chart illustrating fragment handling by the L2/L3 engine component of FIG. 5 .
- a processing engine may include pointers to read descriptor data from a single descriptor memory having logical portions.
- three pointers enable first-in, first-out (FIFO) behavior from three contiguous portions of the descriptor memory.
- FIG. 1 is a block diagram illustrating an exemplary routing system 100 in which the present invention may be implemented.
- System 100 receives a data stream from a physical link, processes the data stream to determine destination information, and transmits the data stream out on a link in accordance with the destination information.
- System 100 may include packet forwarding engines (PFEs) 110 , a switch fabric 120 , and a routing engine (RE) 130 .
- PFEs packet forwarding engines
- RE routing engine
- RE 130 performs high level management functions for system 100 .
- RE 130 communicates with other networks and systems connected to system 100 to exchange information regarding network topology.
- RE 130 creates routing tables based on network topology information, creates forwarding tables based on the routing tables, and forwards the forwarding tables to PFEs 110 .
- PFEs 110 use the forwarding tables to perform route lookup for incoming packets.
- RE 130 also performs other general control and monitoring functions for system 100 .
- PFEs 110 are each connected to RE 130 and switch fabric 120 .
- PFEs 110 receive data at ports on physical links connected to a network, such as a wide area network (WAN), a local area network (LAN), or a device.
- a network such as a wide area network (WAN), a local area network (LAN), or a device.
- Each physical link could be one of many types of transport media, such as optical fiber or Ethernet cable.
- the data on the physical link is formatted according to one of several protocols, such as the synchronous optical network (SONET) standard or Ethernet.
- SONET synchronous optical network
- PFE 110 processes incoming data by stripping off the data link layer.
- PFE 110 converts header information from the remaining data into data structures referred to herein as “notifications” and “cells” (where a cell is a fixed length data unit).
- the data remaining after the data link layer is stripped off is packet data.
- PFE 110 includes the layer 2 (L2) and layer 3 (L3) packet header information, some control information regarding the packets, and the packet data in a series of cells called “D” cells.
- the L2, L3, and the control information are stored in the first two cells of the series of cells.
- L2 header information refers to header information used to steer packets within LANs.
- L3 header information provides connectivity between different LANs. In this sense, the L3 header information provides more general destination information than the L2 header.
- An incoming packet may include both L2 and L3 header information.
- Router 100 may modify the packet's L2 and L3 header information before transmitting the packet to its next destination.
- PFE 110 forms a notification based on the L2, L3, and control information, and performs a route lookup using the notification and the routing table from RE 130 to determine destination information. PFE 110 may also further process the notification to perform protocol-specific functions, policing, and accounting, and might even modify the notification to form a new notification.
- PFE 110 retrieves the cells for the packet, converts the notification or new notification into header information, forms a packet using the packet data from the cells and the header information, and transmits the packet from the port associated with the physical link.
- PFE 110 retrieves the cells for the packet, modifies the first two cells with the new notification and new control information, if necessary, and sends the cells to the other PFE via switch fabric 120 .
- PFE 110 appends a sequence number to each cell, which allows the receiving PFE to reconstruct the order of the transmitted cells. Additionally, the receiving PFE uses the notification to form a packet using the packet data from the cells, and sends the packet out on the port associated with the appropriate physical link of the receiving PFE.
- RE 130 , PFEs 110 , and switch fabric 120 perform routing based on packet-level processing.
- PFEs 110 store each packet in cells while performing a route lookup using a notification, which is based on packet header information, including L2 and L3 layer header information.
- a packet might be received on one PFE and go back out to the network on the same PFE, or be sent through switch fabric 120 to be sent out to the network on a different PFE.
- FIG. 2 is an exemplary detailed block diagram illustrating portions of routing system 100 .
- PFEs 110 connect to one another through switch fabric 120 .
- Each of the PFEs may include one or more physical interface cards (PICs) 210 and flexible port concentrators (FPCs) 220 .
- PICs physical interface cards
- FPCs flexible port concentrators
- PIC 210 may transmit data between a physical link and FPC 220 .
- Different PICs are designed to handle different types of physical links.
- one of PICs 210 may be an interface for an optical link while another PIC may be an interface for an Ethernet link.
- FPCs 220 perform routing functions and handle packet transfers to and from PICs 210 and switch fabric 120 . For each packet it handles, an FPC performs the previously-discussed route lookup function.
- FIG. 2 shows two of PICs 210 connected to each of FPCs 220 and three FPCs 220 connected to switch fabric 120 , in other embodiments consistent with principles of the invention there can be more or fewer PICs 210 and FPCs 220 .
- FIG. 3 is an exemplary configuration of one of FPCs 220 according to an implementation consistent with the principles of the invention.
- Each FPC may include processing units 332 and 334 , first input/output (I/O) unit 336 , second I/O unit 338 , memory unit 340 , and R unit 342 .
- Each processing unit 332 and 334 corresponds to one or more of PICs 210 .
- processing units 332 and 334 may process packet data flowing between PICs 210 and first I/O unit 336 .
- Each processing unit 332 and 334 may process packet data received from the PIC connected to it, and data received from first I/O unit 336 .
- processing unit 332 or 334 may process packets from PIC 210 to convert the packets into data cells, and transmit the data cells to first I/O unit 336 .
- Data cells are the data structure used by FPC 220 internally for transporting and storing data. In one implementation, data cells are 64 bytes in length.
- processing unit 332 or 334 receives data cells and notifications from first I/O unit 336 , extracts certain information and packet data from the data cells, and creates a packet based on the extracted information. Processing unit 332 or 334 creates the packet header based on the information extracted from the data cells. In one embodiment, processing unit 332 or 334 creates L2 and L3 header information based on the extracted information. The created L2 and L3 header information constitutes a new header that the packet uses as it is subsequently transmitted through the physical link.
- R unit 342 may receive notifications from first I/O unit 336 and second I/O unit 338 .
- R unit 342 may provide route lookup, accounting, and policing functionality based on the notifications.
- R unit 342 may receive one or more forwarding tables from RE 130 ( FIG. 1 ) and use the keys, forwarding table(s), and encapsulation tables to perform route or encapsulation lookups.
- R unit 342 may insert the lookup result into a notification, which it may store in memory unit 340 .
- Memory unit 340 may temporarily store data cells from first I/O unit 336 and second I/O unit 338 and notifications from R unit 342 .
- Memory 340 may dispatch the notifications to first I/O unit 336 and second I/O unit 338 .
- first I/O unit 336 and second I/O unit 338 may use the address information in the notification to read out data cells from memory unit 340 that correspond to a notification.
- the notification received from memory unit 340 may have been modified by R unit 342 with route or encapsulation lookup results.
- First I/O unit 336 and second I/O unit 338 may update the data cells read out of memory unit 340 with information from the modified notification.
- the data cells, which now include information from the modified notification are sent to processing unit 332 , processing unit 334 , or switch fabric 120 , depending on which of first I/O unit 336 or second I/O unit 338 is processing the notification.
- processing units 332 and 334 when receiving cells form first I/O unit 336 , may create new packet L2 and L3 header information before sending the packet to PIC 210 for eventual transmission to the physical links.
- FIG. 4 illustrates possible changes made to a packet by processing units 332 and 334 .
- Packet 401 when received by processing unit 332 or 334 from first I/O unit 336 , may include “old” MPLS (multi-protocol label switching) tags 402 (optional), an “old” L3 header field 403 (i.e., the L3 header associated with the packet as originally received by router 100 ), and the payload 404 of the packet.
- MPLS multi-protocol label switching
- L3 header field 403 i.e., the L3 header associated with the packet as originally received by router 100
- the payload 404 of the packet At this point, the original L2 header received with the packet has previously been stripped from the packet by processing units 332 / 334 . Accordingly, no old L2 header information is shown associated with packet 401 .
- MPLS involves setting a specific path for a given sequence of packets, identified by a label in the MPLS tag field.
- Packet 405 includes newly generated L2 header field 406 , new MPLS tags 407 (optional), new L3 header field 408 , and payload 404 .
- New MPLS tags 407 and L3 header field 408 may contain new information, although, in certain cases, the information may be the same as old MPLS tags 402 and old L3 field 403 .
- old L3 header field 403 and old MPLS tags 402 are contained within the first two cells of a packet.
- a portion of the payload 404 may also be contained in the first two cells.
- header field 403 and old MPLS tags 402 may be contained in more or less than the first two cells of the packet.
- the particular mapping of packet information to cells may vary in other implementations consistent with the principles of the present invention.
- FIG. 5 illustrates elements of processing units 332 / 334 as they relate to generation of the L2/L3 header information in fields 406 - 408 .
- processing unit 332 / 334 may include an input interface component 501 connected to a buffer 502 and descriptor reader component 503 .
- Descriptor reader component 503 transmits information to L2/L3 engine component 504 , which generates the new L2/L3 header information of fields 406 - 408 and forwards the generated information to output interface 505 .
- packet data from first I/O unit 336 is received by input interface component 501 as a series of cells.
- Input interface component 501 checks the received data for errors, and if the packet data is acceptable, stores the packet data to buffer 502 .
- Buffer 502 may be implemented as a series of logical queues 520 a - 520 d (collectively “ 520 ”), in which each of queues 520 may correspond to a series of cells for a packet stream.
- Input interface component 501 monitors the cells stored in queues 520 , and when enough cells are present in a queue to start header processing, interface component 501 transfers up to the first two cells (i.e., the cells containing the packet header information) in the queue to descriptor reader component 503 .
- Descriptor reader component 503 may access external SRAM 510 before forwarding the two cells, along with information retrieved from SRAM 510 , to L2/L3 engine component 504 .
- SRAM 510 may store descriptor information required by L2/L3 engine component 504 in generating the new L2/L3 header information. Through SRAM 510 , additional control information or data for the new L2/L3 header information can be associated with the packet.
- L2/L3 engine component 504 generates the new L2/L3 header information for a packet, such as the information in fields 406 , 407 , and 408 ( FIG. 4 ). The operation of L2/L3 engine component 504 will be described in more detail below.
- Output interface component 505 receives the new header information from L2/L3 engine component 504 and the corresponding remaining payload for the packet from buffer 502 . Output interface 505 may then assemble the packet, such as packet 405 ( FIG. 4 ), using the received header and payload.
- L2/L3 engine component 504 may send the new L2/L3 header information for a packet to input interface component 501 and to output interface component 505 .
- One instance of such header “recycling” is described below with reference to FIG. 16 .
- the new L2/L3 header information is sent to output interface component 505 .
- the L2/L3 engine component 504 generates the L2 and L3 header information for a packet.
- multiple L2/L3 engines may be implemented in parallel within the L2/L3 engine component 504 .
- FIG. 6 is a diagram illustrating an embodiment of the L2/L3 engine component 504 having six parallel L2/L3 engines 602 . In other implementations, more or less than six engines 602 may be used.
- Dispatch component 601 directs incoming packet header information to one of L2/L3 engines 602 .
- Dispatch component 601 may function to evenly distribute the incoming packet header information to the L2/L3 engines 602 such that total bandwidth through the engines is maximized.
- Header FIFO component 603 receives the processed L2/L3 packet header information generated by L2/L3 engines 602 and queues the processed header for transmission to output interface component 505 ( FIG. 5 ). Before transmitting the header information, header FIFO component 603 may first tell the output interface 505 the size of the header and the size of the remaining data in data buffer 502 . In response, output interface 505 may start to read the remaining data from data buffer 502 .
- header FIFO component 603 ensures that the packets it is about to transmit are in the sequence in which the packets of a data flow were received. Ordering may be ensured by unit 502 , prior to dispatching a packet to the descriptor reader 503 and eventually L2/L3 engine component 504 , reserves a space in the header FIFO 603 . Because packets are dispatched in order by unit 502 , the space allocated is also in order in the header FIFO component 603 . Even when packets reach header FIFO component 603 at different times, the packets will be stored in the header FIFO 603 in the order they are dispatched, and thus will be transmitted in order.
- FIG. 7 is a diagram illustrating an embodiment of one of L2/L3 engines 602 consistent with the principles of the invention.
- the portion of the L2/L3 engine 602 that generates the L2 header information may be implemented in parallel with the portion of the L2/L3 engine that generates the L3 header information.
- the L2 header information can be generated simultaneously with the L3 header information.
- L2 execution unit 701 and L3 execution unit 703 are implemented in parallel with one another.
- the L2 execution unit 701 is responsible for building the L2 header information.
- L2 execution unit 701 is implemented as a single processing unit that processes one instruction each clock cycle.
- the L3 execution unit 703 is responsible for building the L3 header information.
- the L3 execution unit 703 may be constructed as three parallel processing units.
- L2 execution unit 701 operates on input packet header information by executing instructions stored in instruction memory 702 .
- Dispatch component 601 writes the input packet header information to memory 713 by storing cells including packet header information in memory 713 .
- L2 execution unit 701 retrieves the header data from memory 713 and generates the new L2 header information based on execution of the instructions stored in instruction memory 702 .
- L2 execution unit 701 may additionally read from or write to template memory 710 , tag descriptor memory 711 , and L2 descriptor memory 712 .
- Template memory 710 may store on-chip template data for a packet. Such template data may be common to different headers, and may be initially stored on-chip in a memory (not shown).
- Tag descriptor memory 711 and L2 descriptor memory 712 may include the descriptor information read by descriptor reader 503 from SRAM 510 .
- L3 execution unit 703 may execute instructions stored in instruction memory 704 . In a manner similar to L2 execution unit 701 , L3 execution unit 703 retrieves the header information for a packet from memory 713 and generates the new L3 header information based on execution of the instructions stored in instruction memory 704 .
- Buffers 720 and 721 store the header information generated by the execution units. More specifically, as L2 execution unit 701 generates the L2 header information, it may write it into buffer 720 . Similarly, as L3 execution unit 702 generates the L3 header information, it writes the information into buffer 721 .
- Build component 740 assembles the L2 and L3 header information from L2 output buffer 720 and L3 output buffer 721 .
- Registers 730 and 735 are storage registers useable by L2 execution 701 and L3 execution 703 .
- registers 730 are each 32-bit general purpose storage registers and registers 735 are 32-bit mailbox registers. There may be, for example, sixteen general purpose storage registers 730 and eight mailbox registers 735 .
- the mailbox registers 735 allow execution units 701 and 702 to exchange information with one another.
- the mailbox registers 735 may be pre-designated such that only certain resources can write to a particular one of the mailbox registers while only certain other resources can read from the particular mailbox register. Thus, for example, only the L2 execution unit 701 may be allowed to write to the first mailbox register and only L3 execution unit 703 may be allowed to read from this mailbox register.
- execution units 701 and 703 when interacting with output buffers 720 and 721 will next be described.
- Input packet header data stored in memory 713 is read by L2 execution unit 701 and L3 execution unit 703 , as required by these units.
- L2 execution unit 701 writes to buffer 720 .
- L3 execution unit 703 periodically generates writes to buffer 721 .
- These write operations to buffers 720 and 721 are not time correlated with one another.
- L2 execution unit 701 and L3 execution unit 703 each finish writing to their respective output buffers 720 and 721 .
- L2 execution unit 701 and L3 execution unit 703 finish writing to output buffers 720 and 721 at different times.
- build component 740 combines the results of both buffers to form the complete header for the packet.
- the final packet header generated by output interface 505 includes L2 header fields 406 and 407 followed by the L3 header field 408 .
- L2 execution unit 701 and L3 execution unit 703 are able to operate in parallel. This is advantageous, as it allows for more efficient generation of a packet header relative to a serial implementation that first generates L2 header information and then generates L3 header information.
- FIG. 8 is a flow chart illustrating operation of L2/L3 engines 602 consistent with principles of the invention.
- received packet header data is accessed by L2 execution unit 701 and L3 execution unit 703 (acts 801 , 802 , 803 ).
- L2 execution unit 701 and L3 execution unit 703 independently operate on the received packet header.
- L2 execution unit 701 may write new L2 packet header data to buffer 720 (act 804 ) and L3 execution unit 703 may write new L3 packet header data to buffer 721 (act 805 ).
- the L2 execution unit finishes writing the L2 packet data to buffer 720 it signals its completion to build component 740 .
- the L3 execution unit similarly signals its completion of writing the L3 packet data to buffer 721 (acts 807 and 809 ).
- build component 740 in response to the signals indicating completion of the L2 and L3 header data, loads the L2 packet header from buffer 720 , loads the L3 packet header from buffer 721 , loads any remaining packet data from memory 713 , and builds the complete packet header (act 810 ).
- L2/L3 engines 602 receive old packet header data from dispatch component 601 , generate the new packet header data, and transmit the new packet header data to build component 740 from buffers 720 and 721 . In this manner, each of the L2/L3 engines operate on a single packet at any given time. Consistent with an aspect of the invention, L2/L3 engines 602 may increase the efficiency with which they load and unload packets by simultaneously loading header data for a next packet while unloading the packet header data that was most recently generated.
- FIG. 9 is a diagram conceptually illustrating the loading and unloading of data in the L2/L3 engines 602 .
- Packet header data for a first packet is received from dispatch component 601 and loaded into packet header memory 713 .
- Dispatch component 601 may also transmit data to on-chip template memory 710 , tag descriptor memory 711 , and/or L2 descriptor memory 712 .
- dispatch component may preload general purpose registers 730 with information such as, for example, packet length information, fragmentation information for the packet, or information relating to the MPLS tags. This loaded information is then processed by the L2 execution unit 701 and L3 execution unit 703 , as discussed above.
- build component 740 unloads the buffers by reading the new packet header data from the buffers. Simultaneously, dispatch component may begin to transit data associated with the next packet into memories 710 - 713 . In this manner, processing of the next packet by L2/L3 engine 602 can begin sooner than if the L2/L3 engine waited until a packet was fully removed from the L2/L3 engine before beginning processing on the next packet.
- the L2/L3 engines 602 include general purpose storage registers 730 and mailbox registers 735 .
- Mailbox registers 735 allow execution units 701 and 702 to exchange information with one another and with external devices, such as an external host computer (not shown).
- FIG. 10 is a diagram illustrating an implementation of the mailbox registers 735 consistent with the principles of the invention.
- Both L2 execution unit 701 and L3 execution unit 703 can write and read from any of general purpose registers 730 .
- Mailbox registers 735 are each “one-way” registers that are designed to be written to by a single resource and read from by another resource. Specifically, as shown in the exemplary implementation of FIG.
- three mailbox registers 1001 are used to transfer information from L3 execution unit 703 to L2 execution unit 701
- one mailbox register 1002 is used to transfer information from L2 execution unit 701 to L3 execution unit 703
- one mailbox register 1003 is used to transfer information from L2 execution unit 701 to host processor 1007
- one mailbox register 1004 is used to transfer information from host processor 1007 to L2 execution unit 701
- one mailbox register 1005 is used to transfer information from L3 execution unit 703 to host processor 1007
- one mailbox register 1006 is used to transfer information from host processor 1007 to L3 execution unit 703 .
- Host processor 1007 may be, for example, a processor used in programming and debugging L2/L3 engines 602 .
- each of mailbox registers 1001 - 1007 is a 32-bit register associated with an additional valid (“V”) bit.
- the valid bits inform the resources associated with the mailbox (i.e., the L2 execution unit, the L3 execution unit, and the host processor) whether the mailbox is waiting to be read or is available to be written.
- the L2 execution unit 701 wishes to write to mailbox register 1002 , it first checks the state of the valid bit. If the valid bit is not set (i.e., it is a logic zero), L2 execution unit 701 writes its data to mailbox register 1002 and sets the valid bit.
- the L2 execution unit waits until the L3 execution unit 703 resets the valid bit before writing to mailbox register 1002 .
- L3 execution unit 703 waits until the valid bit is set before reading from mailbox register 1002 . After reading the mailbox register, L3 execution unit 703 resets the valid bit.
- mailbox registers 1001 - 1006 included a certain number of mailbox registers 1001 - 1006 , one of ordinary skill in the art will recognize that more or less mailbox registers 1001 - 1006 could be used for each set of mailbox registers 1001 - 1006 .
- FIGS. 11 and 12 are diagrams illustrating, in additional detail, L2 execution unit 701 and L3 execution unit 703 , respectively, in accordance with concepts consistent with the invention.
- L2 execution unit 701 receives instructions from instruction memory 702 .
- Instruction memory 702 may be a 64-bit wide memory.
- the particular instruction to retrieve from instruction memory 702 is determined by the value in program counter 1102 .
- the selected instruction is received by execution section 1103 .
- Execution section 1103 may be implemented as a two-stage pipeline.
- Stack 1101 is a single entry memory that may be used to temporarily store, for example, an incremented version of the value in program counter 1102 .
- L3 execution unit 703 is constructed similarly to L2 execution unit 701 , except that in addition to a single stack 1201 and a single program counter 1202 , L3 execution unit 703 contains three parallel execution sections 1203 , 1204 , and 1205 . Each execution section 1203 - 1205 simultaneously receives an instruction from instruction memory 704 .
- Instruction memory 704 may be a 192-bit wide memory. Each 192-bit word read from instruction memory 704 is divided equally between execution sections 1203 - 1205 as three 64-bit instructions.
- execution section 1203 Although there are three execution sections 1203 - 1205 in L3 execution unit 703 , only one of the execution sections, execution section 1203 , is connected to program counter 1202 . Thus, only execution section 1203 controls the program flow.
- Execution sections 1203 - 1205 may communicate with one another through the general purpose registers 735 . Generally, however, instructions are stored in memory 704 that allow execution sections 1203 - 1205 to function largely independently of one another. For example, one of the execution sections may work on computing a checksum while another of the execution sections computes a packet length.
- L2 execution unit 701 may read from or write to template memory 710 , tag descriptor memory 711 , and descriptor memory 712 when calculating new L2 header information.
- One implementation of memories 710 - 712 consistent with the principles of the invention is three separate first-in, first-out (FIFO) buffers. Different registers, for example, would be used in conjunction with the buffers to keep track of data within the FIFO buffers. Such an implementation, however, entails a significant delay for L2 execution unit 701 to obtain data from the memories 710 - 712 .
- FIG. 13 is a diagram illustrating the memories 710 , 711 , and 712 of FIG. 7 in additional detail in an implementation consistent with the principles of the invention.
- a single descriptor memory 1310 may include template memory 710 , tag descriptor memory 711 , and L2 descriptor memory 712 .
- template memory 710 , tag descriptor memory 711 , and L2 descriptor memory 712 may be logical portions of descriptor memory 1310 , rather than separate physical memories.
- descriptor memory 1310 may be 40 bytes in size, and may be configured to provide data from any memory location addressed (e.g., random access).
- Template memory 710 may store on-chip template data for a packet, and may begin at address 0 of descriptor memory 1310 . Such template data may be common to different headers, and may be loaded into template memory 710 from an on-chip memory (not shown).
- Tag descriptor memory 711 may store tag descriptor data for MPLS tags from SRAM 510 , and may begin at an address of descriptor memory 1310 that is determined by the size of the on-chip template data in template memory 710 .
- L2 descriptor memory 712 may include the L2 descriptor information read from SRAM 510 , and may begin at an address of descriptor memory 1310 that is determined by the size of both the on-chip template data in template memory 710 and the tag descriptor data in tag descriptor memory 711 . Data may be respectively written to template memory 710 , tag descriptor memory 711 , and L2 descriptor memory 712 by dispatch component 601 in a known order.
- descriptor memory 712 may include descriptor information from a level x (Lx) protocol, where x may indicate one of several protocols (e.g., L2, L3, etc.). Hence, descriptor memory 712 may be referred to generically as an “Lx descriptor memory” 712 .
- L2 execution unit 701 reads from memories 710 - 712 may be specified by different instruction sequences selected by the L2 protocol (e.g., 802.3, SNAP, or PPP) of the packet whose L2 header is being processed.
- the template pointer may be incremented by one or more addresses, determined by the number of bytes read, to simulate reading from a FIFO buffer. In the implementation shown in FIG.
- L2 execution unit 701 need store only three pointers in logic, and may retrieve data from memories 710 - 712 more efficiently than from separate FIFO buffers. Further, data may be alternately read from memories 710 - 712 by switching from one of the respective pointers to another. Because of the ability to rapidly obtain data from memories 710 - 712 , L2 execution unit 701 may process L2 headers more quickly.
- L3 header 408 that is generated L3 execution unit 703 may contain a checksum.
- the checksum may be used to detect corruption in a received packet.
- the checksum for IPv4 may be 16 bits and may be contained within L3 header at a known location.
- L3 execution unit 702 generates L3 header information and writes the information into L3 output buffer 721 .
- the L3 header information is written to L3 output buffer 721 sequentially, with the checksum information being initially written as zeros.
- L3 execution unit 702 One way to generate the checksum would be for L3 execution unit 702 to wait until L3 output buffer 721 contains an entire L3 header, read the entire header, compute the checksum, and write the checksum to the appropriate position in L3 output buffer 721 .
- Such checksum computation would occupy L3 execution unit 702 for a large number of instruction cycles, thereby delaying further L3 header processing.
- FIG. 14 is a diagram illustrating checksum processing element 1400 and its relation to L3 execution unit 703 .
- Checksum processing element 1400 may be situated in the data flow between L3 execution unit 703 and L3 output buffer 721 , as shown in FIG. 14 .
- Checksum processing element 1400 may include a checksum generator 1410 and a checksum accumulator 1420 .
- One of the general purpose registers 730 described in FIG. 7 may be used as checksum accumulator 1420 .
- Checksum generator 1410 may include logic configured to compute a checksum using inputs of L3 information from L3 execution unit 703 and checksum data from checksum accumulator 1420 when instructed by the L3 execution unit 703 . In some implementations, such an instruction may be implicit (i.e., the checksum may be computed by default). In one implementation, L3 execution unit 703 may initially load checksum accumulator 1420 with an initial value when it writes the first L3 information for a new L3 header to L3 output buffer 721 . In another implementation consistent with the present invention, L3 execution unit 703 may pre-load checksum accumulator 1420 with a value before it writes the first L3 information for a new L3 header to L3 output buffer 721 .
- checksum generator 1410 may read stored checksum data from checksum accumulator 1420 and compute a new checksum using the stored checksum data and the L3 information being written to L3 output buffer 721 .
- Checksum generator 1410 may store this new (partial) checksum in checksum accumulator 1420 .
- checksum processing element 1400 updates the L3 checksum “on the fly” (i.e., concurrently) as the L3 header information is written to L3 output buffer 721 by L3 execution unit 703 .
- checksum accumulator 1420 contains the checksum for that L3 header.
- L3 execution unit 703 may write the checksum from checksum accumulator 1420 to the appropriate address in L3 output buffer 721 . Because of the checksum processing element 1400 , L3 execution unit 703 may use only two instruction cycles for checksum processing: one instruction cycle to initially load checksum accumulator 1420 , and one instruction cycle to write the complete checksum from checksum accumulator 1420 to L3 output buffer 721 . Because of the ability to concurrently compute the L3 header checksum, L3 execution unit 703 may process L3 headers more quickly. Although described with respect to L3 headers, it is specifically envisioned that checksum processing elements similar to checksum processing element 1400 may be used to compute other checksums on the fly as well.
- MTU maximum transfer unit
- standard Ethernet has an MTU of about 1500 bytes, while other interface types may have, for example, MTUs around 9000 bytes.
- MTU maximum transfer unit
- IPv4 Internet protocols
- FIG. 15 is a flow chart illustrating fragment handling by the L2/L3 engine component 504 .
- L2/L3 engine component 504 may receive the first two cells of a packet, which contain the packet header information, from descriptor reader 503 [act 1510 ].
- the header information may include the length of the packet. Alternately, the packet length may be detected from the packet itself.
- the header information may also be used to obtain the MTU size of the interface that is the packet's destination. For example, the header information may be used to construct a key, which may be used to look up MTU information from SRAM 510 .
- L2/L3 engine component 504 may determine whether the length of the packet associated with the header information exceeds the MTU size of the destination interface [act 1520 ].
- L2/L3 engine component 504 may generate a new header for the packet, and may transmit the new header to output interface component 505 [act 1530 ].
- Output interface component 505 may assemble the new header information and packet data from buffer 502 into a packet.
- L2/L3 engine component 504 may generate a first fragment header and send a first fragment with the header [act 1540 ].
- the first fragment header may include the size of any payload data in the first two cells and the size of the packet data to be read from buffer 502 . This size information may be used to calculate an updated packet length.
- the updated packet length may be calculated by L2/L3 engine component 504 and included in the first fragment header. Alternately, the updated packet length may not be included in the first fragment header, but may be calculated from information in the first fragment header in later processing (e.g., act 1560 ).
- the first fragment header also may include recycle information, such as a bit indicating previous fragmentation and a length of the L2 header.
- the first fragment header may be transmitted to output interface component 505 for generation of a first packet fragment, and may also be transmitted to input interface component 501 [act 1550 ].
- Output interface component 505 may assemble the first fragment header information and packet data from buffer 502 into a first packet. Meanwhile, input interface component may send the first fragment header, via the descriptor reader 530 , to L2/L3 engine component 504 .
- L2/L3 engine component 504 may use the first fragment header to generate one or more subsequent fragment headers by determining whether the updated packet length (either contained in the first fragment header or calculated) exceeds the MTU size of the destination interface [act 1560 ]. If the updated packet length is less than the MTU size of the destination interface, L2/L3 engine component 504 may generate a new header for the packet, and may transmit the new header to output interface component 505 [act 1570 ]. Output interface component 505 may assemble the new header information and packet data from buffer 502 into a second packet fragment.
- L2/L3 engine component 504 may generate a next fragment header [act 1580 ].
- the next fragment header may include the size of the packet data to be read from buffer 502 . This size information may be used to calculate an updated packet length.
- the updated packet length may be calculated by L2/L3 engine component 504 and included in the first fragment header. Alternately, the updated packet length may not be included in the next fragment header, but may be calculated from information in the first fragment header in later processing (e.g., act 1560 ).
- the next fragment header also may include recycle information, such as a bit indicating previous fragmentation and a length of the L2 header.
- next fragment header (and any subsequent fragment headers) may be transmitted to output interface component 505 and input interface component 501 , and may be processed as described in acts 1550 - 1580 .
- L2/L3 engine component 504 may recycle packet headers when the packet's destination interface requires that the packet be fragmented.
- an L2 engine may include pointers to efficiently read descriptor data from a single descriptor memory divided into a number of logical portions.
- the L3 engine may have dedicated hardware associated with it to generate and update a checksum for the L3 header when data is written to the L3 output buffer.
- the L2/L3 engine may recycle packet headers when the packet's destination interface requires that the packet be fragmented. In this manner, an efficient L2/L3 generation engine is achieved.
- PFEs 110 may be implemented in hardware, software, or some combination thereof. For example, various portions of PFEs 110 may be implemented in application-specific integrated circuits (ASICs). The ASICs may be configured to perform some processing via dedicated logic, and may also be configured to perform some processing using microcode instructions that may be stored in memory. Those skilled in the router art will appreciate that the invention described herein might be practiced using a variety of hardware configurations in addition to, or instead of, ASICs. For example, some combination of general purpose processors, digital signal processors (DSPs), and programmable gate arrays (PGAs) may also be utilized to implement the functionality described herein.
- DSPs digital signal processors
- PGAs programmable gate arrays
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
A packet header processing engine includes a memory having a number of distinct portions for respectively storing different types of descriptor information for a header of a packet. A packet header processing unit includes a number of pointers corresponding to the number of distinct memory portions. The packet header processing unit is configured to retrieve the different types of descriptor information from the number of distinct memory portions and to generate header information from the different types of descriptor information.
Description
- This application is related to six concurrently-filed applications (attorney docket nos. 0023-0063, 0023-0064, 023-0065, 023-0066, 023-0067, and 0023-0069), Ser. Nos. ______, ______, ______, ______, ______, and ______, respectively.
- 1. Field of the Invention
- The principles of the invention relate generally to high performance network routers, and more particularly, to the processing of network information in a high performance router.
- 2. Description of Related Art
- Routers receive data on a physical media, such as optical fiber, analyze the data to determine its destination, and output the data on a physical media in accordance with the destination. Routers were initially designed using a general purpose processor executing large software programs. As line rates and traffic volume increased, however, general purpose processors could not scale to meet these new demands. For example, as functionality was added to the software, such as accounting and policing functionality, these routers suffered performance degradation. In some instances, the routers failed to handle traffic at line rate when the new functionality was turned on.
- To meet the new demands, purpose-built routers were designed with components optimized for routing. They not only handle higher line rates and higher network traffic volume, they also add functionality without compromising line rate performance.
- A purpose-built router may include a number of input and output ports from which it transmits and receives information packets. A packet received at one port is directed to its appropriate output port based on an examination and processing of the packet's header information, which includes an indication of the packet's destination. A high-performance router must accordingly examine and process the header information at a high speed.
- Thus, it is desirable to improve the speed and efficiency of packet header processing in high-performance routers.
- Systems and methods consistent with the principles of the invention, among other things, provide for improved memory access within a packet header processing engine.
- In accordance with one purpose of the invention as embodied and broadly described herein, a packet header processing engine may include a template memory configured to store template data and a tag descriptor memory configured to store tag descriptor data. The engine may also include a level x (Lx) descriptor memory configured to store Lx descriptor data. An Lx header execution unit may be connected to the template memory, the tag descriptor memory, and the Lx descriptor memory. The Lx header execution unit may be configured to construct an Lx header from at least one of the template data, the tag descriptor data, and the Lx descriptor data.
- In another implementation consistent with the principles of the invention, a packet header processing engine may include a memory configured to store at least one of template data, tag descriptor data, and level 2 (L2) descriptor data as continuous portions of the memory. A packet processing unit may be connected to the memory. The packet processing unit may be configured to construct packet header information according to a packet header protocol from the at least one of the template data, the tag descriptor data, and the L2 descriptor data.
- In a further implementation consistent with the principles of the invention, a method may include reading descriptor data from a memory and incrementing at least one pointer to the descriptor data in the memory upon the reading. Packet header information may be constructed from the descriptor data according to a packet header protocol.
- In yet another implementation consistent with the principles of the invention, a network device for processing packets may include a buffer configured to store the packets, including header data and payload data for the packets. A descriptor reader component may be coupled to the buffer. The descriptor reader component may be configured to receive the header data for the packets stored in the buffer and configured to look up descriptor information relating to the received packet header data. A memory may be coupled to the descriptor reader component. The memory may include a number of distinct portions for receiving the descriptor information from the descriptor reader component. A packet header processing unit may be coupled to the descriptor reader component and may include a number of pointers corresponding to the number of distinct memory portions. The packet header processing unit may be configured to receive the packet header data from the descriptor reader component and the descriptor information from the number of distinct memory portions.
- In a further implementation consistent with the principles of the invention, a packet header processing engine may include a memory having a number of distinct portions for respectively storing different types of descriptor information for a header of a packet. A packet header processing unit may include a number of pointers corresponding to the number of distinct memory portions. The packet header processing unit may be configured to retrieve the different types of descriptor information from the number of distinct memory portions and to generate header information from the different types of descriptor information.
- The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate an embodiment of the invention and, together with the description, explain the invention. In the drawings,
-
FIG. 1 is a block diagram illustrating an exemplary routing system in which principles consistent with the invention may be implemented; -
FIG. 2 is an exemplary detailed block diagram illustrating portions of the routing system shown inFIG. 1 ; -
FIG. 3 is a diagram illustrating portions ofFIG. 2 in additional detail; -
FIG. 4 is a diagram illustrating components of a packet; -
FIG. 5 is a diagram illustrating elements of the processing units shown inFIG. 3 ; -
FIG. 6 is a diagram illustrating an L2/L3 engine component having six parallel engines; -
FIG. 7 is a diagram illustrating one of L2/L3 engines shown inFIG. 6 ; -
FIG. 8 is a flow chart illustrating operation of the L2/L3 engines shown inFIGS. 6 and 7 ; -
FIG. 9 is a diagram conceptually illustrating the loading and unloading of data in the L2/L3 engines 602; -
FIG. 10 is a diagram illustrating the operation of mailbox registers; -
FIG. 11 is a diagram illustrating the L2 execution unit ofFIG. 7 in additional detail; -
FIG. 12 is a diagram illustrating the L3 execution unit ofFIG. 7 in additional detail; -
FIG. 13 is a diagram illustrating the descriptor memories ofFIG. 7 in additional detail; -
FIG. 14 is a diagram illustrating checksum processing hardware in conjunction with the L3 execution unit ofFIG. 7 ; and -
FIG. 15 is a flow chart illustrating fragment handling by the L2/L3 engine component ofFIG. 5 . - The following detailed description of the invention refers to the accompanying drawings. The same reference numbers may be used in different drawings to identify the same or similar elements. Also, the following detailed description does not limit the invention. Instead, the scope of the invention is defined by the appended claims and equivalents of the claim limitations.
- As described herein, a processing engine may include pointers to read descriptor data from a single descriptor memory having logical portions. In one implementation, three pointers enable first-in, first-out (FIFO) behavior from three contiguous portions of the descriptor memory.
-
FIG. 1 is a block diagram illustrating anexemplary routing system 100 in which the present invention may be implemented.System 100 receives a data stream from a physical link, processes the data stream to determine destination information, and transmits the data stream out on a link in accordance with the destination information.System 100 may include packet forwarding engines (PFEs) 110, aswitch fabric 120, and a routing engine (RE) 130. -
RE 130 performs high level management functions forsystem 100. For example,RE 130 communicates with other networks and systems connected tosystem 100 to exchange information regarding network topology.RE 130 creates routing tables based on network topology information, creates forwarding tables based on the routing tables, and forwards the forwarding tables toPFEs 110.PFEs 110 use the forwarding tables to perform route lookup for incoming packets.RE 130 also performs other general control and monitoring functions forsystem 100. -
PFEs 110 are each connected toRE 130 and switchfabric 120.PFEs 110 receive data at ports on physical links connected to a network, such as a wide area network (WAN), a local area network (LAN), or a device. Each physical link could be one of many types of transport media, such as optical fiber or Ethernet cable. The data on the physical link is formatted according to one of several protocols, such as the synchronous optical network (SONET) standard or Ethernet. -
PFE 110 processes incoming data by stripping off the data link layer.PFE 110 converts header information from the remaining data into data structures referred to herein as “notifications” and “cells” (where a cell is a fixed length data unit). For example, in one embodiment, the data remaining after the data link layer is stripped off is packet data.PFE 110 includes the layer 2 (L2) and layer 3 (L3) packet header information, some control information regarding the packets, and the packet data in a series of cells called “D” cells. In one embodiment, the L2, L3, and the control information are stored in the first two cells of the series of cells. - In general, L2 header information refers to header information used to steer packets within LANs. L3 header information, in contrast, provides connectivity between different LANs. In this sense, the L3 header information provides more general destination information than the L2 header. An incoming packet may include both L2 and L3 header information.
Router 100 may modify the packet's L2 and L3 header information before transmitting the packet to its next destination. -
PFE 110 forms a notification based on the L2, L3, and control information, and performs a route lookup using the notification and the routing table fromRE 130 to determine destination information.PFE 110 may also further process the notification to perform protocol-specific functions, policing, and accounting, and might even modify the notification to form a new notification. - If the determined destination indicates that the packet should be sent out on a physical link connected to
PFE 110, thenPFE 110 retrieves the cells for the packet, converts the notification or new notification into header information, forms a packet using the packet data from the cells and the header information, and transmits the packet from the port associated with the physical link. - If the destination indicates that the packet should be sent to another PFE via
switch fabric 120, thenPFE 110 retrieves the cells for the packet, modifies the first two cells with the new notification and new control information, if necessary, and sends the cells to the other PFE viaswitch fabric 120. Before transmitting the cells overswitch fabric 120,PFE 110 appends a sequence number to each cell, which allows the receiving PFE to reconstruct the order of the transmitted cells. Additionally, the receiving PFE uses the notification to form a packet using the packet data from the cells, and sends the packet out on the port associated with the appropriate physical link of the receiving PFE. - In summary, in one embodiment,
RE 130,PFEs 110, and switchfabric 120 perform routing based on packet-level processing.PFEs 110 store each packet in cells while performing a route lookup using a notification, which is based on packet header information, including L2 and L3 layer header information. A packet might be received on one PFE and go back out to the network on the same PFE, or be sent throughswitch fabric 120 to be sent out to the network on a different PFE. -
FIG. 2 is an exemplary detailed block diagram illustrating portions ofrouting system 100.PFEs 110 connect to one another throughswitch fabric 120. Each of the PFEs may include one or more physical interface cards (PICs) 210 and flexible port concentrators (FPCs) 220. -
PIC 210 may transmit data between a physical link andFPC 220. Different PICs are designed to handle different types of physical links. For example, one ofPICs 210 may be an interface for an optical link while another PIC may be an interface for an Ethernet link. -
FPCs 220 perform routing functions and handle packet transfers to and fromPICs 210 and switchfabric 120. For each packet it handles, an FPC performs the previously-discussed route lookup function. AlthoughFIG. 2 shows two ofPICs 210 connected to each ofFPCs 220 and threeFPCs 220 connected to switchfabric 120, in other embodiments consistent with principles of the invention there can be more orfewer PICs 210 andFPCs 220. -
FIG. 3 is an exemplary configuration of one ofFPCs 220 according to an implementation consistent with the principles of the invention. Each FPC may include processingunits unit 336, second I/O unit 338,memory unit 340, andR unit 342. Eachprocessing unit PICs 210. - As will be described in greater detail below, processing
units PICs 210 and first I/O unit 336. Eachprocessing unit O unit 336. - For example, processing
unit PIC 210 to convert the packets into data cells, and transmit the data cells to first I/O unit 336. Data cells are the data structure used byFPC 220 internally for transporting and storing data. In one implementation, data cells are 64 bytes in length. - In the other direction, processing
unit O unit 336, extracts certain information and packet data from the data cells, and creates a packet based on the extracted information.Processing unit unit -
R unit 342 may receive notifications from first I/O unit 336 and second I/O unit 338.R unit 342 may provide route lookup, accounting, and policing functionality based on the notifications.R unit 342 may receive one or more forwarding tables from RE 130 (FIG. 1 ) and use the keys, forwarding table(s), and encapsulation tables to perform route or encapsulation lookups.R unit 342 may insert the lookup result into a notification, which it may store inmemory unit 340. -
Memory unit 340 may temporarily store data cells from first I/O unit 336 and second I/O unit 338 and notifications fromR unit 342.Memory 340 may dispatch the notifications to first I/O unit 336 and second I/O unit 338. In response, first I/O unit 336 and second I/O unit 338 may use the address information in the notification to read out data cells frommemory unit 340 that correspond to a notification. The notification received frommemory unit 340 may have been modified byR unit 342 with route or encapsulation lookup results. First I/O unit 336 and second I/O unit 338 may update the data cells read out ofmemory unit 340 with information from the modified notification. The data cells, which now include information from the modified notification, are sent toprocessing unit 332, processingunit 334, or switchfabric 120, depending on which of first I/O unit 336 or second I/O unit 338 is processing the notification. - As previously mentioned,
processing units O unit 336, may create new packet L2 and L3 header information before sending the packet toPIC 210 for eventual transmission to the physical links.FIG. 4 illustrates possible changes made to a packet by processingunits -
Packet 401, when received by processingunit O unit 336, may include “old” MPLS (multi-protocol label switching) tags 402 (optional), an “old” L3 header field 403 (i.e., the L3 header associated with the packet as originally received by router 100), and thepayload 404 of the packet. At this point, the original L2 header received with the packet has previously been stripped from the packet by processingunits 332/334. Accordingly, no old L2 header information is shown associated withpacket 401. MPLS involves setting a specific path for a given sequence of packets, identified by a label in the MPLS tag field. -
Processing unit processes packet 401 and generates a modified version of the packet, labeled aspacket 405.Packet 405 includes newly generatedL2 header field 406, new MPLS tags 407 (optional), newL3 header field 408, andpayload 404.New MPLS tags 407 andL3 header field 408 may contain new information, although, in certain cases, the information may be the same as old MPLS tags 402 andold L3 field 403. - As is also illustrated in
FIG. 4 , oldL3 header field 403 and old MPLS tags 402 are contained within the first two cells of a packet. A portion of thepayload 404 may also be contained in the first two cells. In alternate implementations,header field 403 and old MPLS tags 402 may be contained in more or less than the first two cells of the packet. The particular mapping of packet information to cells may vary in other implementations consistent with the principles of the present invention. -
FIG. 5 illustrates elements of processingunits 332/334 as they relate to generation of the L2/L3 header information in fields 406-408. As shown, processingunit 332/334 may include aninput interface component 501 connected to abuffer 502 anddescriptor reader component 503.Descriptor reader component 503 transmits information to L2/L3 engine component 504, which generates the new L2/L3 header information of fields 406-408 and forwards the generated information tooutput interface 505. - In operation, packet data from first I/
O unit 336 is received byinput interface component 501 as a series of cells.Input interface component 501 checks the received data for errors, and if the packet data is acceptable, stores the packet data to buffer 502. Buffer 502 may be implemented as a series of logical queues 520 a-520 d (collectively “520”), in which each of queues 520 may correspond to a series of cells for a packet stream.Input interface component 501 monitors the cells stored in queues 520, and when enough cells are present in a queue to start header processing,interface component 501 transfers up to the first two cells (i.e., the cells containing the packet header information) in the queue to descriptorreader component 503.Descriptor reader component 503 may accessexternal SRAM 510 before forwarding the two cells, along with information retrieved fromSRAM 510, to L2/L3 engine component 504.SRAM 510 may store descriptor information required by L2/L3 engine component 504 in generating the new L2/L3 header information. ThroughSRAM 510, additional control information or data for the new L2/L3 header information can be associated with the packet. - L2/
L3 engine component 504 generates the new L2/L3 header information for a packet, such as the information infields FIG. 4 ). The operation of L2/L3 engine component 504 will be described in more detail below. -
Output interface component 505 receives the new header information from L2/L3 engine component 504 and the corresponding remaining payload for the packet frombuffer 502.Output interface 505 may then assemble the packet, such as packet 405 (FIG. 4 ), using the received header and payload. - In certain circumstances (e.g., handling packet fragmentation), L2/
L3 engine component 504 may send the new L2/L3 header information for a packet to inputinterface component 501 and tooutput interface component 505. One instance of such header “recycling” is described below with reference toFIG. 16 . In typical circumstances, however, the new L2/L3 header information is sent tooutput interface component 505. - As previously mentioned, the L2/
L3 engine component 504 generates the L2 and L3 header information for a packet. To increase bandwidth, multiple L2/L3 engines may be implemented in parallel within the L2/L3 engine component 504.FIG. 6 is a diagram illustrating an embodiment of the L2/L3 engine component 504 having six parallel L2/L3 engines 602. In other implementations, more or less than sixengines 602 may be used.Dispatch component 601 directs incoming packet header information to one of L2/L3 engines 602.Dispatch component 601 may function to evenly distribute the incoming packet header information to the L2/L3 engines 602 such that total bandwidth through the engines is maximized. -
Header FIFO component 603 receives the processed L2/L3 packet header information generated by L2/L3 engines 602 and queues the processed header for transmission to output interface component 505 (FIG. 5 ). Before transmitting the header information,header FIFO component 603 may first tell theoutput interface 505 the size of the header and the size of the remaining data indata buffer 502. In response,output interface 505 may start to read the remaining data fromdata buffer 502. - In one implementation,
header FIFO component 603 ensures that the packets it is about to transmit are in the sequence in which the packets of a data flow were received. Ordering may be ensured byunit 502, prior to dispatching a packet to thedescriptor reader 503 and eventually L2/L3 engine component 504, reserves a space in theheader FIFO 603. Because packets are dispatched in order byunit 502, the space allocated is also in order in theheader FIFO component 603. Even when packets reachheader FIFO component 603 at different times, the packets will be stored in theheader FIFO 603 in the order they are dispatched, and thus will be transmitted in order. -
FIG. 7 is a diagram illustrating an embodiment of one of L2/L3 engines 602 consistent with the principles of the invention. Inengine 602, the portion of the L2/L3 engine 602 that generates the L2 header information may be implemented in parallel with the portion of the L2/L3 engine that generates the L3 header information. In this manner, the L2 header information can be generated simultaneously with the L3 header information. More specifically,L2 execution unit 701 andL3 execution unit 703 are implemented in parallel with one another. TheL2 execution unit 701 is responsible for building the L2 header information. In one implementation,L2 execution unit 701 is implemented as a single processing unit that processes one instruction each clock cycle. TheL3 execution unit 703 is responsible for building the L3 header information. TheL3 execution unit 703 may be constructed as three parallel processing units. -
L2 execution unit 701 operates on input packet header information by executing instructions stored ininstruction memory 702.Dispatch component 601 writes the input packet header information tomemory 713 by storing cells including packet header information inmemory 713. In operation,L2 execution unit 701 retrieves the header data frommemory 713 and generates the new L2 header information based on execution of the instructions stored ininstruction memory 702. When calculating the new L2 header information,L2 execution unit 701 may additionally read from or write totemplate memory 710,tag descriptor memory 711, andL2 descriptor memory 712.Template memory 710 may store on-chip template data for a packet. Such template data may be common to different headers, and may be initially stored on-chip in a memory (not shown).Tag descriptor memory 711 andL2 descriptor memory 712 may include the descriptor information read bydescriptor reader 503 fromSRAM 510. -
L3 execution unit 703 may execute instructions stored ininstruction memory 704. In a manner similar toL2 execution unit 701,L3 execution unit 703 retrieves the header information for a packet frommemory 713 and generates the new L3 header information based on execution of the instructions stored ininstruction memory 704. -
Buffers L2 execution unit 701 generates the L2 header information, it may write it intobuffer 720. Similarly, asL3 execution unit 702 generates the L3 header information, it writes the information intobuffer 721. -
Build component 740 assembles the L2 and L3 header information fromL2 output buffer 720 andL3 output buffer 721. -
Registers L2 execution 701 andL3 execution 703. In one implementation, registers 730 are each 32-bit general purpose storage registers and registers 735 are 32-bit mailbox registers. There may be, for example, sixteen general purpose storage registers 730 and eight mailbox registers 735. The mailbox registers 735 allowexecution units L2 execution unit 701 may be allowed to write to the first mailbox register and onlyL3 execution unit 703 may be allowed to read from this mailbox register. - The operation of
execution units output buffers memory 713 is read byL2 execution unit 701 andL3 execution unit 703, as required by these units. Periodically,L2 execution unit 701 writes to buffer 720. Similarly,L3 execution unit 703 periodically generates writes to buffer 721. These write operations tobuffers L2 execution unit 701 andL3 execution unit 703 each finish writing to theirrespective output buffers L2 execution unit 701 andL3 execution unit 703 finish writing tooutput buffers build component 740 combines the results of both buffers to form the complete header for the packet. - As shown in
FIG. 4 , the final packet header generated byoutput interface 505 includes L2 header fields 406 and 407 followed by theL3 header field 408. By independently buffering the L2 header data and L3 header data, and then combining the L2 and L3 header data,L2 execution unit 701 andL3 execution unit 703 are able to operate in parallel. This is advantageous, as it allows for more efficient generation of a packet header relative to a serial implementation that first generates L2 header information and then generates L3 header information. -
FIG. 8 is a flow chart illustrating operation of L2/L3 engines 602 consistent with principles of the invention. To begin, received packet header data is accessed byL2 execution unit 701 and L3 execution unit 703 (acts L2 execution unit 701 andL3 execution unit 703 independently operate on the received packet header. During operation,L2 execution unit 701 may write new L2 packet header data to buffer 720 (act 804) andL3 execution unit 703 may write new L3 packet header data to buffer 721 (act 805). When the L2 execution unit finishes writing the L2 packet data to buffer 720, it signals its completion to buildcomponent 740. The L3 execution unit similarly signals its completion of writing the L3 packet data to buffer 721 (acts 807 and 809). Finally, buildcomponent 740, in response to the signals indicating completion of the L2 and L3 header data, loads the L2 packet header frombuffer 720, loads the L3 packet header frombuffer 721, loads any remaining packet data frommemory 713, and builds the complete packet header (act 810). - As discussed above, L2/
L3 engines 602 receive old packet header data fromdispatch component 601, generate the new packet header data, and transmit the new packet header data to buildcomponent 740 frombuffers L3 engines 602 may increase the efficiency with which they load and unload packets by simultaneously loading header data for a next packet while unloading the packet header data that was most recently generated. -
FIG. 9 is a diagram conceptually illustrating the loading and unloading of data in the L2/L3 engines 602. Packet header data for a first packet is received fromdispatch component 601 and loaded intopacket header memory 713.Dispatch component 601 may also transmit data to on-chip template memory 710,tag descriptor memory 711, and/orL2 descriptor memory 712. Additionally, dispatch component may preload general purpose registers 730 with information such as, for example, packet length information, fragmentation information for the packet, or information relating to the MPLS tags. This loaded information is then processed by theL2 execution unit 701 andL3 execution unit 703, as discussed above. - When both the
L2 execution unit 701 and theL3 execution unit 703 have finished writing the new packet header data tobuffers build component 740 unloads the buffers by reading the new packet header data from the buffers. Simultaneously, dispatch component may begin to transit data associated with the next packet into memories 710-713. In this manner, processing of the next packet by L2/L3 engine 602 can begin sooner than if the L2/L3 engine waited until a packet was fully removed from the L2/L3 engine before beginning processing on the next packet. - As shown in
FIG. 7 , the L2/L3 engines 602 include general purpose storage registers 730 and mailbox registers 735. Mailbox registers 735 allowexecution units -
FIG. 10 is a diagram illustrating an implementation of the mailbox registers 735 consistent with the principles of the invention. BothL2 execution unit 701 andL3 execution unit 703 can write and read from any of general purpose registers 730. Mailbox registers 735, however, are each “one-way” registers that are designed to be written to by a single resource and read from by another resource. Specifically, as shown in the exemplary implementation ofFIG. 11 , threemailbox registers 1001 are used to transfer information fromL3 execution unit 703 toL2 execution unit 701, onemailbox register 1002 is used to transfer information fromL2 execution unit 701 toL3 execution unit 703, onemailbox register 1003 is used to transfer information fromL2 execution unit 701 tohost processor 1007, onemailbox register 1004 is used to transfer information fromhost processor 1007 toL2 execution unit 701, onemailbox register 1005 is used to transfer information fromL3 execution unit 703 tohost processor 1007, and one mailbox register 1006 is used to transfer information fromhost processor 1007 toL3 execution unit 703.Host processor 1007 may be, for example, a processor used in programming and debugging L2/L3 engines 602. - In one implementation, each of mailbox registers 1001-1007 is a 32-bit register associated with an additional valid (“V”) bit. The valid bits inform the resources associated with the mailbox (i.e., the L2 execution unit, the L3 execution unit, and the host processor) whether the mailbox is waiting to be read or is available to be written. Thus, for example, when the
L2 execution unit 701 wishes to write tomailbox register 1002, it first checks the state of the valid bit. If the valid bit is not set (i.e., it is a logic zero),L2 execution unit 701 writes its data tomailbox register 1002 and sets the valid bit. If, however, the valid bit is set (i.e., it is a logic one), the L2 execution unit waits until theL3 execution unit 703 resets the valid bit before writing tomailbox register 1002. Correspondingly,L3 execution unit 703 waits until the valid bit is set before reading frommailbox register 1002. After reading the mailbox register,L3 execution unit 703 resets the valid bit. - Although the above description of mailbox registers 1001-1006 included a certain number of mailbox registers 1001-1006, one of ordinary skill in the art will recognize that more or less mailbox registers 1001-1006 could be used for each set of mailbox registers 1001-1006.
-
FIGS. 11 and 12 are diagrams illustrating, in additional detail,L2 execution unit 701 andL3 execution unit 703, respectively, in accordance with concepts consistent with the invention. - As previously mentioned,
L2 execution unit 701 receives instructions frominstruction memory 702.Instruction memory 702 may be a 64-bit wide memory. The particular instruction to retrieve frominstruction memory 702 is determined by the value inprogram counter 1102. The selected instruction is received byexecution section 1103.Execution section 1103 may be implemented as a two-stage pipeline.Stack 1101 is a single entry memory that may be used to temporarily store, for example, an incremented version of the value inprogram counter 1102. - Referring to
FIG. 12 ,L3 execution unit 703 is constructed similarly toL2 execution unit 701, except that in addition to asingle stack 1201 and asingle program counter 1202,L3 execution unit 703 contains threeparallel execution sections instruction memory 704.Instruction memory 704 may be a 192-bit wide memory. Each 192-bit word read frominstruction memory 704 is divided equally between execution sections 1203-1205 as three 64-bit instructions. - Although there are three execution sections 1203-1205 in
L3 execution unit 703, only one of the execution sections,execution section 1203, is connected toprogram counter 1202. Thus, onlyexecution section 1203 controls the program flow. - Execution sections 1203-1205 may communicate with one another through the general purpose registers 735. Generally, however, instructions are stored in
memory 704 that allow execution sections 1203-1205 to function largely independently of one another. For example, one of the execution sections may work on computing a checksum while another of the execution sections computes a packet length. - As previously discussed,
L2 execution unit 701 may read from or write totemplate memory 710,tag descriptor memory 711, anddescriptor memory 712 when calculating new L2 header information. One implementation of memories 710-712 consistent with the principles of the invention is three separate first-in, first-out (FIFO) buffers. Different registers, for example, would be used in conjunction with the buffers to keep track of data within the FIFO buffers. Such an implementation, however, entails a significant delay forL2 execution unit 701 to obtain data from the memories 710-712. -
FIG. 13 is a diagram illustrating thememories FIG. 7 in additional detail in an implementation consistent with the principles of the invention. Asingle descriptor memory 1310 may includetemplate memory 710,tag descriptor memory 711, andL2 descriptor memory 712. In other words,template memory 710,tag descriptor memory 711, andL2 descriptor memory 712 may be logical portions ofdescriptor memory 1310, rather than separate physical memories. In one implementation,descriptor memory 1310 may be 40 bytes in size, and may be configured to provide data from any memory location addressed (e.g., random access). -
Template memory 710 may store on-chip template data for a packet, and may begin ataddress 0 ofdescriptor memory 1310. Such template data may be common to different headers, and may be loaded intotemplate memory 710 from an on-chip memory (not shown).Tag descriptor memory 711 may store tag descriptor data for MPLS tags fromSRAM 510, and may begin at an address ofdescriptor memory 1310 that is determined by the size of the on-chip template data intemplate memory 710. - In one implementation,
L2 descriptor memory 712 may include the L2 descriptor information read fromSRAM 510, and may begin at an address ofdescriptor memory 1310 that is determined by the size of both the on-chip template data intemplate memory 710 and the tag descriptor data intag descriptor memory 711. Data may be respectively written totemplate memory 710,tag descriptor memory 711, andL2 descriptor memory 712 bydispatch component 601 in a known order. In other implementations,descriptor memory 712 may include descriptor information from a level x (Lx) protocol, where x may indicate one of several protocols (e.g., L2, L3, etc.). Hence,descriptor memory 712 may be referred to generically as an “Lx descriptor memory” 712. - Current locations within
template memory 710,tag descriptor memory 711, andL2 descriptor memory 712 may be respectively designated by a template pointer, a tag pointer, and an L2 pointer, all of which initially point to the beginning addresses of their respective memories. Certain details of the order in whichL2 execution unit 701 reads from memories 710-712 may be specified by different instruction sequences selected by the L2 protocol (e.g., 802.3, SNAP, or PPP) of the packet whose L2 header is being processed. When data is read from, for example,template memory 710, the template pointer may be incremented by one or more addresses, determined by the number of bytes read, to simulate reading from a FIFO buffer. In the implementation shown inFIG. 13 ,L2 execution unit 701 need store only three pointers in logic, and may retrieve data from memories 710-712 more efficiently than from separate FIFO buffers. Further, data may be alternately read from memories 710-712 by switching from one of the respective pointers to another. Because of the ability to rapidly obtain data from memories 710-712,L2 execution unit 701 may process L2 headers more quickly. - In certain internet protocols, internet protocol version 4 (IPv4) for example,
L3 header 408 that is generatedL3 execution unit 703 may contain a checksum. The checksum may be used to detect corruption in a received packet. For example, the checksum for IPv4 may be 16 bits and may be contained within L3 header at a known location. As explained above,L3 execution unit 702 generates L3 header information and writes the information intoL3 output buffer 721. Typically, the L3 header information is written toL3 output buffer 721 sequentially, with the checksum information being initially written as zeros. One way to generate the checksum would be forL3 execution unit 702 to wait untilL3 output buffer 721 contains an entire L3 header, read the entire header, compute the checksum, and write the checksum to the appropriate position inL3 output buffer 721. Such checksum computation, however, would occupyL3 execution unit 702 for a large number of instruction cycles, thereby delaying further L3 header processing. -
FIG. 14 is a diagram illustratingchecksum processing element 1400 and its relation toL3 execution unit 703.Checksum processing element 1400 may be situated in the data flow betweenL3 execution unit 703 andL3 output buffer 721, as shown inFIG. 14 .Checksum processing element 1400 may include achecksum generator 1410 and achecksum accumulator 1420. One of the general purpose registers 730 described inFIG. 7 may be used aschecksum accumulator 1420. -
Checksum generator 1410 may include logic configured to compute a checksum using inputs of L3 information fromL3 execution unit 703 and checksum data fromchecksum accumulator 1420 when instructed by theL3 execution unit 703. In some implementations, such an instruction may be implicit (i.e., the checksum may be computed by default). In one implementation,L3 execution unit 703 may initially loadchecksum accumulator 1420 with an initial value when it writes the first L3 information for a new L3 header toL3 output buffer 721. In another implementation consistent with the present invention,L3 execution unit 703 may pre-loadchecksum accumulator 1420 with a value before it writes the first L3 information for a new L3 header toL3 output buffer 721. In subsequent writings of L3 information toL3 output buffer 721,checksum generator 1410 may read stored checksum data fromchecksum accumulator 1420 and compute a new checksum using the stored checksum data and the L3 information being written toL3 output buffer 721.Checksum generator 1410 may store this new (partial) checksum inchecksum accumulator 1420. In this manner,checksum processing element 1400 updates the L3 checksum “on the fly” (i.e., concurrently) as the L3 header information is written toL3 output buffer 721 byL3 execution unit 703. - When
L3 output buffer 721 contains an entire L3 header, checksumaccumulator 1420 contains the checksum for that L3 header.L3 execution unit 703 may write the checksum fromchecksum accumulator 1420 to the appropriate address inL3 output buffer 721. Because of thechecksum processing element 1400,L3 execution unit 703 may use only two instruction cycles for checksum processing: one instruction cycle to initially loadchecksum accumulator 1420, and one instruction cycle to write the complete checksum fromchecksum accumulator 1420 toL3 output buffer 721. Because of the ability to concurrently compute the L3 header checksum,L3 execution unit 703 may process L3 headers more quickly. Although described with respect to L3 headers, it is specifically envisioned that checksum processing elements similar tochecksum processing element 1400 may be used to compute other checksums on the fly as well. - Physical links with certain interface types, such as SONET, Ethernet, ATM, etc., have associated limits on the maximum packet size supported by the interface type. These limits may be referred to as the “maximum transfer unit” (MTU) of the interface. For example, standard Ethernet has an MTU of about 1500 bytes, while other interface types may have, for example, MTUs around 9000 bytes. If the size of a packet exceeds the size of the MTU of its destination interface, the packet may be fragmented (i.e., broken up into two or more pieces) so that it may be transmitted over the interface. Internet protocols (e.g., IPv4) may define protocols for handling fragmentation.
-
FIG. 15 is a flow chart illustrating fragment handling by the L2/L3 engine component 504. L2/L3 engine component 504 may receive the first two cells of a packet, which contain the packet header information, from descriptor reader 503 [act 1510]. The header information may include the length of the packet. Alternately, the packet length may be detected from the packet itself. The header information may also be used to obtain the MTU size of the interface that is the packet's destination. For example, the header information may be used to construct a key, which may be used to look up MTU information fromSRAM 510. L2/L3 engine component 504 may determine whether the length of the packet associated with the header information exceeds the MTU size of the destination interface [act 1520]. - If the length of the packet is less than the MTU size of the destination interface, L2/
L3 engine component 504 may generate a new header for the packet, and may transmit the new header to output interface component 505 [act 1530].Output interface component 505 may assemble the new header information and packet data frombuffer 502 into a packet. - If the length of the packet is greater than the MTU size of the destination interface, L2/
L3 engine component 504 may generate a first fragment header and send a first fragment with the header [act 1540]. The first fragment header may include the size of any payload data in the first two cells and the size of the packet data to be read frombuffer 502. This size information may be used to calculate an updated packet length. The updated packet length may be calculated by L2/L3 engine component 504 and included in the first fragment header. Alternately, the updated packet length may not be included in the first fragment header, but may be calculated from information in the first fragment header in later processing (e.g., act 1560). The first fragment header also may include recycle information, such as a bit indicating previous fragmentation and a length of the L2 header. - The first fragment header may be transmitted to
output interface component 505 for generation of a first packet fragment, and may also be transmitted to input interface component 501 [act 1550].Output interface component 505 may assemble the first fragment header information and packet data frombuffer 502 into a first packet. Meanwhile, input interface component may send the first fragment header, via the descriptor reader 530, to L2/L3 engine component 504. - L2/
L3 engine component 504 may use the first fragment header to generate one or more subsequent fragment headers by determining whether the updated packet length (either contained in the first fragment header or calculated) exceeds the MTU size of the destination interface [act 1560]. If the updated packet length is less than the MTU size of the destination interface, L2/L3 engine component 504 may generate a new header for the packet, and may transmit the new header to output interface component 505 [act 1570].Output interface component 505 may assemble the new header information and packet data frombuffer 502 into a second packet fragment. - If the updated packet length is greater than the MTU size of the destination interface, L2/
L3 engine component 504 may generate a next fragment header [act 1580]. The next fragment header may include the size of the packet data to be read frombuffer 502. This size information may be used to calculate an updated packet length. The updated packet length may be calculated by L2/L3 engine component 504 and included in the first fragment header. Alternately, the updated packet length may not be included in the next fragment header, but may be calculated from information in the first fragment header in later processing (e.g., act 1560). The next fragment header also may include recycle information, such as a bit indicating previous fragmentation and a length of the L2 header. - The next fragment header (and any subsequent fragment headers) may be transmitted to
output interface component 505 andinput interface component 501, and may be processed as described in acts 1550-1580. In this manner, L2/L3 engine component 504 may recycle packet headers when the packet's destination interface requires that the packet be fragmented. - As described above, an L2 engine may include pointers to efficiently read descriptor data from a single descriptor memory divided into a number of logical portions. The L3 engine may have dedicated hardware associated with it to generate and update a checksum for the L3 header when data is written to the L3 output buffer. Also, the L2/L3 engine may recycle packet headers when the packet's destination interface requires that the packet be fragmented. In this manner, an efficient L2/L3 generation engine is achieved.
- The foregoing description of preferred embodiments of the invention provides illustration and description, but is not intended to be exhaustive or to limit the invention to the precise form disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from practice of the invention. Moreover, while a series of acts has been presented with respect to
FIGS. 9 and 15 , the order of the acts may be different in other implementations consistent with principles of the invention. Additionally, lines with arrows are used in the figures to generally illustrate the flow of data. In practice, embodiments consistent with the principles of the invention may send data on these lines in both directions. -
PFEs 110 may be implemented in hardware, software, or some combination thereof. For example, various portions ofPFEs 110 may be implemented in application-specific integrated circuits (ASICs). The ASICs may be configured to perform some processing via dedicated logic, and may also be configured to perform some processing using microcode instructions that may be stored in memory. Those skilled in the router art will appreciate that the invention described herein might be practiced using a variety of hardware configurations in addition to, or instead of, ASICs. For example, some combination of general purpose processors, digital signal processors (DSPs), and programmable gate arrays (PGAs) may also be utilized to implement the functionality described herein. - No element, act, or instruction used in the description of the present application should be construed as critical or essential to the invention unless explicitly described as such. Also, as used herein, the article “a” is intended to include one or more items. Where only one item is intended, the term “one” or similar language is used.
- The scope of the invention is defined by the claims and their equivalents.
Claims (21)
1-27. (canceled)
28. A device comprising:
a plurality of data unit processing components, each of the plurality of data unit processing components, including:
a higher layer execution unit to:
receive header information, where the header information includes template data and tag descriptor data, and
process the header information to obtain higher layer header data, the higher layer execution unit performing the processing of the received header information in parallel with processing of the header information by a lower layer execution unit associated with the one of the plurality of data unit processing components.
29. The device of claim 28 , where the header information further includes header data associated with a previously processed data unit
30. The device of claim 28 , further comprising:
a plurality of mailbox registers to:
store information to allow the higher layer execution unit, of the one of the plurality of data unit processing components, to share data with the lower layer execution unit, associated with the one of the plurality of data unit processing components.
31. The device of claim 30 , where at least one mailbox register, of the plurality of mailbox registers, accepts data from the higher layer execution unit and does not accept data from the lower layer execution unit.
32. The device of claim 30 , where at least one mailbox register, of the plurality of mailbox registers, includes a validity bit that indicates whether the one mailbox register is ready to be written or read.
33. The device of claim 28 , further comprising:
a build component to:
receive the higher layer header data from the higher layer execution unit, and
form a header based on the higher layer header data and the lower layer header data received from the lower layer execution unit.
34. The device of claim 28 , where a data unit processing component, of the plurality of data unit processing components, loads information relating to a first header simultaneously with acquiring information relating to a second header.
35. The device of claim 28 , where the higher layer execution unit includes a plurality of execution sections, including:
a first execution section, of the plurality of execution sections, to process the header information, and
a second execution section, of the plurality of execution sections, to compute a checksum.
36. The device of claim 28 further comprising:
a dispatch component to:
receive the header information associated with a data unit;
parse the received header information, and
to direct the parsed header information to two or more of the plurality of data unit processing components.
37. A device comprising:
input unit to:
receive a data unit, and
determine header information associated with the data unit, where the header information includes template data and tag descriptor data;
a higher layer execution unit to:
receive the header information, and
generate higher layer header data based on the header information, the higher layer execution unit generating the higher layer header data in parallel with generation of lower layer header data, associated with the header information, by an associated lower layer execution unit.
38. The device of claim 37 , further comprising:
a dispatch component to:
parse the header information to create parsed header information portions, and
direct one of the parsed header information portions to the higher layer execution unit.
39. The device of claim 37 , further comprising:
at least one mailbox register to store information, the at least one mailbox register allowing the higher layer execution unit to share data with the lower layer execution unit.
40. The device of claim 39 , where the at least one mailbox register includes a first mailbox register and a second mailbox register, where the first mailbox register cannot be written to by the higher layer execution unit, and where the second mailbox register cannot be written to by the lower layer execution unit.
41. The device of claim 39 , where the at least one mailbox register includes a validity bit that indicates whether data associated with the at least one mailbox register is ready to be written or read.
42. The device of claim 37 , further comprising:
a build component to combine the higher layer header data and the lower layer header data to form a header.
43. The device of claim 37 , where the higher layer execution unit is further to compute a checksum for the received header information.
44. A method implemented in a device comprising a plurality of data unit processing components, the method comprising:
receiving, at a higher layer execution unit in one of the a plurality of data unit processing components, header information including at least one of template data, tag descriptor data, or higher layer descriptor data;
generating, at the higher layer execution unit, higher layer header data based on the header information, where generating the higher layer header data occurring in parallel with generating, by one of the plurality of data unit processing components, lower layer header data associated with the header information; and
forming, based on the higher layer header data and the lower layer header data, a header.
45. The method of claim 44 , further comprising:
transferring data to and from the higher layer execution unit using a first mailbox register.
46. The method of claim 44 , further comprising:
transferring data to and from the lower layer execution unit using a second, different mailbox register that cannot be accessed by the higher layer execution unit.
47. The method of claim 44 , further comprising,
computing a checksum for the header information, and
verifying the header information based on the checksum.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/072,382 US20110170546A1 (en) | 2002-03-22 | 2011-03-25 | Logical separation and accessing of descriptor memories |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/102,933 US7215662B1 (en) | 2002-03-22 | 2002-03-22 | Logical separation and accessing of descriptor memories |
US11/695,723 US7782857B2 (en) | 2002-03-22 | 2007-04-03 | Logical separation and accessing of descriptor memories |
US12/773,494 US7936758B2 (en) | 2002-03-22 | 2010-05-04 | Logical separation and accessing of descriptor memories |
US13/072,382 US20110170546A1 (en) | 2002-03-22 | 2011-03-25 | Logical separation and accessing of descriptor memories |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/773,494 Continuation US7936758B2 (en) | 2002-03-22 | 2010-05-04 | Logical separation and accessing of descriptor memories |
Publications (1)
Publication Number | Publication Date |
---|---|
US20110170546A1 true US20110170546A1 (en) | 2011-07-14 |
Family
ID=38001057
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/102,933 Expired - Fee Related US7215662B1 (en) | 2002-03-22 | 2002-03-22 | Logical separation and accessing of descriptor memories |
US11/695,723 Expired - Fee Related US7782857B2 (en) | 2002-03-22 | 2007-04-03 | Logical separation and accessing of descriptor memories |
US12/773,494 Expired - Fee Related US7936758B2 (en) | 2002-03-22 | 2010-05-04 | Logical separation and accessing of descriptor memories |
US13/072,382 Abandoned US20110170546A1 (en) | 2002-03-22 | 2011-03-25 | Logical separation and accessing of descriptor memories |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/102,933 Expired - Fee Related US7215662B1 (en) | 2002-03-22 | 2002-03-22 | Logical separation and accessing of descriptor memories |
US11/695,723 Expired - Fee Related US7782857B2 (en) | 2002-03-22 | 2007-04-03 | Logical separation and accessing of descriptor memories |
US12/773,494 Expired - Fee Related US7936758B2 (en) | 2002-03-22 | 2010-05-04 | Logical separation and accessing of descriptor memories |
Country Status (1)
Country | Link |
---|---|
US (4) | US7215662B1 (en) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7212530B1 (en) | 2002-03-22 | 2007-05-01 | Juniper Networks, Inc. | Optimized buffer loading for packet header processing |
US7215662B1 (en) * | 2002-03-22 | 2007-05-08 | Juniper Networks, Inc. | Logical separation and accessing of descriptor memories |
US7283528B1 (en) | 2002-03-22 | 2007-10-16 | Raymond Marcelino Manese Lim | On the fly header checksum processing using dedicated logic |
US7236501B1 (en) | 2002-03-22 | 2007-06-26 | Juniper Networks, Inc. | Systems and methods for handling packet fragmentation |
US7317721B1 (en) * | 2002-04-12 | 2008-01-08 | Juniper Networks, Inc. | Systems and methods for memory utilization during packet forwarding |
US20060140203A1 (en) * | 2004-12-28 | 2006-06-29 | Sanjeev Jain | System and method for packet queuing |
US7467325B2 (en) | 2005-02-10 | 2008-12-16 | International Business Machines Corporation | Processor instruction retry recovery |
US7478276B2 (en) * | 2005-02-10 | 2009-01-13 | International Business Machines Corporation | Method for checkpointing instruction groups with out-of-order floating point instructions in a multi-threaded processor |
JP4649315B2 (en) * | 2005-11-02 | 2011-03-09 | キヤノン株式会社 | Communication apparatus and communication method |
US8223788B1 (en) * | 2007-10-24 | 2012-07-17 | Ethernity Networks Ltd | Method and system for queuing descriptors |
US8160071B2 (en) * | 2008-12-02 | 2012-04-17 | Electronics And Telecommunications Research Institute | Apparatus and method for processing L2/L3 packet |
US8687653B2 (en) * | 2009-10-30 | 2014-04-01 | Brocade Communications Systems, Inc. | Tunnel path MTU discovery |
US9253248B2 (en) * | 2010-11-15 | 2016-02-02 | Interactic Holdings, Llc | Parallel information system utilizing flow control and virtual channels |
KR102516027B1 (en) | 2016-07-20 | 2023-03-31 | 삼성전자주식회사 | Header processing device, processor and electronic device |
US11627185B1 (en) * | 2020-09-21 | 2023-04-11 | Amazon Technologies, Inc. | Wireless data protocol |
US11627093B1 (en) * | 2020-10-30 | 2023-04-11 | Juniper Networks, Inc. | Generic layer independent fragmentation of non-internet protocol frames |
Family Cites Families (53)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2692696A1 (en) * | 1992-06-19 | 1993-12-24 | Sgs Thomson Microelectronics | Method for loading a program into a program memory associated with a processor. |
US5408866A (en) * | 1992-11-25 | 1995-04-25 | Nissan Motor Co., Ltd. | Leak diagnosis system for evaporative emission control system |
CA2139081C (en) | 1994-12-23 | 1999-02-02 | Alastair Gordon | Unified messaging system and method |
US5630054A (en) * | 1995-04-18 | 1997-05-13 | Mti Technology Center | Method and apparatus for storing and retrieving error check information |
US20020116595A1 (en) | 1996-01-11 | 2002-08-22 | Morton Steven G. | Digital signal processor integrated circuit |
US5826032A (en) * | 1996-02-12 | 1998-10-20 | University Of Southern California | Method and network interface logic for providing embedded checksums |
US6791947B2 (en) * | 1996-12-16 | 2004-09-14 | Juniper Networks | In-line packet processing |
US6182258B1 (en) * | 1997-06-03 | 2001-01-30 | Verisity Ltd. | Method and apparatus for test generation during circuit design |
US5938736A (en) | 1997-06-30 | 1999-08-17 | Sun Microsystems, Inc. | Search engine architecture for a high performance multi-layer switch element |
US6289023B1 (en) | 1997-09-25 | 2001-09-11 | Hewlett-Packard Company | Hardware checksum assist for network protocol stacks |
US6807581B1 (en) | 2000-09-29 | 2004-10-19 | Alacritech, Inc. | Intelligent network storage interface system |
US6172981B1 (en) | 1997-10-30 | 2001-01-09 | International Business Machines Corporation | Method and system for distributing network routing functions to local area network stations |
US6275508B1 (en) | 1998-04-21 | 2001-08-14 | Nexabit Networks, Llc | Method of and system for processing datagram headers for high speed computer network interfaces at low clock speeds, utilizing scalable algorithms for performing such network header adaptation (SAPNA) |
US6067300A (en) | 1998-06-11 | 2000-05-23 | Cabletron Systems, Inc. | Method and apparatus for optimizing the transfer of data packets between local area networks |
US6195739B1 (en) | 1998-06-29 | 2001-02-27 | Cisco Technology, Inc. | Method and apparatus for passing data among processor complex stages of a pipelined processing engine |
US6560228B2 (en) | 1998-07-01 | 2003-05-06 | Agilent Technologies, Inc. | Generation of informative MAC headers for analysis of ATM routed LAN frames |
US6182267B1 (en) * | 1998-11-20 | 2001-01-30 | Cisco Technology, Inc. | Ensuring accurate data checksum |
JP3645734B2 (en) | 1999-02-24 | 2005-05-11 | 株式会社日立製作所 | Network relay device and network relay method |
US6412092B1 (en) * | 1999-04-14 | 2002-06-25 | Hewlett-Packard Company | Method and apparatus to reduce the cost of preparing the checksum for out bound data in network communication protocols by caching |
US6671722B1 (en) * | 1999-07-08 | 2003-12-30 | Intel Corporation | Stack-less, CPU-less creation of valid SNMP-trap packets |
US6449656B1 (en) | 1999-07-30 | 2002-09-10 | Intel Corporation | Storing a frame header |
US6775284B1 (en) | 2000-01-07 | 2004-08-10 | International Business Machines Corporation | Method and system for frame and protocol classification |
US6721316B1 (en) | 2000-02-14 | 2004-04-13 | Cisco Technology, Inc. | Flexible engine and data structure for packet header processing |
US6944164B2 (en) | 2000-02-29 | 2005-09-13 | International Business Machines Corporation | Method and apparatus for hardware forwarding of LAN frames over ATM networks |
US6988235B2 (en) * | 2000-03-02 | 2006-01-17 | Agere Systems Inc. | Checksum engine and a method of operation thereof |
US6269023B1 (en) * | 2000-05-19 | 2001-07-31 | Advanced Micro Devices, Inc. | Method of programming a non-volatile memory cell using a current limiter |
JP4006169B2 (en) | 2000-05-30 | 2007-11-14 | 株式会社日立製作所 | Label switching type packet transfer device |
US6920133B1 (en) | 2000-06-07 | 2005-07-19 | At&T Corp. | Techniques for introducing in-band network management packets in multi-protocol label switching networks |
US6678746B1 (en) * | 2000-08-01 | 2004-01-13 | Hewlett-Packard Development Company, L.P. | Processing network packets |
JP4203979B2 (en) | 2000-08-02 | 2009-01-07 | 富士通株式会社 | Packet processing device |
JP2002057698A (en) * | 2000-08-09 | 2002-02-22 | Fujitsu Ltd | Packet data processing device |
US7046681B2 (en) | 2000-11-29 | 2006-05-16 | Raza Microelectronics, Inc. | Network switch for routing network traffic |
US6996105B1 (en) * | 2000-12-19 | 2006-02-07 | Adaptec, Inc. | Method for processing data packet headers |
US20030097481A1 (en) | 2001-03-01 | 2003-05-22 | Richter Roger K. | Method and system for performing packet integrity operations using a data movement engine |
EP1246407B1 (en) * | 2001-03-22 | 2004-09-29 | Alcatel | Network-unit and method providing switching by means of a quality of service packet field comprising destination information |
JP2002290399A (en) * | 2001-03-26 | 2002-10-04 | Fujitsu Ltd | Subscriber termination device and packet processing method |
US7454527B2 (en) * | 2001-05-02 | 2008-11-18 | Microsoft Corporation | Architecture and related methods for streaming media content through heterogeneous networks |
US6944168B2 (en) * | 2001-05-04 | 2005-09-13 | Slt Logic Llc | System and method for providing transformation of multi-protocol packets in a data stream |
JP4054550B2 (en) * | 2001-06-29 | 2008-02-27 | 株式会社エヌ・ティ・ティ・ドコモ | Transmission power control method and apparatus |
JP3965283B2 (en) * | 2001-07-02 | 2007-08-29 | 株式会社日立製作所 | Packet transfer device with multiple types of packet control functions |
US6963586B2 (en) * | 2001-08-21 | 2005-11-08 | Via Technologies, Inc. | Method and apparatus for general-purpose packet reception processing |
US7355971B2 (en) | 2001-10-22 | 2008-04-08 | Intel Corporation | Determining packet size in networking |
US20030172177A1 (en) * | 2001-12-06 | 2003-09-11 | Kersley Ian P. | System and method for verifying a device |
US7006533B2 (en) * | 2002-02-19 | 2006-02-28 | Intel Corporation | Method and apparatus for hublink read return streaming |
US7158520B1 (en) * | 2002-03-22 | 2007-01-02 | Juniper Networks, Inc. | Mailbox registers for synchronizing header processing execution |
US7180893B1 (en) * | 2002-03-22 | 2007-02-20 | Juniper Networks, Inc. | Parallel layer 2 and layer 3 processing components in a network router |
US7080308B2 (en) * | 2002-03-22 | 2006-07-18 | Intel Corporation | Method and apparatus to perform error control |
US7236501B1 (en) * | 2002-03-22 | 2007-06-26 | Juniper Networks, Inc. | Systems and methods for handling packet fragmentation |
US7239630B1 (en) * | 2002-03-22 | 2007-07-03 | Juniper Networks, Inc. | Dedicated processing resources for packet header generation |
US7215662B1 (en) * | 2002-03-22 | 2007-05-08 | Juniper Networks, Inc. | Logical separation and accessing of descriptor memories |
US7212530B1 (en) * | 2002-03-22 | 2007-05-01 | Juniper Networks, Inc. | Optimized buffer loading for packet header processing |
US7283528B1 (en) * | 2002-03-22 | 2007-10-16 | Raymond Marcelino Manese Lim | On the fly header checksum processing using dedicated logic |
US8034857B2 (en) | 2007-07-12 | 2011-10-11 | Sabic Innovative Plastics Ip B.V. | Polyetherimide/polyphenylene ether sulfone blends |
-
2002
- 2002-03-22 US US10/102,933 patent/US7215662B1/en not_active Expired - Fee Related
-
2007
- 2007-04-03 US US11/695,723 patent/US7782857B2/en not_active Expired - Fee Related
-
2010
- 2010-05-04 US US12/773,494 patent/US7936758B2/en not_active Expired - Fee Related
-
2011
- 2011-03-25 US US13/072,382 patent/US20110170546A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
US20070183425A1 (en) | 2007-08-09 |
US20100215046A1 (en) | 2010-08-26 |
US7782857B2 (en) | 2010-08-24 |
US7215662B1 (en) | 2007-05-08 |
US7936758B2 (en) | 2011-05-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7773599B1 (en) | Packet fragment handling | |
US7616562B1 (en) | Systems and methods for handling packet fragmentation | |
US7936758B2 (en) | Logical separation and accessing of descriptor memories | |
US7680116B1 (en) | Optimized buffer loading for packet header processing | |
US8001335B2 (en) | Low latency request dispatcher | |
US8737403B2 (en) | Context-switched multi-stream pipelined reorder engine | |
US7764606B1 (en) | Maintaining packet order using hash-based linked-list queues | |
CN1520112B (en) | network protocol offload engine | |
US9678866B1 (en) | Transactional memory that supports put and get ring commands | |
US9280297B1 (en) | Transactional memory that supports a put with low priority ring command | |
US20010053148A1 (en) | Network adapter with embedded deep packet processing | |
US7181541B1 (en) | Host-fabric adapter having hardware assist architecture and method of connecting a host system to a channel-based switched fabric in a data network | |
US20060010193A1 (en) | Parser table/production rule table configuration using CAM and SRAM | |
US7239630B1 (en) | Dedicated processing resources for packet header generation | |
US8996724B2 (en) | Context switched route look up key engine | |
US7158520B1 (en) | Mailbox registers for synchronizing header processing execution | |
US20050281281A1 (en) | Port input buffer architecture | |
US8050262B2 (en) | System and method for parsing frames | |
US7180893B1 (en) | Parallel layer 2 and layer 3 processing components in a network router | |
US7174394B1 (en) | Multi processor enqueue packet circuit | |
US7751422B2 (en) | Group tag caching of memory contents | |
US20020078246A1 (en) | Method and system for network protocol processing | |
US6636952B1 (en) | Systems and methods for processing packet streams in a network device | |
JP3775597B2 (en) | Data array method and apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE |