US20150381481A1 - Data path acceleration using hw virtualization - Google Patents

Data path acceleration using hw virtualization Download PDF

Info

Publication number
US20150381481A1
US20150381481A1 US14/846,190 US201514846190A US2015381481A1 US 20150381481 A1 US20150381481 A1 US 20150381481A1 US 201514846190 A US201514846190 A US 201514846190A US 2015381481 A1 US2015381481 A1 US 2015381481A1
Authority
US
United States
Prior art keywords
data packet
packet
processing
packets
processing operation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/846,190
Inventor
Timor Kardashov
Maxim Kovalenko
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Marvell World Trade Ltd
Original Assignee
Marvell World Trade Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Marvell World Trade Ltd filed Critical Marvell World Trade Ltd
Priority to US14/846,190 priority Critical patent/US20150381481A1/en
Publication of US20150381481A1 publication Critical patent/US20150381481A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/38Flow based routing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/54Store-and-forward switching systems 
    • H04L12/56Packet switching systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/74Address processing for routing
    • H04L45/745Address table lookup; Address filtering
    • H04L45/7453Address table lookup; Address filtering using hashing

Abstract

A processing core includes a packet classifier, implemented in a single processing core, configured to classify incoming data packets into first data packets of a known data packet flow and into second data packets of an unknown data packet flow, a first path thread, implemented in the single processing core, configured to process ones of the first data packets at least by forwarding the first data packets to a destination that corresponds to a previously determined destination associated with the known data packet flow, and a second path thread, implemented in the single processing core, configured to process a received second data packet at least to determine a new data packet flow for the second data packet.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • The present application is a continuation application of U.S. application Drt. No. 13/681,139, filed Nov. 19, 2012, which is a non-provisional of U.S. Provisional Patent Application No. 61/561,392, filed Nov. 18, 2011 the disclosures of all of which are incorporated herein in their entirety by this reference.
  • BACKGROUND
  • 1. Field
  • The current disclosure relates to method and apparatuses for packet processing, and more particularly, to methods and apparatuses for performing packet processing operations using distinct threads implemented in a single processing core to process different types of packets.
  • 2. Background
  • The background description provided herein is for the purpose of generally presenting the context of the disclosure. Work of the presently named inventors, to the extent it is described in this background section, as well as aspects of the description that may not otherwise qualify as prior art at the time of filing, are neither expressly nor impliedly admitted as prior art against the present disclosure.
  • Packet processing operations can be divided into two different types. A first type includes operations performed by a control plane. These control plane operations include, for example, operations to control various aspects of the overall packet processing operation, including management of network devices, etc., as well as to generate rules used for packet processing. A second type includes operations performed by a data plane. These data plane operations relate to the processing of the data itself, for example, operations such as forwarding, modifying, and routing the data.
  • A conventional technique implements control plane operations and data plane operations using at least two separate central processing units (CPUs), with at least one of the CPUs being configured to perform control plane operations and the other at least one of the CPUs being configured to perform data plane operations.
  • SUMMARY
  • One or more embodiments of the disclosure relate to methods and apparatuses which implement a single processing core to perform control plane operations and data plane operations. The methods and apparatuses according to embodiments achieve a significant network performance boost. Furthermore, the methods and apparatuses according to embodiments are implemented in a single processing core, thereby reducing costs and effort required for installation and use. Moreover, the methods and apparatuses according to embodiments are flexible and scalable.
  • It is understood, of course, that the methods and apparatuses according to some embodiments are not required to achieve these benefits.
  • According to an embodiment, there is provided a processing core, including: a packet classifier, implemented in a single processing core, configured to classify incoming data packets into first data packets of a known data packet flow and into second data packets of an unknown data packet flow; a first path thread, implemented in the single processing core, configured to process ones of the first data packets at least by forwarding the first data packets to a destination that corresponds to a previously determined destination associated with the known data packet flow; and a second path thread, implemented in the single processing core, configured to process a received second data packet at least to determine a new data packet flow for the second data packet.
  • According to an embodiment, the packet classifier is implemented in the first path thread and classifies each of the first data packets into a specific packet flow from among a plurality of packet flows.
  • According to an embodiment, the classifier classifies the incoming data packets a single time.
  • According to an embodiment, one of the first path thread and the second path thread modifies each of the incoming data packets after being classified.
  • According to an embodiment, the processing core further includes a connections hash table comprising entries corresponding to known data packet flows, wherein, for each incoming data packet, the classification unit is configured to determine a hash corresponding to the incoming data packet, classify the incoming data packet as a first data packet of the known data packet flow when the hash matches an entry in the connections hash table, and classify the incoming data packet as a second data packet of the unknown data packet flow when the hash does not match any entries in the hash table.
  • According to an embodiment, the first path thread processes the first data packets of the known data packet flow according to a previously generated rule that is stored in a memory.
  • According to an embodiment, the processing core further includes a rules hash table including rules to process different packet flows, wherein, for each second data packet, the second path thread is configured to process the second data packet by identifying a rule in the rules hash table associated with the second data packet, process the second data packet according to the rule, and update the connections hash table such that additional incoming packets of the same data packet flow as the second data packet, which are received after the second data packet, will be classified as first data packets of the known data packet flow and processed in the first path thread according to the rule generated earlier in the second path.
  • According to an embodiment, the first path thread uses fast interrupt requests (FIQ), and the second path thread uses a host operating system network stack.
  • According to an embodiment, the second path thread is further configured to determine a forwarding rule for forwarding data packets corresponding to the new data packet flow, and update the first path thread to use the forwarding rule on data packets corresponding to the new data packet flow.
  • According to an embodiment, the classifier is configured to classify each of the incoming data packets as being associated with a corresponding packet flow indicative of a stream of packets of a particular type, having a same source and a having same destination.
  • According to an embodiment, the second path thread employs Just-in-time (JIT) rule compilation to identify a rule to process the new packet type.
  • According to an embodiment, the processing core further includes a hardware abstraction layer (HAL) network driver to receive the incoming data packets and another HAL network driver to transmit the data packets, after being processed by one of the first path thread and the second path thread, to corresponding destinations.
  • According to another embodiment, a packet processing method includes classifying, in a single processing core, incoming data packets into first data packets of a known data packet flow and into second data packets of an unknown data packet flow; processing, in the single processing core, ones of the first data packets at least by forwarding the first data packets to a destination that corresponds to a previously determined destination associated with the known data packet flow; and processing, in the single processing core, a received second data packet at least to determine a new data packet flow for the second data packet.
  • According to another embodiment, the method further includes performing the classifying and the processing of the first data packets in a same first path thread.
  • According to another embodiment, for each of the incoming data packets, the method further includes: determining a hash corresponding to the incoming data packet; classifying the incoming data packet as a first data packet of the known data packet flow when the hash matches an entry in a connections hash table; and classifying the incoming data packet as a second data packet of the unknown data packet flow when an entry matching the hash is absent from the connections hash table.
  • According to another embodiment, for each packet classified as a second data packet, the method further includes: identifying based on a collection of packet attributes, the packet attributes including at least one or more of: a source, a destination, and a type of the packet; identifying a rule in a rules hash table associated with the second data packet; processing the second data packet according to the rule; and updating the connections hash table such that additional incoming packets of the same type as the second data packet, which are received after the second data packet, will be classified as first data packets of the known data packet flow.
  • According to another embodiment, the method further includes performing the processing of the first data packets using fast interrupt requests (FIQ), and performing the processing of the second data packets using a host operating system network stack.
  • According to yet another embodiment, a processing core includes a first path thread, implemented in a single processing core, configured to receive a stream of data packets associated with one or more packet flows and to process data packets which are of a known class according to a rule associated with a known packet flow; and a second path thread, implemented in the single processing core, configured to receive, from the first path thread, data packets which are of an unknown class and to process the data packets of the unknown class.
  • DRAWINGS
  • FIG. 1 is a block diagram illustrating a packet processor implemented as a single processing core system on chip, according to an embodiment;
  • FIGS. 2A and 2B are block diagrams illustrating additional details of packet classification and packet processing components utilized by the single processing core 10 shown in FIG. 1; and
  • FIG. 3 is a flow diagram illustrating a packet processing technique according to an embodiment.
  • DETAILED DESCRIPTION
  • FIG. 1 is a block diagram illustrating a packet processor implemented as a single processing core system on chip, according to an embodiment. As shown in FIG. 1, the single processing core 10 includes an ingress port 11 a, egress ports 11 b, 11 c, 11 d, and 11 e, a first path thread 12, a second path thread 14, a classification unit 16, a first processing unit 18, a second processing unit 20, a shared memory 22, hardware abstraction layer (HAL) network drivers 24 a, 24 b and virtual network drivers 26 a, 26 b. The single processing core 10 is connected by a connection 21, for example to a computer network. In an embodiment, the single processing core 10 is implemented as a system on chip that is part of a central processing unit (CPU) in a network server, a home or office gateway, or any other suitable network device, etc., and the connection 21 connects the single processing core 10 to other electronic components inside of the computing device. In an embodiment, the single processing core 10 has an Advanced RISC Machine (ARM) architecture.
  • The single processing core 10 is configured to receive and process packets and output the processed packets to appropriate destinations (e.g., ports), in an embodiment. Packets are typically received as a stream of packets. The stream of packets typically includes packets from different flows. The packets include information, for example, in a header, and this information is then used to classify the packets into flows. As shown in FIG. 1, the single processing core receives and processes several packet flows. The packets 1 belong to a known packet flow 1, the packets 3 belong to a known packet flow 3, and the packets 4 belong to a known packet flow 4. Thus, the packets 1, 3 and 4 are referred to as known packets, and belong to known packet flows. The first processing unit 18 processes the packets 1, 3 and 4 to generate processed packets 1′, 3′, and 4′. Packets 2 belong to an unknown packet flow that is processed in the second processing unit 20 to generate processed packets 2′. Thus, the packets 2 are referred to as unknown packets, and belong to unknown packet flows. According to an embodiment, a “known” packet flow refers to a flow of packets which originate from the same source, are directed to the same target or destination, and are of the same type, such as voice, streaming media (video, music, etc.), data, etc. Further, an “unknown” packet flow refers to a flow of packets that originate from an unknown source, are directed to an unknown target or destination, and/or are of an unknown type. The single processing core 10 performs different types of packet processing operations depending on whether the input packet is of a known packet flow or an unknown packet flow.
  • The ingress port 11 a receives a stream of packets belonging to packet flows 1, 2, 3 and 4. The four egress ports 11 b, 11 c, 11 d, and 11 e are ports from which processed packets are egressed from the single processing core 10 after FIQ or slow thread processing respectively. It is understood that more or less than four packet flows may be used according to other embodiments, and that more than one packet flow may be egressed on the same port.
  • According to an embodiment, a known packet, such as the packets 1, 3 and 4 shown in FIG. 1, are already classified into a known packet flow. Packets of the same type (e.g., streaming video) passing through the single processing core 10 may be treated differently if the packets come from different sources or are destined for different targets. For example, packets 1 and 3 may both be the same type (e.g., packet of movies), but packet 1 may be for a different movie than packet 3, and thus packets 1 and 3 may have different sources and destinations. The known packets include information (characteristics) indicating a known destination, and/or may include information indicating a known class/type (e.g., HTTP, VOIP, video streaming, etc.). Since this information is already known when the known data packets are input to the single processing core 10, the single processing core 10 can perform minimal packet processing operations, as compared to a case when the information is unknown.
  • The data packets 2 belonging to an unknown packet flow include information which is unknown about the packets when the packets are received at the single processing core 10. The second data packets 2, therefore, are packets of an unknown data packet flow. For example, the second data packets 2 may have been received from a source that is different from the source of previously received packets, may be directed to a destination that is different from the destination of previously received packets, or may be of a class or type that is unknown or different from the class/type of previously received packets. Since this at least one characteristic is unknown or different from the characteristics of previously received packets when the second data packets 2 are received at the single processing core 10, the single processing core 10 performs more extensive packet processing operations, as compared to a case when the received packet is associated with a previously known flow of packets.
  • The known data packets ( data packets 1, 3 and 4) and unknown data packets (data packets 2) are each part of a respective packet flow, in an embodiment. A “packet flow” according to embodiments may be defined in many different ways according to many different technical standards. For example, a packet flow is defined, in an embodiment, as a stream of packets of a particular type, having a same source and a having same destination. In another example, a packet flow may be defined as an artificial logical equivalent to a call or connection. In another example, a packet flow may be defined as a set of Internet Protocol (IP) packets passing an observation point in a network during a certain time interval. Thus, embodiments disclosed herein may be implemented in accordance with many different types of packets and packet flows.
  • The first path thread 12 (also referred to as the “fast path thread 12”) is implemented in the single processing core 10 and is configured to initially receive input data packets, process the data packets 1, 3 and 4 of the known packet flows, and forward the data packets 2 of the unknown packet flow to the second path thread 14 for processing. At classification unit 16 incoming packets are simply classified as belonging to a known flow (there may be several known flows), or to an unknown flow. If the flow is known (which is typically the majority of packets in an active session), then the packet is processed in the fast path. If the flow is not known, for example when a new flow starts up, the packet is processed in the regular path. In reality, there are typically several flows in any time interval that are known. Similarly, in any time interval there may be one or more flows that are unknown and therefore require more intensive processing, including flow classification, in the second path thread 14.
  • The fast path thread 12 includes a hardware abstraction layer (HAL) network driver 24 a to receive input packets and a HAL network driver 24 b to transmit processed packets to their destinations.
  • The fast path thread 12 (also referred to as the FIQ 12) further includes a classification unit 16 (also referred to as a “packet classifier”). The classification unit 16 receives the input packets from the HAL network driver 24 and classifies the input packets as either known data packets or unknown data packets. The classification unit 16 classifies the packets according to various techniques, such as, for example, parsing out relevant data from a packet header and/or a bit-string hashing technique on parsed header data as described later with reference to FIG. 2. Furthermore, although FIG. 1 illustrates the classification unit 16 as being included within the fast path thread 12, the classification unit 16 is not required to be included within the fast path thread 12, and may instead be located outside of the fast path thread 12 according to other embodiments.
  • The fast path thread 12 further includes a first processing unit 18. The first processing unit 18 receives the known data packets from the classification unit 16 and processes the known data packets. In an embodiment, the first processing unit 18 is configured to process the known data packets using fast interrupt request (FIQ), thereby achieving efficient processing of the known data packets. FIQ processing is a high priority type of processing which is not affected by regular interrupt requests (IRQ) or other actions, such as actions taken by the host operating system (OS). It is noted that other embodiments may use other processing techniques instead of or in addition to FIQ processing to process the known data packets. For example, other processing techniques may be selected based on the context of the packet processing operation, such as, for example, when the processing should be fast, protected, not interrupted, etc.
  • The fast path thread 12 also includes, in an embodiment, a shared memory 22 to store rules which are used to process packets during FIQ processing. These stored rules may be various types, for example, forwarding rules, rules for dropping malicious packets, rules for multicasting, etc. The first processing unit 18 accesses and uses rules stored in the shared memory 22 to process the known data packets.
  • The second path thread 14 (also referred to as the “slow path thread 14”) receives the unknown data packets forwarded from the fast path thread 12, parses information from the packet, typically the packet header, uses the information to determine a flow classification and determine rules, such as forwarding rules, by which to process the present packet and then to be used when processing subsequent packets of the same flow in the FIQ 12. Although the unknown data packets (packets 2) are seen in the figure as being processed by the slow thread, once the flow is classified and the rules are determined, in an alternative embodiment, the rules are provided to the FIQ 12 and the newly classified packet is returned to the stream which is then processed by the FIQ 12 as if belonging to a known flow. In most situations, the quantity of packets processed by the slow path thread 14 is substantially less than the quantity of packets processed by the fast path thread 12, because typically the substantial majority of input packets belong to known flows, such as first data packets 1, 3 and 4. However, since the second data packets 2 include at least one unknown characteristic, the second data packets 2 require additional packet processing operations to be performed, as compared to the known data packets 1, 3 and 4. Additional packet processing operations include, for instance, learning the packet attributes, developing rules for subsequent processing, and updating FIQ processes, in an embodiment.
  • The slow path thread 14 includes a virtual network driver 26 a to receive second data packets 2 forwarded from the classifier 16 in the fast path thread 12 and a virtual network driver 26 b to transmit the second data packets 2 back to the fast path thread 12 after processing, in an embodiment. Flows are directed to an appropriate port among several ports (in a home gateway, for example, there are typically 4 wired ports and several more additional wireless ports). In an embodiment, HAL network driver 24 b is configured as an egress unit that handles transmission to the respective ports 11 b, 11 c, 11 d, and 11 e. Alternatively, each of the respective FIQ and slow threads may handle their own transmission to the respective physical ports.
  • The slow path thread 14 includes a second processing unit 20. The second processing unit 20 receives the second data packets 2 forwarded from the classification unit (which in the embodiment shown in FIG. 1 happens to be in the fast path thread 12) and processes the second data packets 2. The operation of “processing” the second data packets 2, when used in connection with the processing performed by the second processing unit 20, refers to a process of identifying a flow and formatting various rules, such as forwarding rules, corresponding to the flow for processing packets in the flow. The second processing unit 20 is configured to process the second data packets 2 using a host operating system network stack (e.g., Linux, etc.).
  • When an initial packet in a packet flow of second data packets 2 is forwarded to the second processing unit 20 by the classification unit 16, the second processing unit 20 first parses or extracts information from the packet header, then analyzes the extracted information, identifies at least one rule (e.g., from a database) to process the packet, processes the initial packet using the rule, and initiates a process to classify subsequent packets of the same type as known packets instead of unknown packets. In an alternative embodiment, all that the slow thread does is learn the packet to set up a new flow. Setting up a new flow may require learning/classifying several packets all having the same flow information. As a result, packets of the same type as the initial packet (e.g., subsequent packets in the same packet flow), which are received after the initial packet, can be classified by the classification unit 16 as a known data packet belonging to a known data packet flow, instead of an unknown data packet belonging to an unknown data packet flow, as explained in more detail below. Thus, the classification unit 16 may be able to classify packets belonging to the same packet flow only one time, and save information related to the classified packets in a memory. The flow may be saved for a period of time (e.g., several minutes, hours or days), after which the flow may need to be re-learned due to changes in control plane rules affecting the flow.
  • In an embodiment, the first path thread 12 and the second path thread 14 are distinct hardware threads implemented in the same single processing core 10. This process of using the single processing core 10 instead of separate processing cores for each path thread, as is conventionally done, is referred to as hardware virtualization. The single processing core 10 employing hardware virtualization achieves several benefits over the conventional 2-CPU model, including reduced costs.
  • In general, most packet flow traffic relates to only a few flows. Specific flows require very simply handing, and as such, can be handled using a short execution path (e.g., FIQ/fast path thread 12), in an embodiment. Conventionally, classification codes used to classify packet flows and modification codes used to modify packet flows are inter-mixed in a processor, resulting in a large overhead for specific packet flows in the processor. In contrast, according to an embodiment, the processes of classifying and modifying packet flows is divided such that the first path thread 12 classifies the packet flows based on whether the packet flows are known or unknown, and only in the event that a packet flow is determined to be an unknown packet flow, then the packet belonging to an unknown flow is provided to the second path thread 14 which then becomes involved in processing the packet belonging to an unknown flow. Since typically the substantial majority of packet flows are known packet flows, classification and processing of these known packet flows is performed at the fast path thread 12, resulting in a significant performance boost because the fast path thread processes packets faster than the slow path thread. Furthermore, for unknown packet flows, the slow path thread 14 generates a code to process the unknown packet flow after identifying a rule associated with the unknown packet flow, a process referred to as “Just-in-time” (JIT) rule compilation, which is a type of dynamic translation to improve the runtime performance of computer programs by using continuously occurring translation as well as caching of translated code to minimize performance degradation. By employing JIT rule compilation, the single processing core 10 requires fewer memory accesses than a processing core which does not employ JIT rule compilation, further increasing network performance.
  • FIGS. 2A and 2B are block diagrams illustrating additional details of packet classification and packet processing components utilized by the single processing core 10 shown in FIG. 1. FIG. 2A shows the route taken by a known packet (a packet having known characteristics and belonging to a known packet flow). FIG. 2B shows the route taken by an unknown packet (a packet having unknown characteristics and belonging to an unknown packet flow). For illustrative purposes, both FIGS. 2A and 2B illustrate the same components, and the components which are not used for the routing shown in the particular figure are hatched out.
  • As shown in FIG. 2A, the packet classification and packet processing components utilized by the fast path thread 12 to process a known packet include a connections hash table 44. The packet classification and packet processing components are virtual, defined in code, and as such are not located in any fixed physical location, in an embodiment.
  • As shown in FIGS. 2A and 2B, an incoming packet 30 is classified as either part of a known data packet flow or an unknown data packet flow, and based on this classification, the packet 30 either follows a first processing route through the fast path thread 12 (FIG. 2A) or a second processing route through the slow path thread 14 (FIG. 2B).
  • According to an embodiment, to classify an incoming packet 30, the classification unit 16 uses the components shown in FIG. 2A to perform a bit-string hashing technique. To perform the bit-string hashing technique, the classification unit 16 extracts relevant bits of packet headers from the packet 30 (e.g., using a parser) and the relevant bits are arranged into a connection bit-string key 42. Then, the classification unit 16 determines a connections hash 42 a (e.g., “y”) based on the connection bit-string key 42. The classification unit 16 then compares the connections hash 42 a to hashed entries stored in the connections hash table 44, where the hashed entries in the hash table 44 correspond to known data packet flows, respectively. This process is done on a packet-by-packet basis. Thus, for example, the packets numbered 3 (FIG. 1) are part of known packet flow 3 (same type, same source, same destination). Thus, a hashed entry is stored in the connections hash table 44 for known packet flow 3.
  • Thus, the connections hash table 44 is used to determine whether an incoming packet 30 can be determined to have a set of known characteristics that are stored in the table 44. The connection entry mechanism then determines whether there were previously received packets all having the same characteristics, i.e., identifies the packets belonging to a known flow. Then, the first processing unit 18 identifies a connection entry 46 corresponding to the entry in the connections hash table 44 which matches the connection hash 42 a. The connection entry 46 includes information related to the specific type of the packet 30 from among a plurality of types of packets. The packet types may be based on many different criteria, for example, source/destination addresses, packet type (VOIP, HTTP, etc). Then, the first processing unit 16 determines a state 48 corresponding to the connection entry 46. The state 48 includes information associated with the specific type of modifications to be performed on the packet 30 to thereby process the packet 30 (in other words, includes information related to a previously generated rule). Then, the first processing unit 16 determines a packet processing unit 50 corresponding to the state 48. The packet processing unit 50, which is a code, may be considered to be a “modification pipe” which modifies the incoming packet 30. The packet processing unit 50 processes the packet 30 and outputs the packet after processing as a processed packet 52. For example, a forwarding decision is made (e.g., by the packet processing unit 50), and when it is determined that a packet 30 is to be forwarded, the packet processing unit 50 then modifies various header information to be consistent with the forwarding decision and to enable further processing downstream in the network.
  • In an embodiment, each of the packet processing units 50 is a compiled code which makes a forwarding decision and, if necessary, modifies the packet 30 in any of various ways, for example, changing a destination address, changing an encoding/decoding method, etc. Furthermore, a plurality of the packet processing units 50 are provided which correspond to a plurality of the states 48, and the plurality of the packet processing units 50 are stored inside of the shared memory 22 in the fast path thread 12. Thus, since the classification unit 16 and the first processing unit 18 are located in the fast path thread 12, in an embodiment, each packet which is of a known packet flow (e.g., each first data packet 1, 3 and 4) can be classified and processed in the fast path thread 12 without requiring additional processing to be performed in the slow path thread 14, because the packets of a known type are routed through and processed entirely by the fast path thread 12.
  • FIG. 2B shows the route taken by a packet (e.g., packet 2) belonging to an unknown packet flow. As shown in FIG. 2B, the packet classification and packet processing components utilized by the slow path thread 14 to process an unknown packet include a rules hash table 44.
  • As shown in FIG. 2B, when the packet 30 is of an unknown packet flow (e.g., unknown packet type, unknown source, etc.), the connections hash 42 a (e.g., x) of the packet 30 will not match any of the entries stored in the connections hash table 44, and the classification unit 16 will therefore classify the incoming packet 30 as an unknown data packet and forward the packet 30 to the slow path thread 14. Then, the slow path thread 14 extracts relevant bits of packet headers from the packet 30 and arranges the extracted bits into a rule bit-string key 32. Then, the slow path thread 14 determines a rule hash 32 a (e.g., a) based on the rule bit-string key 32. Then, the slow path thread 14 compares the rule hash 32 a to entries stored in the rules hash table 34, the entries indicating rules corresponding to different packet types. In an embodiment, the rules hash table 34 is larger than the connections hash table 44, since the rules hash table 34 stores many different types of rules.
  • When the slow path thread 14 identifies an entry stored in the rules hash table 34 which matches the rule hash 32 a, the slow path thread 14 then determines a rule condition 36 corresponding to the matching entry. This is possible because although the packet 30 is of an unknown packet flow, the packet 30 has certain characteristics which correspond to rules. Then, the second processing unit 20 initiates a process to create/compile a packet processing unit 50 for processing the packet 30, based on information included in the rule condition 36 as well as information included in a library of network primitives which are used to build the packet processing units 50. The process of creating packet processing units 50 for unknown packets is performed in the slow path thread 14 because the slow path thread 14 does not have the same low latency requirements as the fast path thread 12. The slow path thread 14 is able to build the packet processing units 50 for packet flows. Additionally, a connection entry corresponding to the packet 30 of unknown packet flow is created and added to the connections hash table 44, as represented by the arrow between the packet processing unit 50 and the connections hash table 44 shown in FIG. 2B. In other words, feedback is provided from the packet processing unit 50 back to the connections hash table 44 to store a connection entry for the unknown packet. Additionally, an updating operation is performed which updates the action that is to be performed on the packets from the same packet flow as the packet with the newly stored connection entry in the connections hash table 44, as represented by the arrow between the packet processing unit 50 and the connections hash table 44. As a result, when subsequent packets from the same packet flow as the unknown packet are received, the classification unit 16 will classify the other packets as known packet types instead of unknown packet types and perform the appropriate action on the subsequent packets once their flow becomes known.
  • It is understood that other techniques instead of bit-string hashing may also be used to classify the packet flows according to other embodiments.
  • FIG. 3 is a flow diagram illustrating a packet processing technique according to an embodiment. In the following description of FIG. 3, references are made to various components illustrated in FIGS. 1 and 2 for illustrative purposes only. However, these references are by way of example only, and it is understood that the method of FIG. 3 may be performed by components other than those shown in FIGS. 1 and 2, according to other embodiments.
  • In operation 302, the classification unit 16 classifies incoming data packets into first data packets which are known data packets (belonging to a known packet flow) and second data packets which are unknown data packets (belonging to an unknown packet flow). In an embodiment, the classification unit 16 uses a bit-string hashing technique to perform the classification.
  • In operation 304, a first path thread 12 implemented in a single processing core 10 processes ones of the first data packets 1 at least by forwarding the first data packets 1 to a destination that corresponds to a previously determined destination associated with the first data packets 1.
  • In operation 306, a second path thread 14 implemented in the single processing 10 core processes a second data packet 2 at least to determine a new data packet flow for the second data packet.
  • While embodiments have been described, additional variations and modifications of the embodiments may occur to those skilled in the art once they learn of the basic concepts. Therefore, it is intended that the appended claims shall be construed to include both the above embodiments and all such variations and modifications that fall within the spirit and scope of the concepts of the embodiments.

Claims (20)

What is claimed is:
1. A processing core, comprising:
a packet classifier, implemented in a single processing core, configured to classify incoming data packets into first data packets of a known data packet flow and second data packets of an unknown data packet flow, wherein the packet classifier is further configured to:
form an identifier using information obtained from a packet header of one of the incoming data packets, the information comprising an attribute of the one incoming data packet; and
map the one incoming data packet to one of the known data packet flow or the unknown data packet flow based on the attribute;
a first path thread, implemented in the single processing core, configured to process ones of the first data packets mapped to the known data packet flow by performing a first processing operation in a protected path, the protected path protecting the first processing operation from actions that interrupt the first processing operation; and
a second path thread, implemented in the single processing core, configured to process a received second data packet by performing a second processing operation that determines a new data packet flow for the second data packet.
2. The processing core of claim 1, wherein the first processing operation comprises making a forwarding decision to decide a forwarding destination for the first data packets mapped to the known data packet flow, the forwarding destination comprising a previously determined destination associated with others of the first data packets mapped to the known data packet flow.
3. The processing core of claim 1, wherein the attribute of the one incoming data packet comprises a destination of the one incoming data packet, and the packet classifier maps the one incoming data packet to the known data packet flow in response to determining that the attribute of the one incoming data packet indicates a known destination, and maps the one incoming data packet to the unknown data packet flow in response to determining that the attribute of the one incoming data packet indicates an unknown destination.
4. The processing core of claim 1, wherein the protected path protects the first processing operation from actions that interrupt the first processing operation by prioritizing the first processing operation over a regular processing operation having a lower priority than a priority of the first processing operation.
5. The processing core of claim 1, wherein the protected path implements fast interrupt requests (FIQ).
6. The processing core of claim 1, wherein the processing core comprises an Advanced RISC Machine (ARM) architecture.
7. The processing core of claim 1, wherein the known data packet flow comprises an artificial logical equivalent to a call or a connection.
8. The processing core of claim 1, wherein the second processing operation comprises performing a Just-in-time (JIT) rule compilation based on the second data packet.
9. The processing core of claim 8, wherein the JIT rule compilation comprises determining a rule to process the second data packet based on an attribute of the second data packet.
10. The processing core of claim 1, wherein the first processing operation satisfies a first latency requirement, and the second processing operation satisfies a second latency requirement different from the first latency requirement.
11. The processing core of claim 10, wherein the first latency requirement is a lower latency than the second latency requirement.
12. A method of data processing, comprising:
classifying, in a single processing core, incoming data packets into first data packets of a known data packet flow and second data packets of an unknown data packet flow, wherein classifying comprises:
forming an identifier using information obtained from a packet header of one of the incoming data packets, the information comprising an attribute of the one incoming data packet; and
mapping the one incoming data packet to one of the known data packet flow or the unknown data packet flow based on the attribute;
processing, in the single processing core, ones of the first data packets mapped to the known data packet flow by performing a first processing operation in a protected path, the protected path protecting the first processing operation from actions that interrupt the first processing operation; and
processing, in the single processing core, a received second data packet by performing a second processing operation that determines a new data packet flow for the second data packet.
13. The method of claim 12, wherein the first processing operation comprises making a forwarding decision to decide a forwarding destination for the first data packets mapped to the known data packet flow, the forwarding destination comprising a previously determined destination associated with others of the first data packets mapped to the known data packet flow.
14. The method of claim 12, wherein the attribute of the one incoming data packet comprises a destination of the one incoming data packet, and the mapping comprises mapping the one incoming data packet to the known data packet flow in response to determining that the attribute of the one incoming data packet indicates a known destination, and mapping the one incoming data packet to the unknown data packet flow in response to determining that the attribute of the one incoming data packet indicates an unknown destination.
15. The method of claim 12, wherein the protected path protects the first processing operation from actions that interrupt the first processing operation by prioritizing the first processing operation over a regular processing operation having a lower priority than a priority of the first processing operation.
16. The method of claim 12, wherein the protected path implements fast interrupt requests (FIQ).
17. The method of claim 12, wherein the processing core comprises an Advanced RISC Machine (ARM) architecture.
18. The method of claim 12, wherein the known data packet flow comprises an artificial logical equivalent to a call or a connection.
19. The method of claim 12, wherein the second processing operation comprises performing a Just-in-time (JIT) rule compilation based on the second data packet.
20. The method of claim 19, wherein the JIT rule compilation comprises determining a rule to process the second data packet based on an attribute of the second data packet.
US14/846,190 2011-11-18 2015-09-04 Data path acceleration using hw virtualization Abandoned US20150381481A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/846,190 US20150381481A1 (en) 2011-11-18 2015-09-04 Data path acceleration using hw virtualization

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201161561392P 2011-11-18 2011-11-18
US13/681,139 US9130776B2 (en) 2011-11-18 2012-11-19 Data path acceleration using HW virtualization
US14/846,190 US20150381481A1 (en) 2011-11-18 2015-09-04 Data path acceleration using hw virtualization

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US13/681,139 Continuation US9130776B2 (en) 2011-11-18 2012-11-19 Data path acceleration using HW virtualization

Publications (1)

Publication Number Publication Date
US20150381481A1 true US20150381481A1 (en) 2015-12-31

Family

ID=48426884

Family Applications (2)

Application Number Title Priority Date Filing Date
US13/681,139 Active 2032-11-21 US9130776B2 (en) 2011-11-18 2012-11-19 Data path acceleration using HW virtualization
US14/846,190 Abandoned US20150381481A1 (en) 2011-11-18 2015-09-04 Data path acceleration using hw virtualization

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US13/681,139 Active 2032-11-21 US9130776B2 (en) 2011-11-18 2012-11-19 Data path acceleration using HW virtualization

Country Status (2)

Country Link
US (2) US9130776B2 (en)
WO (1) WO2013072773A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160219009A1 (en) * 2013-09-20 2016-07-28 Tcs John Huxley Europe Limited Messaging system

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013072773A2 (en) * 2011-11-18 2013-05-23 Marvell World Trade Ltd. Data path acceleration using hw virtualization
US9413651B2 (en) * 2012-12-14 2016-08-09 Broadcom Corporation Selective deep packet inspection
US9489445B2 (en) * 2013-03-13 2016-11-08 Nice Systems Ltd. System and method for distributed categorization
US9912555B2 (en) * 2013-03-15 2018-03-06 A10 Networks, Inc. System and method of updating modules for application or content identification
US9722918B2 (en) 2013-03-15 2017-08-01 A10 Networks, Inc. System and method for customizing the identification of application or content type
KR101467942B1 (en) * 2013-04-24 2014-12-02 주식회사 윈스 Fast Application Recognition System and Processing Method Therof
US9906422B2 (en) 2014-05-16 2018-02-27 A10 Networks, Inc. Distributed system to determine a server's health
US9787581B2 (en) 2015-09-21 2017-10-10 A10 Networks, Inc. Secure data flow open information analytics
US9838278B2 (en) * 2016-02-26 2017-12-05 Guavus, Inc. Self-learning device classifier
DE112017003494T5 (en) * 2016-07-11 2019-05-02 Harmonic Inc. MULTI-CORE SOFTWARE FORWARDING
US10812348B2 (en) 2016-07-15 2020-10-20 A10 Networks, Inc. Automatic capture of network data for a detected anomaly
US10341118B2 (en) 2016-08-01 2019-07-02 A10 Networks, Inc. SSL gateway with integrated hardware security module
US10250475B2 (en) 2016-12-08 2019-04-02 A10 Networks, Inc. Measurement of application response delay time
US10397270B2 (en) 2017-01-04 2019-08-27 A10 Networks, Inc. Dynamic session rate limiter
US10187377B2 (en) 2017-02-08 2019-01-22 A10 Networks, Inc. Caching network generated security certificates
US11494212B2 (en) * 2018-09-27 2022-11-08 Intel Corporation Technologies for adaptive platform resource assignment
US11399312B2 (en) * 2019-08-13 2022-07-26 International Business Machines Corporation Storage and retention intelligence in mobile networks
US11425230B2 (en) * 2021-01-28 2022-08-23 Mellanox Technologies, Ltd. Efficient parsing tuned to prevalent packet types
WO2022197483A1 (en) * 2021-03-16 2022-09-22 Arris Enterprises Llc Preservation of priority traffic in communications systems
US11711453B2 (en) 2021-10-24 2023-07-25 Mellanox Technologies, Ltd. Template-based packet parsing
US20230412496A1 (en) * 2022-06-21 2023-12-21 Oracle International Corporation Geometric based flow programming

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6226680B1 (en) * 1997-10-14 2001-05-01 Alacritech, Inc. Intelligent network interface system method for protocol processing
US20040240447A1 (en) * 2003-05-28 2004-12-02 Dorbolo Riccardo G. Method and system for identifying bidirectional packet flow
US20050050221A1 (en) * 2003-08-27 2005-03-03 Tasman Mitchell Paul Systems and methods for forwarding data units in a communications network
US7302667B1 (en) * 2004-04-15 2007-11-27 Altera Corporation Methods and apparatus for generating programmable device layout information
US20080002579A1 (en) * 2004-12-21 2008-01-03 Fredrik Lindholm Arrangement and a Method Relating to Flow of Packets in Communication Systems
US20080082789A1 (en) * 2006-08-14 2008-04-03 Jack Kang Interrupt handling
US20090328219A1 (en) * 2008-06-27 2009-12-31 Juniper Networks, Inc. Dynamic policy provisioning within network security devices
US9130776B2 (en) * 2011-11-18 2015-09-08 Marvell World Trade, Ltd. Data path acceleration using HW virtualization

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6876653B2 (en) * 1998-07-08 2005-04-05 Broadcom Corporation Fast flexible filter processor based architecture for a network device
US6598034B1 (en) * 1999-09-21 2003-07-22 Infineon Technologies North America Corp. Rule based IP data processing
US7209996B2 (en) * 2001-10-22 2007-04-24 Sun Microsystems, Inc. Multi-core multi-thread processor
US7804785B2 (en) * 2002-04-19 2010-09-28 Avaya Inc. Network system having an instructional sequence for performing packet processing and optimizing the packet processing
US7337241B2 (en) * 2002-09-27 2008-02-26 Alacritech, Inc. Fast-path apparatus for receiving data corresponding to a TCP connection
US7657706B2 (en) * 2003-12-18 2010-02-02 Cisco Technology, Inc. High speed memory and input/output processor subsystem for efficiently allocating and using high-speed memory and slower-speed memory
JP4373887B2 (en) * 2004-09-30 2009-11-25 Okiセミコンダクタ株式会社 Packet processing method
US8254285B2 (en) * 2005-02-25 2012-08-28 Ip Infusion, Inc. Hardware abstraction layer
US7508764B2 (en) * 2005-09-12 2009-03-24 Zeugma Systems Inc. Packet flow bifurcation and analysis
US20070226795A1 (en) * 2006-02-09 2007-09-27 Texas Instruments Incorporated Virtual cores and hardware-supported hypervisor integrated circuits, systems, methods and processes of manufacture
CN102667725B (en) * 2010-01-13 2015-09-16 马维尔以色列(M.I.S.L.)有限公司 For the hardware virtualization of media processing

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6226680B1 (en) * 1997-10-14 2001-05-01 Alacritech, Inc. Intelligent network interface system method for protocol processing
US20040240447A1 (en) * 2003-05-28 2004-12-02 Dorbolo Riccardo G. Method and system for identifying bidirectional packet flow
US20050050221A1 (en) * 2003-08-27 2005-03-03 Tasman Mitchell Paul Systems and methods for forwarding data units in a communications network
US7302667B1 (en) * 2004-04-15 2007-11-27 Altera Corporation Methods and apparatus for generating programmable device layout information
US20080002579A1 (en) * 2004-12-21 2008-01-03 Fredrik Lindholm Arrangement and a Method Relating to Flow of Packets in Communication Systems
US20080082789A1 (en) * 2006-08-14 2008-04-03 Jack Kang Interrupt handling
US20090328219A1 (en) * 2008-06-27 2009-12-31 Juniper Networks, Inc. Dynamic policy provisioning within network security devices
US9130776B2 (en) * 2011-11-18 2015-09-08 Marvell World Trade, Ltd. Data path acceleration using HW virtualization

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160219009A1 (en) * 2013-09-20 2016-07-28 Tcs John Huxley Europe Limited Messaging system

Also Published As

Publication number Publication date
US20130128885A1 (en) 2013-05-23
US9130776B2 (en) 2015-09-08
WO2013072773A2 (en) 2013-05-23
WO2013072773A3 (en) 2013-07-18

Similar Documents

Publication Publication Date Title
US9130776B2 (en) Data path acceleration using HW virtualization
US11677851B2 (en) Accelerated network packet processing
US11418632B2 (en) High speed flexible packet classification using network processors
US10097466B2 (en) Data distribution method and splitter
EP3314827B1 (en) Method and system for managing data traffic in a computing network
US7760733B1 (en) Filtering ingress packets in network interface circuitry
US10778583B2 (en) Chained longest prefix matching in programmable switch
US8818921B2 (en) Content search mechanism that uses a deterministic finite automata (DFA) graph, a DFA state machine, and a walker process
US8284776B2 (en) Recursive packet header processing
US7852850B2 (en) Double-hash lookup mechanism for searching addresses in a network device
KR101969194B1 (en) Offloading packet processing for networking device virtualization
US9385957B1 (en) Flow key lookup involving multiple simultaneous cam operations to identify hash values in a hash bucket
CN111371779A (en) Firewall based on DPDK virtualization management system and implementation method thereof
US9356844B2 (en) Efficient application recognition in network traffic
WO2021226948A1 (en) Data packet processing method and device based on open virtual switch (ovs)
US20130138920A1 (en) Method and apparatus for packet processing and a preprocessor
WO2015101113A1 (en) Packet processing method and device
TWI593256B (en) Methods and systems for flexible packet classification
US9817769B1 (en) Methods and apparatus for improved access to shared memory
US20220217085A1 (en) Server fabric adapter for i/o scaling of heterogeneous and accelerated compute systems
Okuno et al. Cache-based network processor architecture: Evaluation with real network traffic
Chen et al. HBD: Towards efficient reactive rule dispatching in software-defined networks

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION