US20170365341A1 - Algorithmic n search/m write ternary content addressable memory (tcam) - Google Patents
Algorithmic n search/m write ternary content addressable memory (tcam) Download PDFInfo
- Publication number
- US20170365341A1 US20170365341A1 US15/185,956 US201615185956A US2017365341A1 US 20170365341 A1 US20170365341 A1 US 20170365341A1 US 201615185956 A US201615185956 A US 201615185956A US 2017365341 A1 US2017365341 A1 US 2017365341A1
- Authority
- US
- United States
- Prior art keywords
- cam
- matrix
- search
- operations
- instances
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11C—STATIC STORES
- G11C15/00—Digital stores in which information comprising one or more characteristic parts is written into the store and in which information is read-out by searching for one or more of these characteristic parts, i.e. associative or content-addressed stores
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11C—STATIC STORES
- G11C15/00—Digital stores in which information comprising one or more characteristic parts is written into the store and in which information is read-out by searching for one or more of these characteristic parts, i.e. associative or content-addressed stores
- G11C15/04—Digital stores in which information comprising one or more characteristic parts is written into the store and in which information is read-out by searching for one or more of these characteristic parts, i.e. associative or content-addressed stores using semiconductor elements
Definitions
- the present disclosure relates to a content addressable memory (CAM), and more particularly, to an algorithmic ternary content addressable memory (TCAM) that instantiates multiple copies of X-Y TCAM instances.
- CAM content addressable memory
- TCAM algorithmic ternary content addressable memory
- CAM Content Addressable Memory
- a CAM is designed such that the user supplies a data word and the CAM searches its entire memory to see whether that data word is stored in the CAM. If the data word is found, the CAM returns a list of one or more storage addresses where the word was found.
- a CAM is designed to search its entire memory in a single operation, which makes a CAM faster than Random Access Memory (RAM) in most search operations.
- RAM Random Access Memory
- each individual memory bit in a parallel CAM must have its own associated comparison circuit to detect a match between the stored bit and the input bit. Further, match outputs from each cell in the data word must be combined to yield a complete data word match signal.
- High performance caches require large and fully associative translation lookaside buffers (TLBs).
- TLBs translation lookaside buffers
- a TCAM allows three matching states, which include 1, 0, and “X” (i.e., don't care) for one or more bits in the stored data word.
- X i.e., don't care
- Another alternative is to implement the TCAM using Bit-Stacked-Latches.
- the TCAM instances and multi-port options using the Bit-Stacked-Latches are area inefficient and limit performance.
- a structure includes a content addressable memory (CAM) and a matrix which is structured to instantiate multiple replicated copies of the CAM in a row direction and a column direction of the array.
- CAM content addressable memory
- a structure comprises a matrix of multiple replicated copies of a content addressable memory (CAM) structure comprising a plurality of write rows and search columns structured such that multiple operations are captured at a time allowing for a search command to be executed in the first cycle and a write operation to be executed in the second cycle.
- CAM content addressable memory
- a method comprises: executing multiple searches in parallel in replicated CAM instances, in a first cycle; and executing multiple writes or reads in parallel in the replicated CAM instances, in a second cycle.
- FIG. 1 shows a CAM structure in accordance with aspects of the present disclosure.
- FIG. 2 shows a first cycle of executing a search operation using the CAM structure in accordance with aspects of the present disclosure.
- FIG. 3 shows a second cycle of executing a write operation using the CAM structure in accordance with aspects of the present disclosure.
- FIG. 4 shows a CAM structure in accordance with additional aspects of the present disclosure.
- FIG. 5 shows a flowchart of steps using the CAM structure in accordance with aspects of the present disclosure.
- FIG. 6 shows another flowchart of steps using the CAM structure in accordance with aspects of the present disclosure.
- the present disclosure relates to content addressable memories (CAM), and more particularly, to an algorithmic ternary content addressable memory (TCAM) that instantiates multiple copies of X-Y TCAM instances.
- TCAM ternary content addressable memory
- the present disclosure uses an algorithmic TCAM that instantiates multiple copies of X-Y TCAM instances to achieve the required N-search and M-write per cycle TCAM operations.
- the present disclosure allows for execution of three searches and two writes per cycle.
- the present disclose provides an improved search performance of the searchable TCAM structure by replicating multiple copies of X-Y TCAM to achieve a N-search and M-write per cycle TCAM.
- a method can implement a fully coherent algorithmic multi-port CAM solution using multiple one-port CAM instances, where an M-write/N-search CAM is implemented using an array of 2*N by 2*M CAM instances.
- two operations, 2*M writes and 2*N searches can be captured every second cycle. Also, each search operation is supplied to one of the 2*N search columns, and each write operation is supplied to one of the 2*M write rows.
- two sets of operations can be captured at a time allowing for a search command to be executed in the first cycle and a write operation to be executed in the second cycle. Further, a comparison between the write operations can be performed to maintain coherency during the search execution. Then, after the search is executed, the write operation can be executed based on the results of the search operation.
- the processes herein can execute N searches and M writes, capture them in one cycle and prioritize all searches and prioritize the writes in subsequent writes.
- an X-Y TCAM instance can be addressed by increasing a number of cycles to write 1 for the X portion and 1 for the Y portion of the TCAM.
- the CAM can capture 3*N number of operations with the CAM enlarged (i.e., grown) to have 3*N columns and 2*M rows.
- two cycles can perform the write operation.
- the array can comprises a matrix of 2*M write rows by 2*N search columns to capture 2*(N+M) operations at a time.
- FIG. 1 shows a CAM structure in accordance with aspects of the present disclosure.
- the CAM structure 100 may be a Ternary CAM (TCAM) which comprises data search words comprising 1s, 0s, and “X” or “don't care”, for one or more bits in the stored word.
- TCAM Ternary CAM
- the CAM structure 100 of FIG. 1 may be a binary CAM which comprises data search words comprising entirely of 1s and 0s, for one or more bits in the stored word.
- the CAM structure 100 includes an array or matrix of replicated CAM instances C 1 -C 24 (e.g., C 1 , C 2 , . . . Cn).
- the array or matrix of CAM instances C 1 -C 24 can be a matrix of 2*N by 2*M, where N and M represent replicated instances of the CAM instance C 1 .
- the CAM structure 100 can include a plurality of columns 100 A, 100 B, 100 C, 100 D, 100 E, and 100 F and a plurality of rows 200 A, 200 B, 200 C, and 200 D, comprising replicated CAM instances represented by C 2 -C 24 . That is, the plurality of rows and columns are replicated CAM instances of the CAM instance C 1 .
- CAM instances C 1 - 24 are physical hardware circuits.
- each CAM is 512 words ⁇ 40 bits each, as an example.
- FIG. 1 shows a 4 ⁇ 6 matrix of 512 ⁇ 40 byte CAM instances, it should be understood that any matrix is contemplated by the present disclosure.
- the CAM structure 100 can be implemented as an N ⁇ M matrix of 512 ⁇ 40 value TCAM using 512 ⁇ 80 instances of X-Y TCAM, where a value TCAM requires one clock cycle to update a TCAM entry, and XY TCAM requires two clock cycles to update a TCAM entry.
- the CAM instance C 1 is representative of a 512 ⁇ 40 byte CAM
- embodiments of the present disclosure can include other CAM configurations, such as the a 512 ⁇ 80 byte CAM, a 512 ⁇ 160 byte CAM (i.e., 2 entry/match line implemented with a 4 ⁇ 3 matrix), or any other CAM configurations.
- search 350 includes six operations represented by “A”, “B”, “C”, “D”, “E”, and “F”.
- six operations are captured at a time.
- “A”, “B”, “C”, “D”, “E”, and “F” may comprise any combination of a data word or multiple data words.
- each of the operations “A”, “B”, “C”, “D”, “E”, and “F” can represent different combinations of a data word or multiple data words from the remaining operations.
- search 350 will search whether “A”, “B”, “C”, “D”, “E”, and “F” are stored in the CAM structure 100 . If the six operations (i.e., “A”, “B”, “C”, “D”, “E”, and “F”) are stored in the CAM structure 100 , a hit will be returned and no write operation need be performed. Alternatively, if any of the six operations (i.e., “A”, “B”, “C”, “D”, “E”, and “F”) are not stored in the CAM structure 100 , a miss will be returned, and a write operation will be performed as described herein. For example, the write operation can include writing “A”, “B”, “D”, and “E”, each of which would be duplicated across a respective row.
- each of the words A, B, C, D, E, and F are independent in nature if A misses, but B, C, D, E, and F match, then A is the only one that will need to be updated.
- the processes disclosed herein can execute multiple searches in parallel in replicated CAM instances, in a first cycle, and execute multiple writes or reads in parallel in the replicated CAM instances, in a second cycle, independent of the search operations. Also, if the CAM write or search operations take multiple cycles, the CAM matrix can grow the rows or columns necessary to accommodate the parallelism.
- the write operations can be different from the search operations (e.g., “A”, “B”, “C”).
- the search operations can include “A”, “B”, “C”, “D”, “E”, and “F”
- the write operations can include “G”, “H”, “I”, and “J”.
- FIG. 2 shows a first cycle of executing a search in the CAM structure in accordance with aspects of the present disclosure.
- the CAM structure 100 is searched using the search 350 , e.g., each of the operations “A”, “B”, “C”, “D”, “E”, and “F” by respective columns 100 A- 100 F.
- the search 350 e.g., each of the operations “A”, “B”, “C”, “D”, “E”, and “F” by respective columns 100 A- 100 F.
- column 100 A is searched using the operation “A”
- column 100 B is searched using the operation “B”, etc.
- one of the CAM instances of column 100 A (e.g., C 1 , C 7 , C 13 , and C 19 ) have data words which match the data words of operation “A” then the data words of operation “A” does not need to be written on the next cycle
- one of the CAM instances of column 100 B (e.g., C 2 , C 8 , C 14 , and C 20 ) have data words which match the data words of operation “B” then the data words of operation “B” word does not need to be written on the next cycle
- one of the CAM instances of column 100 C (e.g., C 3 , C 9 , C 15 , and C 21 ) have data words which match the data words of operation “C” then the data words of operation “C” word does not need to be written on the next cycle
- one of the CAM instances of column 100 D (e.g., C 4 , C 10 , C 16 , and C 22 ) have data words which match the
- one of the CAM instances of column 100 A have data words which match the data words of operation “A”
- one of the CAM instances of column 100 B have data words which match the data words of operation “B”
- one of the CAM instances of column 100 C have data words which match the data words of operation “C”
- one of the CAM instances of column 100 D have data words which match the data words of operation “D”
- one of the CAM instances of column 100 E have data words which match the data words of operation “E”
- one of the CAM instances of column 100 F have data words which match the data words of operation “F”
- search keys are illustrative, non-limiting examples of a search, and that other operations (e.g., searches 360 and 370 ) can also be performed within the context of the present disclosure. Also, although “A”, “B”, “C”, “D”, “E”, and “F” in search 350 have been described as operations, one of ordinary skill in the art would understand that each of “A”, “B”, “D”, “E”, and “F” can also be referred to as search keys.
- FIG. 3 shows a second cycle of executing a write operation in the CAM structure in accordance with aspects of the present disclosure.
- embodiments of the present disclosure assume that the data words of operations “A”, “B”, “C”, “D”, “E”, and “F” are not matched, a miss is returned, and write operations of “A”, “B”, “D”, and “E” will be written.
- the write operations of “A”, “B”, “D”, and “E” will be written in respective rows 200 A, 200 B, 200 C, and 200 D comprising replicated CAM instances.
- a data word of “A” operation and duplicate copies will be written in a row 200 A.
- a data word of “B” operation and duplicate copies will be written in row 200 B.
- a data word of “D” operation and duplicate copies will be written in a row 200 C, and a data word of “E” operation and duplicate copies will be written in a row 200 D.
- each word would be duplicated across their respective row of N number of TCAMs.
- the write operations of “A”, “B”, “D”, and “E” in FIG. 3 allow evaluation to occur in parallel in the next set of searching.
- the write operations can occur in different memory areas of the CAM structure 100 than memory areas which are searched.
- FIG. 4 shows a CAM structure in accordance with additional aspects of the present disclosure.
- the CAM structure 100 ′ of FIG. 4 may be a TCAM or, alternatively, a binary CAM.
- the CAM structure 100 ′ comprises a CAM instance C 1 ′ which is replicated in the row and column directions.
- the CAM instance C 1 ′ can be replicated in a 2*M ⁇ 2*N matrix, where M and N can be any integer value.
- M can be any integer value.
- N is an integer value greater than or equal to an integer value of M.
- the writes are not conditional on a hit or miss from the search, there need be no relationship between N and M, i.e., it is possible to have 100 searches and 2 writes, or 2 searches and 100 writes.
- the CAM instance C 1 ′ is a 512 ⁇ 40 byte CAM; although other CAM configurations are contemplated herein.
- the CAM instance C 1 ′ can be a 512 ⁇ 80 byte CAM, a 512 ⁇ 160 byte CAM (i.e., 2 entry/match line implemented with a 4 ⁇ 3 matrix), or any other configuration. In this way, multiple copies of the CAM instance C 1 ′ are instantiated in physical hardware circuitry in a 2*M ⁇ 2*N matrix value CAM.
- the CAM structure 100 ′ executes N searches and M writes every cycle on a, e.g., 512 ⁇ 40 bytes CAM.
- 2N operations are captured at a time.
- FIG. 5 shows a flowchart of steps using the CAM structure in accordance with aspects of the present disclosure.
- FIG. 5 shows a flowchart of steps using the CAM structure in accordance with aspects of the present disclosure.
- a 2*N ⁇ 2*M array of CAMs are instantiated in physical hardware circuitry to implement an N-search/M-write per cycle CAM arrays.
- the CAM arrays are setup such that the CAM arrays capture two cycles worth of search and write operations at one time.
- each search key that was captured in step 510 is applied to a unique column of the CAM array in parallel with step 510 , and then the search results are combined across a CAM column.
- each write is applied to a unique row of the CAM arrays in parallel and then the data is written across all the CAMs in that row.
- the process is completed.
- FIG. 6 shows another flowchart of steps using the CAM structure in accordance with aspects of the present disclosure.
- FIG. 6 shows another flowchart of steps using the CAM structure in accordance with aspects of the present disclosure.
- a 2*N ⁇ 2*M array of CAMs are instantiated in physical hardware to implement an N-search/M-write per cycle CAM arrays.
- the CAM arrays are setup such that the CAM arrays capture two cycles worth of search and write operations at one time.
- each write data can be applied to a unique row of the CAM arrays in parallel and then the data is written across all the CAMs in that row.
- each search key that was captured in the step 610 is applied to a unique column of the CAM array, and then the search results are combined across a CAM column.
- the process is completed.
- the searchable CAM structure of the present disclosure can be manufactured in a number of ways using a number of different tools.
- the methodologies and tools are used to form structures with dimensions in the micrometer and nanometer scale.
- the methodologies, i.e., technologies, employed to manufacture the searchable CAM structure of the present disclosure have been adopted from integrated circuit (IC) technology.
- the structures are built on wafers and are realized in films of material patterned by photolithographic processes on the top of a wafer.
- the fabrication of the searchable CAM structure uses three basic building blocks: (i) deposition of thin films of material on a substrate, (ii) applying a patterned mask on top of the films by photolithographic imaging, and (iii) etching the films selectively to the mask.
- the method(s) as described above is used in the fabrication of integrated circuit chips.
- the resulting integrated circuit chips can be distributed by the fabricator in raw wafer form (that is, as a single wafer that has multiple unpackaged chips), as a bare die, or in a packaged form.
- the chip is mounted in a single chip package (such as a plastic carrier, with leads that are affixed to a motherboard or other higher level carrier) or in a multichip package (such as a ceramic carrier that has either or both surface interconnections or buried interconnections).
- the chip is then integrated with other chips, discrete circuit elements, and/or other signal processing devices as part of either (a) an intermediate product, such as a motherboard, or (b) an end product.
- the end product can be any product that includes integrated circuit chips, ranging from toys and other low-end applications to advanced computer products having a display, a keyboard or other input device, and a central processor.
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Memory System (AREA)
Abstract
Description
- The present disclosure relates to a content addressable memory (CAM), and more particularly, to an algorithmic ternary content addressable memory (TCAM) that instantiates multiple copies of X-Y TCAM instances.
- Content Addressable Memory (CAM) is a special type of computer memory used in high-speed searching operations. A CAM is designed such that the user supplies a data word and the CAM searches its entire memory to see whether that data word is stored in the CAM. If the data word is found, the CAM returns a list of one or more storage addresses where the word was found.
- A CAM is designed to search its entire memory in a single operation, which makes a CAM faster than Random Access Memory (RAM) in most search operations. However, in a CAM, each individual memory bit in a parallel CAM must have its own associated comparison circuit to detect a match between the stored bit and the input bit. Further, match outputs from each cell in the data word must be combined to yield a complete data word match signal.
- High performance caches require large and fully associative translation lookaside buffers (TLBs). In order to perform multiple N searches and M writes in a cycle in a high performance cache, one alternative is to create a multi-write-multi-search ternary CAM (TCAM) cell that would support the required performance. A TCAM allows three matching states, which include 1, 0, and “X” (i.e., don't care) for one or more bits in the stored data word. However, this approach requires dedicated TCAM cells, which is very costly in terms of hardware circuitry. Another alternative is to implement the TCAM using Bit-Stacked-Latches. However, the TCAM instances and multi-port options using the Bit-Stacked-Latches are area inefficient and limit performance.
- In an aspect of the disclosure, a structure includes a content addressable memory (CAM) and a matrix which is structured to instantiate multiple replicated copies of the CAM in a row direction and a column direction of the array.
- In another aspect of the disclosure, a structure comprises a matrix of multiple replicated copies of a content addressable memory (CAM) structure comprising a plurality of write rows and search columns structured such that multiple operations are captured at a time allowing for a search command to be executed in the first cycle and a write operation to be executed in the second cycle.
- In another aspect of the disclosure, a method comprises: executing multiple searches in parallel in replicated CAM instances, in a first cycle; and executing multiple writes or reads in parallel in the replicated CAM instances, in a second cycle.
- The present disclosure is described in the detailed description which follows, in reference to the noted plurality of drawings by way of non-limiting examples of exemplary embodiments of the present disclosure.
-
FIG. 1 shows a CAM structure in accordance with aspects of the present disclosure. -
FIG. 2 shows a first cycle of executing a search operation using the CAM structure in accordance with aspects of the present disclosure. -
FIG. 3 shows a second cycle of executing a write operation using the CAM structure in accordance with aspects of the present disclosure. -
FIG. 4 shows a CAM structure in accordance with additional aspects of the present disclosure. -
FIG. 5 shows a flowchart of steps using the CAM structure in accordance with aspects of the present disclosure. -
FIG. 6 shows another flowchart of steps using the CAM structure in accordance with aspects of the present disclosure. - The present disclosure relates to content addressable memories (CAM), and more particularly, to an algorithmic ternary content addressable memory (TCAM) that instantiates multiple copies of X-Y TCAM instances. In particular, the present disclosure uses an algorithmic TCAM that instantiates multiple copies of X-Y TCAM instances to achieve the required N-search and M-write per cycle TCAM operations. In a specific example, the present disclosure allows for execution of three searches and two writes per cycle. Advantageously, the present disclose provides an improved search performance of the searchable TCAM structure by replicating multiple copies of X-Y TCAM to achieve a N-search and M-write per cycle TCAM.
- In embodiments of the present disclosure, a method can implement a fully coherent algorithmic multi-port CAM solution using multiple one-port CAM instances, where an M-write/N-search CAM is implemented using an array of 2*N by 2*M CAM instances. Further, in embodiments of the present disclosure, two operations, 2*M writes and 2*N searches, can be captured every second cycle. Also, each search operation is supplied to one of the 2*N search columns, and each write operation is supplied to one of the 2*M write rows.
- In more specific embodiments of the present disclosure, two sets of operations can be captured at a time allowing for a search command to be executed in the first cycle and a write operation to be executed in the second cycle. Further, a comparison between the write operations can be performed to maintain coherency during the search execution. Then, after the search is executed, the write operation can be executed based on the results of the search operation. In embodiments, the processes herein can execute N searches and M writes, capture them in one cycle and prioritize all searches and prioritize the writes in subsequent writes.
- In embodiments of the present disclosure, an X-Y TCAM instance can be addressed by increasing a number of cycles to write 1 for the X portion and 1 for the Y portion of the TCAM. Further, the CAM can capture 3*N number of operations with the CAM enlarged (i.e., grown) to have 3*N columns and 2*M rows. In these embodiments, two cycles can perform the write operation. Also, the array can comprises a matrix of 2*M write rows by 2*N search columns to capture 2*(N+M) operations at a time.
-
FIG. 1 shows a CAM structure in accordance with aspects of the present disclosure. In embodiments, theCAM structure 100 may be a Ternary CAM (TCAM) which comprises data search words comprising 1s, 0s, and “X” or “don't care”, for one or more bits in the stored word. Alternatively, theCAM structure 100 ofFIG. 1 may be a binary CAM which comprises data search words comprising entirely of 1s and 0s, for one or more bits in the stored word. - As shown in
FIG. 1 , theCAM structure 100 includes an array or matrix of replicated CAM instances C1-C24 (e.g., C1, C2, . . . Cn). In embodiments, the array or matrix of CAM instances C1-C24 can be a matrix of 2*N by 2*M, where N and M represent replicated instances of the CAM instance C1. More specifically, as shown inFIG. 1 , theCAM structure 100 can include a plurality ofcolumns rows - Although,
FIG. 1 shows a 4×6 matrix of 512×40 byte CAM instances, it should be understood that any matrix is contemplated by the present disclosure. TheCAM structure 100 can be implemented as an N×M matrix of 512×40 value TCAM using 512×80 instances of X-Y TCAM, where a value TCAM requires one clock cycle to update a TCAM entry, and XY TCAM requires two clock cycles to update a TCAM entry. Also, although the CAM instance C1 is representative of a 512×40 byte CAM, embodiments of the present disclosure can include other CAM configurations, such as the a 512×80 byte CAM, a 512×160 byte CAM (i.e., 2 entry/match line implemented with a 4×3 matrix), or any other CAM configurations. - As shown in
FIG. 1 , a set ofsearches search 350 includes six operations represented by “A”, “B”, “C”, “D”, “E”, and “F”. For thesearch 350, six operations are captured at a time. In embodiments of the present disclosure, “A”, “B”, “C”, “D”, “E”, and “F” may comprise any combination of a data word or multiple data words. Further, each of the operations “A”, “B”, “C”, “D”, “E”, and “F” can represent different combinations of a data word or multiple data words from the remaining operations. - In operation, as represented in
FIG. 1 ,search 350 will search whether “A”, “B”, “C”, “D”, “E”, and “F” are stored in theCAM structure 100. If the six operations (i.e., “A”, “B”, “C”, “D”, “E”, and “F”) are stored in theCAM structure 100, a hit will be returned and no write operation need be performed. Alternatively, if any of the six operations (i.e., “A”, “B”, “C”, “D”, “E”, and “F”) are not stored in theCAM structure 100, a miss will be returned, and a write operation will be performed as described herein. For example, the write operation can include writing “A”, “B”, “D”, and “E”, each of which would be duplicated across a respective row. - In embodiments, each of the words A, B, C, D, E, and F are independent in nature if A misses, but B, C, D, E, and F match, then A is the only one that will need to be updated. By way of another example, the processes disclosed herein can execute multiple searches in parallel in replicated CAM instances, in a first cycle, and execute multiple writes or reads in parallel in the replicated CAM instances, in a second cycle, independent of the search operations. Also, if the CAM write or search operations take multiple cycles, the CAM matrix can grow the rows or columns necessary to accommodate the parallelism.
- Further, in embodiments, the write operations (e.g., “D”, “E”, “F”) can be different from the search operations (e.g., “A”, “B”, “C”). In another example of the embodiments, the search operations can include “A”, “B”, “C”, “D”, “E”, and “F”, and the write operations can include “G”, “H”, “I”, and “J”. In other words, in the embodiments, there is no requirement to reuse the words of the search operations for write operations.
-
FIG. 2 shows a first cycle of executing a search in the CAM structure in accordance with aspects of the present disclosure. InFIG. 2 , theCAM structure 100 is searched using thesearch 350, e.g., each of the operations “A”, “B”, “C”, “D”, “E”, and “F” byrespective columns 100A-100F. For example, in the CAM structure,column 100A is searched using the operation “A”,column 100B is searched using the operation “B”, etc. - As shown representatively in
FIG. 2 , if one of the CAM instances of column 100A (e.g., C1, C7, C13, and C19) have data words which match the data words of operation “A” then the data words of operation “A” does not need to be written on the next cycle, if one of the CAM instances of column 100 B (e.g., C2, C8, C14, and C20) have data words which match the data words of operation “B” then the data words of operation “B” word does not need to be written on the next cycle, if one of the CAM instances of column 100C (e.g., C3, C9, C15, and C21) have data words which match the data words of operation “C” then the data words of operation “C” word does not need to be written on the next cycle, one of the CAM instances of column 100D (e.g., C4, C10, C16, and C22) have data words which match the data words of operation “D” then the data words of operation “D” word does not need to be written on the next cycle, one of the CAM instances of column 100E (e.g., C5, C11, C17, and C23) have data words which match the data words of operation “E” then the data words of operation “E” word does not need to be written on the next cycle, and one of the CAM instances of column 100F (e.g., C6, C12, C18, and C24) have data words which mach the data words of operation “F” then the data words of operation “F” word does not need to be written on the next cycle. - Therefore, if one of the CAM instances of
column 100A have data words which match the data words of operation “A”, one of the CAM instances ofcolumn 100B have data words which match the data words of operation “B”, one of the CAM instances ofcolumn 100C have data words which match the data words of operation “C”, one of the CAM instances ofcolumn 100D have data words which match the data words of operation “D”, one of the CAM instances ofcolumn 100E have data words which match the data words of operation “E”, or one of the CAM instances ofcolumn 100F have data words which match the data words of operation “F”, then a hit is returned and a write operation of “A”, “B”, “D”, and “E” need not be performed. On the other hand, if the data words of any of the operations “A”, “B”, “C”, “D”, “E”, and “F” are not matched across any of the CAM instances in a respective column (i.e., one ofcolumn 100A does not contain “A”,column 100B does not contain “B”,column 100C does not contain “C”,column 100D does not contain “D”,column 100E does not contain “E”, andcolumn 100F does not contain “F”), then a miss is returned for the associated word, and a write operation for the respective words is initiated. - At the same time as the
CAM structure 100 is being searched with operations “A”, “B”, “C”, “D”, “E”, and “F”, a comparison can be performed between the four operations (i.e., search keys) that were supposed to be sequentially written (e.g., “A”, “B”, “D”, and “E”). Therefore, searching operations can be performed at the same time as a comparison between writing operations in order to maintain coherency. One of ordinary skill in the art would understand that all of these operations inFIG. 2 are performed during the first cycle. Also, one of ordinary skill in the art would understand that “A”, “B”, “C”, “D”, “E”, and “F” are illustrative, non-limiting examples of a search, and that other operations (e.g., searches 360 and 370) can also be performed within the context of the present disclosure. Also, although “A”, “B”, “C”, “D”, “E”, and “F” insearch 350 have been described as operations, one of ordinary skill in the art would understand that each of “A”, “B”, “D”, “E”, and “F” can also be referred to as search keys. -
FIG. 3 shows a second cycle of executing a write operation in the CAM structure in accordance with aspects of the present disclosure. InFIG. 3 , embodiments of the present disclosure assume that the data words of operations “A”, “B”, “C”, “D”, “E”, and “F” are not matched, a miss is returned, and write operations of “A”, “B”, “D”, and “E” will be written. Specifically, and by way of illustrative, non-limiting example, inFIG. 3 , the write operations of “A”, “B”, “D”, and “E” will be written inrespective rows row 200A. Further, a data word of “B” operation and duplicate copies will be written inrow 200B. A data word of “D” operation and duplicate copies will be written in arow 200C, and a data word of “E” operation and duplicate copies will be written in arow 200D. Thus, each word would be duplicated across their respective row of N number of TCAMs. In this way, the write operations of “A”, “B”, “D”, and “E” inFIG. 3 allow evaluation to occur in parallel in the next set of searching. Also, one of ordinary skill in the art would understand that the write operations can occur in different memory areas of theCAM structure 100 than memory areas which are searched. -
FIG. 4 shows a CAM structure in accordance with additional aspects of the present disclosure. In embodiments of the present disclosure, theCAM structure 100′ ofFIG. 4 may be a TCAM or, alternatively, a binary CAM. In any configuration, theCAM structure 100′ comprises a CAM instance C1′ which is replicated in the row and column directions. As shown in this configuration, the CAM instance C1′ can be replicated in a 2*M×2*N matrix, where M and N can be any integer value. In embodiments, for example, N is an integer value greater than or equal to an integer value of M. However, in a more general embodiment, where the writes are not conditional on a hit or miss from the search, there need be no relationship between N and M, i.e., it is possible to have 100 searches and 2 writes, or 2 searches and 100 writes. - In embodiments shown representatively in
FIG. 4 , the CAM instance C1′ is a 512×40 byte CAM; although other CAM configurations are contemplated herein. For example, the CAM instance C1′ can be a 512×80 byte CAM, a 512×160 byte CAM (i.e., 2 entry/match line implemented with a 4×3 matrix), or any other configuration. In this way, multiple copies of the CAM instance C1′ are instantiated in physical hardware circuitry in a 2*M×2*N matrix value CAM. - In the configuration of
FIG. 4 , theCAM structure 100′ executes N searches and M writes every cycle on a, e.g., 512×40 bytes CAM. InFIG. 4 , 2N operations are captured at a time.Searches 350′, 360′, and 370′ can be variously modified to include 2N operations. For example, if three searches are required per cycle (i.e., N=3), then six operations would be performed (e.g., “A”, “B”, “C”, “D”, “E”, and “F”). If any of the 2N operations are not stored in theCAM structure 100′, then a miss is returned, and 2M operations are written. For example, if two writes are required per cycle (i.e., M=2), then four write operations would be performed (e.g., “A”, “B”, “D”, and “E”). InFIG. 4 , write operations and search operations are performed in a similar fashion asFIGS. 2 and 3 . -
FIG. 5 shows a flowchart of steps using the CAM structure in accordance with aspects of the present disclosure. As should be understood by one of ordinary skill in the art, aspects of the present disclosure are described herein with reference to flowchart illustrations which can be representative of diagrams of methods and apparatus (systems). - Referring to
FIG. 5 , atstep 500, a 2*N×2*M array of CAMs are instantiated in physical hardware circuitry to implement an N-search/M-write per cycle CAM arrays. Then, atstep 510, the CAM arrays are setup such that the CAM arrays capture two cycles worth of search and write operations at one time. Atstep 520, each search key that was captured instep 510 is applied to a unique column of the CAM array in parallel withstep 510, and then the search results are combined across a CAM column. Atstep 530, each write is applied to a unique row of the CAM arrays in parallel and then the data is written across all the CAMs in that row. Atstep 540, the process is completed. -
FIG. 6 shows another flowchart of steps using the CAM structure in accordance with aspects of the present disclosure. As should be understood by one of ordinary skill in the art, aspects of the present disclosure are described herein with reference to flowchart illustrations which can be representative of diagrams of methods and apparatus (systems). - Referring to
FIG. 6 , atstep 600, a 2*N×2*M array of CAMs are instantiated in physical hardware to implement an N-search/M-write per cycle CAM arrays. Then, atstep 610, the CAM arrays are setup such that the CAM arrays capture two cycles worth of search and write operations at one time. Atstep 620, each write data can be applied to a unique row of the CAM arrays in parallel and then the data is written across all the CAMs in that row. Atstep 630, each search key that was captured in thestep 610 is applied to a unique column of the CAM array, and then the search results are combined across a CAM column. Atstep 640, the process is completed. - The searchable CAM structure of the present disclosure can be manufactured in a number of ways using a number of different tools. In general, though, the methodologies and tools are used to form structures with dimensions in the micrometer and nanometer scale. The methodologies, i.e., technologies, employed to manufacture the searchable CAM structure of the present disclosure have been adopted from integrated circuit (IC) technology. For example, the structures are built on wafers and are realized in films of material patterned by photolithographic processes on the top of a wafer. In particular, the fabrication of the searchable CAM structure uses three basic building blocks: (i) deposition of thin films of material on a substrate, (ii) applying a patterned mask on top of the films by photolithographic imaging, and (iii) etching the films selectively to the mask.
- The method(s) as described above is used in the fabrication of integrated circuit chips. The resulting integrated circuit chips can be distributed by the fabricator in raw wafer form (that is, as a single wafer that has multiple unpackaged chips), as a bare die, or in a packaged form. In the latter case the chip is mounted in a single chip package (such as a plastic carrier, with leads that are affixed to a motherboard or other higher level carrier) or in a multichip package (such as a ceramic carrier that has either or both surface interconnections or buried interconnections). In any case the chip is then integrated with other chips, discrete circuit elements, and/or other signal processing devices as part of either (a) an intermediate product, such as a motherboard, or (b) an end product. The end product can be any product that includes integrated circuit chips, ranging from toys and other low-end applications to advanced computer products having a display, a keyboard or other input device, and a central processor.
- The descriptions of the various embodiments of the present disclosure have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.
Claims (20)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/185,956 US9859006B1 (en) | 2016-06-17 | 2016-06-17 | Algorithmic N search/M write ternary content addressable memory (TCAM) |
TW105126730A TWI636458B (en) | 2016-06-17 | 2016-08-22 | Algorithmic n search/m write ternary content addressable memory (tcam) |
DE102016220328.2A DE102016220328A1 (en) | 2016-06-17 | 2016-10-18 | Algorithmic ternary associative memory (TCAM) for N search / M write operations |
CN201710457931.8A CN107527653B (en) | 2016-06-17 | 2017-06-16 | Ternary Content Addressable Memory (TCAM) for N search/M write |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/185,956 US9859006B1 (en) | 2016-06-17 | 2016-06-17 | Algorithmic N search/M write ternary content addressable memory (TCAM) |
Publications (2)
Publication Number | Publication Date |
---|---|
US20170365341A1 true US20170365341A1 (en) | 2017-12-21 |
US9859006B1 US9859006B1 (en) | 2018-01-02 |
Family
ID=60481159
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/185,956 Expired - Fee Related US9859006B1 (en) | 2016-06-17 | 2016-06-17 | Algorithmic N search/M write ternary content addressable memory (TCAM) |
Country Status (4)
Country | Link |
---|---|
US (1) | US9859006B1 (en) |
CN (1) | CN107527653B (en) |
DE (1) | DE102016220328A1 (en) |
TW (1) | TWI636458B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112994886A (en) * | 2021-05-21 | 2021-06-18 | 芯启源(南京)半导体科技有限公司 | Hardware for generating TCAM search keywords and implementation method |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11133065B1 (en) * | 2020-03-06 | 2021-09-28 | Micron Technology, Inc. | Architecture for ternary content-addressable memory search |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6169685B1 (en) * | 1999-05-17 | 2001-01-02 | Cselt-Centro Studi E Laboratori Telecomuicazioni S.P.A. | Content addressable memories |
US6711665B1 (en) * | 1993-12-12 | 2004-03-23 | Neomagic Israel Ltd. | Associative processor |
US7304873B1 (en) * | 2005-01-25 | 2007-12-04 | Netlogic Microsystems, Inc. | Method for on-the-fly error correction in a content addressable memory (CAM) and device therefor |
US8908407B1 (en) * | 2011-07-30 | 2014-12-09 | Rambus Inc. | Content addressable memory (“CAM”) |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB806487A (en) * | 1956-04-06 | 1958-12-23 | British Tabulating Mach Co Ltd | Improvements in or relating to electrical data storage apparatus |
JP2775549B2 (en) * | 1992-05-08 | 1998-07-16 | 三菱電機株式会社 | Associative memory cell and associative memory circuit |
US6493791B1 (en) * | 1999-12-20 | 2002-12-10 | Intel Corporation | Prioritized content addressable memory |
US6259620B1 (en) * | 2000-03-08 | 2001-07-10 | Telefonaktiebolaget Lm Ericsson (Publ) | Multiple entry matching in a content addressable memory |
CA2307240C (en) * | 2000-05-01 | 2011-04-12 | Mosaid Technologies Incorporated | Matchline sense circuit and method |
JP5072145B2 (en) * | 2001-04-05 | 2012-11-14 | 富士通セミコンダクター株式会社 | Associative memory |
US7210003B2 (en) * | 2001-10-31 | 2007-04-24 | Netlogic Microsystems, Inc. | Comparand generation in a content addressable memory |
JP3808753B2 (en) * | 2001-10-31 | 2006-08-16 | 川崎マイクロエレクトロニクス株式会社 | Associative memory device |
US7092270B2 (en) * | 2004-06-30 | 2006-08-15 | International Business Machines Corporation | Apparatus and method for detecting multiple hits in CAM arrays |
US7345897B2 (en) * | 2005-10-18 | 2008-03-18 | Cisco Technology, Inc. | Error protected ternary content-addressable memories and lookup operations performed thereon |
US8120937B2 (en) * | 2009-03-06 | 2012-02-21 | International Business Machines Corporation | Ternary content addressable memory using phase change devices |
CN101546291A (en) * | 2009-05-12 | 2009-09-30 | 华为技术有限公司 | Access method and device for increasing robustness of memory data |
KR101115637B1 (en) * | 2009-06-30 | 2012-03-05 | 주식회사 하이닉스반도체 | Non-volatile memory device and operation method thereof |
KR101274841B1 (en) * | 2009-09-18 | 2013-06-13 | 가부시끼가이샤 도시바 | Semiconductor content addressable memory device |
US9882809B2 (en) * | 2009-11-24 | 2018-01-30 | Verizon Patent And Licensing Inc. | Just-in time forwarding information base |
US9001828B2 (en) * | 2011-03-21 | 2015-04-07 | Marvell World Trade Ltd. | Method and apparatus for pre-classifying packets |
US9177646B2 (en) * | 2013-05-06 | 2015-11-03 | International Business Machines Corporation | Implementing computational memory from content-addressable memory |
US20140369103A1 (en) * | 2013-06-14 | 2014-12-18 | Mediatek Inc. | Content addressable memory cells and ternary content addressable memory cells |
US9255962B2 (en) * | 2013-08-15 | 2016-02-09 | GlobalFoundries, Inc. | Determining intra-die variation of an integrated circuit |
US9183933B2 (en) * | 2014-01-10 | 2015-11-10 | Taiwan Semiconductor Manufacturing Company, Ltd. | Memory cell |
CN104200838B (en) * | 2014-08-28 | 2016-08-24 | 平湖凌云信息科技有限公司 | content addressable memory and similarity intelligent matching method |
-
2016
- 2016-06-17 US US15/185,956 patent/US9859006B1/en not_active Expired - Fee Related
- 2016-08-22 TW TW105126730A patent/TWI636458B/en not_active IP Right Cessation
- 2016-10-18 DE DE102016220328.2A patent/DE102016220328A1/en not_active Withdrawn
-
2017
- 2017-06-16 CN CN201710457931.8A patent/CN107527653B/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6711665B1 (en) * | 1993-12-12 | 2004-03-23 | Neomagic Israel Ltd. | Associative processor |
US6169685B1 (en) * | 1999-05-17 | 2001-01-02 | Cselt-Centro Studi E Laboratori Telecomuicazioni S.P.A. | Content addressable memories |
US7304873B1 (en) * | 2005-01-25 | 2007-12-04 | Netlogic Microsystems, Inc. | Method for on-the-fly error correction in a content addressable memory (CAM) and device therefor |
US8908407B1 (en) * | 2011-07-30 | 2014-12-09 | Rambus Inc. | Content addressable memory (“CAM”) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112994886A (en) * | 2021-05-21 | 2021-06-18 | 芯启源(南京)半导体科技有限公司 | Hardware for generating TCAM search keywords and implementation method |
Also Published As
Publication number | Publication date |
---|---|
DE102016220328A1 (en) | 2017-12-21 |
TWI636458B (en) | 2018-09-21 |
CN107527653B (en) | 2021-03-12 |
US9859006B1 (en) | 2018-01-02 |
CN107527653A (en) | 2017-12-29 |
TW201810272A (en) | 2018-03-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Durbeck et al. | TheCell Matrix: an architecture for nanocomputing | |
US8238173B2 (en) | Using storage cells to perform computation | |
US5184325A (en) | Dynamic associative memory with logic-in-refresh | |
US20090254697A1 (en) | Memory with embedded associative section for computations | |
US8004868B2 (en) | Method and apparatus for performing variable word width searches in a content addressable memory | |
US20120140540A1 (en) | Charge sharing in a tcam array | |
Garzón et al. | Am 4: Mram crossbar based cam/tcam/acam/ap for in-memory computing | |
US9859006B1 (en) | Algorithmic N search/M write ternary content addressable memory (TCAM) | |
US20090141529A1 (en) | Design structure for implementing matrix-based search capability in content addressable memory devices | |
US20190378572A1 (en) | Array organization and architecture to perform range-match operations with content addressable memory (cam) circuits | |
JP2003256265A (en) | Search memory, controller for memory search, and memory search method | |
Imani et al. | CAP: Configurable resistive associative processor for near-data computing | |
USRE42684E1 (en) | Word search in content addressable memory | |
US20190251127A1 (en) | Methods and apparatuses for searching data stored in a memory array using a replicated data pattern | |
US3890603A (en) | Associative store | |
Mattausch et al. | Fully-parallel pattern-matching engine with dynamic adaptability to Hamming or Manhattan distance | |
US9384824B2 (en) | List sort static random access memory | |
RU2762781C1 (en) | Matrix device for parallel search of occurrences and data processing | |
Karunakar et al. | Implementation of LFSR based Fast Error-Resilient Ternary Content-Addressable Memory | |
JPH03212896A (en) | Content-addressable memory | |
CN107342102B (en) | A kind of MRAM chip with search function and search method | |
Parhami | Extreme-value search and general selection algorithms for fully parallel associative memories | |
Kaftannikov et al. | Search for Substrings in Associative Decision Fields | |
Liu et al. | MemSort: In-Memory Sorting Architecture | |
Bird et al. | An Associative Parallel Processor With Application to Picture Processing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: GLOBALFOUNDRIES INC., CAYMAN ISLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ARSOVSKI, IGOR;FRAGANO, MICHAEL T.;HOULE, ROBERT;REEL/FRAME:038946/0766 Effective date: 20160617 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: WILMINGTON TRUST, NATIONAL ASSOCIATION, DELAWARE Free format text: SECURITY AGREEMENT;ASSIGNOR:GLOBALFOUNDRIES INC.;REEL/FRAME:049490/0001 Effective date: 20181127 |
|
AS | Assignment |
Owner name: GLOBALFOUNDRIES U.S. INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:GLOBALFOUNDRIES INC.;REEL/FRAME:050122/0001 Effective date: 20190821 |
|
AS | Assignment |
Owner name: MARVELL INTERNATIONAL LTD., BERMUDA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:GLOBALFOUNDRIES U.S. INC.;REEL/FRAME:051070/0625 Effective date: 20191105 |
|
AS | Assignment |
Owner name: CAVIUM INTERNATIONAL, CAYMAN ISLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MARVELL INTERNATIONAL LTD.;REEL/FRAME:052918/0001 Effective date: 20191231 |
|
AS | Assignment |
Owner name: MARVELL ASIA PTE, LTD., SINGAPORE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:CAVIUM INTERNATIONAL;REEL/FRAME:053475/0001 Effective date: 20191231 |
|
AS | Assignment |
Owner name: GLOBALFOUNDRIES INC., CAYMAN ISLANDS Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:WILMINGTON TRUST, NATIONAL ASSOCIATION;REEL/FRAME:054636/0001 Effective date: 20201117 |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20220102 |