WO1992003779A1 - Procede assurant la communication efficace entre des coprocesseurs - Google Patents
Procede assurant la communication efficace entre des coprocesseurs Download PDFInfo
- Publication number
- WO1992003779A1 WO1992003779A1 PCT/US1991/004070 US9104070W WO9203779A1 WO 1992003779 A1 WO1992003779 A1 WO 1992003779A1 US 9104070 W US9104070 W US 9104070W WO 9203779 A1 WO9203779 A1 WO 9203779A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- queue
- wake
- processors
- data storage
- accordance
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 238000004891 communication Methods 0.000 title description 2
- 230000008569 process Effects 0.000 claims abstract description 32
- 230000000694 effects Effects 0.000 claims description 5
- 230000004044 response Effects 0.000 claims description 4
- 230000002618 waking effect Effects 0.000 claims description 4
- 238000013500 data storage Methods 0.000 claims 7
- 230000002596 correlated effect Effects 0.000 claims 1
- 238000012544 monitoring process Methods 0.000 claims 1
- 230000007246 mechanism Effects 0.000 abstract description 12
- 238000012545 processing Methods 0.000 abstract description 9
- 230000006870 function Effects 0.000 description 10
- 230000002093 peripheral effect Effects 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000007792 addition Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/52—Program synchronisation; Mutual exclusion, e.g. by means of semaphores
Definitions
- the present invention relates generally to multiprocessor computer systems and specifically to allocating processors in a tightly-coupled configuration to execute the threads of one or more multithreaded programs that are running on the system simultaneously.
- the present invention involves the way the wake queue in the preferred embodiment of the computer system operates for coprocessors of unequal speed versus how an interrupt handler operates.
- the wake queue consists of a data structure used as a repository between communicating processors and the locking scheme implemented in the global registers which coordinates access to that data structure. Instead of interrupts, the computer system can use the wake queue to alert the consumer (processor) of a request from a producer (i.e., disk drive).
- a producer i.e., disk drive
- Prior art has typically involved interrupt handlers in computer systems, such as is done in conjunction with the UNIX operating system.
- the kernel is responsible for handling interrupts, whether they result from hardware (such as from the clock or from peripheral devices), from a programmed interrupt (execution of instructions designed to cause "software interrupts"), or from exceptions (such as page faults). If the CPU is executing at a lower processor execution level than the level of the interrupt, it accepts the interrupt before decoding the next instruction and raises the processor execution level, so that no other interrupts of that level (or lower) can happen while it handles the current interrupt, preserving the integrity of kernel data structures.
- the kernel handles the interrupt with the following sequence of operations:
- the system determines the "source” or cause of the interrupt, identifying the type of interrupt (such as clock or disk) and the unit number of the interrupt, if applicable (such as, which dis drive caused the interrupt) .
- the system receives an interrupt, it gets a number from the machine that it uses as an offset into a table, commonly called a interrupt vector.
- interrupt vectors vary from machine to machine, but they usually contain the address of the interrupt handler for the corresponding interrupt source and a way of finding a parameter for the interrupt handler. For example, consider the table of interrupt handlers in Figure 1. If a terminal interrupts the system, the kernel gets interrupt number 2 from the hardware and invokes the terminal interrupt handler ttyintr. The kernel invokes the interrupt handler.
- the kernel stack for the new context layer is logi ⁇ cally distinct from the kernel stack of the previous context layer.
- the interrupt handler completes its work and returns.
- the kernel executes a machine-specific sequence of instructions that restores the register context and kernel stack of the previous context layer as they existed at the time of the interrupt and then resumes execution of the restored context layer.
- the behavior of the process may be affected by the interrupt handler, since the interrupt handler may have altered global kernel data structures and awakened sleeping processes. Usually, however, the process continues execution as if the interrupt had never happened. (Refer to "The Design of the UNIX Operating System" by Maurice J. Bach, pages 162-163, Prentice-Hall, 1986.) SUMMARY OF THE INVENTION
- the wake queue In the present invention, items are place on a queue, referred to as the wake queue.
- An objective of the inven ⁇ tion is to provide a method of placing the items on the wake queue by a slower processor and removing them by a faster processor without having the faster processor wait for the slower processor.
- the wake queue invention also provides a method whereby processors of differing capabilities can efficiently communicate. This allows realization of the objective for the present invention of distributing the operating system efficiently.
- Slower coprocessors can queue entries in wake queues. Access to these wake queues are governed by a queuing mechanism which utilizes the global registers to synchronize multiple concurrent producers and multiple concurrent consumers. (Refer to the related patent GLOBAL REGISTERS FOR A MULTIPROCESSOR SYSTEM. )
- Counting semaphores are software mechanisms for synchronization.
- the semaphore consists of a count of the available resources to be managed and a list associated with entities waiting for that resource. To implement a lock, this count is set to one so that only one resource, the lock, exists. If the semaphore is going to govern multiple resources it is set to the number of resources available. This count is determined as a resourc is taken. When the semaphore count goes to zero, no more resources are available so the requester is put to sleep to wait for one to become available. As a process frees a resource, it increments the semaphore counter and wakes up a waiting process.
- Another implementation has the slower coprocessor queuing command blocks onto the wake queue which a kernel procedure in a fast processor dequeues and queues onto a work queue before waking the appropriate kernel daemon to process that work queue.
- this daemon runs, it consults the work queue to obtain and process the information communicated from the slower processor. In this way, the objective of allowing fast processors to do post interrupt processing without being interrupted is achieved.
- next service number (modulo number of chairs, initialized to zero)
- goes to the location checks to see if the previous worker has left, sits down, picks up the work ⁇ order, and as he leaves, increments the requestor seats available.
- This circular queue is basically a clock distri ⁇ bution mechanism that avoids convoying around a master queue lock and is still FIFO in response. An individual requestor/worker can still find themselves waiting on each other if one or the other is slow (or not operating) .
- the wake queue with the associated integrated dispatcher and the ability to create alternative processing entities is implemented in a tightly coupled, multithreaded, multiprocessing environ ⁇ ment, it is not limited to this implementation.
- the wake queue method of the present invention can be implemented in other functions and utilities that require efficient processor scheduling.
- Figure 1 is a table showing prior art interrupt handlers.
- Figure 2 is a schematic diagram showing the wake queue in the integrated dispatcher environment.
- Figure 3 is a pictorial representation of an excerpt showing the format of wake queue global registers.
- Figure 4 is an illustration of the interaction between global registers and certain portions of memory.
- Figure 5 shows an excerpt from the header file that descri the format of the wake queue-related global registers.
- the wake queue mechanism is a way for coprocessors of unequal speed to efficiently communicate in a multiprocess operating system.
- Slow speed producers can produce entries for high speed consumers without interrupting the consumer.
- the consumers in the present invention are typically processors in the preferred embodiment.
- Typical producers are anything attached to the input/output controller (IOC), i.e., a disk drive, a frame buffer, etc.; although the computer system processors can also be producers.
- IOC input/output controller
- Figure 2 shows the functions and data structures in the preferred embodiment of the wake queue.
- the producer communicates with the consumer by queuing an entry in the queue entry funciton 80 on a wake queue data structure 70.
- the entry is ultimately consumed by the consuming processor when it runs the integrated dispatcher function 1112, or any other kernel procedure assigned to process wake queue entries, which dequeues the entry via consume e .ry function 30.
- the related patent application entitled SCHEDULING MECHANISM FOR A MULTIPRO ⁇ CESSING OPERATING SYSTEM contains information on the integrated dispatcher. Using the integrated dispatcher to process wake queue entries is in agreement with the preferred embodiment of the present invention? however, other means are available. That is, any kernel procedure can be enhanced with the intelligence needed to process wake queue entries.
- the processing done on the entry can vary widely, depending upon the intelligence added in the consume entry function 30. For example, if the wake queue's entries consist of semaphore addresses, the consume entry function can wake a process waiting on the semaphore address in that entry, thereby queuing that process on to the run queue data structure 10. This process is then eligible to be run. This allows coprocessors to schedule tasks they want fast processors to execute. If this wake queue's entries consist of command blocks, the consume entry function 30 could queue the command block onto a work queue data structure 90 and wake a waiting iproc, queuing it on the run queue data structure 10.
- An iproc is a minimal context process entity capable of performing all system-side tasks associated with kernel functions of a kernel based operatin system. It is described in greater detail in the commonly- assigned and copending patent application entitled METHOD OF IMPLEMENTING KERNEL FUNCTIONS USING MINIMAL-CONTEXT PROCESSES.
- This iproc is now eligible to run. When it does, it consults the appropriate work queue to obtain the informa ⁇ tion communicated from the coprocessor and processes it accordingly. This allows fast processors to do what is referred to in standard UNIX as "post interrupt processing" without having to interrupt the fast processor. It is important to note her that once the iproc runs, it will process all of its work queue entries, not just one. That is, multiple entries may have been queued onto the wake queue by coprocessors and then transferred to the work queue before this iproc was chosen to run. This allows the consumer (the iproc) to process the output of multiple producers [the coprocessor(s) ] much more efficiently. It should be noted that what is placed on the wake queue and what the consumer does with it upon removal is irrelevant to the present invention.
- the invention is an apparatus for placing and removing items only.
- the wake queue can be used to implement a coprocessor scheduling mechanism with limited interrupts. That is, interrupts are now limited to handle the case when the wak queue fills up.
- the wake queue is circular and of fixed size, so when it fills, an interrupt is generated such tha the integrated dispatcher can run and empty out the wake queue. This situation where the wake queue fill up could occur when he processors would otherwise be flooded by requests, as In the case of multiple striped disk requests occurring at the same time.
- the overall idea behind the wake queue in accordance with this invention is to avoid having slower coprocessors (e.g., disk controller) interrupting fast processors, because doing so slows down the fast processors.
- the wake queue allows this to happen by letting coprocessors queue entries in a certain section of memory, called the wake queue. Access to this section of memory is governed by a locking mechanism implemented in a set of global registers. The interface functions between memory areas and these global registers is illustrated in Figure 4. Both th the processors and the coprocessors must adhere to this locking mechanism when queueing or dequeuing 'entries' from the wake queue.
- Figure 5 shows an excerpt from the header file that describ the format of the wake queue-related global registers, with their format described pictorially in Figure 3.
- the locking mechanism implemented in these global register is used to govern access to the memory being used a a wake queue.
- This mechanism consists of having 'producers' gain access to queue an entry on the wake queue by decrementing the "available” field and incrementing the "requests” field, and conversely, having 'consumers' gain access to dequeue an entry from the wake queue by decrementing the "requests” filed and incrementing the "available” field.
- a producer calls queue_entry which performs the following:
- FCA fetch and Conditionally Add
- this operation is the FAA (Fetch and Add) instruction.
- FAA adds a value to the contents of the register and returns the result to that register.
- FCA is used to decrement the outstanding requests counter.
- the FAA is used to increment the out_ord field.
- This scheme allows coprocessors to communicate with faster processors without interrupting the faster processors.
- Some kernel routine must be embellished with the intelli ⁇ gence to handle each wake queue.
- this routine is the integrated dispatcher. When a fast processo executes whatever routine has been specified to handle the wake queue, it will dequeue entries and do whatever pro ⁇ cessing is appropriate for that entry.
- the information communicated between the coprocessor and the processor is completely implementation dependent as long as the coprocessor routine which is queuing entries and the kernel routine which is dequeuing entries mutually understands the contents of the entry and the processing that is appropriate for that entry.
- Each wake queue could have entirely unique entry formats.
- Each wake queue would depend upon setting up a set of global registers to govern access to it and enhancing some kernel routine with the intelligence to process its entries.
- Each entry on a specific wake queue will be of the same format/size, but entries from different wake queues need have nothing in common. For example, one coprocessor may queue eight byte entries on a certain wake queue, while another coprocessor could be queuing twenty-four byte entries onto a different wake queue.
- a command block is set up to detail specific processing. It is often associated with 10.
- the operating system can pass command blocks to a disk controller to inform the controller where data should be read/written from/to, the amount of data to transfer, etc.
- Command blocks can be queued on a wake queue to detail peripheral activity. That is, a fast processor could be the producer queuing an entry on a wake queue and the controller would then be a consumer dequeuing entries to decipher details of disk"trans ers. Note”that this implies that a coproces ⁇ r routine rather than a kernel routine would be enhanceJ with the intelligence to process queue entries. Upon completion of the disk activity, the roles could be reversed. The controller would then be the producer queuing an entry on a different wake queue and the fast processor would then be a consumer dequeuing the entry to determine which process should be woken up as a result of the completed disk activity. (This is the example mentioned above of the coprocessor queuing an address on the wake queue and the integrated dispatcher waking up the process sleeping on that address.)
- a processor or coprocessor calls the queue_entry routine whenever it has information it wants communicated via the wake queue mechanism. Although not limited to this use, it was designed with the intent of allowing slower coprocessors to communicate with fast processors without interrupts.
- the co ⁇ processor queues an address on the wake queue which the integrated dispatcher dequeues.
- the integrated dispatcher knows that this address is the address of a semaphore upon which a process has been sleeping on while waiting for the coprocessor to finish this task.
- the integrated dispatcher performs a wake operation on that semaphore which dequeues the process on the run queue so that it can be chosen for execution.
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Multi Processors (AREA)
Abstract
L'invention (70) concernant la file d'attente de réveil constitue un procédé grâce auquel des processeurs ayant des fonctions différentes peuvent communiquer de manière efficace. Ceci permet d'obtenir une meilleure efficacité de la répartition du système d'exploitation. Des coprocesseurs plus lents peuvent mettre en files d'attente des entrées (80), dans des zones appelées files d'attente de réveil. Les accès à ces files d'attente de réveil sont gérés par un mécanisme (90) de mise en file d'attente qui utilise les registres communs pour synchroniser la pluralité de producteurs (80) concurrents et la pluralité de consommateurs (30) concurrents. Lorsqu'un processeur rapide exécute le processus noyau affecté à la gestion de la file d'attente de réveil (70), les entrées (80) sont extraites de la file d'attente de réveil et les informations se trouvant dans une entrée sont traitées. Les mises en application peuvent varier grandement en fonction des informations et du traitement recherché.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/571,951 US5202988A (en) | 1990-06-11 | 1990-08-23 | System for communicating among processors having different speeds |
US571,951 | 1990-08-23 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1992003779A1 true WO1992003779A1 (fr) | 1992-03-05 |
Family
ID=24285730
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1991/004070 WO1992003779A1 (fr) | 1990-08-23 | 1991-06-10 | Procede assurant la communication efficace entre des coprocesseurs |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO1992003779A1 (fr) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0954796A1 (fr) * | 1996-03-19 | 1999-11-10 | Bull HN Information Systems Inc. | Procedure pour detecter et disperser des convois associes a des serveurs a portes a files d'attente |
WO2020097177A1 (fr) * | 2018-11-06 | 2020-05-14 | Dover Microsystems, Inc. | Systèmes et procédés pour suspendre le fonctionnement d'un processeur hôte |
US11748457B2 (en) | 2018-02-02 | 2023-09-05 | Dover Microsystems, Inc. | Systems and methods for policy linking and/or loading for secure initialization |
US11797398B2 (en) | 2018-04-30 | 2023-10-24 | Dover Microsystems, Inc. | Systems and methods for checking safety properties |
US11841956B2 (en) | 2018-12-18 | 2023-12-12 | Dover Microsystems, Inc. | Systems and methods for data lifecycle protection |
US12079197B2 (en) | 2019-10-18 | 2024-09-03 | Dover Microsystems, Inc. | Systems and methods for updating metadata |
US12124566B2 (en) | 2018-11-12 | 2024-10-22 | Dover Microsystems, Inc. | Systems and methods for metadata encoding |
US12124576B2 (en) | 2020-12-23 | 2024-10-22 | Dover Microsystems, Inc. | Systems and methods for policy violation processing |
US12248564B2 (en) | 2018-02-02 | 2025-03-11 | Dover Microsystems, Inc. | Systems and methods for transforming instructions for metadata processing |
US12253944B2 (en) | 2020-03-03 | 2025-03-18 | Dover Microsystems, Inc. | Systems and methods for caching metadata |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4644461A (en) * | 1983-04-29 | 1987-02-17 | The Regents Of The University Of California | Dynamic activity-creating data-driven computer architecture |
US4807111A (en) * | 1987-06-19 | 1989-02-21 | International Business Machines Corporation | Dynamic queueing method |
US4943913A (en) * | 1988-02-10 | 1990-07-24 | International Business Machines Corporation | Operating system accessing control blocks by using home address space segment table to control instruction and operand fetch and store operations |
US5031089A (en) * | 1988-12-30 | 1991-07-09 | United States Of America As Represented By The Administrator, National Aeronautics And Space Administration | Dynamic resource allocation scheme for distributed heterogeneous computer systems |
-
1991
- 1991-06-10 WO PCT/US1991/004070 patent/WO1992003779A1/fr unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4644461A (en) * | 1983-04-29 | 1987-02-17 | The Regents Of The University Of California | Dynamic activity-creating data-driven computer architecture |
US4807111A (en) * | 1987-06-19 | 1989-02-21 | International Business Machines Corporation | Dynamic queueing method |
US4943913A (en) * | 1988-02-10 | 1990-07-24 | International Business Machines Corporation | Operating system accessing control blocks by using home address space segment table to control instruction and operand fetch and store operations |
US5031089A (en) * | 1988-12-30 | 1991-07-09 | United States Of America As Represented By The Administrator, National Aeronautics And Space Administration | Dynamic resource allocation scheme for distributed heterogeneous computer systems |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0954796A4 (fr) * | 1996-03-19 | 2006-10-04 | Bull Hn Information Syst | Procedure pour detecter et disperser des convois associes a des serveurs a portes a files d'attente |
EP0954796A1 (fr) * | 1996-03-19 | 1999-11-10 | Bull HN Information Systems Inc. | Procedure pour detecter et disperser des convois associes a des serveurs a portes a files d'attente |
US12242575B2 (en) | 2018-02-02 | 2025-03-04 | Dover Microsystems, Inc. | Systems and methods for policy linking and/or loading for secure initialization |
US11748457B2 (en) | 2018-02-02 | 2023-09-05 | Dover Microsystems, Inc. | Systems and methods for policy linking and/or loading for secure initialization |
US11977613B2 (en) | 2018-02-02 | 2024-05-07 | Dover Microsystems, Inc. | System and method for translating mapping policy into code |
US12248564B2 (en) | 2018-02-02 | 2025-03-11 | Dover Microsystems, Inc. | Systems and methods for transforming instructions for metadata processing |
US11797398B2 (en) | 2018-04-30 | 2023-10-24 | Dover Microsystems, Inc. | Systems and methods for checking safety properties |
WO2020097177A1 (fr) * | 2018-11-06 | 2020-05-14 | Dover Microsystems, Inc. | Systèmes et procédés pour suspendre le fonctionnement d'un processeur hôte |
WO2020097179A1 (fr) * | 2018-11-06 | 2020-05-14 | Dover Microsystems, Inc. | Systèmes et procédés pour suspendre le fonctionnement d'un processeur hôte |
US11875180B2 (en) | 2018-11-06 | 2024-01-16 | Dover Microsystems, Inc. | Systems and methods for stalling host processor |
US12124566B2 (en) | 2018-11-12 | 2024-10-22 | Dover Microsystems, Inc. | Systems and methods for metadata encoding |
US11841956B2 (en) | 2018-12-18 | 2023-12-12 | Dover Microsystems, Inc. | Systems and methods for data lifecycle protection |
US12079197B2 (en) | 2019-10-18 | 2024-09-03 | Dover Microsystems, Inc. | Systems and methods for updating metadata |
US12253944B2 (en) | 2020-03-03 | 2025-03-18 | Dover Microsystems, Inc. | Systems and methods for caching metadata |
US12124576B2 (en) | 2020-12-23 | 2024-10-22 | Dover Microsystems, Inc. | Systems and methods for policy violation processing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5202988A (en) | System for communicating among processors having different speeds | |
US5452452A (en) | System having integrated dispatcher for self scheduling processors to execute multiple types of processes | |
US12131186B2 (en) | Hardware accelerated dynamic work creation on a graphics processing unit | |
US5390329A (en) | Responding to service requests using minimal system-side context in a multiprocessor environment | |
US6006247A (en) | Method and system for scheduling threads and handling exceptions within a multiprocessor data processing system | |
US7137116B2 (en) | Method and system for performing a task on a computer | |
Anderson et al. | Scheduler activations: Effective kernel support for the user-level management of parallelism | |
EP1839146B1 (fr) | Mecanisme pour la programmation d'unites d'execution sur des sequenceurs mis sous sequestre par systeme d'exploitation sous sans intervention de systeme d'exploitation | |
US6732138B1 (en) | Method and system for accessing system resources of a data processing system utilizing a kernel-only thread within a user process | |
US7373640B1 (en) | Technique for dynamically restricting thread concurrency without rewriting thread code | |
US4387427A (en) | Hardware scheduler/dispatcher for data processing system | |
JP2514299B2 (ja) | プロセスレベルプログラミングのための割込み処理の直列化方法 | |
US6983462B2 (en) | Method and apparatus for serving a request queue | |
JPH03126158A (ja) | スケジユーリング方法及び装置 | |
US20040078799A1 (en) | Interpartition communication system and method | |
WO1992003779A1 (fr) | Procede assurant la communication efficace entre des coprocesseurs | |
Schwan et al. | Multiprocessor real-time threads | |
JPS6334490B2 (fr) | ||
JP7346649B2 (ja) | 同期制御システムおよび同期制御方法 | |
Rothberg | Interrupt handling in Linux | |
Wani | Operating System | |
Papadimitriou et al. | Mac OS versus FreeBSD: A comparative evaluation | |
WO1992003783A1 (fr) | Procede de mise en ×uvre des fonctions du noyau | |
WO1992003784A1 (fr) | Procede d'organisation pour un systeme d'exploitation a multiprocessor | |
Seo et al. | Supporting preemptive user-level threads for embedded real-time systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): JP KR |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IT LU NL SE |