WO1982000381A1 - Speech synthesis aid for the vocally handicapped - Google Patents
Speech synthesis aid for the vocally handicapped Download PDFInfo
- Publication number
- WO1982000381A1 WO1982000381A1 PCT/GB1981/000144 GB8100144W WO8200381A1 WO 1982000381 A1 WO1982000381 A1 WO 1982000381A1 GB 8100144 W GB8100144 W GB 8100144W WO 8200381 A1 WO8200381 A1 WO 8200381A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech
- speech synthesis
- microcomputer
- synthesis device
- read
- Prior art date
Links
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 26
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 26
- 230000015654 memory Effects 0.000 claims abstract description 26
- 239000011159 matrix material Substances 0.000 claims abstract description 8
- 238000004891 communication Methods 0.000 claims abstract description 5
- 239000004020 conductor Substances 0.000 claims description 10
- 239000012528 membrane Substances 0.000 claims description 6
- 230000004044 response Effects 0.000 claims description 5
- 238000000034 method Methods 0.000 claims description 3
- 230000002093 peripheral effect Effects 0.000 claims description 3
- 230000001105 regulatory effect Effects 0.000 claims description 3
- 230000006399 behavior Effects 0.000 claims description 2
- 230000001360 synchronised effect Effects 0.000 claims description 2
- 229910000859 α-Fe Inorganic materials 0.000 claims description 2
- 230000005055 memory storage Effects 0.000 claims 3
- 125000006850 spacer group Chemical group 0.000 claims 1
- 230000009466 transformation Effects 0.000 claims 1
- 230000006870 function Effects 0.000 abstract description 15
- 238000005286 illumination Methods 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 10
- 101100084053 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) ppi-2 gene Proteins 0.000 description 6
- 101100084040 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) ppi-1 gene Proteins 0.000 description 5
- 239000000872 buffer Substances 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 229920000333 poly(propyleneimine) Polymers 0.000 description 3
- 240000007320 Pinus strobus Species 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 229910052732 germanium Inorganic materials 0.000 description 2
- GNPVGFCGXDBREM-UHFFFAOYSA-N germanium atom Chemical compound [Ge] GNPVGFCGXDBREM-UHFFFAOYSA-N 0.000 description 2
- 238000003825 pressing Methods 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 102100025840 Coiled-coil domain-containing protein 86 Human genes 0.000 description 1
- 101000932708 Homo sapiens Coiled-coil domain-containing protein 86 Proteins 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- HPNSNYBUADCFDR-UHFFFAOYSA-N chromafenozide Chemical compound CC1=CC(C)=CC(C(=O)N(NC(=O)C=2C(=C3CCCOC3=CC=2)C)C(C)(C)C)=C1 HPNSNYBUADCFDR-UHFFFAOYSA-N 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000013479 data entry Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000000881 depressing effect Effects 0.000 description 1
- 230000000994 depressogenic effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- LELOWRISYMNNSU-UHFFFAOYSA-N hydrogen cyanide Chemical compound N#C LELOWRISYMNNSU-UHFFFAOYSA-N 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000004377 microelectronic Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 239000002985 plastic film Substances 0.000 description 1
- 229920006255 plastic film Polymers 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001747 pupil Anatomy 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B21/00—Teaching, or communicating with, the blind, deaf or mute
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Definitions
- This invention concerns a speech synthesis aid for the vocally handicapped which takes advantage of recent advances in the field of electronic speech synthesis to provide a relatively low-cost portable device exhibiting substantial flexibility of application ranging from a communication aid operable by a person incapable of speech to enable speech utterances selected from a preselected range to be made, to a device useful for example as a learning aid which couples synthesised speech utterances with the playing of a simple game or the performance of simple routine operations.
- speech synthesiser circuits have been developed to a point where devices are available which are capable of selective operation to output signals which when applied to drive an acoustic transducer produce clear and intelligible speech.
- Such devices commonly employ more or less complicated microelectronic circuitry including read-only-memories (ROM's) pre-programmed with data information selectively accessible by a microcontroller for outputting the requisite signals for speech generation.
- ROM's read-only-memories
- Such circuits have found application in electronic calculators, instruments and computer-type peripherals where a spoken word output is desired in addition to for example a visual display, and it has been proposed to incorporate speech synthesis systems into operating routines such as for example are involved in pre-flight checking of aircraft.
- the present invention is not concerned with the actual circuits and methods employed for synthesising speech, rather the invention is concerned with applications for known speech synthesis devices.
- a speech synthesis communication aid for the vocally handicapped which employs a small microcomputer to monitor control inputs and appropriately drive a visual display and a speech synthesiser unit.
- the aim is to provide a portable and easily used aid for persons with speech handicaps.
- the invention allows the user to communicate via synthesised speech, independent of natural voice function. Since a broad vodabulary is required to cover the range of application, a modular approach employing interchangeable repertoire modules and associated word-selector overlays is used, the repertoire modules being designed so that the user may plug them into the speech aid as required.
- each repertoire module Associated with each repertoire module is a translucent overlay sheet on which appropriate symbols or words are printed, the overlay sheet corresponding to the currently installed repertoire module being adapted to be placed over a selector panel on the speech aid.
- This panel is comprised of a rectangular array of lamps any one of which may be selected by the user to be illuminated under microcomputer control, the arrangement being such that when an overlay is placed on the selector panel, the words or symbols of the overlay are positioned to locate over the lamps in the array and may be selectively highlighted to indicate selection of the corresponding speech output.
- Selector controls are provided for changing, via the microcomputer, the lamp or lamps currently illuminated in the selector panel. Use of these controls allows the apparent movement of an index light behind the overlay sheet, permitting the choice of desired words or symbols printed thereon.
- a string of utterances may be constructed by repeatedly selecting and storing words or symbols.
- the desired assembly may be synthetically spoken by operating the appropriate control.
- the capabilities to repeat and extend existing utterance strings are provided.
- the invention includes options for extra therapeutic functions. By means of external connection facilities and appropriate programming, units may be interconnected or linked to a master console. These modes are intended for teaching applications. The latter mode allows a teacher to monitor and guide pupils in the style of a language laboratory. Sentence construction exercises might be conducted through such a network equipped with suitable overlay sheets and vocabulary modules.
- the aid may function as a game or puzzle, simple concept-ordering problems being provided in pre-programmed form associated with specific overlay sheets and vocabulary modules and the vocal output of the unit being used to guide and encourage the user.
- the invention enables an unlimited repertoire, a simple means of utterance selection and outstanding therapeutic capabilities.
- a lack of restriction on repertoire size is obtained by virtue of the use of interchangeable vocabulary modules and overlay sheets.
- the back-illuminated selector panel with uncomplicated controls facilitates utterance selection. Due to its microcomputer-based design, the invention is amenable to functional expansions including network and games modes.
- Fig. 1 is a top plan view of the operating panel of the first embodiment
- Fig. 2 shows the circuit diagram (in block form) of the embodiment of Fig. 1;
- Fig. 3 shows the circuit diagram of an input . switch sub-system of the circuit of Fig. 2;
- Fig. 4 shows the circuit diagram of a display sub-system of the circuit of Fig. 2; and Figs. 5 and 6 together show the basic system software flow-chart of the first embodiment; and wherein:-
- Fig. 7 shows a schematic circuit diagram of a microcomputer section of the second embodiment
- Fig. 8 shows a schematic circuit diagram of display and keyboard sections of the second embodiment
- Fig. 9 shows a schematic circuit diagram of speech synthesis and control key sections of the second embodiment
- Fig. 10 shows a schematic circuit diagram of a power supply section of the second embodiment
- Fig. 11 shows input/output port allocation in the second embodiment
- Fig. 12 shows a simplified system software flow-chart of the second embodiment.
- Fig. 1 shows the front face appearance of the first described embodiment which may be comparable in size and weight with a portable cassette tape recorder.
- the controls include an 8 x 8 cell individually and selectively back-illuminated selector panel 1 with slider controls 3 and 4 to allow choice of the cell illuminated on the selector panel, the controls selecting columns and rows respectively.
- Translucent word- defining overlay sheets (not shown) are laid on panel 1. Power to the unit is switched by 2, whilst keys 5 and 6 control "speak” and “store” functions of the unit, the "speak” key being used to initiate the device output and for repetition of utterances and the "store” key being used when sequentially selected utterances are to be strung together.
- the device of Fig. 1 is adapted to provide readily intelligible synthesised speech from a Tele- sensory Systems Inc. TSI-S2 large scale integrated two chip synthesiser with interchangeable vocabulary ROMS embodied as plug-in modules.
- the device operates under microcomputer control, a device from the Intel MCS-48 single-chip microcomputer family for example being selected for this purpose.
- the 8748 unit of this family might by selected, this unit. being UV erasable and reprogrammable and providing 1K x 8 bits of control memory, 64 x 8 bits of read/write memory and 27 input/ output (I/O) lines.
- Such a device is capable of around one minute of speech output per vocabulary module, divided among 64 selectable utterances. In the embodiment described only the speech output mode is provided.
- the system may be considered as five sections, namely:-
- a single-chip microcomputer with support components (i) input switches and associated bus-sharing components; (iii) the speech synthesiser chip set and associated audio-path components; (iv) decoders controlling the selector panel and system enable lines; and (v) the selector panel driver and lamp array sub-system.
- an Intel 8748-8 microcomputer 15 (Fig. 2) operating with a cycle time of 4.17 ⁇ s is the central controlling element of the system, ports 1 and 2 of the microcomputer being used for output and mixed I/O respectively.
- Output port 1 provides a six bit select bus to speech synthesiser integrated circuit 10 and to display decoders 8 and 11.
- the speech synthesiser 10 comprises for example a Telesensory Systems Inc. device no. S2. Bits 6 and 7 remaining from Port 1 are decoded to provide four system module enable lines via decoder 13. These lines are used to control the speech synthesiser 10 and the input subsystem constituted.by selection switches 16, 17 and function keys l8.
- the low order four bits of Port 2 are not used to control intrinsic system functions, but are dedicated to the role of providing the top four bits of addresses to external program memory in conduction with 8 lines from address latch 19. These low order address bits are strobed from the data bus into the address latch during ALE cycles.
- Bits 4 to 7 of Port 2 are used to input data from the. select and function switches 16, 17 and 18. As shown in Fig. 3 , all three switch modules share the bus and are selected onto it by pulling switch common lines to ground via transistors 21, 23 and 26. These devices are controlled by system enable lines.
- the structure of the 8748 Ports 1 and 2 is such that, when configured as inputs, lines are internally pulled high by 50k resistors, thereby obviating the need for external pull-ups.
- the diodes 20, 22, 24 and 25 associated with the switches prevent the state of disabled switches from effecting valid data.
- Fig. 2 shows the speech synthesiser LS1, 10 interfaced to the microcomputer 15 via eight lines; the previously mentioned six-bit utterance select input, and "start” and “busy” lines.
- a level-shifting network is provided as shown in Fig. 2 on appropriate synthesiser 1/0 lines to allow interfacing of the TTL and PMOS logic signals.
- Start is an input and is controlled by a system enable line.
- Bussy is a status output indicating synthesiser non-availability, and is monitored by a single-bit testable input, TO. Connections are also made to a vocabulary ROM 7, housed in a plug-in module. This contains the parameters associated with the utterance set from which the synthesiser reconstructs speech.
- Resulting audio signals are low-pass filtered by filter circuit 9 to remove sampling frequency components associated with digital signal processing. They are then fed, after amplification by a small monolithic audio amplifier 12, to a loudspeaker 14.
- Fig. 4 is a simplified diagram showing only four element lamps and associated drivers, the full system comprising eight such row and column drivers connected to 64 incandescent lamps each with an associated diode.
- Lamps 30 etc. are driven by row and column PNP transistor buffers 27 and 28 respectively. Eight transistors make up each buffer. These devices are activated by low- true signals derived from two 74LS138three-to-eight line decoders (8 and 11 in Fig. 2) controlled by bits 0 to 5 of Port 1 of microcomputer 15. Diodes 29 etc. are employed in the array to prevent spurious current paths through unselected lamps.
- System expansion facilities have been provided by incorporating an address bus latch (Fig. 2, 19) and by making PSEN (program Store Enable), RD (Read), WR (Write) and the data and address busses externally accessible. Memory mapped interface devices and up to
- UTT ADDR is the synthesiser and selector panel control address
- UTT PTR is a pointer to the current UTT ADDR storage location in data memory.
- Actuation of the "Speak" key causes the FO flag to be tested. If it is set, the selector panel controls have been altered since the last speak or store operation, and a new item is assumed. Thus the UTT PTR and FO flag are reset before the selected utterance is output. If the F0 flag is not set, repeat mode is assumed and the current stored string is output, leaving the UTT PTR unchanged, ready for a repeated output if the "Speak" key is re-depressed. If the
- the function key input subroutine is edgesensitive and includes debounce delays to avoid spurious data entries.
- the "speak string" subroutine outputs to both the speech synthesiser and selector panel. Data is output from an UTT ADDR buffer starting at the buffer base and ending on the last position of the UTT PTR. BUSY, generated by the speech synthesiser, is tested within the subroutine, allowing correct output timing. Functional enhancements are attainable within the IK byte on-chip program memory space. External additional program and data memory may be added to further expand functional capabilities.
- a single-chip microcomputer controller would be employed in production versions of the speech aid hereinafter described to minimise parts cost
- the described embodiment uses a Z8 ⁇ microprocessor with associated support components, this approach having been dictated by the availability to the inventors of Z ⁇ O software development tools.
- the embodiment hereinafter described comprises four readily identifiable sections which are illustrated in Figs. 7, to 10, namely a microcomputer controller (Fig. 7) an integrated display/keyboard (Fig. 8), aspeech synthesiser unit with vocabulary ROMs and audio output components (Fig. 9) and a logic-controlled power supply (Fig. 10).
- the microcomputer controller is based on a Z8 ⁇ microprocessor 50, and employs a memory-mapped architecture with partially decoded linear select device addressing.
- Program memory consists of 2k bytes of PROM space in a single 2716 device 51 residing between hexadecimal addresses 0000 and 07FF in the system memory space. By virtue of its erasable/reprogrammable nature, the 2716 was a logical choice for program development.
- Read/write memory is provided in the form of two 2114 RAM chips 52 offering Ik bytes of temporary workspace mapped between hexadecimal addresses 3000 and 33FF.
- PPI programmable peripheral interface
- the former device controls display rows, display columns and various enables through its three ports.
- the latter strobes keyboard rows and provides speech synthesiser utterance addresses through its A port, whilst its B and C ports, configured as inputs, monitor keyboard columns and miscellaneous system status bits.
- System support functions (clock and reset generation and device selection) are performed by the remaining microcomputer components.
- a discrete crystal-controlled transistor oscillator 54 operating at 1MHz generates the system clock signal. This is buffered by a TTL schmitt inverter 55 before being fed to the microprocessor clock terminal.
- An inverted version of the same signal is used to synchronise, via a D type flip-flop 56, an asynchronous reset line derived from a schmitt inverter-based power-on reset circuit 57.
- the synchronised reset signal is fed in inverted and non-inverted forms to the reset terminals of the microprocessor 50 and PPIs 53 respectively.
- the RAM enable signal is further gated with RD and WR (read and write) signals by gates 59 before being routed to the 2114 chip select terminals. This prevents data bus contention during write cycles when the RAM enable signal is true before a write pulse.
- RD and WR read and write
- this assembly comprises a set of 64 back-illuminated switches 60 (only some of which are shown) realised via membrane switch technology. This allows cells of the flexible translucent keyboard overlays, associated with particular vocabulary repertoires to be selectively high-lighted under software control.
- the display consists of 64 T1 incandescent lamps 61 (used because of their "white" light output) isolated against spurious back currents by series diodes 62 aiid driven by 16 bipolar transistors 63 arranged as eight row and eight column drivers, the former being configured as emitter followers while the latter operate in the saturating common emitter mode. As with the lamps 61, only some of transistors 63 are shown in Fig. 8. Both driver sets are arranged to be enabled by high-true signals.
- a set of eight 4K7 resistors 64 pull up the row driver inputs which are connected to port A of PPI 1 (Fig. 7). Eight 390 ⁇ . resistors 65 limit drive current sourced by port B of PPI 1 to the bases of the column driver transistors 63.
- the keyboard may be based on a matrix of 8 x 8 thin silver contact strips laid down on two stable plastic films.
- a thin perforated non-conducting membrane separates the films which are arranged so that the conductor sets lie at right angles to each other. The perforations are aligned with the crosspoints of the conductor-carrying films so that external pressure (e.g. from a finger) causes deformation of the film through a perforation and consequential contact closure.
- Keyboard rows are strobed by port A of PPI 2 (Fig. 7) which, under software control, sequentially pulls row lines .low. Column lines, pulled high by 47K resisto ⁇ »s 66 are montored by port B of PPI 2. If a coincidence of row strobe and contact closure occurs, the associated column line is pulled low. Software then decodes the location of the depressed key.
- a Telesensory Systems Inc. S2B mini speech synthesis board 70 is used in the embodiment, this circuit having been modified to include a vocabulary ROM containing a "standard" set of 64 utterances mounted "off-board” as a plug-in module. Different plug-in modules will contain repertoires appropriate to the diverse needs of the speech handicapped.
- the computer interface to the board 70 comprises six address lines SA 0-5 (to select one of 64 utterances), a "start" line ST, a ROM power down control line VR0M, and a "busy” output.
- the S2B requires its inputs to be pulled high via resistors and its "busy” output to be level-shifted and buffered when interfaced to TTL circuitry.
- Utterance address and ROM power down signals derived from port A, PPI 2 and a "start" signal from port C, PPI 1 are buffered by a 74LS244 device 71 and pulled high via a 4K7 resistor package 72 before being fed to the appropriate terminals of the S2B.
- the "busy" signal from the synthesiser 70 is level- shifted and buffered by a discrete transistor inverter 73, then fed to port C of PPI 2 along with low true “speak” and "clear” control key lines.
- the analogue output AAS of the S2B is processed by an amplifier 74 and filter 75 before being routed to a loudspeaker 76.
- Filter characteristics are variable by means of a preset Baxandall circuit to allow intellegibility to be optimised.
- the basic filter response is bandpass and, with Baxandall flat, has corner frequencies at 400Hz and 1KHz. Roll-on and roll-off are at approximately 6dB per octave.
- Filter and amplifier are realised via 3l4 ⁇ operational amplifier chips.
- the output level is adjustable by a gain trimmer in the amplifier stage.
- a complementary emitter follower buffer 77 using germanium transistors for low base-emitter voltage drop drives the loudspeaker from the filter output.
- a red/green light emitting diode (LED) indicator 78, driven by a pair of TTL inverters 79 enabled by signals from port C of PPI 1 is employed to show system status.
- LED red/green light emitting diode
- the power supply (Fig. 10) incorporates several unusual features. It is required that the aid be portable and independent of external power sources, hence internal batteries are dictated. Most internal circuitry requires a +5 volt supply, hence three 2-volt Cyclon accumulators 80 were chosen for the power source. These 2.5 Ah devices yield a series EMF of 6 volts, allowing an accompanying regulating circuit only a one volt drop. In addition it was desired that a low battery condition be signalled to the microcomputer to allow an orderly automatic shutdown procedure. Further, a requirement for toggle on/ off action and automatic time-out in standby mode was specified. Finally the S2B speech synthesiser requires a minus 10 volt rail which must be derived from the 6 volt power source.
- a germanium saturable series pass element 8l This transistor is driven by the collector current of one side of a long-tailed pair 82 which operates as a comparator between a zener derived reference potential and a proportion of the output voltage. Emitter current for the long-tailed pair 82 is sunk by a resistor 83 in series with a transistor switch 84. When the transistor 84 is disabled, base current to the series pass transistor 8l is cut off and the regulator output falls to zero. A keep-alive current is fed to the reference zener 85 under these conditions. This facilitates orderly power-up behaviour.
- a PNP transistor 86 with baseemitter connected via a current limiting resistor 87 across the series pass transistor 8l monitors the input- output differential voltage and produces, via a CMOS schmitt trigger gate 88 , a "battery low" logic signal which is routed to port C of PPI 2. Under powered down conditions the regulator draws less than one milliamp, and hence may be permanently connected to the six volt supply.
- the base of the long-tailed pair emitter current switch transistor 84 is fed via a current limiting resistor 89 from a set/reset flip-flop 90 configured using CMOS schmitt NAND gates, permanently powered from the six volt supply.
- Flip-flop 90 may be triggered “on” by the momentary closure of the "on/ off” key switch 91.
- a differentiating network 92 between the switch and gate ensures a short ( 10 microsecond) trigger pulse.
- the set/reset flip-flop 90 is triggered “off” by a high to low transition of a line from port C of PPI 1.
- a "lock-out” circuit 93 prevents the flip-flop 90 from adopting a stable "on” state. This is achieved by resetting the flip-flop 90 with a suitably delayed version of the "battery low” logic signal. Automatic power-down on low battery voltage is disabled after a few milliseconds by a delayed output from the flip-flop 90 derived by CR network 94.
- a flyback type converter is included in the power supply sub-system.
- a 1:1 ferrite pulse transformer 95 is driven by a transistor 96 in series with a current sensing resistor 97 from the six volt supply.
- Transformer primary current is monitored by the inverting input of a 3140 operational amplifier 98 c-onfigured as a schmitt trigger with threshold varied by feedback from the transformer secondary circuit.
- Two paths provide both AC positive feedback ensuring high switching efficiency, and negative DC feedback regulating the output voltage.
- Secondary current is rectified and smoothed to form the minus 10 volt output.
- Logic controlled on/off switching is effected by virtue of the power supply to operational amplifier 98 being derived from the +5 volt rail.
- Fig. 12 is a software flowchart of the basic system function together with the following functional description of the machine operation.
- LED flashes red during settling time and until key is released, then lights green to indicate "ready" state. If battery voltage is initially low, power-up is inhibited by hardware.
- display areas are pressed in the order that utterances are required. As areas are pressed they back-illuminate to show current store contents.
- Pressing the SPEAK key allows the stored utt erance string to be verbally output in the order of entry. As each utterance is made the associated display area blanks. At the end of the string all previously selected display areas are re-illuminated.
- Pressing the CLEAR key during any user entry phase clears the utterance store and blanks the display. Entering an utterance already stored clears that utterance from memory and blanks the associated display area.
- the LED displays red and the machine automatically powers-down after the current utterance is spoken. Under standby conditions, low battery voltage is indicated by red LED illumination for 5 seconds during which time all control and entry functions except ON/ OFF are locked out. At the end of the time-out period automatic power-down is invoked.
- the LED flashes green for 10 seconds to warn of incipient power-down. Any entries made during this time restore normal operation. If no entry is made, the machine powers itself down.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Business, Economics & Management (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Input From Keyboards Or The Like (AREA)
Abstract
A speech synthesis communication aid for the vocally handicapped comprises a selector panel for use with translucent overlays bearing words or symbols arranged in an 8x8 matrix array and having controls (16, 17) operable by a user to enable selection and consequent back-illumination of one or more of the matrix cells. A microcomputer (15) is programmed to respond to cell selection to organise the back-illumination and also correspondingly to address a speech synthesiser circuit (10) having a read-only memory (7) containing data adapted for generation of speech signals so that an output is provided from the speech synthesiser circuit corresponding to the selected cell. The read-only memory is a plug-in module preprogrammed with a vocabulary corresponding to the words or symbols on a respective overlay, and a plurality of such modules and overlays are provided to obtain the requisite vocabulary repertoire. The microcomputer is also programmed to enable strings of speech utterances to be assembled by selection sequentially of a plurality of matrix cells and to be verbalised by the speech synthesiser, function keys (18) being provided for user operation to utilise this facility.
Description
Speech synthesis aid for the vocally handicapped
This invention concerns a speech synthesis aid for the vocally handicapped which takes advantage of recent advances in the field of electronic speech synthesis to provide a relatively low-cost portable device exhibiting substantial flexibility of application ranging from a communication aid operable by a person incapable of speech to enable speech utterances selected from a preselected range to be made, to a device useful for example as a learning aid which couples synthesised speech utterances with the playing of a simple game or the performance of simple routine operations.
In recent years speech synthesiser circuits have been developed to a point where devices are available which are capable of selective operation to output signals which when applied to drive an acoustic transducer produce clear and intelligible speech. Such devices commonly employ more or less complicated microelectronic circuitry including read-only-memories (ROM's) pre-programmed with data information selectively accessible by a microcontroller for outputting the requisite signals for speech generation. Such circuits
have found application in electronic calculators, instruments and computer-type peripherals where a spoken word output is desired in addition to for example a visual display, and it has been proposed to incorporate speech synthesis systems into operating routines such as for example are involved in pre-flight checking of aircraft. The present invention is not concerned with the actual circuits and methods employed for synthesising speech, rather the invention is concerned with applications for known speech synthesis devices.
Hereinafter described, in accordance with the present invention is a speech synthesis communication aid for the vocally handicapped which employs a small microcomputer to monitor control inputs and appropriately drive a visual display and a speech synthesiser unit. The aim is to provide a portable and easily used aid for persons with speech handicaps. By means of simple controls, the invention allows the user to communicate via synthesised speech, independent of natural voice function. Since a broad vodabulary is required to cover the range of application, a modular approach employing interchangeable repertoire modules and associated word-selector overlays is used, the repertoire modules being designed so that the user may plug them into the speech aid as required. Associated with each repertoire module is a translucent overlay sheet on which appropriate symbols or words are printed, the overlay sheet corresponding to the currently installed repertoire module being adapted to be placed over a selector panel on the speech aid. This panel is comprised of a rectangular array of lamps any one of which may be selected by the user to be illuminated under microcomputer control, the arrangement being such that when an overlay is placed on the
selector panel, the words or symbols of the overlay are positioned to locate over the lamps in the array and may be selectively highlighted to indicate selection of the corresponding speech output. Selector controls are provided for changing, via the microcomputer, the lamp or lamps currently illuminated in the selector panel. Use of these controls allows the apparent movement of an index light behind the overlay sheet, permitting the choice of desired words or symbols printed thereon. Having selected a word or symbol, other controls may be used to cause the invention to utter a spoken equivalent to that word, or to store it. In the latter case, a string of utterances may be constructed by repeatedly selecting and storing words or symbols. When the desired assembly has been created it may be synthetically spoken by operating the appropriate control. The capabilities to repeat and extend existing utterance strings are provided. Further to basic speech-aid capabilities, the invention includes options for extra therapeutic functions. By means of external connection facilities and appropriate programming, units may be interconnected or linked to a master console. These modes are intended for teaching applications. The latter mode allows a teacher to monitor and guide pupils in the style of a language laboratory. Sentence construction exercises might be conducted through such a network equipped with suitable overlay sheets and vocabulary modules. Provision is made for data to flow bidirectionally between speech-aid units and the control console. In a standalone therapeutic mode, the aid may function as a game or puzzle, simple concept-ordering problems being provided in pre-programmed form associated with specific overlay sheets and vocabulary modules and the vocal
output of the unit being used to guide and encourage the user.
The invention enables an unlimited repertoire, a simple means of utterance selection and outstanding therapeutic capabilities. A lack of restriction on repertoire size is obtained by virtue of the use of interchangeable vocabulary modules and overlay sheets. The back-illuminated selector panel with uncomplicated controls (joystick or slider, or push-buttons, for example) facilitates utterance selection. Due to its microcomputer-based design, the invention is amenable to functional expansions including network and games modes.
The invention in its various aspects, and features and advantages thereof, will become fully apparent to those possessed of appropriate skills from consideration of the following descriptions of two exemplary embodiments of the invention, the first embodiment being described in outline and the second in detail, which are illustrated in the accompanying drawings wherein:-
Fig. 1 is a top plan view of the operating panel of the first embodiment;
Fig. 2 shows the circuit diagram (in block form) of the embodiment of Fig. 1;
Fig. 3 shows the circuit diagram of an input . switch sub-system of the circuit of Fig. 2;
Fig. 4 shows the circuit diagram of a display sub-system of the circuit of Fig. 2; and Figs. 5 and 6 together show the basic system software flow-chart of the first embodiment; and wherein:-
Fig. 7 shows a schematic circuit diagram of a microcomputer section of the second embodiment; Fig. 8 shows a schematic circuit diagram of
display and keyboard sections of the second embodiment;
Fig. 9 shows a schematic circuit diagram of speech synthesis and control key sections of the second embodiment; Fig. 10 shows a schematic circuit diagram of a power supply section of the second embodiment;
Fig. 11 shows input/output port allocation in the second embodiment; and
Fig. 12 shows a simplified system software flow-chart of the second embodiment. Fig. 1 shows the front face appearance of the first described embodiment which may be comparable in size and weight with a portable cassette tape recorder. The controls include an 8 x 8 cell individually and selectively back-illuminated selector panel 1 with slider controls 3 and 4 to allow choice of the cell illuminated on the selector panel, the controls selecting columns and rows respectively. Translucent word- defining overlay sheets (not shown) are laid on panel 1. Power to the unit is switched by 2, whilst keys 5 and 6 control "speak" and "store" functions of the unit, the "speak" key being used to initiate the device output and for repetition of utterances and the "store" key being used when sequentially selected utterances are to be strung together.
The device of Fig. 1 is adapted to provide readily intelligible synthesised speech from a Tele- sensory Systems Inc. TSI-S2 large scale integrated two chip synthesiser with interchangeable vocabulary ROMS embodied as plug-in modules. The device operates under microcomputer control, a device from the Intel MCS-48 single-chip microcomputer family for example being selected for this purpose. The 8748 unit of this family might by selected, this unit. being UV erasable and reprogrammable and providing 1K x 8 bits of control
memory, 64 x 8 bits of read/write memory and 27 input/ output (I/O) lines. Such a device is capable of around one minute of speech output per vocabulary module, divided among 64 selectable utterances. In the embodiment described only the speech output mode is provided. Symbols selected by the user are uttered by the invention. Utterances may be stored to produce strings of words and may be repeated. Expansion to include the network and therapeutic puzzle modes mentioned hereinbefore is readily available by appropriate reprogramming of the microcomputer control memory and the addition of a simple hardware interface.
The system may be considered as five sections, namely:-
(i) a single-chip microcomputer with support components; (ii) input switches and associated bus-sharing components; (iii) the speech synthesiser chip set and associated audio-path components; (iv) decoders controlling the selector panel and system enable lines; and (v) the selector panel driver and lamp array sub-system.
Referring to Figs. 2, 3 and 4 of the accompanying drawings, an Intel 8748-8 microcomputer 15 (Fig. 2) operating with a cycle time of 4.17μs is the central controlling element of the system, ports 1 and 2 of the microcomputer being used for output and mixed I/O respectively. Output port 1 provides a six bit select bus to speech synthesiser integrated circuit 10 and to display decoders 8 and 11. The speech synthesiser 10 comprises for example a Telesensory Systems Inc. device no. S2. Bits 6 and 7 remaining from Port 1
are decoded to provide four system module enable lines via decoder 13. These lines are used to control the speech synthesiser 10 and the input subsystem constituted.by selection switches 16, 17 and function keys l8. The low order four bits of Port 2 are not used to control intrinsic system functions, but are dedicated to the role of providing the top four bits of addresses to external program memory in conduction with 8 lines from address latch 19. These low order address bits are strobed from the data bus into the address latch during ALE cycles.
Bits 4 to 7 of Port 2 are used to input data from the. select and function switches 16, 17 and 18. As shown in Fig. 3 , all three switch modules share the bus and are selected onto it by pulling switch common lines to ground via transistors 21, 23 and 26. These devices are controlled by system enable lines. The structure of the 8748 Ports 1 and 2 is such that, when configured as inputs, lines are internally pulled high by 50k resistors, thereby obviating the need for external pull-ups. The diodes 20, 22, 24 and 25 associated with the switches prevent the state of disabled switches from effecting valid data.
Fig. 2 shows the speech synthesiser LS1, 10 interfaced to the microcomputer 15 via eight lines; the previously mentioned six-bit utterance select input, and "start" and "busy" lines. A level-shifting network is provided as shown in Fig. 2 on appropriate synthesiser 1/0 lines to allow interfacing of the TTL and PMOS logic signals. "Start" is an input and is controlled by a system enable line. "Busy" is a status output indicating synthesiser non-availability, and is monitored by a single-bit testable input, TO. Connections are also made to a vocabulary ROM 7, housed in a plug-in module. This contains the parameters associated with the
utterance set from which the synthesiser reconstructs speech. Resulting audio signals are low-pass filtered by filter circuit 9 to remove sampling frequency components associated with digital signal processing. They are then fed, after amplification by a small monolithic audio amplifier 12, to a loudspeaker 14.
The circuit diagram of the eight by eight incandescent lamp display matric constituting the selector panel is represented in Fig. 4 which is a simplified diagram showing only four element lamps and associated drivers, the full system comprising eight such row and column drivers connected to 64 incandescent lamps each with an associated diode. Lamps 30 etc. are driven by row and column PNP transistor buffers 27 and 28 respectively. Eight transistors make up each buffer. These devices are activated by low- true signals derived from two 74LS138three-to-eight line decoders (8 and 11 in Fig. 2) controlled by bits 0 to 5 of Port 1 of microcomputer 15. Diodes 29 etc. are employed in the array to prevent spurious current paths through unselected lamps. System expansion facilities have been provided by incorporating an address bus latch (Fig. 2, 19) and by making PSEN (program Store Enable), RD (Read), WR (Write) and the data and address busses externally accessible. Memory mapped interface devices and up to
3K bytes of program memory and 190 bytes of data memory may be added to the system via these expansion facilities. System software is illustrated by the flowchart of Figs. 5 and 6 wherein the following terminology is employed:- UTT ADDR is the synthesiser and selector panel control address; and UTT PTR is a pointer to the current UTT ADDR storage location in data memory.
Referring to Figs. 5 and 6, the majority of CPU time is spent in an idle loop in which the selector panel controls and function keys are scanned. Display refreshing is done in this loop, illuminating the current lamp for 20 ms per cycle. After reading the row and column selector switches, the data obtained is translated from Grey code (produced by the slide logic switches) to an UTT ADDR. This is then compared with the UTT ADDR obtained in the last scan. A repeat flag, FO is set if a change is detected.
When the "Store" key is actuated, the current UTT ADDR is stored at the UTT PTR location, the FO flag is cleared and the UTT PTR is incremented. Thus a string of utterances may be stored by alternately setting the selector panel controls and depressing the "Store" key.
Actuation of the "Speak" key causes the FO flag to be tested. If it is set, the selector panel controls have been altered since the last speak or store operation, and a new item is assumed. Thus the UTT PTR and FO flag are reset before the selected utterance is output. If the F0 flag is not set, repeat mode is assumed and the current stored string is output, leaving the UTT PTR unchanged, ready for a repeated output if the "Speak" key is re-depressed. If the
"Store" key is actuated at this point, new utterances may be added to the end of the existing string.
It will be appreciated that the flow diagrams (Figs. 5 and 6) are at a high level and do not show the structure of specified operations.
The function key input subroutine is edgesensitive and includes debounce delays to avoid spurious data entries. The "speak string" subroutine outputs to both the speech synthesiser and selector panel. Data is output from an UTT ADDR buffer starting
at the buffer base and ending on the last position of the UTT PTR. BUSY, generated by the speech synthesiser, is tested within the subroutine, allowing correct output timing. Functional enhancements are attainable within the IK byte on-chip program memory space. External additional program and data memory may be added to further expand functional capabilities.
Having thus described, in outline, a first embodiment of the invention, a second embodiment will hereinafter be described in detail with reference to Figs. 7 to 12 of the accompanying drawings.
Whilst a single-chip microcomputer controller would be employed in production versions of the speech aid hereinafter described to minimise parts cost, the described embodiment uses a Z8θ microprocessor with associated support components, this approach having been dictated by the availability to the inventors of ZδO software development tools. The embodiment hereinafter described comprises four readily identifiable sections which are illustrated in Figs. 7, to 10, namely a microcomputer controller (Fig. 7) an integrated display/keyboard (Fig. 8), aspeech synthesiser unit with vocabulary ROMs and audio output components (Fig. 9) and a logic-controlled power supply (Fig. 10).
Referring to Fig. 7, the microcomputer controller is based on a Z8θ microprocessor 50, and employs a memory-mapped architecture with partially decoded linear select device addressing. Program memory consists of 2k bytes of PROM space in a single 2716 device 51 residing between hexadecimal addresses 0000 and 07FF in the system memory space. By virtue of its erasable/reprogrammable nature, the 2716 was a logical choice for program development. Read/write memory is
provided in the form of two 2114 RAM chips 52 offering Ik bytes of temporary workspace mapped between hexadecimal addresses 3000 and 33FF.
Communication with display, keyboard, speech synthesiser and power supply control lines is furnished via two 8255A programmable peripheral interface (PPI) devices 53 each offering three eight-bit I/O ports, A, B and C. These are configured under software control to give four output and two input ports. Mapped in memory space, the PPIs have hexadecimal addresses of
5000 to 5003 and 9000 to 9003. respectively. The former device controls display rows, display columns and various enables through its three ports. The latter strobes keyboard rows and provides speech synthesiser utterance addresses through its A port, whilst its B and C ports, configured as inputs, monitor keyboard columns and miscellaneous system status bits.
System support functions (clock and reset generation and device selection) are performed by the remaining microcomputer components. A discrete crystal-controlled transistor oscillator 54 operating at 1MHz generates the system clock signal. This is buffered by a TTL schmitt inverter 55 before being fed to the microprocessor clock terminal. An inverted version of the same signal is used to synchronise, via a D type flip-flop 56, an asynchronous reset line derived from a schmitt inverter-based power-on reset circuit 57. The synchronised reset signal is fed in inverted and non-inverted forms to the reset terminals of the microprocessor 50 and PPIs 53 respectively. An array of two-input NAND gates 58 each fed by an inverted version of the Z8θ MREQ. (memory request) signal together with levels derived from the top four bits of the address bus provide device enable lines. Following a system reset, the Z80 program counter commences
addressing memory from hexadecimal 0000, hence address bit A12 is initially low. This bit is inverted before being routed to the PROM enable NAND gate, thereby allowing program memory access following system reset. The other device-enabling gates are fed directly by address lines A13, A14 and A15 and enable RAM and PPIs 1 and 2 respectively. As is generally the case with linear select systems, only one address selecting bit may be true at any one time, hence A12 A13 A12 and A15 are considered mutually exclusive in the system software. The RAM enable signal is further gated with RD and WR (read and write) signals by gates 59 before being routed to the 2114 chip select terminals. This prevents data bus contention during write cycles when the RAM enable signal is true before a write pulse. Reference should be made to the Zilog Z80 Technical Manual for a detailed description of Z80 system timing.
Referring now to Fig. 8, the display/keyboard circuit is schematically represented. Physically, this assembly comprises a set of 64 back-illuminated switches 60 (only some of which are shown) realised via membrane switch technology. This allows cells of the flexible translucent keyboard overlays, associated with particular vocabulary repertoires to be selectively high-lighted under software control.
The display consists of 64 T1 incandescent lamps 61 (used because of their "white" light output) isolated against spurious back currents by series diodes 62 aiid driven by 16 bipolar transistors 63 arranged as eight row and eight column drivers, the former being configured as emitter followers while the latter operate in the saturating common emitter mode. As with the lamps 61, only some of transistors 63 are shown in Fig. 8. Both driver sets are arranged to be enabled
by high-true signals. A set of eight 4K7 resistors 64 pull up the row driver inputs which are connected to port A of PPI 1 (Fig. 7). Eight 390Ω. resistors 65 limit drive current sourced by port B of PPI 1 to the bases of the column driver transistors 63.
The keyboard may be based on a matrix of 8 x 8 thin silver contact strips laid down on two stable plastic films. A thin perforated non-conducting membrane separates the films which are arranged so that the conductor sets lie at right angles to each other. The perforations are aligned with the crosspoints of the conductor-carrying films so that external pressure (e.g. from a finger) causes deformation of the film through a perforation and consequential contact closure. Keyboard rows are strobed by port A of PPI 2 (Fig. 7) which, under software control, sequentially pulls row lines .low. Column lines, pulled high by 47K resistoι»s 66 are montored by port B of PPI 2. If a coincidence of row strobe and contact closure occurs, the associated column line is pulled low. Software then decodes the location of the depressed key.
The speech synthesiser and control key subsection is shown in Fig. 9 . A Telesensory Systems Inc. S2B mini speech synthesis board 70 is used in the embodiment, this circuit having been modified to include a vocabulary ROM containing a "standard" set of 64 utterances mounted "off-board" as a plug-in module. Different plug-in modules will contain repertoires appropriate to the diverse needs of the speech handicapped. The computer interface to the board 70 comprises six address lines SA 0-5 (to select one of 64 utterances), a "start" line ST, a ROM power down control line VR0M, and a "busy" output. The S2B requires its inputs to be pulled high via resistors and its "busy" output to be level-shifted and buffered
when interfaced to TTL circuitry.
Utterance address and ROM power down signals derived from port A, PPI 2 and a "start" signal from port C, PPI 1 are buffered by a 74LS244 device 71 and pulled high via a 4K7 resistor package 72 before being fed to the appropriate terminals of the S2B. The "busy" signal from the synthesiser 70 is level- shifted and buffered by a discrete transistor inverter 73, then fed to port C of PPI 2 along with low true "speak" and "clear" control key lines. The analogue output AAS of the S2B is processed by an amplifier 74 and filter 75 before being routed to a loudspeaker 76. Filter characteristics are variable by means of a preset Baxandall circuit to allow intellegibility to be optimised. The basic filter response is bandpass and, with Baxandall flat, has corner frequencies at 400Hz and 1KHz. Roll-on and roll-off are at approximately 6dB per octave. Filter and amplifier are realised via 3l4θ operational amplifier chips. The output level is adjustable by a gain trimmer in the amplifier stage. A complementary emitter follower buffer 77 using germanium transistors for low base-emitter voltage drop drives the loudspeaker from the filter output. A red/green light emitting diode (LED) indicator 78, driven by a pair of TTL inverters 79 enabled by signals from port C of PPI 1 is employed to show system status. Typically green will be lit to show the normal "on" state. Red might indicate low battery conditions. In order to meet certain special functional requirements of the speech aid, the power supply (Fig. 10) incorporates several unusual features. It is required that the aid be portable and independent of external power sources, hence internal batteries are dictated. Most internal circuitry requires a +5 volt
supply, hence three 2-volt Cyclon accumulators 80 were chosen for the power source. These 2.5 Ah devices yield a series EMF of 6 volts, allowing an accompanying regulating circuit only a one volt drop. In addition it was desired that a low battery condition be signalled to the microcomputer to allow an orderly automatic shutdown procedure. Further, a requirement for toggle on/ off action and automatic time-out in standby mode was specified. Finally the S2B speech synthesiser requires a minus 10 volt rail which must be derived from the 6 volt power source.
The requirement of low regulator dropout voltage dictated a germanium saturable series pass element 8l. This transistor is driven by the collector current of one side of a long-tailed pair 82 which operates as a comparator between a zener derived reference potential and a proportion of the output voltage. Emitter current for the long-tailed pair 82 is sunk by a resistor 83 in series with a transistor switch 84. When the transistor 84 is disabled, base current to the series pass transistor 8l is cut off and the regulator output falls to zero. A keep-alive current is fed to the reference zener 85 under these conditions. This facilitates orderly power-up behaviour. A PNP transistor 86 with baseemitter connected via a current limiting resistor 87 across the series pass transistor 8l monitors the input- output differential voltage and produces, via a CMOS schmitt trigger gate 88 , a "battery low" logic signal which is routed to port C of PPI 2. Under powered down conditions the regulator draws less than one milliamp, and hence may be permanently connected to the six volt supply. The base of the long-tailed pair emitter current switch transistor 84 is fed via a current limiting resistor 89 from a set/reset flip-flop 90 configured
using CMOS schmitt NAND gates, permanently powered from the six volt supply. Flip-flop 90 may be triggered "on" by the momentary closure of the "on/ off" key switch 91. A differentiating network 92 between the switch and gate ensures a short ( 10 microsecond) trigger pulse. The set/reset flip-flop 90 is triggered "off" by a high to low transition of a line from port C of PPI 1. Under conditions of low battery voltage, a "lock-out" circuit 93 prevents the flip-flop 90 from adopting a stable "on" state. This is achieved by resetting the flip-flop 90 with a suitably delayed version of the "battery low" logic signal. Automatic power-down on low battery voltage is disabled after a few milliseconds by a delayed output from the flip-flop 90 derived by CR network 94. After this initial automatic power-down time "window", only a logic signal from the microcomputer can switch off the system. To this end an output (low true) is derived from the "on/off" control key 91 and routed to port C of PPI 2, thereby enabling the microcomputer to detect when the user wishes to power-down.
In order to cater for the -10 volt rail requirements of the S2B, a flyback type converter is included in the power supply sub-system. A 1:1 ferrite pulse transformer 95 is driven by a transistor 96 in series with a current sensing resistor 97 from the six volt supply. Transformer primary current is monitored by the inverting input of a 3140 operational amplifier 98 c-onfigured as a schmitt trigger with threshold varied by feedback from the transformer secondary circuit. Two paths provide both AC positive feedback ensuring high switching efficiency, and negative DC feedback regulating the output voltage. Secondary current is rectified and smoothed to form the minus 10 volt output. Logic controlled on/off switching is effected by virtue of the
power supply to operational amplifier 98 being derived from the +5 volt rail. When this is disabled, no base current flows in the switching transistor 96, hence no current is drawn from the six volt supply. Having thus described the construction and organization of the second embodiment, reference may now be had to Fig. 12 which is a software flowchart of the basic system function together with the following functional description of the machine operation. On power-up (depression of ON/OFF key) LED flashes red during settling time and until key is released, then lights green to indicate "ready" state. If battery voltage is initially low, power-up is inhibited by hardware. To enter an utterance or string of utterances, display areas are pressed in the order that utterances are required. As areas are pressed they back-illuminate to show current store contents.
Pressing the SPEAK key allows the stored utt erance string to be verbally output in the order of entry. As each utterance is made the associated display area blanks. At the end of the string all previously selected display areas are re-illuminated.
Pressing the CLEAR key during any user entry phase clears the utterance store and blanks the display. Entering an utterance already stored clears that utterance from memory and blanks the associated display area.
If more than l6 utterances are entered, LED flashes red/green and further entries are ignored; control key functions continue to operate normally.
If battery volts drop below the "battery low" threshold during operation and the machine is speaking, the LED displays red and the machine automatically powers-down after the current utterance is spoken.
Under standby conditions, low battery voltage is indicated by red LED illumination for 5 seconds during which time all control and entry functions except ON/ OFF are locked out. At the end of the time-out period automatic power-down is invoked.
If the machine is left "on" for greater than 10 minutes without any entries being made, the LED flashes green for 10 seconds to warn of incipient power-down. Any entries made during this time restore normal operation. If no entry is made, the machine powers itself down.
There follows appended hereto a complete software listing in respect of the operations and external and internal control functions in respect of the embodiment abovedescribed with reference to Figs. 7 to 12 of the accompanying drawings.
Whilst there have thus been described two embodiments of the invention, it will be well appreciated by those possessed of appropriate skills that many variations, alterations and modifications, could be made without departure from the spirit and scope of the invention. The foregoing is therefore intended to be read as exemplifying the invention and not in a limitative sense.
Claims
1. A speech synthesis device unable by the vocally handicapped to provide a speech facility comprising:- a housing incorporating a selector panel and associated selector controls enabling identification by a user of one or more of a plurality of available selectable speech, utterances, and indicator means associated Vith said panel for providing an indication of selected ones of said speech utterances; a speech synthesiser circuit in said housing and capable of outputting electrical signals transformable into audible speech utterances in response to corresponding inputs thereto; a read only memory associated with said speech synthesiser circuit and having pre-stored therein data to enable said speech synthesiser circuit to output electrical signals representative of a predetermined limited speech vocabulary, said read only memory being a modular unit selectively connectable with and disconnectable from said speech synthesiser circuit whereby a requisite vocabulary repertoire may be provided by means of a plurality of such modular units; and a microcomputer interfaced with said selector panel and associated selector controls and with said speech synthesis circuit, said microcomputer being programmed such that the indentification by a user of a selected speech utterance causes the addressing of corresponding data items stored in said read only memory and the enabling of the speech synthesiser circuit to output corresponding electrical signals for transformation into speech utterances corresponding to the user selection.
2. A speech synthesis device as claimed in claim 1 wherein said selector panel comprises a matrix array of selectable cells each having associated indicator means actuable via said microcomputer in response to a corresponding user selection to illuminate a corresponding location of a translucent overlay bearing graphical representations of selectable speech utterances corresponding to data stored in the read only memory, the arrangement being such that with a plurality of such overlays and a plurality of correspondingly pre-stored read only memories the requisite vocabulary repertoire may be provided.
3. A speech synthesis device as claimed in claim
2 wherein said selector controls comprise a matrix array of manually operable switches associated with said matrix array of selectable selector panel cells.
4. A speech synthesis device as claimed in claim
3 wherein said matrix array of manually operable switches constitutes the selector panel, the said switch array comprising a plurality of row conductors mounted upon one face of a first flexible membrane, a plurality of column conductors mounted upon an opposed face of a second membrane, a spacer layer interposed between said first and second membranes and having defined therein openings whereat the row conductors traverse the column conductors whereby pressure upon said first membrane in selected areas thereof will cause a selected row conductor to contact a selected column conductor, and driver circuitry associated with said row and column conductors for causing a unique signal to be developed at an input port of the microcomputer in response to contact of a selected row con ductor with a selected column conductor consequential to selection by the user of a corresponding cell of the array.
5. A speech synthesis device as claimed in any of the preceding claims wherein said microcomputer includes a read/write memory storage facility, and said selector controls include means enabling a user selectively to -store signals representative of a plurality of serially selected speech utterances to be vocalized in a continuous speech utterance string.
6. A speech synthesis device as claimed in any of the preceding claims wherein said microcomputer comprises a microprocessor employing a memory-mapped architecture with partially decoded linear select device addressing.
7. A speech synthesis device as claimed in claim
6 wherein said microcomputer comprises a microprocessor, a program memory storage device configured as a programmable read only memory (PROM), a read/write memory storage device configured as a random access memory (RAM), and programmable peripheral interface (PPI) means configured under software control to provide communication between the microprocessor and indicator, selector panel, speech synthesiser and power control lines.
8. A speech synthesis device as claimed in claim
7 wherein the microcomputer comprises a power-on reset circuit for deriving a reset signal synchronised with system clock which is applied to reset terminals of the microprocessor and the PPI means.
9. A speech synthesis device as claimed in claim 7 or 8 wherein a RAM enable signal from the microprocessor is gated with signals (RD = read and WR = write) before being routed to the RAM so as to prevent data bus contention during write cycles when the RAM enable signal is true before a write pulse.
10. A speech synthesis device as claimed in any of claims 6 to 9 wherein said microprocessor is a Zilog Z80 device.
11. A speech synthesis device as claimed in any of the preceding claims including a software controlled T.attery power supply system arranged to derive a "battery low" logic signal in response to a detected low battery condition, the occurrence of a said "battery low" logic signal being arranged to signal an orderly automatic shut-down procedure by the microcomputer.
12. A speech synthesis device as claimed in claim 11 wherein the power supply system comprises a low volt drop saturable pass element in series with the battery for providing a regulated output voltage, the said element being controlled in dependence upon a comparison between a zener-derived reference voltage and the power supply output voltage, and a further element monitors the input-output differential voltage across the said series element for providing said "battery low" logic signal.
13. A speech synthesis device as claimed in claim 12 wherein a selectively operable on-off switching device is associated with said series pass element for determining the operation thereof to determine the on- off condition of the power supply, and circuitry is provided for feeding a keep-alive current to the reference zener when the power supply is in its off condition for facilitating orderly power-up behaviour.
14. A speech synthesis device as claimed in any of the preceding claims wherein the power supply subsystem includes a converter circuit comprising a 1:1 ferrite pulse transformer driven by a transistor in series with a current sensing resistor from a battery supply rail, the transformer primary current being monitored by an operational amplifier configured as a schmitt trigger with threshold varied by feedback from the transformer secondary circuit via paths providing both AC positive feedback and negative DC feedback, and the secodnary current being rectified and smoothed for providing the requisite output.
15. A speech synthesis device as claimed in any of the preceding claims wherein the speech synthesiser circuit provides an analogue output which is processed by a filter and amplifier before being routed to an acoustic output transducer, and wherein filter characteristics are variable by means of a Baxandall circuit to allow the intelligibility of the speech utterances to be optimised.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU73782/81A AU7378281A (en) | 1980-07-24 | 1981-07-24 | Speech synthesis aid for the vocally handicapped |
JP50009182A JPS58500085A (en) | 1981-01-19 | 1981-12-16 | Blood type determination device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB8024236800724 | 1980-07-24 | ||
GB8024236 | 1980-07-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1982000381A1 true WO1982000381A1 (en) | 1982-02-04 |
Family
ID=10515004
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB1981/000144 WO1982000381A1 (en) | 1980-07-24 | 1981-07-24 | Speech synthesis aid for the vocally handicapped |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP0056046A1 (en) |
JP (1) | JPS57501252A (en) |
WO (1) | WO1982000381A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1992012491A1 (en) * | 1990-12-28 | 1992-07-23 | Semantic Compaction System | System and method for automatically selecting among a plurality of input modes |
FR2733103A1 (en) * | 1995-04-12 | 1996-10-18 | Philips Electronics Nv | AUTORADIO RECEIVER WITH MEMORY FOR STORING PREDETERMINAL VOCABULAR ELEMENTS |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0962180A (en) * | 1995-08-28 | 1997-03-07 | Nippon Denki Ido Tsushin Kk | Speech guide device |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3400319A (en) * | 1966-06-03 | 1968-09-03 | Automatic Elect Lab | Regulated voltage converter circuit for converting a dc voltage into a higher dc voltage |
US4085302A (en) * | 1976-11-22 | 1978-04-18 | Control Data Corporation | Membrane-type touch panel |
CH602990A5 (en) * | 1977-04-05 | 1978-08-15 | Carba Sa | Communication system for handicapped person |
CA1057855A (en) * | 1976-09-14 | 1979-07-03 | Michael P. Beddoes | Generator for spelled speech and for speech |
FR2418495A1 (en) * | 1978-02-27 | 1979-09-21 | Rockwell International Corp | POWER SUPPLY OF A COMPUTING DEVICE WITH DATA PROTECTION IN THE EVENT OF A POWER INTERRUPTION |
US4215240A (en) * | 1977-11-11 | 1980-07-29 | Federal Screw Works | Portable voice system for the verbally handicapped |
DE3017517A1 (en) * | 1979-05-07 | 1980-11-13 | Texas Instruments Inc | LANGUAGE SYNTHESIS ARRANGEMENT |
-
1981
- 1981-07-24 EP EP19810902039 patent/EP0056046A1/en not_active Withdrawn
- 1981-07-24 JP JP50243981A patent/JPS57501252A/ja active Pending
- 1981-07-24 WO PCT/GB1981/000144 patent/WO1982000381A1/en not_active Application Discontinuation
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3400319A (en) * | 1966-06-03 | 1968-09-03 | Automatic Elect Lab | Regulated voltage converter circuit for converting a dc voltage into a higher dc voltage |
CA1057855A (en) * | 1976-09-14 | 1979-07-03 | Michael P. Beddoes | Generator for spelled speech and for speech |
US4085302A (en) * | 1976-11-22 | 1978-04-18 | Control Data Corporation | Membrane-type touch panel |
CH602990A5 (en) * | 1977-04-05 | 1978-08-15 | Carba Sa | Communication system for handicapped person |
US4215240A (en) * | 1977-11-11 | 1980-07-29 | Federal Screw Works | Portable voice system for the verbally handicapped |
FR2418495A1 (en) * | 1978-02-27 | 1979-09-21 | Rockwell International Corp | POWER SUPPLY OF A COMPUTING DEVICE WITH DATA PROTECTION IN THE EVENT OF A POWER INTERRUPTION |
DE3017517A1 (en) * | 1979-05-07 | 1980-11-13 | Texas Instruments Inc | LANGUAGE SYNTHESIS ARRANGEMENT |
Non-Patent Citations (7)
Title |
---|
Electronic Design, Volume 28, No. 6, March 15, 1980 (Rochelle Park, US) D.R. GUBBLE "Singel-Board Speech Synthesizer Extends Microcomputer-I/o Capability", pages 251-255, see in particular figures 1,2 * |
Electronics International, Volume 50, No. 23, November 10, 1977 (New Yort, US), "Electronic Voice System Generates Messages for Vocally Handicapped", pages 32,33 * |
Electronics International, Volume 51, No. 26, December 21, 1978 (New York, US), R.J. BOSCHERT: "Flyback Converters: Solid-State Solution to Low-Cost Switching Power Supplies", pages 100-104 see in particular figures 3,4 * |
Electronics International, Volume 53, No. 8, April 10, 1980, (New York, US), D.W. WEINRICH:, "Speech-Synthesis Chip Borrows Human Intonation", pages 113-118 see in particular figures 5,6 * |
IBM Technical Disclosure Bulletin, Volume 13 No. 11, April 1972 (New York, US) C.H. CLAASSEN: "Low-Voltage series Regulator without Auxiliary Voltage", see page 3431 * |
ICASSP80, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, April 9-11th, 1980, Volume 3, (Denver, Colorado, US), M.A. KARJALAINEN et al. "Aids for the Handicapped Based on "Synte 2" Speech Synthesizer", pages 851-854, see in particular pages 851,852: "Introduction" "Speaking Machine for the Speech Impared"; figures 1,2 * |
Medical and Biological Engineering and Computing, Volume 17, No. 1, January 1979, (Stevenage, GB), R.B. FREEDMAN et al. "Verbal Communication Aid for Nonvocal Patients", pages 103-105, see in particular page 105, left-hand column, line 25 to right-hand column, line 6 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1992012491A1 (en) * | 1990-12-28 | 1992-07-23 | Semantic Compaction System | System and method for automatically selecting among a plurality of input modes |
US5210689A (en) * | 1990-12-28 | 1993-05-11 | Semantic Compaction Systems | System and method for automatically selecting among a plurality of input modes |
FR2733103A1 (en) * | 1995-04-12 | 1996-10-18 | Philips Electronics Nv | AUTORADIO RECEIVER WITH MEMORY FOR STORING PREDETERMINAL VOCABULAR ELEMENTS |
EP0739104A1 (en) * | 1995-04-12 | 1996-10-23 | Philips Electronique Grand Public | Car radio receiver comprising a memory for storing certain elements of vocabulary |
CN1084971C (en) * | 1995-04-12 | 2002-05-15 | 曼内斯曼Vdo股份公司 | Car radio receiver comprising memory for storing predetermined vocabulary elements |
Also Published As
Publication number | Publication date |
---|---|
EP0056046A1 (en) | 1982-07-21 |
JPS57501252A (en) | 1982-07-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4215240A (en) | Portable voice system for the verbally handicapped | |
US6405167B1 (en) | Interactive book | |
US4968255A (en) | Electronic instructional apparatus | |
US4218760A (en) | Electronic dictionary with plug-in module intelligence | |
US5813861A (en) | Talking phonics interactive learning device | |
EP0835500B1 (en) | Method and apparatus for automated learning and performance evaluation | |
EP0327408A2 (en) | Voice language translator | |
US4465465A (en) | Communication device for handicapped persons | |
US5556283A (en) | Card type of electronic learning aid/teaching apparatus | |
US20070275357A1 (en) | Electronic learning aid for teaching arithmetic skills | |
US20020133350A1 (en) | Interactive book | |
JPS59144978A (en) | portable translation device | |
US4658374A (en) | Alphabetizing Japanese words in a portable electronic language interpreter | |
CN102394015B (en) | Speech learning machine and player with bilingual model | |
JPS645347B2 (en) | ||
US4031797A (en) | Pocket-size musical chord study aid | |
US5295070A (en) | Electronic dual language dictionary | |
US3414985A (en) | Teaching apparatus | |
WO1982000381A1 (en) | Speech synthesis aid for the vocally handicapped | |
US7912723B2 (en) | Talking book | |
EP0094502A1 (en) | Electronic learning aid for assistance in speech pronunciation | |
GB2140943A (en) | Improvements relating to computers | |
JP2837729B2 (en) | Word learning device | |
CN110858455A (en) | Early education enlightenment learning machine with brand-new interactive system and suitable for infants | |
Owen | Linguistics: An Overview |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Designated state(s): AU DK FI JP NO US |
|
AL | Designated countries for regional patents |
Designated state(s): AT CH DE FR GB LU NL SE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1981902039 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1981902039 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1981902039 Country of ref document: EP |