Support external transcription providers #909

timothycarambat · 2024-03-14T22:30:43Z

Pull Request Type

Relevant Issues

resolves #850
resolves #849

What is in this change?

Supports OpenAI whisper along-side as an additional configuration option to allow quicker transcription of files (mp3, mp4, etc) - 25MB LIMIT!

The localWhisper model still can run if needed but it often crashes instances when on underpowered devices with restricted RAM or CPU. OpenAI is a quick way to resolve this and satisfy the user need.

Additional Information

Developer Validations

I ran yarn lint from the root of the repo & committed changes
Relevant documentation has been updated
I have tested my code functionality
Docker build succeeds locally

timothycarambat · 2024-03-14T22:43:23Z

This sets up the ability for us to add speech-to-text chatting with workspace 👍

AIbottesting · 2024-05-10T18:33:20Z

Anything-LLM is awesome! Thank you and your team for all your hard work and passion. Because I am visually impaired, my only feature request is a text-to-speech button to read AI output. I currently copy and paste every single time into eSpeak which uses Microsoft Windows 10 built-in text-to-speech engine. This is my humble wish. However, if I could dream, being able to press a button for speech-to-text and be able to have a two-way conversation would be out of this world. Furthermore, could you imagine being able to scan a paper receipt and Anything-LLM puts the data into an Excel file for you! Wow

timothycarambat · 2024-05-10T18:38:28Z

@AIbottesting thank you for raising this. I think we can easily support TTS using the built-in browser TTS. I am sorry we overlooked that kind of accessibility feature. Please let me know if you have any further accessibility issues and we will try to get those handled

AIbottesting · 2024-05-11T00:02:30Z

Thank you for being kind and no need to be sorry. On a side note, the government (Department of Rehabilitation) spends money on my behalf for a software called Kurzweil 3000 to read my college texts. I think this software is used by many disabled people at their jobs. I also believe software companies go through some kind of accessibility certification which may help your business side to be able to claim you are compliant. Just a thought. Keep on kicking butt as you do.

* Support External Transcription providers * patch files * update docs * fix return data

timothycarambat added 4 commits March 14, 2024 15:27

Support External Transcription providers

2baaf4d

patch files

3cb8aa8

update docs

d86028f

fix return data

96f2a04

timothycarambat merged commit 0ada882 into master Mar 14, 2024

timothycarambat deleted the 850-external-transcription-providers branch March 14, 2024 22:43

timothycarambat mentioned this pull request May 11, 2024

[FEAT]: Accessibility TTS on messages #1353

Closed

cabwds pushed a commit to cabwds/anything-llm that referenced this pull request Jul 3, 2025

Support external transcription providers (Mintplex-Labs#909)

dbb055c

* Support External Transcription providers * patch files * update docs * fix return data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support external transcription providers #909

Support external transcription providers #909

Uh oh!

timothycarambat commented Mar 14, 2024

Uh oh!

timothycarambat commented Mar 14, 2024

Uh oh!

AIbottesting commented May 10, 2024

Uh oh!

timothycarambat commented May 10, 2024

Uh oh!

AIbottesting commented May 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Support external transcription providers #909

Support external transcription providers #909

Uh oh!

Conversation

timothycarambat commented Mar 14, 2024

Pull Request Type

Relevant Issues

What is in this change?

Additional Information

Developer Validations

Uh oh!

timothycarambat commented Mar 14, 2024

Uh oh!

AIbottesting commented May 10, 2024

Uh oh!

timothycarambat commented May 10, 2024

Uh oh!

AIbottesting commented May 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants