这是indexloc提供的服务,不要输入任何密码
Skip to content

Gemini does not detect video or voice. #108

@Toltally-suck-at-code

Description

@Toltally-suck-at-code

Description of the bug:

Hi there!
When running this app, whenever I turn on my mic and my webcam, it doesn't respond anything to me, I checked the logs, and see that it's repeatedly sending video data and preventing from the actual API to process

Here's the log in the app provides

8:26:51client.openConnected
8:26:51server.sendsetupComplete
8:26:55client.realtimeInputaudio39
8:26:55client.realtimeInputvideo
8:26:57client.realtimeInputaudio31
8:26:57client.realtimeInputvideo
8:26:59client.realtimeInputaudio29
8:26:59client.realtimeInputvideo
8:27:07client.realtimeInputaudio248
8:27:07client.realtimeInputvideo
8:27:08client.realtimeInputaudio45

When turning off the webcam and the mic, and only using pure text, it responds normally, but seems to hallucinate and tells me that it hears me and sees my video feed???

Secondly, when using the direct SDK provided by Google Studio AI, sometimes it seems to also hallucinate and pretends that it doesn't hear me and doesn't see my webcam (even though I am speaking to it and streaming via OpenCV) (PS The second issue, I used the latest 2.5 native thinking audio model, when I switch back to the older 2.0 Version, it stops hallucinating. And yes, I tried changing API keys and restarting to try to "reset" the chatlogs (Or tried to since it still remember what I have said earlier???)

Actual vs expected behavior:

No response

Any other information you'd like to share?

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions