I tried some of the web-ai-demos on https://chrome.dev/, such as https://chrome.dev/web-ai-demos/perf-client-side-gemma-worker/
Some demos say that the model will take about 30s or 1 minute to load. This took longer, as it turned the demo was downloading a model … of more than 1GB … which eventually took 15 minutes to complete.
Please add a warning message as per https://web.dev/articles/client-side-ai-performance#signal_large_downloads guidelines.
From the looks of it, the model doesn’t get cached on disk properly, so people end up downloading the model over and over again.