Description
Flakes are so bad that each PR needs like at least like 3 or more retires until the wsl task passes.
One thing we should do is to add WSL logging to see if anything shows up there, I started here #25882 so someone should finish that up
There are like two different symptoms, total CI task timeout after 60 mins (likely should lower that again because the task time is much faster), in this case most commands just seem to timeout after 10mins.
https://cirrus-ci.com/task/5839274018340864
C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar 7d5770bf3d96
[FAILED] Timed out after 600.000s.
command timed out after 600.000000s: [C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar 7d5770bf3d96]
Expected process to exit. It did not.
C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine rm --force fd408f939745
[FAILED] Timed out after 600.000s.
command timed out after 600.000000s: [C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine rm --force fd408f939745]
Expected process to exit. It did not.
And then there is a "fast" fail where at some point most tasks just keep failing with WSL connection errors
https://cirrus-ci.com/task/4649069861142528
https://api.cirrus-ci.com/v1/artifact/task/4649069861142528/html/machine-wsl-podman-windows-rootless-host-sqlite.log.html#t--run-basic-podman-commands-Volume-ops--1
C> podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar --now ef245288215b
Importing operating system into WSL (this may take a few minutes on a new WSL install)...
The operation completed successfully.
Configuring system...
Machine init complete
Starting machine "ef245288215b"
The operation timed out because a response was not received from the virtual machine or container.
Error code: Wsl/Service/CreateInstance/HCS_E_CONNECTION_TIMEOUT
Error: the WSL bootstrap script failed: command wsl [-u root -d podman-ef245288215b /root/bootstrap] failed: exit status 0xffffffff
C> podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar --now a2b3385349e9
Importing operating system into WSL (this may take a few minutes on a new WSL install)...
Catastrophic failure
Error code: Wsl/Service/RegisterDistro/E_UNEXPECTED
Error: the WSL import of guest OS failed: command wsl [--import podman-a2b3385349e9 Z:\podman_test1271291991\.local\share\containers\podman\machine\wsl\wsldist\a2b3385349e9 Z:\podman_test1271291991\.local\share\containers\podman\machine\wsl\a2b3385349e9-amd64 --version 2] failed: exit status 0xffffffff
C> podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar --now fe0e4d84c6a5
Importing operating system into WSL (this may take a few minutes on a new WSL install)...
A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
Error code: Wsl/Service/RegisterDistro/0x8007274c
Error: the WSL import of guest OS failed: command wsl [--import podman-fe0e4d84c6a5 Z:\podman_test1181544560\.local\share\containers\podman\machine\wsl\wsldist\fe0e4d84c6a5 Z:\podman_test1181544560\.local\share\containers\podman\machine\wsl\fe0e4d84c6a5-amd64 --version 2] failed: exit status 0xffffffff