+
Skip to content

CI: wsl machine flakes #26547

Open
Open
@Luap99

Description

@Luap99

Flakes are so bad that each PR needs like at least like 3 or more retires until the wsl task passes.
One thing we should do is to add WSL logging to see if anything shows up there, I started here #25882 so someone should finish that up

There are like two different symptoms, total CI task timeout after 60 mins (likely should lower that again because the task time is much faster), in this case most commands just seem to timeout after 10mins.

https://cirrus-ci.com/task/5839274018340864

  C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar 7d5770bf3d96
  [FAILED] Timed out after 600.000s.
  command timed out after 600.000000s: [C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar 7d5770bf3d96]
  Expected process to exit.  It did not.
  C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine rm --force fd408f939745
  [FAILED] Timed out after 600.000s.
  command timed out after 600.000000s: [C:\Users\Administrator\AppData\Local\cirrus-ci-build\repo\bin\windows\podman.exe machine rm --force fd408f939745]
  Expected process to exit.  It did not.

And then there is a "fast" fail where at some point most tasks just keep failing with WSL connection errors
https://cirrus-ci.com/task/4649069861142528
https://api.cirrus-ci.com/v1/artifact/task/4649069861142528/html/machine-wsl-podman-windows-rootless-host-sqlite.log.html#t--run-basic-podman-commands-Volume-ops--1

C> podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar --now ef245288215b
  Importing operating system into WSL (this may take a few minutes on a new WSL install)...
  The operation completed successfully. 

  Configuring system...
  Machine init complete
  Starting machine "ef245288215b"
  The operation timed out because a response was not received from the virtual machine or container. 

  Error code: Wsl/Service/CreateInstance/HCS_E_CONNECTION_TIMEOUT

  Error: the WSL bootstrap script failed: command wsl [-u root -d podman-ef245288215b /root/bootstrap] failed: exit status 0xffffffff
C> podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar --now a2b3385349e9
  Importing operating system into WSL (this may take a few minutes on a new WSL install)...
  Catastrophic failure 

  Error code: Wsl/Service/RegisterDistro/E_UNEXPECTED

  Error: the WSL import of guest OS failed: command wsl [--import podman-a2b3385349e9 Z:\podman_test1271291991\.local\share\containers\podman\machine\wsl\wsldist\a2b3385349e9 Z:\podman_test1271291991\.local\share\containers\podman\machine\wsl\a2b3385349e9-amd64 --version 2] failed: exit status 0xffffffff
  C> podman.exe machine init --disk-size 11 --image Z:\podman-machine.x86_64.wsl.tar --now fe0e4d84c6a5
  Importing operating system into WSL (this may take a few minutes on a new WSL install)...
  A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond. 

  Error code: Wsl/Service/RegisterDistro/0x8007274c

  Error: the WSL import of guest OS failed: command wsl [--import podman-fe0e4d84c6a5 Z:\podman_test1181544560\.local\share\containers\podman\machine\wsl\wsldist\fe0e4d84c6a5 Z:\podman_test1181544560\.local\share\containers\podman\machine\wsl\fe0e4d84c6a5-amd64 --version 2] failed: exit status 0xffffffff

Metadata

Metadata

Assignees

No one assigned

    Labels

    flakesFlakes from Continuous Integrationmachinewindowsissue/bug on Windows

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载