这是indexloc提供的服务,不要输入任何密码
Skip to content

ValueError: invalid literal for int() with base 10: 'blocks' #431

@zhangxy16-gbits

Description

@zhangxy16-gbits

我使用RWKV-PEFT训练V6模型,1.6B和3B
其中1.6B的训练参数如下:

Image

训练完成后合并,在rwkv runner中运行出现以下错误:
Strategy Devices: {'cuda'}
state cache enabled
D:\rwkv\py310\Lib\site-packages_distutils_hack_init_.py:11: UserWarning: Distutils was imported before Setuptools, but importing Setuptools also replaces the distutils module in sys.modules. This may lead to undesirable behaviors or errors. To avoid these issues, avoid using distutils directly, ensure that setuptools is installed in the traditional way (e.g. not an editable install), and/or make sure that setuptools is always imported before distutils.
warnings.warn(
D:\rwkv\py310\Lib\site-packages_distutils_hack_init_.py:26: UserWarning: Setuptools is replacing distutils.
warnings.warn("Setuptools is replacing distutils.")
RWKV_JIT_ON 1 RWKV_CUDA_ON 1 RESCALE_LAYER 6

Loading D:\rwkv\models\v6lora1B6.pth ...
invalid literal for int() with base 10: 'blocks'
Traceback (most recent call last):
File "D:\rwkv\backend-python\routes\config.py", line 80, in switch_model
RWKV(
File "D:\rwkv\backend-python\utils\rwkv.py", line 716, in RWKV
model = Model(model_path, strategy)
File "D:\rwkv\py310\Lib\site-packages\torch\jit_script.py", line 309, in init_then_script
original_init(self, *args, **kwargs)
File "D:\rwkv\backend-python\rwkv_pip\model.py", line 1078, in init
layer_id = int(x.split(".")[1]) if ("blocks." in x) else 0
ValueError: invalid literal for int() with base 10: 'blocks'

INFO: 127.0.0.1:1736 - "POST /switch-model HTTP/1.1" 500 Internal Server Error

请问是什么原因呢?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions