查看: 33|回复: 5

换脸时出问题

[复制链接]

381

积分

31

帖子

56

符石

金丹师

Rank: 3Rank: 3Rank: 3

积分
381
发表于 昨天 16:03 | 显示全部楼层 |阅读模式
论坛购买了无敌小康的底丹,然后自己切脸训练,但在加载时弹出以下内容,不知道什么问题导致,还请各位大佬指点


使用设备: 0-NVIDIA GeForce RTX 5060 Laptop GPU (5.15gb/5.15gb)
ICE训练器: GPU 优化 RO 已关闭

装载类型:me-model: 100%|██████████████████████████| 8/8 [00:17<00:00,  2.23s/it]
装载训练样本: 100%|██████████████████████████| 151/151 [00:00<00:00, 442.53it/s]
装载训练样本: 100%|████████████████████████| 1373/1373 [00:02<00:00, 511.84it/s]


ICE 1.818 version by kingboy! QQ group:366893641
开始训练,当前迭代目标为:3000000,达到此目标数量将自动终结训练任务。

[当前时间][i:迭代数量][延迟ms]-[src损失][dst损失]
Traceback (most recent call last):
  File "D:\Tools\FaceAI-ICE1.85\_internal\python-3.8.5\lib\site-packages\tensorflow\python\client\session.py", line 1375, in _do_call
    return fn(*args)
  File "D:\Tools\FaceAI-ICE1.85\_internal\python-3.8.5\lib\site-packages\tensorflow\python\client\session.py", line 1359, in _run_fn
    return self._call_tf_sessionrun(options, feed_dict, fetch_list,
  File "D:\Tools\FaceAI-ICE1.85\_internal\python-3.8.5\lib\site-packages\tensorflow\python\client\session.py", line 1451, in _call_tf_sessionrun
    return tf_session.TF_SessionRun_wrapper(self._session, options, feed_dict,
tensorflow.python.framework.errors_impl.ResourceExhaustedError: 2 root error(s) found.
  (0) Resource exhausted: OOM when allocating tensor with shape[640,640,3,3] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
         [[{{node Conv2D_46}}]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

         [[gradients_1/Reshape_74_grad/Reshape/_521]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

  (1) Resource exhausted: OOM when allocating tensor with shape[640,640,3,3] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
         [[{{node Conv2D_46}}]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

0 successful operations.
0 derived errors ignored.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "models_ice\IceModel_Train.py", line 174, in models_ice.IceModel_Train.trainerThread
  File "models_ice\IceModel_Class.py", line 1025, in models_ice.IceModel_Class.IceModel_Class.train_one_iter
  File "models_ice\Ice_MULAN\Model.py", line 1580, in models_ice.Ice_MULAN.Model.MULANModel.onTrainOneIter
  File "models_ice\Ice_MULAN\Model.py", line 1269, in models_ice.Ice_MULAN.Model.MULANModel.on_init_model.train_sd
  File "D:\Tools\FaceAI-ICE1.85\_internal\python-3.8.5\lib\site-packages\tensorflow\python\client\session.py", line 967, in run
    result = self._run(None, fetches, feed_dict, options_ptr,
  File "D:\Tools\FaceAI-ICE1.85\_internal\python-3.8.5\lib\site-packages\tensorflow\python\client\session.py", line 1190, in _run
    results = self._do_run(handle, final_targets, final_fetches,
  File "D:\Tools\FaceAI-ICE1.85\_internal\python-3.8.5\lib\site-packages\tensorflow\python\client\session.py", line 1368, in _do_run
    return self._do_call(_run_fn, feeds, fetches, targets, options,
  File "D:\Tools\FaceAI-ICE1.85\_internal\python-3.8.5\lib\site-packages\tensorflow\python\client\session.py", line 1394, in _do_call
    raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.ResourceExhaustedError: 2 root error(s) found.
  (0) Resource exhausted: OOM when allocating tensor with shape[640,640,3,3] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
         [[node Conv2D_46 (defined at \threading.py:870) ]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

         [[gradients_1/Reshape_74_grad/Reshape/_521]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

  (1) Resource exhausted: OOM when allocating tensor with shape[640,640,3,3] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
         [[node Conv2D_46 (defined at \threading.py:870) ]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

0 successful operations.
0 derived errors ignored.

Original stack trace for 'Conv2D_46':
  File "\threading.py", line 890, in _bootstrap
    self._bootstrap_inner()
  File "\threading.py", line 932, in _bootstrap_inner
    self.run()
  File "\threading.py", line 870, in run
    self._target(*self._args, **self._kwargs)
  File "\site-packages\tensorflow\python\util\dispatch.py", line 206, in wrapper
    return target(*args, **kwargs)
  File "\site-packages\tensorflow\python\ops\nn_ops.py", line 2269, in conv2d
    return gen_nn_ops.conv2d(
  File "\site-packages\tensorflow\python\ops\gen_nn_ops.py", line 968, in conv2d
    _, _, _op, _outputs = _op_def_library._apply_op_helper(
  File "\site-packages\tensorflow\python\framework\op_def_library.py", line 748, in _apply_op_helper
    op = g._create_op_internal(op_type_name, inputs, dtypes=None,
  File "\site-packages\tensorflow\python\framework\ops.py", line 3557, in _create_op_internal
    ret = Operation(
  File "\site-packages\tensorflow\python\framework\ops.py", line 2045, in __init__
    self._traceback = tf_stack.extract_stack_for_node(self._c_op)

Zhatv换脸论坛免责声明
全站默认解压密码:zhatv.cn
【Zhatv】论坛里的文章仅代表作者本人的观点,与本网站立场无关。
所有文章、内容、信息、资料,都不保证其准确性、完整性、有效性、时效性,请依据情况自身做出判断。
因阅读本站内容而被误导等其他因素所造成的损失责任自负,【Zhatv】不承担任何责任。

381

积分

31

帖子

56

符石

金丹师

Rank: 3Rank: 3Rank: 3

积分
381
 楼主| 发表于 昨天 16:08 | 显示全部楼层
我这个结果是关闭了R0优化器的,如果开了优化器的话就是弹出开始训练后长时间没任何反应,之前预训练也是这样,使用V7模型就可以训练,但这个底丹是V71的,我看论坛大部分都是V71的,也不知道什么情况导致的
回复

使用道具 举报

2675

积分

93

帖子

1604

符石

化神丹师

Rank: 5

积分
2675

最佳新人热心会员咸鱼勋章

发表于 昨天 16:13 | 显示全部楼层
本帖最后由 wtxx8888 于 2025-9-17 16:17 编辑

00M 炸显存了。
模型目前的参数,你的显卡 带不动。

最简单的,减低BS的数值,去尝试。
或者关闭,其他的 显存占用项目。
参数的各功能,最基础的i教程里 就有。

评分

参与人数 1金钱 +5 贡献 +10 符石 +10 收起 理由
奸商 + 5 + 10 + 10 助人为乐!

查看全部评分

回复

使用道具 举报

381

积分

31

帖子

56

符石

金丹师

Rank: 3Rank: 3Rank: 3

积分
381
 楼主| 发表于 昨天 16:17 | 显示全部楼层
wtxx8888 发表于 2025-9-17 16:13
00M 炸显存了。
模型目前的参数,你的显卡 带不动。

好的,谢谢,全是英文看不明白,麻烦了
[发帖际遇]: smdongxi1 偷丞相的丹被发现 金钱 降了 2 . 幸运榜 / 衰神榜
回复

使用道具 举报

2675

积分

93

帖子

1604

符石

化神丹师

Rank: 5

积分
2675

最佳新人热心会员咸鱼勋章

发表于 昨天 16:20 | 显示全部楼层
本帖最后由 wtxx8888 于 2025-9-17 16:22 编辑
smdongxi1 发表于 2025-9-17 16:17
好的,谢谢,全是英文看不明白,麻烦了

看主要的标记,谁去看 一吐鲁的英文?  
00M 就三字符。故障贴里面  就有别人的经验。。。
挂机时没事 就多看贴,不要闭门造车
回复

使用道具 举报

7291

积分

540

帖子

1万

符石

太乙金仙

Rank: 10Rank: 10

积分
7291

灌水之王论坛元老咸鱼勋章

发表于 昨天 16:31 | 显示全部楼层
@wtxx8888 大佬已经说了,有很明显的OOM

另外,50系第一次启动训练会卡很长时间才开始训练
通用直播丹代练:QQ1453174
回复

使用道具 举报

小黑屋|ZhaTV ( 滇ICP备15003127号-4 ) |网站地图

GMT+8, 2025-9-18 06:28

Powered by Zhatv.cn

© 2022-2023

快速回复 返回顶部 返回列表