temparary fix of bmtrain+opendelta load state dict #77

Achazwl · 2023-02-24T02:03:55Z

No description provided.

liweiqing1997 · 2023-04-17T11:53:20Z

The saved weights are:
encoder.layers.33.self_att.self_attention.project_v.lora.lora_A tensor([[ 5.5656e-03, 1.1871e-02, 1.4404e-02, ..., 1.3145e-02,
-1.3046e-03, -2.7542e-03],...]], dtype=torch.float16)
<class 'collections. OrderedDict'>

But the weight of the model is:
encoder.layers.33.self_att.self_attention.project_v.lora.lora_A Parameter containing:
Parameter(DistributedParameter([ 0.0030, 0.0088, 0.0114, ..., 0.0004, -0.0066,
-0.0021], device='cuda:0', dtype=torch.float16,
requires_grad=True))

Is it because of type inconsistencies. But how to solve this

Achazwl · 2023-04-17T12:00:12Z

This is not type inconsistency, Parameter is just a wrapper for the tensor and add some training information.

temparary fix of bmtrain+opendelta load state dict

672878b

Achazwl mentioned this pull request Mar 22, 2023

Finetune task2任务无法基于已训练完的Finetune task1的best.pt继续训练？ OpenBMB/CPM-Live#400

Closed

a710128 approved these changes Apr 19, 2023

View reviewed changes

a710128 merged commit b0c4b3c into OpenBMB:main Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

temparary fix of bmtrain+opendelta load state dict #77

temparary fix of bmtrain+opendelta load state dict #77

Uh oh!

Achazwl commented Feb 24, 2023

Uh oh!

liweiqing1997 commented Apr 17, 2023

Uh oh!

Achazwl commented Apr 17, 2023

Uh oh!

Uh oh!

temparary fix of bmtrain+opendelta load state dict #77

temparary fix of bmtrain+opendelta load state dict #77

Uh oh!

Conversation

Achazwl commented Feb 24, 2023

Uh oh!

liweiqing1997 commented Apr 17, 2023

Uh oh!

Achazwl commented Apr 17, 2023

Uh oh!

Uh oh!