model,tokenizer,subword tokenization(huggingface)

Posted Jul 25, 2023 Updated Mar 16, 2024

By 1 min read

Model

*Config这个类，用于给出某个模型的网络结构，通过config来加载模型，得到的就是一个模型的架子，没有预训练的权重。

  
from transformers import BertModel, BertConfig

config = BertConfig()
model = BertModel(config)  # 模型是根据config来构建的，这时构建的模型是参数随机初始化的

更常用的做法则是直接加载预训练模型，然后微调。

  
from transformers import BertModel

model = BertModel.from_pretrained('bert-base-cased')

模型的保存：

  
model.save_pretrained("directory_on_my_computer")# 会在当前目录下创建名为directory_on_my_computer的文件夹，保存预训练模型

This post is licensed under CC BY 4.0 by the author.