pytorch-day08

Created2025-09-17|Updated2025-09-18

|Post Views:

使用线性回归和DNN作为示例进行演示

首先手动创建训练数据集

n = 400
X = 10*torch.rand([n,2])-5.0
w0 = torch.tensor([[2.0,3.0]])
b0 = torch.tensor([[10.0]]) # 这里为什么要创建一个二维的张量？
Y = X@w0 + b0 + torch.normal(0.0,2.0,size=[n,1])

创建数据管道

def data_iter(features, labels, batch_size=8):
    num_examples = len(features)
    indices = list(range(num_examples))
    np.random.shuffle(indices)
    for i in range(0,num_examples,batch_size):
        indexs = torch.LongTensor(indices[i:min(i + batch_size,num_examples)])
        yield features.index_select(0, indexs), labels.index_select(0,indexs)

低阶API

Pytorch 的低阶API主要包括张量操作，计算图和自动微分

# define the model
class LinearRegression:
    def __init__(self):
        self.w = torch.randn_like(w0,requires_grad = True)
        self.b = torch.zeros_like(b0,requires = True)
    def forward(self,x)
        return x@self.w0 + self.b
    def loss_func(self,y_pred,y_true):
        return torch.mean((y_pred - y_true)**2/2)
# train the model
def train_step(model, features, labels):
    predicitons = model.forward(features)
    loss = model.loss_func(predictions, labels)
    loss.backward()
    with torch.no_grad():
        model,w -= 0.001*model.w.grad
        model.b -= 0.001*model.b.grad

        model.w.grad.zero_()
        model.b.grad.zero()

    return loss

中阶API

包括各类模型层，损失函数，优化器，数据管道等等

一般来说，都会使用这里的API，有的模型需要自定义一些新的模型层，才有可能会用到低阶API的内容

# load the data
ds = TensorDataset(X,Y)
dl = DataLoader(ds,batch_size = 10,shuffle = True,num_workers = 2)
# define the model
model = nn.Linear(2,1)
model.loss_fn = nn.MSELoss()
model.optimizer = torch.optim.SGD(model.parameters(),lr =0.01)

高阶API

对于每个模型，可以自己定义自己的API封装，这一点就因人而异了。
一般来说，可以把一些常用操作做成API用来调用，比如summary，eval等常用操作。

Something else

什么是`yields`

yields 将一个普通函数变成”生成器函数”，调用它不会立刻执行完，而是返回一个生成器对象
每次遇到yield 会产出一个值并挂起函数状态；下次迭代会从挂起处据悉执行
return 结束函数而 yield 可以多次产出多个值

什么是 `num_worker`

指dataloader用多少个子进程并行加载/预取批次数据
0：在主机成加载，最稳定，最省内存，最少坑
1 开启多进程并行调用getitem，在后台异步读取，常见于有耗时 I/O 或 CPU 预处理时(图像解码，数据增强)提速
更多worker会更快，但会占用更多内存

Author: Adam Chen

Link: https://517adam.github.io/blog/2025/09/17/pytorch-day08/

Copyright Notice: All articles on this blog are licensed under CC BY-NC-SA 3.0 CN unless otherwise stated.

Related Articles

写在一开始被同学拉着一起学，那就正式学习一下pytorch. 择日不如撞日.* 代码部分放在colab上，这里总结一点主要/延申知识Day 1 Structured Data Modeling Example Using Titanic dataset. The goal is to predict whether a passenger is surived. The dataset contains 10 features,within them: 4 valued feature 4 categorical feature 2 other feauture(ticket number & name) Among the features, some of them has missing values The tutorial then do the data preprocessing, building a MLP with one hidden layer, and write the training function. About the O...

张量的数据类型张量的数据类型与 numpy.array 基本一一对应，除了不支持str类型一般的神经网络用的是torch.float32类型如果要显示指定数据类型，可以使用torch.tensor(data,dtype = torch.type) 也可以使用特定的构造函数123i = torch.Inttensor() #构造数据类型为 int 的张量x = torch.Tensor() # 构造数据类型为 float 的张量b = torch.BoolTensor() #构造数据类型为 bool 的张量此外，还可以对不同类型的张量进行转化1234i = torch.tensor(1) # 构建类型为int64的张量x = i.float() # 调用float方法转换为float类型y = i.type(torch.float) # 使用type函数转换为浮点类型z = i.type_as(x) # 使用type_as 方法转化为与某个Tensor相同类型的张量张量的维度张量的尺寸可以使用shape属性或者size() 方法查看张量在每一维的长度可以...

Image data Modeling examplePrepare dataset Using cifr-2 as an example Import necessary pakcage12345import torch from torch import nnfrom torch.utils.data import Dataset,DataLoaderfrom torchvision import transforms as Tfrom torchvision import datasets 1234567891011121314151617# define helper functiontransform_img = T.Compose( [T.ToTensor()])def transform_label(x): return torch.tensor([x]).float()# load dataset using Imagefolderds_train = datasets.ImageFolder("./eat_pytorch_datasets...

Text data Modeling Example IMDB数据集的目标是预测评论的情感标签数据预处理这里似乎只用了一个简单的构建词表，然后hard code每一个词？没有用到任何word-embedding。我很好奇这真的有用吗这里我们定义了一个类用来处理数据1234567891011class ImdbDataset(Dataset): def __init__(self,df): self.df = df def __len__(self): return len(self.df) def __getitem__(self,index): text = self.df["text"].iloc[index] label = torch.tensor([self.df["label"].iloc[index]]).float() tokens = torch.tensor(text_pipeline(text)).int() ...

由没有可以tran的optimization课，下学期可能要去上一门time series，这里正好就看到了，也许是天意呢。通过继承torch.utils.data.Dataset 实现自定义时间序列数据集1torch.utils.data.Dataset 这是一个抽象类，我们只需继承这个类，并且复写其中两个方法即可 __len__: 实现len(dataset)返回整个数据集的大小 __getitem__: 用来获取一些索引的数据，使dataset[i] 返回数据集中第 i 个样本注意：如果不复写的话会直接返回错误12345678910WINDOW_SIZE = 8class Covid19Dataset(Dataset): def __len__(self): return len(dfdiff) - WINDOW def __getitem___(self,i) x = dfdiff.loc[i:i+WINDOW_SIZE-1,;] feature = torch.tensor(x.values) y...

动态计算图 Pytorch 中的计算图是动态图计算图的正向传播立即执行，无需等待完整的图创建完毕计算图在反向传播后立即销毁，下次调用需要重新构建计算图。如果使用backward方法或者torch.autograd.grad 方法计算了梯度，创建的梯度会被立即销毁，释放储存空间。1234567891011121314#计算图在反向传播之后立即销毁import torch w = torch.tensor([[3.0,1.0]],requires_grad=True)b = torch.tensor([[3.0]],requires_grad=True)X = torch.randn(10,2)Y = torch.randn(10,1)Y_hat = X@w.t() + b # Y_hat定义后其正向传播被立即执行，与其后面的loss创建语句无关loss = torch.mean(torch.pow(Y_hat-Y,2))#计算图在反向传播后立即销毁，如果需要保留计算图, 需要设置retain_graph = Trueloss.backward() #loss.backwar...