Pytorch实现自己的残差网络图片分类器_残差块(ResidualBlock)资源-CSDN下载

共3个文件

py：2个

docx：1个

人工智能

Resnet

pytorch

4星 · 超过85%的资源需积分: 48 47 浏览量 2018-06-25 14:56:27 上传评论 6 收藏 268KB RAR 举报

**PyTorch实现自己的残差网络图片分类器** 在深度学习领域，图像分类是一个核心任务，而Residual Network（残差网络）因其独特的结构设计，有效地解决了深度神经网络中的梯度消失和爆炸问题，使得训练更深的网络成为可能。本教程将详细介绍如何使用PyTorch框架构建一个自定义的ResNet模型进行图像分类。我们来理解ResNet的基本概念。ResNet的核心是残差块，其创新之处在于引入了跳接（skip connection）机制。通过这种设计，网络的输入可以直接加到输出上，从而让信息可以直接传递到深层，解决了深度网络训练的难题。在每个残差块里，数据流会经过两个或三个卷积层，然后通过加法操作与输入连接，形成“恒等映射”。接下来，我们将探讨如何在PyTorch中构建ResNet模型： 1. **基础模块**：我们需要定义基础的卷积层和批量归一化层。在PyTorch中，`nn.Conv2d`用于创建卷积层，`nn.BatchNorm2d`用于批量归一化，它们是构建ResNet的基础模块。 2. **基本残差块**：残差块由两个卷积层组成，每个卷积层后跟一个批量归一化层和ReLU激活函数。使用短路结构（加法操作）将输入与经过两层卷积后的特征图相加。关键代码如下： ```python class BasicBlock(nn.Module): def __init__(self, in_channels, out_channels, stride=1): super(BasicBlock, self).__init__() self.conv1 = nn.Conv2d(in_channels, out_channels, kernel_size=3, stride=stride, padding=1, bias=False) self.bn1 = nn.BatchNorm2d(out_channels) self.conv2 = nn.Conv2d(out_channels, out_channels, kernel_size=3, stride=1, padding=1, bias=False) self.bn2 = nn.BatchNorm2d(out_channels) self.shortcut = nn.Sequential() if stride != 1 or in_channels != out_channels: self.shortcut = nn.Sequential( nn.Conv2d(in_channels, out_channels, kernel_size=1, stride=stride, bias=False), nn.BatchNorm2d(out_channels) ) def forward(self, x): out = F.relu(self.bn1(self.conv1(x))) out = self.bn2(self.conv2(out)) out += self.shortcut(x) out = F.relu(out) return out ``` 3. **ResNet模型**：ResNet模型由多个基本残差块堆叠而成，根据网络深度不同，可以选择不同的数量。此外，还需要一个全局平均池化层和全连接层用于分类。代码示例： ```python class ResNet(nn.Module): def __init__(self, block, layers, num_classes=10): super(ResNet, self).__init__() self.in_channels = 64 self.conv = nn.Conv2d(3, 64, kernel_size=3, stride=1, padding=1, bias=False) self.bn = nn.BatchNorm2d(64) self.layer1 = self.make_layer(block, 64, layers[0]) self.layer2 = self.make_layer(block, 128, layers[1], stride=2) self.layer3 = self.make_layer(block, 256, layers[2], stride=2) self.layer4 = self.make_layer(block, 512, layers[3], stride=2) self.avg_pool = nn.AvgPool2d(7) self.fc = nn.Linear(512 * block.expansion, num_classes) def make_layer(self, block, out_channels, blocks, stride=1): downsample = None if (stride != 1) or (self.in_channels != out_channels * block.expansion): downsample = nn.Sequential( nn.Conv2d(self.in_channels, out_channels * block.expansion, kernel_size=1, stride=stride, bias=False), nn.BatchNorm2d(out_channels * block.expansion) ) layers = [] layers.append(block(self.in_channels, out_channels, stride, downsample)) self.in_channels = out_channels * block.expansion for _ in range(1, blocks): layers.append(block(self.in_channels, out_channels)) return nn.Sequential(*layers) def forward(self, x): out = F.relu(self.bn(self.conv(x))) out = self.layer1(out) out = self.layer2(out) out = self.layer3(out) out = self.layer4(out) out = self.avg_pool(out) out = out.view(out.size(0), -1) out = self.fc(out) return out ``` 4. **训练过程**：在`train.py`文件中，我们可以设置优化器（如SGD）、损失函数（如交叉熵损失）和训练循环。在训练循环中，我们将数据送入模型，计算损失，执行反向传播并更新权重。同时，还需要定期验证模型性能。 5. **预测过程**：`predict.py`文件通常包含加载预训练模型、处理单个图像并进行预测的逻辑。它会读取图片，将其转化为模型所需的格式，然后通过模型得到预测类别。通过以上步骤，我们就完成了ResNet模型的构建、训练和预测。在实际应用中，可以根据需求调整网络结构（如ResNet18、ResNet34、ResNet50等）和参数，以适应不同的任务和数据集。在PyTorch中，ResNet模型的强大灵活性和易用性使其成为深度学习研究和实践中的常用工具。

资源推荐

资源详情

资源评论

收起资源包目录

ResNetTraining.rar （3个子文件）

train.py 7KB

predict.py 7KB

README.docx 270KB

import sys import getopt import argparse import os import shutil import time import torch import torch.nn as nn import torch.nn.parallel import torch.backends.cudnn as cudnn import torch.distributed as dist import torch.optim import torch.utils.data import torch.utils.data.distributed import torchvision.transforms as transforms import torchvision.datasets as dsets import torchvision.models as models def main(argv): inputfile = '' savefile = '' testfile='' try: opts, args = getopt.getopt(argv,'hi:t:s:',["ifile =","tfile =""sfile ="]) except getopt.GetoptError: print("train.py -i <input_data_file> -t <test_data_file> -s <save_model_file>") sys.exit(2) for opt,arg in opts: if opt =="-h": print("train.py -i <input_data_file> -t <test_data_file> -s <save_model_file>") sys.exit() elif opt in ("-i","--ifile"): inputfile = arg elif opt in("-s","--sfile"): savefile = arg elif opt in("-t","--tfile"): testfile = arg print("the input datasets is :",inputfile) print("the test datasets is :",testfile) print("the save model file is :",savefile) model = models.resnet18(pretrained=True) fc_features = model.fc.in_features model.fc = nn.Linear(fc_features, 102) model = torch.nn.DataParallel(model).cuda() # parameters arch = 'resnet18' lr = 0.05 momentum = 0.9 weight_decay = 1e-4 resume = '' epochs = 30 start_epoch = 0 evaluate = 0 best_prec1 = 0 print_freq = 10 # define loss function (criterion) and optimizer criterion = nn.CrossEntropyLoss().cuda() optimizer = torch.optim.SGD(model.parameters(), lr, momentum=momentum, weight_decay=weight_decay) cudnn.benchmark = True # data prepearing train_dir = inputfile valid_dir = testfile normalize = transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]) batch_size=10 data_transforms = transforms.Compose([ transforms.Resize(256), transforms.CenterCrop(224), transforms.ToTensor(), normalize, ]) # TODO: Load the datasets with ImageFolder train_datasets = dsets.ImageFolder(train_dir, data_transforms) # TODO: Using the image datasets and the trainforms, define the dataloaders trainloader = torch.utils.data.DataLoader(dataset = train_datasets, batch_size=batch_size, shuffle=True, drop_last=False, pin_memory=True) valid_datasets = dsets.ImageFolder(valid_dir, data_transforms) validloader = torch.utils.data.DataLoader(dataset = valid_datasets, batch_size=batch_size, shuffle=True, drop_last=False, pin_memory=True) # training for epoch in range(start_epoch, epochs): # train for one epoch train(trainloader, model, criterion, optimizer, epoch) prec1 = validate(validloader, model, criterion) # saving save_checkpoint(model,savefile) def train(train_loader, model, criterion, optimizer, epoch): losses = AverageMeter() top1 = AverageMeter() top5 = AverageMeter() print_freq = 10 # switch to train mode model.train() for i, (input, target) in enumerate(train_loader): # measure data loading time target = target.cuda(non_blocking=True) # compute output output = model(input) loss = criterion(output, target) # measure accuracy and record loss prec1, prec5 = accuracy(output, target, topk=(1, 5)) losses.update(loss.item(), input.size(0)) top1.update(prec1[0], input.size(0)) top5.update(prec5[0], input.size(0)) # compute gradient and do SGD step optimizer.zero_grad() loss.backward() optimizer.step() # measure elapsed time if i % print_freq == 0: print('Epoch: [{0}][{1}/{2}]\t' 'Loss {loss.val:.4f} ({loss.avg:.4f})\t' 'Prec@1 {top1.avg:.3f}\t' 'Prec@5 {top5.avg:.3f}'.format( epoch, i, len(train_loader), loss=losses, top1=top1, top5=top5)) def validate(val_loader, model, criterion): losses = AverageMeter() top1 = AverageMeter() top5 = AverageMeter() print_freq = 10 # switch to evaluate mode model.eval() with torch.no_grad(): for i, (input, target) in enumerate(val_loader): target = target.cuda(non_blocking=True) # compute output output = model(input) loss = criterion(output, target) # measure accuracy and record loss prec1, prec5 = accuracy(output, target, topk=(1, 5)) losses.update(loss.item(), input.size(0)) top1.update(prec1[0], input.size(0)) top5.update(prec5[0], input.size(0)) # measure elapsed time if i % print_freq == 0: print('Test: [{0}/{1}]\t' 'Loss {loss.val:.4f} ({loss.avg:.4f})\t' 'Prec@1 {top1.avg:.3f}\t' 'Prec@5 {top5.avg:.3f}'.format( i, len(val_loader), loss=losses, top1=top1, top5=top5)) print(' * Prec@1 {top1.avg:.3f} Prec@5 {top5.avg:.3f}' .format(top1=top1, top5=top5)) return top1.avg def save_checkpoint(model, filename): torch.save(model, filename) class AverageMeter(object): """Computes and stores the average and current value""" def __init__(self): self.reset() def reset(self): self.val = 0 self.avg = 0 self.sum = 0 self.count = 0 def update(self, val, n=1): self.val = val self.sum += val * n self.count += n self.avg = self.sum / self.count def adjust_learning_rate(optimizer, epoch): """Sets the learning rate to the initial LR decayed by 10 every 10 epochs""" lr = lr * (0.1 ** (epoch // 10)) for param_group in optimizer.param_groups: param_group['lr'] = lr def accuracy(output, target, topk=(1,)): """Computes the precision@k for the specified values of k""" with torch.no_grad(): maxk = max(topk) batch_size = target.size(0) _, pred = output.topk(maxk, 1, True, True) pred = pred.t() correct = pred.eq(target.view(1, -1).expand_as(pred)) res = [] for k in topk: correct_k = correct[:k].view(-1).float().sum(0, keepdim=True) res.append(correct_k.mul_(100.0 / batch_size)) return res if __name__ =="__main__": main(sys.argv[1:])

评论收藏

内容反馈