面向深度卷积神经网络的确定性连接丢弃算法-陕西师范大学学报期刊社网站

陕西师范大学学报（自然科学版）

人工智能专题

面向深度卷积神经网络的确定性连接丢弃算法

李鸿杨1，潘静1，2，何宇清1，庞彦伟1*

（1 天津大学电气自动化与信息工程学院，天津 300072; 2 天津职业技术师范大学电子工程学院，天津 300222）

庞彦伟，男，教授，博士，主要从事深度学习、模式识别研究。E-mail: pyw@tju.edu.cn

摘要:

针对深度卷积神经网络中存在的过拟合问题，本文提出了一种确定性连接丢弃的正则化方法。核心思想是根据不同卷积滤波器权重对结果的贡献度不同，确定性丢弃卷积层层间连接，通过降低卷积滤波器权重的空间维度，使得卷积神经网络各层之间的连接更稀疏。通过将算法应用于图像分类任务来验证算法的性能,在MNIST、CIFAR-10和CIFAR-100数据集上，错误率分别为0.32%、5.33%、26.88%，相比于原始实验错误率分别降低0.15%、1.09%、1.36%。实验表明，本算法能够有效处理深度卷积神经网络的过拟合问题，并能提升网络的鲁棒性和泛化能力。

关键词：

深度学习；卷积神经网络；正则化；连接丢弃

收稿日期：

2019-07-01

中图分类号：

TP391.4

文献标识码：

文章编号：

1672-4291(2020)02-0032-06

基金项目：

国家自然科学基金（61472274）

Doi:

A deterministic DropConnect algorithm for deep convolutional neural networks

LI Hongyang1，PAN Jing1,2，HE Yuqing1，PANG Yanwei1*

(1 School of Electrical and Information Engineering， Tianjin University，Tianjin 300072， China;2 School of Electronic Engineering， Tianjin University of Technology and Education,Tianjin 300222， China)

Abstract:

In order to avoid overfitting in deep convolutional neural networks, a new regularization method called deterministic DropConnect is proposed. The main idea of the deterministic DropConnect is to reduce the spatial dimension of the convolution filter by selectively discarding the partial connection, according to the contributions of the different weights of the convolutional filters to the result. The connections between the layers of the convolutional neural networks are more sparse. In this paper, the effectiveness of the proposed deterministic DropConnect is evaluated on the task of image classification. The error rates are 0.32%, 5.33% and 26.88% in the MNIST, CIFAR-10 and CIFAR-100 datasets, respectively. Compared with the original experimental error rates, the error rates reduce by 0.15%, 1.09% and 1.36%, respectively. Experiments show that the proposed algorithm is an effective method to deal with the overfitting in neural networks, and can improve the robustness and generalization ability of the convolutional neural networks.

KeyWords:

deep learning; convolutional neural networks; regularization; DropConnect 深度学习（deep learning，DL）是近年来机器学习领域的一个研究热点。深度卷积神经网络（deep convolutional neural network）作为深度学习中一种极具代表性的网络结构，在计算机视觉、自动驾驶、广告推荐、自然语言处理和生命科学等领域均取得了突破性进展，有着广泛的应用前景。深度卷积神经网络通过构造深度学习模型，让模型自主地从海量训练数据中学习特征，用于提升对未来样本或未知事件的判断和预测能力。不同于传统的手工设计特征，通过大数据学习得到的特征更能刻画数据丰富的内在信息。这也是深度学习模型在大数据领域越来越受关注的本质原因。为了更好地将深度卷积神经网络应用于现实生活中，很多研究都致力于构建更好\[1-2\]和更快\[3-4\]的深度学习模型。