学术论文

      基于混合maxout单元的卷积神经网络性能优化

      Improving deep convolutional neural networks with mixed maxout units

      摘要:
      针对深度卷积神经网络中maxout单元非最大特征无法传递、特征图像子空间池化表达能力不足的局限性,提出混合maxout(mixout,mixed maxout)单元.首先,计算相同输入在不同卷积变换下所形成的特征图像子空间的指数概率分布;其次,根据概率分布计算特征图像子空间的期望;最后,利用伯努利分布对子空间的最大值与期望值加权,均衡单元模型.分别构建基于mixout单元的简单模型和网中网模型进行实验,结果表明mixout单元模型性能较好.
      Abstract:
      The maxout units have the problem of not delivering non-max features, resulting in the insufficient of pooling operation over a subspace that is composed of several linear feature mappings, when they are applied in deep convolu-tional neural networks. The mixed maxout (mixout) units were proposed to deal with this constrain. Firstly, the exponen-tial probability of the feature mappings getting from different linear transformations was computed. Then, the averaging of a subspace of different feature mappings by the exponential probability was computed. Finally, the output was ran-domly sampled from the max feature and the mean value by the Bernoulli distribution, leading to the better utilizing of model averaging ability of dropout. The simple models and network in network models was built to evaluate the perfor-mance of mixout units. The results show that mixout units based models have better performance.
      Author: ZHAO Hui-zhen LIU Fu-xian LI Long-yue LUO Chang
      作者单位: 空军工程大学防空反导学院,陕西西安,710051
      刊 名: 通信学报 ISTICEIPKU
      年,卷(期): 2017, 38(7)
      分类号: TP391.3
      在线出版日期: 2017年8月24日
      基金项目: 国家自然科学基金资助项目(No.61601499) The National Natural Science Foundation of China