博客
关于我
学习笔记之——Normalization
阅读量:515 次
发布时间:2019-03-07

本文共 3662 字,大约阅读时间需要 12 分钟。

Normalization Methods in Deep Learning

Normalization techniques are crucial in deep learning, particularly for training deep neural networks efficiently. These techniques help address issues like Internal Covariate Shift, which can hinder training due to varying input distributions across batches. Here, we explore key normalization methods and their applications.

Batch Normalization (BN)

Batch Normalization (BN) was introduced to address the Internal Covariate Shift issue. By normalizing each batch to have a mean of 0 and variance of 1, BN stabilizes the input distribution, preventing gradient issues like vanishing or exploding gradients. BN introduces learnable parameters β and γ to shift and scale the normalized values, allowing the model to adapt to different data distributions.

Advantages of BN

  • Reduces Internal Covariate Shift, accelerating training.
  • Mitigates gradient issues, especially with activation functions like sigmoid or tanh.
  • Less sensitive to initial parameter values, enhancing training stability.

Disadvantages of BN

  • Batch size sensitivity: Small batch sizes lead to inaccurate mean and variance estimates.
  • Requires storing batch statistics during training, which can be memory-intensive for large batches.
  • Not suitable for RNNs due to varying sequence lengths.

Layer Normalization (LN)

Layer Normalization (LN) normalizes all features in a layer across the batch. Unlike BN, LN does not depend on batch size, making it suitable for small batches. It computes mean and variance across the entire layer, ensuring consistent normalization regardless of batch size. This makes LN more versatile, particularly for RNNs where sequence lengths vary.

Differences Between BN and LN

  • BN normalizes features per batch, while LN normalizes features across the entire layer.
  • LN does not require storing batch statistics, saving memory.

Weight Normalization (WN)

Weight Normalization (WN) normalizes filter weights rather than features. It decouples weights into magnitude (beta) and direction (gamma), allowing independent training of these parameters. Unlike BN and LN, WN does not depend on input data distribution, making it a form of parameter normalization.

Instance Normalization (IN)

Instance Normalization (IN) is ideal for tasks like image style transfer, where the model's output depends on individual image instances rather than the entire batch. IN normalizes features within each image instance, ensuring consistent output while allowing diverse styles across instances.

Group Normalization (GN)

Group Normalization (GN) addresses the limitations of BN by grouping features along the channel dimension. Each group is normalized separately, reducing the impact of batch size on normalization accuracy. GN is particularly effective for small batches and can be seen as a middle ground between BN and LN.

Key Characteristics of GN

  • Divides features into groups and normalizes each group separately.
  • Reduces dependency on batch size, making it suitable for small batches.
  • Combines aspects of BN and LN, providing a flexible normalization approach.

Switchable Normalization (SN)

Switchable Normalization (SN) allows each layer to choose the most appropriate normalization method dynamically during training. This adaptability addresses the limitation of fixed normalization methods, offering a solution for diverse applications without manual tuning.

Summary

Normalization techniques like BN, LN, WN, IN, GN, and SN each address specific challenges in deep learning. Understanding their unique properties and applications can help choose the optimal method for a given task, ensuring efficient and stable training of deep neural networks.

转载地址:http://tiajz.baihongyu.com/

你可能感兴趣的文章
NSUserdefault读书笔记
查看>>
NS图绘制工具推荐
查看>>
NT AUTHORITY\NETWORK SERVICE 权限问题
查看>>
NT symbols are incorrect, please fix symbols
查看>>
ntelliJ IDEA 报错:找不到包或者找不到符号
查看>>
NTFS文件权限管理实战
查看>>
ntko web firefox跨浏览器插件_深度比较:2019年6个最好的跨浏览器测试工具
查看>>
ntko文件存取错误_苹果推送 macOS 10.15.4:iCloud 云盘文件夹共享终于来了
查看>>
ntp server 用法小结
查看>>
ntpdate 通过外网同步时间
查看>>
ntpdate同步配置文件调整详解
查看>>
NTPD使用/etc/ntp.conf配置时钟同步详解
查看>>
NTP及Chrony时间同步服务设置
查看>>
NTP服务器
查看>>
NTP配置
查看>>
NUC1077 Humble Numbers【数学计算+打表】
查看>>
NuGet Gallery 开源项目快速入门指南
查看>>
NuGet(微软.NET开发平台的软件包管理工具)在VisualStudio中的安装的使用
查看>>
nuget.org 无法加载源 https://api.nuget.org/v3/index.json 的服务索引
查看>>
Nuget~管理自己的包包
查看>>