當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

tensorflow 进阶五 ---TensorFlow conv2d原理及实践

發(fā)布時間：2025/4/5 编程问答 22 豆豆

生活随笔收集整理的這篇文章主要介紹了 tensorflow 进阶五 ---TensorFlow conv2d原理及实践小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

tf.nn.conv2d(input, filter, strides, padding, use_cudnn_on_gpu=None, data_format=None, name=None)

官方教程說明：

給定四維的input和filter tensor，計算一個二維卷積

Args:

input: A Tensor. type必須是以下幾種類型之一: half, float32, float64. filter: A Tensor. type和input必須相同 strides: A list of ints.一維，長度4，在input上切片采樣時，每個方向上的滑窗步長，必須和format指定的維度同階 padding: A string from: "SAME", "VALID". padding 算法的類型 use_cudnn_on_gpu: An optional bool. Defaults to True. data_format: An optional string from: "NHWC", "NCHW"，默認為"NHWC"。指定輸入輸出數(shù)據(jù)格式，默認格式為"NHWC", 數(shù)據(jù)按這樣的順序存儲： [batch, in_height, in_width, in_channels] 也可以用這種方式："NCHW", 數(shù)據(jù)按這樣的順序存儲： [batch, in_channels, in_height, in_width] name: 操作名，可選.

Returns:

A Tensor. type與input相同

Given an input tensor of shape [batch, in_height, in_width, in_channels]
and a filter / kernel tensor of shape
[filter_height, filter_width, in_channels, out_channels]

conv2d實際上執(zhí)行了以下操作：

將filter轉(zhuǎn)為二維矩陣，shape為 [filter_height * filter_width * in_channels, output_channels]. 從input tensor中提取image patches，每個patch是一個virtual tensor，shape[batch, out_height, out_width, filter_height * filter_width * in_channels]. 將每個filter矩陣和image patch向量相乘

具體來講，當data_format為NHWC時：

output[b, i, j, k] =sum_{di, dj, q} input[b, strides[1] * i + di, strides[2] * j + dj, q] *filter[di, dj, q, k]

input 中的每個patch都作用于filter，每個patch都能獲得其他patch對filter的訓練
需要滿足strides[0] = strides[3] = 1. 大多數(shù)水平步長和垂直步長相同的情況下：strides = [1, stride, stride, 1].

下面舉例來進行說明

在最基本的例子中，沒有padding和stride = 1。讓我們假設你的input和kernel有：

當您的內(nèi)核您將收到以下輸出：在此輸入圖像說明，它按以下方式計算：

14 = 4 * 1 + 3 * 0 + 1 * 1 + 2 * 2 + 1 * 1 + 0 * 0 + 1 * 0 + 2 * 0 + 4 * 1 6 = 3 * 1 + 1 * 0 + 0 * 1 + 1 * 2 + 0 * 1 + 1 * 0 + 2 * 0 + 4 * 0 + 1 * 1 6 = 2 * 1 + 1 * 0 + 0 * 1 + 1 * 2 + 2 * 1 + 4 * 0 + 3 * 0 + 1 * 0 + 0 * 1 12 = 1 * 1 + 0 * 0 + 1 * 1 + 2 * 2 + 4 * 1 + 1 * 0 + 1 * 0 + 0 * 0 + 2 * 1

TF的conv2d函數(shù)批量計算卷積，并使用稍微不同的格式。對于一個輸入，它是[batch, in_height, in_width, in_channels]內(nèi)核的[filter_height, filter_width, in_channels, out_channels]。所以我們需要以正確的格式提供數(shù)據(jù)：

import tensorflow as tf k = tf.constant([[1, 0, 1],[2, 1, 0],[0, 0, 1] ], dtype=tf.float32, name='k') i = tf.constant([[4, 3, 1, 0],[2, 1, 0, 1],[1, 2, 4, 1],[3, 1, 0, 2] ], dtype=tf.float32, name='i') kernel = tf.reshape(k, [3, 3, 1, 1], name='kernel') image = tf.reshape(i, [1, 4, 4, 1], name='image')res = tf.squeeze(tf.nn.conv2d(image, kernel, [1, 1, 1, 1], "VALID"))pp=tf.nn.conv2d(image, kernel, [1, 1, 1, 1], "VALID") # VALID means no padding with tf.Session() as sess:print ('kernel=',sess.run(kernel)) print ('image=',sess.run(image))print('pp=',sess.run(pp))print ('res=',sess.run(res)) kernel= [[[[ 1.]][[ 0.]][[ 1.]]][[[ 2.]][[ 1.]][[ 0.]]][[[ 0.]][[ 0.]][[ 1.]]]] image= [[[[ 4.][ 3.][ 1.][ 0.]][[ 2.][ 1.][ 0.][ 1.]][[ 1.][ 2.][ 4.][ 1.]][[ 3.][ 1.][ 0.][ 2.]]]] pp= [[[[ 14.][ 6.]][[ 6.][ 12.]]]] res= [[ 14. 6.][ 6. 12.]]

原文鏈接

相關文章鏈接

總結

以上是生活随笔為你收集整理的tensorflow 进阶五 ---TensorFlow conv2d原理及实践的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。

上一篇： tensorflow 进阶(四)---C
下一篇： VALID SAME

3atv精品不卡视频,97人人超碰国产精品最新,中文字幕av一区二区三区人妻少妇,久久久精品波多野结衣,日韩一区二区三区精品

编程问答

tensorflow 进阶 五 ---TensorFlow conv2d原理及实践

總結

tensorflow 进阶五 ---TensorFlow conv2d原理及实践