Tensorflow自動編碼器與二進制文件的自定義培訓示例

我試圖調整Tensorflow自動編碼器代碼（https://github.com/aymericdamien/TensorFlow-Examples/blob/master/examples/3_NeuralNetworks/autoencoder.py）以使用我自己的培訓示例。我的訓練實例是單通道29 * 29（灰度級）圖像，以二進制文件形式連續保存爲UINT8值。我創建了一個模塊來創建data_batches，它將指導培訓。這是模塊：Tensorflow自動編碼器與二進制文件的自定義培訓示例

import tensorflow as tf 

# various initialization variables 
BATCH_SIZE = 128 
N_FEATURES = 9 

def batch_generator(filenames, record_bytes): 
""" filenames is the list of files you want to read from. 
In this case, it contains only heart.csv 
""" 

record_bytes = 29**2 # 29x29 images per record 
filename_queue = tf.train.string_input_producer(filenames) 
reader = tf.FixedLengthRecordReader(record_bytes=record_bytes) # skip the first line in the file 
_, value = reader.read(filename_queue) 
print(value) 

# record_defaults are the default values in case some of our columns are empty 
# This is also to tell tensorflow the format of our data (the type of the decode result) 
# for this dataset, out of 9 feature columns, 
# 8 of them are floats (some are integers, but to make our features homogenous, 
# we consider them floats), and 1 is string (at position 5) 
# the last column corresponds to the lable is an integer 

#record_defaults = [[1.0] for _ in range(N_FEATURES)] 
#record_defaults[4] = [''] 
#record_defaults.append([1]) 

# read in the 10 columns of data 
content = tf.decode_raw(value, out_type=tf.uint8) 
#print(content) 

# convert the 5th column (present/absent) to the binary value 0 and 1 
#condition = tf.equal(content[4], tf.constant('Present')) 
#content[4] = tf.where(condition, tf.constant(1.0), tf.constant(0.0)) 

# pack all UINT8 values into a tensor 
features = tf.stack(content) 
#print(features) 

# assign the last column to label 
#label = content[-1] 

# The bytes read represent the image, which we reshape 
# from [depth * height * width] to [depth, height, width]. 
depth_major = tf.reshape(
    tf.strided_slice(content, [0], 
        [record_bytes]), 
    [1, 29, 29]) 

# Convert from [depth, height, width] to [height, width, depth]. 
uint8image = tf.transpose(depth_major, [1, 2, 0]) 

# minimum number elements in the queue after a dequeue, used to ensure 
# that the samples are sufficiently mixed 
# I think 10 times the BATCH_SIZE is sufficient 
min_after_dequeue = 10 * BATCH_SIZE 

# the maximum number of elements in the queue 
capacity = 20 * BATCH_SIZE 

# shuffle the data to generate BATCH_SIZE sample pairs 
data_batch = tf.train.shuffle_batch([uint8image], batch_size=BATCH_SIZE, 
            capacity=capacity, min_after_dequeue=min_after_dequeue) 

return data_batch

然後我適應自動編碼器代碼從我的輸入投料代碼加載batch_xs：

from __future__ import division, print_function, absolute_import 

# Various initialization variables 
DATA_PATH1 = 'data/building_extract_train.bin' 

import tensorflow as tf 
import numpy as np 
import matplotlib.pyplot as plt 

# custom imports 
import data_reader 

# Import MNIST data 
from tensorflow.examples.tutorials.mnist import input_data 
mnist = input_data.read_data_sets("MNIST_data", one_hot=True) 

# Parameters 
learning_rate = 0.01 
training_epochs = 20 
batch_size = 256 
display_step = 1 
examples_to_show = 10 

# Network Parameters 
n_hidden_1 = 256 # 1st layer num features 
n_hidden_2 = 128 # 2nd layer num features 
#n_input = 784 # edge-data input (img shape: 28*28) 
n_input = 841 # edge-data input (img shape: 29*29) 

# tf Graph input (only pictures) 
X = tf.placeholder("float", [None, n_input]) 

# create the data batches (queue) 
# Accepts two parameters. The tensor containing the binary files and the size of a record 
data_batch = data_reader.batch_generator([DATA_PATH1],29**2) 

weights = { 
'encoder_h1': tf.Variable(tf.random_normal([n_input, n_hidden_1])), 
'encoder_h2': tf.Variable(tf.random_normal([n_hidden_1, n_hidden_2])), 
'decoder_h1': tf.Variable(tf.random_normal([n_hidden_2, n_hidden_1])), 
'decoder_h2': tf.Variable(tf.random_normal([n_hidden_1, n_input])), 
} 
biases = { 
'encoder_b1': tf.Variable(tf.random_normal([n_hidden_1])), 
'encoder_b2': tf.Variable(tf.random_normal([n_hidden_2])), 
'decoder_b1': tf.Variable(tf.random_normal([n_hidden_1])), 
'decoder_b2': tf.Variable(tf.random_normal([n_input])), 
} 

# Building the encoder 
def encoder(x): 
# Encoder Hidden layer with sigmoid activation #1 
layer_1 = tf.nn.sigmoid(tf.add(tf.matmul(x, weights['encoder_h1']), 
           biases['encoder_b1'])) 
# Decoder Hidden layer with sigmoid activation #2 
layer_2 = tf.nn.sigmoid(tf.add(tf.matmul(layer_1, weights['encoder_h2']), 
           biases['encoder_b2'])) 
return layer_2 


# Building the decoder 
def decoder(x): 
# Encoder Hidden layer with sigmoid activation #1 
layer_1 = tf.nn.sigmoid(tf.add(tf.matmul(x, weights['decoder_h1']), 
           biases['decoder_b1'])) 
# Decoder Hidden layer with sigmoid activation #2 
layer_2 = tf.nn.sigmoid(tf.add(tf.matmul(layer_1,  weights['decoder_h2']), 
           biases['decoder_b2'])) 
return layer_2 

# Construct model 
encoder_op = encoder(X) 
decoder_op = decoder(encoder_op) 

# Prediction 
y_pred = decoder_op 
# Targets (Labels) are the input data. 
y_true = X 

# Define loss and optimizer, minimize the squared error 
cost = tf.reduce_mean(tf.pow(y_true - y_pred, 2)) 
optimizer = tf.train.RMSPropOptimizer(learning_rate).minimize(cost) 

# Initializing the variables 
init = tf.global_variables_initializer() 

# Launch the graph 
with tf.Session() as sess: 
coord = tf.train.Coordinator() 
threads = tf.train.start_queue_runners(coord=coord) 
sess.run(init) 
total_batch = int(mnist.train.num_examples/batch_size) 
# Training cycle 
for epoch in range(training_epochs): 
    # Loop over all batches 
    for i in range(total_batch): 
     #batch_xs, batch_ys = mnist.train.next_batch(batch_size) 
     batch_xs = sess.run([data_batch]) 
     #print(batch_xs) 
     #batch_xs = tf.reshape(batch_xs, [-1, n_input]) 
     # Run optimization op (backprop) and cost op (to get loss value) 
     _, c = sess.run([optimizer, cost], feed_dict={X: batch_xs}) 
    # Display logs per epoch step 
    if epoch % display_step == 0: 
     print("Epoch:", '%04d' % (epoch+1), 
       "cost=", "{:.9f}".format(c)) 
coord.request_stop() 
coord.join(threads) 
print("Optimization Finished!")

不幸的是，在運行代碼時，我得到這個錯誤： ValueError異常：無法爲張量'佔位符：0'形狀（1,128,29,29,1）提供形狀'（？，841）'

我的第一個問題是爲什麼我有張量形狀（1，128，29，29，1）期待（128,29,29,1）？我在這裏錯過了什麼嗎？

我也不懂下面的代碼，我怎麼可以改變它，以便將它與我的數據集進行比較：

# Applying encode and decode over test set 
encode_decode = sess.run(
    y_pred, feed_dict={X: mnist.test.images[:examples_to_show]})

據我瞭解，這個代碼執行圖形的y_pred一部分，將前10個測試圖像傳遞給之前定義的佔位符X.如果我要爲我的測試圖像使用第二個數據隊列（29x29），我會如何將這些輸入到上面的字典中？

data_batch_eval = data_reader.batch_generator([DATA_PATH_EVAL],29**2) # eval set

儘管如此，我將如何提取的第10個測試圖像喂詞典：

例如，使用我的代碼可以如下定義一個data_batch_eval？

來源

2017-07-15 divined

你的'''features'''變量的目的是什麼，它看起來像你創建它，但不使用它。 – Anis

我只是忘了發表評論。它需要被刪除。 – divined

My first question is why do I have Tensors of shape (1, 128, 29, 29, 1) when I was expecting (128,29,29,1)? Am I missing something here?

您需要卸下支架在sess.run：

batch_xs = sess.run(data_batch)

Unfortunately, when running the code I get this error: ValueError: Cannot feed value of shape (1, 128, 29, 29, 1) for Tensor 'Placeholder:0', which has shape '(?, 841)'

您已宣佈預留位置X作爲其是[無，841]和進給的輸入[128， 29,29,1]：

X = tf.placeholder("float", [None, n_input])

要麼更改您的Feed輸入或您的佔位符，以便兩者具有相同的大小。

注意：您對隊列的處理效率低下，您直接將data_batch作爲輸入傳遞給您的network，而不是通過feed in機制。

來源

2017-07-15 19:26:47

謝謝你的回答。它通過刪除括號並重塑爲[29 ** 2]來起作用。不幸的是，對於這樣一個簡單的網絡來說，速度非常緩慢。看來我需要使用feed_in機制。 – divined

是的，正如我在筆記中所說的，你對隊列的處理是錯誤的。直接將'data_batch'傳遞給encoder_op = encoder（data_batch）。如果你想同時使用隊列和feed_dict（在推理期間），你可以使用'tf.placeholder_with_default（）'。無論如何，完全不同的問題，開闢一個新的問題，如果你仍然有問題，我會解決它。 –

Tensorflow自動編碼器與二進制文件的自定義培訓示例

回答

相關問題