2016-09-29 63 views

回答

5

將預處理管道更改爲使用BigQuerySource(使用與CSV樣本中相同的Features類)。下面是一個例子:

feature_set = CsvFeatures() 
train_query = 「SELECT …」 
valid_query = 「SELECt …」 
train = pipeline | 'read_train' >> beam.Read(beam.io.BigQuerySource(query=train_query)) 
eval = pipeline | 'read_valid' >> beam.Read(beam.io.BigQuerySource(query=valid_query)) 
(metadata, train_features, eval_features) = ((train, eval) | 
    ml.Preprocess('Preprocess', feature_set)) 
相關問題