2016-12-23 99 views
1

我正在嘗試將csv文件讀入數據框以便在生成的sql select語句中使用這些字段。數據是這樣的:循環遍歷數據幀的行

  0 1  2  3  4   5   6 
0 status1 A0 NaN NaN  3 Customer1 customer Id 
1 status2 A1 NaN NaN  3 Customer2 customer Id 
2 status3 A2 NaN 1253  3 Customer3 customer Id 
3 status4 A3 27.0 L0  12 Customer4 customer Id 
4 status5 A3 30.0 NaN  12 Customer5 customer Id 

,當我遍歷每一行和嘗試插入域到字符串:

for row in M: 
print("(Select '{}' as disposition, '{}' as category_code, '{}' as status_code, '{:02f}' as Payer_reason_code, {} as precedence, {} as source_id, '{}' as reco_id) union" 
      .format(M[row][0], M[row][1], M[row][2], M[row][3], M[row][4], M[row][5], M[row][6])) 

引發此錯誤:

line 14, in print("(Select '{}' as disposition, '{}' as category_code, '{}' as status_code, '{:02f}' as Payer_reason_code, {} as precedence, {} as source_id, '{}' as reco_id) union".format(M[row][0], M[row][1], M[row][2], M[row][3], M[row][4], M[row][5], M[row][6])) IndexError: arrays used as indices must be of integer (or boolean) type

如何我是否在一個numpy二維數組上循環?

這裏是完整的腳本:

import pandas as pd 
import os 
import numpy as np 

path = '../resources' 

X = pd.read_csv('../resources/data.csv', header=None).as_matrix() 

for row in X: 
    print("(Select '{}' as disposition, '{}' as category_code, '{}' as status_code, '{:02f}' as Payer_reason_code, {} as precedence, {} as source_id, '{}' as reco_id) union".format(X[row][0], X[row][1], X[row][2], X[row][3], X[row][4], X[row][5], X[row][6])) 
+0

這是大熊貓不NumPy的。 –

+1

如果你想要numpy,使用'X.values'。 –

+0

[通過熊貓數據幀循環]可能的重複(http://stackoverflow.com/questions/34958526/looping-through-a-pandas-dataframe) –

回答

0

您無法通過這樣的一個數據幀進行迭代。

for i, row in M.iterrows(): 
    print("(Select '{}' as disposition, '{}' as category_code, '{}' as status_code, '{:02f}' as Payer_reason_code, {} as precedence, {} as source_id, '{}' as reco_id) union" 
     .format(row[0], row[1], row[2], row[3], row[4], row[5], row[6]))