2016-11-16 203 views
-1

我有一個列表。它是隨機的。我想要重複刪除列表中的項目。從列表中刪除略有重複的項目Python

我只想要這些網址(項目)在我的列表中。

/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k 

我只會從我的列表中刪除這些URL(項目)。

/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k.1479303517063659 

urlList是保存它的變量。

/watch?v=iOpJywrdCuQ&lc=z12ntbeg5nvptj2pv04chz5roqfpyxpp1to0k 
/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k 
/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k.1479303517063659 
/watch?v=iOpJywrdCuQ&lc=z12zt3fagtymh3nta23dhpsohmjocfwqd04 
/watch?v=iOpJywrdCuQ&lc=z12mv3rgatfcf3jvq22tuzgzcmfadbtvi 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k.1475086029972480 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k.1475086070389366 
/watch?v=iOpJywrdCuQ&lc=z13qulw4jvafgxo1322hdfnitwv5u35d0 
/watch?v=iOpJywrdCuQ&lc=z13qulw4jvafgxo1322hdfnitwv5u35d0.1478197028336117 
/watch?v=iOpJywrdCuQ&lc=z12xxv3r3py3jv5en23vjft4gxb3jhnkx 
/watch?v=iOpJywrdCuQ&lc=z12xxv3r3py3jv5en23vjft4gxb3jhnkx.1478703511076217 
/watch?v=iOpJywrdCuQ&lc=z12fzxwqwlr4xdkao04cflhbawivd1gq2jc 
/watch?v=iOpJywrdCuQ&lc=z12oczcbksudef3cq23psfd4cqjycro5e04 
/watch?v=iOpJywrdCuQ&lc=z12oczcbksudef3cq23psfd4cqjycro5e04.1478790261949273 
/watch?v=iOpJywrdCuQ&lc=z135h1sxqprthvfdl23hsh2qdrmggpzsc 

我想要什麼:

/watch?v=iOpJywrdCuQ&lc=z12ntbeg5nvptj2pv04chz5roqfpyxpp1to0k 
/watch?v=iOpJywrdCuQ&lc=z13ohzibgwq3yjw4u04cglzhxmioy5phrn00k 
/watch?v=iOpJywrdCuQ&lc=z12zt3fagtymh3nta23dhpsohmjocfwqd04 
/watch?v=iOpJywrdCuQ&lc=z12mv3rgatfcf3jvq22tuzgzcmfadbtvi 
/watch?v=iOpJywrdCuQ&lc=z13ohrfy3mb5xphyh04cd3shfq2bzhr5up00k 
/watch?v=iOpJywrdCuQ&lc=z13qulw4jvafgxo1322hdfnitwv5u35d0 
/watch?v=iOpJywrdCuQ&lc=z12xxv3r3py3jv5en23vjft4gxb3jhnkx 
/watch?v=iOpJywrdCuQ&lc=z12fzxwqwlr4xdkao04cflhbawivd1gq2jc 
/watch?v=iOpJywrdCuQ&lc=z12oczcbksudef3cq23psfd4cqjycro5e04 
/watch?v=iOpJywrdCuQ&lc=z135h1sxqprthvfdl23hsh2qdrmggpzsc 

我會如何呢?

+4

捨棄'.'的字符串,然後把它放在一個集。 –

回答

0

試驗用於字符.匹配正則表達式(因爲.\.表示被保留用於any char):

import re 
without_dot = [] 
for x in open('file.txt').read().split('\n'): 
    if not re.match('\.', x): 
     without_dot.append(x)