生活随笔
收集整理的這篇文章主要介紹了
爬虫笔记8实例淘宝商品比价爬虫
小編覺得挺不錯的,現在分享給大家,幫大家做個參考.
這個也不錯
import requests
import re
def getHTMLText(url
):headers
= {'cookie': 'miid=1296267545453648768; t=b4d385e2145f596a67961e4dd08e9a8f; cna=pqwcFXxbJjACAXWIA7AFEfA8; thw=cn; tracknick=tb487881011; lgc=tb487881011; _cc_=UIHiLt3xSw%3D%3D; tg=0; enc=%2FTqA3gAexHOKU0cyPYbSWM1pGS8vgnlEK3EMnkYd2T%2BlB%2BJh18hxryREG48c%2BYmdk7yfvbSMCBDQExP23eUm3w%3D%3D; hng=CN%7Czh-CN%7CCNY%7C156; x=e%3D1%26p%3D*%26s%3D0%26c%3D0%26f%3D0%26g%3D0%26t%3D0%26__ll%3D-1%26_ato%3D0; cookie2=19ef67fdfc3f433776e5e9cafaf6a8ea; v=0; _tb_token_=08b7e3e7e183; _m_h5_tk=62383241b06635c64b07942e50e47d9d_1562004576179; _m_h5_tk_enc=0465da475a8335f8fd8d9ef6bb280a71; unb=4235284520; sg=101; _l_g_=Ug%3D%3D; skt=c571ae590b7580cb; cookie1=AnQIvxj44XbyESoVNTVtwfJRB8W%2BbAPV%2BVZMWhAghjk%3D; csg=23f40375; uc3=vt3=F8dBy34cs3fc7ebsEqk%3D&id2=Vy67WD1MZomrsw%3D%3D&nk2=F5RBzeKtOazPVJc%3D&lg2=UtASsssmOIJ0bQ%3D%3D; existShop=MTU2MTk5NTE3MQ%3D%3D; dnk=tb487881011; _nk_=tb487881011; cookie17=Vy67WD1MZomrsw%3D%3D; mt=ci=21_1; uc1=cookie14=UoTaGdT0tHdY5w%3D%3D&lng=zh_CN&cookie16=VT5L2FSpNgq6fDudInPRgavC%2BQ%3D%3D&existShop=false&cookie21=VFC%2FuZ9aj3yE&tag=8&cookie15=UIHiLt3xD8xYTw%3D%3D&pas=0; whl=-1%260%260%261561995222497; isg=BHNzJqpkKgCWtOesccf13ZRUAnddACwkF8iwAyUQzxLJJJPGrXiXutG23hRvn19i; l=bBMxcfBPv539-OTkBOCanurza77OSIRYYuPzaNbMi_5K-6T_2qQOkAuQFF96Vj5Rs4YB4G2npwJ9-etkq','User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/74.0.3729.169 Safari/537.36'}try:r
=requests
.get
(url
,headers
=headers
)r
.raise_for_status
()r
.encoding
=r
.apparent_encoding
return r
.text
except:print('get失敗')return ""
def parsePage(ilt
,html
):try:plt
= re
.findall
(r
'\"view_price\"\:\"[\d\.]*"', html
)tlt
= re
.findall
(r
'\"raw_title\"\:\".*?"', html
)for i
in range(len(plt
)):price
= eval(plt
[i
].split
(':')[1])title
= eval(tlt
[i
].split
(':')[1])ilt
.append
([price
,title
])except:print("parsePage錯誤")return ""
def printGoodsList(ilt
):tplt
="{:4}\t{:8}\t{:16}"print(tplt
.format("序號","價格","名稱"))count
=0for g
in ilt
:count
= count
+1print(tplt
.format(count
,g
[0],g
[1]))def main():goods
='書包'depth
=3start_url
='https://s.taobao.com/search?q='+goodsinfoList
= []for i
in range(depth
):try:url
=start_url
+'&s='+str(44*i
)html
=getHTMLText
(url
)parsePage
(infoList
,html
)except:continueprintGoodsList
(infoList
)main
()
總結
以上是生活随笔為你收集整理的爬虫笔记8实例淘宝商品比价爬虫的全部內容,希望文章能夠幫你解決所遇到的問題。
如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。