Python爬虫(五)
                                                            生活随笔
收集整理的這篇文章主要介紹了
                                Python爬虫(五)
小編覺得挺不錯的,現在分享給大家,幫大家做個參考.                        
                                源碼:
1 import requests 2 from lxml import etree 3 from my_mysql import MysqlConnect 4 5 6 mc = MysqlConnect('127.0.0.1','root','123456','homework') 7 sql = 'insert into lianjia(title,addr,shape,area,dire,price) values(%s,%s,%s,%s,%s,%s)' 8 for page in range(3): 9 url = 'https://bj.lianjia.com/zufang/pg{}rp2rp1/'.format(page) 10 response = requests.get(url) 11 html = etree.HTML(response.text) 12 li_list = html.xpath('//ul[@id="house-lst"]/li') 13 # print(li_list) 14 for li_ele in li_list: 15 title = li_ele.xpath('./div[2]/h2/a')[0].text 16 addr = li_ele.xpath('./div[2]/div[1]/div[1]/a/span')[0].text 17 shape = li_ele.xpath('./div[2]/div[1]/div[1]/span[1]/span')[0].text 18 area = li_ele.xpath('./div[2]/div[1]/div[1]/span[2]')[0].text 19 dire = li_ele.xpath('./div[2]/div[1]/div[1]/span[3]')[0].text 20 price = li_ele.xpath('./div[2]/div[2]/div[1]/span')[0].text 21 # print(title,addr,shape,area,price) 22 data = (title,addr,shape,area,dire,price) 23 print(data) 24 mc.exec_data(sql,data) 25 # break?
轉載于:https://www.cnblogs.com/zhxd-python/p/9501310.html
總結
以上是生活随笔為你收集整理的Python爬虫(五)的全部內容,希望文章能夠幫你解決所遇到的問題。
 
                            
                        - 上一篇: Statues CodeForces -
- 下一篇: loj10200. 「一本通 6.2 练
