简书python爬虫权威_python爬虫 --- 简书评论
某些網站的一些數據是通過js加載的 ,所以爬取下來的數據拿不到,
找到評論的地址 .進行請求獲取評論數據
#coding=utf-8
import json
import requests
def requests_view(response):
import webbrowser
requests_url = response.url
base_url = '
' %(requests_url)base_url = base_url.encode('utf-8')
content = response.content.replace(b"
",base_url)tem_html = open('tmp.html','wb')
tem_html.write(content)
tem_html.close()
webbrowser.open_new_tab("tmp.html")
headers = {
"User-Agent": 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36'}
response = requests.get("https://www.jianshu.com/notes/26504955/comments?comment_id=&author_only=false&since_id=0&max_id=1586510606000&order_by=likes_count&page=1",headers=headers)
comments = json.loads(response.content)
if comments['comment_exist'] == True:
for item in comments['comments']:
print(item['user']['nickname'],item['compiled_content'])
總結
以上是生活随笔為你收集整理的简书python爬虫权威_python爬虫 --- 简书评论的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: word文档只读怎么办 word文档无法
- 下一篇: 全志uboot修改_全志SDK编译问题解