python 成语接龙1-爬去四字成语
生活随笔
收集整理的這篇文章主要介紹了
python 成语接龙1-爬去四字成语
小編覺(jué)得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.
# coding=utf-8
import requests
import random
import xpinyin
from bs4 import BeautifulSoup
#定義爬取的網(wǎng)站地址
urls = ["http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_1.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_2.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_3.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_4.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_5.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_6.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_7.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_8.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_9.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_10.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_11.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_12.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_13.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_14.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_15.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_16.html","http://www.chengyudaquan.net/feisizichengyu/sizichengyu/list_17.html"]
#定義詞語(yǔ)文件
w = open('/tensorflow/py_aiplat_demo/data/ciyu.txt','w')
for url in urls:response = requests.get(url)response.raise_for_status()response.encoding = response.apparent_encodingsoup = BeautifulSoup(response.text, 'lxml')for link in soup.find_all("span", class_="mainlia1 wzbtlist"):#處理數(shù)據(jù)只取四字成語(yǔ)if len(link.text) == 4:notext=link.text + '\n'w.write(notext)
print("抓取數(shù)據(jù)成功!")
總結(jié)
以上是生活随笔為你收集整理的python 成语接龙1-爬去四字成语的全部?jī)?nèi)容,希望文章能夠幫你解決所遇到的問(wèn)題。
- 上一篇: 爱奇艺内容中台数据中心的设计与实现
- 下一篇: u3d 镜面反射的效果