生活随笔
收集整理的這篇文章主要介紹了
python程序导入import、规范化和封装自己写的.py文件
小編覺得挺不錯的,現在分享給大家,幫大家做個參考.
目錄
1. 簡單地導入自己寫的.py文件
2. 將自己寫的多個.py文件規范化成外部類,并創建__init__.py
?3. 將自己的程序封裝成外部包
1. 簡單地導入自己寫的.py文件
將a.py與b.py放在同一項目路徑下,然后在另一個b.py文件中執行import a.py,然后我們就可以在b.py中調用a.py中的函數了
import aa.py()
參考:https://jingyan.baidu.com/article/08b6a591810daf14a8092204.html
2. 將自己寫的多個.py文件規范化成外部類,并創建__init__.py
參考:https://github.com/fifths/python_baike_spider/tree/master/baike_spider
import urllib.requestclass HtmlDownloader(object):def download(self, url):if url is None:return Noneresponse = urllib.request.urlopen(url)if response.getcode() != 200:return Nonereturn response.read()
from bs4 import BeautifulSoup
import re
import urllib.parseclass HtmlParser(object):def _get_new_urls(self, page_url, soup):new_urls = set()links = soup.find_all('a', href=re.compile(r"/view/\d+\.htm"))for link in links:new_url = link['href']new_full_url = urllib.parse.urljoin(page_url, new_url)new_urls.add(new_full_url)return new_urlsdef _get_new_data(self, page_url, soup):res_data = {}# urlres_data['url'] = page_urltitle_node = soup.find('dd', class_="lemmaWgt-lemmaTitle-title").find('h1')res_data['title'] = title_node.get_text()# lemma-summarysummary_node = soup.find('div', class_="lemma-summary")res_data['summary'] = summary_node.get_text()return res_datadef paser(self, page_url, html_cont):if page_url is None or html_cont is None:returnsoup = BeautifulSoup(html_cont, 'html.parser', from_encoding='utf-8')new_urls = self._get_new_urls(page_url, soup)new_data = self._get_new_data(page_url, soup)return new_urls, new_data
from baike_spider import url_manager
from baike_spider import html_downloader
from baike_spider import html_parser
from baike_spider import html_outputerclass SpiderMain(object):def __init__(self):self.urls = url_manager.UrlManager()self.downloader = html_downloader.HtmlDownloader()self.parser = html_parser.HtmlParser()self.outputer = html_outputer.HtmlOutputer()def craw(self, root_url):count = 1self.urls.add_new_url(root_url)while self.urls.has_new_url():try:new_url = self.urls.get_new_url()print("craw %d : %s" %(count, new_url))html_cont = self.downloader.download(new_url)new_urls, new_data = self.parser.paser(new_url, html_cont)self.urls.add_new_urls(new_urls)self.outputer.collect_data(new_data)if count == 10:breakcount = count + 1except:print('craw failed')self.outputer.output_html()if __name__ == '__main__':root_url = "http://baike.baidu.com/view/21087.htm"obj_spider = SpiderMain()obj_spider.craw(root_url)
?3. 將自己的程序封裝成外部包
參考:http://www.zzvips.com/article/84558.html;?https://www.cnblogs.com/smileyes/p/7657591.html;?https://www.cnblogs.com/mangM/p/11619247.html
總結
以上是生活随笔為你收集整理的python程序导入import、规范化和封装自己写的.py文件的全部內容,希望文章能夠幫你解決所遇到的問題。
如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。