當前位置：首頁 > 编程语言 > python >内容正文

python

python操作hive数据库代码_python导出hive数据表的schema实例代码

發布時間：2024/8/23 python 22 豆豆

生活随笔收集整理的這篇文章主要介紹了 python操作hive数据库代码_python导出hive数据表的schema实例代码小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

本文研究的主要問題是python語言導出hive數據表的schema，分享了實現代碼，具體如下。

為了避免運營提出無窮無盡的查詢需求，我們決定將有查詢價值的數據從mysql導入hive中，讓他們使用HUE這個開源工具進行查詢。想必他們對表結構不甚了解，還需要為之提供一個表結構說明，于是編寫了一個腳本，從hive數據庫中將每張表的字段即類型查詢出來，代碼如下：

#coding=utf-8

import pyhs2

from xlwt import *

hiveconn = pyhs2.connect(host='10.46.77.120',

port=10000,

authMechanism='PLAIN',

user='hadoop',

database='hibiscus_data',

)

def create_excel():

sql = 'show tables'

tables = []

with hiveconn.cursor() as cursor:

cursor.execute(sql)

res = cursor.fetch()

for table in res:

tables.append(table[0])

tableinfo = []

for table in tables:

tableinfo.append(get_column_info(table))

create_excel_ex(tableinfo)

def create_excel_ex(tableinfo):

w = Workbook()

sheet = w.add_sheet(u'表結構')

row = 0

for info in tableinfo:

row = write_tale_info(info,sheet,row)

w.save('hive_schema.xls')

def write_tale_info(tableinfo,sheet,row):

print row

sheet.write_merge(row,row,0,2,tableinfo['table'])

row += 1

sheet.write(row,0,u'名稱')

sheet.write(row,1,u'類型')

sheet.write(row,2,u'解釋')

row += 1

fields = tableinfo['fields']

for field in fields:

sheet.write(row,0,field['name'])

sheet.write(row,1,field['type'])

row += 1

return row + 1

def get_column_info(table):

sql = 'desc {table}'.format(table=table)

info = {'table':table,'fields':[]}

with hiveconn.cursor() as cursor:

cursor.execute(sql)

res = cursor.fetch()

for item in res:

if item[0] == '':

break

info['fields'].append({'name':item[0],'type':item[1]})

return info

if __name__ == '__main__':

create_excel()

其實，我們的hive數據庫將所有的元數據存儲在了mysql當中，分析這些元數據也可以獲得表結構信息。

總結

以上就是本文關于python導出hive數據表的schema實例代碼的全部內容，希望對大家有所幫助。感興趣的朋友可以繼續參閱本站其他相關專題，如有不足之處，歡迎留言指出。感謝朋友們對本站的支持！

總結

以上是生活随笔為你收集整理的python操作hive数据库代码_python导出hive数据表的schema实例代码的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇： python是在linux系统下运行的吗
下一篇： python pd Series 添加行

3atv精品不卡视频,97人人超碰国产精品最新,中文字幕av一区二区三区人妻少妇,久久久精品波多野结衣,日韩一区二区三区精品

python

python操作hive数据库代码_python导出hive数据表的schema实例代码

總結