python通过thrift操作hbase实例

thrift 是facebook开发并开源的一个二进制通讯中间件,通过thrift,我们可以充分利用各个语言的优势,编写高效的代码。

关于thrift的论文:http://pan.baidu.com/share/link?shareid=234128&uk=3238841275

安装thrift:http://thrift.apache.org/docs/install/ubuntu/

安装完成后到hbase的目录下,找到Hbase.thrift,该文件在

hbase-0.94.4/src/main/resources/org/apache/hadoop/hbase/thrift下可以找到

thrift –gen python hbase.thrift 会生成gen-py文件夹,将其修改成hbase

安装python的thrift库

sudo pip install thrift

启动hbase的thrift服务:bin/hbase-daemon.sh start thrift 默认端口是9090

创建hbase表:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
from thriftimport Thrift
from thrift.transportimport TSocket
from thrift.transportimport TTransport
from thrift.protocolimport TBinaryProtocol
 
from hbaseimport Hbase
from hbase.ttypesimport *
 
transport= TSocket.TSocket('localhost',9090);
 
transport= TTransport.TBufferedTransport(transport)
 
protocol= TBinaryProtocol.TBinaryProtocol(transport);
 
client= Hbase.Client(protocol)
transport.open()
 
 
contents= ColumnDescriptor(name='cf:', maxVersions=1)
client.createTable('test', [contents])
 
print client.getTableNames()

执行代码,成功后,进入hbase的shell,用命令list可以看到刚刚的test表已经创建成功。

插入数据:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
from thriftimport Thrift
from thrift.transportimport TSocket
from thrift.transportimport TTransport
from thrift.protocolimport TBinaryProtocol
 
from hbaseimport Hbase
 
from hbase.ttypesimport *
 
transport= TSocket.TSocket('localhost',9090)
 
transport= TTransport.TBufferedTransport(transport)
 
protocol= TBinaryProtocol.TBinaryProtocol(transport)
 
client= Hbase.Client(protocol)
 
transport.open()
 
row= 'row-key1'
 
mutations= [Mutation(column="cf:a", value="1")]
client.mutateRow('test', row, mutations,None)

获取一行数据:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
from thriftimport Thrift
from thrift.transportimport TSocket
from thrift.transportimport TTransport
from thrift.protocolimport TBinaryProtocol
 
from hbaseimport Hbase
from hbase.ttypesimport *
 
transport= TSocket.TSocket('localhost',9090)
transport= TTransport.TBufferedTransport(transport)
 
protocol= TBinaryProtocol.TBinaryProtocol(transport)
 
client= Hbase.Client(protocol)
 
transport.open()
 
tableName= 'test'
rowKey= 'row-key1'
 
result= client.getRow(tableName, rowKey,None)
print result
for rin result:
    print 'the row is ' , r.row
    print 'the values is ' , r.columns.get('cf:a').value

返回多行则需要使用scan:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
from thriftimport Thrift
from thrift.transportimport TSocket
from thrift.transportimport TTransport
from thrift.protocolimport TBinaryProtocol
 
from hbaseimport Hbase
from hbase.ttypesimport *
 
transport= TSocket.TSocket('localhost',9090)
transport= TTransport.TBufferedTransport(transport)
 
protocol= TBinaryProtocol.TBinaryProtocol(transport)
 
client= Hbase.Client(protocol)
transport.open()
 
scan= TScan()
tableName= 'test'
id = client.scannerOpenWithScan(tableName, scan,None)
 
result2= client.scannerGetList(id,10)
 
print result2

scannerGet则是每次只取一行数据:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
from thriftimport Thrift
from thrift.transportimport TSocket
from thrift.transportimport TTransport
from thrift.protocolimport TBinaryProtocol
 
from hbaseimport Hbase
from hbase.ttypesimport *
 
transport= TSocket.TSocket('localhost',9090)
transport= TTransport.TBufferedTransport(transport)
 
protocol= TBinaryProtocol.TBinaryProtocol(transport)
 
client= Hbase.Client(protocol)
transport.open()
 
scan= TScan()
tableName= 'test'
id = client.scannerOpenWithScan(tableName, scan,None)
result= client.scannerGet(id)
while result:
    print result
© 版权声明
THE END
喜欢就支持一下吧
点赞12 分享
评论 抢沙发
头像
欢迎您留下宝贵的见解!
提交
头像

昵称

取消
昵称表情代码图片

    暂无评论内容