600字范文,内容丰富有趣,生活中的好帮手!
600字范文 > Python调用百度OCR API实现图像文字识别

Python调用百度OCR API实现图像文字识别

时间:2019-11-07 08:35:52

相关推荐

Python调用百度OCR API实现图像文字识别

百度控制台地址:https://console./ai/#/ai/ocr/overview/index

直接通过urllib去按照API文档调用接口便可,唯一需要注意的是API文档里要求image在base64编码后进行urlencode,但是实际上只需要base64编码便可

# !/user/bin/env Python3# -*- coding:utf-8 -*-"""file:baidu_api.pycreate time:/4/10 15:14author:Loong Xudesc: 调用百度OCRapi实现文本识别"""import base64from urllib import parse, requestimport jsondef GetAccessToken(ak, sk):'''获取access_token代码:param ak:控制台应用API Key:param sk:控制台应用Secret Key:return:返回接口调用的access_token参数以及token的有效期(单位为秒)'''# client_id 为官网获取的AK, client_secret 为官网获取的SKhost = '/oauth/2.0/token?grant_type=client_credentials&client_id=%s&client_secret=%s' % (ak, sk)headers = {'Content-Type': 'application/json; charset=UTF-8'}req = request.Request(method='GET', url=host, headers=headers)response = request.urlopen(req)if (response.status == 200):data = json.loads(response.read().decode())access_token = data['access_token']expires_in = data['expires_in']return access_token, expires_indef RecogniseForm(access_token, image, templateSign=None, classifierId=None):"""自定义模板文字识别:param access_token::param image:图像数据(string),base64编码,注意大小不超过4M,最短边至少15px,最长边最大4096px,支持jpg/png/bmp格式:param templateSign:模板ID(string):param classifierId:分类器ID(int),这个参数与templateSign至少存在一个,优先使用templateSign,存在templateSign时,使用指定模板;如果没有templateSign而有classifierld,表示使用分类器去判断使用模板:return:返回识别结果"""host = '/rest/2.0/solution/v1/iocr/recognise?access_token=%s' % access_tokenheaders = {'Content-Type': 'application/x-www-form-urlencoded'}formdata = {'image': image}if templateSign is not None:formdata['templateSign'] = templateSignif classifierId is not None:formdata['classifierId'] = classifierIddata = parse.urlencode(formdata).encode('utf8')req = request.Request(method='POST', url=host, headers=headers, data=data)response = request.urlopen(req)if (response.status == 200):jobj = json.loads(response.read().decode())datas = jobj['data']recognise = {}for obj in datas['ret']:recognise[obj['word_name']] = obj['word']return recognisedef RecogniseGeneral(access_token, image=None, url=None, recognize_granularity='big', language_type='CHN_ENG',detect_direction=False, detect_language=False, vertexes_location=False, probability=False):'''通用文字识别(含位置信息):param access_token:URL参数,需要拼接到接口URL上:param image:图像数据,base64编码,要求base64编码后大小不超过4M,最短边至少15px,最长边最大4096px,支持jpg/png/bmp格式,当image字段存在时url字段失效:param url:图片完整URL,URL长度不超过1024字节,URL对应的图片base64编码后大小不超过4M,最短边至少15px,最长边最大4096px,支持jpg/png/bmp格式,当image字段存在时url字段失效,不支持https的图片链接:param recognize_granularity:是否定位单字符位置,big:不定位单字符位置,默认值;small:定位单字符位置:param language_type:识别语言类型,默认为CHN_ENG。可选值包括:- CHN_ENG:中英文混合;- ENG:英文;- POR:葡萄牙语;- FRE:法语;- GER:德语;- ITA:意大利语;- SPA:西班牙语;- RUS:俄语;- JAP:日语;- KOR:韩语:param detect_direction:是否检测图像朝向,默认不检测,即:false。朝向是指输入图像是正常方向、逆时针旋转90/180/270度。可选值包括:- true:检测朝向;- false:不检测朝向。:param detect_language:是否检测语言,默认不检测。当前支持(中文、英语、日语、韩语):param vertexes_location:是否返回文字外接多边形顶点位置,不支持单字位置。默认为false:param probability:是否返回识别结果中每一行的置信度:return:'''host = '/rest/2.0/ocr/v1/general?access_token=%s' % access_tokenheaders = {'Content-Type': 'application/x-www-form-urlencoded'}formdata = {'recognize_granularity': recognize_granularity, 'language_type': language_type,'detect_direction': detect_direction, 'detect_language': detect_language,'vertexes_location': vertexes_location, 'probability': probability}if image is not None:formdata['image'] = imageelif url is not None:formdata['url'] = urldata = parse.urlencode(formdata).encode('utf8')req = request.Request(method='POST', url=host, headers=headers, data=data)response = request.urlopen(req)if (response.status == 200):jobj = json.loads(response.read().decode())datas = jobj['words_result']recognise = {}for obj in datas:recognise[obj['words']] = obj['location']return recognisedef Recognise(img_path):access_token, expires_in = GetAccessToken(ak, sk) # 将此ak与sk替换成自己应用的值with open(file=img_path, mode='rb') as file:base64_data = base64.b64encode(file.read())# 调用iOCR自定义模板文字识别接口# recognise = RecogniseForm(access_token=access_token, image=base64_data,templateSign=templateSign) # 将此templateSign替换成自己设置的模板值recognise = RecogniseGeneral(access_token=access_token, image=base64_data)for k, v in recognise.items():print(k, v)Recognise(r'C:\Users\Administrator\Desktop\1.jpg')

本内容不代表本网观点和政治立场,如有侵犯你的权益请联系我们处理。
网友评论
网友评论仅供其表达个人看法,并不表明网站立场。