python识别图片文字

大预言家

已于 2022-03-24 10:30:33 修改

阅读量4.3k

点赞数

CC 4.0 BY-SA版权

分类专栏：实践文章标签： python

于 2022-03-23 21:51:04 首次发布

本文链接：https://blog.csdn.net/silent_flower/article/details/123697273

实践专栏收录该内容

7 篇文章

订阅专栏

该篇博客介绍了如何使用Python结合OpenCV和pytesseract库来识别图像中的文字。首先，通过OpenCV读取并处理图片，包括转化为灰度图、去噪、二值化等步骤，然后利用pytesseract将处理后的图像转换为可读的字符串。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

python识别图片文字

from PIL import Image
import pytesseract
import cv2
import numpy as np

src_path=""

def get_string(img_path):
	# 读取图片
	img = cv2.imread(img_path)
	# 转化为灰度图
	img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
	# apply dilation and erosion to remove some noise
	kernel = np.ones((1,1), np.uint8)
	img = cv2.dilate(img, kernel, iterations=1)
	img = cv2.erode(img, kernel, iterations=1)
	cv2.imwrite(src_path + "removed_noise.png", img)
	# Apply threshold to get image with only black and white
	img = cv2.adaptiveThreshold(img, 255, cv2.ADAPTIVE_THRESH_GANSSIAN_C, cv2.THRESH_BINARY, 11, 2)
	cv2.imwrite(src_path + "thres.png", img)
	
	result = pytesseract.image_to_string(Image.open(src_path + "thres.png"))
	return result

print get_string(src_path + "1.png")