-
Notifications
You must be signed in to change notification settings - Fork 3k
图片无法转换 #114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
make sure that you are using latest version |
Today, I tested directly using the pip command to install dependency packages. |
so any problem ? |
if possible include full context your os the version of python you are using and the error message |
I also cannot output image content,I can only get the main idea of the images,oops~ |
@BlackPool888 i think you are getting what is intended for giving image input you will get description |
@SigireddyBalasai Right,I get it now,I had expected to receive the main content,not just a description,thx |
#51 ,you can comment the code |
md = MarkItDown(llm_client=client, llm_model="gpt-4o") ,图片转md需要提供llm的接口才能识别 |
This may require an OCR approach |
This method can summarize the content of the picture, but can not identify the text in the picture, how to extract the text in the picture |
Insert the pictures in the PDF document into the corresponding positions in the Markdown document. |
I also need this feature! |
测试结果图片不能转换到md文档里面
The text was updated successfully, but these errors were encountered: