Skip to content

图片无法转换 #114

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
hgmsq opened this issue Dec 18, 2024 · 14 comments
Open

图片无法转换 #114

hgmsq opened this issue Dec 18, 2024 · 14 comments

Comments

@hgmsq
Copy link

hgmsq commented Dec 18, 2024

测试结果图片不能转换到md文档里面

@SigireddyBalasai
Copy link
Contributor

make sure that you are using latest version

@hgmsq
Copy link
Author

hgmsq commented Dec 18, 2024

Today, I tested directly using the pip command to install dependency packages.

@SigireddyBalasai
Copy link
Contributor

Today, I tested directly using the pip command to install dependency packages.

so any problem ?

@SigireddyBalasai
Copy link
Contributor

if possible include full context your os the version of python you are using and the error message

@BlackPool888
Copy link

I also cannot output image content,I can only get the main idea of the images,oops~

@BlackPool888
Copy link

Python:3.12

e.g:

Image

@SigireddyBalasai
Copy link
Contributor

@BlackPool888 i think you are getting what is intended for giving image input you will get description

@BlackPool888
Copy link

@SigireddyBalasai Right,I get it now,I had expected to receive the main content,not just a description,thx

@KmBase
Copy link

KmBase commented Dec 18, 2024

#51 ,you can comment the code

@happy-xlf
Copy link

md = MarkItDown(llm_client=client, llm_model="gpt-4o") ,图片转md需要提供llm的接口才能识别

@neverlatetolearn0
Copy link

@SigireddyBalasai Right,I get it now,I had expected to receive the main content,not just a description,thx

This may require an OCR approach

@neverlatetolearn0
Copy link

md = MarkItDown(llm_client=client, llm_model="gpt-4o") ,图片转md需要提供llm的接口才能识别

This method can summarize the content of the picture, but can not identify the text in the picture, how to extract the text in the picture

@wking2014
Copy link

Insert the pictures in the PDF document into the corresponding positions in the Markdown document.
This cannot be achieved.

@zenoda
Copy link

zenoda commented May 19, 2025

md = MarkItDown(llm_client=client, llm_model="gpt-4o") ,图片转md需要提供llm的接口才能识别

This method can summarize the content of the picture, but can not identify the text in the picture, how to extract the text in the picture

I also need this feature!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants