You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With the recent version of markdownify, the library no longer converts HTML content into proper Markdown. Instead, it often returns the original HTML as-is. This issue is especially noticeable when handling HTML extracted from PDFs, where the expected Markdown formatting is incorrect.
The text was updated successfully, but these errors were encountered:
Also, to clarify, if HTML is extracted from a PDF, it won't automatically be converted to markdown. This type of recursive processing currently is not automatic.
With the recent version of markdownify, the library no longer converts HTML content into proper Markdown. Instead, it often returns the original HTML as-is. This issue is especially noticeable when handling HTML extracted from PDFs, where the expected Markdown formatting is incorrect.
The text was updated successfully, but these errors were encountered: