-
Notifications
You must be signed in to change notification settings - Fork 3k
Issues: microsoft/markitdown
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
PPTX: Extract images
enhancement
New feature or request
open for contribution
Invites open-source developers to contribute to the project.
#56
opened Dec 16, 2024 by
pstoeckle
updated Jan 29, 2025
Information about the markitdown library.
#308
opened Jan 31, 2025 by
wesleyrover
updated Jan 31, 2025
Docx Table of Contents is converted to broken anchors
#314
opened Feb 4, 2025 by
benaminc
updated Feb 4, 2025
Preserve tables, titles (structure) of PDF documents
enhancement
New feature or request
open for contribution
Invites open-source developers to contribute to the project.
#41
opened Dec 15, 2024 by
shcheklein
updated Feb 5, 2025
"Description" hardcoded in English when converting images to another language
#315
opened Feb 4, 2025 by
sglebs
updated Feb 9, 2025
'charmap' codec can't encode character '\u02da'
#151
opened Dec 19, 2024 by
dr-graviton
updated Feb 9, 2025
AttributeError: module 'pdfminer.high_level' has no attribute 'extract_text'"
#297
opened Jan 20, 2025 by
subhrajit-mohanty
updated Feb 10, 2025
Support old New feature or request
open for contribution
Invites open-source developers to contribute to the project.
.xls
files
enhancement
#137
opened Dec 19, 2024 by
scruel
updated Feb 10, 2025
convert_stream in case of xlsx and xls files is broken
#321
opened Feb 11, 2025 by
abab-dev
updated Feb 11, 2025
File support: chm support
enhancement
New feature or request
good first issue
Good for newcomers
#14
opened Dec 13, 2024 by
bewernick
updated Feb 16, 2025
Using markdown to develop some features
#1029
opened Feb 16, 2025 by
andreisaioc
updated Feb 16, 2025
PR SUBMMITED 331: PPTX Shape Groups Are not accounted for and any and all text in these shapes is not being included in markdown
#332
opened Feb 13, 2025 by
C0dingMast3r
updated Feb 17, 2025
ChatGPT OCR results are generated in different languages
#864
opened Feb 16, 2025 by
tanreinama
updated Feb 18, 2025
Does markitdown have the ability to do join Lines for paragraphs?
#1037
opened Feb 19, 2025 by
DannyRavi
updated Feb 19, 2025
Exclude Hidden Sheets in Excel Conversion
#1073
opened Feb 28, 2025 by
matteo-tafuro
updated Feb 28, 2025
Run evaluation on OmniDocBench and Marker benchmark
#1081
opened Feb 28, 2025 by
dantetemplar
updated Mar 1, 2025
ProTip!
Exclude everything labeled
bug
with -label:bug.