kotaemon/libs
Elias Judin 54320d08df
feat(loader): implement markdown parsing in MathpixPDFReader (#498)
*  feat(loader): implement markdown parsing in MathpixPDFReader

Add functionality to properly handle PDF content:
- Add parse_markdown_text_to_tables method to separate tables and text
- Fix load_data implementation to properly process documents
- Fix lazy_load_data method
- Improve document metadata handling for tables and text sections

The loader now correctly processes PDFs through Mathpix API and converts content to proper Document objects.

* fix(loader): remove super() calls blocking MathpixPDFReader implementation

Remove early returns using super() in load_data and lazy_load_data methods that were preventing the actual implementation from being executed. This fixes the "not implemented" error while maintaining the full PDF reader functionality.
2024-12-17 17:30:17 +07:00
..
kotaemon feat(loader): implement markdown parsing in MathpixPDFReader (#498) 2024-12-17 17:30:17 +07:00
ktem fix: using a base path breaks file preview (#575) 2024-12-17 17:12:27 +07:00