kotaemon

Files

Elias Judin 54320d08df feat(loader): implement markdown parsing in MathpixPDFReader (#498 )

* ✨ feat(loader): implement markdown parsing in MathpixPDFReader

Add functionality to properly handle PDF content:
- Add parse_markdown_text_to_tables method to separate tables and text
- Fix load_data implementation to properly process documents
- Fix lazy_load_data method
- Improve document metadata handling for tables and text sections

The loader now correctly processes PDFs through Mathpix API and converts content to proper Document objects.

* fix(loader): remove super() calls blocking MathpixPDFReader implementation

Remove early returns using super() in load_data and lazy_load_data methods that were preventing the actual implementation from being executed. This fixes the "not implemented" error while maintaining the full PDF reader functionality.

2024-12-17 17:30:17 +07:00

kotaemon

feat(loader): implement markdown parsing in MathpixPDFReader (#498 )

2024-12-17 17:30:17 +07:00

ktem

fix: using a base path breaks file preview (#575 )

2024-12-17 17:12:27 +07:00