kotaemon/knowledgehub
Tuan Anh Nguyen Dang (Tadashi_Cin) 6c3d614973 [AUR-432] Add layout-aware table parsing PDF reader (#27)
* add OCRReader, MathPixReader and ExcelReader

* update test case for ocr reader

* reformat

* minor fix
2023-09-26 15:52:44 +07:00
..
contribs [AUR-408] Export logs to Excel (#23) 2023-09-25 17:20:03 +07:00
docstores [AUR-338, AUR-406, AUR-407] Export pipeline to config for PromptUI. Construct PromptUI dynamically based on config. (#16) 2023-09-21 14:27:23 +07:00
documents [AUR-432] Add layout-aware table parsing PDF reader (#27) 2023-09-26 15:52:44 +07:00
embeddings [AUR-421] base output post-processor that works using regex. (#20) 2023-09-19 19:54:44 +07:00
llms [AUR-338, AUR-406, AUR-407] Export pipeline to config for PromptUI. Construct PromptUI dynamically based on config. (#16) 2023-09-21 14:27:23 +07:00
loaders [AUR-432] Add layout-aware table parsing PDF reader (#27) 2023-09-26 15:52:44 +07:00
pipelines [AUR-338, AUR-406, AUR-407] Export pipeline to config for PromptUI. Construct PromptUI dynamically based on config. (#16) 2023-09-21 14:27:23 +07:00
post_processing [AUR-390] Add prompt template and prompt component (#24) 2023-09-25 14:38:22 +07:00
prompt [AUR-390] Add prompt template and prompt component (#24) 2023-09-25 14:38:22 +07:00
vectorstores [AUR-430] Add test case for Chroma VectoStore save/load (#26) 2023-09-26 10:58:41 +07:00
__init__.py [AUR-385, AUR-388] Declare BaseComponent and decide LLM call interface (#2) 2023-08-29 15:47:12 +07:00
base.py [AUR-421] base output post-processor that works using regex. (#20) 2023-09-19 19:54:44 +07:00
cli.py Initiate repository 2023-08-16 14:56:48 +07:00
config.py Initiate repository 2023-08-16 14:56:48 +07:00
schema.py Initiate repository 2023-08-16 14:56:48 +07:00