From 39a055aa946ab32e50b3c1b014e44495da17e833 Mon Sep 17 00:00:00 2001 From: Viktor Date: Sat, 7 Jun 2025 14:01:01 +0200 Subject: [PATCH] Update ROADMAP.md --- ROADMAP.md | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/ROADMAP.md b/ROADMAP.md index eecfc6ceb..2bb202fcd 100644 --- a/ROADMAP.md +++ b/ROADMAP.md @@ -38,14 +38,6 @@ associated with each language added, at least a models file or two, as well as s It would be very helpful to find a speaker of a large language other than English to help in the fine tuning. -## Support for binary formats like PDF - -The crawler needs to be modified to retain them, and the conversion logic needs to parse them. -The documents database probably should have some sort of flag indicating it's a PDF as well. - -PDF parsing is known to be a bit of a security liability so some thought needs to be put in -that direction as well. - ## Custom ranking logic Stract does an interesting thing where they have configurable search filters. @@ -66,6 +58,14 @@ One of the search engine's biggest limitations right now is that it does not ind # Completed +## Support for binary formats like PDF (COMPLETED 2025-05) + +The crawler needs to be modified to retain them, and the conversion logic needs to parse them. +The documents database probably should have some sort of flag indicating it's a PDF as well. + +PDF parsing is known to be a bit of a security liability so some thought needs to be put in +that direction as well. + ## Web Design Overhaul (COMPLETED 2025-01) The design is kinda clunky and hard to maintain, and needlessly outdated-looking.