1
1
mirror of https://github.com/MarginaliaSearch/MarginaliaSearch.git synced 2025-10-05 21:22:39 +02:00

Commit Graph

  • f1a71e9033 (ndp) Deprioritize tumblr in the visitation order master Viktor Lofgren 2025-10-05 12:17:46 +02:00
  • 7b525918c9 (ndp) Deprioritize tumblr in the visitation order Viktor Lofgren 2025-10-05 12:16:05 +02:00
  • 0f3aede66f (ndp) Clean up code Viktor Lofgren 2025-10-05 11:56:41 +02:00
  • 88236f3836 (ndp) Use mariadb syntax instead of sqlite syntax when querying mariadb Viktor Lofgren 2025-10-05 11:56:31 +02:00
  • ad31a22fbb (ndp) Refresh the ndp queue on restart Viktor Lofgren 2025-10-05 10:32:05 +02:00
  • 2785ae8241 (language) Further amend the docs to mention the language configuration files Viktor Lofgren 2025-10-05 09:04:12 +02:00
  • 1ed1f2f299 (language) Update documentation for the language processing function Viktor Lofgren 2025-10-04 11:20:24 +02:00
  • b7d3b67a1d (language) Fix language configuration stub for German to not use French stemming Viktor Lofgren 2025-10-02 10:15:30 +02:00
  • d28010b7e6 (search) Fix pagination in light mode Viktor Lofgren 2025-10-02 09:04:49 +02:00
  • 2689bd9eaa (chore) Update to Java 25 Viktor Lofgren 2025-10-02 09:04:25 +02:00
  • eebd4b40fc (chore) Update to Java 25 detached Viktor Lofgren 2025-09-30 21:41:45 +02:00
  • f6d5d7f196 (chore) Update to Java 25 Viktor Lofgren 2025-09-30 15:59:09 +02:00
  • abf1186fa7 Merge pull request #231 from johnvonessen/feature/configurable-crawler-timeouts Viktor 2025-09-30 13:47:07 +02:00
  • 94a77ebddf Fix timeout configuration test to expect exceptions for invalid values John Von Essen 2025-09-06 21:58:31 +00:00
  • 4e2f76a477 feat: Make crawler timeouts configurable via system.properties John Von Essen 2025-09-05 09:31:25 -04:00
  • 4cd1834938 Merge pull request #232 from johnvonessen/socks-support Viktor 2025-09-30 13:32:14 +02:00
  • 5cbbea67ed (docs) Update documentation with more appropriate best practices Viktor Lofgren 2025-09-30 13:31:23 +02:00
  • b688f15550 (proxy) Fix late binding of proxy configuration Viktor Lofgren 2025-09-30 11:48:43 +02:00
  • f55af8ef48 (boot) Explicitly stop ndp and ping processes at first boot Viktor Lofgren 2025-09-30 09:29:04 +02:00
  • 3406b3c1e9 (dom-sample) JSONL export of the dom sample data export-dom-sample-data Viktor Lofgren 2025-09-29 18:07:21 +02:00
  • f2eaf5188d (dom-sample) GUI and plumbing for export utility for DOM sample data Viktor Lofgren 2025-09-29 13:15:31 +02:00
  • adc815e282 (language) Add outcome of a simulation of the complete outcome of keyword extraction to the language processing tool Viktor Lofgren 2025-09-28 12:45:25 +02:00
  • ca8455e049 (live-capture) Use threads instead of FJP for coordination of sampling Viktor Lofgren 2025-09-25 10:13:32 +02:00
  • 4ea724d2cb (live-capture) Use threads instead of FJP for coordination of sampling Viktor Lofgren 2025-09-25 10:10:46 +02:00
  • 40600e7297 (live-capture) Use threads instead of FJP for coordination of sampling Viktor Lofgren 2025-09-25 10:10:05 +02:00
  • 7795742538 (live-capture) Use threads instead of FJP for coordination of sampling Viktor Lofgren 2025-09-25 10:06:12 +02:00
  • 82d33ce69b (assistant) Add domain coordination module Viktor Lofgren 2025-09-25 09:57:32 +02:00
  • e49cc5c244 (live-capture) Add domain coordination, make sampling parallel Viktor Lofgren 2025-09-25 09:55:50 +02:00
  • 0af389ad93 (live-capture) Use availability information to select domains for sampling more intelligently Viktor Lofgren 2025-09-24 18:22:37 +02:00
  • 48791f56bd (index) Put back Chesterton's fence Viktor Lofgren 2025-09-24 16:09:54 +02:00
  • be83726427 (query) Remove log noise from query service Viktor Lofgren 2025-09-24 16:06:01 +02:00
  • 708caa8791 (index) Update verbatim match handling to account for matches that span multiple tags Viktor Lofgren 2025-09-24 15:43:00 +02:00
  • 32394f42b9 (index) Update verbatim match handling to account for matches that span multiple tags Viktor Lofgren 2025-09-24 15:41:53 +02:00
  • b8e3445ce0 (index) Update verbatim match handling to account for matches that span multiple tags Viktor Lofgren 2025-09-24 15:22:50 +02:00
  • 17a78a7b7e (query) Remove obsolete code Viktor Lofgren 2025-09-24 15:03:08 +02:00
  • 5a75dd8093 (index) Update james cook test Viktor Lofgren 2025-09-24 15:02:13 +02:00
  • a9713347a0 (query) Submit all segmentations as optional matching groups Viktor Lofgren 2025-09-24 15:01:59 +02:00
  • 4694d36ed2 (index) Tweak ranking bonuses for partial matches Viktor Lofgren 2025-09-24 15:01:29 +02:00
  • 70bdd1f51e (index) Add test case for 'captain james cook' Viktor Lofgren 2025-09-24 13:27:07 +02:00
  • 187b4828e6 (index) Sort doc ids passed to re-ranking Viktor Lofgren 2025-09-24 13:26:53 +02:00
  • 93fc14dc94 (index) Add sanity assertions to SkipListReader Viktor Lofgren 2025-09-24 13:26:31 +02:00
  • fbfea8539b (refac) Merge IndexResultScoreCalculator into IndexResultRankingService Viktor Lofgren 2025-09-24 11:51:16 +02:00
  • 0929d77247 (chore) Remove vestigial Serializable annotation from a few core models Viktor Lofgren 2025-09-24 10:42:10 +02:00
  • db8f8c1f55 (index) Fix bitmask handling in HtmlFeature Viktor Lofgren 2025-09-23 10:15:01 +02:00
  • dcb2723386 (index) Fix broken test case in the "slow" collection Viktor Lofgren 2025-09-23 10:13:51 +02:00
  • 00c1f495f6 (index) Fix incorrect document flag bitmask handling Viktor Lofgren 2025-09-23 10:12:14 +02:00
  • 73a923983a (language) Fix outdated test assertion Viktor Lofgren 2025-09-22 10:30:06 +02:00
  • e9ed0c5669 (language) Fix keyword pattern matching unicode handling Viktor Lofgren 2025-09-22 10:27:46 +02:00
  • 5b2bec6144 (search) Fix broken tests Viktor Lofgren 2025-09-22 10:17:38 +02:00
  • f26bb8e2b1 (loader) Clean up the code Viktor Lofgren 2025-09-22 10:14:54 +02:00
  • 4455495dc6 (system) Fix file loggers in the json config Viktor Lofgren 2025-09-21 19:02:18 +02:00
  • b84d17aa51 (system) Fix file loggers in the prod config Viktor Lofgren 2025-09-21 14:02:41 +02:00
  • 9d008390ae (language) Fix unicode issues in keyword extraction Viktor Lofgren 2025-09-21 13:54:01 +02:00
  • a40c2a8146 (index) Partition index journal by language to speed up index construction Viktor Lofgren 2025-09-21 13:53:43 +02:00
  • a3416bf48e (query) Fix timeout settings to use ms and not s Viktor Lofgren 2025-09-19 22:45:22 +02:00
  • ee2461d9fc (query) Fix timeout settings to use ms and not us Viktor Lofgren 2025-09-19 22:19:31 +02:00
  • 54c91a84e3 (query) Make the query client give up if the request exceeds its configured timeout by 50% Viktor Lofgren 2025-09-19 18:59:35 +02:00
  • a6371fc54c (query) Add a timeout to the query API Viktor Lofgren 2025-09-19 18:52:44 +02:00
  • 8faa9a572d (live-capture) Fix random puppeteer API churn Viktor Lofgren 2025-09-19 11:15:38 +02:00
  • fdce940263 (search) Fix redundant spam in <title> Viktor Lofgren 2025-09-19 10:20:14 +02:00
  • af8a13a7fb (index) Correct file name compatibility with previous versions Viktor Lofgren 2025-09-19 09:40:43 +02:00
  • 9e332de6b4 Merge pull request #223 from MarginaliaSearch/multilingual Viktor 2025-09-19 09:12:54 +02:00
  • d457bb5d44 (index) Fix index actor initialization multilingual Viktor Lofgren 2025-09-18 16:06:40 +02:00
  • c661ebb619 (refac) Move language-processing into functions Viktor Lofgren 2025-09-18 10:29:54 +02:00
  • 53e744398a Update gitignore to exclude eclipse-generated stuff Viktor Lofgren 2025-09-17 17:14:02 +02:00
  • 1d71baf3e5 (search) Display search query first in title Viktor Lofgren 2025-09-16 13:16:18 +02:00
  • bb5fc0f348 (language) Fix sketchy unicode handling in UnicodeNormalization Viktor Lofgren 2025-09-16 12:15:09 +02:00
  • c8f112d040 (lang+search) Clean up LanguageConfiguration initialization and LangCommandD Viktor Lofgren 2025-09-16 11:49:46 +02:00
  • ae31bc8498 (lang+search) Clean up LanguageConfiguration initialization and LangCommand Viktor Lofgren 2025-09-16 11:47:15 +02:00
  • da5046c3bf (lang) Remove language redirects for languages that are not configured Viktor Lofgren 2025-09-16 11:05:31 +02:00
  • f67257baf2 (lang) Remove lang:... keyword during LangCommand Viktor Lofgren 2025-09-16 11:01:11 +02:00
  • 924fb05661 (config) Fix language config pickup Viktor Lofgren 2025-09-16 10:43:27 +02:00
  • c231a82062 (search) Lang redirection works better if it's hooked in Viktor Lofgren 2025-09-16 10:40:24 +02:00
  • 2c1082d7f0 (search) Add notice about the current language selection to the UI Viktor Lofgren 2025-09-16 10:32:13 +02:00
  • 06947bd026 (search) Add redirect based on lang:-keyword in search query Viktor Lofgren 2025-09-16 10:00:20 +02:00
  • 519aebd7c6 (process) Make the use of zookeeper based domain coordination optional Viktor Lofgren 2025-09-15 19:13:57 +02:00
  • 42cc27586e (process) Reduce connection pool stats log spam Viktor Lofgren 2025-09-15 18:51:43 +02:00
  • 360881fafd (setup) Pull POS tags from control svc on first boot Viktor Lofgren 2025-09-15 10:05:17 +02:00
  • 4c6fdf6ebe (language) Make language configuration configurable Viktor Lofgren 2025-09-15 09:54:57 +02:00
  • 554de21f68 (converter) Disable language keyword Viktor Lofgren 2025-09-15 09:49:04 +02:00
  • 00194acbfe (search) Add language chooser to UI, clean up search service code Viktor Lofgren 2025-09-13 12:40:42 +02:00
  • 97dabcefaa (search) Add language chooser to UI, clean up search service code Viktor Lofgren 2025-09-13 12:34:34 +02:00
  • cc790644d4 (search) Persist language choice in the search form Viktor Lofgren 2025-09-12 11:14:54 +02:00
  • 8f893ee6c0 (search) Add basic support for configuring query language to the search service Viktor Lofgren 2025-09-11 15:55:09 +02:00
  • 938721b793 (index) Backwards compatible loading of old words file in index loading Viktor Lofgren 2025-09-11 15:42:31 +02:00
  • f68bcefc75 (index) Correct index construction to use the correct files for Fwd index Viktor Lofgren 2025-09-09 11:21:48 +02:00
  • 164a646af6 Fix SOCKS proxy property propagation to spawned processes John Von Essen 2025-09-09 01:02:00 +00:00
  • 0cfd759f85 (deps) Upgrade slop to 0.17 for better skip performance and faster index construction times Viktor Lofgren 2025-09-08 18:02:34 +02:00
  • b53002200c (index) SkipListWriter should not be in APPEND mode Viktor Lofgren 2025-09-08 17:55:14 +02:00
  • 78246b9a63 (index) Fix journal language enumeration Viktor Lofgren 2025-09-08 15:38:26 +02:00
  • b552e79927 (language) Make LanguageConfiguration a Singleton to avoid duplicate initializations Viktor Lofgren 2025-09-08 13:24:18 +02:00
  • bffc159486 (language) Make unicode normalization configurable Viktor Lofgren 2025-09-08 13:18:58 +02:00
  • b8000721bd Implement proper SOCKS proxy support for HTTP Components v5 John Von Essen 2025-09-07 21:49:21 +00:00
  • 2ee0b0e420 Fix SOCKS proxy implementation for HTTP Components v5 John Von Essen 2025-09-06 21:39:20 +00:00
  • 1432fc87d7 (index) Test languages via integration test Viktor Lofgren 2025-09-06 20:11:41 +02:00
  • ec5f32b1d8 Add SOCKS proxy support for crawler processes John Von Essen 2025-09-05 10:42:58 -04:00
  • edd453531e (index) Partition keyword lexicons by language Viktor Lofgren 2025-09-04 17:24:48 +02:00
  • 096496ada1 (refac) Fold ft-anchor-keywords into converting-process Viktor Lofgren 2025-09-03 13:04:30 +02:00
  • 8ca6209260 (refac) Fold ft-anchor-keywords into converting-process Viktor Lofgren 2025-09-03 13:03:38 +02:00
  • 673c65d3c9 (refac) Fold term-frequency-dict into language-processing Viktor Lofgren 2025-09-03 12:55:01 +02:00