Commit Graph

15 Commits

Author SHA1 Message Date
Shy
4dd5e70406 Fix #492 2025-01-23 21:40:37 -05:00
simon987
8fdb832c85 refactor index schema, remove sidecar parsing, remove TS 2023-09-05 18:59:18 -04:00
simon987
92478ec47c Remove debug statement 2023-07-13 21:12:43 -04:00
simon987
2596361af5 Use mupdf's OCR methods rather than raw tesseract, various fixes 2023-07-10 21:40:58 -04:00
simon987
610882112d Use WEBP to encode thumbnails 2023-05-20 13:12:12 -04:00
simon987
300c70883d Fixes and cleanup 2023-04-10 11:04:16 -04:00
simon987
fc36f33d52 use sqlite to save index, major thread pool refactor 2023-04-03 21:39:50 -04:00
simon987
f8abffba81 process pool mostly works, still WIP 2023-03-09 22:11:21 -05:00
simon987
2e3d648796 Update --thumbnail-quality argument, add documentation 2023-01-29 11:24:34 -05:00
simon987
16a4fb4874 Rework document IDs 2022-03-05 11:18:06 -05:00
simon987
3d4331b27d Add thumbnail-count option 2022-02-19 13:45:31 -05:00
simon987
ad95684771 Update --ocr-* args, enable OCR'ing images 2022-01-08 14:24:50 -05:00
simon987
255bc2d689 Tweak MIN_OCR_SIZE behavior, update gitignore 2022-01-08 10:33:02 -05:00
Yatao Li
94a5e0ac59 refactor: split ocr_extract_text from ebook 2022-01-07 23:20:35 +08:00
simon987
a41b5dcc1f Remove libscan git submodule 2021-11-07 09:30:14 -05:00