Commit Graph

93 Commits

Author SHA1 Message Date
simon987
c18557e360 Fix thumbnail copying for incremental index, fix incremental index when there are no new updates, add option for JSON logs output 2022-11-23 20:45:47 -05:00
simon987
84d9bf4323 Fix cmake libmobi build maybe 2022-04-17 12:23:45 -04:00
simon987
90aa90f3f3 Update antiword 2022-04-17 11:47:33 -04:00
simon987
901035da15 Build libmobi with cmake, update to 0.10 2022-04-15 16:01:40 -04:00
simon987
036ed9ea1e Update libmagic cmake things 2022-04-15 15:35:20 -04:00
simon987
474eb95aff Update antiword 2022-03-17 15:08:55 -04:00
simon987
acf7453057 Add test for large msdoc 2022-03-17 15:05:48 -04:00
simon987
c575fca91d Do not store duration or bitrate when the value is 0 or for images 2022-03-05 21:24:59 -05:00
simon987
e9f92330fd Cleanup macros 2022-03-05 11:18:07 -05:00
simon987
16a4fb4874 Rework document IDs 2022-03-05 11:18:06 -05:00
simon987
499eb2b2e4 Un-break raw file thumbnails 2022-03-05 11:18:05 -05:00
simon987
2882741926 Fix multiple content metadata bug (but without compilation error this time) 2022-02-20 10:52:22 -05:00
simon987
edba9b7917 Fix multiple content metadata bug 2022-02-20 10:43:34 -05:00
simon987
e89964d592 Fix antiword build 2022-02-20 09:37:24 -05:00
simon987
3d4331b27d Add thumbnail-count option 2022-02-19 13:45:31 -05:00
simon987
065146ff8a Docker fixes 2022-02-19 13:43:44 -05:00
simon987
ad95684771 Update --ocr-* args, enable OCR'ing images 2022-01-08 14:24:50 -05:00
simon987
b37e5a4ad4 Fix some warnings in media.c 2022-01-08 11:06:14 -05:00
simon987
15ae2190cf Fix tesseract lang validation, update README.md, fix tesseract memory leak 2022-01-08 11:04:52 -05:00
simon987
255bc2d689 Tweak MIN_OCR_SIZE behavior, update gitignore 2022-01-08 10:33:02 -05:00
simon987
cd2a44e016 Update ocr.h
Fix minimum image size validation in ocr_extract_text
2022-01-08 10:24:57 -05:00
Yatao Li
94a5e0ac59 refactor: split ocr_extract_text from ebook 2022-01-07 23:20:35 +08:00
simon987
81008d8936 Add --list-file argument 2021-12-29 18:54:13 -05:00
simon987
f2fd7ccf41 Fix raw parsing maybe, fix index picker css 2021-12-25 11:08:52 -05:00
simon987
08b2ca9d43 Update lcms -> lcms2 2021-11-12 11:29:50 -05:00
simon987
61ab68ce15 Update argparse repo URL 2021-11-07 09:42:17 -05:00
simon987
a41b5dcc1f Remove libscan git submodule 2021-11-07 09:30:14 -05:00
simon987
06f21d5f0f Remove libscan submodule 2021-11-07 09:17:02 -05:00
simon987
0887046b41 Fix sidecar files, better error handling in store_write 2021-09-20 20:34:05 -04:00
simon987
17fda1e540 Support for rewind buffer 2021-09-11 20:46:40 -04:00
simon987
34b363bfd8 Add argument to calculate checksums 2021-09-11 14:31:48 -04:00
simon987
7267d4bd2c Add basic JSON/NDJSON support 2021-09-07 08:14:32 -04:00
simon987
27560a82bb Basic support for WordPerfect files 2021-09-06 14:08:53 -04:00
simon987
f16ead1902 Parse page numbers from .docx files 2021-09-06 09:50:00 -04:00
simon987
f4e1d90a6b web UI rewrite, switch to ndjson.zst index format 2021-09-05 09:49:25 -04:00
simon987
9c0f3e0e31 Fix .docx segmentation fault 2021-08-16 17:50:54 -04:00
simon987
78f3c897e2 libscan version sync 2021-07-10 12:52:24 -04:00
simon987
3da2c8cae3 Update CI scripts, Dockerfiles, enable arm64 build again 2021-06-14 14:02:16 -04:00
simon987
c6fee7f6e2 update argparse 2021-06-13 09:41:18 -04:00
simon987
5b8c13fd13 Handle GPS metadata in the UI 2021-06-11 20:41:05 -04:00
simon987
81670ee107 Fix subtitle problems 2021-06-11 10:05:33 -04:00
simon987
f8d9b718c0 Fix memory leak in RAW parsing 2021-06-09 08:22:31 -04:00
simon987
6f5fdc2935 Fix for segfault in some comic files 2021-06-07 09:01:46 -04:00
simon987
a01f6dff1f Use 16-bit ints for meta keys (wip) 2021-06-07 08:40:12 -04:00
simon987
fc7f30d670 Add tests for subtitle 2021-05-05 16:10:55 -04:00
simon987
71f9dfcfe0 sync libscan 2021-05-05 14:21:01 -04:00
simon987
50771bd1dc Read subtitles from media files, fix bug in text_buffer 2021-03-26 19:48:16 -04:00
simon987
bc884e137c Change encoding for antiword PDF 2021-01-16 12:17:43 -05:00
simon987
ce1e241dea Workaround for UTF8 .doc files 2021-01-16 12:13:56 -05:00
simon987
f87eac1f90 Update submodules 2020-12-31 10:26:05 -05:00