karakeep - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	fix: lower priority of mass admin actions	MohamedBassem	2026-02-04	1	-12/+33
\|
*	fix: backfill old sessions and do queue backpressure (#2449)	Mohamed Bassem	2026-02-04	2	-1/+64
\| \| \| \| \|	* fix: backfill old sessions and do queue backpressure * fix typo
*	feat: Import workflow v3 (#2378)	Mohamed Bassem	2026-02-04	16	-375/+7576
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* feat: import workflow v3 * batch stage * revert migration * cleanups * pr comments * move to models * add allowed workers * e2e tests * import list ids * add missing indicies * merge test * more fixes * add resume/pause to UI * fix ui states * fix tests * simplify progress tracking * remove backpressure * fix list imports * fix race on claiming bookmarks * remove the codex file
*	feat: Add LLM-based OCR as alternative to Tesseract (#2442)	Mohamed Bassem	2026-02-01	2	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* feat(ocr): add LLM-based OCR support alongside Tesseract Add support for using configured LLM inference providers (OpenAI or Ollama) for OCR text extraction from images as an alternative to Tesseract. Changes: - Add OCR_USE_LLM environment variable flag (default: false) - Add buildOCRPrompt function for LLM-based text extraction - Add readImageTextWithLLM function in asset preprocessing worker - Update extractAndSaveImageText to route between Tesseract and LLM OCR - Update documentation with the new configuration option When OCR_USE_LLM is enabled, the system uses the configured inference model to extract text from images. If no inference provider is configured, it falls back to Tesseract. https://claude.ai/code/session_01Y7h7kDAmqXKXEWDmWbVkDs * format --------- Co-authored-by: Claude <noreply@anthropic.com>
*	feat: batch meilisearch requests (#2441)	Mohamed Bassem	2026-02-01	4	-13/+205
\| \| \| \| \|	* feat: batch meilisearch requests * more fixes
*	fix(web): don't bundle tiktoken in client bundles	Mohamed Bassem	2026-02-01	3	-81/+90
\|
*	feat: add support for redirectUrl after signup (#2439)	Mohamed Bassem	2026-02-01	6	-7/+153
\| \| \| \| \| \| \| \| \| \| \|	* feat: add support for redirectUrl after signup * pr review * more fixes * format * another fix
*	refactor: migrate trpc to the new react query integration mode (#2438)	Mohamed Bassem	2026-02-01	15	-407/+627
\| \| \| \| \| \| \| \| \|	* refactor: migrate trpc to the new react query integration mode * more fixes * more migrations * upgrade trpc client
*	chore: add an endpoint for propagating client configs to the mobile app	Mohamed Bassem	2026-02-01	3	-0/+41
\|
*	refactor: lazy init background queues	Mohamed Bassem	2026-02-01	4	-48/+105
\|
*	fix: use user's preferred language for manual summarization (#2429)	Mohamed Bassem	2026-01-28	1	-1/+9
\|
*	feat(search): add tag: alias for # and ! alias for negation (#2425)	Mohamed Bassem	2026-01-26	2	-3/+142
\| \| \| \| \| \| \| \|	Add `tag:` as an alternative syntax to `#` for tag search queries, and `!` as an alternative to `-` for negating qualifiers. This provides more intuitive syntax options for users who prefer text-based qualifiers over special characters. Co-authored-by: Claude <noreply@anthropic.com>
*	feat: disable karakeep 2025 wrapped	Mohamed Bassem	2026-01-19	1	-0/+8
\|
*	feat: Add attachedBy field to update tags endpoint (#2281)	Mohamed Bassem	2026-01-18	5	-13/+198
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* feat: Add attachedBy field to updateTags endpoint This change allows callers to specify the attachedBy field when updating tags on a bookmark. The field defaults to "human" if not provided, maintaining backward compatibility with existing code. Changes: - Added attachedBy field to zManipulatedTagSchema with default "human" - Updated updateTags endpoint to use the specified attachedBy value - Created mapping logic to correctly assign attachedBy to each tag * fix(cli): migrate bookmark source in migration command * fix * reduce queries --------- Co-authored-by: Claude <noreply@anthropic.com>
*	feat: track api key usage dates	Mohamed Bassem	2026-01-18	6	-1/+3049
\|
*	feat(rules): add "Title Contains" condition to Rule Engine (#1670) (#2354)	Andrii Mokhovyk	2026-01-18	3	-0/+75
\| \| \| \| \| \| \| \|	* feat(rules): add "Title Contains" condition to Rule Engine (#1670) * feat(rules): hide title conditions for bookmark created trigger * fix typecheck
*	deps: upgrade react to 19.2.1	Mohamed Bassem	2026-01-15	1	-1/+1
\|
*	feat: privacy-respecting bookmark debugger admin tool (#2373)	Mohamed Bassem	2026-01-11	4	-15/+592
\| \| \| \| \| \| \| \| \| \| \| \| \|	* fix: parallelize queue enqueues in bookmark routes * fix: guard meilisearch client init with mutex * feat: add bookmark debugging admin tool * more fixes * more fixes * more fixes
*	fix: depri mass admin actions	Mohamed Bassem	2026-01-10	1	-10/+31
\|
*	fix: harden the restate implementation (#2370)	Mohamed Bassem	2026-01-10	8	-211/+611
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* fix: parallelize queue enqueues in bookmark routes * fix: guard meilisearch client init with mutex * fix: fix propagation of last error in restate * fix: don't fail invocations when the job fails * fix: add a timeout around the worker runner logic * fix: add leases to handle dangling semaphores * feat: separate dispatchers and runners * add a test * fix silent promise failure
*	fix: fix propagation of last error in restate	Mohamed Bassem	2026-01-10	1	-0/+1
\|
*	fix: guard meilisearch client init with mutex	Mohamed Bassem	2026-01-10	1	-0/+12
\|
*	fix: parallelize queue enqueues in bookmark routes	Mohamed Bassem	2026-01-10	1	-35/+42
\|
*	feat: add openai service tier configuration option (#2339)	Robert Rosca	2026-01-03	2	-0/+10
\|
*	feat: Add retry buttons for pending bookmarks in admin panel (#2341)	Mohamed Bassem	2026-01-03	1	-2/+2
\|
*	fix: drop idProvider from restate hot path	Mohamed Bassem	2026-01-03	1	-2/+1
\|
*	fix: Eliminate the O(n2) parsing of the netscape import parsing (#2338)	Mohamed Bassem	2026-01-03	2	-31/+348
\| \| \| \| \|	* fix: Eliminate the O(n2) parsing of the netscape import parsing * remove unneeded tests
*	release(cli,sdk): release cli and sdk v0.30	Mohamed Bassem	2026-01-01	2	-4/+14
\|
*	fix: fix wrapped feature to only show bookmarks in 2025	Mohamed Bassem	2026-01-01	1	-1/+11
\|
*	fix: don't switch the bookmark back to pending on recrawl	Mohamed Bassem	2026-01-01	1	-7/+0
\|
*	fix: use the Ollama generate endpoint instead of chat (#2324)	Erik Tews	2026-01-01	1	-5/+4
\| \| \| \| \| \| \| \| \|	* Use the Ollama generate endpoint instead of chat Ollama has two API endpoints for text generation. There is a chat endpoint for interactive and interactive chat like generation of text and there is a generate endpoint that is used one one-shot prompts, such as summarization tasks and similar things. Karakeep used the chat endpoint that resulted in odd summaries. This commit makes karakeep use the generate endpoint instead, which results in better and more compact summaries. * format
*	feat: add "URL Does Not Contain" condition to rule engine (#2280)	Mohamed Bassem	2025-12-30	3	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* feat: add "URL Does Not Contain" condition to rule engine Add a new condition type `urlDoesNotContain` that allows users to create rules based on URLs that do NOT contain specific strings. This enables more flexible rule configurations, such as: - Automatically adding bookmarks to a "Read Later" list if the URL does not contain "reddit.com" or "youtube.com" Changes: - Added `urlDoesNotContain` condition type to Zod schema - Implemented evaluation logic in RuleEngine - Added UI support in ConditionBuilder component - Added translation key for new condition type - Added test coverage for the new condition Fixes #2259 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Mohamed Bassem <MohamedBassem@users.noreply.github.com> * fix type link --------- Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: Mohamed Bassem <MohamedBassem@users.noreply.github.com>
*	feat: 2025 wrapped (#2322)	Mohamed Bassem	2025-12-30	3	-1/+382
\| \| \| \| \|	* feat: 2025 wrapped * don't add wrapped for new users
*	ci: fix tests	Mohamed Bassem	2025-12-30	1	-1/+1
\|
*	feat: change default for tag style to be title case with spaces	Mohamed Bassem	2025-12-30	4	-1/+3030
\|
*	fix: more tagging tweaks	Mohamed Bassem	2025-12-29	1	-4/+3
\|
*	fix: change prompt to better recognize error pages	Mohamed Bassem	2025-12-29	1	-3/+6
\|
*	refactor: reduce duplication in compare-models tool	Mohamed Bassem	2025-12-29	1	-28/+79
\|
*	chore: add tracing for email functions	Mohamed Bassem	2025-12-29	1	-115/+109
\|
*	feat: Add open telemetry (#2318)	Mohamed Bassem	2025-12-29	8	-3/+376
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* feat: add OpenTelemetry tracing infrastructure Introduce distributed tracing capabilities using OpenTelemetry: - Add @opentelemetry packages to shared-server for tracing - Create tracing utility module with span helpers (withSpan, addSpanEvent, etc.) - Add tRPC middleware for automatic span creation on API calls - Initialize tracing in API and workers entry points - Add demo instrumentation to bookmark creation and crawler worker - Add configuration options (OTEL_TRACING_ENABLED, OTEL_EXPORTER_OTLP_ENDPOINT, etc.) - Document tracing configuration in environment variables docs When enabled, traces are collected for tRPC calls, bookmark creation flow, and crawler operations, with support for any OTLP-compatible backend (Jaeger, Tempo, etc.) * refactor: remove tracing from workers for now Keep tracing infrastructure but remove worker instrumentation: - Remove tracing initialization from workers entry point - Remove tracing instrumentation from crawler worker - Fix formatting in tracing files The tracing infrastructure remains available for future use. * add hono and next tracing * remove extra span logging * more fixes * update config * some fixes * upgrade packages * remove unneeded packages --------- Co-authored-by: Claude <noreply@anthropic.com>
*	fix: reset tagging status on crawl failure (#2316)	Mohamed Bassem	2025-12-29	7	-3/+3057
\| \| \| \| \| \| \|	* feat: add the ability to specify a different changelog version * fix: reset tagging status on crawl failure * fix missing crawlStatus in loadMulti
*	feat: add the ability to specify a different changelog version	Mohamed Bassem	2025-12-29	1	-0/+2
\|
*	feat: add customizable tag styles (#2312)	Mohamed Bassem	2025-12-27	9	-5/+3121
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* feat: add customizable tag styles * add tag lang setting * ui settings cleanup * fix migration * change look of the field * more fixes * fix tests
*	feat: add Matter import support (#2245)	Moondragon85	2025-12-27	1	-0/+50
\| \| \| \| \| \| \| \| \| \| \|	* Matter import * use zod * fix date parsing --------- Co-authored-by: Mohamed Bassem <me@mbassem.com>
*	feat: support archiving as pdf (#2309)	Mohamed Bassem	2025-12-27	8	-0/+25
\| \| \| \| \| \| \| \| \| \| \|	* feat: support archiving as pdf * add supprot for manually triggering pdf downloads * fix submenu * menu cleanup * fix store pdf
*	feat: add OPENAI_PROXY_URL configuration and support for proxy in OpenAI ↵	rzxczxc	2025-12-27	2	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \|	client (#2231) * Add OPENAI_PROXY_URL configuration and support for proxy in OpenAIInferenceClient * docs: add OPENAI_PROXY_URL configuration for proxy support in OpenAI API requests * format --------- Co-authored-by: Mohamed Bassem <me@mbassem.com>
*	fix(tests): fix the asset upload tests	Mohamed Bassem	2025-12-27	4	-21/+58
\|
*	fix: reject spoofed content types on uploads	Mohamed Bassem	2025-12-27	2	-1/+12
\|
*	fix(restate): change journal retention for services to 3d	Mohamed Bassem	2025-12-25	1	-0/+3
\|
*	fix: preserve failure count when rescheduling rate limited domains (#2303)	Mohamed Bassem	2025-12-25	4	-10/+171
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* fix: preserve retry count when rate-limited jobs are rescheduled Previously, when a domain was rate-limited in the crawler worker, the job would be re-enqueued as a new job, which reset the failure count. This meant rate-limited jobs could retry indefinitely without respecting the max retry limit. This commit introduces a RateLimitRetryError exception that signals the queue system to retry the job after a delay without counting it as a failed attempt. The job is retried within the same invocation, preserving the original retry count. Changes: - Add RateLimitRetryError class to shared/queueing.ts - Update crawler worker to throw RateLimitRetryError instead of re-enqueuing - Update Restate queue service to handle RateLimitRetryError with delay - Update Liteque queue wrapper to handle RateLimitRetryError with delay This ensures that rate-limited jobs respect the configured retry limits while still allowing for delayed retries when domains are rate-limited. * refactor: use liteque's native RetryAfterError for rate limiting Instead of manually handling retries in a while loop, translate RateLimitRetryError to liteque's native RetryAfterError. This is cleaner and lets liteque handle the retry logic using its built-in mechanism. * test: add tests for RateLimitRetryError handling in restate queue Added comprehensive tests to verify that: 1. RateLimitRetryError delays retry appropriately 2. Rate-limited retries don't count against the retry limit 3. Jobs can be rate-limited more times than the retry limit 4. Regular errors still respect the retry limit These tests ensure the queue correctly handles rate limiting without exhausting retry attempts. * lint & format * fix: prevent onError callback for RateLimitRetryError Fixed two issues with RateLimitRetryError handling in restate queue: 1. RateLimitRetryError now doesn't trigger the onError callback since it's not a real error - it's an expected rate limiting behavior 2. Check for RateLimitRetryError in runWorkerLogic before calling onError, ensuring the instanceof check works correctly before the error gets further wrapped by restate Updated tests to verify onError is not called for rate limit retries. * fix: catch RateLimitRetryError before ctx.run wraps it Changed approach to use a discriminated union instead of throwing and catching RateLimitRetryError. Now we catch the error inside the ctx.run callback before it gets wrapped by restate's TerminalError, and return a RunResult type that indicates success, rate limit, or error. This fixes the issue where instanceof checks would fail because ctx.run wraps all errors in TerminalError. * more fixes * rename error name --------- Co-authored-by: Claude <noreply@anthropic.com>