karakeep - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	refactor: Start tracking bookmark assets in the assets table	MohamedBassem	2024-10-06	1	-60/+83
\|
*	refactor: Include userId in the assets table	MohamedBassem	2024-10-06	1	-0/+5
\|
*	feature(web): Add ability to manually trigger full page archives. Fixes #398 ↵	kamtschatka	2024-09-30	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	(#418) * [Feature Request] Ability to select what to "crawl full page archive" #398 Added the ability to start a full page crawl for links and also in bulk operations added the ability to refresh links as a bulk operation as well * minor icon and wording changes --------- Co-authored-by: MohamedBassem <me@mbassem.com>
*	fix(workers): Log stacktrace on worker error. #424 (#429)	kamtschatka	2024-09-26	1	-1/+3
\| \| \|	extended logging when an exception occurrs, so it is possible to see the stacktrace of a failed execution
*	fix(workers): Shutdown workers on SIGTERM	MohamedBassem	2024-07-28	1	-0/+4
\|
*	fix: async/await issues with the new queue (#319)	kamtschatka	2024-07-21	1	-2/+2
\|
*	refactor: Replace the usage of bullMQ with the hoarder sqlite-based queue (#309)	Mohamed Bassem	2024-07-21	1	-31/+29
\|
*	fix: monolith not embedding SVG files correctly. Fixes #289 (#306)	kamtschatka	2024-07-14	1	-5/+2
\| \| \|	passing in the URL of the page to have the proper URL for resolving relative paths
*	refactor: added the bookmark type to the database (#256)	kamtschatka	2024-07-01	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* refactoring asset types Extracted out functions to silently delete assets and to update them after crawling Generalized the mapping of assets to bookmark fields to make extending them easier * Added the bookmark type to the database Introduced an enum to have better type safety cleaned up the code and based some code on the type directly * add BookmarkType.UNKNWON * lint and remove unused function --------- Co-authored-by: MohamedBassem <me@mbassem.com>
*	refactor: remove redundant code from crawler worker and refactor handling of ↵	kamtschatka	2024-06-29	1	-32/+49
\| \| \| \| \| \| \| \| \| \| \| \| \|	asset types (#253) * refactoring asset types Extracted out functions to silently delete assets and to update them after crawling Generalized the mapping of assets to bookmark fields to make extending them easier * revert silentDeleteAsset and hide better-sqlite3 --------- Co-authored-by: MohamedBassem <me@mbassem.com>
*	feature: Automatically transfer image urls into bookmared assets. Fixes #246	MohamedBassem	2024-06-23	1	-6/+16
\|
*	refactor: extract assets into their own database table. #215 (#220)	kamtschatka	2024-06-23	1	-29/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Allow downloading more content from a webpage and index it #215 added a new table that contains the information about assets for link bookmarks created migration code that transfers the existing data into the new table * Allow downloading more content from a webpage and index it #215 removed the old asset columns from the database updated the UI to use the data from the linkBookmarkAssets array * generalize the assets table to not be linked in particular to links * fix migrations post merge * fix missing asset ids in the getBookmarks call --------- Co-authored-by: MohamedBassem <me@mbassem.com>
*	feature: add support for PDF links. Fixes #28 (#216)	kamtschatka	2024-06-22	1	-57/+163
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* feature request: pdf support #28 Added a new sourceUrl column to the asset bookmarks Added transforming a link bookmark pointing at a pdf to an asset bookmark made sure the "View Original" link is also shown for asset bookmarks that have a sourceURL updated gitignore for IDEA * remove pdf parsing from the crawler * extract the http logic into its own function to avoid duplicating the post-processing actions (openai/index) * Add 5s timeout to the content type fetch --------- Co-authored-by: MohamedBassem <me@mbassem.com>
*	fix: Trigger search re-index on bookmark tag manual updates. Fixes #208 (#210)	kamtschatka	2024-06-09	1	-5/+2
\| \| \| \| \| \| \| \| \| \| \| \|	* re-index of database is not scanning all places when bookmark tags are changed. Manual indexing is working as workaround #208 introduced a new function to trigger a reindex to reduce copy/paste added missing reindexes when tags are deleted/bookmarks are updated * give functions a bit more descriptive name --------- Co-authored-by: kamtschatka <simon.schatka@gmx.at> Co-authored-by: MohamedBassem <me@mbassem.com>
*	fix(crawler): Only update the database if full page archival is enabled	MohamedBassem	2024-05-26	1	-19/+19
\|
*	feature: Full page archival with monolith. Fixes #132	MohamedBassem	2024-05-26	1	-1/+65
\|
*	feature(crawler): Allow connecting to browser's websocket address and ↵	MohamedBassem	2024-05-15	1	-28/+55
\| \| \| \|	launching the browser on demand. This enables support for browserless
*	feature: Take full page screenshots #143 (#148)	kamtschatka	2024-05-12	1	-1/+2
\| \| \| \| \| \|	Added the fullPage flag to take full screen screenshots updated the UI accordingly to properly show the screenshots instead of scaling it down Co-authored-by: kamtschatka <simon.schatka@gmx.at>
*	feature(crawler): Allow increasing crawler concurrency and configure storing ↵	MohamedBassem	2024-04-26	1	-0/+13
\| \| \| \|	images and screenshots
*	fix(crawler): Better extraction for amazon images	MohamedBassem	2024-04-23	1	-0/+2
\|
*	fix(workers): Set a modern user agent and update the default viewport size	MohamedBassem	2024-04-23	1	-0/+7
\|
*	feature: Allow recrawling bookmarks without running inference jobs	MohamedBassem	2024-04-20	1	-7/+29
\|
*	feature: Download images and screenshots	MohamedBassem	2024-04-20	1	-28/+130
\|
*	feature: Recrawl failed links from admin UI (#95)	Ahmad Mujahid	2024-04-11	1	-0/+20
\| \| \| \| \|	* feature: Retry failed crawling URLs * fix: Enhancing visuals and some minor changes.
*	fix: Increase default navigation timeout to 30s, make it configurable and ↵	MohamedBassem	2024-04-11	1	-1/+1
\| \| \| \|	add retries to crawling jobs
*	fix(crawler): Skip validating URLs in metascrapper as it was already being ↵	MohamedBassem	2024-04-09	1	-0/+3
\| \| \| \|	validated. Fixes #22
*	fix(workers): Increase default timeout to 60s, make it configurable and ↵	MohamedBassem	2024-04-06	1	-11/+21
\| \| \| \|	improve logging
*	fix(workers): Add a timeout to the crawling job to prevent it from getting ↵	MohamedBassem	2024-04-02	1	-1/+2
\| \| \| \|	stuck. Fixes #63
*	chore(workers): Remove unused configuration options	MohamedBassem	2024-03-31	1	-2/+0
\|
*	format: Add missing lint and format, and format the entire repo	MohamedBassem	2024-03-30	1	-5/+6
\|
*	refactor: Validate env variables using zod	MohamedBassem	2024-03-27	1	-1/+1
\|
*	docker: Use external chrome docker container	MohamedBassem	2024-03-24	1	-10/+40
\|
*	fix(workers): Fix the leaky browser instances in workers during development	MohamedBassem	2024-03-21	1	-28/+30
\|
*	fix: Simple validations for crawled URLs	MohamedBassem	2024-03-21	1	-1/+17
\|
*	structure: Create apps dir and copy tooling dir from t3-turbo repo	MohamedBassem	2024-03-14	1	-0/+201