index
:
karakeep
main
Unnamed repository; edit this file 'description' to name the repository.
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
apps
/
workers
/
crawlerWorker.ts
(
unfollow
)
Commit message (
Expand
)
Author
Files
Lines
2025-04-16
fix(workers): Close browser if connect on demand (#1151)
Chang-Yen Tseng
1
-0
/
+3
2025-04-12
chore: Rename hoarder packages to karakeep
MohamedBassem
1
-8
/
+8
2025-03-27
feat(workers): Add CRAWLER_SCREENSHOT_TIMEOUT_SEC (#1155)
Chang-Yen Tseng
1
-10
/
+18
2025-03-22
feat(workers): Adds publisher and author og:meta tags to Bookmark (#1141)
erik-nilcoast
1
-0
/
+24
2025-02-17
feat: Add PDF screenshot generation and display (#995)
Ahmad Mujahid
1
-0
/
+1
2025-02-02
fix: Dont rearchive singlefile uploads and consider them as archives
Mohamed Bassem
1
-2
/
+6
2025-02-01
fix: Abort all IO when workers timeout instead of detaching. Fixes #742
Mohamed Bassem
1
-13
/
+62
2025-01-19
feat: Change webhooks to be configurable by users
Mohamed Bassem
1
-2
/
+2
2025-01-19
feat(webhook): Implement webhook functionality for bookmark events (#852)
玄猫
1
-0
/
+4
2025-01-11
feat: Add support for singlefile extension uploads. #172
Mohamed Bassem
1
-6
/
+30
2024-12-26
refactor: Move asset preprocessing to its own worker out of the inference worker
Mohamed Bassem
1
-17
/
+18
2024-12-08
feature: Store crawling status code and allow users to find broken links. Fix...
Mohamed Bassem
1
-4
/
+6
2024-11-30
feature(workers): Allow running hoarder without chrome as a hard dependency. ...
Mohamed Bassem
1
-11
/
+35
2024-11-23
fix(workers): Set a timeout on the screenshot call and completely skip it if ...
Mohamed Bassem
1
-13
/
+32
2024-11-21
fix(workers): Don't block connection to chrome when failing to download adblo...
Mohamed Bassem
1
-6
/
+22
2024-11-21
chore(workers): Add extra logging for browser connection errors
Mohamed Bassem
1
-1
/
+1
2024-11-09
fix: Only update bookmark tagging/crawling status when worker is out of retries
Mohamed Bassem
1
-4
/
+4
2024-11-03
fix: Pass arguments to monolith and yt-dlp as array for better escaping
Mohamed Bassem
1
-1
/
+1
2024-10-28
feature: Archive videos using yt-dlp. Fixes #215 (#525)
kamtschatka
1
-49
/
+10
2024-10-27
deps: Extract the queue implementation into its own repos
Mohamed Bassem
1
-1
/
+1
2024-10-06
refactor: Start tracking bookmark assets in the assets table
MohamedBassem
1
-60
/
+83
2024-10-06
refactor: Include userId in the assets table
MohamedBassem
1
-0
/
+5
2024-09-30
feature(web): Add ability to manually trigger full page archives. Fixes #398 ...
kamtschatka
1
-3
/
+5
2024-09-26
fix(workers): Log stacktrace on worker error. #424 (#429)
kamtschatka
1
-1
/
+3
2024-07-28
fix(workers): Shutdown workers on SIGTERM
MohamedBassem
1
-0
/
+4
2024-07-21
fix: async/await issues with the new queue (#319)
kamtschatka
1
-2
/
+2
2024-07-21
refactor: Replace the usage of bullMQ with the hoarder sqlite-based queue (#309)
Mohamed Bassem
1
-31
/
+29
2024-07-14
fix: monolith not embedding SVG files correctly. Fixes #289 (#306)
kamtschatka
1
-5
/
+2
2024-07-01
refactor: added the bookmark type to the database (#256)
kamtschatka
1
-0
/
+6
2024-06-29
refactor: remove redundant code from crawler worker and refactor handling of ...
kamtschatka
1
-32
/
+49
2024-06-23
feature: Automatically transfer image urls into bookmared assets. Fixes #246
MohamedBassem
1
-6
/
+16
2024-06-23
refactor: extract assets into their own database table. #215 (#220)
kamtschatka
1
-29
/
+71
2024-06-22
feature: add support for PDF links. Fixes #28 (#216)
kamtschatka
1
-57
/
+163
2024-06-09
fix: Trigger search re-index on bookmark tag manual updates. Fixes #208 (#210)
kamtschatka
1
-5
/
+2
2024-05-26
fix(crawler): Only update the database if full page archival is enabled
MohamedBassem
1
-19
/
+19
2024-05-26
feature: Full page archival with monolith. Fixes #132
MohamedBassem
1
-1
/
+65
2024-05-15
feature(crawler): Allow connecting to browser's websocket address and launchi...
MohamedBassem
1
-28
/
+55
2024-05-12
feature: Take full page screenshots #143 (#148)
kamtschatka
1
-1
/
+2
2024-04-26
feature(crawler): Allow increasing crawler concurrency and configure storing ...
MohamedBassem
1
-0
/
+13
2024-04-23
fix(crawler): Better extraction for amazon images
MohamedBassem
1
-0
/
+2
2024-04-23
fix(workers): Set a modern user agent and update the default viewport size
MohamedBassem
1
-0
/
+7
2024-04-20
feature: Allow recrawling bookmarks without running inference jobs
MohamedBassem
1
-7
/
+29
2024-04-20
feature: Download images and screenshots
MohamedBassem
1
-28
/
+130
2024-04-11
feature: Recrawl failed links from admin UI (#95)
Ahmad Mujahid
1
-0
/
+20
2024-04-11
fix: Increase default navigation timeout to 30s, make it configurable and add...
MohamedBassem
1
-1
/
+1
2024-04-09
fix(crawler): Skip validating URLs in metascrapper as it was already being va...
MohamedBassem
1
-0
/
+3
2024-04-06
fix(workers): Increase default timeout to 60s, make it configurable and impro...
MohamedBassem
1
-11
/
+21
2024-04-02
fix(workers): Add a timeout to the crawling job to prevent it from getting st...
MohamedBassem
1
-1
/
+2
2024-03-31
chore(workers): Remove unused configuration options
MohamedBassem
1
-2
/
+0
2024-03-30
format: Add missing lint and format, and format the entire repo
MohamedBassem
1
-5
/
+6
[next]