index
:
karakeep
main
Unnamed repository; edit this file 'description' to name the repository.
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
apps
/
workers
/
crawlerWorker.ts
(
unfollow
)
Commit message (
Expand
)
Author
Files
Lines
2024-10-06
refactor: Start tracking bookmark assets in the assets table
MohamedBassem
1
-60
/
+83
2024-10-06
refactor: Include userId in the assets table
MohamedBassem
1
-0
/
+5
2024-09-30
feature(web): Add ability to manually trigger full page archives. Fixes #398 ...
kamtschatka
1
-3
/
+5
2024-09-26
fix(workers): Log stacktrace on worker error. #424 (#429)
kamtschatka
1
-1
/
+3
2024-07-28
fix(workers): Shutdown workers on SIGTERM
MohamedBassem
1
-0
/
+4
2024-07-21
fix: async/await issues with the new queue (#319)
kamtschatka
1
-2
/
+2
2024-07-21
refactor: Replace the usage of bullMQ with the hoarder sqlite-based queue (#309)
Mohamed Bassem
1
-31
/
+29
2024-07-14
fix: monolith not embedding SVG files correctly. Fixes #289 (#306)
kamtschatka
1
-5
/
+2
2024-07-01
refactor: added the bookmark type to the database (#256)
kamtschatka
1
-0
/
+6
2024-06-29
refactor: remove redundant code from crawler worker and refactor handling of ...
kamtschatka
1
-32
/
+49
2024-06-23
feature: Automatically transfer image urls into bookmared assets. Fixes #246
MohamedBassem
1
-6
/
+16
2024-06-23
refactor: extract assets into their own database table. #215 (#220)
kamtschatka
1
-29
/
+71
2024-06-22
feature: add support for PDF links. Fixes #28 (#216)
kamtschatka
1
-57
/
+163
2024-06-09
fix: Trigger search re-index on bookmark tag manual updates. Fixes #208 (#210)
kamtschatka
1
-5
/
+2
2024-05-26
fix(crawler): Only update the database if full page archival is enabled
MohamedBassem
1
-19
/
+19
2024-05-26
feature: Full page archival with monolith. Fixes #132
MohamedBassem
1
-1
/
+65
2024-05-15
feature(crawler): Allow connecting to browser's websocket address and launchi...
MohamedBassem
1
-28
/
+55
2024-05-12
feature: Take full page screenshots #143 (#148)
kamtschatka
1
-1
/
+2
2024-04-26
feature(crawler): Allow increasing crawler concurrency and configure storing ...
MohamedBassem
1
-0
/
+13
2024-04-23
fix(crawler): Better extraction for amazon images
MohamedBassem
1
-0
/
+2
2024-04-23
fix(workers): Set a modern user agent and update the default viewport size
MohamedBassem
1
-0
/
+7
2024-04-20
feature: Allow recrawling bookmarks without running inference jobs
MohamedBassem
1
-7
/
+29
2024-04-20
feature: Download images and screenshots
MohamedBassem
1
-28
/
+130
2024-04-11
feature: Recrawl failed links from admin UI (#95)
Ahmad Mujahid
1
-0
/
+20
2024-04-11
fix: Increase default navigation timeout to 30s, make it configurable and add...
MohamedBassem
1
-1
/
+1
2024-04-09
fix(crawler): Skip validating URLs in metascrapper as it was already being va...
MohamedBassem
1
-0
/
+3
2024-04-06
fix(workers): Increase default timeout to 60s, make it configurable and impro...
MohamedBassem
1
-11
/
+21
2024-04-02
fix(workers): Add a timeout to the crawling job to prevent it from getting st...
MohamedBassem
1
-1
/
+2
2024-03-31
chore(workers): Remove unused configuration options
MohamedBassem
1
-2
/
+0
2024-03-30
format: Add missing lint and format, and format the entire repo
MohamedBassem
1
-5
/
+6
2024-03-27
refactor: Validate env variables using zod
MohamedBassem
1
-1
/
+1
2024-03-24
docker: Use external chrome docker container
MohamedBassem
1
-10
/
+40
2024-03-21
fix(workers): Fix the leaky browser instances in workers during development
MohamedBassem
1
-28
/
+30
2024-03-21
fix: Simple validations for crawled URLs
MohamedBassem
1
-1
/
+17
2024-03-14
structure: Create apps dir and copy tooling dir from t3-turbo repo
MohamedBassem
1
-0
/
+0
2024-03-05
feature: Store html content of links in the database
MohamedBassem
1
-0
/
+1
2024-03-05
fix: Use puppeteer adblocker to block cookies notices
MohamedBassem
1
-0
/
+6
2024-03-02
feature: Store full link content and index them
MohamedBassem
1
-1
/
+12
2024-03-01
feature: Add full text search support
MohamedBassem
1
-0
/
+8
2024-02-23
db: Migrate from prisma to drizzle
MohamedBassem
1
-10
/
+10
2024-02-20
branding: Rename app to Hoarder
MohamedBassem
1
-4
/
+4
2024-02-17
build: Fix docker images
MohamedBassem
1
-1
/
+5
2024-02-17
fix: Let the crawler wait a bit more for page load
MohamedBassem
1
-2
/
+12
2024-02-14
fix: Harden puppeteer against browser disconnections and exceptions
MohamedBassem
1
-16
/
+33
2024-02-14
feature: Add ability to refresh bookmark details
MohamedBassem
1
-1
/
+13
2024-02-11
fix: Fix build for workers package and add it to CI
MohamedBassem
1
-8
/
+42
2024-02-09
[feature] Use puppeteer for fetching websites
MohamedBassem
1
-4
/
+32
2024-02-09
[chore] Linting and formating tweaking
MohamedBassem
1
-5
/
+11
2024-02-09
[refactor] Extract the bookmark model to be a high level model to support oth...
MohamedBassem
1
-23
/
+9
2024-02-08
[refactor] Move the different packages to the package subdir
MohamedBassem
1
-0
/
+0
[next]