index
:
karakeep
main
Unnamed repository; edit this file 'description' to name the repository.
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
apps
/
workers
/
crawlerWorker.ts
(
unfollow
)
Commit message (
Expand
)
Author
Files
Lines
2024-04-26
feature(crawler): Allow increasing crawler concurrency and configure storing ...
MohamedBassem
1
-0
/
+13
2024-04-23
fix(crawler): Better extraction for amazon images
MohamedBassem
1
-0
/
+2
2024-04-23
fix(workers): Set a modern user agent and update the default viewport size
MohamedBassem
1
-0
/
+7
2024-04-20
feature: Allow recrawling bookmarks without running inference jobs
MohamedBassem
1
-7
/
+29
2024-04-20
feature: Download images and screenshots
MohamedBassem
1
-28
/
+130
2024-04-11
feature: Recrawl failed links from admin UI (#95)
Ahmad Mujahid
1
-0
/
+20
2024-04-11
fix: Increase default navigation timeout to 30s, make it configurable and add...
MohamedBassem
1
-1
/
+1
2024-04-09
fix(crawler): Skip validating URLs in metascrapper as it was already being va...
MohamedBassem
1
-0
/
+3
2024-04-06
fix(workers): Increase default timeout to 60s, make it configurable and impro...
MohamedBassem
1
-11
/
+21
2024-04-02
fix(workers): Add a timeout to the crawling job to prevent it from getting st...
MohamedBassem
1
-1
/
+2
2024-03-31
chore(workers): Remove unused configuration options
MohamedBassem
1
-2
/
+0
2024-03-30
format: Add missing lint and format, and format the entire repo
MohamedBassem
1
-5
/
+6
2024-03-27
refactor: Validate env variables using zod
MohamedBassem
1
-1
/
+1
2024-03-24
docker: Use external chrome docker container
MohamedBassem
1
-10
/
+40
2024-03-21
fix(workers): Fix the leaky browser instances in workers during development
MohamedBassem
1
-28
/
+30
2024-03-21
fix: Simple validations for crawled URLs
MohamedBassem
1
-1
/
+17
2024-03-14
structure: Create apps dir and copy tooling dir from t3-turbo repo
MohamedBassem
1
-0
/
+0
2024-03-05
feature: Store html content of links in the database
MohamedBassem
1
-0
/
+1
2024-03-05
fix: Use puppeteer adblocker to block cookies notices
MohamedBassem
1
-0
/
+6
2024-03-02
feature: Store full link content and index them
MohamedBassem
1
-1
/
+12
2024-03-01
feature: Add full text search support
MohamedBassem
1
-0
/
+8
2024-02-23
db: Migrate from prisma to drizzle
MohamedBassem
1
-10
/
+10
2024-02-20
branding: Rename app to Hoarder
MohamedBassem
1
-4
/
+4
2024-02-17
build: Fix docker images
MohamedBassem
1
-1
/
+5
2024-02-17
fix: Let the crawler wait a bit more for page load
MohamedBassem
1
-2
/
+12
2024-02-14
fix: Harden puppeteer against browser disconnections and exceptions
MohamedBassem
1
-16
/
+33
2024-02-14
feature: Add ability to refresh bookmark details
MohamedBassem
1
-1
/
+13
2024-02-11
fix: Fix build for workers package and add it to CI
MohamedBassem
1
-8
/
+42
2024-02-09
[feature] Use puppeteer for fetching websites
MohamedBassem
1
-4
/
+32
2024-02-09
[chore] Linting and formating tweaking
MohamedBassem
1
-5
/
+11
2024-02-09
[refactor] Extract the bookmark model to be a high level model to support oth...
MohamedBassem
1
-23
/
+9
2024-02-08
[refactor] Move the different packages to the package subdir
MohamedBassem
1
-0
/
+0
2024-02-07
[feature] Add openAI integration for extracting tags from articles
MohamedBassem
1
-0
/
+6
2024-02-07
[refactor] Rename the crawlers package to workers
MohamedBassem
1
-0
/
+0
2024-02-06
Implement metadata fetching logic in the crawler
MohamedBassem
1
-2
/
+68
2024-02-06
Init package and start bullmq workers
MohamedBassem
1
-0
/
+6