| Commit message (Collapse) | Author | Age | Files | Lines |
| | |
|
| | |
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
(#418)
* [Feature Request] Ability to select what to "crawl full page archive" #398
Added the ability to start a full page crawl for links and also in bulk operations
added the ability to refresh links as a bulk operation as well
* minor icon and wording changes
---------
Co-authored-by: MohamedBassem <me@mbassem.com>
|
| |
|
| |
extended logging when an exception occurrs, so it is possible to see the stacktrace of a failed execution
|
| | |
|
| | |
|
| | |
|
| |
|
| |
passing in the URL of the page to have the proper URL for resolving relative paths
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
* refactoring asset types
Extracted out functions to silently delete assets and to update them after crawling
Generalized the mapping of assets to bookmark fields to make extending them easier
* Added the bookmark type to the database
Introduced an enum to have better type safety
cleaned up the code and based some code on the type directly
* add BookmarkType.UNKNWON
* lint and remove unused function
---------
Co-authored-by: MohamedBassem <me@mbassem.com>
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
asset types (#253)
* refactoring asset types
Extracted out functions to silently delete assets and to update them after crawling
Generalized the mapping of assets to bookmark fields to make extending them easier
* revert silentDeleteAsset and hide better-sqlite3
---------
Co-authored-by: MohamedBassem <me@mbassem.com>
|
| | |
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
* Allow downloading more content from a webpage and index it #215
added a new table that contains the information about assets for link bookmarks
created migration code that transfers the existing data into the new table
* Allow downloading more content from a webpage and index it #215
removed the old asset columns from the database
updated the UI to use the data from the linkBookmarkAssets array
* generalize the assets table to not be linked in particular to links
* fix migrations post merge
* fix missing asset ids in the getBookmarks call
---------
Co-authored-by: MohamedBassem <me@mbassem.com>
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
* feature request: pdf support #28
Added a new sourceUrl column to the asset bookmarks
Added transforming a link bookmark pointing at a pdf to an asset bookmark
made sure the "View Original" link is also shown for asset bookmarks that have a sourceURL
updated gitignore for IDEA
* remove pdf parsing from the crawler
* extract the http logic into its own function to avoid duplicating the post-processing actions (openai/index)
* Add 5s timeout to the content type fetch
---------
Co-authored-by: MohamedBassem <me@mbassem.com>
|
| |
|
|
|
|
|
|
|
|
|
|
| |
* re-index of database is not scanning all places when bookmark tags are changed. Manual indexing is working as workaround #208
introduced a new function to trigger a reindex to reduce copy/paste
added missing reindexes when tags are deleted/bookmarks are updated
* give functions a bit more descriptive name
---------
Co-authored-by: kamtschatka <simon.schatka@gmx.at>
Co-authored-by: MohamedBassem <me@mbassem.com>
|
| | |
|
| | |
|
| |
|
|
| |
launching the browser on demand. This enables support for browserless
|
| |
|
|
|
|
| |
Added the fullPage flag to take full screen screenshots
updated the UI accordingly to properly show the screenshots instead of scaling it down
Co-authored-by: kamtschatka <simon.schatka@gmx.at>
|
| |
|
|
| |
images and screenshots
|
| | |
|
| | |
|
| | |
|
| | |
|
| |
|
|
|
| |
* feature: Retry failed crawling URLs
* fix: Enhancing visuals and some minor changes.
|
| |
|
|
| |
add retries to crawling jobs
|
| |
|
|
| |
validated. Fixes #22
|
| |
|
|
| |
improve logging
|
| |
|
|
| |
stuck. Fixes #63
|
| | |
|
| | |
|
| | |
|
| | |
|
| | |
|
| | |
|
| |
|