Skip to content

[Bug] Failed to capture the screenshot #1097

@startle09

Description

@startle09

Describe the Bug

Failed to fetch the specific webpage, URL: https://github.com/HqWu-HITCS/Awesome-Chinese-LLM

Other webpages are fetched normally (including other GitHub repositories).

The bug can be reproduced multiple times. Please let me know if you need more information.

Steps to Reproduce

pass

Expected Behaviour

pass

Screenshots or Additional Context

Log:

2025-03-07T05:51:46.188Z info: [Crawler][74] Will crawl "https://github.com/HqWu-HITCS/Awesome-Chinese-LLM" for link with id "qaxgbqoluf49akiqs5q9rbar"
2025-03-07T05:51:46.188Z info: [Crawler][74] Attempting to determine the content-type for the url https://github.com/HqWu-HITCS/Awesome-Chinese-LLM
2025-03-07T05:51:46.502Z info: [webhook][76] Starting a webhook job for bookmark with id "qaxgbqoluf49akiqs5q9rbar"
2025-03-07T05:51:46.502Z info: [webhook][76] Completed successfully
2025-03-07T05:51:46.759Z info: [search][75] Attempting to index bookmark with id qaxgbqoluf49akiqs5q9rbar ...
2025-03-07T05:51:46.826Z info: [search][75] Completed successfully
2025-03-07T05:51:48.976Z info: [Crawler][74] Content-type for the url https://github.com/HqWu-HITCS/Awesome-Chinese-LLM is "text/html; charset=utf-8"
2025-03-07T05:51:52.985Z info: [Crawler][74] Successfully navigated to "https://github.com/HqWu-HITCS/Awesome-Chinese-LLM". Waiting for the page to load ...
2025-03-07T05:51:54.380Z info: [Crawler][74] Finished waiting for the page to load.
2025-03-07T05:51:54.469Z info: [Crawler][74] Successfully fetched the page content.
--> 2025-03-07T05:51:59.470Z warn: [Crawler][74] Failed to capture the screenshot.
2025-03-07T05:52:01.068Z info: [Crawler][74] Will attempt to extract metadata from page ...
2025-03-07T05:52:02.530Z info: [Crawler][74] Will attempt to extract readable content ...
2025-03-07T05:52:04.596Z info: [Crawler][74] Done extracting readable content.
2025-03-07T05:52:04.597Z info: [Crawler][74] Skipping storing the screenshot as it's empty.
2025-03-07T05:52:05.379Z info: [Crawler][74] Done extracting metadata from the page.
2025-03-07T05:52:05.379Z info: [Crawler][74] Downloading image from "https://opengraph.githubassets.com/e5554c3072197574abc8846fb5cff36fa325b3e1699cbea42f2c1883e8eeabfd/HqWu-HITCS/Awesome-Chinese-LLM"
2025-03-07T05:52:06.049Z info: [Crawler][74] Downloaded image as assetId: 05e1e551-904f-464f-ad6d-34907ab661d6
2025-03-07T05:52:06.060Z info: [Crawler][74] Completed successfully

Device Details

Debian 12

Exact Hoarder Version

0.22.0

Have you checked the troubleshooting guide?

  • I have checked the troubleshooting guide and I haven't found a solution to my problem

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingstatus/untriagedThis issue needs triaging to confirm it

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions