gnews.org
robots.txt

Robots Exclusion Standard data for gnews.org

Resource Scan

Scan Details

Site Domain gnews.org
Base Domain gnews.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-14T15:26:06+00:00
Next Scan 2024-09-28T15:26:06+00:00

Last Successful Scan

Scanned2024-08-23T15:25:41+00:00
URL https://gnews.org/robots.txt
Redirect https://gnews.org/sitemap/robots.txt
Domain IPs 104.18.24.88, 104.18.25.88, 2606:4700::6812:1858, 2606:4700::6812:1958
Response IP 104.18.24.88
Found Yes
Hash 8a2576e7e78930f520e4bbc220de0834c0be9d44374c119af4aa0bd775dbfdf7
SimHash 4045ce494793

Groups

*

Rule Path
Disallow /articles/
Disallow /post/
Disallow /zh-hans/
Disallow /threads/
Disallow /t/
Allow /m/
Allow /wiki/

Other Records

Field Value
sitemap https://gnews.org/sitemap/sitemap_index.xml