cgpress.org
robots.txt

Robots Exclusion Standard data for cgpress.org

Resource Scan

Scan Details

Site Domain cgpress.org
Base Domain cgpress.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-29T14:40:33+00:00
Next Scan 2025-11-28T14:40:33+00:00

Last Successful Scan

Scanned2025-09-30T08:18:55+00:00
URL https://cgpress.org/robots.txt
Domain IPs 213.109.149.132
Response IP 213.109.149.132
Found Yes
Hash 2be7e58d04e47da4d8bbd73efd00b30b8e416009e0f7a827b6a74e886851aced
SimHash 81625daee649

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /trackback
Disallow /feed
Disallow /comments
Disallow /archives/category/
Disallow /archives/date/
Disallow /archives/tuts_subjects/
Disallow /archives/tuts_software/
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow /?s=
Disallow /site_search
Disallow /search-disabled

*

No rules defined. All paths allowed.