cgtricks.com
robots.txt

Robots Exclusion Standard data for cgtricks.com

Resource Scan

Scan Details

Site Domain cgtricks.com
Base Domain cgtricks.com
Scan Status Ok
Last Scan2026-02-03T08:27:09+00:00
Next Scan 2026-02-10T08:27:09+00:00

Last Scan

Scanned2026-02-03T08:27:09+00:00
URL https://cgtricks.com/robots.txt
Domain IPs 213.109.149.129
Response IP 213.109.149.129
Found Yes
Hash d476e6236d5a62c1f10bbb90ac87cba529ed31f74af06daddbcae8684d2dbe57
SimHash cb0408408301

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /search?q=*
Disallow *?replytocom
Disallow */attachment/*
Disallow /images/
Disallow /comments
Disallow /author
Disallow /archives
Allow /*.js$
Allow /*.css$

Other Records

Field Value
sitemap https://cgtricks.com/post-sitemap.xml