theblemish.com
robots.txt

Robots Exclusion Standard data for theblemish.com

Resource Scan

Scan Details

Site Domain theblemish.com
Base Domain theblemish.com
Scan Status Ok
Last Scan2024-07-02T19:03:38+00:00
Next Scan 2024-07-09T19:03:38+00:00

Last Scan

Scanned2024-07-02T19:03:38+00:00
URL https://theblemish.com/robots.txt
Domain IPs 104.21.234.188, 104.21.234.189
Response IP 104.21.234.188
Found Yes
Hash cd3806acbca55216b7a5205b9dce688e2da78795812d04d317e354c59d373681
SimHash 63a54c0ece90

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow */comment-page-*/
Disallow */all-comments/
Disallow */trackback
Disallow */email/
Disallow *?replytocom
Disallow /photos/

Other Records

Field Value
sitemap http://theblemish.com/sitemap_index.xml