housedoghq.com
robots.txt

Robots Exclusion Standard data for housedoghq.com

Resource Scan

Scan Details

Site Domain housedoghq.com
Base Domain housedoghq.com
Scan Status Ok
Last Scan2024-10-01T21:34:09+00:00
Next Scan 2024-10-08T21:34:09+00:00

Last Scan

Scanned2024-10-01T21:34:09+00:00
URL https://housedoghq.com/robots.txt
Domain IPs 35.232.249.117
Response IP 35.232.249.117
Found Yes
Hash 8cb467ab70ccabc05fce02935ff02f5d5705ef4ba27e2304a494d2a62d999103
SimHash 4374d9760632

Groups

*

Rule Path
Disallow /partner/
Disallow /sponsor/
Disallow /recommends/
Disallow */page/
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-content/cache
Disallow /wp-json/
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /xmlrpc.php
Disallow /license.txt
Disallow /readme.html
Disallow /trackback/
Disallow /comments/feed/
Disallow /*?replytocom
Disallow */feed
Disallow */rss
Disallow /author/
Disallow /?
Disallow /*?
Disallow /?s=
Disallow *%26s%3D
Disallow /search
Disallow *?attachment_id=
Allow /*.css
Allow /*.js
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /wp-content/plugins/

Other Records

Field Value
sitemap https://housedoghq.com/sitemap.xml

Warnings

  • `host` is not a known field.