houseind.com
robots.txt

Robots Exclusion Standard data for houseind.com

Resource Scan

Scan Details

Site Domain houseind.com
Base Domain houseind.com
Scan Status Ok
Last Scan2024-05-23T04:21:32+00:00
Next Scan 2024-06-22T04:21:32+00:00

Last Scan

Scanned2024-05-23T04:21:32+00:00
URL https://houseind.com/robots.txt
Redirect https://houseindustries.com/robots.txt
Redirect Domain houseindustries.com
Redirect Base houseindustries.com
Domain IPs 104.26.4.250, 104.26.5.250, 172.67.75.93, 2606:4700:20::681a:4fa, 2606:4700:20::681a:5fa, 2606:4700:20::ac43:4b5d
Redirect IPs 104.21.40.118, 172.67.151.16, 2606:4700:3030::6815:2876, 2606:4700:3033::ac43:9710
Response IP 104.21.40.118
Found Yes
Hash c073263c3695249a81bad49c88a60b05c1b837ead8faf8608711ec452e149ef5
SimHash aed52d0f74c0

Groups

*

Rule Path
Disallow /checkout
Disallow /cart
Disallow /orders
Disallow /countries
Disallow /line_items
Disallow /password_resets
Disallow /states
Disallow /user_sessions
Disallow /user_registrations
Disallow /users
Disallow /account

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /