getherbackguide.com
robots.txt

Robots Exclusion Standard data for getherbackguide.com

Resource Scan

Scan Details

Site Domain getherbackguide.com
Base Domain getherbackguide.com
Scan Status Ok
Last Scan2024-10-21T17:49:55+00:00
Next Scan 2024-11-20T17:49:55+00:00

Last Scan

Scanned2024-10-21T17:49:55+00:00
URL https://getherbackguide.com/robots.txt
Domain IPs 172.66.40.254, 172.66.43.2, 2606:4700:3108::ac42:28fe, 2606:4700:3108::ac42:2b02
Response IP 172.66.43.2
Found Yes
Hash 2478a91a7eea73ea59321ee0f2c784eec28235655e05c7ef46e0e42229753890
SimHash 6920d0404333

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

gptbot

Rule Path
Disallow /
Disallow */?wlfrom*
Disallow */wl-checkout.php*
Disallow /please-log-in/*
Disallow /developer-test-post/

Other Records

Field Value
sitemap https://getherbackguide.com/sitemap.xml