reece.com.au
robots.txt

Robots Exclusion Standard data for reece.com.au

Resource Scan

Scan Details

Site Domain reece.com.au
Base Domain reece.com.au
Scan Status Ok
Last Scan2024-11-06T03:27:39+00:00
Next Scan 2024-11-20T03:27:39+00:00

Last Scan

Scanned2024-11-06T03:27:39+00:00
URL https://reece.com.au/robots.txt
Redirect https://www.reece.com.au/robots.txt
Redirect Domain www.reece.com.au
Redirect Base reece.com.au
Domain IPs 108.157.254.114, 108.157.254.15, 108.157.254.49, 108.157.254.89
Redirect IPs 18.155.68.31, 18.155.68.59, 18.155.68.69, 18.155.68.73, 2600:9000:23d2:8000:11:fb5d:d080:93a1, 2600:9000:23d2:800:11:fb5d:d080:93a1, 2600:9000:23d2:8c00:11:fb5d:d080:93a1, 2600:9000:23d2:8e00:11:fb5d:d080:93a1, 2600:9000:23d2:a000:11:fb5d:d080:93a1, 2600:9000:23d2:bc00:11:fb5d:d080:93a1, 2600:9000:23d2:da00:11:fb5d:d080:93a1, 2600:9000:23d2:e00:11:fb5d:d080:93a1
Response IP 18.155.68.31
Found Yes
Hash 2063ea3b460714470d4ba9cbfde50882fe093c52ccc8e572a868d6db0fbdbe83
SimHash a81f6500af73

Groups

*

Rule Path
Allow /
Disallow /admin
Disallow /page-not-found
Disallow /server-error
Disallow /response-24-7
Disallow /product/*?query=*
Disallow /search/*?query=*
Disallow *?*=*
Disallow *?*=*&*=*
Disallow /project-inspiration-gallery/?*

ahrefsbot
almaden
arquivo-web-crawler
aspseek
baiduspider
dumbbot
generic
grub-client
mj12bot
msiecrawler
nexabot
npbot
owr_crawler
psbot
rpt-httpclient
scoutabout
semanticdiscovery
turnitinbot
twiceler
wget
yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.reece.com.au/sitemap.xml

Warnings

  • 2 invalid lines.