thebigdomain.com
robots.txt

Robots Exclusion Standard data for thebigdomain.com

Resource Scan

Scan Details

Site Domain thebigdomain.com
Base Domain thebigdomain.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-01-10T20:10:15+00:00
Next Scan 2026-04-10T20:10:15+00:00

Last Successful Scan

Scanned2024-11-24T17:07:32+00:00
URL https://thebigdomain.com/robots.txt
Redirect https://www.thebigdomain.com/robots.txt
Redirect Domain www.thebigdomain.com
Redirect Base thebigdomain.com
Domain IPs 104.18.38.27, 172.64.149.229, 2606:4700:4400::6812:261b, 2606:4700:4400::ac40:95e5
Redirect IPs 104.18.38.27, 172.64.149.229, 2606:4700:4400::6812:261b, 2606:4700:4400::ac40:95e5
Response IP 172.64.149.229
Found Yes
Hash 902981e61b2ded0241c6d633f1413c135942f67149d24667bbc55a47cf6d205f
SimHash 580d4dc5c5d3

Groups

*

Rule Path
Disallow /success
Disallow /booking
Disallow /available
Disallow /*.aspx
Disallow /*?pl=
Disallow /blog?id=
Disallow /*blog/?page=
Disallow /*.asmx
Disallow /*?page=
Allow /AvailabilityNew.aspx

Other Records

Field Value
sitemap https://www.thebigdomain.co.uk/sitemap.xml
sitemap https://www.thebigdomain.co.uk/pages.xml
sitemap https://www.thebigdomain.co.uk/properties.xml

Comments

  • Disallow