thearchlondon.com
robots.txt

Robots Exclusion Standard data for thearchlondon.com

Resource Scan

Scan Details

Site Domain thearchlondon.com
Base Domain thearchlondon.com
Scan Status Ok
Last Scan2024-09-01T09:54:10+00:00
Next Scan 2024-10-01T09:54:10+00:00

Last Scan

Scanned2024-09-01T09:54:10+00:00
URL https://thearchlondon.com/robots.txt
Domain IPs 104.21.95.224, 172.67.171.126, 2606:4700:3035::6815:5fe0, 2606:4700:3036::ac43:ab7e
Response IP 172.67.171.126
Found Yes
Hash 7b4a2c36480e01b6cc863c8ac33e2596d9ad83713eb2f4ee7eb9703af5e31868
SimHash 410594452d13

Groups

*

Rule Path
Disallow

*

Rule Path
Disallow /wp-content/uploads/wpforms/

Other Records

Field Value
sitemap https://www.thearchlondon.com/sitemap_index.xml