companieshouse.id
robots.txt

Robots Exclusion Standard data for companieshouse.id

Resource Scan

Scan Details

Site Domain companieshouse.id
Base Domain companieshouse.id
Scan Status Ok
Last Scan2024-06-02T01:30:33+00:00
Next Scan 2024-06-09T01:30:33+00:00

Last Scan

Scanned2024-06-02T01:30:33+00:00
URL https://companieshouse.id/robots.txt
Domain IPs 104.26.4.133, 104.26.5.133, 172.67.74.128, 2606:4700:20::681a:485, 2606:4700:20::681a:585, 2606:4700:20::ac43:4a80
Response IP 104.26.4.133
Found Yes
Hash bb464f1789d1225c7ee967509d5e2b1819f03a559430420e2e18e95aee062728
SimHash a8142900ef90

Groups

*

Rule Path
Disallow /confirm$
Disallow /cdn-cgi/*