christianlindholm.com
robots.txt
Robots Exclusion Standard data for christianlindholm.com
Resource Scan
Scan Details
Site Domain | christianlindholm.com |
Base Domain | christianlindholm.com |
Scan Status | Ok |
Last Scan | 2024-05-30T00:31:13+00:00 |
Next Scan | 2024-06-06T00:31:13+00:00 |
Last Scan
Scanned | 2024-05-30T00:31:13+00:00 |
URL | https://christianlindholm.com/robots.txt |
Redirect | https://physiosa.org.za/robots.txt |
Redirect Domain | physiosa.org.za |
Redirect Base | physiosa.org.za |
Domain IPs | 104.21.13.122, 172.67.167.244, 2606:4700:3034::ac43:a7f4, 2606:4700:3037::6815:d7a |
Redirect IPs | 104.21.42.240, 172.67.213.107, 2606:4700:3033::ac43:d56b, 2606:4700:3035::6815:2af0 |
Response IP | 172.67.213.107 |
Found | Yes |
Hash | eb73c036ac930987d7dfff00f95733b1f9ca2da5f32bb060ab1b94a9f499724b |
SimHash | 0014d941e510 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /search/ |
Disallow | /download/ |
Disallow | /a/ |
Disallow | /search/* |
Disallow | /download/* |
Disallow | /a/* |
Disallow | /cdn-cgi/ |
Disallow | /cdn-cgi/* |
Disallow | /sw/ |
Disallow | /sw/* |
Other Records
Field | Value |
---|---|
sitemap | https://physiosa.org.za/sitemap.xml |