mylighthouse.com
robots.txt
Robots Exclusion Standard data for mylighthouse.com
Resource Scan
Scan Details
Site Domain | mylighthouse.com |
Base Domain | mylighthouse.com |
Scan Status | Ok |
Last Scan | 2024-06-08T20:33:04+00:00 |
Next Scan | 2024-07-08T20:33:04+00:00 |
Last Scan
Scanned | 2024-06-08T20:33:04+00:00 |
URL | https://mylighthouse.com/robots.txt |
Redirect | https://www.mylighthouse.com/robots.txt |
Redirect Domain | www.mylighthouse.com |
Redirect Base | mylighthouse.com |
Domain IPs | 75.2.60.5 |
Redirect IPs | 13.228.199.255, 13.251.96.10, 2406:da18:880:3802::c8, 2406:da18:b3d:e201::64 |
Response IP | 46.137.195.11 |
Found | Yes |
Hash | 13a821eafb02961e157687ea03b8f92426e165a6e8c2668f76f9de3ba1302ddd |
SimHash | 294454648502 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /404 |
Disallow | /de/404 |
Disallow | /en/404 |
Disallow | /es-mx/404 |
Disallow | /fr/404 |
Disallow | /ja/404 |
Disallow | /pt/404 |
Disallow | /pt-br/404 |
Other Records
Field | Value |
---|---|
sitemap | https://www.mylighthouse.com/sitemap.xml |
Comments