leadpages.com
robots.txt
Robots Exclusion Standard data for leadpages.com
Resource Scan
Scan Details
Site Domain | leadpages.com |
Base Domain | leadpages.com |
Scan Status | Ok |
Last Scan | 2024-05-11T04:34:47+00:00 |
Next Scan | 2024-06-10T04:34:47+00:00 |
Last Scan
Scanned | 2024-05-11T04:34:47+00:00 |
URL | https://leadpages.com/robots.txt |
Redirect | https://www.leadpages.com/robots.txt |
Redirect Domain | www.leadpages.com |
Redirect Base | leadpages.com |
Domain IPs | 76.76.21.21 |
Redirect IPs | 76.76.21.22, 76.76.21.61 |
Response IP | 76.76.21.9 |
Found | Yes |
Hash | 270ad8a7294596ec00ded84057c2fd7423685fb6911b860d6fd3c3efc61a5fee |
SimHash | 6b4c9c514151 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /studio |
*
Rule | Path |
---|---|
Disallow | /story |
*
Rule | Path |
---|---|
Disallow | /_legacy |
*
Rule | Path |
---|---|
Disallow | /blog/search |
Other Records
Field | Value |
---|---|
sitemap | https://www.leadpages.com/sitemap.xml |
sitemap | https://www.leadpages.com/sitemap/blog/index.xml |
sitemap | https://www.leadpages.com/sitemap/templates/index.xml |
Warnings
- `host` is not a known field.
Comments