ccarlblom.com
robots.txt
Robots Exclusion Standard data for ccarlblom.com
Resource Scan
Scan Details
Site Domain | ccarlblom.com |
Base Domain | ccarlblom.com |
Scan Status | Ok |
Last Scan | 2024-09-14T16:07:18+00:00 |
Next Scan | 2024-10-14T16:07:18+00:00 |
Last Scan
Scanned | 2024-09-14T16:07:18+00:00 |
URL | https://www.ccarlblom.com/robots.txt |
Domain IPs | 54.36.204.21, 91.134.231.21 |
Response IP | 91.134.231.21 |
Found | Yes |
Hash | 3a4a0308f8f4a0ec0fed84b7518d117d3ea09d81d3eb49b82465b0167ad458ba |
SimHash | 610e826aacc2 |
Groups
*
Rule | Path |
---|---|
Disallow | *search%3D* |
Disallow | *.rss |
Disallow | /*?r=1 |
Disallow | /*?fis=* |
Disallow | /*?subgallery=* |
Disallow | /lightbox |
Disallow | /lightbox?* |
Disallow | /cart |
Disallow | /cart?* |
Disallow | /quotations/* |
Disallow | /users/* |
Disallow | /downloads/* |
Disallow | /invoices/* |
Disallow | /media/*/price |
Disallow | /media/*/price/* |
Disallow | /media/*/share |
Disallow | /media/*?download=* |
Disallow | /media/*/rate*rate%3D* |
Disallow | /-/*/medias/*/price |
Disallow | /-/*/medias/*/price/* |
Disallow | /-/*/medias/*/share |
Disallow | /-/*/medias/*?download=* |
Disallow | /-/*/medias/*/rate*rate%3D* |
Disallow | /m/lightbox |
Disallow | /m/lightbox?* |
Disallow | /m/cart |
Disallow | /m/cart?* |
Disallow | /m/quotations/* |
Disallow | /m/users/* |
Disallow | /m/downloads/* |
Disallow | /m/invoices/* |
Disallow | /m/media/*/price |
Disallow | /m/media/*/price/* |
Disallow | /m/media/*/share |
Disallow | /m/media/*?download=* |
Disallow | /m/media/*/rate*rate%3D* |
Disallow | /m/-/*/medias/*/price |
Disallow | /m/-/*/medias/*/price/* |
Disallow | /m/-/*/medias/*/share |
Disallow | /m/-/*/medias/*?download=* |
Disallow | /m/-/*/medias/*/rate*rate%3D* |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
Other Records
Field | Value |
---|---|
sitemap | https://www.ccarlblom.com/sitemap.xml |